Bug 231660

Summary: Use GBK as fallback, not gb18030
Product: WebKit Reporter: Anne van Kesteren <annevk>
Component: DOMAssignee: Nobody <webkit-unassigned>
Status: NEW    
Severity: Normal CC: cdumez, kevin_neal, mmaxfield, webkit-bug-importer
Priority: P2 Keywords: InRadar
Version: WebKit Nightly Build   
Hardware: Unspecified   
OS: Unspecified   

Anne van Kesteren
Reported 2021-10-13 00:32:05 PDT
For users with a Chinese locale, it's better to use GBK as the fallback encoding. It uses the same decoder as gb18030, but it uses a more conservative encoder which is likely more compatible with servers. Chrome and Firefox already have (this) behavior (like this). (Unfortunately there are still some differences in the overall text encoding story. Mozilla would be up for standardizing those.) https://github.com/whatwg/html/pull/4714 changed this in the HTML Standard.
Attachments
Chris Dumez
Comment 1 2021-10-13 10:15:19 PDT
Test page: https://hsivonen.com/test/moz/sniff-zh-hans.htm Says "windows-1252" on my machine. But apparently, if you change your system language to Chinese and reboot, it will say "gb18030" (not "gbk").
Radar WebKit Bug Importer
Comment 2 2021-10-19 10:34:05 PDT
Note You need to log in before you can comment on or make changes to this bug.