<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://bugs.webkit.org/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.4.1"
          urlbase="https://bugs.webkit.org/"
          
          maintainer="admin@webkit.org"
>

    <bug>
          <bug_id>231660</bug_id>
          
          <creation_ts>2021-10-13 00:32:05 -0700</creation_ts>
          <short_desc>Use GBK as fallback, not gb18030</short_desc>
          <delta_ts>2021-10-19 10:34:05 -0700</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>1</classification_id>
          <classification>Unclassified</classification>
          <product>WebKit</product>
          <component>DOM</component>
          <version>WebKit Nightly Build</version>
          <rep_platform>Unspecified</rep_platform>
          <op_sys>Unspecified</op_sys>
          <bug_status>NEW</bug_status>
          <resolution></resolution>
          
          
          <bug_file_loc></bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords>InRadar</keywords>
          <priority>P2</priority>
          <bug_severity>Normal</bug_severity>
          <target_milestone>---</target_milestone>
          
          
          <everconfirmed>1</everconfirmed>
          <reporter name="Anne van Kesteren">annevk</reporter>
          <assigned_to name="Nobody">webkit-unassigned</assigned_to>
          <cc>cdumez</cc>
    
    <cc>kevin_neal</cc>
    
    <cc>mmaxfield</cc>
    
    <cc>webkit-bug-importer</cc>
          

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>1803664</commentid>
    <comment_count>0</comment_count>
    <who name="Anne van Kesteren">annevk</who>
    <bug_when>2021-10-13 00:32:05 -0700</bug_when>
    <thetext>For users with a Chinese locale, it&apos;s better to use GBK as the fallback encoding. It uses the same decoder as gb18030, but it uses a more conservative encoder which is likely more compatible with servers.

Chrome and Firefox already have (this) behavior (like this). (Unfortunately there are still some differences in the overall text encoding story. Mozilla would be up for standardizing those.)

https://github.com/whatwg/html/pull/4714 changed this in the HTML Standard.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>1803831</commentid>
    <comment_count>1</comment_count>
    <who name="Chris Dumez">cdumez</who>
    <bug_when>2021-10-13 10:15:19 -0700</bug_when>
    <thetext>Test page: https://hsivonen.com/test/moz/sniff-zh-hans.htm

Says &quot;windows-1252&quot; on my machine. But apparently, if you change your system language to Chinese and reboot, it will say &quot;gb18030&quot; (not &quot;gbk&quot;).</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>1806138</commentid>
    <comment_count>2</comment_count>
    <who name="Radar WebKit Bug Importer">webkit-bug-importer</who>
    <bug_when>2021-10-19 10:34:05 -0700</bug_when>
    <thetext>&lt;rdar://problem/84421509&gt;</thetext>
  </long_desc>
      
      

    </bug>

</bugzilla>