<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://bugs.webkit.org/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.4.1"
          urlbase="https://bugs.webkit.org/"
          
          maintainer="admin@webkit.org"
>

    <bug>
          <bug_id>74610</bug_id>
          
          <creation_ts>2011-12-15 08:34:25 -0800</creation_ts>
          <short_desc>Remove UTF-7 and UTF-32 support</short_desc>
          <delta_ts>2023-05-29 02:18:23 -0700</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>1</classification_id>
          <classification>Unclassified</classification>
          <product>WebKit</product>
          <component>Platform</component>
          <version>528+ (Nightly build)</version>
          <rep_platform>Unspecified</rep_platform>
          <op_sys>Unspecified</op_sys>
          <bug_status>NEW</bug_status>
          <resolution></resolution>
          
          <see_also>https://bugs.webkit.org/show_bug.cgi?id=159651</see_also>
    
    <see_also>https://bugs.webkit.org/show_bug.cgi?id=10709</see_also>
          <bug_file_loc></bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords></keywords>
          <priority>P2</priority>
          <bug_severity>Normal</bug_severity>
          <target_milestone>---</target_milestone>
          
          
          <everconfirmed>1</everconfirmed>
          <reporter name="Anne van Kesteren">annevk</reporter>
          <assigned_to name="Nobody">webkit-unassigned</assigned_to>
          <cc>ap</cc>
    
    <cc>cdumez</cc>
    
    <cc>eoconnor</cc>
    
    <cc>jshin</cc>
    
    <cc>masa141421356</cc>
    
    <cc>mjs</cc>
    
    <cc>Ms2ger</cc>
    
    <cc>syoichi</cc>
          

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>521918</commentid>
    <comment_count>0</comment_count>
    <who name="Anne van Kesteren">annevk</who>
    <bug_when>2011-12-15 08:34:25 -0800</bug_when>
    <thetext>I&apos;m not sure what the right component is or who to copy on this bug exactly, but per HTML UTF-7 and UTF-32 should not be supported. Gecko and Presto have disabled these already. WebKit/Chromium would preferably follow us here.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>522217</commentid>
    <comment_count>1</comment_count>
    <who name="Alexey Proskuryakov">ap</who>
    <bug_when>2011-12-15 13:47:38 -0800</bug_when>
    <thetext>Do you have a test case for out UTF-7 support? That would be a bug, as we were supposed to have blocked it long ago, &lt;http://trac.webkit.org/changeset/49487&gt;.

What is the rationale to ban UTF-32?</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>522218</commentid>
    <comment_count>2</comment_count>
    <who name="Theresa O&apos;Connor">eoconnor</who>
    <bug_when>2011-12-15 13:53:22 -0800</bug_when>
    <thetext>ap: from the HTML spec ( http://www.whatwg.org/specs/web-apps/current-work/multipage/parsing.html#character-encodings-0 ):

Support for UTF-32 is not recommended. This encoding is rarely used, and frequently implemented incorrectly.

This specification does not make any attempt to support EBCDIC-based encodings and UTF-32 in its algorithms; support and use of these encodings can thus lead to unexpected behavior in implementations of this specification.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>522226</commentid>
    <comment_count>3</comment_count>
    <who name="Alexey Proskuryakov">ap</who>
    <bug_when>2011-12-15 14:04:11 -0800</bug_when>
    <thetext>So, the rationale is &quot;rarely used, and frequently implemented incorrectly&quot;.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>848860</commentid>
    <comment_count>4</comment_count>
    <who name="Masahiro Yamada">masa141421356</who>
    <bug_when>2013-03-06 07:45:19 -0800</bug_when>
    <thetext>Support of UTF-7 is removed by bug 29078.
But other unrecommended encoding are still supported.
(CESU-8, UTF-7, BOCU-1 and SCSU)

Matrix of supported encoding names per browser is here:
http://l0.cm/encodings/table/</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>1958587</commentid>
    <comment_count>5</comment_count>
    <who name="Anne van Kesteren">annevk</who>
    <bug_when>2023-05-29 02:18:23 -0700</bug_when>
    <thetext>What is UTF7Encoding() about? Is that for WebKit embedders? If so, I suspect this is fixed as UTF-7 is blocklisted somewhere.</thetext>
  </long_desc>
      
      

    </bug>

</bugzilla>