Bug 125225 - Support fewer text encoding names, don't blindly enumerate everything from ICU
Summary: Support fewer text encoding names, don't blindly enumerate everything from ICU
Status: RESOLVED CONFIGURATION CHANGED
Alias: None
Product: WebKit
Classification: Unclassified
Component: Text (show other bugs)
Version: 528+ (Nightly build)
Hardware: Unspecified Unspecified
: P2 Normal
Assignee: Nobody
URL: data:text/html;charset=windows-65001,...
Keywords:
Depends on:
Blocks:
 
Reported: 2013-12-04 08:10 PST by Anne van Kesteren
Modified: 2020-05-05 14:53 PDT (History)
2 users (show)

See Also:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Anne van Kesteren 2013-12-04 08:10:07 PST
Supporting labels such as windows-65001 for utf-8 seems bad.

Any chance you could align closer to Gecko and http://encoding.spec.whatwg.org/ ? Supporting more labels is not necessarily in your benefit. E.g. sites might rely on them not being supported.
Comment 1 Alexey Proskuryakov 2013-12-04 09:53:38 PST
Yes, I want to switch from enumerating all ICU aliases to having a built-in table in WebKit. That will have the side effect of removing support for some crazy encoding names.
Comment 2 Alexey Proskuryakov 2013-12-04 09:57:11 PST
That said, the ideal resolution would be to have an ICU "standard" for HTML5, so that the right names would be in ICU. This is exactly what those are for, and I find it less than great that HTML encoding names are tracked in the Encoding spec, not in ICU.

See <http://demo.icu-project.org/icu-bin/convexp> - there are separate namespaces for MIME, IANA, and others. HTML is just another case.
Comment 3 Anne van Kesteren 2013-12-04 12:57:11 PST
ap, I don't understand that feedback. Why would it not be good to have a standard for what ICU should implement? ICU is not the only implementation.
Comment 4 Alexey Proskuryakov 2013-12-04 13:25:07 PST
True. Maybe that would be another project under Unicode consortium umbrella (CLDR perhaps)?

The stellar history of the consortium's interaction with many interested parties makes it a desirable gatekeeper for a sensitive spec like this.
Comment 5 Anne van Kesteren 2013-12-04 13:34:58 PST
I don't really see what moving it brings us. It's mostly a research project.
Comment 6 Alexey Proskuryakov 2014-01-01 23:20:44 PST
See also: bug 28760.
Comment 7 Alexey Proskuryakov 2020-05-05 14:53:48 PDT
This was done a few years ago.