125225 – Support fewer text encoding names, don't blindly enumerate everything from ICU

RESOLVED CONFIGURATION CHANGED 125225

Support fewer text encoding names, don't blindly enumerate everything from ICU

https://bugs.webkit.org/show_bug.cgi?id=125225

Summary Support fewer text encoding names, don't blindly enumerate everything from ICU

Anne van Kesteren

Reported 2013-12-04 08:10:07 PST

Supporting labels such as windows-65001 for utf-8 seems bad. Any chance you could align closer to Gecko and http://encoding.spec.whatwg.org/ ? Supporting more labels is not necessarily in your benefit. E.g. sites might rely on them not being supported.

Attachments
Add attachment proposed patch, testcase, etc.

Alexey Proskuryakov

Comment 1 2013-12-04 09:53:38 PST

Yes, I want to switch from enumerating all ICU aliases to having a built-in table in WebKit. That will have the side effect of removing support for some crazy encoding names.

Alexey Proskuryakov

Comment 2 2013-12-04 09:57:11 PST

That said, the ideal resolution would be to have an ICU "standard" for HTML5, so that the right names would be in ICU. This is exactly what those are for, and I find it less than great that HTML encoding names are tracked in the Encoding spec, not in ICU. See <http://demo.icu-project.org/icu-bin/convexp> - there are separate namespaces for MIME, IANA, and others. HTML is just another case.

Anne van Kesteren

Comment 3 2013-12-04 12:57:11 PST

ap, I don't understand that feedback. Why would it not be good to have a standard for what ICU should implement? ICU is not the only implementation.

Alexey Proskuryakov

Comment 4 2013-12-04 13:25:07 PST

True. Maybe that would be another project under Unicode consortium umbrella (CLDR perhaps)? The stellar history of the consortium's interaction with many interested parties makes it a desirable gatekeeper for a sensitive spec like this.

Anne van Kesteren

Comment 5 2013-12-04 13:34:58 PST

I don't really see what moving it brings us. It's mostly a research project.

Alexey Proskuryakov

Comment 6 2014-01-01 23:20:44 PST