6046 – Need to cleanup charset aliases support

RESOLVED FIXED 6046

Need to cleanup charset aliases support

https://bugs.webkit.org/show_bug.cgi?id=6046

Summary Need to cleanup charset aliases support

Alexey Proskuryakov

Reported 2005-12-11 22:14:34 PST

With a fix to bug 5932 in place, WebCore uses both its own and ICU's tables for charset names. There are a lot of redundant entries that should be removed (I'm not even sure if any aliases known to WebCore are missing from ICU). Also, ICU works directly with charset names, so it's unnecessary to convert them to CFStringEncodings and back.

Attachments
Add attachment proposed patch, testcase, etc.

Darin Adler

Comment 1 2005-12-18 14:25:37 PST

Note that WebCore's lookup ignores case and all non-alphanumeric characters. So that if WebCore lists "UTF-8", then "u<t>f-8-" will also work. Since ICU does not have this property, we'd need to do something if we want to preserve this behavior.

Alexey Proskuryakov

Comment 2 2005-12-19 03:38:15 PST

(In reply to comment #1) ICU does something similar: it ignores case, dashes, spaces and underscores. Dunno whether ignoring other non-alphanumeric symbols is useful.

Alexey Proskuryakov

Comment 3 2007-03-31 11:45:27 PDT

Charset handling has been refactored, this bug doesn't appear useful anymore.

Note You need to log in before you can comment on or make changes to this bug.

Status RESOLVED

Resolution FIXED

Priority P2

Severity Enhancement

Classification Unclassified

Version 420+

Hardware Mac

OS OS X 10.4

Product WebKit

Component DOM

Assignee

Nobody

Reported

2005-12-11 22:14 PST

Modified

2007-03-31 11:45 PDT History

CC List

1 user Show

URL

Keywords

Depends on

Blocks