Summary: | Windows-1253 (Greek) encoding error | ||||||
---|---|---|---|---|---|---|---|
Product: | WebKit | Reporter: | O. Andersen <pub-webkit> | ||||
Component: | Text | Assignee: | Nobody <webkit-unassigned> | ||||
Status: | RESOLVED INVALID | ||||||
Severity: | Normal | CC: | ap | ||||
Priority: | P2 | Keywords: | InRadar | ||||
Version: | 528+ (Nightly build) | ||||||
Hardware: | All | ||||||
OS: | OS X 10.6 | ||||||
Attachments: |
|
Description
O. Andersen
2010-07-11 09:55:54 PDT
My testing gives somewhat different results: 1. IE converts AA to U+F8F9, which is a PUA character; 2. Firefox seems mysteriously broken, it "eats" the byte after AA. We just use ICU, <http://icu-project.org/icu-bin/convexp?conv=ibm-5349_P100-1998&s=ALL>, so it would be more straightforward to discuss this in ICU bug tracker. The problem (if it's even a problem) doesn't seem serious enough to warrant a workaround in WebKit to me. Created attachment 61255 [details]
test case
1. You are of course right about IE mapping undefined characters to PUA instead of U+FFFD. Sorry for being imprecise. 2. This is the result of a known bug in current versions of Firefox. The bug has been fixed, but the fix does not seem to have reached non-beta versions yet. Firefox did map 0xFF to U+FFFD before this bug was introduced, and future versions can be expected to do the same. I agree that this should be fixed in ICU and that a temporary work-around is probably not needed. (In reply to comment #3) > 0xFF That should of course be 0xAA. OK, let's treat this as any bug in underlying libraries that we don't plan to work around, and mark it as INVALID then. If you file an ICU bug, please post its URL here. Filed an ICU bug: <http://bugs.icu-project.org/trac/ticket/7818> |