Bug 19216 - Character set detection seems to happen too late
Summary: Character set detection seems to happen too late
Status: NEW
Alias: None
Product: WebKit
Classification: Unclassified
Component: Text (show other bugs)
Version: 528+ (Nightly build)
Hardware: Mac OS X 10.5
: P2 Normal
Assignee: Nobody
URL: http://www.zoo.gov.tw/table.shtml
Keywords:
Depends on:
Blocks:
 
Reported: 2008-05-23 08:47 PDT by David Carson
Modified: 2009-01-11 14:54 PST (History)
1 user (show)

See Also:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description David Carson 2008-05-23 08:47:06 PDT
When I load this site, the text seems garbled. When I load it in FF, it starts garbled, but then switches over the page to the correct character set.
Comment 1 Alexey Proskuryakov 2009-01-11 14:54:22 PST
This document has a broken structure (part of body content comes before head). In such cases, WebKit only checks the first kilobyte for meta charset - while Firefox will restart decoding if it sees the meta at any place in the document.

HTML5 specifies Firefox behavior, but despite this, I have very strong doubts about it, because parsing can have side effects (like script execution), which re-parsing would repeat.