Bug 65971

Summary: In-page search : asymmetric search for letters with diacritic mark
Product: WebKit Reporter: Yair Yogev <progame+wk>
Component: TextAssignee: Nobody <webkit-unassigned>
Status: NEW    
Severity: Normal CC: ap, dglazkov, jshin, mitz, playmobil, rniwa, xji
Priority: P2    
Version: 528+ (Nightly build)   
Hardware: All   
OS: Unspecified   

Yair Yogev
Reported 2011-08-10 02:54:45 PDT
In-page search isn't "Hebrew diacritics" sensitive. For example, searching for שִׁ is the same as searching for ש, it ignores the diacritics (nikud) when searching. Ideally IMO, a search for ש (for example) will find all occurrences of ש regardless of the diacritics but searching for שִׁ will only find שִׁ (and not ש שׁ שֶׂ שֻׁ שֶׁ etc.) Only tested Windows Safari (but i imagine it affects all platforms) chromium bug: http://crbug.com/84111
Attachments
Jungshik Shin
Comment 1 2011-08-10 14:02:57 PDT
This is not Hebrew-specific. This is also applicable to any Latin/Greek/Cyrillic letters with diacritics (and some more) What's requested here is 'asymmetric' match in the sense that 'A with umlaut' will only match 'A with umlaut', but 'A' will match 'A with umlaut', 'A', 'A with ring above', etc. It's being worked on by an Apple engineer (in the CLDR). In the meantime, if we just need 'diacritic-sensitive' search, it's currently possible if 'case-sensitive' search is turned on. What's not currently possible is 'case-insensitive' but 'diacritic-sensitive' search.
Jungshik Shin
Comment 2 2011-08-10 14:06:12 PDT
Actually, the corresponding chromium bug is http://crbug.com/71741 http://crbug.com/84111 is about searching for a stand-alone diacritic mark.
Alexey Proskuryakov
Comment 3 2011-08-10 15:50:31 PDT
> This is also applicable to any Latin/Greek/Cyrillic letters with diacritics (and some more) Each language has its own rules. In Russian search, "ё" should be fully symmetrical to "е", but "й" should never match "и" in either direction.
Note You need to log in before you can comment on or make changes to this bug.