Summary: | Tamil conjuncts are not selected as a single unit when styling initials | ||||||
---|---|---|---|---|---|---|---|
Product: | WebKit | Reporter: | Fuqiao Xue <xfq.free> | ||||
Component: | Layout and Rendering | Assignee: | Nobody <webkit-unassigned> | ||||
Status: | NEW --- | ||||||
Severity: | Normal | CC: | ap, bfulgham, darin, mmaxfield, simon.fraser, webkit-bug-importer, zalan | ||||
Priority: | P2 | Keywords: | InRadar | ||||
Version: | WebKit Nightly Build | ||||||
Hardware: | Unspecified | ||||||
OS: | Unspecified | ||||||
See Also: |
https://bugs.webkit.org/show_bug.cgi?id=228992 https://bugs.webkit.org/show_bug.cgi?id=179815 https://bugs.webkit.org/show_bug.cgi?id=127519 |
||||||
Attachments: |
|
Description
Fuqiao Xue
2021-12-01 23:06:02 PST
I wonder which Unicode algorithm is the basis for implementing the correct behavior here. We don’t want to come up with something novel, but I understand that to get this right we need to go beyond "grapheme cluster". For example, is "extended grapheme cluster" enough? FWIW, following https://drafts.csswg.org/css-pseudo/#first-letter-pseudo it looks like we'd need to devise something that matches platform behavior: > A UA must use the extended grapheme cluster (not legacy grapheme cluster), as defined in UAX29, as the basis for its typographic character unit. However, the UA should tailor the definitions as required by typographic tradition since the default rules are not always appropriate or ideal—and is expected to tailor them differently depending on the operation as needed. Maybe it can be the same as character selection. I’m not sure if our platform has any concept of initial letter… Maybe I should talk to the Pages engineers. It does have the concept of "shift-right-arrow to select one character", which is what Alexey was referring to. |