Bug 181302 - Web Inspector: Zero-width joiners and other hidden Unicode characters are not displayed
Summary: Web Inspector: Zero-width joiners and other hidden Unicode characters are not...
Status: NEW
Alias: None
Product: WebKit
Classification: Unclassified
Component: Web Inspector (show other bugs)
Version: Safari Technology Preview
Hardware: All All
: P2 Normal
Assignee: Nobody
URL:
Keywords: InRadar
: 220831 (view as bug list)
Depends on:
Blocks:
 
Reported: 2018-01-04 12:46 PST by Craig Hockenberry
Modified: 2021-01-23 07:52 PST (History)
8 users (show)

See Also:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Craig Hockenberry 2018-01-04 12:46:46 PST
Zero-width joiners and other hidden Unicode characters in #text nodes are not displayed in the Web Inspector.

To reproduce:

1) Open this web page:

https://meta.stackoverflow.com/questions/361390/can-posts-to-stack-overflow-be-fingerprinted-using-hidden-unicode-characters

2) Select the "foo" in the "foo * bar" sample code

3) Inspect the element

4) Look at the nodeValue for the #text node

Expected result:

All the Unicode code points in the element should be visible. Chrome displays the HTML entities for the Unicode code points contained within the element:

‌​‌​‌‌‌⁠‌‌​‌​​​⁠‌‌​​​​‌⁠‌‌‌​‌​​⁠‌​​​​​⁠‌‌​​‌​​⁠‌‌​‌‌‌‌⁠‌‌​​‌​‌⁠‌‌‌​​‌‌⁠‌​​​​​⁠‌‌​​‌‌​⁠‌‌​‌‌‌‌⁠‌‌​‌‌‌‌⁠‌​​​​​⁠‌‌​​‌​‌⁠‌‌‌​‌‌​⁠‌‌​​‌​‌⁠‌‌​‌‌‌​⁠‌​​​​​⁠‌‌​‌‌​‌⁠‌‌​​‌​‌⁠‌‌​​​​‌⁠‌‌​‌‌‌​⁠‌‌‌‌‌‌foo 

Actual result:

Text value is displayed as " foo ".
Comment 1 Radar WebKit Bug Importer 2018-01-04 15:25:48 PST
<rdar://problem/36307533>
Comment 2 Smoley 2021-01-21 19:37:52 PST
*** Bug 220831 has been marked as a duplicate of this bug. ***
Comment 3 Sam Sneddon [:gsnedders] 2021-01-22 11:04:32 PST
What do we want to display escaped? General category C*? Some of White_Space=yes, as well, probably?
Comment 4 Ebrahim Byagowi 2021-01-23 07:52:35 PST
> What do we want to display escaped? General category C*? Some of White_Space=yes, as well, probably?

Oh, finding a unicode category of them also would be nice, it should contain what WebKit initially started at Bug 93888 and ended up to be https://source.chromium.org/chromium/chromium/src/+/master:third_party/devtools-frontend/src/front_end/elements/ElementsTreeOutline.js;l=1679?q=MappedCharToEntity&ss=chromium I started a draft based on the two on https://pastebin.com/1hVMRLD3