Bug 159203 - Use Unicode 9’s `ID_Start` & `ID_Continue` for identifiers
Summary: Use Unicode 9’s `ID_Start` & `ID_Continue` for identifiers
Status: NEW
Alias: None
Product: WebKit
Classification: Unclassified
Component: JavaScriptCore (show other bugs)
Version: WebKit Nightly Build
Hardware: Unspecified Unspecified
: P2 Normal
Assignee: Yusuke Suzuki
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2016-06-28 02:32 PDT by Mathias Bynens
Modified: 2018-01-25 18:59 PST (History)
3 users (show)

See Also:


Attachments
Node.js script that outputs lists of new `ID_Start` and `ID_Continue` symbols (536 bytes, text/javascript)
2016-06-28 02:33 PDT, Mathias Bynens
no flags Details
results.js (198.34 KB, text/javascript)
2016-06-28 02:33 PDT, Mathias Bynens
no flags Details

Note You need to log in before you can comment on or make changes to this bug.
Description Mathias Bynens 2016-06-28 02:32:40 PDT
* Unicode 8 has 2,518 `ID_Start` symbols; Unicode 9 has 2,669, i.e. 151 more (no removals).
* Unicode 8 has 109,830 `ID_Continue` symbols; Unicode 9 has 117,007, i.e. 7,177 more (no removals).

E.g. these should not throw per Unicode 9:

    Function('var \u{1E943}'); // new ID_Start
    Function('var _\u{1E959}'); // new ID_Continue

I’ve attached `results.js` which contains the full list of new `ID_Start` and `ID_Continue` symbols in Unicode 9, and the Node.js script used to generate it.
Comment 1 Mathias Bynens 2016-06-28 02:33:21 PDT
Created attachment 282228 [details]
Node.js script that outputs lists of new `ID_Start` and `ID_Continue` symbols
Comment 2 Mathias Bynens 2016-06-28 02:33:36 PDT
Created attachment 282229 [details]
results.js
Comment 3 Mathias Bynens 2016-06-28 02:41:36 PDT
Bugzilla won’t allow me to add https://github.com/Microsoft/ChakraCore/issues/1208 to the “See also” list. Can that be fixed?
Comment 4 Alexey Proskuryakov 2016-06-28 20:09:11 PDT
Per <http://www.ecma-international.org/ecma-262/7.0/>, "A conforming implementation of ECMAScript must interpret source text input in conformance with the Unicode Standard, Version 8.0.0 or later and ISO/IEC 10646".
Comment 5 Mathias Bynens 2016-06-28 23:32:45 PDT
The intent is to use the latest available Unicode version. See https://github.com/tc39/ecma262/pull/620 for some discussion.