Bug 210816

Summary: Canonicalize JSBigInt generated by structured-cloning by calling rightTrim
Product: WebKit Reporter: Yusuke Suzuki <ysuzuki>
Component: New BugsAssignee: Yusuke Suzuki <ysuzuki>
Status: RESOLVED FIXED    
Severity: Normal CC: alecflett, beidson, darin, ews-watchlist, jsbell, keith_miller, mark.lam, msaboff, saam, tzagallo, webkit-bug-importer
Priority: P2 Keywords: InRadar
Version: WebKit Nightly Build   
Hardware: Unspecified   
OS: Unspecified   
Attachments:
Description Flags
Patch
none
Patch
none
Patch
none
Patch
keith_miller: review+
Patch for landing
none
Patch for landing none

Yusuke Suzuki
Reported 2020-04-21 14:24:45 PDT
[JSC] Call rightTrim to ensure that corrupted serialized data should not produce wrong JSBigInt
Attachments
Patch (1.88 KB, patch)
2020-04-21 14:26 PDT, Yusuke Suzuki
no flags
Patch (8.42 KB, patch)
2020-04-21 14:42 PDT, Yusuke Suzuki
no flags
Patch (8.42 KB, patch)
2020-04-21 14:59 PDT, Yusuke Suzuki
no flags
Patch (8.95 KB, patch)
2020-04-21 15:56 PDT, Yusuke Suzuki
keith_miller: review+
Patch for landing (11.96 KB, patch)
2020-04-21 17:12 PDT, Yusuke Suzuki
no flags
Patch for landing (11.93 KB, patch)
2020-04-21 17:18 PDT, Yusuke Suzuki
no flags
Yusuke Suzuki
Comment 1 2020-04-21 14:26:08 PDT
Yusuke Suzuki
Comment 2 2020-04-21 14:42:33 PDT
Yusuke Suzuki
Comment 3 2020-04-21 14:59:41 PDT
Darin Adler
Comment 4 2020-04-21 15:04:09 PDT
Comment on attachment 397128 [details] Patch View in context: https://bugs.webkit.org/attachment.cgi?id=397128&action=review > Source/WebCore/bindings/js/SerializedScriptValue.cpp:3038 > + bigInt = bigInt->rightTrim(m_lexicalGlobalObject->vm()); Wouldn’t it be better for deserialization to *fail* if rightTrim would do anything here? > Source/WebCore/bindings/js/SerializedScriptValue.cpp:3074 > + bigInt = bigInt->rightTrim(m_lexicalGlobalObject->vm()); Ditto.
Keith Miller
Comment 5 2020-04-21 15:28:46 PDT
Comment on attachment 397128 [details] Patch View in context: https://bugs.webkit.org/attachment.cgi?id=397128&action=review > Source/JavaScriptCore/ChangeLog:3 > + Call rightTrim to ensure that corrupted serialized data should not produce wrong JSBigInt to make structured-cloning for JSBigInt more robust That's a mouthful, can we just say: "Make structured-cloning for JSBigInt more robust by calling rightTrim"? We can explain why in the ChangLog. > Source/WebCore/ChangeLog:8 > + Let's assume that the serialized data is corrupted. In that case, we should (1) throw an failure if it is detected or (2) produce a right value if it is not detected. For (2) do you mean a valid value? What would the "right" value be for corrupted input? > Source/WebCore/ChangeLog:9 > + We should call `JSBigInt::rightTrim(VM&)` when finishing JSBigInt deserialization to at least ensure that (2). Nit: ensure that (2) => ensure (2) >> Source/WebCore/bindings/js/SerializedScriptValue.cpp:3038 >> + bigInt = bigInt->rightTrim(m_lexicalGlobalObject->vm()); > > Wouldn’t it be better for deserialization to *fail* if rightTrim would do anything here? The only upside I can see of not failing would be that maybe a developer could work around a bug on our end. Overall, I agree with you though.
Yusuke Suzuki
Comment 6 2020-04-21 15:41:38 PDT
Comment on attachment 397128 [details] Patch View in context: https://bugs.webkit.org/attachment.cgi?id=397128&action=review Thanks! >> Source/JavaScriptCore/ChangeLog:3 >> + Call rightTrim to ensure that corrupted serialized data should not produce wrong JSBigInt to make structured-cloning for JSBigInt more robust > > That's a mouthful, can we just say: "Make structured-cloning for JSBigInt more robust by calling rightTrim"? > > We can explain why in the ChangLog. Will fix. >> Source/WebCore/ChangeLog:8 >> + Let's assume that the serialized data is corrupted. In that case, we should (1) throw an failure if it is detected or (2) produce a right value if it is not detected. > > For (2) do you mean a valid value? What would the "right" value be for corrupted input? Valid value. Corrupt means that it is not ensuring JSBigInt's data layout invariant. For example, zero should be length = 0 and sign = false. But we can represent this value in various form like, length = 1 and value = 0. Or sign = true, length = 0. This patch is generating correct value while ignoring data layout invariant in JSBigInt. >> Source/WebCore/ChangeLog:9 >> + We should call `JSBigInt::rightTrim(VM&)` when finishing JSBigInt deserialization to at least ensure that (2). > > Nit: ensure that (2) => ensure (2) Fixed. >>> Source/WebCore/bindings/js/SerializedScriptValue.cpp:3038 >>> + bigInt = bigInt->rightTrim(m_lexicalGlobalObject->vm()); >> >> Wouldn’t it be better for deserialization to *fail* if rightTrim would do anything here? > > The only upside I can see of not failing would be that maybe a developer could work around a bug on our end. Overall, I agree with you though. My wording would be misleading. Here, what I said wrong is "wrong in terms of internal invariant of JSBigInt". For example, in this format, we have several ways to represent zero JSBigInt. One is 0 length. One is 1 length, and value is zero. One is 0-length, but sign is true. But in JSBigInt layout, we are assuming some invariant in JSBigInt. For example, if JSBigInt is zero, length should be zero, and sign flag should be false. This `rightTrim(VM&)` offers the way to canonicalize JSBigInt format even if the input is not holding JSBigInt's internal invariant. I think this direction is better. We should not expose such an internal invariant of JSBigInt in serialization code. This makes JSBigInt implementation free from the format of serialization code. Just calling rightTrim correctly canonicalize JSBigInt's internal representation regardless of how it is constructed.
Yusuke Suzuki
Comment 7 2020-04-21 15:51:59 PDT
Comment on attachment 397128 [details] Patch View in context: https://bugs.webkit.org/attachment.cgi?id=397128&action=review >>>> Source/WebCore/bindings/js/SerializedScriptValue.cpp:3038 >>>> + bigInt = bigInt->rightTrim(m_lexicalGlobalObject->vm()); >>> >>> Wouldn’t it be better for deserialization to *fail* if rightTrim would do anything here? >> >> The only upside I can see of not failing would be that maybe a developer could work around a bug on our end. Overall, I agree with you though. > > My wording would be misleading. Here, what I said wrong is "wrong in terms of internal invariant of JSBigInt". For example, in this format, we have several ways to represent zero JSBigInt. One is 0 length. One is 1 length, and value is zero. One is 0-length, but sign is true. > But in JSBigInt layout, we are assuming some invariant in JSBigInt. For example, if JSBigInt is zero, length should be zero, and sign flag should be false. > This `rightTrim(VM&)` offers the way to canonicalize JSBigInt format even if the input is not holding JSBigInt's internal invariant. > I think this direction is better. We should not expose such an internal invariant of JSBigInt in serialization code. This makes JSBigInt implementation free from the format of serialization code. > Just calling rightTrim correctly canonicalize JSBigInt's internal representation regardless of how it is constructed. Let's consider about adding a new invariant into JSBigInt's internal format. If we throw an error if serialized format does not meet JSBigInt's internal invariant, then, we cannot easily modify JSBigInt's invariant because this serialization format requires backward-compatibility. On the other hand, if this canonicalizes the generated JSBigInt, then we can freely add new invariants into JSBigInt implementation without considering the serialization format.
Darin Adler
Comment 8 2020-04-21 15:55:08 PDT
(In reply to Yusuke Suzuki from comment #6) > I think this direction is better. We should not expose such an internal > invariant of JSBigInt in serialization code. OK.
Yusuke Suzuki
Comment 9 2020-04-21 15:56:59 PDT
Yusuke Suzuki
Comment 10 2020-04-21 15:57:14 PDT
(In reply to Darin Adler from comment #8) > (In reply to Yusuke Suzuki from comment #6) > > I think this direction is better. We should not expose such an internal > > invariant of JSBigInt in serialization code. > > OK. Thanks!! I've updated.
Keith Miller
Comment 11 2020-04-21 16:13:29 PDT
Comment on attachment 397134 [details] Patch r=me. I was forgetting that the point of this is to be written to disk. This patch makes sense now.
Yusuke Suzuki
Comment 12 2020-04-21 16:15:22 PDT
I'll further add a test using `internals.deserializeBuffer` to test that this canonicalization happens correctly!
Yusuke Suzuki
Comment 13 2020-04-21 17:12:14 PDT
Created attachment 397143 [details] Patch for landing
Yusuke Suzuki
Comment 14 2020-04-21 17:18:38 PDT
Created attachment 397145 [details] Patch for landing
EWS
Comment 15 2020-04-21 19:36:02 PDT
Committed r260489: <https://trac.webkit.org/changeset/260489> All reviewed patches have been landed. Closing bug and clearing flags on attachment 397145 [details].
Radar WebKit Bug Importer
Comment 16 2020-04-21 19:37:15 PDT
Note You need to log in before you can comment on or make changes to this bug.