202356 – Add some assertions to convertUTF8ToUTF16().

RESOLVED FIXED 202356

Add some assertions to convertUTF8ToUTF16().

https://bugs.webkit.org/show_bug.cgi?id=202356

Summary Add some assertions to convertUTF8ToUTF16().

Mark Lam

Reported 2019-09-29 16:10:19 PDT

<rdar://problem/52846813>

Attachments
proposed patch. (1.86 KB, patch) 2019-09-29 16:21 PDT, Mark Lam	saam: review+	Details Formatted Diff Diff
proposed patch. (1.56 KB, patch) 2019-09-29 18:42 PDT, Mark Lam	no flags	Details Formatted Diff Diff
Show Obsolete (1) View All Add attachment proposed patch, testcase, etc.

Mark Lam

Comment 1 2019-09-29 16:21:30 PDT

Created attachment 379820 [details] proposed patch.

Saam Barati

Comment 2 2019-09-29 16:30:58 PDT

Comment on attachment 379820 [details] proposed patch. View in context: https://bugs.webkit.org/attachment.cgi?id=379820&action=review > Source/WTF/wtf/unicode/UTF8Conversion.cpp:95 > + RELEASE_ASSERT(targetEnd - target <= std::numeric_limits<int>::max()); why does the code require this?

Mark Lam

Comment 3 2019-09-29 17:18:40 PDT

Comment on attachment 379820 [details] proposed patch. View in context: https://bugs.webkit.org/attachment.cgi?id=379820&action=review Thanks for the review. >> Source/WTF/wtf/unicode/UTF8Conversion.cpp:95 >> + RELEASE_ASSERT(targetEnd - target <= std::numeric_limits<int>::max()); > > why does the code require this? For the same reason as why there's a "RELEASE_ASSERT(sourceEnd - source <= std::numeric_limits<int>::max())" above: to validate that the client didn't pass in a targetEnd that is at a lower address than the targetStart.

Mark Lam

Comment 4 2019-09-29 17:20:27 PDT

Surprisingly, looks like these assertions are failing on some tests. I'm investigating.

Mark Lam

Comment 5 2019-09-29 18:38:03 PDT

(In reply to Mark Lam from comment #4) > Surprisingly, looks like these assertions are failing on some tests. I'm > investigating. Turns out, it's too aggressive to assert that (targetEnd - target <= std::numeric_limits<int>::max()). Some clients actually did the work to calculate the exact number of UChars needed, not just the upper bound. We should not penalize these clients. I'll upload an updated patch with a better assertion.

Mark Lam

Comment 6 2019-09-29 18:42:52 PDT

Created attachment 379821 [details] proposed patch. Let's try this on the EWS first.

Mark Lam

Comment 7 2019-09-30 10:31:38 PDT

Comment on attachment 379821 [details] proposed patch. Thanks for the review. Landing now.

WebKit Commit Bot

Comment 8 2019-09-30 11:15:34 PDT

Comment on attachment 379821 [details] proposed patch. Clearing flags on attachment: 379821 Committed r250520: <https://trac.webkit.org/changeset/250520>

WebKit Commit Bot

Comment 9 2019-09-30 11:15:36 PDT

All reviewed patches have been landed. Closing bug.

Saam Barati

Comment 10 2019-09-30 11:27:58 PDT

(In reply to Mark Lam from comment #3) > Comment on attachment 379820 [details] > proposed patch. > > View in context: > https://bugs.webkit.org/attachment.cgi?id=379820&action=review > > Thanks for the review. > > >> Source/WTF/wtf/unicode/UTF8Conversion.cpp:95 > >> + RELEASE_ASSERT(targetEnd - target <= std::numeric_limits<int>::max()); > > > > why does the code require this? > > For the same reason as why there's a "RELEASE_ASSERT(sourceEnd - source <= > std::numeric_limits<int>::max())" above: to validate that the client didn't > pass in a targetEnd that is at a lower address than the targetStart. This isn't quite right. You were halving the output rang.

Mark Lam

Comment 11 2019-09-30 12:03:19 PDT

(In reply to Saam Barati from comment #10) > (In reply to Mark Lam from comment #3) > > >> Source/WTF/wtf/unicode/UTF8Conversion.cpp:95 > > >> + RELEASE_ASSERT(targetEnd - target <= std::numeric_limits<int>::max()); > > > > > > why does the code require this? > > > > For the same reason as why there's a "RELEASE_ASSERT(sourceEnd - source <= > > std::numeric_limits<int>::max())" above: to validate that the client didn't > > pass in a targetEnd that is at a lower address than the targetStart. > > This isn't quite right. You were halving the output rang. I clarified this with Saam offline: I am not halving the output range because target and targetEnd are in units of UChar*. Hence, this assertion is correct.

Note You need to log in before you can comment on or make changes to this bug.

Status RESOLVED

Resolution FIXED

Priority P2

Severity Normal

Classification Unclassified

Version WebKit Nightly Build

Hardware Unspecified

OS Unspecified

Product WebKit

Component JavaScriptCore

Assignee

Mark Lam

Reported

2019-09-29 16:10 PDT

Modified

2019-09-30 12:03 PDT History

CC List

10 users Show

URL

Keywords InRadar

Depends on

Blocks