167736 – [JSC] Add operationToInt32SensibleSlow to optimize kraken pbkdf2 and sha256

RESOLVED FIXED 167736

[JSC] Add operationToInt32SensibleSlow to optimize kraken pbkdf2 and sha256

https://bugs.webkit.org/show_bug.cgi?id=167736

Summary [JSC] Add operationToInt32SensibleSlow to optimize kraken pbkdf2 and sha256

Yusuke Suzuki

Reported 2017-02-02 09:05:20 PST

[JSC] Add operationToInt32Fallback to optimize kraken pbkdf2

Attachments
Patch (10.69 KB, patch) 2017-02-02 09:13 PST, Yusuke Suzuki	no flags	Details Formatted Diff Diff
Patch (16.15 KB, patch) 2017-02-03 05:09 PST, Yusuke Suzuki	saam: review+	Details Formatted Diff Diff
Show Obsolete (1) View All Add attachment proposed patch, testcase, etc.

Yusuke Suzuki

Comment 1 2017-02-02 09:13:11 PST

Created attachment 300414 [details] Patch WIP

Yusuke Suzuki

Comment 2 2017-02-02 09:13:44 PST

Still considering about the edge cases.

Yusuke Suzuki

Comment 3 2017-02-02 09:15:52 PST

CAUTION: Currently, this is only valid in x64 (sensible double to int32 case in DFG and FTL). And still I'm now considering about edge cases. BTW, the latest ARM64 has super fast double to int32 operation. If this instruction becomes available, we can optimize this in ARM64.

Yusuke Suzuki

Comment 4 2017-02-02 09:34:39 PST

Pf course, later, we should make DFG prediction better not to emit ValueToInt32 in such a case. But I think optimizing this sensitive double to int32 is worth doing. Note that kraken pbkdf2 consumes 1/3 time for operationToInt32. ;)

Yusuke Suzuki

Comment 5 2017-02-02 10:13:08 PST

This also improves kraken sha256 iterative.

Yusuke Suzuki

Comment 6 2017-02-02 11:17:13 PST

Comment on attachment 300414 [details] Patch View in context: https://bugs.webkit.org/attachment.cgi?id=300414&action=review > Source/JavaScriptCore/runtime/MathCommon.cpp:481 > + // If exponent < 0 there will be no bits to the left of the decimal point > + // after rounding; if the exponent is > 83 then no bits of precision can be > + // left in the low 32-bit range of the result (IEEE-754 doubles have 52 bits > + // of fractional precision). > + // Note this case handles 0, -0, and all infinite, NaN, & denormal value. > + if (exp < 0 || exp > 83) > + return 0; And I think this check is also unnecessary. 1. If exp < 0, truncate should succeed. 2. If exp > 83 (exp >= 84), the following code produces 0 as a result. (exp > 52, so static_cast<int32_t>(bits << (84 - 52)) -> 0)

Yusuke Suzuki

Comment 7 2017-02-03 05:09:23 PST

Created attachment 300518 [details] Patch

Saam Barati

Comment 8 2017-02-04 01:12:49 PST

Comment on attachment 300518 [details] Patch View in context: https://bugs.webkit.org/attachment.cgi?id=300518&action=review r=me > Source/JavaScriptCore/ChangeLog:63 > + 1ff: 66 48 0f 7e c0 movq %xmm0,%rax I wonder why this doesn't do a mov rdx, rax > Source/JavaScriptCore/runtime/MathCommon.cpp:470 > +int32_t JIT_OPERATION operationToInt32SensibleSlow(double number) IEE754 is fun

Yusuke Suzuki

Comment 9 2017-02-04 05:46:14 PST

Comment on attachment 300518 [details] Patch View in context: https://bugs.webkit.org/attachment.cgi?id=300518&action=review >> Source/JavaScriptCore/ChangeLog:63 >> + 1ff: 66 48 0f 7e c0 movq %xmm0,%rax > > I wonder why this doesn't do a mov rdx, rax Yeah, that's storage. But it's due to the GCC6's decision. >> Source/JavaScriptCore/runtime/MathCommon.cpp:470 >> +int32_t JIT_OPERATION operationToInt32SensibleSlow(double number) > > IEE754 is fun Yeah :)

Yusuke Suzuki

Comment 10 2017-02-04 05:47:51 PST

Committed r211670: <http://trac.webkit.org/changeset/211670>

Saam Barati

Comment 11 2017-02-04 13:16:37 PST

I've confirmed that this is a 1-1.5% progression on Mac Kraken.

Radar WebKit Bug Importer

Comment 12 2017-02-04 13:22:35 PST

<rdar://problem/30364482>

Note You need to log in before you can comment on or make changes to this bug.

Status RESOLVED

Resolution FIXED

Priority P2

Severity Normal

Classification Unclassified

Version WebKit Nightly Build

Hardware Unspecified

OS Unspecified

Product WebKit

Component New Bugs

Assignee

Yusuke Suzuki

Reported

2017-02-02 09:05 PST

Modified

2017-02-04 13:22 PST History

CC List

9 users Show

URL

Keywords InRadar

Depends on

Blocks