163264 – B3->Air lowering needs the same defenses in effectiveAddr() that it has in tryAppendLea()

RESOLVED FIXED 163264

B3->Air lowering needs the same defenses in effectiveAddr() that it has in tryAppendLea()

https://bugs.webkit.org/show_bug.cgi?id=163264

Summary B3->Air lowering needs the same defenses in effectiveAddr() that it has in tr...

Filip Pizlo

Reported 2016-10-10 20:47:06 PDT

There are two ways we may end up matching left-shifts into address calculations: effectiveAddr() and tryAppendLea(). tryAppendLea() does it right, effectiveAddr() does it wrong.

Attachments
the patch (9.02 KB, patch) 2016-10-11 09:49 PDT, Filip Pizlo	mark.lam: review+	Details Formatted Diff Diff
View All Add attachment proposed patch, testcase, etc.

Filip Pizlo

Comment 1 2016-10-11 09:26:24 PDT

Yup, this test generates wrong code and crashes: void testLoadBaseIndexShift32() { Procedure proc; BasicBlock* root = proc.addBlock(); root->appendNew<Value>( proc, Return, Origin(), root->appendNew<MemoryValue>( proc, Load, Int32, Origin(), root->appendNew<Value>( proc, Add, Origin(), root->appendNew<ArgumentRegValue>(proc, Origin(), GPRInfo::argumentGPR0), root->appendNew<Value>( proc, Shl, Origin(), root->appendNew<ArgumentRegValue>(proc, Origin(), GPRInfo::argumentGPR1), root->appendNew<Const32Value>(proc, Origin(), 32))))); auto code = compile(proc); int32_t value = 12341234; char* ptr = bitwise_cast<char*>(&value); for (unsigned i = 0; i < 10; ++i) CHECK_EQ(invoke<int32_t>(*code, ptr - (static_cast<intptr_t>(1) << static_cast<intptr_t>(32)) * i, i), 12341234); }

Filip Pizlo

Comment 2 2016-10-11 09:49:51 PDT

Created attachment 291271 [details] the patch

Mark Lam

Comment 3 2016-10-11 10:30:21 PDT

Comment on attachment 291271 [details] the patch r=me if this works for ARM as well. Otherwise, you might need to make this adjustment conditional on isX86 (if that's not already the case). I could be mistaken, but I vaguely remember that for 32-bit ARM, I think the shift operand is internally masked with 31 by the CPU before executing the shift. Hence, it's not possible to get a shift by 32. I'm not sure of the treatment on ARM64. Regardless, you should run the test on ARM64 as well to make sure that this does work there (if you haven't already done so).

Filip Pizlo

Comment 4 2016-10-11 10:55:55 PDT

(In reply to comment #3) > Comment on attachment 291271 [details] > the patch > > r=me if this works for ARM as well. Otherwise, you might need to make this > adjustment conditional on isX86 (if that's not already the case). > > I could be mistaken, but I vaguely remember that for 32-bit ARM, I think the > shift operand is internally masked with 31 by the CPU before executing the > shift. Hence, it's not possible to get a shift by 32. I'm not sure of the > treatment on ARM64. Regardless, you should run the test on ARM64 as well to > make sure that this does work there (if you haven't already done so). This is correct on arm64. The semantics of B3 Shl are to mask by 63 for 64-bit shifts and to mask by 31 for 32-bit shifts. Since this code is about emitting BaseIndex instead of Shl, it only has to worry about B3 semantics of Shl (which don't change from architecture to architecture) and target CPU semantics for BaseIndez (which vary wildly, hence the isValidIndexForm call). Incidentally, ARM64, x86, and x86-64 all agree with B3 about Shl. ARMv7 disagrees (it masks shifts by 255). But even on ARMv7 this code is right since its converting a Shl to BaseIndex, not ARMv7's shift instruction. To support ARMv7 we would leave this code alone, we would continue to keep our same Shl semantics, and we would make our lowering of Shl on ARMv7 have an extra legalization step that injects a shift by 31 or 63 to make ARMv7 comply with our expectations.

Filip Pizlo

Comment 5 2016-10-11 13:42:05 PDT

Landed in https://trac.webkit.org/changeset/207163

Saam Barati

Comment 6 2016-10-11 14:58:32 PDT

Comment on attachment 291271 [details] the patch View in context: https://bugs.webkit.org/attachment.cgi?id=291271&action=review > Source/JavaScriptCore/b3/testb3.cpp:13679 > + if (isX86()) > + checkUsesInstruction(*code, "(%rdi,%rsi,4)"); It would also be nice to check we do good things for ARM64 here? Will this optimization kick in on ARM64?

Filip Pizlo

Comment 7 2016-10-11 15:01:50 PDT

(In reply to comment #6) > Comment on attachment 291271 [details] > the patch > > View in context: > https://bugs.webkit.org/attachment.cgi?id=291271&action=review > > > Source/JavaScriptCore/b3/testb3.cpp:13679 > > + if (isX86()) > > + checkUsesInstruction(*code, "(%rdi,%rsi,4)"); > > It would also be nice to check we do good things for ARM64 here? Will this > optimization kick in on ARM64? The ARM optimization is more restricted. I think it might kick in here, but I wasn't sure. I think that if/when we start trying to get this right on ARM we should write ARM-specific tests for that. Also, the reason for this check is to make sure that the fix for this bug didn't completely break BaseIndex. To do that, it can do the check on any architecture on which we regularly run tests.

Note You need to log in before you can comment on or make changes to this bug.

Status RESOLVED

Resolution FIXED

Priority P2

Severity Normal

Classification Unclassified

Version WebKit Nightly Build

Hardware All

OS All

Product WebKit

Component JavaScriptCore

Assignee

Filip Pizlo

Reported

2016-10-10 20:47 PDT

Modified

2016-10-11 15:01 PDT History

CC List

5 users Show

URL

Keywords

Depends on

Blocks