Bug 186765 - [Armv7] Linkbuffer: executableOffsetFor() fails for location 2
Summary: [Armv7] Linkbuffer: executableOffsetFor() fails for location 2
Status: RESOLVED FIXED
Alias: None
Product: WebKit
Classification: Unclassified
Component: JavaScriptCore (show other bugs)
Version: WebKit Nightly Build
Hardware: Other Linux
: P2 Normal
Assignee: Nobody
URL:
Keywords: InRadar
Depends on:
Blocks:
 
Reported: 2018-06-18 07:39 PDT by Guillaume Emont
Modified: 2018-06-20 10:17 PDT (History)
13 users (show)

See Also:


Attachments
Patch (1.44 KB, patch)
2018-06-18 07:53 PDT, Guillaume Emont
no flags Details | Formatted Diff | Diff
Archive of layout-test-results from ltilve-gtk-wk2-ews for gtk-wk2 (2.80 MB, application/zip)
2018-06-18 19:22 PDT, Igalia-pontevedra EWS
no flags Details
Patch (1.68 KB, patch)
2018-06-19 03:20 PDT, Guillaume Emont
no flags Details | Formatted Diff | Diff
Patch (1.68 KB, patch)
2018-06-20 03:31 PDT, Guillaume Emont
no flags Details | Formatted Diff | Diff
Archive of layout-test-results from ews106 for mac-sierra-wk2 (3.38 MB, application/zip)
2018-06-20 05:19 PDT, EWS Watchlist
no flags Details

Note You need to log in before you can comment on or make changes to this bug.
Description Guillaume Emont 2018-06-18 07:39:36 PDT
On THUMB2, instructions can be 2 bytes long, and therefore are not guaranteed to be 4-aligned. This is the case for jump origins and targets, which means that the parameter of executableOffsetFor() can be the value 2, in which case executableOffsetFor() returns a value taken from before the start of the buffer. Since r231961, we see in unit tests cases where executableOffsetFor() is passed 2 as a parameter, which leads to the wrong offset being used and eventually a segfault as we generate a jump to an unpredictable address.
Comment 1 Guillaume Emont 2018-06-18 07:53:10 PDT
Created attachment 342936 [details]
Patch

This seems to fix it. I'm unsure of how recordLinkOffsets()/executableOffsetFor() are supposed to work on thumb2, as they seem to assume that jump origins and destinations are 4-byte aligned, though I might be misunderstanding something, as it looks like this has been working like that since the introduction of thumb2 support.
Comment 2 Mark Lam 2018-06-18 08:56:43 PDT
Comment on attachment 342936 [details]
Patch

View in context: https://bugs.webkit.org/attachment.cgi?id=342936&action=review

> Source/JavaScriptCore/assembler/LinkBuffer.h:300
> -        if (!location)
> +        if (location < sizeof(int32_t))

Why is it ever valid to pass a location of 2 and expect it to map to an offset of 0?  It looks to me like this change is trying to mask the bug instead of fixing it.  Please provide justification as why this is valid, or if this isn't the real issue, fix the real issue instead.
Comment 3 Mark Lam 2018-06-18 09:08:05 PDT
(In reply to Guillaume Emont from comment #0)
> On THUMB2, instructions can be 2 bytes long, and therefore are not
> guaranteed to be 4-aligned. This is the case for jump origins and targets,
> which means that the parameter of executableOffsetFor() can be the value 2,
> in which case executableOffsetFor() returns a value taken from before the
> start of the buffer. 

Please explain how an offset of 2 translates to "a value taken from before the start of the buffer".  Can you point to some specific code where this happens?

> Since r231961, we see in unit tests cases where
> executableOffsetFor() is passed 2 as a parameter, which leads to the wrong
> offset being used and eventually a segfault as we generate a jump to an
> unpredictable address.

A location of 2 should be valid for ARMv7.

(In reply to Guillaume Emont from comment #1)
> Created attachment 342936 [details]
> Patch
> 
> This seems to fix it. I'm unsure of how
> recordLinkOffsets()/executableOffsetFor() are supposed to work on thumb2, as
> they seem to assume that jump origins and destinations are 4-byte aligned,
> though I might be misunderstanding something, as it looks like this has been
> working like that since the introduction of thumb2 support.

You're possibly getting closer to the real issue here.
Comment 4 Mark Lam 2018-06-18 09:33:29 PDT
(In reply to Mark Lam from comment #3)
> (In reply to Guillaume Emont from comment #0)
> > On THUMB2, instructions can be 2 bytes long, and therefore are not
> > guaranteed to be 4-aligned. This is the case for jump origins and targets,
> > which means that the parameter of executableOffsetFor() can be the value 2,
> > in which case executableOffsetFor() returns a value taken from before the
> > start of the buffer. 
> 
> Please explain how an offset of 2 translates to "a value taken from before
> the start of the buffer".  Can you point to some specific code where this
> happens?

I see what you meant here.  The code at the end of executableOffsetFor() is where this can happen if the location is 2.  However, that doesn't mean the change you made is valid.

2 things you should do:
1. Find out how the location value of 2 came to be, and determine if it's a valid value or not.
2. If a location of 2 is valid, then the code in executableOffsetFor() and recordLinkOffsets() might need a different implementation for ARMv7.

The answers to the above depends on how the branch compaction feature (guarded by ENABLE(BRANCH_COMPACTION)) works.
Comment 5 Guillaume Emont 2018-06-18 09:38:39 PDT
(In reply to Mark Lam from comment #3)
> (In reply to Guillaume Emont from comment #0)
> > On THUMB2, instructions can be 2 bytes long, and therefore are not
> > guaranteed to be 4-aligned. This is the case for jump origins and targets,
> > which means that the parameter of executableOffsetFor() can be the value 2,
> > in which case executableOffsetFor() returns a value taken from before the
> > start of the buffer. 
> 
> Please explain how an offset of 2 translates to "a value taken from before
> the start of the buffer".  Can you point to some specific code where this
> happens?

In executableOffsetFor() we do:

  return bitwise_cast<int32_t*>(m_assemblerStorage.buffer())[location / sizeof(int32_t) - 1]

If location is 2, our index (location / sizeof(int32_t) - 1) is -1.

> 
> > Since r231961, we see in unit tests cases where
> > executableOffsetFor() is passed 2 as a parameter, which leads to the wrong
> > offset being used and eventually a segfault as we generate a jump to an
> > unpredictable address.
> 
> A location of 2 should be valid for ARMv7.
> 
> (In reply to Guillaume Emont from comment #1)
> > Created attachment 342936 [details]
> > Patch
> > 
> > This seems to fix it. I'm unsure of how
> > recordLinkOffsets()/executableOffsetFor() are supposed to work on thumb2, as
> > they seem to assume that jump origins and destinations are 4-byte aligned,
> > though I might be misunderstanding something, as it looks like this has been
> > working like that since the introduction of thumb2 support.
> 
> You're possibly getting closer to the real issue here.

That's what I was half guessing, though I'm surprised because it's been like that for so long. I guess we got lucky. I see two ideas on how to fix this, and am unsure of which one is best yet:
 - force alignment of jumps and labels to 4 bytes on CPU(ARM_THUMB2). That might partly negate the benefits of jump compaction.
or
 - change the way recordLinkOffsets()/executableOffsetFor() work. The issue is that we want the offset table to fit where the code was, like it is currently (avoiding the use of extra memory), we would have to either store the offsets on 16 bits (not sure it's enough), or find a clever way to store them, if one exists, that can be proved to fit in the amount of memory available. The alternative would be to add an extra structure (hashtable?) to keep track of these offsets. Either way would likely make things slower for arm64, so we might want to have different code paths, e.g. depending on sizeof(InstructionType).

Any thoughts on that?
Comment 6 Guillaume Emont 2018-06-18 10:07:28 PDT
(In reply to Mark Lam from comment #4)
> (In reply to Mark Lam from comment #3)
> > (In reply to Guillaume Emont from comment #0)
> > > On THUMB2, instructions can be 2 bytes long, and therefore are not
> > > guaranteed to be 4-aligned. This is the case for jump origins and targets,
> > > which means that the parameter of executableOffsetFor() can be the value 2,
> > > in which case executableOffsetFor() returns a value taken from before the
> > > start of the buffer. 
> > 
> > Please explain how an offset of 2 translates to "a value taken from before
> > the start of the buffer".  Can you point to some specific code where this
> > happens?
> 
> I see what you meant here.  The code at the end of executableOffsetFor() is
> where this can happen if the location is 2.  However, that doesn't mean the
> change you made is valid.
> 
> 2 things you should do:
> 1. Find out how the location value of 2 came to be, and determine if it's a
> valid value or not.

In LinkBuffer::copyCompactAndLinkCode() we have:

            if (jumpsToLink[i].to() >= jumpsToLink[i].from())
                target = codeOutData + jumpsToLink[i].to() - offset; // Compensate for what we have collapsed so far
            else
                target = codeOutData + jumpsToLink[i].to() - executableOffsetFor(jumpsToLink[i].to());

In that case, if the code we're linking is something like:
  2 byte instruction
  label:
  more instructions
  jmp label

We would have a .to() == 2. Indeed we see this with a few unit tests since the
InstanceOf inline cache has been added. I think that these values are valid, as
it makes sense to be able to branch to the second instruction.



> 2. If a location of 2 is valid, then the code in executableOffsetFor() and
> recordLinkOffsets() might need a different implementation for ARMv7.

Yes, it looks like that's where we're heading.
> 
> The answers to the above depends on how the branch compaction feature
> (guarded by ENABLE(BRANCH_COMPACTION)) works.
Comment 7 Michael Saboff 2018-06-18 10:50:19 PDT
Comment on attachment 342936 [details]
Patch

View in context: https://bugs.webkit.org/attachment.cgi?id=342936&action=review

r=me

>> Source/JavaScriptCore/assembler/LinkBuffer.h:300
>> +        if (location < sizeof(int32_t))
> 
> Why is it ever valid to pass a location of 2 and expect it to map to an offset of 0?  It looks to me like this change is trying to mask the bug instead of fixing it.  Please provide justification as why this is valid, or if this isn't the real issue, fix the real issue instead.

I'm fine with this change.  Since the buffer records the offsets prior to the current location and the offsets are stored as int32_t values, this works.  You might want to add a comment that no compaction can happen before this point.
Comment 8 Mark Lam 2018-06-18 11:00:55 PDT
Comment on attachment 342936 [details]
Patch

View in context: https://bugs.webkit.org/attachment.cgi?id=342936&action=review

>>> Source/JavaScriptCore/assembler/LinkBuffer.h:300
>>> +        if (location < sizeof(int32_t))
>> 
>> Why is it ever valid to pass a location of 2 and expect it to map to an offset of 0?  It looks to me like this change is trying to mask the bug instead of fixing it.  Please provide justification as why this is valid, or if this isn't the real issue, fix the real issue instead.
> 
> I'm fine with this change.  Since the buffer records the offsets prior to the current location and the offsets are stored as int32_t values, this works.  You might want to add a comment that no compaction can happen before this point.

For the record, I withdraw my objection after speaking with Michael and looking at the code.  Please add the comment that the reason this works is because at (offset < sizeof(int32_t)), no compaction could have happened before this point as the assembler could not have placed a branch instruction within this space that required some compaction.  Hence, 0 is the correct result.
Comment 9 Guillaume Emont 2018-06-18 11:09:03 PDT
(In reply to Michael Saboff from comment #7)
> Comment on attachment 342936 [details]
> Patch
> 
> View in context:
> https://bugs.webkit.org/attachment.cgi?id=342936&action=review
> 
> r=me
> 
> >> Source/JavaScriptCore/assembler/LinkBuffer.h:300
> >> +        if (location < sizeof(int32_t))
> > 
> > Why is it ever valid to pass a location of 2 and expect it to map to an offset of 0?  It looks to me like this change is trying to mask the bug instead of fixing it.  Please provide justification as why this is valid, or if this isn't the real issue, fix the real issue instead.
> 
> I'm fine with this change.  Since the buffer records the offsets prior to
> the current location and the offsets are stored as int32_t values, this
> works.  You might want to add a comment that no compaction can happen before
> this point.


I'm wondering though, how can we be sure that executableOffsetFor() is always correct, since the offsets stored have a granularity of 4 bytes, but instruction addresses have a granularity of 2? Like, what if we have:

  ...
  jmp label1 ; 2-byte instruction
  label2:
  jmp label3
  ...

then could we have executableOffsetFor(label2) return the wrong offset? And if not, why?
Comment 10 Mark Lam 2018-06-18 11:28:27 PDT
(In reply to Guillaume Emont from comment #9)
> (In reply to Michael Saboff from comment #7)
> > Comment on attachment 342936 [details]
> > Patch
> > 
> > View in context:
> > https://bugs.webkit.org/attachment.cgi?id=342936&action=review
> > 
> > r=me
> > 
> > >> Source/JavaScriptCore/assembler/LinkBuffer.h:300
> > >> +        if (location < sizeof(int32_t))
> > > 
> > > Why is it ever valid to pass a location of 2 and expect it to map to an offset of 0?  It looks to me like this change is trying to mask the bug instead of fixing it.  Please provide justification as why this is valid, or if this isn't the real issue, fix the real issue instead.
> > 
> > I'm fine with this change.  Since the buffer records the offsets prior to
> > the current location and the offsets are stored as int32_t values, this
> > works.  You might want to add a comment that no compaction can happen before
> > this point.
> 
> 
> I'm wondering though, how can we be sure that executableOffsetFor() is
> always correct, since the offsets stored have a granularity of 4 bytes, but
> instruction addresses have a granularity of 2? Like, what if we have:
> 
>   ...
>   jmp label1 ; 2-byte instruction
>   label2:
>   jmp label3
>   ...
> 
> then could we have executableOffsetFor(label2) return the wrong offset? And
> if not, why?

Because it's storing the offset in the old (before compaction) buffer, which at minimum would have allocated more than 1 instruction per branch.
Comment 11 Igalia-pontevedra EWS 2018-06-18 19:22:43 PDT
Comment on attachment 342936 [details]
Patch

Attachment 342936 [details] did not pass gtk-wk2-ews (gtk-wk2):
Output: http://webkit-queues.webkit.org/results/8239453

New failing tests:
fast/files/file-constructor.html
Comment 12 Igalia-pontevedra EWS 2018-06-18 19:22:47 PDT
Created attachment 343007 [details]
Archive of layout-test-results from ltilve-gtk-wk2-ews for gtk-wk2

The attached test failures were seen while running run-webkit-tests on the gtk-wk2-ews.
Bot: ltilve-gtk-wk2-ews  Port: gtk-wk2  Platform: Linux-4.16.0-0.bpo.1-amd64-x86_64-with-debian-9.4
Comment 13 Guillaume Emont 2018-06-19 03:20:15 PDT
Created attachment 343041 [details]
Patch

Updated patch for landing that adds the required comment and adds a static_cast to make clang happy.
Comment 14 Guillaume Emont 2018-06-19 03:21:54 PDT
(In reply to Igalia-pontevedra EWS from comment #11)
> Comment on attachment 342936 [details]
> Patch
> 
> Attachment 342936 [details] did not pass gtk-wk2-ews (gtk-wk2):
> Output: http://webkit-queues.webkit.org/results/8239453
> 
> New failing tests:
> fast/files/file-constructor.html

The issues raised by the gtk-wk2-ews should be issues with the bot, since the  code modified only gets compiled on arm and arm64.
Comment 15 Guillaume Emont 2018-06-20 03:31:57 PDT
Created attachment 343148 [details]
Patch

Real patch for landing (added review)
Comment 16 EWS Watchlist 2018-06-20 05:19:48 PDT
Comment on attachment 343148 [details]
Patch

Attachment 343148 [details] did not pass mac-wk2-ews (mac-wk2):
Output: https://webkit-queues.webkit.org/results/8263727

New failing tests:
http/tests/misc/object-embedding-svg-delayed-size-negotiation-2.htm
Comment 17 EWS Watchlist 2018-06-20 05:19:50 PDT
Created attachment 343152 [details]
Archive of layout-test-results from ews106 for mac-sierra-wk2

The attached test failures were seen while running run-webkit-tests on the mac-wk2-ews.
Bot: ews106  Port: mac-sierra-wk2  Platform: Mac OS X 10.12.6
Comment 18 Guillaume Emont 2018-06-20 05:48:27 PDT
(In reply to Build Bot from comment #16)
> Comment on attachment 343148 [details]
> Patch
> 
> Attachment 343148 [details] did not pass mac-wk2-ews (mac-wk2):
> Output: https://webkit-queues.webkit.org/results/8263727
> 
> New failing tests:
> http/tests/misc/object-embedding-svg-delayed-size-negotiation-2.htm

The modified code is guarded by ENABLE(BRANCH_COMPACTION), which is only true on ARM_THUMB2 and ARM64, so I think only the output from the ios and ios-sim EWS bots is relevant.
Comment 19 WebKit Commit Bot 2018-06-20 10:16:28 PDT
Comment on attachment 343148 [details]
Patch

Clearing flags on attachment: 343148

Committed r233015: <https://trac.webkit.org/changeset/233015>
Comment 20 WebKit Commit Bot 2018-06-20 10:16:30 PDT
All reviewed patches have been landed.  Closing bug.
Comment 21 Radar WebKit Bug Importer 2018-06-20 10:17:23 PDT
<rdar://problem/41293679>