Bug 208003 - EWS fails to retry build in case of kill-old-processes exception
Summary: EWS fails to retry build in case of kill-old-processes exception
Status: RESOLVED FIXED
Alias: None
Product: WebKit
Classification: Unclassified
Component: Tools / Tests (show other bugs)
Version: Other
Hardware: Unspecified Unspecified
: P2 Normal
Assignee: Aakash Jain
URL:
Keywords: InRadar
Depends on:
Blocks:
 
Reported: 2020-02-20 07:02 PST by Aakash Jain
Modified: 2020-02-28 13:08 PST (History)
5 users (show)

See Also:


Attachments
Patch (1.51 KB, patch)
2020-02-20 07:04 PST, Aakash Jain
no flags Details | Formatted Diff | Diff

Note You need to log in before you can comment on or make changes to this bug.
Description Aakash Jain 2020-02-20 07:02:50 PST
EWS retries the build in case of kill-old-processes failure (since https://trac.webkit.org/changeset/252324). However, EWS fails to retry build in case of kill-old-processes Exception. Exception in case of kill-old-processes is noticeable only on WinCairo queue.

e.g.:
https://ews-build.webkit.org/#/builders/12/builds/17905
https://ews-build.webkit.org/#/builders/12/builds/17860
https://ews-build.webkit.org/#/builders/12/builds/17858

It would be nice if EWS can retry the build in case of kill-old-processes exception as well.
Comment 1 Aakash Jain 2020-02-20 07:04:27 PST
Created attachment 391283 [details]
Patch
Comment 2 Aakash Jain 2020-02-20 07:05:02 PST
(In reply to Aakash Jain from comment #1)
> Created attachment 391283 [details]
> Patch
Speculative fix.
Comment 3 Jonathan Bedard 2020-02-20 07:46:53 PST
(In reply to Aakash Jain from comment #0)
> EWS retries the build in case of kill-old-processes failure (since
> https://trac.webkit.org/changeset/252324). However, EWS fails to retry build
> in case of kill-old-processes Exception. Exception in case of
> kill-old-processes is noticeable only on WinCairo queue.
> 
> e.g.:
> https://ews-build.webkit.org/#/builders/12/builds/17905
> https://ews-build.webkit.org/#/builders/12/builds/17860
> https://ews-build.webkit.org/#/builders/12/builds/17858
> 
> It would be nice if EWS can retry the build in case of kill-old-processes
> exception as well.

What even causes a kill-old-processes exception? If that script is having issues, I would bet that the entire machine is in trouble.
Comment 4 Aakash Jain 2020-02-20 08:06:43 PST
(In reply to Jonathan Bedard from comment #3)
> What even causes a kill-old-processes exception? 
Probably the machine is in a very bad state.
> If that script is having issues, I would bet that the entire machine is in trouble.
Yes. The idea is to retry the build so that the patch gets re-tested on some other machine. If we don't retry the status-bubble for that patch will remain purple, until someone manually retries the build.

Even though that particular machine will keep failing at kill-old-processes, the patch being tested will eventually be tested on some other machine because of this RETRY. Hopefully someone would notice that the machine is in bad state and fix it eventually.

For example: https://ews-build.webkit.org/#/builders/22/builds/10002 failed with kill-old-processed, however the build was retried and picked up by another bot (in https://ews-build.webkit.org/#/builders/22/builds/10034) and passed. The status-bubble for corresponding patch 390315 (in Bug 207516) became green without any manual intervention (of bot-watcher or patch author). 

Similar thing in https://ews-build.webkit.org/#/builders/3/builds/16167 and https://ews-build.webkit.org/#/builders/3/builds/16168

Retry made the system work fine even though one bot was in bad state.
Comment 5 Jonathan Bedard 2020-02-20 11:03:46 PST
Comment on attachment 391283 [details]
Patch

Do we actually have a practical example of an exception?
Comment 6 Aakash Jain 2020-02-20 11:13:38 PST
> Do we actually have a practical example of an exception?
https://ews-build.webkit.org/#/builders/12/builds/17858

Although, this is speculative fix. I am not sure if it would really work in case of exception. It's hard to reproduce and I don't have machine to test it out.
Comment 7 Jonathan Bedard 2020-02-20 13:05:46 PST
(In reply to Aakash Jain from comment #6)
> > Do we actually have a practical example of an exception?
> https://ews-build.webkit.org/#/builders/12/builds/17858
> 
> Although, this is speculative fix. I am not sure if it would really work in
> case of exception. It's hard to reproduce and I don't have machine to test
> it out.

Fair enough.

As I said in the other bug, if a machine fails kill-old-processes, it probably needs to be rebooted.
Comment 8 WebKit Commit Bot 2020-02-28 13:07:32 PST
Comment on attachment 391283 [details]
Patch

Clearing flags on attachment: 391283

Committed r257654: <https://trac.webkit.org/changeset/257654>
Comment 9 WebKit Commit Bot 2020-02-28 13:07:33 PST
All reviewed patches have been landed.  Closing bug.
Comment 10 Radar WebKit Bug Importer 2020-02-28 13:08:14 PST
<rdar://problem/59899567>