<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://bugs.webkit.org/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.4.1"
          urlbase="https://bugs.webkit.org/"
          
          maintainer="admin@webkit.org"
>

    <bug>
          <bug_id>208003</bug_id>
          
          <creation_ts>2020-02-20 07:02:50 -0800</creation_ts>
          <short_desc>EWS fails to retry build in case of kill-old-processes exception</short_desc>
          <delta_ts>2020-02-28 13:08:14 -0800</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>1</classification_id>
          <classification>Unclassified</classification>
          <product>WebKit</product>
          <component>Tools / Tests</component>
          <version>Other</version>
          <rep_platform>Unspecified</rep_platform>
          <op_sys>Unspecified</op_sys>
          <bug_status>RESOLVED</bug_status>
          <resolution>FIXED</resolution>
          
          <see_also>https://bugs.webkit.org/show_bug.cgi?id=203858</see_also>
          <bug_file_loc></bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords>InRadar</keywords>
          <priority>P2</priority>
          <bug_severity>Normal</bug_severity>
          <target_milestone>---</target_milestone>
          
          
          <everconfirmed>1</everconfirmed>
          <reporter name="Aakash Jain">aakash_jain</reporter>
          <assigned_to name="Aakash Jain">aakash_jain</assigned_to>
          <cc>aakash_jain</cc>
    
    <cc>ap</cc>
    
    <cc>commit-queue</cc>
    
    <cc>jbedard</cc>
    
    <cc>webkit-bug-importer</cc>
          

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>1621192</commentid>
    <comment_count>0</comment_count>
    <who name="Aakash Jain">aakash_jain</who>
    <bug_when>2020-02-20 07:02:50 -0800</bug_when>
    <thetext>EWS retries the build in case of kill-old-processes failure (since https://trac.webkit.org/changeset/252324). However, EWS fails to retry build in case of kill-old-processes Exception. Exception in case of kill-old-processes is noticeable only on WinCairo queue.

e.g.:
https://ews-build.webkit.org/#/builders/12/builds/17905
https://ews-build.webkit.org/#/builders/12/builds/17860
https://ews-build.webkit.org/#/builders/12/builds/17858

It would be nice if EWS can retry the build in case of kill-old-processes exception as well.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>1621193</commentid>
    <comment_count>1</comment_count>
      <attachid>391283</attachid>
    <who name="Aakash Jain">aakash_jain</who>
    <bug_when>2020-02-20 07:04:27 -0800</bug_when>
    <thetext>Created attachment 391283
Patch</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>1621194</commentid>
    <comment_count>2</comment_count>
    <who name="Aakash Jain">aakash_jain</who>
    <bug_when>2020-02-20 07:05:02 -0800</bug_when>
    <thetext>(In reply to Aakash Jain from comment #1)
&gt; Created attachment 391283 [details]
&gt; Patch
Speculative fix.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>1621217</commentid>
    <comment_count>3</comment_count>
    <who name="Jonathan Bedard">jbedard</who>
    <bug_when>2020-02-20 07:46:53 -0800</bug_when>
    <thetext>(In reply to Aakash Jain from comment #0)
&gt; EWS retries the build in case of kill-old-processes failure (since
&gt; https://trac.webkit.org/changeset/252324). However, EWS fails to retry build
&gt; in case of kill-old-processes Exception. Exception in case of
&gt; kill-old-processes is noticeable only on WinCairo queue.
&gt; 
&gt; e.g.:
&gt; https://ews-build.webkit.org/#/builders/12/builds/17905
&gt; https://ews-build.webkit.org/#/builders/12/builds/17860
&gt; https://ews-build.webkit.org/#/builders/12/builds/17858
&gt; 
&gt; It would be nice if EWS can retry the build in case of kill-old-processes
&gt; exception as well.

What even causes a kill-old-processes exception? If that script is having issues, I would bet that the entire machine is in trouble.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>1621223</commentid>
    <comment_count>4</comment_count>
    <who name="Aakash Jain">aakash_jain</who>
    <bug_when>2020-02-20 08:06:43 -0800</bug_when>
    <thetext>(In reply to Jonathan Bedard from comment #3)
&gt; What even causes a kill-old-processes exception? 
Probably the machine is in a very bad state.
&gt; If that script is having issues, I would bet that the entire machine is in trouble.
Yes. The idea is to retry the build so that the patch gets re-tested on some other machine. If we don&apos;t retry the status-bubble for that patch will remain purple, until someone manually retries the build.

Even though that particular machine will keep failing at kill-old-processes, the patch being tested will eventually be tested on some other machine because of this RETRY. Hopefully someone would notice that the machine is in bad state and fix it eventually.

For example: https://ews-build.webkit.org/#/builders/22/builds/10002 failed with kill-old-processed, however the build was retried and picked up by another bot (in https://ews-build.webkit.org/#/builders/22/builds/10034) and passed. The status-bubble for corresponding patch 390315 (in Bug 207516) became green without any manual intervention (of bot-watcher or patch author). 

Similar thing in https://ews-build.webkit.org/#/builders/3/builds/16167 and https://ews-build.webkit.org/#/builders/3/builds/16168

Retry made the system work fine even though one bot was in bad state.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>1621306</commentid>
    <comment_count>5</comment_count>
      <attachid>391283</attachid>
    <who name="Jonathan Bedard">jbedard</who>
    <bug_when>2020-02-20 11:03:46 -0800</bug_when>
    <thetext>Comment on attachment 391283
Patch

Do we actually have a practical example of an exception?</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>1621310</commentid>
    <comment_count>6</comment_count>
    <who name="Aakash Jain">aakash_jain</who>
    <bug_when>2020-02-20 11:13:38 -0800</bug_when>
    <thetext>&gt; Do we actually have a practical example of an exception?
https://ews-build.webkit.org/#/builders/12/builds/17858

Although, this is speculative fix. I am not sure if it would really work in case of exception. It&apos;s hard to reproduce and I don&apos;t have machine to test it out.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>1621378</commentid>
    <comment_count>7</comment_count>
    <who name="Jonathan Bedard">jbedard</who>
    <bug_when>2020-02-20 13:05:46 -0800</bug_when>
    <thetext>(In reply to Aakash Jain from comment #6)
&gt; &gt; Do we actually have a practical example of an exception?
&gt; https://ews-build.webkit.org/#/builders/12/builds/17858
&gt; 
&gt; Although, this is speculative fix. I am not sure if it would really work in
&gt; case of exception. It&apos;s hard to reproduce and I don&apos;t have machine to test
&gt; it out.

Fair enough.

As I said in the other bug, if a machine fails kill-old-processes, it probably needs to be rebooted.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>1624085</commentid>
    <comment_count>8</comment_count>
      <attachid>391283</attachid>
    <who name="WebKit Commit Bot">commit-queue</who>
    <bug_when>2020-02-28 13:07:32 -0800</bug_when>
    <thetext>Comment on attachment 391283
Patch

Clearing flags on attachment: 391283

Committed r257654: &lt;https://trac.webkit.org/changeset/257654&gt;</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>1624086</commentid>
    <comment_count>9</comment_count>
    <who name="WebKit Commit Bot">commit-queue</who>
    <bug_when>2020-02-28 13:07:33 -0800</bug_when>
    <thetext>All reviewed patches have been landed.  Closing bug.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>1624087</commentid>
    <comment_count>10</comment_count>
    <who name="Radar WebKit Bug Importer">webkit-bug-importer</who>
    <bug_when>2020-02-28 13:08:14 -0800</bug_when>
    <thetext>&lt;rdar://problem/59899567&gt;</thetext>
  </long_desc>
      
          <attachment
              isobsolete="0"
              ispatch="1"
              isprivate="0"
          >
            <attachid>391283</attachid>
            <date>2020-02-20 07:04:27 -0800</date>
            <delta_ts>2020-02-28 13:07:32 -0800</delta_ts>
            <desc>Patch</desc>
            <filename>bug-208003-20200220100426.patch</filename>
            <type>text/plain</type>
            <size>1543</size>
            <attacher name="Aakash Jain">aakash_jain</attacher>
            
              <data encoding="base64">SW5kZXg6IFRvb2xzL0NoYW5nZUxvZwo9PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09
PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09Ci0tLSBUb29scy9DaGFuZ2VMb2cJKHJl
dmlzaW9uIDI1NzA1NSkKKysrIFRvb2xzL0NoYW5nZUxvZwkod29ya2luZyBjb3B5KQpAQCAtMSwz
ICsxLDE0IEBACisyMDIwLTAyLTIwICBBYWthc2ggSmFpbiAgPGFha2FzaF9qYWluQGFwcGxlLmNv
bT4KKworICAgICAgICBFV1MgZmFpbHMgdG8gcmV0cnkgYnVpbGQgaW4gY2FzZSBvZiBraWxsLW9s
ZC1wcm9jZXNzZXMgZXhjZXB0aW9uCisgICAgICAgIGh0dHBzOi8vYnVncy53ZWJraXQub3JnL3No
b3dfYnVnLmNnaT9pZD0yMDgwMDMKKworICAgICAgICBSZXZpZXdlZCBieSBOT0JPRFkgKE9PUFMh
KS4KKworICAgICAgICAqIEJ1aWxkU2xhdmVTdXBwb3J0L2V3cy1idWlsZC9zdGVwcy5weToKKyAg
ICAgICAgKEtpbGxPbGRQcm9jZXNzZXMuZXZhbHVhdGVDb21tYW5kKToKKyAgICAgICAgKEtpbGxP
bGRQcm9jZXNzZXMuZ2V0UmVzdWx0U3VtbWFyeSk6CisKIDIwMjAtMDItMjAgIERvbiBPbG1zdGVh
ZCAgPGRvbi5vbG1zdGVhZEBzb255LmNvbT4KIAogICAgICAgICBbQ01ha2VdIEFkZCBXUEU6Omxp
YndwZSB0YXJnZXQKSW5kZXg6IFRvb2xzL0J1aWxkU2xhdmVTdXBwb3J0L2V3cy1idWlsZC9zdGVw
cy5weQo9PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09
PT09PT09PT09PT09PT09Ci0tLSBUb29scy9CdWlsZFNsYXZlU3VwcG9ydC9ld3MtYnVpbGQvc3Rl
cHMucHkJKHJldmlzaW9uIDI1NzA1NSkKKysrIFRvb2xzL0J1aWxkU2xhdmVTdXBwb3J0L2V3cy1i
dWlsZC9zdGVwcy5weQkod29ya2luZyBjb3B5KQpAQCAtMTM3MSwxMiArMTM3MSwxMiBAQCBjbGFz
cyBLaWxsT2xkUHJvY2Vzc2VzKHNoZWxsLkNvbXBpbGUpOgogICAgICAgICBzdXBlcihLaWxsT2xk
UHJvY2Vzc2VzLCBzZWxmKS5fX2luaXRfXyh0aW1lb3V0PTYwLCBsb2dFbnZpcm9uPUZhbHNlLCAq
Kmt3YXJncykKIAogICAgIGRlZiBldmFsdWF0ZUNvbW1hbmQoc2VsZiwgY21kKToKLSAgICAgICAg
aWYgY21kLmRpZEZhaWwoKToKKyAgICAgICAgaWYgc2VsZi5yZXN1bHRzIGluIFtGQUlMVVJFLCBF
WENFUFRJT05dOgogICAgICAgICAgICAgc2VsZi5idWlsZC5idWlsZEZpbmlzaGVkKFsnRmFpbGVk
IHRvIGtpbGwgb2xkIHByb2Nlc3NlcywgcmV0cnlpbmcgYnVpbGQnXSwgUkVUUlkpCiAgICAgICAg
IHJldHVybiBzaGVsbC5Db21waWxlLmV2YWx1YXRlQ29tbWFuZChzZWxmLCBjbWQpCiAKICAgICBk
ZWYgZ2V0UmVzdWx0U3VtbWFyeShzZWxmKToKLSAgICAgICAgaWYgc2VsZi5yZXN1bHRzID09IEZB
SUxVUkU6CisgICAgICAgIGlmIHNlbGYucmVzdWx0cyBpbiBbRkFJTFVSRSwgRVhDRVBUSU9OXToK
ICAgICAgICAgICAgIHJldHVybiB7dSdzdGVwJzogdSdGYWlsZWQgdG8ga2lsbCBvbGQgcHJvY2Vz
c2VzJ30KICAgICAgICAgcmV0dXJuIHNoZWxsLkNvbXBpbGUuZ2V0UmVzdWx0U3VtbWFyeShzZWxm
KQogCg==
</data>

          </attachment>
      

    </bug>

</bugzilla>