<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://bugs.webkit.org/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.4.1"
          urlbase="https://bugs.webkit.org/"
          
          maintainer="admin@webkit.org"
>

    <bug>
          <bug_id>96842</bug_id>
          
          <creation_ts>2012-09-14 16:59:42 -0700</creation_ts>
          <short_desc>[chromium] lots of layout test failures on 10.7.4</short_desc>
          <delta_ts>2012-10-02 18:37:06 -0700</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>1</classification_id>
          <classification>Unclassified</classification>
          <product>WebKit</product>
          <component>Tools / Tests</component>
          <version>528+ (Nightly build)</version>
          <rep_platform>Unspecified</rep_platform>
          <op_sys>Unspecified</op_sys>
          <bug_status>RESOLVED</bug_status>
          <resolution>WONTFIX</resolution>
          
          
          <bug_file_loc></bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords></keywords>
          <priority>P2</priority>
          <bug_severity>Normal</bug_severity>
          <target_milestone>---</target_milestone>
          
          
          <everconfirmed>1</everconfirmed>
          <reporter name="Dirk Pranke">dpranke</reporter>
          <assigned_to name="Nobody">webkit-unassigned</assigned_to>
          <cc>chase</cc>
    
    <cc>dpranke</cc>
    
    <cc>nsylvain</cc>
    
    <cc>ojan</cc>
    
    <cc>rsesek</cc>
    
    <cc>thakis</cc>
    
    <cc>tony</cc>
    
    <cc>zmo</cc>
          

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>721254</commentid>
    <comment_count>0</comment_count>
    <who name="Dirk Pranke">dpranke</who>
    <bug_when>2012-09-14 16:59:42 -0700</bug_when>
    <thetext>it appears that things changed in 10.7.4 such that tests that run fine on 10.7.3 produce different baselines on 10.7.4. More details to follow.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>722290</commentid>
    <comment_count>1</comment_count>
    <who name="Dirk Pranke">dpranke</who>
    <bug_when>2012-09-17 14:48:53 -0700</bug_when>
    <thetext>fwiw, I have a local platform/chromium-mac-10_7_4 directory with ~160ish layout test results that I think account for most if not all of the failures. I&apos;m not posting this to the bug for now since I haven&apos;t really audited all of them, and it&apos;s unclear quite yet what to do with this.

Per zmo, it seems that we don&apos;t necessarily want to just upgrade the bots to 10.7.4 because that&apos;ll cause a lot of gpu tests to fail. On the other hand, 10.7.4 is what our customers are running, so arguably we should just suck it up.

We can:

1) create a 10.7.4 platform directory and keep it up-to-date by hand
2) spin up a 10.7.4 bot
3) wait for 10.7.5 and hope that fixes the gpu issues
4) suppress the failures as IMAGE (and IMAGE+TEXT etc) so that we don&apos;t need to track the baselines
5) ignore the problem and hope it goes away :)

Let me know if anyone has any strong leanings here.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>722329</commentid>
    <comment_count>2</comment_count>
    <who name="Ojan Vafai">ojan</who>
    <bug_when>2012-09-17 15:42:41 -0700</bug_when>
    <thetext>My vote would be for 1+2. But I&apos;d be fine with any combination of 1, 3, 4 or 5. Seems like we should at least do one of 1 or 4 though. 4 seems like the least amount of effort in the short-term.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>722332</commentid>
    <comment_count>3</comment_count>
    <who name="Dirk Pranke">dpranke</who>
    <bug_when>2012-09-17 15:45:26 -0700</bug_when>
    <thetext>I&apos;ll run locally w/ my 10.7.4 baselines for a while and see how much churn there is. Maybe updating by hand won&apos;t be too bad ... I&apos;d prefer to avoid 2) if we can, and I&apos;m not a bug fan of 4) since I&apos;m trying to move away from suppressions (and 160+ tests is a lot to suppress if we have no reason to expect this to get fixed soon).</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>722964</commentid>
    <comment_count>4</comment_count>
    <who name="Tony Chang">tony</who>
    <bug_when>2012-09-18 12:04:48 -0700</bug_when>
    <thetext>How will the gpu tests fail?  Are these real failures that our users are seeing or is it a bug in our test infrastructure?</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>722974</commentid>
    <comment_count>5</comment_count>
    <who name="Dirk Pranke">dpranke</who>
    <bug_when>2012-09-18 12:22:36 -0700</bug_when>
    <thetext>zmo can confirm but I believe they reflect real bugs in 10.7.4 that our users see.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>722980</commentid>
    <comment_count>6</comment_count>
    <who name="Tony Chang">tony</who>
    <bug_when>2012-09-18 12:27:32 -0700</bug_when>
    <thetext>(In reply to comment #5)
&gt; zmo can confirm but I believe they reflect real bugs in 10.7.4 that our users see.

In that case, I would just update the bots to 10.7.4 and either rebaseline or mark them in TestExpectations.  The failures are real, so it seems fine for us to mark them as such in TestExpectations.

I&apos;m not sure what benefit we get by running the tests on 10.7.3 if that&apos;s not what our users are running.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>723081</commentid>
    <comment_count>7</comment_count>
    <who name="Dirk Pranke">dpranke</who>
    <bug_when>2012-09-18 15:11:23 -0700</bug_when>
    <thetext>So, after running with a set of 10.7.4-specific baselines for a while, I&apos;m not seeing any real gpu-related weirdness during the layout tests; they seem fairly stable. Given that, and a general consensus that we should probably be testing what our customers are likely running, it seems like we should update the lion layout test bots to 10.7.4 and update the baselines.

I am not proposing updating the other chromium bots; I&apos;ll let zhenyao worry about those.

Anyone disagree?</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>733305</commentid>
    <comment_count>8</comment_count>
    <who name="Dirk Pranke">dpranke</who>
    <bug_when>2012-10-02 18:37:06 -0700</bug_when>
    <thetext>We&apos;ve upgraded the canaries to 10.7.5 (and rebaselined), so I&apos;m closing this as WONTFIX ...</thetext>
  </long_desc>
      
      

    </bug>

</bugzilla>