RESOLVED FIXED 46452
[chromium] Updated test expectations to match the bots using new auto-update script.
https://bugs.webkit.org/show_bug.cgi?id=46452
Summary [chromium] Updated test expectations to match the bots using new auto-update ...
James Kozianski
Reported 2010-09-23 23:53:35 PDT
[chromium] Updated test expectations to match the bots using new auto-update script.
Attachments
Patch (4.07 KB, patch)
2010-09-23 23:56 PDT, James Kozianski
no flags
James Kozianski
Comment 1 2010-09-23 23:56:49 PDT
WebKit Commit Bot
Comment 2 2010-09-24 02:13:33 PDT
Comment on attachment 68656 [details] Patch Clearing flags on attachment: 68656 Committed r68244: <http://trac.webkit.org/changeset/68244>
WebKit Commit Bot
Comment 3 2010-09-24 02:13:38 PDT
All reviewed patches have been landed. Closing bug.
Tony Chang
Comment 4 2010-09-24 11:00:20 PDT
Where can I learn more about this script? What will the work flow be for converting BUG_AUTO into bugs? As a feature request, maybe it can guess when to use SLOW if the test is only failing in debug.
James Kozianski
Comment 5 2010-09-24 13:24:26 PDT
Ojan and I have been working on improving the webkit/tools/layout_tests/webkitpy/layout_tests/update_expectations_from_dashboard.py script. Its usage is undocumented, but in short it takes JSON data as input generated from http://test-results.appspot.com/dashboards/flakiness_dashboard.html#expectationsUpdate=true and uses it to modify test_expectations.txt. The modifications to the script are unreviewed, but once committed I'll add a section to the wiki explaining how to use it at http://dev.chromium.org/developers/testing/flakiness-dashboard. > What will the work flow be for converting BUG_AUTO into bugs? I'm not sure - Ojan, can you chime in? > As a feature request, maybe it can guess when to use SLOW if the test is only failing in debug. Yep, that sounds like a good idea. Could you provide a more specific heuristic?
Tony Chang
Comment 6 2010-09-24 14:16:27 PDT
(In reply to comment #5) > > As a feature request, maybe it can guess when to use SLOW if the test is only failing in debug. > > Yep, that sounds like a good idea. Could you provide a more specific heuristic? I'm not sure. I just bring it up because it looks like the last 5 entries to test_expectations.txt are just slow tests. But maybe not. One of the tests is for Release builds.
Ojan Vafai
Comment 7 2010-09-25 14:02:00 PDT
(In reply to comment #4) > Where can I learn more about this script? The working version is still not checked in. We'll document it once it's usable. This was our first pass at actually using it. > What will the work flow be for converting BUG_AUTO into bugs? That's a good question. I don't have a good answer. I was planning on bringing this up on chromium-dev soon. I'm open to suggestions. > As a feature request, maybe it can guess when to use SLOW if the test is only failing in debug. I've been meaning to get rid of SLOW. I think it's too complicated. Instead, we should just have a long timeout but give a short timeout to tests that we expect to timeout. How does that sound?
Tony Chang
Comment 8 2010-09-27 09:29:46 PDT
(In reply to comment #7) > (In reply to comment #4) > > What will the work flow be for converting BUG_AUTO into bugs? > > That's a good question. I don't have a good answer. I was planning on bringing this up on chromium-dev soon. I'm open to suggestions. I think whoever runs the tool should fill in bug numbers, right? We shouldn't check in expectations without bugs filed. > > As a feature request, maybe it can guess when to use SLOW if the test is only failing in debug. > > I've been meaning to get rid of SLOW. I think it's too complicated. Instead, we should just have a long timeout but give a short timeout to tests that we expect to timeout. How does that sound? I suspect that over time, more tests will time out (the long timeout) and the full test run will gradually get slower. I prefer fast by default with exceptions to make things slower. Maybe you're hoping that the auto-update script will detect and mark tests as slow? I'm not sure how easy that will be to do.
Note You need to log in before you can comment on or make changes to this bug.