A couple of times, I noticed that if you leave it open for a long time, the result will be different if your refresh -- which means there is a problem arriving at the same state, freshly loaded vs. long-running. Unfortunately, I was too distracted to capture the state in both cases :(
We keep some old data around to narrow down the regression ranges. You might be observing that state.