This is because those tests are actually testing the effects of polymorphic operands on performance, and not the correctness of operations on objects.
Created attachment 267037 [details] proposed patch.
Thanks for the review. Landed in r193855: <http://trac.webkit.org/r193855>.