These tests barely fail. The differences are extremely minor, but since they're ref tests and we can't update expectations, we've had to skip them.