There is a bug in how the GTK4 implementation takes a snapshot of the webview for pixel results that causes them to try to capture before the contents are actually rendered. This results in WTR just getting a black PNG when running a test, that is later useless for comparison. As a result of this, the GTK4 test bot has hundreds of failures, and can't be used reliably for finding regressions.