Compare runs

Pick two completed runs against the same test set. See what improved, regressed, and stayed the same.

Both runs must be completed and use the same test set.

How comparison works

Δ +Improved

B's score is higher than A's on the same case.

Δ −Regressed

B's score is lower than A's on the same case.

Δ 0Unchanged

Both runs scored the case the same.