Back to Test sets

Test sets/New

New test set

Name it, describe what it covers. You'll add cases on the next screen.

Injected into judge / cluster / compare prompts so eval generalizes beyond customer support.

Cancel