Eval: Checklist Categorization

Run evaluations against our Checklist Categorization inference pipeline to measure and track performance.

Target Dataset

Select the dataset you want to run an evaluation against.

Model Configuration

https://platform.openai.com/docs/models

1

These are the model instructions used to tell the model how to behave. They are pre-loaded with our default system prompt, but can be updated here for tinkering. Note that the instructions intentionally use Markdown for emphasis.