Run evaluations against our Checklist Categorization inference pipeline to measure and track performance.
Select the dataset you want to run an evaluation against.
https://platform.openai.com/docs/models
https://community.openai.com/t/cheat-sheet-mastering-temperature-and-top-p-in-chatgpt-api/172683
These are the model instructions used to tell the model how to behave. They are pre-loaded with our default system prompt, but can be updated here for tinkering. Note that the instructions intentionally use Markdown for emphasis.