Datasets

Datasets let you curate sets of input/output pairs you can replay against prompt or model changes to catch regressions before they reach production.

Documentation coming soon

We're still writing the full guide for datasets. In the meantime, see the AI Evals overview to learn how datasets fit alongside evaluations and trace reviews.

Community questions

Was this page useful?

Questions about this page? or post a community question.