# AI Evaluation & Testing

_Methods and tools for rigorously evaluating AI models before deployment and supervising them after._

Comprehensive resources on AI evaluation and testing methodologies. From pre-deployment benchmarks to post-deployment supervision and testing strategies.

## Featured guide

- [/post/ai-evaluation-testing-comprehensive-guide](/post/ai-evaluation-testing-comprehensive-guide)

## Related articles

- [/post/ai-agent-evaluation-metrics-guide](/post/ai-agent-evaluation-metrics-guide)
- [/post/llm-evaluation-metrics-guide](/post/llm-evaluation-metrics-guide)

## Knowledge base

- [/ai-agent-evaluation](/ai-agent-evaluation)
- [/ml-model-testing](/ml-model-testing)
- [/model-evaluation-functions](/model-evaluation-functions)
- [/ai-model-performance](/ai-model-performance)
- [/ai-interrogation](/ai-interrogation)
- [/ml-model-lifecycle](/ml-model-lifecycle)

## Try Swept AI evaluation & supervision

Set the bar for your AI agents with evaluation scorecards, then supervise them in production.

→ [/offering/evaluation](/offering/evaluation)