Open-source eval framework for red teaming and comparing LLM configurations.
Evaluation and Observability
The Agentic Workflow