Benchmarks
Datasets for evaluation and monitoring
Scenarios
LLM designed evaluation plans
Metric Sets
Scored metrics with scopes and versions