stabilize tests

This commit is contained in:
master
2026-02-01 21:37:40 +02:00
parent 55744f6a39
commit 5d5e80b2e4
6435 changed files with 33984 additions and 13802 deletions

14
docs/benchmarks/README.md Normal file
View File

@@ -0,0 +1,14 @@
# Benchmarks
This directory contains benchmark specs, datasets, and evaluation guidance.
## Index
- `docs/benchmarks/performance-baselines.md`
- `docs/benchmarks/golden-corpus-kpis.md`
- `docs/benchmarks/fidelity-metrics.md`
- `docs/benchmarks/accuracy-metrics-framework.md`
## Usage Notes
- Benchmarks must be deterministic and offline-friendly.
- Store fixtures alongside their benchmark docs.
- Record expected ceilings and variance bounds in each benchmark spec.