Files
git.stella-ops.org/bench/reachability-benchmark/tools/scorer
StellaOps Bot 8f54ffa203
Some checks failed
Docs CI / lint-and-preview (push) Has been cancelled
devportal-offline / build-offline (push) Has been cancelled
Mirror Thin Bundle Sign & Verify / mirror-sign (push) Has been cancelled
up
2025-11-29 11:37:00 +02:00
..
up
2025-11-29 11:37:00 +02:00

rb-score (placeholder)

Planned CLI to score reachability submissions against truth sets.

Future work (BENCH-SCORER-513-008):

  • Validate submission against schemas/submission.schema.json.
  • Validate truth against schemas/truth.schema.json.
  • Compute precision/recall/F1, explainability score (0-3), runtime stats, determinism rate.
  • Emit JSON report with stable ordering.

For now this folder is a stub; implementation will be added in task 513-008 once schemas stabilize.