2.8 KiB
2.8 KiB
AI Authority Classification Engine
Module
Attestor
Status
IMPLEMENTED
Description
Authority classification engine that determines whether AI outputs are evidence-backed (authoritative) or suggestion-only, with configurable thresholds and scoring across multiple artifact types.
Implementation Details
- Core Classifier:
src/Attestor/__Libraries/StellaOps.Attestor.ProofChain/Predicates/AI/AIAuthorityClassifier.cs-- partial class withDetermineAuthority()method that evaluatescitationRate,verifiedRate,confidenceScore, andqualityScoreagainst configurable thresholds. - Artifact-Specific Scoring: Partial files implement scoring for each artifact type:
AIAuthorityClassifier.Explanation.cs/AIAuthorityClassifier.ExplanationScore.cs-- explanation classification and scoringAIAuthorityClassifier.PolicyDraft.cs/AIAuthorityClassifier.PolicyDraftScore.cs-- policy draft classificationAIAuthorityClassifier.Remediation.cs/AIAuthorityClassifier.RemediationScore.cs-- remediation plan classificationAIAuthorityClassifier.VexDraft.cs/AIAuthorityClassifier.VexDraftScore.cs-- VEX statement draft classification
- Authority Levels:
AIArtifactAuthority.cs--Suggestion(no evidence backing),EvidenceBacked(citations verified, evidence resolvable),AuthorityThreshold(auto-processing eligible) - Thresholds Config:
AIAuthorityThresholds.cs--MinCitationRate(0.8),MinConfidenceScore(0.7),MinVerifiedCitationRate(0.9),AuthorityThresholdScore(0.95),RequireResolvableEvidence(true) - Classification Result:
AIAuthorityClassificationResult.cs-- captures authority level, reasons, and individual scores - Evidence Resolution: Constructor accepts optional
Func<string, bool>evidence resolver to verify that cited evidence IDs are resolvable - Tests:
__Libraries/StellaOps.Attestor.ProofChain.Tests/AI/AIAuthorityClassifierTests.cs
E2E Test Plan
- Classify an explanation with all metrics above thresholds and verify
EvidenceBackedresult with three reason entries (citation rate, verified rate, confidence) - Classify a policy draft with
qualityScore >= 0.95and verifyAuthorityThresholdresult regardless of other metrics - Classify a remediation plan with
citationRate = 0.5and verifySuggestionresult with reason mentioning citation rate below threshold - Classify a VEX draft with an evidence resolver that returns
falsefor some evidence IDs and verify the verified rate drops below threshold - Override
AIAuthorityThresholdswith stricter values (e.g.,MinCitationRate = 0.95) and verify classification changes accordingly - Verify all four artifact-type classifiers (Explanation, PolicyDraft, Remediation, VexDraft) produce correct
AIAuthorityClassificationResultwith type-specific scoring