3.2 KiB
3.2 KiB
AI-Assisted Explanation and Classification
Module
Attestor
Status
IMPLEMENTED
Description
AI authority classifier with explanation scoring, citation references, explanation types, and model identifiers. AI artifact verification step integrates into the verification pipeline.
Implementation Details
- AIAuthorityClassifier:
src/Attestor/__Libraries/StellaOps.Attestor.ProofChain/Predicates/AI/AIAuthorityClassifier.cs(with.Explanation,.ExplanationScore,.PolicyDraft,.PolicyDraftScore,.Remediation,.RemediationScore,.VexDraft,.VexDraftScorepartials) -- classifies AI outputs intoSuggestion,EvidenceBacked, orAuthorityThresholdbased on citation rate, verified rate, and confidence score. - AIAuthorityThresholds:
AIAuthorityThresholds.cs-- configurable thresholds:MinCitationRate(default 0.8),MinConfidenceScore(default 0.7),MinVerifiedCitationRate(default 0.9),AuthorityThresholdScore(default 0.95). - AIArtifactAuthority enum:
AIArtifactAuthority.cs-- three levels: Suggestion (no evidence), EvidenceBacked (citations verified), AuthorityThreshold (meets auto-processing score). - AIExplanationPredicate:
AIExplanationPredicate.cs-- record extendingAIArtifactBasePredicatewithExplanationType,Content,Citations,ConfidenceScore,CitationRate,Subject,ContextScope. - AIExplanationCitation:
AIExplanationCitation.cs-- links claims to evidence withClaimIndex,ClaimText,EvidenceId(sha256 format),EvidenceType,Verifiedflag. - AIExplanationType enum:
AIExplanationType.cs-- Exploitability, CodePath, PolicyDecision, RiskFactors, RemediationOptions, PlainLanguageSummary, EvidenceChain. - AIModelIdentifier:
AIModelIdentifier.cs-- tracks provider/model/version with optionalWeightsDigestfor local models. - Verification Step:
src/Attestor/__Libraries/StellaOps.Attestor.ProofChain/Verification/AIArtifactVerificationStep.cs(with.Execute,.Classify,.Helpers,.Summary,.VerifyParse,.VerifyValidationpartials) -- integrates intoVerificationPipelineto verify AI artifacts in proof bundles. - Tests:
__Libraries/StellaOps.Attestor.ProofChain.Tests/AI/AIAuthorityClassifierTests.cs
E2E Test Plan
- Create an
AIExplanationPredicatewith citation rate >= 0.8, verified rate >= 0.9, and confidence >= 0.7, classify viaAIAuthorityClassifier, and verify it returnsEvidenceBacked - Create an explanation with citation rate < 0.8 and verify classifier returns
Suggestionwith appropriate reason messages - Create an explanation with quality score >= 0.95 and verify classifier returns
AuthorityThreshold - Submit a proof bundle containing AI artifacts through
AIArtifactVerificationStep.ExecuteAsyncand verify all artifacts are validated (parse, schema, classification) - Submit a proof bundle with invalid AI artifacts (malformed predicate) and verify the verification step returns
Passed = falsewith error details - Create an
AIExplanationCitationwithVerified = falseand verify it lowers the verified rate below the threshold, causing the classifier to returnSuggestion - Verify
AIModelIdentifier.ToString()produces the canonicalprovider:model:versionformat