- Added NullAdvisoryObservationEventTransport for handling advisory observation events. - Created IOrchestratorRegistryStore interface for orchestrator registry operations. - Implemented MongoOrchestratorRegistryStore for MongoDB interactions with orchestrator data. - Defined OrchestratorCommandDocument and OrchestratorCommandRecord for command handling. - Added OrchestratorHeartbeatDocument and OrchestratorHeartbeatRecord for heartbeat tracking. - Created OrchestratorRegistryDocument and OrchestratorRegistryRecord for registry management. - Developed tests for orchestrator collections migration and MongoOrchestratorRegistryStore functionality. - Introduced AirgapImportRequest and AirgapImportValidator for air-gapped VEX bundle imports. - Added incident mode rules sample JSON for notifier configuration.
4.9 KiB
Advisory AI Guardrails & Evidence Intake
Updated: 2025-11-22 · Owner: Advisory AI Docs Guild · Status: Draft (Sprint 0111)
This note captures the guardrail behaviors and evidence intake boundaries required by Sprint 0111 tasks (AIAI-DOCS-31-001, AIAI-RAG-31-003). It binds Advisory AI guardrails to upstream evidence sources and clarifies how Link-Not-Merge (LNM) documents flow into Retrieval-Augmented Generation (RAG) payloads.
1) Evidence sources and contracts
Upstream readiness gates
-
CLI + Policy artefacts (
CLI-VULN-29-001,CLI-VEX-30-001,policyVersiondigests) must be present before enabling non-default profiles. Until then, Advisory AI accepts requests but responds with409 advisory.contextUnavailablewhen those references are missing. -
LNM linksets stay the single source of truth; Advisory AI refuses ad-hoc advisory payloads even if CLI/Policy artefacts are delayed.
-
Advisory observations (LNM) — Consume immutable
advisory_observationsandadvisory_linksetsproduced perdocs/modules/concelier/link-not-merge-schema.md(frozen v1, 2025-11-17). -
VEX statements — Excititor + VEX Lens linksets with trust weights; treated as structured chunks with
source_idandconfidence. -
SBOM context —
SBOM-AIAI-31-001contract: timelines and dependency paths retrieved viaISbomContextRetriever(AddSbomContextHttpClient), default clamps 500 timeline entries / 200 paths. -
Policy explain traces — Policy Engine digests referenced by
policyVersion; cache keys include policy hash to keep outputs replayable. -
Runtime posture (optional) — Zastava signals (
exposure,admissionStatus) when provided by Link-Not-Merge-enabled tenants; optional chunks taggedruntime.
All evidence items must carry content_hash + source_id; Advisory AI never mutates or merges upstream facts (Aggregation-Only Contract).
2) Guardrail stages
- Pre-flight sanitization
- Redact secrets (AWS-style keys, PEM blobs, generic tokens).
- Strip prompt-injection phrases; enforce max input payload 16 kB (configurable, default).
- Reject requests missing
advisoryKeyor linkset-backed evidence (LNM guard).
- Prompt assembly
- Deterministic section order: advisory excerpts → VEX statements → SBOM deltas → policy traces → runtime hints.
- Vector previews capped at 600 chars + ellipsis; section budgets fixed per profile (
default,fips-local,gost-local,cloud-openai); budgets live inprofiles.catalog.jsonand are hashed into DSSE provenance.
- LLM invocation (local/remote)
- Profiles selected via
profilefield; remote profiles require Authority tenant consent andadvisory-ai:operate+aoc:verify.
- Profiles selected via
- Validation & citation enforcement
- Every emitted fact must map to an input chunk (
source_id+content_hash); citations serialized as[n]in Markdown. - Block outputs lacking citations, exceeding section budgets, or including unredacted PII.
- Every emitted fact must map to an input chunk (
- Output sealing
- Store
outputHash,inputDigest,provenanceHash; wrap in DSSE when configured. - Cache TTL defaults to 24h; regenerate only when inputs change or
forceRefresh=true.
- Store
Metrics: advisory_ai_guardrail_blocks_total, advisory_ai_outputs_stored_total, advisory_ai_citation_coverage_ratio. Logs carry output_hash, profile, and block reason; no secrets or raw prompt bodies are logged.
3) RAG payload mapping to LNM (summary)
| LNM field | RAG chunk field | Notes |
|---|---|---|
observation._id |
source_id |
Used for citations and conflict surfacing. |
observation.advisoryId |
advisory_key |
Keyed alongside task type in cache. |
observation.affected[].purl |
purl |
Included for remediation + SBOM joins. |
observation.severities[] |
severity |
Passed through unmerged; multiple severities allowed. |
linkset.conflicts[] |
conflicts |
Rendered verbatim for conflict tasks; no inference merges. |
provenance.sourceArtifactSha |
content_hash |
Drives determinism and replay. |
See docs/advisory-ai/evidence-payloads.md for full JSON examples and alignment rules.
4) Compliance with upstream artefacts
- References:
CONSOLE-VULN-29-001,CONSOLE-VEX-30-001,CLI-VULN-29-001,CLI-VEX-30-001,EXCITITOR-CONSOLE-23-001,DEVOPS-AIAI-31-001. - Guardrails must remain compatible with
docs/policy/assistant-parameters.md; configuration knobs documented there are authoritative for env vars and defaults. - Packaging tasks (AIAI-PACKAGING-31-002) must include this guardrail summary in DSSE metadata to keep Offline Kit parity.
5) Operator checklist
- LNM feed enabled and Concelier schemas at v1 (2025-11-17).
- SBOM retriever configured or
NullSbomContextClientleft as safe default. - Policy hash pinned via
policyVersionwhen reproducibility is required. - Remote profiles only after Authority consent and profile allowlist are set.
- Cache directories shared between web + worker hosts for DSSE sealing.