Refactor code structure for improved readability and maintainability; optimize performance in key functions.

2025-12-22 19:06:31 +02:00
parent dfaa2079aa
commit 0536a4f7d4
1443 changed files with 109671 additions and 7840 deletions
--- a/docs/modules/scanner/architecture.md
+++ b/docs/modules/scanner/architecture.md
@@ -1,20 +1,20 @@
-# component_architecture_scanner.md — **Stella Ops Scanner** (2025Q4)
+# component_architecture_scanner.md â€” **Stellaâ€¯Ops Scanner** (2025Q4)

-> Aligned with Epic 6 – Vulnerability Explorer and Epic 10 – Export Center.
+> Aligned with Epicâ€¯6 â€“ Vulnerability Explorer and Epicâ€¯10 â€“ Export Center.

-> **Scope.** Implementation‑ready architecture for the **Scanner** subsystem: WebService, Workers, analyzers, SBOM assembly (inventory & usage), per‑layer caching, three‑way diffs, artifact catalog (RustFS default + PostgreSQL, S3-compatible fallback), attestation hand‑off, and scale/security posture. This document is the contract between the scanning plane and everything else (Policy, Excititor, Concelier, UI, CLI).
+> **Scope.** Implementationâ€‘ready architecture for the **Scanner** subsystem: WebService, Workers, analyzers, SBOM assembly (inventory & usage), perâ€‘layer caching, threeâ€‘way diffs, artifact catalog (RustFS default + PostgreSQL, S3-compatible fallback), attestation handâ€‘off, and scale/security posture. This document is the contract between the scanning plane and everything else (Policy, Excititor, Concelier, UI, CLI).

 ---

 ## 0) Mission & boundaries

-**Mission.** Produce **deterministic**, **explainable** SBOMs and diffs for container images and filesystems, quickly and repeatedly, without guessing. Emit two views: **Inventory** (everything present) and **Usage** (entrypoint closure + actually linked libs). Attach attestations through **Signer→Attestor→Rekor v2**.
+**Mission.** Produce **deterministic**, **explainable** SBOMs and diffs for container images and filesystems, quickly and repeatedly, without guessing. Emit two views: **Inventory** (everything present) and **Usage** (entrypoint closure + actually linked libs). Attach attestations through **Signerâ†’Attestorâ†’Rekor v2**.

 **Boundaries.**

 * Scanner **does not** produce PASS/FAIL. The backend (Policy + Excititor + Concelier) decides presentation and verdicts.
-* Scanner **does not** keep third‑party SBOM warehouses. It may **bind** to existing attestations for exact hashes.
-* Core analyzers are **deterministic** (no fuzzy identity). Optional heuristic plug‑ins (e.g., patch‑presence) run under explicit flags and never contaminate the core SBOM.
+* Scanner **does not** keep thirdâ€‘party SBOM warehouses. It may **bind** to existing attestations for exact hashes.
+* Core analyzers are **deterministic** (no fuzzy identity). Optional heuristic plugâ€‘ins (e.g., patchâ€‘presence) run under explicit flags and never contaminate the core SBOM.

 ---

@@ -22,41 +22,41 @@

 ```
 src/
- ├─ StellaOps.Scanner.WebService/            # REST control plane, catalog, diff, exports
- ├─ StellaOps.Scanner.Worker/                # queue consumer; executes analyzers
- ├─ StellaOps.Scanner.Models/                # DTOs, evidence, graph nodes, CDX/SPDX adapters
- ├─ StellaOps.Scanner.Storage/               # PostgreSQL repositories; RustFS object client (default) + S3 fallback; ILM/GC
- ├─ StellaOps.Scanner.Queue/                 # queue abstraction (Redis/NATS/RabbitMQ)
- ├─ StellaOps.Scanner.Cache/                 # layer cache; file CAS; bloom/bitmap indexes
- ├─ StellaOps.Scanner.EntryTrace/            # ENTRYPOINT/CMD → terminal program resolver (shell AST)
- ├─ StellaOps.Scanner.Analyzers.OS.[Apk|Dpkg|Rpm]/
- ├─ StellaOps.Scanner.Analyzers.Lang.[Java|Node|Bun|Python|Go|DotNet|Rust|Ruby|Php]/
- ├─ StellaOps.Scanner.Analyzers.Native.[ELF|PE|MachO]/   # PE/Mach-O planned (M2)
- ├─ StellaOps.Scanner.Symbols.Native/                    # NEW – native symbol reader/demangler (Sprint 401)
- ├─ StellaOps.Scanner.CallGraph.Native/                  # NEW – function/call-edge builder + CAS emitter
- ├─ StellaOps.Scanner.Emit.CDX/              # CycloneDX (JSON + Protobuf)
- ├─ StellaOps.Scanner.Emit.SPDX/             # SPDX 3.0.1 JSON
- ├─ StellaOps.Scanner.Diff/                  # image→layer→component three‑way diff
- ├─ StellaOps.Scanner.Index/                 # BOM‑Index sidecar (purls + roaring bitmaps)
- ├─ StellaOps.Scanner.Tests.*                # unit/integration/e2e fixtures
- └─ Tools/
-     ├─ StellaOps.Scanner.Sbomer.BuildXPlugin/   # BuildKit generator (image referrer SBOMs)
-     └─ StellaOps.Scanner.Sbomer.DockerImage/    # CLI‑driven scanner container
+ â”œâ”€ StellaOps.Scanner.WebService/            # REST control plane, catalog, diff, exports
+ â”œâ”€ StellaOps.Scanner.Worker/                # queue consumer; executes analyzers
+ â”œâ”€ StellaOps.Scanner.Models/                # DTOs, evidence, graph nodes, CDX/SPDX adapters
+ â”œâ”€ StellaOps.Scanner.Storage/               # PostgreSQL repositories; RustFS object client (default) + S3 fallback; ILM/GC
+ â”œâ”€ StellaOps.Scanner.Queue/                 # queue abstraction (Redis/NATS/RabbitMQ)
+ â”œâ”€ StellaOps.Scanner.Cache/                 # layer cache; file CAS; bloom/bitmap indexes
+ â”œâ”€ StellaOps.Scanner.EntryTrace/            # ENTRYPOINT/CMD â†’ terminal program resolver (shell AST)
+ â”œâ”€ StellaOps.Scanner.Analyzers.OS.[Apk|Dpkg|Rpm]/
+ â”œâ”€ StellaOps.Scanner.Analyzers.Lang.[Java|Node|Bun|Python|Go|DotNet|Rust|Ruby|Php]/
+ â”œâ”€ StellaOps.Scanner.Analyzers.Native.[ELF|PE|MachO]/   # PE/Mach-O planned (M2)
+ â”œâ”€ StellaOps.Scanner.Symbols.Native/                    # NEW â€“ native symbol reader/demangler (Sprint 401)
+ â”œâ”€ StellaOps.Scanner.CallGraph.Native/                  # NEW â€“ function/call-edge builder + CAS emitter
+ â”œâ”€ StellaOps.Scanner.Emit.CDX/              # CycloneDX (JSON + Protobuf)
+ â”œâ”€ StellaOps.Scanner.Emit.SPDX/             # SPDX 3.0.1 JSON
+ â”œâ”€ StellaOps.Scanner.Diff/                  # imageâ†’layerâ†’component threeâ€‘way diff
+ â”œâ”€ StellaOps.Scanner.Index/                 # BOMâ€‘Index sidecar (purls + roaring bitmaps)
+ â”œâ”€ StellaOps.Scanner.Tests.*                # unit/integration/e2e fixtures
+ â””â”€ Tools/
+     â”œâ”€ StellaOps.Scanner.Sbomer.BuildXPlugin/   # BuildKit generator (image referrer SBOMs)
+     â””â”€ StellaOps.Scanner.Sbomer.DockerImage/    # CLIâ€‘driven scanner container
 ```

 Per-analyzer notes (language analyzers):
- `docs/modules/scanner/analyzers-java.md` — Java/Kotlin (Maven, Gradle, fat archives)
- `docs/modules/scanner/dotnet-analyzer.md` — .NET (deps.json, NuGet, packages.lock.json, declared-only)
- `docs/modules/scanner/analyzers-python.md` — Python (pip, Poetry, pipenv, conda, editables, vendored)
- `docs/modules/scanner/analyzers-node.md` — Node.js (npm, Yarn, pnpm, multi-version locks)
- `docs/modules/scanner/analyzers-bun.md` — Bun (bun.lock v1, dev classification, patches)
- `docs/modules/scanner/analyzers-go.md` — Go (build info, modules)
+- `docs/modules/scanner/analyzers-java.md` â€” Java/Kotlin (Maven, Gradle, fat archives)
+- `docs/modules/scanner/dotnet-analyzer.md` â€” .NET (deps.json, NuGet, packages.lock.json, declared-only)
+- `docs/modules/scanner/analyzers-python.md` â€” Python (pip, Poetry, pipenv, conda, editables, vendored)
+- `docs/modules/scanner/analyzers-node.md` â€” Node.js (npm, Yarn, pnpm, multi-version locks)
+- `docs/modules/scanner/analyzers-bun.md` â€” Bun (bun.lock v1, dev classification, patches)
+- `docs/modules/scanner/analyzers-go.md` â€” Go (build info, modules)

 Cross-analyzer contract (identity safety, evidence locators, container layout):
- `docs/modules/scanner/language-analyzers-contract.md` — PURL vs explicit-key rules, evidence formats, bounded scanning
+- `docs/modules/scanner/language-analyzers-contract.md` â€” PURL vs explicit-key rules, evidence formats, bounded scanning

 Semantic entrypoint analysis (Sprint 0411):
- `docs/modules/scanner/semantic-entrypoint-schema.md` — Schema for intent, capabilities, threat vectors, and data boundaries
+- `docs/modules/scanner/semantic-entrypoint-schema.md` â€” Schema for intent, capabilities, threat vectors, and data boundaries

 Analyzer assemblies and buildx generators are packaged as **restart-time plug-ins** under `plugins/scanner/**` with manifests; services must restart to activate new plug-ins.

@@ -64,15 +64,15 @@ Analyzer assemblies and buildx generators are packaged as **restart-time plug-in

 The **Semantic Entrypoint Engine** enriches scan results with application-level understanding:

- **Intent Classification** — Infers application type (WebServer, Worker, CliTool, Serverless, etc.) from framework detection and entrypoint analysis
- **Capability Detection** — Identifies system resource access patterns (network, filesystem, database, crypto)
- **Threat Vector Inference** — Maps capabilities to potential attack vectors with CWE/OWASP references
- **Data Boundary Mapping** — Tracks data flow boundaries with sensitivity classification
+- **Intent Classification** â€” Infers application type (WebServer, Worker, CliTool, Serverless, etc.) from framework detection and entrypoint analysis
+- **Capability Detection** â€” Identifies system resource access patterns (network, filesystem, database, crypto)
+- **Threat Vector Inference** â€” Maps capabilities to potential attack vectors with CWE/OWASP references
+- **Data Boundary Mapping** â€” Tracks data flow boundaries with sensitivity classification

 Components:
- `StellaOps.Scanner.EntryTrace/Semantic/` — Core semantic types and orchestrator
- `StellaOps.Scanner.EntryTrace/Semantic/Adapters/` — Language-specific adapters (Python, Java, Node, .NET, Go)
- `StellaOps.Scanner.EntryTrace/Semantic/Analysis/` — Capability detection, threat inference, boundary mapping
+- `StellaOps.Scanner.EntryTrace/Semantic/` â€” Core semantic types and orchestrator
+- `StellaOps.Scanner.EntryTrace/Semantic/Adapters/` â€” Language-specific adapters (Python, Java, Node, .NET, Go)
+- `StellaOps.Scanner.EntryTrace/Semantic/Analysis/` â€” Capability detection, threat inference, boundary mapping

 Integration points:
 - `LanguageComponentRecord` includes semantic fields (`intent`, `capabilities[]`, `threatVectors[]`)
@@ -88,8 +88,8 @@ CLI usage: `stella scan --semantic <image>` enables semantic analysis in output.
 - **Build-id capture**: read `.note.gnu.build-id` for every ELF, store hex build-id alongside soname/path, propagate into `SymbolID`/`code_id`, and expose it to SBOM + runtime joiners. If missing, fall back to file hash and mark source accordingly.
 - **PURL-resolved edges**: annotate call edges with the callee purl and `symbol_digest` so graphs merge with SBOM components. See `docs/reachability/purl-resolved-edges.md` for schema rules and acceptance tests.
 - **Symbol hints in evidence**: reachability union and richgraph payloads emit `symbol {mangled,demangled,source,confidence}` plus optional `code_block_hash` for stripped/heuristic functions; serializers clamp confidence to [0,1] and uppercase `source` (`DWARF|PDB|SYM|NONE`) for determinism.
- **Unknowns emission**: when symbol → purl mapping or edge targets remain unresolved, emit structured Unknowns to Signals (see `docs/signals/unknowns-registry.md`) instead of dropping evidence.
- **Hybrid attestation**: emit **graph-level DSSE** for every `richgraph-v1` (mandatory) and optional **edge-bundle DSSE** (≤512 edges) for runtime/init-root/contested edges or third-party provenance. Publish graph DSSE digests to Rekor by default; edge-bundle Rekor publish is policy-driven. CAS layout: `cas://reachability/graphs/{blake3}` for graph body, `.../{blake3}.dsse` for envelope, and `cas://reachability/edges/{graph_hash}/{bundle_id}[.dsse]` for bundles. Deterministic ordering before hashing/signing is required.
+- **Unknowns emission**: when symbol â†’ purl mapping or edge targets remain unresolved, emit structured Unknowns to Signals (see `docs/signals/unknowns-registry.md`) instead of dropping evidence.
+- **Hybrid attestation**: emit **graph-level DSSE** for every `richgraph-v1` (mandatory) and optional **edge-bundle DSSE** (â‰¤512 edges) for runtime/init-root/contested edges or third-party provenance. Publish graph DSSE digests to Rekor by default; edge-bundle Rekor publish is policy-driven. CAS layout: `cas://reachability/graphs/{blake3}` for graph body, `.../{blake3}.dsse` for envelope, and `cas://reachability/edges/{graph_hash}/{bundle_id}[.dsse]` for bundles. Deterministic ordering before hashing/signing is required.
 - **Deterministic call-graph manifest**: capture analyzer versions, feed hashes, toolchain digests, and flags in a manifest stored alongside `richgraph-v1`; replaying with the same manifest MUST yield identical node/edge sets and hashes (see `docs/reachability/lead.md`).

 ### 1.1 Queue backbone (Redis / NATS)
@@ -121,10 +121,10 @@ scanner:

 The DI extension (`AddScannerQueue`) wires the selected transport, so future additions (e.g., RabbitMQ) only implement the same contract and register.

-**Runtime form‑factor:** two deployables
+**Runtime formâ€‘factor:** two deployables

 * **Scanner.WebService** (stateless REST)
-* **Scanner.Worker** (N replicas; queue‑driven)
+* **Scanner.Worker** (N replicas; queueâ€‘driven)

 ---

@@ -134,30 +134,30 @@ The DI extension (`AddScannerQueue`) wires the selected transport, so future add
 * **RustFS** (default, offline-first) for SBOM artifacts; optional S3/MinIO compatibility retained for migration; **Object Lock** semantics emulated via retention headers; **ILM** for TTL.
 * **PostgreSQL** for catalog, job state, diffs, ILM rules.
 * **Queue** (Redis Streams/NATS/RabbitMQ).
-* **Authority** (on‑prem OIDC) for **OpToks** (DPoP/mTLS).
+* **Authority** (onâ€‘prem OIDC) for **OpToks** (DPoP/mTLS).
 * **Signer** + **Attestor** (+ **Fulcio/KMS** + **Rekor v2**) for DSSE + transparency.

 ---

 ## 3) Contracts & data model

-### 3.1 Evidence‑first component model
+### 3.1 Evidenceâ€‘first component model

 **Nodes**

 * `Image`, `Layer`, `File`
-* `Component` (`purl?`, `name`, `version?`, `type`, `id` — may be `bin:{sha256}`)
-* `Executable` (ELF/PE/Mach‑O), `Library` (native or managed), `EntryScript` (shell/launcher)
+* `Component` (`purl?`, `name`, `version?`, `type`, `id` â€” may be `bin:{sha256}`)
+* `Executable` (ELF/PE/Machâ€‘O), `Library` (native or managed), `EntryScript` (shell/launcher)

 **Edges** (all carry **Evidence**)

-* `contains(Image|Layer → File)`
-* `installs(PackageDB → Component)` (OS database row)
-* `declares(InstalledMetadata → Component)` (dist‑info, pom.properties, deps.json…)
-* `links_to(Executable → Library)` (ELF `DT_NEEDED`, PE imports)
-* `calls(EntryScript → Program)` (file:line from shell AST)
-* `attests(Rekor → Component|Image)` (SBOM/predicate binding)
-* `bound_from_attestation(Component_attested → Component_observed)` (hash equality proof)
+* `contains(Image|Layer â†’ File)`
+* `installs(PackageDB â†’ Component)` (OS database row)
+* `declares(InstalledMetadata â†’ Component)` (distâ€‘info, pom.properties, deps.jsonâ€¦)
+* `links_to(Executable â†’ Library)` (ELF `DT_NEEDED`, PE imports)
+* `calls(EntryScript â†’ Program)` (file:line from shell AST)
+* `attests(Rekor â†’ Component|Image)` (SBOM/predicate binding)
+* `bound_from_attestation(Component_attested â†’ Component_observed)` (hash equality proof)

 **Evidence**

@@ -211,17 +211,20 @@ migrations.
 All under `/api/v1/scanner`. Auth: **OpTok** (DPoP/mTLS); RBAC scopes.

 ```
-POST /scans                        { imageRef|digest, force?:bool } → { scanId }
-GET  /scans/{id}                   → { status, imageDigest, artifacts[], rekor? }
-GET  /sboms/{imageDigest}          ?format=cdx-json|cdx-pb|spdx-json&view=inventory|usage → bytes
-GET  /scans/{id}/ruby-packages     → { scanId, imageDigest, generatedAt, packages[] }
-GET  /scans/{id}/bun-packages      → { scanId, imageDigest, generatedAt, packages[] }
-GET  /diff?old=<digest>&new=<digest>&view=inventory|usage → diff.json
-POST /exports                      { imageDigest, format, view, attest?:bool } → { artifactId, rekor? }
-POST /reports                      { imageDigest, policyRevision? } → { reportId, rekor? }   # delegates to backend policy+vex
-GET  /catalog/artifacts/{id}       → { meta }
+POST /scans                        { imageRef|digest, force?:bool } â†’ { scanId }
+GET  /scans/{id}                   â†’ { status, imageDigest, artifacts[], rekor? }
+GET  /sboms/{imageDigest}          ?format=cdx-json|cdx-pb|spdx-json&view=inventory|usage â†’ bytes
+POST /sbom/upload                  { artifactRef, sbom|sbomBase64, format?, source? } -> { sbomId, analysisJobId }
+GET  /sbom/uploads/{sbomId}        -> upload record + provenance
+GET  /scans/{id}/ruby-packages     â†’ { scanId, imageDigest, generatedAt, packages[] }
+GET  /scans/{id}/bun-packages      â†’ { scanId, imageDigest, generatedAt, packages[] }
+GET  /diff?old=<digest>&new=<digest>&view=inventory|usage â†’ diff.json
+POST /exports                      { imageDigest, format, view, attest?:bool } â†’ { artifactId, rekor? }
+POST /reports                      { imageDigest, policyRevision? } â†’ { reportId, rekor? }   # delegates to backend policy+vex
+GET  /catalog/artifacts/{id}       â†’ { meta }
 GET  /healthz | /readyz | /metrics
 ```
+See docs/modules/scanner/byos-ingestion.md for BYOS workflow, formats, and troubleshooting.

 ### Report events

@@ -233,13 +236,13 @@ When `scanner.events.enabled = true`, the WebService serialises the signed repor

 ### 5.1 Acquire & verify

-1. **Resolve image** (prefer `repo@sha256:…`).
+1. **Resolve image** (prefer `repo@sha256:â€¦`).
 2. **(Optional) verify image signature** per policy (cosign).
 3. **Pull blobs**, compute layer digests; record metadata.

 ### 5.2 Layer union FS

-* Apply whiteouts; materialize final filesystem; map **file → first introducing layer**.
+* Apply whiteouts; materialize final filesystem; map **file â†’ first introducing layer**.
 * Windows layers (MSI/SxS/GAC) planned in **M2**.

 ### 5.3 Evidence harvest (parallel analyzers; deterministic only)
@@ -259,32 +262,32 @@ When `scanner.events.enabled = true`, the WebService serialises the signed repor

 **B) Language ecosystems (installed state only)**

-* **Java**: `META-INF/maven/*/pom.properties`, MANIFEST → `pkg:maven/...`
-* **Node**: `node_modules/**/package.json` → `pkg:npm/...`
-* **Bun**: `bun.lock` (JSONC text) + `node_modules/**/package.json` + `node_modules/.bun/**/package.json` (isolated linker) → `pkg:npm/...`; `bun.lockb` (binary) emits remediation guidance
-* **Python**: `*.dist-info/{METADATA,RECORD}` → `pkg:pypi/...`
-* **Go**: Go **buildinfo** in binaries → `pkg:golang/...`
-* **.NET**: `*.deps.json` + assembly metadata → `pkg:nuget/...`
+* **Java**: `META-INF/maven/*/pom.properties`, MANIFEST â†’ `pkg:maven/...`
+* **Node**: `node_modules/**/package.json` â†’ `pkg:npm/...`
+* **Bun**: `bun.lock` (JSONC text) + `node_modules/**/package.json` + `node_modules/.bun/**/package.json` (isolated linker) â†’ `pkg:npm/...`; `bun.lockb` (binary) emits remediation guidance
+* **Python**: `*.dist-info/{METADATA,RECORD}` â†’ `pkg:pypi/...`
+* **Go**: Go **buildinfo** in binaries â†’ `pkg:golang/...`
+* **.NET**: `*.deps.json` + assembly metadata â†’ `pkg:nuget/...`
 * **Rust**: crates only when **explicitly present** (embedded metadata or cargo/registry traces); otherwise binaries reported as `bin:{sha256}`.

 > **Rule:** We only report components proven **on disk** with authoritative metadata. Lockfiles are evidence only.

 **C) Native link graph**

-* **ELF**: parse `PT_INTERP`, `DT_NEEDED`, RPATH/RUNPATH, **GNU symbol versions**; map **SONAMEs** to file paths; link executables → libs.
-* **PE/Mach‑O** (planned M2): import table, delay‑imports; version resources; code signatures.
+* **ELF**: parse `PT_INTERP`, `DT_NEEDED`, RPATH/RUNPATH, **GNU symbol versions**; map **SONAMEs** to file paths; link executables â†’ libs.
+* **PE/Machâ€‘O** (planned M2): import table, delayâ€‘imports; version resources; code signatures.
 * Map libs back to **OS packages** if possible (via file lists); else emit `bin:{sha256}` components.
 * The exported metadata (`stellaops.os.*` properties, license list, source package) feeds policy scoring and export pipelines
-  directly – Policy evaluates quiet rules against package provenance while Exporters forward the enriched fields into
+  directly â€“ Policy evaluates quiet rules against package provenance while Exporters forward the enriched fields into
  downstream JSON/Trivy payloads.
 * **Reachability lattice**: analyzers + runtime probes emit `Evidence`/`Mitigation` records (see `docs/reachability/lattice.md`). The lattice engine joins static path evidence, runtime hits (EventPipe/JFR), taint flows, environment gates, and mitigations into `ReachDecision` documents that feed VEX gating and event graph storage.
-* Sprint 401 introduces `StellaOps.Scanner.Symbols.Native` (DWARF/PDB reader + demangler) and `StellaOps.Scanner.CallGraph.Native`
+* Sprintâ€¯401 introduces `StellaOps.Scanner.Symbols.Native` (DWARF/PDB reader + demangler) and `StellaOps.Scanner.CallGraph.Native`
  (function boundary detector + call-edge builder). These libraries feed `FuncNode`/`CallEdge` CAS bundles and enrich reachability
  graphs with `{code_id, confidence, evidence}` so Signals/Policy/UI can cite function-level justifications.

-**D) EntryTrace (ENTRYPOINT/CMD → terminal program)**
+**D) EntryTrace (ENTRYPOINT/CMD â†’ terminal program)**

-* Read image config; parse shell (POSIX/Bash subset) with AST: `source`/`.` includes; `case/if`; `exec`/`command`; `run‑parts`.
+* Read image config; parse shell (POSIX/Bash subset) with AST: `source`/`.` includes; `case/if`; `exec`/`command`; `runâ€‘parts`.
 * Resolve commands via **PATH** within the **built rootfs**; follow language launchers (Java/Node/Python) to identify the terminal program (ELF/JAR/venv script).
 * Record **file:line** and choices for each hop; output chain graph.
 * Unresolvable dynamic constructs are recorded as **unknown** edges with reasons (e.g., `$FOO` unresolved).
@@ -293,11 +296,11 @@ When `scanner.events.enabled = true`, the WebService serialises the signed repor

 Post-resolution, the `SemanticEntrypointOrchestrator` enriches entry trace results with semantic understanding:

-* **Application Intent** — Infers the purpose (WebServer, CliTool, Worker, Serverless, BatchJob, etc.) from framework detection and command patterns.
-* **Capability Classes** — Detects capabilities (NetworkListen, DatabaseSql, ProcessSpawn, SecretAccess, etc.) via import/dependency analysis and framework signatures.
-* **Attack Surface** — Maps capabilities to potential threat vectors (SqlInjection, Xss, Ssrf, Rce, PathTraversal) with CWE IDs and OWASP Top 10 categories.
-* **Data Boundaries** — Traces I/O edges (HttpRequest, DatabaseQuery, FileInput, EnvironmentVar) with direction and sensitivity classification.
-* **Confidence Scoring** — Each inference carries a score (0.0–1.0), tier (Definitive/High/Medium/Low/Unknown), and reasoning chain.
+* **Application Intent** â€” Infers the purpose (WebServer, CliTool, Worker, Serverless, BatchJob, etc.) from framework detection and command patterns.
+* **Capability Classes** â€” Detects capabilities (NetworkListen, DatabaseSql, ProcessSpawn, SecretAccess, etc.) via import/dependency analysis and framework signatures.
+* **Attack Surface** â€” Maps capabilities to potential threat vectors (SqlInjection, Xss, Ssrf, Rce, PathTraversal) with CWE IDs and OWASP Top 10 categories.
+* **Data Boundaries** â€” Traces I/O edges (HttpRequest, DatabaseQuery, FileInput, EnvironmentVar) with direction and sensitivity classification.
+* **Confidence Scoring** â€” Each inference carries a score (0.0â€“1.0), tier (Definitive/High/Medium/Low/Unknown), and reasoning chain.

 Language-specific adapters (`PythonSemanticAdapter`, `JavaSemanticAdapter`, `NodeSemanticAdapter`, `DotNetSemanticAdapter`, `GoSemanticAdapter`) recognize framework patterns:
 * **Python**: Django, Flask, FastAPI, Celery, Click/Typer, Lambda handlers
@@ -316,7 +319,7 @@ See `docs/modules/scanner/operations/entrypoint-semantic.md` for full schema ref
 **E) Attestation & SBOM bind (optional)**

 * For each **file hash** or **binary hash**, query local cache of **Rekor v2** indices; if an SBOM attestation is found for **exact hash**, bind it to the component (origin=`attested`).
-* For the **image** digest, likewise bind SBOM attestations (build‑time referrers).
+* For the **image** digest, likewise bind SBOM attestations (buildâ€‘time referrers).

 ### 5.4 Component normalization (exact only)

@@ -326,25 +329,25 @@ See `docs/modules/scanner/operations/entrypoint-semantic.md` for full schema ref
 ### 5.5 SBOM assembly & emit

 * **Per-layer SBOM fragments**: components introduced by the layer (+ relationships).
-* **Image SBOMs**: merge fragments; refer back to them via **CycloneDX BOM‑Link** (or SPDX ExternalRef).
+* **Image SBOMs**: merge fragments; refer back to them via **CycloneDX BOMâ€‘Link** (or SPDX ExternalRef).
 * Emit both **Inventory** & **Usage** views.
 * When the native analyzer reports an ELF `buildId`, attach it to component metadata and surface it as `stellaops:buildId` in CycloneDX properties (and diff metadata). This keeps SBOM/diff output in lockstep with runtime events and the debug-store manifest.
-* Serialize **CycloneDX JSON** and **CycloneDX Protobuf**; optionally **SPDX 3.0.1 JSON**.
-* Build **BOM‑Index** sidecar: purl table + roaring bitmap; flag `usedByEntrypoint` components for fast backend joins.
+* Serialize **CycloneDX 1.7 JSON** and **CycloneDX 1.7 Protobuf**; optionally **SPDX 3.0.1 JSON-LD** (`application/spdx+json; version=3.0.1`) with legacy tag-value output (`text/spdx`) when enabled (1.6 accepted for ingest compatibility).
+* Build **BOMâ€‘Index** sidecar: purl table + roaring bitmap; flag `usedByEntrypoint` components for fast backend joins.

-The emitted `buildId` metadata is preserved in component hashes, diff payloads, and `/policy/runtime` responses so operators can pivot from SBOM entries → runtime events → `debug/.build-id/<aa>/<rest>.debug` within the Offline Kit or release bundle.
+The emitted `buildId` metadata is preserved in component hashes, diff payloads, and `/policy/runtime` responses so operators can pivot from SBOM entries â†’ runtime events â†’ `debug/.build-id/<aa>/<rest>.debug` within the Offline Kit or release bundle.

 ### 5.6 DSSE attestation (via Signer/Attestor)

 * WebService constructs **predicate** with `image_digest`, `stellaops_version`, `license_id`, `policy_digest?` (when emitting **final reports**), timestamps.
 * Calls **Signer** (requires **OpTok + PoE**); Signer verifies **entitlement + scanner image integrity** and returns **DSSE bundle**.
-* **Attestor** logs to **Rekor v2**; returns `{uuid,index,proof}` → stored in `artifacts.rekor`.
+* **Attestor** logs to **Rekor v2**; returns `{uuid,index,proof}` â†’ stored in `artifacts.rekor`.
 * **Hybrid reachability attestations**: graph-level DSSE (mandatory) plus optional edge-bundle DSSEs for runtime/init/contested edges. See [`docs/reachability/hybrid-attestation.md`](../../reachability/hybrid-attestation.md) for verification runbooks and Rekor guidance.
 * Operator enablement runbooks (toggles, env-var map, rollout guidance) live in [`operations/dsse-rekor-operator-guide.md`](operations/dsse-rekor-operator-guide.md) per SCANNER-ENG-0015.

 ---

-## 6) Three‑way diff (image → layer → component)
+## 6) Threeâ€‘way diff (image â†’ layer â†’ component)

 ### 6.1 Keys & classification

@@ -360,7 +363,7 @@ B = components(imageNew, key)

 added   = B \ A
 removed = A \ B
-changed = { k in A∩B : version(A[k]) != version(B[k]) || origin changed }
+changed = { k in Aâˆ©B : version(A[k]) != version(B[k]) || origin changed }

 for each item in added/removed/changed:
   layer = attribute_to_layer(item, imageOld|imageNew)
@@ -372,13 +375,13 @@ Diffs are stored as artifacts and feed **UI** and **CLI**.

 ---

-## 7) Build‑time SBOMs (fast CI path)
+## 7) Buildâ€‘time SBOMs (fast CI path)

 **Scanner.Sbomer.BuildXPlugin** can act as a BuildKit **generator**:

 * During `docker buildx build --attest=type=sbom,generator=stellaops/sbom-indexer`, run analyzers on the build context/output; attach SBOMs as OCI **referrers** to the built image.
-* Optionally request **Signer/Attestor** to produce **Stella Ops‑verified** attestation immediately; else, Scanner.WebService can verify and re‑attest post‑push.
-* Scanner.WebService trusts build‑time SBOMs per policy, enabling **no‑rescan** for unchanged bases.
+* Optionally request **Signer/Attestor** to produce **Stellaâ€¯Opsâ€‘verified** attestation immediately; else, Scanner.WebService can verify and reâ€‘attest postâ€‘push.
+* Scanner.WebService trusts buildâ€‘time SBOMs per policy, enabling **noâ€‘rescan** for unchanged bases.

 ---

@@ -420,26 +423,26 @@ scanner:

 ## 9) Scale & performance

-* **Parallelism**: per‑analyzer concurrency; bounded directory walkers; file CAS dedupe by sha256.
+* **Parallelism**: perâ€‘analyzer concurrency; bounded directory walkers; file CAS dedupe by sha256.
 * **Distributed locks** per **layer digest** to prevent duplicate work across Workers.
-* **Registry throttles**: per‑host concurrency budgets; exponential backoff on 429/5xx.
+* **Registry throttles**: perâ€‘host concurrency budgets; exponential backoff on 429/5xx.
 * **Targets**:

-  * **Build‑time**: P95 ≤ 3–5 s on warmed bases (CI generator).
-  * **Post‑build delta**: P95 ≤ 10 s for 200 MB images with cache hit.
-  * **Emit**: CycloneDX Protobuf ≤ 150 ms for 5k components; JSON ≤ 500 ms.
-  * **Diff**: ≤ 200 ms for 5k vs 5k components.
+  * **Buildâ€‘time**: P95 â‰¤â€¯3â€“5â€¯s on warmed bases (CI generator).
+  * **Postâ€‘build delta**: P95 â‰¤â€¯10â€¯s for 200â€¯MB images with cache hit.
+  * **Emit**: CycloneDX Protobuf â‰¤â€¯150â€¯ms for 5k components; JSON â‰¤â€¯500â€¯ms.
+  * **Diff**: â‰¤â€¯200â€¯ms for 5k vs 5k components.

 ---

 ## 10) Security posture

-* **AuthN**: Authority‑issued short OpToks (DPoP/mTLS).
+* **AuthN**: Authorityâ€‘issued short OpToks (DPoP/mTLS).
 * **AuthZ**: scopes (`scanner.scan`, `scanner.export`, `scanner.catalog.read`).
 * **mTLS** to **Signer**/**Attestor**; only **Signer** can sign.
 * **No network fetches** during analysis (except registry pulls and optional Rekor index reads).
-* **Sandboxing**: non‑root containers; read‑only FS; seccomp profiles; disable execution of scanned content.
-* **Release integrity**: all first‑party images are **cosign‑signed**; Workers/WebService self‑verify at startup.
+* **Sandboxing**: nonâ€‘root containers; readâ€‘only FS; seccomp profiles; disable execution of scanned content.
+* **Release integrity**: all firstâ€‘party images are **cosignâ€‘signed**; Workers/WebService selfâ€‘verify at startup.

 ---

@@ -451,8 +454,8 @@ scanner:
  * `scanner.layer_cache_hits_total`, `scanner.file_cas_hits_total`
  * `scanner.artifact_bytes_total{format}`
  * `scanner.attestation_latency_seconds`, `scanner.rekor_failures_total`
-  * `scanner_analyzer_golang_heuristic_total{indicator,version_hint}` — increments whenever the Go analyzer falls back to heuristics (build-id or runtime markers). Grafana panel: `sum by (indicator) (rate(scanner_analyzer_golang_heuristic_total[5m]))`; alert when the rate is ≥ 1 for 15 minutes to highlight unexpected stripped binaries.
-* **Tracing**: spans for acquire→union→analyzers→compose→emit→sign→log.
+  * `scanner_analyzer_golang_heuristic_total{indicator,version_hint}` â€” increments whenever the Go analyzer falls back to heuristics (build-id or runtime markers). Grafana panel: `sum by (indicator) (rate(scanner_analyzer_golang_heuristic_total[5m]))`; alert when the rate is â‰¥â€¯1 for 15â€¯minutes to highlight unexpected stripped binaries.
+* **Tracing**: spans for acquireâ†’unionâ†’analyzersâ†’composeâ†’emitâ†’signâ†’log.
 * **Audit logs**: DSSE requests log `license_id`, `image_digest`, `artifactSha256`, `policy_digest?`, Rekor UUID on success.

 ---
@@ -461,12 +464,12 @@ scanner:

 * **Analyzer contracts:** see `language-analyzers-contract.md` for cross-analyzer identity safety, evidence locators, and container layout rules. Per-analyzer docs: `analyzers-java.md`, `dotnet-analyzer.md`, `analyzers-python.md`, `analyzers-node.md`, `analyzers-bun.md`, `analyzers-go.md`. Implementation: `docs/implplan/SPRINT_0408_0001_0001_scanner_language_detection_gaps_program.md`.

-* **Determinism:** given same image + analyzers → byte‑identical **CDX Protobuf**; JSON normalized.
-* **OS packages:** ground‑truth images per distro; compare to package DB.
-* **Lang ecosystems:** sample images per ecosystem (Java/Node/Python/Go/.NET/Rust) with installed metadata; negative tests w/ lockfile‑only.
-* **Native & EntryTrace:** ELF graph correctness; shell AST cases (includes, run‑parts, exec, case/if).
-* **Diff:** layer attribution against synthetic two‑image sequences.
-* **Performance:** cold vs warm cache; large `node_modules` and `site‑packages`.
+* **Determinism:** given same image + analyzers â†’ byteâ€‘identical **CDX Protobuf**; JSON normalized.
+* **OS packages:** groundâ€‘truth images per distro; compare to package DB.
+* **Lang ecosystems:** sample images per ecosystem (Java/Node/Python/Go/.NET/Rust) with installed metadata; negative tests w/ lockfileâ€‘only.
+* **Native & EntryTrace:** ELF graph correctness; shell AST cases (includes, runâ€‘parts, exec, case/if).
+* **Diff:** layer attribution against synthetic twoâ€‘image sequences.
+* **Performance:** cold vs warm cache; large `node_modules` and `siteâ€‘packages`.
 * **Security:** ensure no code execution from image; fuzz parser inputs; path traversal resistance on layer extract.

 ---
@@ -474,16 +477,16 @@ scanner:
 ## 13) Failure modes & degradations

 * **Missing OS DB** (files exist, DB removed): record **files**; do **not** fabricate package components; emit `bin:{sha256}` where unavoidable; flag in evidence.
-* **Unreadable metadata** (corrupt dist‑info): record file evidence; skip component creation; annotate.
+* **Unreadable metadata** (corrupt distâ€‘info): record file evidence; skip component creation; annotate.
 * **Dynamic shell constructs**: mark unresolved edges with reasons (env var unknown) and continue; **Usage** view may be partial.
 * **Registry rate limits**: honor backoff; queue job retries with jitter.
 * **Signer refusal** (license/plan/version): scan completes; artifact produced; **no attestation**; WebService marks result as **unverified**.

 ---

-## 14) Optional plug‑ins (off by default)
+## 14) Optional plugâ€‘ins (off by default)

-* **Patch‑presence detector** (signature‑based backport checks). Reads curated function‑level signatures from advisories; inspects binaries for patched code snippets to lower false‑positives for backported fixes. Runs as a sidecar analyzer that **annotates** components; never overrides core identities.
+* **Patchâ€‘presence detector** (signatureâ€‘based backport checks). Reads curated functionâ€‘level signatures from advisories; inspects binaries for patched code snippets to lower falseâ€‘positives for backported fixes. Runs as a sidecar analyzer that **annotates** components; never overrides core identities.
 * **Runtime probes** (with Zastava): when allowed, compare **/proc/<pid>/maps** (DSOs actually loaded) with static **Usage** view for precision.

 ---
@@ -506,14 +509,14 @@ scanner:

 ## 17) Roadmap (Scanner)

-* **M2**: Windows containers (MSI/SxS/GAC analyzers), PE/Mach‑O native analyzer, deeper Rust metadata.
-* **M2**: Buildx generator GA (certified external registries), cross‑registry trust policies.
-* **M3**: Patch‑presence plug‑in GA (opt‑in), cross‑image corpus clustering (evidence‑only; not identity).
+* **M2**: Windows containers (MSI/SxS/GAC analyzers), PE/Machâ€‘O native analyzer, deeper Rust metadata.
+* **M2**: Buildx generator GA (certified external registries), crossâ€‘registry trust policies.
+* **M3**: Patchâ€‘presence plugâ€‘in GA (optâ€‘in), crossâ€‘image corpus clustering (evidenceâ€‘only; not identity).
 * **M3**: Advanced EntryTrace (POSIX shell features breadth, busybox detection).

 ---

-### Appendix A — EntryTrace resolution (pseudo)
+### Appendix A â€” EntryTrace resolution (pseudo)

 ```csharp
 ResolveEntrypoint(ImageConfig cfg, RootFs fs):
@@ -544,9 +547,9 @@ ResolveEntrypoint(ImageConfig cfg, RootFs fs):
  return Unknown(reason)
 ```

-### Appendix A.1 — EntryTrace Explainability
+### Appendix A.1 â€” EntryTrace Explainability

-### Appendix A.0 — Replay / Record mode
+### Appendix A.0 â€” Replay / Record mode

 - WebService ships a **RecordModeService** that assembles replay manifests (schema v1) with policy/feed/tool pins and reachability references, then writes deterministic input/output bundles to the configured object store (RustFS default, S3/Minio fallback) under `replay/<head>/<digest>.tar.zst`.
 - Bundles contain canonical manifest JSON plus inputs (policy/feed/tool/analyzer digests) and outputs (SBOM, findings, optional VEX/logs); CAS URIs follow `cas://replay/...` and are attached to scan snapshots as `ReplayArtifacts`.
@@ -567,12 +570,12 @@ EntryTrace emits structured diagnostics and metrics so operators can quickly und

 Diagnostics drive two metrics published by `EntryTraceMetrics`:

- `entrytrace_resolutions_total{outcome}` — resolution attempts segmented by outcome (`resolved`, `partiallyresolved`, `unresolved`).
- `entrytrace_unresolved_total{reason}` — diagnostic counts keyed by reason.
+- `entrytrace_resolutions_total{outcome}` â€” resolution attempts segmented by outcome (`resolved`, `partiallyresolved`, `unresolved`).
+- `entrytrace_unresolved_total{reason}` â€” diagnostic counts keyed by reason.

 Structured logs include `entrytrace.path`, `entrytrace.command`, `entrytrace.reason`, and `entrytrace.depth`, all correlated with scan/job IDs. Timestamps are normalized to UTC (microsecond precision) to keep DSSE attestations and UI traces explainable.

-### Appendix B — BOM‑Index sidecar
+### Appendix B â€” BOMâ€‘Index sidecar

 ```
 struct Header { magic, version, imageDigest, createdAt }
--- a/docs/modules/scanner/byos-ingestion.md
+++ b/docs/modules/scanner/byos-ingestion.md
@@ -0,0 +1,33 @@
+# BYOS SBOM ingestion
+
+## Overview
+- Accepts external SBOMs and runs them through validation, normalization, and analysis triggers.
+- Stores the SBOM artifact in the scanner object store and records provenance metadata.
+- Emits a deterministic analysis job id tied to the upload metadata.
+
+## API
+- `POST /api/v1/sbom/upload`
+- `GET /api/v1/sbom/uploads/{sbomId}`
+
+Example request:
+```json
+{
+  "artifactRef": "example.com/app:1.0",
+  "sbomBase64": "<base64>",
+  "format": "cyclonedx",
+  "source": { "tool": "syft", "version": "1.0.0" }
+}
+```
+
+## Supported formats
+- CycloneDX JSON 1.4-1.6 (`bomFormat`, `specVersion`)
+- SPDX JSON 2.3 (`spdxVersion`)
+- SPDX JSON 3.0 (structural checks only; schema validation pending)
+
+## CLI
+`stella sbom upload --file sbom.json --artifact example.com/app:1.0`
+
+## Troubleshooting
+- Missing format: ensure `bomFormat` (CycloneDX) or `spdxVersion` (SPDX).
+- Unsupported versions: CycloneDX must be 1.4-1.6; SPDX must be 2.3 or 3.0.
+- Empty component lists are accepted but reduce quality scores.
--- a/docs/modules/scanner/reachability-drift.md
+++ b/docs/modules/scanner/reachability-drift.md
@@ -1,73 +1,42 @@
-# Reachability Drift Detection - Architecture
+# Reachability Drift Detection - Architecture

 **Module:** Scanner
 **Version:** 1.0
-**Status:** Implemented (Sprint 3600.2-3600.3)
+**Status:** Implemented (core drift engine + API; Node Babel integration pending)
 **Last Updated:** 2025-12-22

 ---

 ## 1. Overview

-Reachability Drift Detection tracks function-level reachability changes between scans to identify when code modifications create new paths to vulnerable sinks or mitigate existing risks. This enables security teams to:
+Reachability Drift Detection tracks function-level reachability changes between scans. It highlights when code changes create new paths to sensitive sinks or remove existing paths, producing deterministic evidence for triage and VEX workflows.

- **Detect regressions** when previously unreachable vulnerabilities become exploitable
- **Validate fixes** by confirming vulnerable code paths are removed
- **Prioritize triage** based on actual exploitability rather than theoretical risk
- **Automate VEX** by generating evidence-backed justifications
+Key outcomes:
+- Detect regressions when previously unreachable sinks become reachable.
+- Validate mitigations when reachable sinks become unreachable.
+- Provide deterministic evidence for audit and policy decisions.

 ---

 ## 2. Key Concepts

 ### 2.1 Call Graph
-
-A directed graph representing function/method call relationships in source code:
-
- **Nodes**: Functions, methods, lambdas with metadata (file, line, visibility)
- **Edges**: Call relationships with call kind (direct, virtual, delegate, reflection, dynamic)
- **Entrypoints**: Public-facing functions (HTTP handlers, CLI commands, message consumers)
- **Sinks**: Security-sensitive APIs (command execution, SQL, file I/O, deserialization)
+A directed graph of function calls:
+- Nodes: functions, methods, lambdas with file and line metadata.
+- Edges: call relationships (direct, virtual, dynamic).
+- Entrypoints: public handlers (HTTP, CLI, background services).
+- Sinks: security-sensitive APIs from the sink registry.

 ### 2.2 Reachability Analysis
-
-Multi-source BFS traversal from entrypoints to determine which sinks are exploitable:
-
-```
-Entrypoints (HTTP handlers, CLI)
-        │
-        ▼ BFS traversal
-    [Application Code]
-        │
-        ▼
-    Sinks (exec, query, writeFile)
-        │
-        ▼
-    Reachable = TRUE if path exists
-```
+Multi-source traversal from entrypoints to sinks to determine exploitability.

 ### 2.3 Drift Detection
-
-Compares reachability between two scans (base vs head):
-
-| Transition | Direction | Risk Impact |
-|------------|-----------|-------------|
-| Unreachable → Reachable | `became_reachable` | **Increased** - New exploit path |
-| Reachable → Unreachable | `became_unreachable` | **Decreased** - Mitigation applied |
+Compares reachability between base and head scans:
+- `became_reachable`: risk increased (new path to sink).
+- `became_unreachable`: risk decreased (path removed or mitigated).

 ### 2.4 Cause Attribution
-
-Explains *why* drift occurred by correlating with code changes:
-
-| Cause Kind | Description | Example |
-|------------|-------------|---------|
-| `guard_removed` | Conditional check removed | `if (!authorized)` deleted |
-| `guard_added` | New conditional blocks path | Added null check |
-| `new_public_route` | New entrypoint created | Added `/api/admin` endpoint |
-| `visibility_escalated` | Internal → Public | Method made public |
-| `dependency_upgraded` | Library update changed behavior | lodash 4.x → 5.x |
-| `symbol_removed` | Function deleted | Removed vulnerable helper |
-| `unknown` | Cannot determine | Multiple simultaneous changes |
+Explains why drift happened by correlating code changes with paths.

 ---

@@ -75,36 +44,15 @@ Explains *why* drift occurred by correlating with code changes:

 ```mermaid
 flowchart TD
-    subgraph Scan["Scan Execution"]
-        A[Source Code] --> B[Call Graph Extractor]
-        B --> C[CallGraphSnapshot]
-    end
-
-    subgraph Analysis["Drift Analysis"]
-        C --> D[Reachability Analyzer]
-        D --> E[ReachabilityResult]
-
-        F[Base Scan Graph] --> G[Drift Detector]
-        E --> G
-        H[Code Changes] --> G
-        G --> I[ReachabilityDriftResult]
-    end
-
-    subgraph Output["Output"]
-        I --> J[Path Compressor]
-        J --> K[Compressed Paths]
-        I --> L[Cause Explainer]
-        L --> M[Drift Causes]
-
-        K --> N[Storage/API]
-        M --> N
-    end
-
-    subgraph Integration["Integration"]
-        N --> O[Policy Gates]
-        N --> P[VEX Emission]
-        N --> Q[Web UI]
-    end
+    A[Source or binary] --> B[Call graph extractor]
+    B --> C[CallGraphSnapshot]
+    C --> D[Reachability analyzer]
+    D --> E[ReachabilityResult]
+    C --> F[Code change extractor]
+    E --> G[ReachabilityDriftDetector]
+    F --> G
+    G --> H[ReachabilityDriftResult]
+    H --> I[Storage + API]
 ```

 ---
@@ -113,259 +61,109 @@ flowchart TD

 ### 4.1 Call Graph Extractors

-Per-language AST analysis producing `CallGraphSnapshot`:
+Registered extractors are configured in `CallGraphServiceCollectionExtensions`.

-| Language | Extractor | Technology | Status |
-|----------|-----------|------------|--------|
-| .NET | `DotNetCallGraphExtractor` | Roslyn semantic model | **Done** |
-| Java | `JavaCallGraphExtractor` | ASM bytecode analysis | **Done** |
-| Go | `GoCallGraphExtractor` | golang.org/x/tools SSA | **Done** |
-| Python | `PythonCallGraphExtractor` | Python AST | **Done** |
-| Node.js | `NodeCallGraphExtractor` | Babel (planned) | Skeleton |
-| PHP | `PhpCallGraphExtractor` | php-parser | **Done** |
-| Ruby | `RubyCallGraphExtractor` | parser gem | **Done** |
-
-**Location:** `src/Scanner/__Libraries/StellaOps.Scanner.CallGraph/Extraction/`
+| Language | Extractor | Status | Notes |
+|---|---|---|---|
+| .NET | `DotNetCallGraphExtractor` | Registered | Roslyn semantic model. |
+| Node.js | `NodeCallGraphExtractor` | Registered (placeholder) | Trace-based fallback; Babel integration pending (Sprint 3600.0004). |
+| Java | `JavaCallGraphExtractor` | Library present, not wired | Register extractor to enable. |
+| Go | `GoCallGraphExtractor` | Library present, not wired | Register extractor to enable. |
+| Python | `PythonCallGraphExtractor` | Library present, not wired | Register extractor to enable. |
+| PHP | `PhpCallGraphExtractor` | Library present, not wired | Register extractor to enable. |
+| Ruby | `RubyCallGraphExtractor` | Library present, not wired | Register extractor to enable. |
+| JavaScript | `JavaScriptCallGraphExtractor` | Library present, not wired | Register extractor to enable. |
+| Bun | `BunCallGraphExtractor` | Library present, not wired | Register extractor to enable. |
+| Deno | `DenoCallGraphExtractor` | Library present, not wired | Register extractor to enable. |
+| Binary | `BinaryCallGraphExtractor` | Library present, not wired | Native call edge extraction. |

 ### 4.2 Reachability Analyzer
-
-Multi-source BFS from entrypoints to sinks:
-
-```csharp
-public sealed class ReachabilityAnalyzer
-{
-    public ReachabilityResult Analyze(CallGraphSnapshot graph);
-}
-
-public record ReachabilityResult
-{
-    ImmutableHashSet<string> ReachableNodes { get; }
-    ImmutableArray<string> ReachableSinks { get; }
-    ImmutableDictionary<string, ImmutableArray<string>> ShortestPaths { get; }
-}
-```
-
-**Location:** `src/Scanner/__Libraries/StellaOps.Scanner.CallGraph/Analysis/`
+Located in `src/Scanner/__Libraries/StellaOps.Scanner.CallGraph/Analysis/`.

 ### 4.3 Drift Detector
+`ReachabilityDriftDetector` compares base and head snapshots and produces `ReachabilityDriftResult` with compressed paths.

-Compares base and head graphs:
-
-```csharp
-public sealed class ReachabilityDriftDetector
-{
-    public ReachabilityDriftResult Detect(
-        CallGraphSnapshot baseGraph,
-        CallGraphSnapshot headGraph,
-        IReadOnlyList<CodeChangeFact> codeChanges);
-}
-```
-
-**Location:** `src/Scanner/__Libraries/StellaOps.Scanner.ReachabilityDrift/Services/`
-
-### 4.4 Path Compressor
-
-Reduces full paths to key nodes for storage/display:
-
-```
-Full Path (20 nodes):
-  entrypoint → A → B → C → ... → X → Y → sink
-
-Compressed Path:
-  entrypoint → [changed: B] → [changed: X] → sink
-  (intermediateCount: 17)
-```
-
-**Location:** `src/Scanner/__Libraries/StellaOps.Scanner.ReachabilityDrift/Services/PathCompressor.cs`
-
-### 4.5 Cause Explainer
-
-Correlates drift with code changes:
-
-```csharp
-public sealed class DriftCauseExplainer
-{
-    public DriftCause Explain(...);
-    public DriftCause ExplainUnreachable(...);
-}
-```
-
-**Location:** `src/Scanner/__Libraries/StellaOps.Scanner.ReachabilityDrift/Services/DriftCauseExplainer.cs`
+### 4.4 Path Compressor and Cause Explainer
+- `PathCompressor` reduces paths to key nodes and optionally includes full paths.
+- `DriftCauseExplainer` correlates changes to explain why drift happened.

 ---

 ## 5. Language Support Matrix

-| Feature | .NET | Java | Go | Python | Node.js | PHP | Ruby |
-|---------|------|------|-------|--------|---------|-----|------|
-| Function extraction | Yes | Yes | Yes | Yes | Partial | Yes | Yes |
-| Call edge extraction | Yes | Yes | Yes | Yes | Partial | Yes | Yes |
-| HTTP entrypoints | ASP.NET | Spring | net/http | Flask/Django | Express* | Laravel | Rails |
-| gRPC entrypoints | Yes | Yes | Yes | Yes | No | No | No |
-| CLI entrypoints | Yes | Yes | Yes | Yes | Partial | Yes | Yes |
-| Sink detection | Yes | Yes | Yes | Yes | Partial | Yes | Yes |
-
-*Requires Sprint 3600.4 completion
+| Capability | .NET | Node.js | Others (Java/Go/Python/PHP/Ruby/JS/Bun/Deno/Binary) |
+|---|---|---|---|
+| Call graph extraction | Supported | Placeholder | Library present, not wired |
+| Entrypoint detection | Supported | Partial | Library present, not wired |
+| Sink detection | Supported | Partial | Library present, not wired |

 ---

 ## 6. Storage Schema

-### 6.1 PostgreSQL Tables
+Migrations are in `src/Scanner/__Libraries/StellaOps.Scanner.Storage/Postgres/Migrations/`.

-**call_graph_snapshots:**
-```sql
-CREATE TABLE call_graph_snapshots (
-    id UUID PRIMARY KEY,
-    tenant_id UUID NOT NULL,
-    scan_id TEXT NOT NULL,
-    language TEXT NOT NULL,
-    graph_digest TEXT NOT NULL,
-    node_count INT NOT NULL,
-    edge_count INT NOT NULL,
-    entrypoint_count INT NOT NULL,
-    sink_count INT NOT NULL,
-    extracted_at TIMESTAMPTZ NOT NULL,
-    snapshot_json JSONB NOT NULL
-);
-```
-
-**reachability_drift_results:**
-```sql
-CREATE TABLE reachability_drift_results (
-    id UUID PRIMARY KEY,
-    tenant_id UUID NOT NULL,
-    base_scan_id TEXT NOT NULL,
-    head_scan_id TEXT NOT NULL,
-    language TEXT NOT NULL,
-    newly_reachable_count INT NOT NULL,
-    newly_unreachable_count INT NOT NULL,
-    detected_at TIMESTAMPTZ NOT NULL,
-    result_digest TEXT NOT NULL
-);
-```
-
-**drifted_sinks:**
-```sql
-CREATE TABLE drifted_sinks (
-    id UUID PRIMARY KEY,
-    tenant_id UUID NOT NULL,
-    drift_result_id UUID NOT NULL REFERENCES reachability_drift_results(id),
-    sink_node_id TEXT NOT NULL,
-    symbol TEXT NOT NULL,
-    sink_category TEXT NOT NULL,
-    direction TEXT NOT NULL,
-    cause_kind TEXT NOT NULL,
-    cause_description TEXT NOT NULL,
-    compressed_path JSONB NOT NULL,
-    associated_vulns JSONB
-);
-```
-
-**code_changes:**
-```sql
-CREATE TABLE code_changes (
-    id UUID PRIMARY KEY,
-    tenant_id UUID NOT NULL,
-    scan_id TEXT NOT NULL,
-    base_scan_id TEXT NOT NULL,
-    language TEXT NOT NULL,
-    file TEXT NOT NULL,
-    symbol TEXT NOT NULL,
-    change_kind TEXT NOT NULL,
-    details JSONB,
-    detected_at TIMESTAMPTZ NOT NULL
-);
-```
-
-### 6.2 Valkey Caching
-
-```
-stella:callgraph:{scan_id}:{lang}:{digest}     → Compressed CallGraphSnapshot
-stella:callgraph:{scan_id}:{lang}:reachable    → Set of reachable sink IDs
-stella:callgraph:{scan_id}:{lang}:paths:{sink} → Shortest path to sink
-```
-
-TTL: Configurable (default 24h)
-Circuit breaker: 5 failures → 30s timeout
+Core tables:
+- `call_graph_snapshots`: `scan_id`, `language`, `graph_digest`, `extracted_at`, `node_count`, `edge_count`, `entrypoint_count`, `sink_count`, `snapshot_json`.
+- `reachability_results`: `scan_id`, `language`, `graph_digest`, `result_digest`, `computed_at`, `reachable_node_count`, `reachable_sink_count`, `result_json`.
+- `code_changes`: `scan_id`, `base_scan_id`, `language`, `node_id`, `file`, `symbol`, `change_kind`, `details`, `detected_at`.
+- `reachability_drift_results`: `base_scan_id`, `head_scan_id`, `language`, `newly_reachable_count`, `newly_unreachable_count`, `detected_at`, `result_digest`.
+- `drifted_sinks`: `drift_result_id`, `sink_node_id`, `sink_category`, `direction`, `cause_kind`, `cause_description`, `compressed_path`, `associated_vulns`.
+- `material_risk_changes`: extended with `base_scan_id`, `cause`, `cause_kind`, `path_nodes`, `associated_vulns` for drift attachments.

 ---

-## 7. API Endpoints
+## 7. Cache and Determinism
+
+If the call graph cache is enabled (`CallGraph:Cache`), cached keys follow this pattern:
+- `callgraph:graph:{scanId}:{language}`
+- `callgraph:reachability:{scanId}:{language}`
+
+Determinism is enforced by stable ordering and deterministic IDs (see `DeterministicIds`).
+
+---
+
+## 8. API Endpoints
+
+Base path: `/api/v1`

 | Method | Path | Description |
-|--------|------|-------------|
-| GET | `/scans/{scanId}/drift` | Get drift results for a scan |
-| GET | `/drift/{driftId}/sinks` | List drifted sinks (paginated) |
-| POST | `/scans/{scanId}/compute-reachability` | Trigger reachability computation |
-| GET | `/scans/{scanId}/reachability/components` | List components with reachability |
-| GET | `/scans/{scanId}/reachability/findings` | Get reachable vulnerable sinks |
-| GET | `/scans/{scanId}/reachability/explain` | Explain why a sink is reachable |
+|---|---|---|
+| GET | `/scans/{scanId}/drift` | Get or compute drift results for a scan. |
+| GET | `/drift/{driftId}/sinks` | List drifted sinks (paged). |
+| POST | `/scans/{scanId}/compute-reachability` | Trigger reachability computation. |
+| GET | `/scans/{scanId}/reachability/components` | List components with reachability. |
+| GET | `/scans/{scanId}/reachability/findings` | List findings with reachability. |
+| GET | `/scans/{scanId}/reachability/explain` | Explain reachability for a CVE and PURL. |

-See: `docs/api/scanner-drift-api.md`
+See `docs/api/scanner-drift-api.md` for details.

 ---

-## 8. Integration Points
+## 9. Integration Points

-### 8.1 Policy Module
-
-Drift results feed into policy gates for CI/CD blocking:
-
-```yaml
-smart_diff:
-  gates:
-    - condition: "delta_reachable > 0 AND is_kev = true"
-      action: block
-```
-
-### 8.2 VEX Emission
-
-Automatic VEX candidate generation on drift:
-
-| Drift Direction | VEX Status | Justification |
-|-----------------|------------|---------------|
-| became_unreachable | `not_affected` | `vulnerable_code_not_in_execute_path` |
-| became_reachable | — | Requires manual review |
-
-### 8.3 Attestation
-
-DSSE-signed drift attestations:
-
-```json
-{
-  "_type": "https://in-toto.io/Statement/v1",
-  "predicateType": "stellaops.dev/predicates/reachability-drift@v1",
-  "predicate": {
-    "baseScanId": "abc123",
-    "headScanId": "def456",
-    "newlyReachable": [...],
-    "newlyUnreachable": [...],
-    "resultDigest": "sha256:..."
-  }
-}
-```
+- Policy gates: planned in `SPRINT_3600_0005_0001_policy_ci_gate_integration.md`.
+- VEX candidate emission: planned alongside policy gates.
+- Attestation: `StellaOps.Scanner.ReachabilityDrift.Attestation` provides DSSE signing utilities (integration is optional).

 ---

-## 9. Performance Characteristics
+## 10. Performance Characteristics (Targets)

 | Metric | Target | Notes |
-|--------|--------|-------|
-| Graph extraction (100K LOC) | < 60s | Per language |
-| Reachability analysis | < 5s | BFS traversal |
-| Drift detection | < 10s | Graph comparison |
-| Memory usage | < 2GB | Large projects |
-| Cache hit improvement | 10x | Valkey lookup vs recompute |
+|---|---|---|
+| Call graph extraction (100K LOC) | < 60s | Per language extractor. |
+| Reachability analysis | < 5s | BFS traversal on trimmed graphs. |
+| Drift detection | < 10s | Graph comparison and compression. |
+| Cache hit improvement | 10x | Valkey cache vs recompute. |

 ---

-## 10. References
+## 11. References

- **Implementation Sprints:**
-  - `docs/implplan/SPRINT_3600_0002_0001_call_graph_infrastructure.md`
-  - `docs/implplan/SPRINT_3600_0003_0001_drift_detection_engine.md`
- **API Reference:** `docs/api/scanner-drift-api.md`
- **Operations Guide:** `docs/operations/reachability-drift-guide.md`
- **Original Advisory:** `docs/product-advisories/archived/17-Dec-2025 - Reachability Drift Detection.md`
- **Source Code:** `src/Scanner/__Libraries/StellaOps.Scanner.ReachabilityDrift/`
+- `docs/implplan/archived/SPRINT_3600_0002_0001_call_graph_infrastructure.md`
+- `docs/implplan/archived/SPRINT_3600_0003_0001_drift_detection_engine.md`
+- `docs/api/scanner-drift-api.md`
+- `docs/operations/reachability-drift-guide.md`
+- `docs/product-advisories/archived/17-Dec-2025 - Reachability Drift Detection.md`
+- `src/Scanner/__Libraries/StellaOps.Scanner.ReachabilityDrift/`