43 lines
2.7 KiB
Markdown
43 lines
2.7 KiB
Markdown
# Graph Fixtures Plan (SAMPLES-GRAPH-24-003)
|
|
|
|
## Goals
|
|
- Produce a deterministic large-scale SBOM graph fixture (~40k nodes) with policy overlay snapshot for perf/regression suites (UI/CLI/Graph API).
|
|
- Align with current graph node/edge schema and overlay format used by `StellaOps.Graph.Indexer` and Vulnerability Explorer.
|
|
- Ensure offline parity: fixtures packaged for Offline Kit consumption (NDJSON + manifest hashes).
|
|
|
|
## Assumptions / Pending confirmations
|
|
- Overlay format: waiting on Graph Guild decision (checkpoint 2025-11-22) regarding overlay fields and snapshot manifest layout.
|
|
- SBOM bundle source: use scanner surface mock bundle v1 unless real caches land; confirm with Graph Guild.
|
|
- Tenant: default to `demo-tenant` unless advised otherwise; all IDs and timestamps must be deterministic.
|
|
|
|
## Proposed fixture contents
|
|
- `nodes.ndjson`: ~40k nodes; sorted by id; includes artifact, package, relationship nodes.
|
|
- `edges.ndjson`: matching edges; sorted by id.
|
|
- `overlays/policy.ndjson`: policy overlay snapshot aligned with chosen overlay schema.
|
|
- `manifest.json`: hashes (SHA-256) of all files plus counts; UTC timestamps rounded to seconds.
|
|
- `README.md`: execution + verification steps, expected counts/hashes.
|
|
|
|
## Generation sketch
|
|
1) Start from existing mock SBOM bundle (scanner surface v1); sample driver script will:
|
|
- deterministically seed random generators;
|
|
- produce nodes/edges via Graph Indexer schema helpers;
|
|
- emit overlays using placeholder policy verdicts (allow/deny/defer) until final schema confirmed.
|
|
2) Write NDJSON with stable ordering; compute SHA-256 for each file; write manifest.
|
|
3) Run validation script to assert counts, schema shape, and hash reproducibility.
|
|
|
|
## Interim fixtures (delivered 2025-12-01)
|
|
- Synthetic deterministic graphs generated under `samples/graph/interim/`:
|
|
- `graph-50k` (50k nodes, ~200k edges)
|
|
- `graph-100k` (100k nodes, ~400k edges)
|
|
- Minimal schema (`id, kind, name, version, tenant`), seeded RNG, stable ordering, manifests with hashes.
|
|
- Purpose: unblock BENCH-GRAPH-21-001/002 while overlay format is finalized. Overlays not included yet.
|
|
|
|
## Open items (to resolve before canonical data generation)
|
|
- Confirm overlay field set and file naming (Graph Guild, due 2025-11-22).
|
|
- Confirm allowed mock SBOM source list and artifact naming (Graph Guild / SBOM Service Guild).
|
|
- Provide expected node/edge cardinality breakdown (packages vs files vs relationships) to guide generation.
|
|
|
|
## Next steps
|
|
- Keep SAMPLES-GRAPH-24-003 blocked until overlay/schema confirmation, but interim fixtures are available for benches.
|
|
- Once overlay schema final, extend generator to emit overlays + CAS manifests and promote to official fixture.
|