Files
git.stella-ops.org/docs/technical/testing/testing-strategy-models.md
2026-01-28 02:30:48 +02:00

7.3 KiB

Testing Strategy Models and Lanes (2026)

Source advisory: docs/product/advisories/22-Dec-2026 - Better testing strategy.md
Supersedes/extends: docs/product/advisories/archived/2025-12-21-testing-strategy/20-Dec-2025 - Testing strategy.md

Purpose

  • Define a single testing taxonomy for all StellaOps project types.
  • Make determinism, offline readiness, and evidence integrity testable by default.
  • Align CI lanes with a shared catalog so coverage is visible and enforceable.

Strategy in brief

  • Use test models (L0, S1, C1, W1, WK1, T1, AN1, CLI1, PERF) to encode required test types.
  • Map every module to one or more models in docs/technical/testing/TEST_CATALOG.yml.
  • Run tests through standardized CI lanes (Unit, Contract, Integration, Security, Performance, Live).

Test models (requirements)

  • L0 (Library/Core): unit + property + snapshot + determinism.
  • S1 (Storage/Postgres): migrations + idempotency + concurrency + query ordering.
  • T1 (Transport/Queue): protocol roundtrip + fuzz invalid + delivery semantics.
  • C1 (Connector/External): fixtures + snapshot + resilience + security; optional Live smoke.
  • W1 (WebService/API): contract + authz + OTel trace assertions + negative cases.
  • WK1 (Worker/Indexer): end-to-end job flow + retries + idempotency + telemetry.
  • AN1 (Analyzer/SourceGen): Roslyn harness + diagnostics + golden generated output.
  • CLI1 (Tool/CLI): exit codes + golden output + deterministic formatting.
  • PERF (Benchmark): perf smoke subset + regression thresholds (relative).

Repository foundations

  • TestKit primitives: deterministic time/random, canonical JSON asserts, snapshot helpers, Postgres/Valkey fixtures, OTel capture.
  • Determinism gate: canonical bytes and stable hashes for SBOM/VEX/verdict artifacts.
  • Hybrid reachability posture: graph DSSE mandatory; edge-bundle DSSE optional/targeted with deterministic ordering.
  • Architecture guards: enforce cross-module dependency boundaries (no lattice in Concelier/Excititor).
  • Offline defaults: no network access unless explicitly tagged Live.

CI lanes (standard filters)

  • Unit: fast, offline; includes property and snapshot sub-traits.
  • Contract: schema/OpenAPI stability and response envelopes.
  • Integration: Testcontainers-backed service and storage tests.
  • Security: authz/negative tests and security regressions.
  • Performance: perf smoke and benchmark guards.
  • Live: opt-in upstream connector checks (never PR gating by default).

Documentation moments (when to update)

  • New model or required test type: update docs/technical/testing/TEST_CATALOG.yml.
  • New lane or gate: update docs/technical/testing/TEST_SUITE_OVERVIEW.md and docs/technical/testing/ci-quality-gates.md.
  • Module-specific test policy change: update the module dossier under docs/modules/<module>/.
  • New fixtures or runnable harnesses: place under docs/benchmarks/** or tests/** and link here.
  • Test catalog (source of truth): docs/technical/testing/TEST_CATALOG.yml
  • Test suite overview: docs/technical/testing/TEST_SUITE_OVERVIEW.md
  • Quality guardrails: docs/technical/testing/testing-quality-guardrails-implementation.md
  • Code samples from the advisory: docs/benchmarks/testing/better-testing-strategy-samples.md

Turn #6 Enhancements (2026-01-27)

Source advisory: Testing Enhancements (Automation Turn #6) Sprint: docs/implplan/SPRINT_0127_002_DOCS_testing_enhancements_turn6.md

New test intent categories

Every non-trivial test must declare intent. Intent clarifies why the behavior exists.

public static class TestIntents
{
    public const string Regulatory = "Regulatory";   // Compliance, audit, legal
    public const string Safety = "Safety";           // Security, fail-secure, crypto
    public const string Performance = "Performance"; // Latency, throughput, resources
    public const string Competitive = "Competitive"; // Parity with competitor tools
    public const string Operational = "Operational"; // Observability, operability
}

// Usage
[Trait("Intent", TestIntents.Safety)]
[Trait("Category", "Unit")]
public void Signer_RejectsExpiredCertificate() { /* ... */ }

New test trait categories

Category Purpose Example Usage
Intent Test intent classification [Trait("Intent", "Safety")]
Evidence Evidence chain validation [Trait("Category", "Evidence")]
Observability OTel/log/metrics contracts [Trait("Category", "Observability")]
Longevity Time-extended stability tests [Trait("Category", "Longevity")]
Interop Cross-version/environment skew [Trait("Category", "Interop")]
PostIncident Tests from production incidents [Trait("Category", "PostIncident")]

Updated test model requirements

Model Turn #6 Additions
L0 (Library/Core) + Intent trait required for non-trivial tests
S1 (Storage/Postgres) + Interop tests for schema version migrations
W1 (WebService/API) + Observability contract tests (OTel spans, log fields, metrics)
WK1 (Worker/Indexer) + Longevity tests for memory/connection stability
CLI1 (Tool/CLI) + PostIncident regression tests

New CI lanes

Lane Purpose Cadence Gating
Evidence Evidence chain validation, traceability Per PR PR-gating for regulatory modules
Longevity Time-extended stability tests Nightly Release-gating
Interop Cross-version compatibility Weekly + pre-release Release-gating

Observability contract requirements (W1 model)

WebService tests must validate:

  • OTel spans: required spans exist, attributes present, cardinality bounded.
  • Structured logs: required fields present, no PII, appropriate log levels.
  • Metrics: required metrics exist, label cardinality bounded, counters monotonic.
[Trait("Category", "Observability")]
[Trait("Intent", "Operational")]
public async Task Scanner_EmitsRequiredTelemetry()
{
    using var otel = new OtelCapture();
    await sut.ScanAsync(request);

    OTelContractAssert.HasRequiredSpans(otel, "ScanImage", "ExtractLayers", "AnalyzeSBOM");
    OTelContractAssert.NoHighCardinalityAttributes(otel, threshold: 100);
}

Evidence traceability requirements

Regulatory tests must link to requirements:

[Requirement("REQ-EVIDENCE-001")]
[Trait("Intent", "Regulatory")]
public void EvidenceBundle_IsImmutableAfterSigning() { /* ... */ }

CI generates traceability matrix: requirement -> test -> artifact.

Cross-version testing requirements (Interop)

For modules with schema or API versioning:

  • Test N-1 compatibility (current server, previous client).
  • Test N+1 compatibility (previous server, current client).
  • Document compatibility matrix.

Time-extended testing requirements (Longevity)

For worker modules (WK1 model):

  • Memory stability: verify no growth under sustained load.
  • Connection pool stability: verify no leaks.
  • Counter drift: verify values remain bounded.

Run duration: 1+ hours for nightly, 4+ hours for release validation.

Post-incident testing requirements

For P1/P2 production incidents:

  1. Capture event sequence via replay infrastructure.
  2. Generate test scaffold from replay manifest.
  3. Include incident metadata (ID, root cause, severity).
  4. Tag with [Trait("Category", "PostIncident")].
  5. Test failures block releases.