Files
git.stella-ops.org/docs/implplan/SPRINT_174_telemetry.md
master e950474a77
Some checks failed
Docs CI / lint-and-preview (push) Has been cancelled
AOC Guard CI / aoc-guard (push) Has been cancelled
AOC Guard CI / aoc-verify (push) Has been cancelled
api-governance / spectral-lint (push) Has been cancelled
oas-ci / oas-validate (push) Has been cancelled
Policy Lint & Smoke / policy-lint (push) Has been cancelled
Policy Simulation / policy-simulate (push) Has been cancelled
SDK Publish & Sign / sdk-publish (push) Has been cancelled
up
2025-11-27 15:16:31 +02:00

5.6 KiB
Raw Blame History

Sprint 174 - Notifications & Telemetry · 170.B) Telemetry

Active items only. Completed/historic work now resides in docs/implplan/archived/tasks.md (updated 2025-11-08).

[Notifications & Telemetry] 170.B) Telemetry Depends on: Sprint 150.A - Orchestrator Summary: Notifications & Telemetry focus on Telemetry).

Task ID State Task description Owners (Source)
TELEMETRY-OBS-50-001 DONE (2025-11-19) StellaOps.Telemetry.Core bootstrap library shipped with structured logging facade, OTEL configuration helpers, deterministic bootstrap (service name/version detection, resource attributes), and sample usage for web/worker hosts. Evidence: docs/observability/telemetry-bootstrap.md. Telemetry Core Guild (src/Telemetry/StellaOps.Telemetry.Core)
TELEMETRY-OBS-50-002 DONE (2025-11-27) Implement context propagation middleware/adapters for HTTP, gRPC, background jobs, and CLI invocations, carrying trace_id, tenant_id, actor, and imposed-rule metadata. Provide test harness covering async resume scenarios. Dependencies: TELEMETRY-OBS-50-001. Telemetry Core Guild (src/Telemetry/StellaOps.Telemetry.Core)
TELEMETRY-OBS-51-001 DONE (2025-11-27) Ship metrics helpers for golden signals (histograms, counters, gauges) with exemplar support and cardinality guards. Provide Roslyn analyzer preventing unsanitised labels. Dependencies: TELEMETRY-OBS-50-002. Evidence: GoldenSignalMetrics.cs + StellaOps.Telemetry.Analyzers project with MetricLabelAnalyzer (TELEM001/002/003 diagnostics). Telemetry Core Guild, Observability Guild (src/Telemetry/StellaOps.Telemetry.Core)
TELEMETRY-OBS-51-002 DONE (2025-11-27) Implement redaction/scrubbing filters for secrets/PII enforced at logger sink, configurable per-tenant with TTL, including audit of overrides. Add determinism tests verifying stable field order and timestamp normalization. Dependencies: TELEMETRY-OBS-51-001. Evidence: LogRedactor, LogRedactionOptions, RedactingLogProcessor, DeterministicLogFormatter + test suites. Telemetry Core Guild, Security Guild (src/Telemetry/StellaOps.Telemetry.Core)
TELEMETRY-OBS-55-001 TODO Provide incident mode toggle API that adjusts sampling, enables extended retention tags, and records activation trail for services. Ensure toggle honored by all hosting templates and integrates with Config/FeatureFlag providers. Dependencies: TELEMETRY-OBS-51-002. Telemetry Core Guild (src/Telemetry/StellaOps.Telemetry.Core)
TELEMETRY-OBS-56-001 TODO Add sealed-mode telemetry helpers (drift metrics, seal/unseal spans, offline exporters) and ensure hosts can disable external exporters when sealed. Dependencies: TELEMETRY-OBS-55-001. Telemetry Core Guild (src/Telemetry/StellaOps.Telemetry.Core)

Status notes (2025-11-19 UTC)

  • TELEMETRY-OBS-50-001 DONE. Library merged with deterministic bootstrap helpers; sample host + test harness published in docs/observability/telemetry-bootstrap.md.
  • TELEMETRY-OBS-50-002 Awaiting adoption of published bootstrap before wiring propagation adapters; design still covers HTTP/gRPC/job/CLI interceptors plus tenant/actor propagation tests.
  • TELEMETRY-OBS-51-001 DONE. Golden signal metrics (GoldenSignalMetrics.cs) with exemplar support and cardinality guards already existed. Added Roslyn analyzer project (StellaOps.Telemetry.Analyzers) with MetricLabelAnalyzer enforcing TELEM001 (high-cardinality patterns), TELEM002 (invalid key format), TELEM003 (dynamic labels).
  • TELEMETRY-OBS-51-002 DONE. Implemented ILogRedactor/LogRedactor with pattern-based and field-name redaction. Per-tenant overrides with TTL and audit logging. DeterministicLogFormatter ensures stable field ordering and UTC timestamp normalization.
  • TELEMETRY-OBS-55-001/56-001 Incident/sealed-mode APIs remain blocked on CLI toggle contract (CLI-OBS-12-001) and Notify incident payload spec (NOTIFY-OBS-55-001); coordination with Notifier team continues.

Milestones & dependencies

Target date Milestone Owner(s) Notes / dependencies
2025-11-18 Land Telemetry.Core bootstrap sample in Orchestrator Telemetry Core Guild · Orchestrator Guild Demonstrates TELEMETRY-OBS-50-001 deliverable; prerequisite for propagation middleware adoption.
2025-11-19 Publish propagation adapter API draft Telemetry Core Guild Needed for TELEMETRY-OBS-50-002 and downstream service adoption.
2025-11-21 Security sign-off on scrub policy (POLICY-SEC-42-003) Telemetry Core Guild · Security Guild Unlocks TELEMETRY-OBS-51-001/51-002 implementation.
2025-11-22 Incident/CLI toggle contract agreed (CLI-OBS-12-001 + NOTIFY-OBS-55-001) Telemetry Core Guild · Notifications Service Guild · CLI Guild Required before TELEMETRY-OBS-55-001/56-001 can advance.

Coordination log

Date (UTC) Update Owner
2025-11-12 18:05 Marked TELEMETRY-OBS-50-001 as DOING and captured branch/progress details in status notes. Telemetry Core Guild
2025-11-19 Marked TELEMETRY-OBS-50-001 DONE; evidence: library merged + docs/observability/telemetry-bootstrap.md with sample host integration. Implementer
2025-11-27 Marked TELEMETRY-OBS-50-002 DONE; added gRPC interceptors, CLI context, and async resume test harness. Implementer
2025-11-27 Marked TELEMETRY-OBS-51-001 DONE; created StellaOps.Telemetry.Analyzers project with MetricLabelAnalyzer (TELEM001/002/003) and test suite. Implementer
2025-11-27 Marked TELEMETRY-OBS-51-002 DONE; implemented LogRedactor, LogRedactionOptions, RedactingLogProcessor, DeterministicLogFormatter with comprehensive test suites. Implementer