Verify live search suggestions against ingested corpus

This commit is contained in:
master
2026-03-07 18:52:18 +02:00
parent 9d3bed1d0e
commit 820fb4ec25
4 changed files with 324 additions and 16 deletions

View File

@@ -20,7 +20,7 @@
## Delivery Tracker
### QA-ZL-001 - Add live corpus preflight and rebuild checks
Status: TODO
Status: DONE
Dependency: none
Owners: Test Automation
Task description:
@@ -28,12 +28,12 @@ Task description:
- Fail with explicit setup diagnostics when the corpus is empty or stale instead of producing misleading UI failures.
Completion criteria:
- [ ] The live suite checks rebuild/readiness before suggestion assertions.
- [ ] Failure output distinguishes ingestion failure from UI failure.
- [ ] Setup docs reference compiled CLI and HTTP rebuild fallbacks.
- [x] The live suite checks rebuild/readiness before suggestion assertions.
- [x] Failure output distinguishes ingestion failure from UI failure.
- [x] Setup docs reference compiled CLI and HTTP rebuild fallbacks.
### QA-ZL-002 - Prove every surfaced suggestion succeeds
Status: TODO
Status: DONE
Dependency: QA-ZL-001
Owners: Test Automation
Task description:
@@ -41,32 +41,37 @@ Task description:
- Include pages that rely on current-scope weighting and overflow fallback.
Completion criteria:
- [ ] The live suite iterates through each surfaced suggestion on the covered pages.
- [ ] Every rendered suggestion produces a visible non-dead-end state.
- [ ] Previously failing suggestion paths are covered explicitly.
- [x] The live suite iterates through each surfaced suggestion on the covered pages.
- [x] Every rendered suggestion produces a visible non-dead-end state.
- [x] Previously failing suggestion paths are covered explicitly.
### QA-ZL-003 - Verify search-to-chat consolidation
Status: TODO
Status: DONE
Dependency: QA-ZL-002
Owners: Test Automation
Task description:
- Verify the compact chat launcher and answer-panel handoff preserve query, page context, and evidence after the search redesign.
Completion criteria:
- [ ] Search is the tested primary entry in all covered flows.
- [ ] AdvisoryAI opens as a secondary deep-dive from search with inherited context.
- [ ] Execution log records the final full-pack commands and outcomes.
- [x] Search is the tested primary entry in all covered flows.
- [x] AdvisoryAI opens as a secondary deep-dive from search with inherited context.
- [x] Execution log records the final full-pack commands and outcomes.
## Execution Log
| Date (UTC) | Update | Owner |
| --- | --- | --- |
| 2026-03-07 | Sprint created for live corpus-backed suggestion reliability and zero-learning search verification. | Project Manager |
| 2026-03-07 | Reproduced the user-facing failure against `http://127.1.0.44`: health was up but `POST /v1/advisory-ai/index/rebuild` returned `documentCount=0`, `chunkCount=0`, and `doctorProjectionCount=0`, so suggestion preflight now treats empty-corpus services as setup failures instead of UI regressions. | Test Automation |
| 2026-03-07 | Prepared sources against the repo-controlled service, rebuilt both indexes, and verified live query `database connectivity` returned `contextAnswer.status=grounded` with knowledge cards and citations. | Test Automation |
| 2026-03-07 | Ran `npx playwright test tests/e2e/unified-search-contextual-suggestions.live.e2e.spec.ts --config playwright.config.ts` against `http://127.0.0.1:10451`; result `5/5` passed covering chip viability, every surfaced suggestion, result-open follow-up chips, and Ask-AdvisoryAI handoff. | Test Automation |
## Decisions & Risks
- Decision: live reliability gates are required because static mocks cannot prove suggestion viability against real corpora.
- Decision: a healthy service with an empty corpus is an ingestion/setup failure, not a passing baseline; live E2E must fail before UI assertions in that case.
- Risk: local environments may have partially ingested or empty corpora, especially in Doctor/knowledge projections.
- Mitigation: add explicit corpus preflight and rebuild guidance so the suite fails with actionable diagnostics.
- Mitigation: use a repo-controlled local service (`http://127.0.0.1:10451`) with `advisoryai sources prepare`, `POST /v1/advisory-ai/index/rebuild`, and `POST /v1/search/index/rebuild` before running the live suite.
## Next Checkpoints
- 2026-03-09: Land live corpus preflight before broadening the suggestion matrix.
- 2026-03-10: Run the final live suggestion pack and capture exact outcomes in the execution log.
- 2026-03-09: Broaden live coverage beyond Doctor once findings/policy/VEX ingestion parity is available.
- 2026-03-10: Fold the live reliability lane into the consolidated zero-learning search redesign phases.