4.0 KiB
		
	
	
	
	
	
	
	
			
		
		
	
	Normalized Versions Query Guide
This guide complements the Sprint 1–2 normalized versions rollout. It documents recommended indexes and aggregation patterns for querying AffectedPackage.normalizedVersions.
For a field-by-field look at how normalized rules persist in MongoDB (including provenance metadata), see Section 8 of the Concelier SemVer Merge Playbook.
1. Recommended indexes
When concelier.storage.enableSemVerStyle is enabled, advisories expose a flattened
normalizedVersions array at the document root. Create these indexes in mongosh
after the migration completes (adjust collection name if you use a prefix):
db.advisories.createIndex(
  {
    "normalizedVersions.packageId": 1,
    "normalizedVersions.scheme": 1,
    "normalizedVersions.type": 1
  },
  { name: "advisory_normalizedVersions_pkg_scheme_type" }
);
db.advisories.createIndex(
  { "normalizedVersions.value": 1 },
  { name: "advisory_normalizedVersions_value", sparse: true }
);
- The compound index accelerates $matchstages that filter by package identifier and rule style without unwindingaffectedPackages.
- The sparse index keeps storage costs low while supporting pure exact-version lookups (type exact).
The storage bootstrapper creates the same indexes automatically when the feature flag is enabled.
2. Query patterns
2.1 Determine if a specific version is affected
db.advisories.aggregate([
  { $match: { "normalizedVersions.packageId": "pkg:npm/lodash" } },
  { $unwind: "$normalizedVersions" },
  { $match: {
      $or: [
        { "normalizedVersions.type": "exact",
          "normalizedVersions.value": "4.17.21" },
        { "normalizedVersions.type": "range",
          "normalizedVersions.min": { $lte: "4.17.21" },
          "normalizedVersions.max": { $gt: "4.17.21" } },
        { "normalizedVersions.type": "gte",
          "normalizedVersions.min": { $lte: "4.17.21" } },
        { "normalizedVersions.type": "lte",
          "normalizedVersions.max": { $gte: "4.17.21" } }
      ]
  }},
  { $project: { advisoryKey: 1, title: 1, "normalizedVersions.packageId": 1 } }
]);
Use this pipeline during Sprint 2 staging validation runs. Invoke explain("executionStats") to confirm the compound index is selected.
2.2 Locate advisories missing normalized rules
db.advisories.aggregate([
  { $match: { $or: [
      { "normalizedVersions": { $exists: false } },
      { "normalizedVersions": { $size: 0 } }
    ] } },
  { $project: { advisoryKey: 1, affectedPackages: 1 } }
]);
Run this query after backfill jobs to identify gaps that still rely solely on rangeExpression.
2.3 Deduplicate overlapping rules
db.advisories.aggregate([
  { $unwind: "$normalizedVersions" },
  { $group: {
      _id: {
        identifier: "$normalizedVersions.packageId",
        scheme: "$normalizedVersions.scheme",
        type: "$normalizedVersions.type",
        min: "$normalizedVersions.min",
        minInclusive: "$normalizedVersions.minInclusive",
        max: "$normalizedVersions.max",
        maxInclusive: "$normalizedVersions.maxInclusive",
        value: "$normalizedVersions.value"
      },
      advisories: { $addToSet: "$advisoryKey" },
      notes: { $addToSet: "$normalizedVersions.notes" }
  }},
  { $match: { "advisories.1": { $exists: true } } },
  { $sort: { "_id.identifier": 1, "_id.type": 1 } }
]);
Use this to confirm the merge dedupe logic keeps only one normalized rule per unique constraint.
3. Operational checklist
- Create the indexes in staging before toggling dual-write in production.
- Capture explain plans and attach them to the release notes.
- Notify downstream services that consume advisory snapshots about the new normalizedVersionsarray.
- Update export fixtures once dedupe verification passes.
Additional background and mapper examples live in Concelier SemVer Merge Playbook.