1.3 KiB
1.3 KiB
Golden Pairs Corpus
Golden pairs are curated binary pairs (original vs patched) used to validate binary-diff logic. Binaries are stored outside git; this folder tracks metadata, hashes, and reports only.
Layout
datasets/golden-pairs/
index.json
CVE-2022-0847/
metadata.json
original/
vmlinux
vmlinux.sha256
vmlinux.sections.json
patched/
vmlinux
vmlinux.sha256
vmlinux.sections.json
diff-report.json
advisories/
USN-5317-1.txt
File Conventions
metadata.jsonfollowsdocs/schemas/golden-pair-v1.schema.json.index.jsonfollowsdocs/schemas/golden-pairs-index.schema.json.*.sha256contains a single lowercase hex digest, no prefix.*.sections.jsoncontains section hash output from the ELF hash extractor.diff-report.jsonis produced bygolden-pairs diff.
Adding a Pair
- Create a
CVE-YYYY-NNNN/metadata.jsonwith required fields. - Fetch binaries via
golden-pairs mirror CVE-.... - Generate section hashes for each binary.
- Run
golden-pairs diff CVE-...and reviewdiff-report.json. - Update
index.jsonwith status and summary counts.
Offline Notes
- Use cached package mirrors or
file://sources for air-gapped runs. - Keep hashes and timestamps deterministic; always use UTC ISO-8601 timestamps.