NHGRI Dog Genome Project
722 dogs from the Plassais 2019 NIH/NHGRI release — the public-domain breed-reference cohort.
The NHGRI Dog Genome Project at the National Human Genome Research Institute is the multi-decade public canine sequencing effort that established many of the breed reference panels modern canine genomics depends on. The 722-dog subset Sniff ingested comes from the Plassais et al. 2019 Nature Communications release (CC-BY) — a curated cross-breed cohort spanning 322 breeds, village dogs, wolves, coyotes, and other wild canids.
NHGRI 722 contributes the public-domain backbone of breed-reference dogs the atlas needed to validate its projection pipeline (78.4% out-of-sample breed concordance — 196× over chance). Includes 56 wild canids (wolves, coyotes, jackals, fox, dingo, dhole) that render in a visually distinct region of the atlas at PCA norm 70–80 — biologically correct, not a glitch. CC-BY 4.0.
- Yorkshire Terrier 75 dogs
- Village Dog China 42 dogs
- Unknown 30 dogs
- Grey Wolf 27 dogs
- Labrador Retriever 24 dogs
- Golden Retriever 20 dogs
- Bernese Mountain Dog 18 dogs
- Border Collie 15 dogs
- Plassais et al. 2019. Whole genome sequencing of canids reveals genomic regions under selection and variants influencing morphology. Nature Communications 10:1489. doi:10.1038/s41467-019-09373-w