Preprocess direct-to-consumer (DTC) genomes for research
Data | Num. | Desc. | Time freeze | Link | Size |
---|---|---|---|---|---|
OpenSNP | 100 | Random 100 individual VCFs for quick download. See an example file here. | 19.02.2020 | 684M | |
OpenSNP | 5081 | Individual VCF, GRCh37. List of genomes processed: here, and log. | 19.02.2020 | 36G | |
PGP | 734 | Individual VCFs, GRCh37. List of genomes processed: here, and log. | 19.02.2020 | 5.6G | |
OpenSNP | 5393 | Combined Plink format (bim, fam, bed) in GRCh37, including those that were originally deposited in GRCh36 format. | 19.02.2020 | 570M |
Quick check variants seen in the OpenSNP genomes: openSNP.bim
Additional data support can be requested by contacting us.
C. Lu, B. Greshake Tzovaras, J. Gough, A survey of direct-to-consumer genotype data,and quality control tool (GenomePrep) for research, Computational and Structural Biotechnology Journal(2021), doi: https://doi.org/10.1016/j.csbj.2021.06.040