Skip to main content
. 2020 Apr 2;106(4):535–548. doi: 10.1016/j.ajhg.2020.03.004

Table 1.

Quarantine and Exclusion Criteria for MVP Samples and Sample Count per Category

Category Number of Samples Percentage of Samples
Starting MVP sample set for analysis 514,383
Intentionally duplicated samples 25,291
Uniquely genotyped individuals 485,856 100.00%
Samples with call rates below 98.5% 15,436 3.18%
Positive-control samples 3,236 0.66%
Samples with sex misclassification 1,450 0.29%
Samples on plates containing 4 or more sex misclassifications 2,619 0.53%
Unintentionally duplicated samples 1,149 0.23%
Samples on plates containing an intentional duplicate with high discordance 9,975 2.05%
Samples with high heterozygosity 248 0.05%
Samples with no or multiple unique participant identifiers 71 0.01%
Intentionally duplicated samples with high discordance 413 0.08%
Samples with 7 or more “relatives” 466 0.09%
Samples excluded from the dataset 28,527 5.87%
Samples quarantined from the dataset 31,836 6.55%
Sample set in current data release 459,777

Percentages are calculated from the total number of uniquely genotyped individuals (485,856). Categories are not mutually exclusive (i.e., a sample can be removed as a result of more than one category and is counted in each applicable category in the table).