Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2020 Feb 21;15(2):e0229352. doi: 10.1371/journal.pone.0229352

Utilization of archived neonatal dried blood spots for genome-wide genotyping

Pagna Sok 1, Philip J Lupo 1, Melissa A Richard 1, Karen R Rabin 1, Erik A Ehli 2, Noah A Kallsen 2, Gareth E Davies 2, Michael E Scheurer 1, Austin L Brown 1,*
Editor: Isabelle Chemin3
PMCID: PMC7034898  PMID: 32084225

Abstract

Introduction

Heel pricks are performed on newborns for diagnostic screenings of various pre-symptomatic metabolic and genetic diseases. Excess blood is spotted on Guthrie cards and archived by many states in biobanks for follow-up diagnoses and public health research. However, storage environment may vary across biobanks and across time within biobanks. With increased applications of DNA extracted from spots for genetic studies, identifying factors associated with genotyping success is critical to maximize DNA quality for future studies.

Method

We evaluated 399 blood spots, which were part of a genome-wide association study of childhood leukemia risk in children with Down syndrome, archived at the Michigan Neonatal Biobank between 1992 and 2008. High quality DNA was defined as having post-quality control call rate ≥ 99.0% based on the Illumina GenomeStudio 2.0 GenCall algorithm after processing the samples on the Illumina Infinium Global Screening Array. Bivariate analyses and multivariable logistic regression models were applied to evaluate effects of storage environment and storage duration on DNA genotyping quality.

Results

Both storage environment and duration were associated with sample genotyping call rates (p-values < 0.001). Sample call rates were associated with storage duration independent of storage environment (p-trend = 0.006 for DBS archived in an uncontrolled environment and p-trend = 0.002 in a controlled environment). However, 95% of the total sample had high genotyping quality with a call rate ≥ 95.0%, a standard threshold for acceptable sample quality in many genetic studies.

Conclusion

Blood spot DNA quality was lower in samples archived in uncontrolled storage environments and for samples archived for longer durations. Still, regardless of storage environment or duration, neonatal biobanks including the Michigan Neonatal Biobanks can provide access to large collections of spots with DNA quality acceptable for most genotyping studies.

Introduction

Blood obtained from heel pricks of newborn infants are spotted on Guthrie cards and collected as part of newborn screening (NBS) programs to identify potentially life-threatening metabolic and genetic disorders including sickle cell disease, phenylketonuria, and numerous other conditions associated with long-term morbidity [17]. Many states store excess or residual dried blood spots (DBS) that are not needed for the NBS in long-term repositories. In states that permit the use of these DBS for biomedical research, DBS can then be linked to disease registries, providing a rich resource for population-based epidemiologic studies for conditions not typically evaluated as part of NBS programs [8, 9]. Thus far, conditions studied have included type 1 diabetes, multiple sclerosis, and childhood cancers among many others [1012]. While DBS are increasingly being utilized to conduct genome-wide association studies for a range of phenotypes [13, 14], DBS may be in archive for decades under various conditions, which may affect the quality of DNA quality retrieved for research purposes [1517].

The Michigan Neonatal Biobank (MNB) is a storage and management center for archiving DBS collected as part of the Michigan NBS program. The MNB has archived over five million DBS to date, which represents almost every live birth in the state since 1987. We examined the effects of multiple factors, including storage environment and storage duration, on the quality of DNA extracted from DBS retrieved from the MNB for genome-wide genotyping. Our overall goal was to understand the potential compromising factors to be considered when identifying DBS while still maximizing samples to utilize for genetic epidemiologic studies. Particularly, for genomic analyses of rare childhood conditions, utilizing archived DBS that are several decades old may be necessary to amass an adequate sample size to achieve statistical power for rare variants evaluation.

Methods

Neonatal DBS were obtained from MNB as part of a genome-wide association study of childhood leukemia risk in children with Down syndrome [18], which permits all original 425 genotyped samples from children born between 1992 and 2008 to be evaluated. Birth records at the Michigan Department of Health and Human Services were linked to the records in the Michigan Cancer Surveillance Program to identify individuals with Down syndrome with and without acute lymphoblastic leukemia in the Michigan Cancer Surveillance program. This study was approved by the Committee for the Protection of Human Subjects of the Health and Human Services Agency of the State of Michigan and the Institutional Review Board of Baylor College of Medicine and did not involve the use of any personal identifiers.

DNA extraction

DNA extraction and genotyping for all DBS samples were conducted in 2017. DNA extraction was performed using the GenTegra (Pleasanton, CA) GenSolve Whole Blood DNA recovery kit according to the manufacturer protocol, which allows for DNA recovery from Whatman FTA (Flinders Technology Associates) cards, regardless of storage time. For each unique sample, six 3-mm punches were combined into a single 1.5 mL microtube containing the specified amount of GenTegra Recovery Solution A/Proteinase K solution. Following incubation in a heated shaker for a minimum of 1 hour, the tubes were centrifuged to collect any condensation. Remnant punches were removed before the solution was mixed with GenTegra Recovery Solution B. The entire volume was then transferred to either the NucleoSpin XS (Macherey-Nagel Inc., Bethlehem, PA) or Qiagen Micro (Hilden, Germany) spin column for DNA purification according to manufacturer recommendations. Total DNA was then eluted from each respective spin column by addition of approximately 35 μL of AE buffer. Quantification of the purity of the extracted DNA was performed using the Thermo Fisher Scientific NanoDrop 2000 Spectrophotometer (Wilmington, DE).

DNA genotyping and quality control

Extracted DNA was genotyped at the Avera Institute for Human Genetics (Sioux Falls, SD) using the Illumina (San Diego, CA) Infinium Global Screening Array BeadChip, which captures approximately 700,000 single nucleotide polymorphisms (SNPs). These genotyped SNPs or variants were called with GenomeStudio 2.0 software (San Diego, CA). The analysis was restricted to variants on chromosomes 1 to 20 and 22. Variants on chromosome 21 (n = 9,443) were excluded because calling for trisomic variants is not currently supported in GenomeStudio 2.0. We implemented conservative quality control criteria based on the GenomeStudio clustering algorithm metrics [19] rather than criteria often used in genome-wide association studies [20, 21], to filter poor-performing probes and specifically evaluate sample call rate from DBS. In particular, variants with a call rate < 90% (n = 17,804), minor allele frequency < 1% (n = 150,292), and cluster separation < 0.3 (n = 20,468) were removed. Final call rates were calculated among the 472,019 variants that passed quality control.

DBS call rate as primary outcome

GenomeStudio assigned variants genotyping based on a GenCall score generated through a calling algorithm. GenCall score is a quality metric between 0 and 1 that describes sample clustering, where a low score is assigned to samples that locate away from a cluster center. A low GenCall score is an indication of unreliable genotype calls, and typically, variants with a score < 0.15 are not assigned a genotype. Hence, sample call rate is the proportion of genotyped or called variants for an individual sample that were successfully determined by the GenomeStudio calling algorithm after initial removal of variants that did not pass quality control. Sample call rate was used as the primary outcome to evaluate DBS quality. Call rates were dichotomized on a stringent threshold such that a sample call rate ≥ 99.0% was classified as evidence of higher DNA quality while < 99.0% sample call rate indicated poorer DNA quality.

DBS storage characteristics as primary exposure

At MNB, DBS collected between 1996 and 2008 are kept in a controlled environment at 70 degrees Fahrenheit and 35% humidity while older DBS collected prior to 1996 are stored in an uncontrolled temperature and controlled (35%) humidity environment. Thus, the primary independent variables evaluated as potential factors differentiating DNA quality were storage environment and storage duration.

Statistical methods

Descriptive statistics, including counts and percentages for categorical variables and median and ranges for continuous variables, were calculated for the overall sample and compared across samples with call rate ≥ 99.0% or < 99.0%. Differences between sample quality and independent variables, including storage environment, storage duration, leukemia status, infant sex, maternal race/ethnicity, total DNA yield, and 260/280 measurement (indicative of DNA quality) were evaluated using Pearson chi-square, Fisher’s exact, or Wilcoxon rank-sum test. Since storage environment was correlated with year of sample collection, we fitted multivariable logistic regression models to evaluate the association between period of sample collection and sample quality, stratified by storage environment: 1) collected prior to 1996 (n = 101) and 2) collected on or after 1996 (n = 298). Logistic regression models were adjusted for total DNA yield and maternal race/ethnicity. All statistical analyses were conducted in R 3.5.2 and a significance threshold p-value < 0.05 was applied. Finally, to identify if any chromosomal regions were more susceptible to DNA degradation due to differences in storage conditions, we applied Fisher’s exact test to evaluate missing call counts across post-quality control genotyped SNPs in PLINK 1.9 using storage environments as the dependent variable [22, 23]. Variants with 0% or 100% missing call frequency were not evaluated.

Results

DBS call rate quality

To compare sample quality of the 425 DBS collected from newborns born between 1992 and 2008, 26 samples were excluded due to differences in DNA extraction methods. Among the 399 remaining samples, 75% were collected between 1996 and 2008 and stored in a controlled environment at 70 degrees Fahrenheit and 35% humidity (Table 1). The proportion of DBS collected and archived each year constituted between 4–8% of the total for the 16-year time period, with the exception of 2% in the year 2004. The majority of the samples were collected from individuals without a known diagnosis of childhood leukemia (94%), male sex (52%), and non-Hispanic white maternal race/ethnicity (77%).

Table 1. Associations between residual DBS characteristics and call rate among DBS archived at the Michigan Neonatal Biobank between 1992 and 2008.

Residual DBS characteristics Overall (n = 399) Sample Call Rate p-val1
≥ 99.0% (n = 339) < 99.0% (n = 60)
Storage environments, n (%) <0.001
Uncontrolled environment2 101 (25%) 54 (16%) 47 (78%)
Controlled environment3 298 (75%) 285 (84%) 13 (22%)
Year DBS collected, n (%) <0.001
1992 26 (7%) 11 (3%) 15 (25%)
1993 24 (6%) 8 (2%) 16 (27%)
1994 29 (7%) 17 (5%) 12 (20%)
1995 22 (6%) 18 (5%) 4 (7%)
1996 26 (7%) 26 (8%) 0 (0%)
1997 24 (6%) 19 (6%) 5 (8%)
1998 24 (6%) 19 (6%) 5(8%)
1999 32 (8%) 31 (9%) 1 (2%)
2000 16 (4%) 16 (5%) 0 (0%)
2001 25 (6%) 25 (7%) 0 (0%)
2002 28 (7%) 28 (8%) 0 (0%)
2003 17 (4%) 15 (4%) 2 (3%)
2004 8 (2%) 8 (2%) 0 (0%)
2005 24 (6%) 24 (7%) 0 (0%)
2006 33 (8%) 33 (10%) 0 (0%)
2007 14 (4%) 14 (4%) 0 (0%)
2008 27 (7%) 27 (8%) 0 (0%)
Leukemia status, n (%) 0.6
Leukemia 23 (6%) 21 (6%) 2 (3%)
No leukemia 376 (94%) 318 (94%) 58 (97%)
Infant sex, n (%) 0.6
Male 206 (52%) 173 (51%) 33 (55%)
Female 193 (48%) 166 (49%) 27 (45%)
Maternal race/ethnicity, n (%) 0.04
Non-Hispanic White 308 (77%) 264 (78%) 44 (73%)
Non-Hispanic Black 57 (14%) 44 (13%) 13 (22%)
Non-Hispanic Asian 13 (3%) 10 (3%) 3 (5%)
Hispanic 21 (5%) 21 (6%) 0 (0%)
Total DNA yield in μg, median (range) 5.3 (2.3–14.4) 5.3 (2.4–14.4) 4.8 (2.3–13.3) 0.3

1Pearson chi-square, Fisher’s exact, or Wilcoxon rank-sum p-value comparing residual DBS characteristics by sample call rate.

2Uncontrolled temperature, 35% humidity.

370 degrees Fahrenheit, 35% humidity.

The median sample call rate was 98.8% (range = 79.9–99.0%) prior to variant filtering. Variant filtering statistically improved call rate (median = 99.9%, range = 80.4–99.9%, p-value < 0.001) where 339 (85%) DBS had a call rate ≥ 99.0% after filtering (Table 1). The number of samples with call rate < 99.0% occurred more frequently in those collected prior to 1996 that were stored in a laboratory retention center without a controlled environment (p-value < 0.001). In contrast, the proportion of samples with call rate ≥ 99.0% archived on or after 1996 exceeded 85% every year, with the exception of 1997 (79%) and 1998 (79%). Maternal race/ethnicity was significantly associated with call rate (p-value = 0.04); however, this association was no longer statistically significant in models accounting for year of birth, suggesting that differences in call rates between racial/ethnic groups was confounded by changes in demographic dynamics of the population during the DBS collection period.

DBS call rate quality by storage environment and duration

We observed improvements in median sample call rate by year of DBS collection and archive environment (Fig 1). This trend was apparent both prior to and following variant filtering. To evaluate if the association between sample call rates and date of sample collection remained after accounting for differences in storage conditions, we conducted multivariable logistic regression stratifying on storage environment (Table 2). Independent of storage environment, there was also a positive correlation between the proportion of DBS with call rate ≥ 99.0% and more recent collection years. This was true for both DBS collected prior to 1996 (p-value for trend = 0.006) and samples collected on or after 1996 (p-value for trend = 0.002). We did not identify statistically significant associations between storage environments (S1 Table) and leukemia status (p-value = 0.9), infant sex (p-value = 0.9), maternal race/ethnicity (p-value = 0.9), or total DNA extracted (p-value = 0.05). Although DBS placement along the genotyping array was randomized, we further evaluated DBS placement as a potentially factor affecting call rate and did not observed any significant associations with call rate, storage environment, or DBS collection year (S2 Table). Moreover, DBS placement was not statistically associated with call rate (p-value = 0.5, data not shown) in the multivariable models accounting for total DNA yield, DBS collection year, maternal race/ethnicity, and storage environment. Therefore, DBS placement was not included in the final models presented.

Fig 1. Before and after variant filtering call rate of DBS archived at the Michigan Neonatal Biobank between 1992 and 2008.

Fig 1

Table 2. Trend in proportions of call rate by collection years among DBS archived at the Michigan Neonatal Biobank between 1992 and 2008.

Storage environments Year DBS collected Total DBS collected DBS proportion with call rate ≥ 99.0% p-trend1
Uncontrolled environment2 0.006
1992 26 0.42
1993 24 0.33
1994 29 0.59
1995 22 0.82
Controlled environment3 1996 26 1.00 0.002
1997 24 0.79
1998 24 0.79
1999 32 0.97
2000 16 1.00
2001 25 1.00
2002 28 1.00
2003 17 0.88
2004 8 1.00
2005 24 1.00
2006 33 1.00
2007 14 1.00
2008 27 1.00

1Multivariable logistic regression model adjusted for DNA yield and maternal race/ethnicity.

2Uncontrolled temperature, 35% humidity.

370 degrees Fahrenheit, 35% humidity.

Finally, we evaluated SNP missing frequency by storage environment and found random missing frequency across all chromosomes (Table 3, S1 Fig). Genome-wide statistically significant (p-value < 5×10−8) differential missingness was observed between storage environments for 2.17% of all variants evaluated. The frequency of differential missingness was similar across each chromosome (range: 1.33–2.77%), suggesting storage condition adversely impaired DNA quality for genotyping indiscriminately.

Table 3. Proportions of differential missing SNPs by storage environment among DBS archived at the Michigan Neonatal Biobank between 1992 and 2008.

Chromosome Number of SNPs evaluated Number of SNPs with p-value ≤ 5×10−8 Proportion of SNPs with p-value ≤ 5×10−8
1 26966 521 0.019
2 28580 630 0.022
3 24467 552 0.023
4 23160 641 0.028
5 21205 512 0.024
6 26951 628 0.023
7 20210 495 0.024
8 18607 461 0.025
9 15502 328 0.021
10 17538 355 0.020
11 17105 331 0.019
12 16769 352 0.021
13 12769 353 0.028
14 11316 252 0.022
15 10529 188 0.018
16 11242 164 0.015
17 10895 187 0.017
18 10504 247 0.024
19 7836 121 0.015
20 8521 120 0.014
22 5121 68 0.013
All chromosomes 345793 7506 0.022

Discussion

Utilizing archived residual DBS for genome-wide association research has increased in recent years [24, 25]. DBS have become a rich resource for population-based epidemiologic research on genetic factors of numerous diseases because of the accumulation of stored DBS along with the possibility to link samples to demographic and clinical data. However, due to the variability in the condition and environment that DBS are collected and archived within and between biobanks, it is good practice to consider the storage conditions and duration to evaluate the quality of extracted DNA across samples being used for any genomic studies. We evaluated sample call rate, a measurement of genotyping quality, of DBS archived at the MNB under varying storage conditions since their collection at births between 1992 and 2008. While controlled storage environment and newer blood spots yielded higher call rates, overall, our findings suggest DBS from MNB are viable and a potentially valuable resource for genome-wide association studies of common genetic variants.

In our analysis, storage environment and duration were the only two factors associated with differences in call rate. Specifically, DBS archived in a temperature- and humidity-controlled environment yielded higher DNA call rates (median call rate = 99.9%) when compared to those stored in an uncontrolled environment (median call rate = 99.3%). This is consistent with previous reports suggesting DBS storage environment influences biomarker stability [26, 27]. However, it should be noted that the average call rate for samples stored in an uncontrolled environment was acceptable for most genotyping applications. In this study, we used a call rate threshold of ≥ 99.0% to identify “high-quality” genotypes. Based on this, 85% of the selected samples were considered high quality (96% for those archived in a controlled environment). Moreover, many genome-wide association studies apply a less stringent call rate threshold of ≥ 95.0% to identify samples of acceptable quality [20, 21]. Using that threshold, 95.5% of the total DBS had sample call rates exceeding 95.0%, including 83% of the samples stored in an uncontrolled environment. Therefore, the majority of DBS regardless of their archival condition and duration should pass conventional quality control filtering applied in most contemporary genetic association studies. While proper archival of residual DBS in controlled environments will likely lessen DNA degradation over time, for the purposes of genotyping on array-based technology, DBS stored in an uncontrolled environment are still useable for most genetic studies. Additionally, our results suggest that DNA quality was consistent across the genome regardless of storage environments, indicating that little if any bias exists in evaluating certain genomic regions.

As noted, storage duration affected call rates regardless of DBS storage environment, where the most recently archived DBS yielded higher DNA call rates. This association was similarly described previously [28, 29], with the effect of storage duration on DNA quality reduced among DBS archived in a controlled environment. However, despite the significant association between storage duration and call rate, the vast majority of DBS archived in a controlled environment had a call rate ≥ 99.0% (the exceptions are collection year 1997, 1998, and 2003 where there were DBS with call rate < 99.0%). Conversely, among DBS archived in an uncontrolled environment, the proportion of DBS with call rate ≥ 99.0% seems to increase linearly by storage year (Table 2), suggesting an improved call rate in more recently collected DBS.

This study evaluated the impact of environmental condition, storage duration, and demographic factors on genotyping quality obtained from nearly 400 residual DBS archived over two decades. While this study provides important information on correlates of DBS sample quality for array-based genotyping, the study findings should be considered in light of some limitations. First, our analyses are limited to assessment of DNA used for genotyping studies. More specifically, it is not clear if the quality of DNA extracted from DBS is sufficient for the detection of variants using whole-exome or whole-genome sequencing technologies. Notably, we obtained relatively high DNA yields from DBS [30, 31]; however, total yield may not accurately reflect DNA quality and DNA degradation remains a concern. For more accurate estimations of quality DNA yield, future studies using DBS may want to consider specifically measuring dsDNA, which was not done in the current study. Additionally, because of our limited sample size, we were not able to fully evaluate the quality of low frequency variants (i.e., <1%) that are sometimes included in genome-wide association studies. Finally, because the samples included in this study were collected from a single bloodspot biobank, our findings may not be generalizable to DBS obtained from other biobanks with incomparable DBS archival environments.

In summary, residual DBS are a unique resource that facilitate population-based epidemiologic investigations of inherited genetic variation, epigenetic profiles, metabolomics and other potential biomarkers of complex diseases [27, 3234], especially for rare pediatric outcomes. The results of this study illustrate that residual DBS archived at MNB provide high quality DNA for genetic association studies evaluating common variation using standard array-based genotyping. While we identified differences in average sample quality across storage environment and storage duration, these factors did not substantially limit the number of samples that would pass conventional quality control measures for most genome-wide association studies. For investigations of common phenotypes or limited sample sizes, priority may be given to more recently collected DBS and those stored in controlled environments. However, our findings generally support the use of DBS stored for extended periods in uncontrolled environments for investigations of rare phenotypes, which may require accruing cases occurring over a period of several decades in order to achieve acceptable sample sizes.

Supporting information

S1 Table. Associations between residual DBS characteristics and storage environment among DBS archived at the Michigan Neonatal Biobank between 1992 and 2008.

(DOCX)

S2 Table. Counts of residual DBS call rate, storage environment, and collection period by genotyping array position among DBS archived at the Michigan Neonatal Biobank between 1992 and 2008.

(DOCX)

S1 Fig. Manhattan plot showing the association of SNPs with storage environment among DBS archived at the Michigan Neonatal Biobank between 1992 and 2008.

Black and red horizontal lines signify p-value ≤ 1×10−5 and ≤ 5×10−8, respectively.

(TIFF)

Acknowledgments

We want to thank the Michigan Department of Health and Human Services and the Michigan Neonatal Biobank for their input and feedback on the acquisition and use of residual newborn screening dried blood spot specimens.

Data Availability

Data cannot be shared publicly because participants did not provide informed consent for the deposition of genetic data. Data are available from the EpiCenter at Baylor College of Medicine (contact via epicenter@bcm.edu) for researchers who meet the criteria for access to confidential data.

Funding Statement

This work was supported in part by funding from the Cancer Prevention and Research Institute of Texas (RP170074, MPI: Rabin/Lupo) and the National Cancer Institute (K07 CA218362, PI: Brown). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.GUTHRIE R, SUSI A. A SIMPLE PHENYLALANINE METHOD FOR DETECTING PHENYLKETONURIA IN LARGE POPULATIONS OF NEWBORN INFANTS. Pediatrics. 1963. September;32:338–43. [PubMed] [Google Scholar]
  • 2.El-Hattab AW, Almannai M, Sutton VR. Newborn Screening: History, Current Status, and Future Directions. Pediatr Clin North Am. 2018. 04;65(2):389–405. 10.1016/j.pcl.2017.11.013 [DOI] [PubMed] [Google Scholar]
  • 3.Almannai M, Marom R, Sutton VR. Newborn screening: a review of history, recent advancements, and future perspectives in the era of next generation sequencing. Curr Opin Pediatr. 2016. 12;28(6):694–9. 10.1097/MOP.0000000000000414 [DOI] [PubMed] [Google Scholar]
  • 4.Pena LD, van Calcar SC, Hansen J, Edick MJ, Walsh Vockley C, Leslie N, et al. Outcomes and genotype-phenotype correlations in 52 individuals with VLCAD deficiency diagnosed by NBS and enrolled in the IBEM-IS database. Mol Genet Metab. 2016. 08;118(4):272–81. 10.1016/j.ymgme.2016.05.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Castellani C, Massie J, Sontag M, Southern KW. Newborn screening for cystic fibrosis. Lancet Respir Med. 2016. 08;4(8):653–61. 10.1016/S2213-2600(16)00053-9 [DOI] [PubMed] [Google Scholar]
  • 6.Chaturvedi S, DeBaun MR. Evolution of sickle cell disease from a life-threatening disease of children to a chronic disease of adults: The last 40 years. Am J Hematol. 2016. January;91(1):5–14. 10.1002/ajh.24235 [DOI] [PubMed] [Google Scholar]
  • 7.Millington DS, Kodo N, Norwood DL, Roe CR. Tandem mass spectrometry: a new method for acylcarnitine profiling with potential for neonatal screening for inborn errors of metabolism. J Inherit Metab Dis. 1990;13(3):321–4. 10.1007/bf01799385 [DOI] [PubMed] [Google Scholar]
  • 8.Choi EH, Lee SK, Ihm C, Sohn YH. Rapid DNA extraction from dried blood spots on filter paper: potential applications in biobanking. Osong Public Health Res Perspect. 2014. December;5(6):351–7. 10.1016/j.phrp.2014.09.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Lewis MH, Scheurer ME, Green RC, McGuire AL. Research results: preserving newborn blood samples. Sci Transl Med. 2012. November 7;4(159):159cm12 10.1126/scitranslmed.3004474 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Kyvsgaard JN, Overgaard AJ, Jacobsen LD, Thorsen SU, Pipper CB, Hansen TH, et al. Low perinatal zinc status is not associated with the risk of type 1 diabetes in children. Pediatr Diabetes. 2017. November;18(7):637–42. 10.1111/pedi.12476 [DOI] [PubMed] [Google Scholar]
  • 11.Nielsen NM, Munger KL, Koch-Henriksen N, Hougaard DM, Magyari M, Jørgensen KT, et al. Neonatal vitamin D status and risk of multiple sclerosis: A population-based case-control study. Neurology. 2017. January 3;88(1):44–51. 10.1212/WNL.0000000000003454 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Bhatti P, Doody DR, Mckean-Cowdin R, Mueller BA. Neonatal vitamin D and childhood brain tumor risk. Int J Cancer. 2015. May 15;136(10):2481–5. 10.1002/ijc.29291 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Pedersen CB, Bybjerg-Grauholm J, Pedersen MG, Grove J, Agerbo E, Bækvad-Hansen M, et al. The iPSYCH2012 case-cohort sample: new directions for unravelling genetic and environmental architectures of severe mental disorders. Mol Psychiatry. 2018. 01;23(1):6–14. 10.1038/mp.2017.196 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Horsdal HT, Agerbo E, McGrath JJ, Vilhjálmsson BJ, Antonsen S, Closter AM, et al. Association of Childhood Exposure to Nitrogen Dioxide and Polygenic Risk Score for Schizophrenia With the Risk of Developing Schizophrenia. JAMA Netw Open. 2019. November 1;2(11):e1914401 10.1001/jamanetworkopen.2019.14401 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Mitchell C, Jennings C, Brambilla D, Aldrovandi G, Amedee AM, Beck I, et al. Diminished human immunodeficiency virus type 1 DNA yield from dried blood spots after storage in a humid incubator at 37 degrees C compared to -20 degrees C. J Clin Microbiol. 2008. September;46(9):2945–9. 10.1128/JCM.00359-08 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Hollegaard MV, Thorsen P, Norgaard-Pedersen B, Hougaard DM. Genotyping whole-genome-amplified DNA from 3- to 25-year-old neonatal dried blood spot samples with reference to fresh genomic DNA. Electrophoresis. 2009. July;30(14):2532–5. 10.1002/elps.200800655 [DOI] [PubMed] [Google Scholar]
  • 17.Olney RS, Moore CA, Ojodu JA, Lindegren ML, Hannon WH. Storage and use of residual dried blood spots from state newborn screening programs. J Pediatr. 2006. May;148(5):618–22. 10.1016/j.jpeds.2005.12.053 [DOI] [PubMed] [Google Scholar]
  • 18.Brown AL, de Smith AJ, Gant VU, Yang W, Scheurer ME, Walsh KM, et al. Inherited genetic susceptibility to acute lymphoblastic leukemia in Down syndrome. Blood. 2019. October 10;134(15):1227–37. 10.1182/blood.2018890764 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Infinium Genotyping Data Analysis: A guide for analyzing Infinium genotyping data using the GenomeStudio Genotyping Module [Internet]. illumina.com. 2014 [cited 20 August 2019]. Available from: https://www.illumina.com/Documents/products/technotes/technote_infinium_genotyping_data_analysis.pdf
  • 20.Turner S, Armstrong LL, Bradford Y, Carlson CS, Crawford DC, Crenshaw AT, et al. Quality control procedures for genome-wide association studies. Curr Protoc Hum Genet. 2011. January;Chapter 1:Unit1.19. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Laurie CC, Doheny KF, Mirel DB, Pugh EW, Bierut LJ, Bhangale T, et al. Quality control and quality assurance in genotypic data for genome-wide association studies. Genet Epidemiol. 2010. September;34(6):591–602. 10.1002/gepi.20516 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Chang CC, Chow CC, Tellier LC, Vattikuti S, Purcell SM, Lee JJ. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience. 2015;4:7 10.1186/s13742-015-0047-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007. September;81(3):559–75. 10.1086/519795 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Wiemels JL, Walsh KM, de Smith AJ, Metayer C, Gonseth S, Hansen HM, et al. GWAS in childhood acute lymphoblastic leukemia reveals novel genetic associations at chromosomes 17q12 and 8q24.21. Nat Commun. 2018. January 18;9(1):286 10.1038/s41467-017-02596-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Hollegaard MV, Grove J, Thorsen P, Nørgaard-Pedersen B, Hougaard DM. High-throughput genotyping on archived dried blood spot samples. Genet Test Mol Biomarkers. 2009. April;13(2):173–9. 10.1089/gtmb.2008.0073 [DOI] [PubMed] [Google Scholar]
  • 26.Mei JV, Li L, Rasmussen SA, Collier S, Frias JL, Honein MA, et al. Effect of specimen storage conditions on newborn dried blood spots used to assess Toxoplasma gondii immunoglobulin M (IgM). Clin Chim Acta. 2011. February 20;412(5–6):455–9. 10.1016/j.cca.2010.11.028 [DOI] [PubMed] [Google Scholar]
  • 27.Koulman A, Prentice P, Wong MCY, Matthews L, Bond NJ, Eiden M, et al. The development and validation of a fast and robust dried blood spot based lipid profiling method to study infant metabolism. Metabolomics. 2014;10(5):1018–25. 10.1007/s11306-014-0628-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Hollegaard MV, Grove J, Grauholm J, Kreiner-Møller E, Bønnelykke K, Nørgaard M, et al. Robustness of genome-wide scanning using archived dried blood spot samples as a DNA source. BMC Genet. 2011. July 4;12:58 10.1186/1471-2156-12-58 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Schwartz A, Baidjoe A, Rosenthal PJ, Dorsey G, Bousema T, Greenhouse B. The Effect of Storage and Extraction Methods on Amplification of Plasmodium falciparum DNA from Dried Blood Spots. Am J Trop Med Hyg. 2015. May;92(5):922–5. 10.4269/ajtmh.14-0602 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Hannelius U, Lindgren CM, Melén E, Malmberg A, von Dobeln U, Kere J. Phenylketonuria screening registry as a resource for population genetic studies. J Med Genet. 2005. October;42(10):e60 10.1136/jmg.2005.032987 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Hollegaard MV, Grauholm J, Børglum A, Nyegaard M, Nørgaard-Pedersen B, Ørntoft T, et al. Genome-wide scans using archived neonatal dried blood spot samples. BMC Genomics. 2009. July 4;10:297 10.1186/1471-2164-10-297 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Wong N, Morley R, Saffery R, Craig J. Archived Guthrie blood spots as a novel source for quantitative DNA methylation analysis. BioTechniques. 2008. October;45(4):423–4, 426, 428 passim. 10.2144/000112945 [DOI] [PubMed] [Google Scholar]
  • 33.Staunstrup NH, Starnawska A, Nyegaard M, Christiansen L, Nielsen AL, Børglum A, et al. Genome-wide DNA methylation profiling with MeDIP-seq using archived dried blood spots. Clin Epigenetics. 2016;8:81 10.1186/s13148-016-0242-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Beger RD, Dunn W, Schmidt MA, Gross SS, Kirwan JA, Cascante M, et al. Metabolomics enables precision medicine: "A White Paper, Community Perspective". Metabolomics. 2016;12(10):149 10.1007/s11306-016-1094-6 [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

Isabelle Chemin

12 Dec 2019

PONE-D-19-32165

Utilization of archived neonatal dried blood spots for genome-wide genotyping

PLOS ONE

Dear Dr Sok,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the minor points raised during the review process.

We would appreciate receiving your revised manuscript by 6 weeks. When you are ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter.

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). This letter should be uploaded as separate file and labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. This file should be uploaded as separate file and labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. This file should be uploaded as separate file and labeled 'Manuscript'.

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

We look forward to receiving your revised manuscript.

Kind regards,

Isabelle Chemin, PhD

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements:

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at http://www.plosone.org/attachments/PLOSOne_formatting_sample_main_body.pdf and http://www.plosone.org/attachments/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. We note that you have indicated that data from this study are available upon request. PLOS only allows data to be available upon request if there are legal or ethical restrictions on sharing data publicly. For information on unacceptable data access restrictions, please see http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions.

In your revised cover letter, please address the following prompts:

a) If there are ethical or legal restrictions on sharing a de-identified data set, please explain them in detail (e.g., data contain potentially identifying or sensitive patient information) and who has imposed them (e.g., an ethics committee). Please also provide contact information for a data access committee, ethics committee, or other institutional body to which data requests may be sent.

b) If there are no restrictions, please upload the minimal anonymized data set necessary to replicate your study findings as either Supporting Information files or to a stable, public repository and provide us with the relevant URLs, DOIs, or accession numbers. Please see http://www.bmj.com/content/340/bmj.c181.long for guidelines on how to de-identify and prepare clinical data for publication. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories.

We will update your Data Availability statement on your behalf to reflect the information you provide.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: No

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The paper by Sok and colleagues explores how storage of dried blot spots cards may affect the final data and which factors that could compromise results. The manuscript is well written and easy to read.

The Introduction is overall a good review of the literature.

• I disagree that 13-15 are good references for GWAS, I would characterize these as methodological papers not unlike the one in review here. None-the-less the point is valid, there are numerous examples of canonical GWAS’s done on DBS, for example out of the iPSYCH/PGC.

The methodology is appropriate there are many alternative methods and here they have chosen one specific for the filter paper they use. Material on a DBS card is scarce which can be adjusted for by using more spots, here six are used. The statistics are appropriate and the authors are wise to exclude chromosome 21 since, as stated, the methodology does poorly when deviating from chr_n=2. The results are appropriately presented.

• You found no significance between in concentrations between +/- controlled, what about between concentration and call rate? Could the difference in CR>99% be confounded by low concentration?

• Also the yields are high i.e. µg. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1735924/ estimated theoretical yields, recalculating this for a 3mm spot you get 60ng as done in https://www.ncbi.nlm.nih.gov/pubmed/19575812. here your median estimate is almost a order of magnitude higher. Did you consider messuring concentrations with a dsDNA specific assay?

• Probes were excluded if the heterozygosity rate exceeds 0.4/40%, I am puzzled by this and curious as to why? For a variant with a MAF=0.5 the expected equilibrium would be 25%AA, 25%BB and 50%AB, thus such variant could exist in a natural outbreeding population meaning you exclude valid genotypes. The loss for this is high removing ~15% of the array content as per the authors own numbers (100K/700K).

• The physical location on the chip has been known to affect call rate, presumably driven by concentration gradients over the stain flow chambers. Especially the “top” of the chip has been prone to failure on the GSA v1. Did you consider chip placement as a variable?

• For the same reason above I am slightly skeptical about the trends in table 2. Only 3:13 point deviate from absolute success (prop>99%=1) meaning those three are sure to act as leverage points. Could this just be driven by experimental artifacts from samples placement on underperforming array?

I find this to be an interesting paper with few important messages. First is that when storing samples a controlled environment is preferable, second being that even in an uncontrolled environment the genotyping by array approach generates robust and reliable results despite decades of storage.

A general weakness of these papers is that there is no consensus in neonatal biobanks on how to store samples. As done here, showing that even the uncontrolled samples have great value is important. These population samples have many advantages over more typical samples of convenience.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files to be viewed.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2020 Feb 21;15(2):e0229352. doi: 10.1371/journal.pone.0229352.r002

Author response to Decision Letter 0


8 Jan 2020

Dear editor:

Thank you for the opportunity to revise and resubmit our manuscript, entitled “Utilization of archived neonatal dried blood spots for genome-wide genotyping.” We found all of the reviewer comments to be thoughtful and well-informed. We carefully considered each suggestion as we revised the manuscript, and we believe that addressing the reviewer’s concerns strengthened the manuscript considerably.

All revisions are indicated using track changes in the attached files. Additionally, we have included a second supplementary table (S2 Table) that addresses data concerning the associations between DBS call rate and genotyping array row position. The following are our responses and edits to the reviewers’ comments.

Reviewers’ comment 1: I disagree that 13-15 are good references for GWAS, I would characterize these as methodological papers not unlike the one in review here. None-the-less the point is valid, there are numerous examples of canonical GWAS’s done on DBS, for example out of the iPSYCH/PGC.

The methodology is appropriate there are many alternative methods and here they have chosen one specific for the filter paper they use. Material on a DBS card is scarce which can be adjusted for by using more spots, here six are used. The statistics are appropriate and the authors are wise to exclude chromosome 21 since, as stated, the methodology does poorly when deviating from chr_n=2. The results are appropriately presented.

Authors’ response 1: We appreciate the reviewers careful and thoughtful consideration of the manuscript. We agree that the references in question were not the most appropriate for GWAS of DBS. We have moved these refences to more accurately reflect the content of these papers and included new references to provide examples of GWAS using DBS.

Reviewers’ comment 2: You found no significance between in concentrations between +/- controlled, what about between concentration and call rate? Could the difference in CR>99% be confounded by low concentration?

Authors’ response 2: As correctly indicated by the reviewer, we did not identify a statistically significant difference in total DNA yield between storage environments (p=0.05), as shown in supplementary table 1. As requested, we have included a comparison of total DNA yield by call rate in Table 1. Again, we did not observe a significant association (p=0.3). Given the relatively weak associations observed, we do not believe total DNA yield is a strong confounder of the association between storage environment and call rate. Notably, including DNA yield as a covariate in our multivariable regression models did not materially impact our results.

Reviewers’ comment 3: Also the yields are high i.e. µg. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1735924/ estimated theoretical yields, recalculating this for a 3mm spot you get 60ng as done in https://www.ncbi.nlm.nih.gov/pubmed/19575812. here your median estimate is almost a order of magnitude higher. Did you consider measuring concentrations with a dsDNA specific assay?

Authors’ response 3: The reviewer raises an important limitation of our study. Although the yields were quite high in the current study, we did not specifically assess dsDNA concentrations. We have noted this limitation in our discussion section.

Reviewer’s comment 4: Probes were excluded if the heterozygosity rate exceeds 0.4/40%, I am puzzled by this and curious as to why? For a variant with a MAF=0.5 the expected equilibrium would be 25%AA, 25%BB and 50%AB, thus such variant could exist in a natural outbreeding population meaning you exclude valid genotypes. The loss for this is high removing ~15% of the array content as per the authors own numbers (100K/700K).

Authors’ response 4: Although the heterozygosity rate of 0.4/40% exclusion threshold is suggested by GenomeStudio clustering algorithm metrics, indicated in table 1 of the Infinium Genotyping Data Analysis guide (link included below), we agree with the reviewer’s concern. This exclusion criterion may be overly restrictive. Therefore, we revised our quality control measures in the DNA genotyping and quality control section of the Method (line 98-111 of the manuscript). Updated call rates were calculated based on the 472,019 SNPs that passed quality control. We then re-evaluated our analyses using the new call rates. All updated findings are indicated through track changes. Overall, our updated results are consistent with our initial findings.

https://www.illumina.com/Documents/products/technotes/technote_infinium_genotyping_data_analysis.pdf

Reviewers’ comments 5: The physical location on the chip has been known to affect call rate, presumably driven by concentration gradients over the stain flow chambers. Especially the “top” of the chip has been prone to failure on the GSA v1. Did you consider chip placement as a variable?

Authors’ response 5: Thank you for raising this important point. Notably, chip position was randomly assigned to all samples included in this study, a detail we have added to our results section. Still, we agreed with the reviewers that physical location on the genotyping array can affect sample call rate. Thus, we decided to include genotyping array position as a potential factor affecting sample call rate. With row 12 indicating the top of the array and row 1 the bottom, we tabulated sample call rate and storage environment by genotyping array row position. All results were reported in the supplemental table 2; however, we did not observe significant association for call rate (p=0.92) or storage environment (p=0.99) by array positioning.

Reviewers’ comments 6: For the same reason above I am slightly skeptical about the trends in table 2. Only 3:13 point deviate from absolute success (prop>99%=1) meaning those three are sure to act as leverage points. Could this just be driven by experimental artifacts from samples placement on underperforming array?

Authors’ response 6: The reviewer raises a valid concern. As reported in the supplementary table 2, we did not observe significant association between storage environment (p=0.99) or DBS collection period (p=0.99) and array positioning. Hence, we do not believe the significant p-trends in table 2 is confounded by DBS position on the genotyping array.

Attachment

Submitted filename: Response to Reviewers.docx

Decision Letter 1

Isabelle Chemin

5 Feb 2020

Utilization of archived neonatal dried blood spots for genome-wide genotyping

PONE-D-19-32165R1

Dear Dr. Sok,

We are pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it complies with all outstanding technical requirements.

Within one week, you will receive an e-mail containing information on the amendments required prior to publication. When all required modifications have been addressed, you will receive a formal acceptance letter and your manuscript will proceed to our production department and be scheduled for publication.

Shortly after the formal acceptance letter is sent, an invoice for payment will follow. To ensure an efficient production and billing process, please log into Editorial Manager at https://www.editorialmanager.com/pone/, click the "Update My Information" link at the top of the page, and update your user information. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, you must inform our press team as soon as possible and no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

With kind regards,

Isabelle Chemin, PhD

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Very good response to the questions/concerns raised during review.

In my opinion, this is interesting reading for anyone who considers using this type of material.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Jonas Bybjerg-Grauholm

Acceptance letter

Isabelle Chemin

7 Feb 2020

PONE-D-19-32165R1

Utilization of archived neonatal dried blood spots for genome-wide genotyping

Dear Dr. Sok:

I am pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please notify them about your upcoming paper at this point, to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

For any other questions or concerns, please email plosone@plos.org.

Thank you for submitting your work to PLOS ONE.

With kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Mrs Isabelle Chemin

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Table. Associations between residual DBS characteristics and storage environment among DBS archived at the Michigan Neonatal Biobank between 1992 and 2008.

    (DOCX)

    S2 Table. Counts of residual DBS call rate, storage environment, and collection period by genotyping array position among DBS archived at the Michigan Neonatal Biobank between 1992 and 2008.

    (DOCX)

    S1 Fig. Manhattan plot showing the association of SNPs with storage environment among DBS archived at the Michigan Neonatal Biobank between 1992 and 2008.

    Black and red horizontal lines signify p-value ≤ 1×10−5 and ≤ 5×10−8, respectively.

    (TIFF)

    Attachment

    Submitted filename: Response to Reviewers.docx

    Data Availability Statement

    Data cannot be shared publicly because participants did not provide informed consent for the deposition of genetic data. Data are available from the EpiCenter at Baylor College of Medicine (contact via epicenter@bcm.edu) for researchers who meet the criteria for access to confidential data.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES