Abstract
Background: DNA methylation reflects health-related environmental exposures and genetic risk, providing insights into aetiological mechanisms and potentially predicting disease onset, progression and treatment response. An increasingly recognised need for large-scale, longitudinally-profiled samples collected world-wide has made the development of efficient and straightforward sample collection and storage procedures a pressing issue. An alternative to the low-temperature storage of EDTA tubes of venous blood samples, which are frequently the source of the DNA used in such studies, is to collect and store at room temperature blood samples using purpose built filter paper, such as Whatman FTA® cards. Our goal was to determine whether DNA stored in this manner can be used to generate DNA methylation profiles comparable to those generated using blood samples frozen in EDTA tubes.
Methods: DNA methylation profiles were obtained from matched EDTA tube and Whatman FTA® card whole-blood samples from 62 Generation Scotland: Scottish Family Health Study participants using the Infinium HumanMethylation450 BeadChip. Multiple quality control procedures were implemented, the relationship between the two sample types assessed, and epigenome-wide association studies (EWASs) performed for smoking status, age and the interaction between these variables and sample storage method.
Results: Dried blood spot (DBS) DNA methylation profiles were of good quality and DNA methylation profiles from matched DBS and EDTA tube samples were highly correlated (mean r = 0.991) and could distinguish between participants. EWASs replicated established associations for smoking and age, with no evidence for moderation by storage method.
Conclusions: Our results support the use of Whatman FTA® cards for collecting and storing blood samples for DNA methylation profiling. This approach is likely to be particularly beneficial for large-scale studies and those carried out in areas where freezer access is limited. Furthermore, our results will inform consideration of the use of newborn heel prick DBSs for research use.
Keywords: dried blood spots, DNA methylation, DBS, Guthrie cards, Generation Scotland
Introduction
Recent technological advances have facilitated the high-throughput genome-wide measurement of DNA methylation, yielding many studies reporting links between methylation and several conditions, diseases and phenotypes. The majority of studies have profiled DNA methylation in peripheral blood due to ease of sampling, which also renders it a useful tissue for identifying biomarkers. Blood samples for DNA methylation profiling are typically frozen prior to DNA extraction, with some samples being stored for years before use. As the sample numbers used for studies of DNA methylation, particularly epigenome-wide association studies (EWASs), continue to grow and the value of repeat sampling over time and exposures is recognised, so the logistical and financial issues surrounding whole blood sample collection and storage escalate. Moreover, population differences in DNA methylation 1– 4 highlight the need to profile DNA methylation around the world, including in low- and middle-income countries and rural settings, which might lack the required facilities for the low-temperature storage of EDTA blood collection tubes.
A potential alternative that would permit field collection and storage as well as postal collection is to collect finger or heel prick blood samples on filter paper 5. As part of Generation Scotland: Scottish Family Health Study (GS:SFHS) 6, 7, Whatman FTA ® cards were used to spot 100μl of the peripheral (venous) blood samples obtained at the baseline research clinic appointment, which took place between 2006 and 2011. These cards contain chemicals that lyse cell membranes and denature proteins, thus protecting DNA from degradation. Samples collected on these cards can be stored at room temperature for several years 8 prior to DNA extraction, thus avoiding the financial and logistical issues associated with freezer storage. Whatman FTA ® cards have been validated for genetic studies 9– 11; however, the effect of storing samples in this manner on the epigenome is less well-established. Previous studies provide support for the possibility of profiling DNA methylation in samples stored on Whatman FTA ® cards 8, 12; however, these studies were limited by their focus on either a small number of individuals (n = 2) 12 or a single gene of interest 8 and a relatively short time interval between sample collection and DNA extraction (maximum of approximately two years). Moreover, to the best of our knowledge, no previous study has compared genome-wide DNA methylation profiles obtained from dried blood spots (DBSs) stored on Whatman FTA ® cards with those from DNA from matched whole blood samples frozen in EDTA tubes (henceforth referred to as “EDTA samples”).
The matched DBSs and EDTA samples obtained from GS:SFHS 6, 7 have permitted us to carry out this comparison using samples from 62 participants. We show that it is technically feasible to measure DNA methylation from DBSs stored on Whatman FTA ® cards, that the methylation profiles of DBS samples are highly correlated with those from EDTA samples from the same individuals, and that EWASs using DBS samples replicate well-established differentially methylated loci associated with smoking and age. These findings indicate that this storage method represents a viable alternative to freezing blood samples for DNA methylation studies.
In addition to supporting the use of DBSs as a solution to the challenges posed by the need to collect and store large numbers of blood samples for DNA methylation studies in a wide-range of sociodemographic settings, our results also provide encouragement for the ability to profile DNA methylation in DBSs routinely collected on filter paper (commonly referred to as Guthrie cards) from new-born babies. The Guthrie cards are used clinically to screen for a prescribed set of metabolic and genetic disorders. These tests do not exhaust the sample and the remaining material is typically stored, often indefinitely. The ability to profile DNA methylation from Guthrie cards would present a number of important opportunities 13, 14, including: addressing the issues of reverse causation and confounding, which plague EWASs; anchoring population studies of gene-environment interactions at the beginning of life; and permitting the development of predictors of future disease onset, thus facilitating early intervention.
In Denmark, research access to Guthrie cards is possible through the Danish Newborn Screening Biobank 15 and in 2017 it was estimated that almost 40% of the 2.1 million samples stored in this biobank had contributed to published research 16. In Scotland, however, a moratorium on research access to Guthrie cards was imposed in 2009 and remains in place pending a public consultation 17. An important step in determining whether medical researchers will be permitted access to this resource (over 2.5 million cards collected since 1965, including 9,788 GS:SFHS participants with genome-wide genotype data and extensive phenotype data) will be assessments of the feasibility of obtaining biologically meaningful and clinically valuable data from archived DBSs. Although the Whatman FTA ® card is not identical to the card types used in the NHS Scotland new-born screening programme, some of the issues addressed in the present study, such as low DNA yield, a long period of storage, and the necessity of using a sample comprising heterogeneous cell types, are relevant to the potential use of Guthrie cards for research.
Methods
Cohort description
Genome-wide DNA methylation was profiled in whole blood samples from 62 GS:SFHS participants, chosen to have matched DNA samples from whole blood stored in EDTA tubes, DBSs stored on Whatman FTA ® cards and complete phenotype data (on sex, age and smoking status). GS:SFHS has been described in detail elsewhere 6, 7; further information and information on research access can be found here. Studies arising from GS:SFHS can be found here. Briefly, GS:SFHS comprises ~24,000 individuals who were aged 18 years or over at the time of recruitment. At the time of blood sample collection, participants were deeply phenotyped for a range of demographic, health, and social variables. A favourable ethical opinion for GS:SFHS was obtained from the NHS Tayside Committee on Medical Research Ethics, on behalf of the National Health Service (reference: 05/S1401/89). GS:SFHS has Research Tissue Bank Status (reference: 15/ES/0040).
Blood sample collection and DNA extraction
Each participant provided a blood sample (9ml), which was collected in an EDTA tube for DNA extraction, when attending the baseline GS:SFHS clinical appointment. This sample was stored at -20°C for between 33 and 497 days. At this point, 100μl of the sample (approximately equivalent to two drops of blood) was blotted onto a Whatman FTA ® classic card (GE Healthcare; cat. no. WB120205) and DNA was extracted from the remaining sample using the Nucleon BACC3 Genomic DNA Extraction Kit (GE Healthcare; cat. no. RPN8512) 18, following the manufacturer’s instructions. The Whatman FTA ® cards were stored at room temperature for between 5 and 10 years until DNA extraction for this project. DNA was extracted from the entire blood spot using the QIAamp DNA Investigator Kit (Qiagen; cat. no. 56504), following the manufacturer’s instructions. The concentration of the DNA samples was measured using a Qubit 2.0 fluorometer and the Qubit dsDNA HS assay (Thermo Fisher; cat. no. Q32854). Sample storage, DNA extraction and DNA quantification took place at the Edinburgh Clinical Research Facility at the University of Edinburgh.
Genome-wide methylation profiling
Whole blood genomic DNA (EDTA samples: 500ng; DBS samples: 212–500ng) was treated with sodium bisulphite using the EZ-96 DNA Methylation Kit (Zymo Research; cat. no. D5004), following the manufacturer’s instructions. DNA methylation was profiled using the Infinium HumanMethylation450 BeadChip (Illumina Inc.; cat. no. WG-314-1001), according to the manufacturer’s protocol. Paired EDTA and DBS samples from a given participant were profiled on separate slides and matched for row position.
Raw intensity (.idat) files were read into R 19 v. 3.5.1 using minfi 20 v. 1.28.0 and quality control was carried out using shinyMethyl 21 v. 1.18.0 and functions within wateRmelon 22 v. 1.26.0. ShinyMethyl was used to create a “QC plot”, which plots the log median intensity of the unmethylated signal against the log median intensity of the methylated signal for each array, and density plots of the beta-values separated by probe type (type 1 or 2) and colour channel for the type 1 probes. These plots were assessed for outlier samples by visual inspection. The performance of each of the 14 types of control probe on the array was assessed by inspecting plots created by shinyMethyl showing the raw signal intensities for each array. Outliers were assessed by visual inspection. ShinyMethyl’s sex prediction plot was used to assess whether a participant’s predicted sex based on the difference between the median copy number intensity for the Y chromosome and the median copy number intensity for the X chromosome matched their self-reported sex. Next, the pfilter function in wateRmelon was used to exclude poor-performing samples and probes. Samples with ≥ 1% sites with a detection p-value of > 0.05 were excluded. Probes were excluded from the dataset if: (i) ≥5% samples had a beadcount of < 3; or (ii) ≥ 0.5% samples had a detection p-value of > 0.05. Probes that had been predicted to cross-hybridise were removed using the rmSNPandCH function in the R package DMRcate 23 v. 1.18.0, which removes cross-reactive probes identified by Chen et al. 24. Probes on the X and Y chromosomes and those that target loci affected by a SNP at the CpG site or at the site of single base extension (in the case of Type 1 probes) were removed prior to carrying out EWASs.
The data from all samples were normalised together using the dasen method in wateRmelon. Dasen adjusts the background difference between Type I and Type II assays (by adding the offset between Type I and II probe intensities to Type I intensities) and then performs between-array quantile normalisation for the methylated and unmethylated signal intensities separately (Type I and Type II assays normalised separately).
Assessment of the relationship between detection p-value and DBS sample characteristics
A paired t-test was performed using R 19 v. 3.5.1 to determine whether there was a difference in the total number of sites per sample with a poor detection p-value (detection p > 0.05) between the DBS and EDTA samples, with p ≤ 0.05 deemed to be significant. The relationships between the total number of sites for each sample with a detection p-value greater than 0.05 and (i) sample storage time and (ii) quantity of DNA hybridised to the array were assessed by linear modelling, with p ≤ 0.05 deemed to be significant.
Estimation of whole blood cellular composition
Estimated cell counts for B-lymphocytes, granulocytes, monocytes, natural killer cells, CD4+ T-lymphocytes and CD8+ T-lymphocytes were obtained using the estimateCellCounts function in minfi. This function implements Jaffe and Irizarry’s 25 modification of Houseman’s 26 algorithm. Paired t-tests were performed to assess whether blood storage method (EDTA tube vs. Whatman FTA ® card) affected cellular composition. Correction for multiple testing was implemented using the Benjamini-Hochberg false discovery rate (FDR), with q-values of ≤ 0.05 deemed to be significant.
Assessment of the correlation between methylation profiles obtained from EDTA tube and DBS paired samples and unsupervised hierarchical clustering
Following the method described by Hannon et al. (2015) 27, a set of probes deemed to be variable in our sample was identified. The pair-wise correlation between all samples was assessed by calculating Pearson correlation coefficients. Unsupervised hierarchical clustering plots were generated using the R package lumi 28 v. 2.34.0 using the same set of variable probes as used for the correlation analysis.
Identification of differentially methylated positions associated with smoking or age and the interaction with storage method
EWASs were carried out (i) to identify loci where DNA methylation is associated with smoking or age in the EDTA or DBS samples and (ii) to establish whether these associations were moderated by storage method. EWASs for smoking and age were implemented in the R package limma 29 v. 3.38.3 by fitting linear models in which DNA methylation (beta-value) was the outcome variable and age or smoking (a categorical variable with the following levels: current smoker, former smoker who gave up under 12 months ago, former smoker who gave up 12 months or longer ago, or never smoked) was the predictor of interest. When smoking status was the predictor of interest, the following covariates were included in the model: age, sex, slide ID, row number and estimated proportions of B-lymphocytes, granulocytes, natural killer cells, CD4+ T-lymphocytes and CD8+ T-lymphocytes. The same covariates were included for the EWAS of age with the addition of “pack years”, a measure that indicates an individual’s lifetime exposure to tobacco. Pack years were calculated by multiplying the years an individual had smoked for by the maximum number of packs of cigarettes they ever smoked per day (a pack = 20 cigarettes). A conversion was used for cigars (a cigar = four cigarettes) and rolling tobacco (a 25g pack = 50 cigarettes). EWASs to assess the interaction between smoking status or age and storage method (EDTA tube or Whatman FTA ® card) were carried out using linear mixed effects models implemented in the R package nlme 30 v. 3.1-137. The fixed effects were the same as for the main effects EWASs, with the addition of a variable representing storage method, and each participant was modelled with a random intercept. The threshold for significance was defined as p ≤ 3.6 × 10 -8, as recommended by Saffari et al. (2018) 31.
Results
Extraction of DNA from DBSs stored on Whatman FTA ® cards
The concentration of the DNA extracted from the DBSs ranged from 3.8 ng/μl to 145.8 ng/μl, with a mean yield of 18.0 ng/μl, in a total volume of 50 μl. The recommended minimum input for the Infinium HumanMethylation450 BeadChip, 500 ng DNA, was available for 43 of the 62 samples, with all but two samples having at least 250 ng DNA as input. The lowest array input was 212 ng DNA.
Quality control of DNA methylation arrays
Initial assessment of array performance using the R package shinyMethyl indicated that all arrays had performed well. Neither the QC plot ( Figure 1) nor the density plots of M-values and beta-values ( Figure 2) produced using shinyMethyl indicated any obvious outliers. Both plot types did, however, indicate a slight shift between the EDTA and DBS samples. From the QC plot it can be seen that there was a tendency for the DBS samples to have slightly lower methylated signal intensity (y axis) and a higher unmethylated signal intensity (x axis). From the density plots, a slight shift in distributions, which was most evident at the extremes of the beta-value range (i.e. 0 and 1), could be seen for type 1 probes in both the green ( Figure 2A) and red ( Figure 2B) colour channels and type 2 probes ( Figure 2C). This shift was most obvious for type 2 probes at the methylated end of the beta-value range, such that fewer sites in DBS samples showed the highest levels of DNA methylation. These differences in overall signal intensity were not, however, reflected by differences in the performance of the control probes. Plots of the output of these probes supported the conclusion that all arrays had performed well, with no obvious outliers ( Figure 3). Importantly, the bisulphite conversion control probes did not indicate a difference in bisulphite conversion efficiency between EDTA and DBS samples and the non-polymorphic control probes, which measure overall assay performance, did not appear to differ according to sample storage method.
In keeping with these observations, no samples were excluded based on the proportion of poorly detected sites. There was, however, a significant difference between the DBS and EDTA blood samples in the number of sites per sample with a poor detection p-value ( p = 0.0177), such that the DBS samples had fewer sites per sample with a poor detection p-value than the EDTA samples (mean difference = -63.7 sites/sample). For the DBS samples, the total number of sites per sample with a poor detection p-value was not related to storage time ( p = 0.498) or the amount of DNA hybridised to the array ( p = 0.220).
In total, 292 sites were excluded as their beadcount was lower than three in at least five percent of samples and 2,280 sites were excluded due to having at least one percent of samples with a detection p-value greater than 0.05. Following the exclusion of probes that have been predicted to cross-hybridise, 452,278 probes were retained in the dataset, with this number being reduced to 436,309 after excluding probes on the X and Y chromosomes and those affected by a SNP at the CpG site or the site of single base extension (for type 1 probes).
Participant and sample characteristics
The mean age of the participants was 31.8 years (range = 18.1–42.8 years; standard deviation = 7.2 years) and 62.9% of the sample were female. Current smokers numbered 14, and 32 had never smoked. The mean storage time for the DBS samples was 7.78 years (range = 5.79 – 9.96 years).
Assessment of the effect of storage method on whole blood cellular composition
Sample storage method (EDTA vs. DBS) was not observed to be associated with the proportion of any of the six estimated blood cell types (min. q = 0.103; Table 1).
Table 1. Assessment of the relationship between blood cell type proportions and sample storage method.
Cell type | t-statistic | p-value | q-value |
---|---|---|---|
B-cells | -0.63 | 0.528 | 0.736 |
CD4+ T-cells | -1.56 | 0.124 | 0.373 |
CD8+ T-cells | 2.45 | 0.0171 | 0.103 |
Granulocytes | 0.34 | 0.736 | 0.736 |
Monocytes | 0.40 | 0.687 | 0.736 |
Natural Killer cells | -1.19 | 0.240 | 0.480 |
Assessment of the comparability of DNA methylation profiles obtained from DBS and EDTA tube-stored whole blood samples
Using a subset of probes defined as showing variation in our sample (n = 190,044), Pearson correlation coefficients were calculated between all pairs of samples. For all participants, the highest correlation coefficient was observed between the two blood sample types (EDTA and DBS) from that participant (mean r = 0.991, range = 0.976 – 0.995). For comparison, the pairwise correlation coefficients between pairs of samples from different participants (across both storage methods) had a mean of r = 0.953 and ranged from 0.904 – 0.976. Consistent with these observations, unsupervised hierarchical clustering performed using the same variable set of probes indicated a high level of similarity between the DBS and EDTA samples from the same participant, with methylation differences being sufficient to distinguish between participants ( Figure 4).
Assessment of the relationship between DNA methylation and Whatman FTA ® card storage time
An EWAS was performed to establish whether the length of time that a sample is stored on a Whatman FTA ® card is associated with altered DNA methylation. This did not reveal any significant differences (min. p = 1.19 × 10 -5; summary statistics are available as Underlying data 32)
Assessment of the effect of sample storage method on the relationship between DNA methylation and age and smoking
Age and smoking have been robustly associated with altered DNA methylation by several previous studies. EWASs were carried out to (i) identify age- and smoking-associated differentially methylated positions (DMPs) in the EDTA and DBS samples when considered separately and (ii) assess whether sample storage method exerted an effect on these associations. EWASs for age and smoking status identified significant DMPs in both the EDTA and the DBS samples ( Table 2– Table 5; complete summary statistics are available as Underlying data 32), while no significant associations were identified between DNA methylation and the interaction between storage method and age or smoking (min p = 5.57 × 10 -6; summary statistics are available as Underlying data 32).
Table 2. Significant results from an epigenome-wide association study (EWAS) comparing current and never smokers carried out using dried blood spot (DBS) samples.
CpG Name | Gene Symbol | Gene Name | Chr. | Position | t | p-value |
---|---|---|---|---|---|---|
cg05575921 | AHRR | aryl-hydrocarbon receptor repressor | 5 | 373378 | -11.18 | 1.06 × 10 -13 |
cg26703534 | AHRR | aryl-hydrocarbon receptor repressor | 5 | 377358 | -8.35 | 3.35 × 10 -10 |
cg21161138 | AHRR | aryl-hydrocarbon receptor repressor | 5 | 399360 | -7.05 | 1.88 × 10 -8 |
Table 3. Significant results from an epigenome-wide association study (EWAS) comparing current and never smokers carried out using EDTA samples.
CpG Name | Gene Symbol | Gene Name | Chr. | Position | t | p-value |
---|---|---|---|---|---|---|
cg05575921 | AHRR | aryl-hydrocarbon receptor repressor | 5 | 373378 | -10.09 | 2.12 × 10 -12 |
cg26703534 | AHRR | aryl-hydrocarbon receptor repressor | 5 | 377358 | -7.54 | 4.00 × 10 -9 |
cg21161138 | AHRR | aryl-hydrocarbon receptor repressor | 5 | 399360 | -7.41 | 6.11 × 10 -9 |
Table 4. The top 10 significant results from an epigenome-wide association study (EWAS) of age carried out using dried blood spot (DBS) samples.
CpG Name | Gene Symbol | Gene Name | Chr. | Position | t | p-value |
---|---|---|---|---|---|---|
cg16867657 | ELOVL2 | ELOVL fatty acid elongase 2 | 6 | 11044877 | 16.04 | 1.85 × 10 -18 |
cg10501210 | NA | NA | 1 | 207997020 | -9.39 | 2.01 × 10 -11 |
cg07553761 | TRIM59 | tripartite motif containing 59 | 3 | 160167977 | 9.36 | 2.20 × 10 -11 |
cg24724428 | ELOVL2 | ELOVL fatty acid elongase 2 | 6 | 11044888 | 8.97 | 6.65 × 10 -11 |
cg06639320 | FHL2 | four and a half LIM domains 2 | 2 | 106015739 | 8.75 | 1.26 × 10 -10 |
cg22454769 | FHL2 | four and a half LIM domains 2 | 2 | 106015767 | 8.58 | 2.07 × 10 -10 |
cg23606718 | AMER3 | APC membrane recruitment protein 3 | 2 | 131513927 | 7.71 | 2.86 × 10 -9 |
cg21572722 | ELOVL2 | ELOVL fatty acid elongase 2 | 6 | 11044894 | 7.41 | 7.11 × 10 -9 |
cg12934382 | GRM2 | glutamate metabotropic receptor 2 | 3 | 51741135 | 7.27 | 1.09 × 10 -8 |
cg08957484 | CCNI2 | cyclin I family member 2 | 5 | 132083532 | 7.20 | 1.36 × 10 -8 |
Table 5. The top 10 significant results from an epigenome-wide association study (EWAS) of age carried out using EDTA samples.
CpG Name | Gene Symbol | Gene Name | Chr. | Position | t | p-value |
---|---|---|---|---|---|---|
cg16867657 | ELOVL2 | ELOVL fatty acid elongase 2 | 6 | 11044877 | 11.61 | 4.88 × 10 -14 |
cg22454769 | FHL2 | four and a half LIM domains 2 | 2 | 106015767 | 7.94 | 1.41 × 10 -9 |
cg10501210 | NA | NA | 1 | 207997020 | -7.91 | 1.53 × 10 -9 |
cg07547549 | SLC12A5 | solute carrier family 12 member 5 | 20 | 44658225 | 7.48 | 5.71 × 10 -9 |
cg16419235 | PENK | proenkephalin | 8 | 57360613 | 7.34 | 8.83 × 10 -9 |
cg06639320 | FHL2 | four and a half LIM domains 2 | 2 | 106015739 | 7.29 | 1.01 × 10 -8 |
cg21572722 | ELOVL2 | ELOVL fatty acid elongase 2 | 6 | 11044894 | 6.68 | 6.78 × 10 -8 |
cg23606718 | AMER3 | APC membrane recruitment protein 3 | 2 | 131513927 | 6.60 | 8.74 × 10 -8 |
cg17737621 | NA | NA | 20 | 21372480 | 6.60 | 8.80 × 10 -8 |
cg23500537 | NA | NA | 5 | 140419819 | 6.57 | 9.58 × 10 -8 |
Discussion
In agreement with previous studies, we have demonstrated that it is possible to obtain high quality DNA methylation profiles from archived DBSs stored on Whatman FTA ® cards and, for the first time, shown that these profiles are highly comparable to those obtained from matched whole blood DNA obtained from samples stored in EDTA tubes. Our findings support the use of Whatman FTA ® cards for the routine collection of blood samples for studies of DNA methylation. There are potentially substantial benefits to this approach, particularly for large-scale studies and those carried out in rural or remote settings and/or in low- and middle-income countries. The blood collection by finger prick protocol is comparable to that used for blood glucose monitoring in diabetes and could be self-administered by study participants.
DNA methylation profiles obtained from the DBSs appeared to be of good quality: on almost all comparison metrics, the performance of the DBS samples could not be distinguished from the performance of the EDTA tube samples. There was a tendency for DBS samples to have higher methylated signal intensity and lower unmethylated signal intensity than the EDTA samples, but this did not result in a difference in the performance of the control probes, or in the number of sites with a poor detection p-value. Encouragingly, the overall rate of site detection was slightly better in the DBS samples than the EDTA samples. These observations support and extend those of previous studies in which DBS DNA (both from Whatman FTA ® cards and other card types) has been profiled successfully using Infinium BeadArrays 12, 33– 37.
Two concerns cited when considering the use of archived DBSs are the effects of sample quantity and degradation with storage time 14. Here we demonstrate that despite obtaining less DNA than recommended as input for the Infinium HumanMethylation450 BeadChip from 19 of the DBS samples, high quality methylation profiles were obtained. Moreover, we did not observe an effect of storage time on the success of methylation profiling; however, as the longest time that a sample was stored on a Whatman FTA ® card in the present study was 10 years, it remains possible that longer storage times might exert an effect. An earlier demonstration of successful methylation profiling of two DBS samples that had been stored for 26–28 years on standard filter paper 35 does, however, provide additional support for the notion that extended storage time (as is the case for some potentially valuable archival sample collections) might not exert a deleterious effect.
Correlation and unsupervised hierarchical clustering analyses demonstrated a high degree of concordance between the DNA methylation profiles obtained from the Whatman FTA ® DBSs and the EDTA samples. These findings are consistent with those of Joo et al. (2013) 36 and Hollegard et al. (2013) 35 who showed methylation profiles obtained from DBS DNA samples to be highly correlated and/or cluster with those from paired buffy coat DNA or whole blood DNA samples from the same individual. These studies only considered small numbers of participants (n = two or five) for whom the DBSs had been stored for three years, rendering our findings in 62 participants with DBSs that had been stored for a mean period of eight years an important addition to previous studies.
EWASs of smoking and age in both the DBS and EDTA samples produced results that replicated previously established associations identified using peripheral blood DNA. The most significant locus identified in the EWASs of smoking status in both sample types was cg05575921 in AHRR, which was hypomethylated in smokers. This replicates multiple previous studies 38, 39, including meta-analyses 40, 41, and cg05575921 has been suggested as a biomarker of smoking status 42. Interestingly, one of the validation cohorts (Melbourne Collaborative Cohort Study 43) assessed by Fasanelli et al. (2015) 39 comprised 75% DBS samples. This cohort was used to successfully validate the two most significant DMPs (including cg05575921) identified in the discovery cohort.
The most significant locus in the EWASs of age in both the DBS and EDTA samples was cg16867657 in ELOVL2, where methylation was positively correlated with age, replicating previous studies 44, 45. Methylation at this locus has been suggested as a biomarker of aging, with potential utility for forensic investigation 46.
No significant associations were observed between DNA methylation and the interaction between smoking or age and storage method. This observation indicates that storing blood samples as DBSs at room temperature for approximately eight years is unlikely to affect the results of EWASs for traits that are robustly associated with variation in DNA methylation.
It is of interest to consider these findings in the context Dugué et al.’s (2016) 47 demonstration of lower reliability of DNA methylation profiling of DBSs using Infinium HumanMethylation450 BeadChips (compared to the moderate reliability of peripheral blood mononuclear cell DNA samples). This finding, which was based on the calculation of probe-wise intraclass correlation coefficients for technical replicates (ICC; the median of which was 0.20 for DBS technical replicates), suggests that, in general, there is a higher degree of measurement error when assaying DBS DNA. In light of this finding, Dugué et al. suggested that it may be advisable to only include probes with an ICC above a certain threshold when carrying out EWASs using DBS samples. It is clear from the present results, however, that measurement error was not so large as to obfuscate the pair-wise relationship between samples in the study or to confound EWASs of variables known to be robustly associated with variation in DNA methylation.
A disadvantage to storing blood samples using Whatman FTA ® cards is that it is not possible to sort the sample into separate blood cell types. As DNA methylation varies between blood cell types, the issue of cellular heterogeneity must be addressed in order to avoid confounding in EWASs using whole blood DNA 25, 26. Fortunately, several algorithms have been developed previously to deal with the issue of cellular heterogeneity in EWAS studies (see Kaushal et al., 2017 48 for a comparison of these methods) and our data supports the applicability of these to blood samples stored using Whatman FTA ® cards.
Taken together, the results of this study support the use of Whatman FTA ® cards as an alternative, more cost- and space-effective storage method for blood samples collected for DNA methylation studies. We demonstrate that DNA from DBSs stored for up to 10 years can be successfully profiled using Infinium BeadChips, producing methylation profiles that are highly correlated with matched EDTA samples. We envisage that this approach to collecting and storing blood samples for DNA methylation profiling will help circumvent many issues faced by researchers amassing large collections of blood samples, particularly where access to freezer facilities is limited. These findings are also relevant to the ongoing discussion regarding the potential use of new-born Guthrie cards for research purposes and supports previous work assessing the potential for methylomic profiling of these samples 34– 36, 49, 50.
Data availability
Underlying data
Due to the confidential and potentially identifiable nature of the DNA methylation data and participant phenotypic information, it is not possible to publically share the individual level data used for these analyses. Researchers can, however, request access to all GS data by contacting the GS Access Committee ( access@generationscotland.org). GS operates a managed data access process including an online application form, which will be reviewed by the GS Access Committee. Summary information to help researchers assess the feasibility and statistical power of a proposed project can be requested by contacting resources@generationscotland.org. Epigenome-wide association study (EWAS) summary statistics are available from the University of Edinburgh DataShare website.
Edinburgh DataShare: Comparison of DNA methylation profiles from Generation Scotland blood samples stored on Whatman FTA cards and in EDTA tubes: EWAS summary statistics. https://doi.org/10.7488/ds/2502 32
This project contains the following underlying data:
storage_time_DBS_EWAS_summary_stats.csv (EWAS of storage time carried out using blood samples stored on Whatman FTA cards)
age_EDTA_EWAS_summary_stats.csv (EWAS of age carried out using blood samples stored in EDTA tubes)
age_DBS_EWAS_summary_stats.csv (EWAS of age carried out using blood samples stored on Whatman FTA cards)
smoking_EDTA_EWAS_summary_stats.csv (EWAS of current vs. never smokers carried out using blood samples stored in EDTA tubes)
smoking_DBS_EWAS_summary_stats.csv (EWAS of current vs. never smokers carried out using blood samples stored on Whatman FTA cards)
age_storage_method_interaction_EWAS_summary_stats.csv (EWAS of the interaction between age and storage method (EDTA tube vs. Whatman FTA card) carried out using paired blood samples)
smoking_storage_method_interaction_EWAS_summary_stats.csv (EWAS of the interaction between smoking status (current or never) and storage method (EDTA tube vs. Whatman FTA card) carried out using paired blood samples)
Data are available under the terms of the Creative Commons Attribution 4.0 International license (CC-BY 4.0).
Acknowledgements
We are grateful to everyone who took part in GS: SFHS, the GPs and Scottish School of Primary Care for their help in recruitment, and the whole GS team, which includes academic researchers, clinic staff, laboratory technicians, clerical workers, IT staff, statisticians and research managers.
Funding Statement
This work was supported by the Wellcome Trust through a Strategic Award “STratifying Resilience and Depression Longitudinally” (STRADL) to AMM, KLE and DJP [104036]. This work was also supported by an MRC Mental Health Data Pathfinder Grant [MC_PC_17209] to AMM and DJP, and a MRC GCRF Mental Health pump-priming Award [MC_PC_MR/R01910X/1] to AMM. Generation Scotland received core support from the Chief Scientist Office of the Scottish Government Health Directorates [CZD/16/6] and the Scottish Funding Council [HR03006]. RMW, AMM, DJP, and KLE are members of The University of Edinburgh Centre for Cognitive Ageing and Cognitive Epidemiology (CCACE), part of the cross-council Lifelong Health and Wellbeing Initiative [MR/K026992/1]. Funding for CCACE from the Biotechnology and Biological Sciences Research Council and Medical Research Council is gratefully acknowledged.
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
[version 1; peer review: 2 approved]
References
- 1. Barfield RT, Almli LM, Kilaru V, et al. : Accounting for population stratification in DNA methylation studies. Genet Epidemiol. 2014;38(3):231–41. 10.1002/gepi.21789 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Fraser HB, Lam LL, Neumann SM, et al. : Population-specificity of human DNA methylation. Genome Biol. 2012;13(2):R8. 10.1186/gb-2012-13-2-r8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Moen EL, Zhang X, Mu W, et al. : Genome-wide variation of cytosine modifications between European and African populations and the implications for complex traits. Genetics. 2013;194(4):987–96. 10.1534/genetics.113.151381 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Husquin LT, Rotival M, Fagny M, et al. : Exploring the genetic basis of human population differences in DNA methylation and their causal impact on immune gene regulation. Genome Biol. 2018;19(1):222. 10.1186/s13059-018-1601-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Smit PW, Elliott I, Peeling RW, et al. : An overview of the clinical use of filter paper in the diagnosis of tropical diseases. Am J Trop Med Hyg. 2014;90(2):195–210. 10.4269/ajtmh.13-0463 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Smith BH, Campbell A, Linksted P, et al. : Cohort Profile: Generation Scotland: Scottish Family Health Study (GS:SFHS). The study, its participants and their potential for genetic research on health and illness. Int J Epidemiol. 2013;42(3):689–700. 10.1093/ije/dys084 [DOI] [PubMed] [Google Scholar]
- 7. Smith BH, Campbell H, Blackwood D, et al. : Generation Scotland: the Scottish Family Health Study; a new resource for researching genes and heritability. BMC Med Genet. 2006;7:74. 10.1186/1471-2350-7-74 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Serra O, Frazzi R, Perotti A, et al. : Use of FTA ® classic cards for epigenetic analysis of sperm DNA. Biotechniques. 2018;64(2):45–51. 10.2144/btn-2017-0101 [DOI] [PubMed] [Google Scholar]
- 9. Mas S, Crescenti A, Gassó P, et al. : DNA cards: determinants of DNA yield and quality in collecting genetic samples for pharmacogenetic studies. Basic Clin Pharmacol Toxicol. 2007;101(2):132–7. 10.1111/j.1742-7843.2007.00089.x [DOI] [PubMed] [Google Scholar]
- 10. Al Safar HS, Abidi FH, Khazanehdari KA, et al. : Evaluation of different sources of DNA for use in genome wide studies and forensic application. Appl Microbiol Biotechnol. 2011;89(3):807–15. 10.1007/s00253-010-2926-3 [DOI] [PubMed] [Google Scholar]
- 11. He H, Argiro L, Dessein H, et al. : Improved technique that allows the performance of large-scale SNP genotyping on DNA immobilized by FTA technology. Infect Genet Evol. 2007;7(1):128–32. 10.1016/j.meegid.2006.06.001 [DOI] [PubMed] [Google Scholar]
- 12. Ghantous A, Saffery R, Cros MP, et al. : Optimized DNA extraction from neonatal dried blood spots: application in methylome profiling. BMC Biotechnol. 2014;14:60. 10.1186/1472-6750-14-60 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Ramagopalan SV, Rakyan VK: The promise and challenges of blood spot methylomics. Epigenetics. 2013;8(8):775–7. 10.4161/epi.25357 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Cruickshank MN, Pitt J, Craig JM: Going back to the future with Guthrie-powered epigenome-wide association studies. Genome Med. 2012;4(10):83. 10.1186/gm384 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Nørgaard-Pedersen B, Hougaard DM: Storage policies and use of the Danish Newborn Screening Biobank. J Inherit Metab Dis. 2007;30(4):530–6. 10.1007/s10545-007-0631-x [DOI] [PubMed] [Google Scholar]
- 16. Nordfalk F, Ekstrøm CT: Newborn dried blood spot samples in Denmark: the hidden figures of secondary use and research participation. Eur J Hum Genet. 2019;27(2):203–210. 10.1038/s41431-018-0276-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Laurie G, Hunter K, Cunningham-Burley S: Storage, Use and Access to the Scottish Guthrie Card Collection: Ethical, Legal, and Social Issues. Scottish Government Social Research.2014. Reference Source [Google Scholar]
- 18. Kerr SM, Campbell A, Murphy L, et al. : Pedigree and genotyping quality analyses of over 10,000 DNA samples from the Generation Scotland: Scottish Family Health Study. BMC Med Genet. 2013;14:38. 10.1186/1471-2350-14-38 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Team RC: R: A language and environment for statistical computing. R Foundation for Statistical Computing: Vienna, Austria.2018. [Google Scholar]
- 20. Aryee MJ, Jaffe AE, Corrada-Bravo H, et al. : Minfi: a flexible and comprehensive Bioconductor package for the analysis of Infinium DNA methylation microarrays. Bioinformatics. 2014;30(10):1363–9. 10.1093/bioinformatics/btu049 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Fortin JP, Fertig E, Hansen K: shinyMethyl: interactive quality control of Illumina 450k DNA methylation arrays in R [version 2; referees: 2 approved]. F1000Res. 2014;3:175. 10.12688/f1000research.4680.2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Pidsley R, Y Wong CC, Volta M, et al. : A data-driven approach to preprocessing Illumina 450K methylation array data. BMC Genomics. 2013;14:293. 10.1186/1471-2164-14-293 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Peters TJ, Buckley MJ, Statham AL, et al. : De novo identification of differentially methylated regions in the human genome. Epigenetics Chromatin. 2015;8:6. 10.1186/1756-8935-8-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Chen YA, Lemire M, Choufani S, et al. : Discovery of cross-reactive probes and polymorphic CpGs in the Illumina Infinium HumanMethylation450 microarray. Epigenetics. 2013;8(2):203–9. 10.4161/epi.23470 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Jaffe AE, Irizarry RA: Accounting for cellular heterogeneity is critical in epigenome-wide association studies. Genome Biol. 2014;15(2):R31. 10.1186/gb-2014-15-2-r31 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Houseman EA, Accomando WP, Koestler DC, et al. : DNA methylation arrays as surrogate measures of cell mixture distribution. BMC Bioinformatics. 2012;13:86. 10.1186/1471-2105-13-86 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Hannon E, Lunnon K, Schalkwyk L, et al. : Interindividual methylomic variation across blood, cortex, and cerebellum: implications for epigenetic studies of neurological and neuropsychiatric phenotypes. Epigenetics. 2015;10(11):1024–32. 10.1080/15592294.2015.1100786 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Du P, Kibbe WA, Lin SM: lumi: a pipeline for processing Illumina microarray. Bioinformatics. 2008;24(13):1547–8. 10.1093/bioinformatics/btn224 [DOI] [PubMed] [Google Scholar]
- 29. Ritchie ME, Phipson B, Wu D, et al. : limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015;43(7):e47. 10.1093/nar/gkv007 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Pinheiro JBD, DebRoy S, Sarkar D, et al. : nlme: Linear and Nonlinear Mixed Effects Models.2018. Reference Source [Google Scholar]
- 31. Saffari A, Silver MJ, Zavattari P, et al. : Estimation of a significance threshold for epigenome-wide association studies. Genet Epidemiol. 2018;42(1):20–33. 10.1002/gepi.22086 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Walker R, MacGillivray L, McCafferty S, et al. : Comparison of DNA methylation profiles from Generation Scotland blood samples stored on Whatman FTA cards and in EDTA tubes: EWAS summary statistics, [dataset].University of Edinburgh. School of Molecular, Genetic and Population Health Sciences. Institute of Genetics and Molecular Medicine.2019. 10.7488/ds/2502 [DOI] [Google Scholar]
- 33. Asrani K, Shaw GM, Rine J, et al. : DNA Methylome Profiling on the Infinium HumanMethylation450 Array from Limiting Quantities of Genomic DNA from a Single, Small Archived Bloodspot. Genet Test Mol Biomarkers. 2017;21(8):516–519. 10.1089/gtmb.2017.0019 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Beyan H, Down TA, Ramagopalan SV, et al. : Guthrie card methylomics identifies temporally stable epialleles that are present at birth in humans. Genome Res. 2012;22(11):2138–45. 10.1101/gr.134304.111 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Hollegaard MV, Grauholm J, Nørgaard-Pedersen B, et al. : DNA methylome profiling using neonatal dried blood spot samples: a proof-of-principle study. Mol Genet Metab. 2013;108(4):225–31. 10.1016/j.ymgme.2013.01.016 [DOI] [PubMed] [Google Scholar]
- 36. Joo JE, Wong EM, Baglietto L, et al. : The use of DNA from archival dried blood spots with the Infinium HumanMethylation450 array. BMC Biotechnol. 2013;13:23. 10.1186/1472-6750-13-23 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Cruickshank MN, Oshlack A, Theda C, et al. : Analysis of epigenetic changes in survivors of preterm birth reveals the effect of gestational age and evidence for a long term legacy. Genome Med. 2013;5(10):96. 10.1186/gm500 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Zeilinger S, Kuhnel B, Klopp N, et al. : Tobacco smoking leads to extensive genome-wide changes in DNA methylation. PLoS One. 2013;8(5):e63812. 10.1371/journal.pone.0063812 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Fasanelli F, Baglietto L, Ponzi E, et al. : Hypomethylation of smoking-related genes is associated with future lung cancer in four prospective cohorts. Nat Commun. 2015;6:10192. 10.1038/ncomms10192 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Gao X, Jia M, Zhang Y, et al. : DNA methylation changes of whole blood cells in response to active smoking exposure in adults: a systematic review of DNA methylation studies. Clin Epigenetics. 2015;7:113. 10.1186/s13148-015-0148-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Joehanes R, Just AC, Marioni RE, et al. : Epigenetic Signatures of Cigarette Smoking. Circ Cardiovasc Genet. 2016;9(5):436–447. 10.1161/CIRCGENETICS.116.001506 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Bojesen SE, Timpson N, Relton C, et al. : AHRR (cg05575921) hypomethylation marks smoking behaviour, morbidity and mortality. Thorax. 2017;72(7):646–653. 10.1136/thoraxjnl-2016-208789 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Giles GG, English DR: The Melbourne Collaborative Cohort Study. IARC Sci Publ. 2002;156:69–70. [PubMed] [Google Scholar]
- 44. Kananen L, Marttila S, Nevalainen T, et al. : Aging-associated DNA methylation changes in middle-aged individuals: the Young Finns study. BMC Genomics. 2016;17:103. 10.1186/s12864-016-2421-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Slieker RC, Relton CL, Gaunt TR, et al. : Age-related DNA methylation changes are tissue-specific with ELOVL2 promoter methylation as exception. Epigenetics Chromatin. 2018;11(1):25. 10.1186/s13072-018-0191-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46. Garagnani P, Bacalini MG, Pirazzini C, et al. : Methylation of ELOVL2 gene as a new epigenetic marker of age. Aging Cell. 2012;11(6):1132–4. 10.1111/acel.12005 [DOI] [PubMed] [Google Scholar]
- 47. Dugué PA, English DR, MacInnis RJ, et al. : Reliability of DNA methylation measures from dried blood spots and mononuclear cells using the HumanMethylation450k BeadArray. Sci Rep. 2016;6:30317. 10.1038/srep30317 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48. Kaushal A, Zhang H, Karmaus WJJ, et al. : Comparison of different cell type correction methods for genome-scale epigenetics studies. BMC Bioinformatics. 2017;18(1):216. 10.1186/s12859-017-1611-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49. Staunstrup NH, Starnawska A, Nyegaard M, et al. : Genome-wide DNA methylation profiling with MeDIP-seq using archived dried blood spots. Clin Epigenetics. 2016;8:81. 10.1186/s13148-016-0242-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. Wong N, Morley R, Saffery R, et al. : Archived Guthrie blood spots as a novel source for quantitative DNA methylation analysis. Biotechniques. 2008;45(4):423–4, 426, 428 passim. 10.2144/000112945 [DOI] [PubMed] [Google Scholar]