RNA sequencing of archived neonatal dried blood spots

Jonas Bybjerg-Grauholm; Christian Munch Hagen; Sok Kean Khoo; Maria Louise Johannesen; Christine Søholm Hansen; Marie Bækvad-Hansen; Michael Christiansen; David Michael Hougaard; Mads V Hollegaard

doi:10.1016/j.ymgmr.2016.12.004

. 2016 Dec 24;10:33–37. doi: 10.1016/j.ymgmr.2016.12.004

RNA sequencing of archived neonatal dried blood spots

Jonas Bybjerg-Grauholm ^a,^b,^⁎,¹, Christian Munch Hagen ^a,^b,¹, Sok Kean Khoo ^c, Maria Louise Johannesen ^a,^b, Christine Søholm Hansen ^a,^b, Marie Bækvad-Hansen ^a,^b, Michael Christiansen ^a,^b,^d, David Michael Hougaard ^a,^b, Mads V Hollegaard ^a,^b

PMCID: PMC5198792 PMID: 28053876

Abstract

Neonatal dried blood spots (DBS) are routinely collected on standard Guthrie cards for all-comprising national newborn screening programs for inborn errors of metabolism, hypothyroidism and other diseases. In Denmark, the Guthrie cards are stored at − 20 °C in the Danish Neonatal Screening Biobank and each sample is linked to elaborate social and medical registries. This provides a unique biospecimen repository to enable large population research at a perinatal level. Here, we demonstrate the feasibility to obtain gene expression data from DBS using next-generation RNA sequencing (RNA-seq). RNA-seq was performed on five males and five females. Sequencing results have an average of > 30 million reads per sample. 26,799 annotated features can be identified with 64% features detectable without fragments per kilobase of transcript per million mapped reads (FPKM) cutoff; number of detectable features dropped to 18% when FPKM ≥ 1. Sex can be discriminated using blood-based sex-specific gene set identified by the Genotype-Tissue Expression consortium. Here, we demonstrate the feasibility to acquire biologically-relevant gene expression from DBS using RNA-seq which provide a new avenue to investigate perinatal diseases in a high throughput manner.

Abbreviations: DBS, Dried blood spots; DNSB, Danish Neonatal Screening Biobank; GTEx, Genotype-Tissue Expression consortium

Keywords: Gene expression, RNA-seq, Limited material, Neonatal Screening, Guthrie Cards, Dried blood spots

1. Introduction

Gene expression analysis captures a snapshot of cellular activity that reflects the response to genetic, environmental and epigenetic changes in a biological system. Determining phenotypes at the molecular level with differentially-expressed genes, using high-throughput technologies such as microarray and next-generation RNA-sequencing (RNA-seq), could help bridge the current gap between genotypes and phenotypes [1]. Neonatal dried blood spots (DBS) represent a unique biospecimen resource that can provide the “baseline” gene expression at birth for genotype-phenotype studies. In many developed countries, DBS are routinely collected on standard Guthrie cards for newborn screening to diagnose inborn errors of metabolism, hypothyroidism and other diseases [2]. Post-diagnostic gene expression analysis of DBS can potentially give insights into the cellular response of molecular pathological factors and thereby the etiology of a specific disease. For example, it has been shown that microarray gene expression profiles from DBS can predict children with cerebral palsy [3].

Denmark has an all-comprising comprehensive system of registries which hold and collect information on social and health related life events for each resident linked through a unique person identifiable number (CPR number) [4], [5], [6]. Since the early 1980s, the Danish Neonatal Screening Biobank (DNSB) has been systematically collecting and storing surplus DBS from the neonatal screening program. While the primary purpose of DBS is to diagnose and treat congenital disorders, it can also be used for research purposes with appropriate approval [7], [8]. Several methods have been developed, primarily concerned with utilizing DNA from DBS [9], [10], [11], [12]. In addition, we and several other groups have shown the feasibility of obtaining RNA and generating RNA gene expression microarray data from DBS for various studies [3], [13], [14], [15], [16], [17], [18], [19].

RNA microarray technologies are fast, robust and relatively inexpensive. However, they have inherent problems such as differing hybridization properties of probes, as well as predominant probe placements near the 3′ end of transcripts. RNA-seq, a cutting-edge massively parallel next-generation sequencing technique, has the capacity to sequence each transcript on a single nucleotide basis. While above median transcripts are generally reliably detected by both RNA microarrays and RNA-seq, RNA-seq offers several significant advantages over microarray such as the capability to distinguish isoforms, call sequence variants, and provide improved accuracy for low-abundance transcripts [20], [21], [22]. However, the nature of total RNA, from blood in particular, does complicate sequencing approaches. Firstly, approximately 80% of total RNA from any tissue source is ribosomal RNA (rRNA). Thus, rRNA is usually reduced or removed prior to RNA-seq [23], [24]. Secondly, in blood, 50–90% of mRNA transcripts are globin species [21], [25]; globin may contribute to uninformative reads that are present in high abundance thus compromising RNA-seq analysis. Globin depletion can remove ~ 80% of globin species, improve correlation of technical replicates, and enable an additional 3500 RNA transcripts to be detected [21]. In neonatal DBS, globin is primarily derived from fetal hemoglobin (HbF), the predominant form of hemoglobin in the developing fetus which persists in newborn blood until gradually switching over to adult hemoglobin (HbA) ~ 6 months post-birth [26], [27]. HbF could be depleted using the same methodology as HbA, but it remains untested whether sequence similarity is high enough to use the exact same method.

To date, RNA-seq for neonatal DBS has not been reported. Here, we conducted a proof-of-concept study to demonstrate the feasibility of obtaining robust and biologically-relevant gene expression data from DBS using RNA-seq.

2. Methods

2.1. DBS samples

This study does not constitute a health-related research project as defined by the Danish “Act on Research Ethics Review of Health Research Projects”. It is considered a developmental project for the Newborn Screening Program instead which does not require a separate approval from the Committees on Biomedical Research Ethics for the Capital Region of Denmark. In this study, we used a set of anonymized neonatal DBS samples that had been stripped of all information including the year of sampling. Sexes were determined using Sequenom's Sample ID panel (data not included). Sample names are PKU_M_0 to PKU_M_4 for five male DBS and PKU_F_0 to PKU_F_4 for five female DBS.

2.2. RNA isolation, library preparation and sequencing

Total RNA from two 3.2 mm DBS punches was extracted, purified and concentrated using Illustra RNAspin mini (GE Healthcare) and then subjected to DNase treatment to eliminate DNA contamination. Samples were concentrated using RNA Clean & Concentrator (Zymo Research). Sequencing libraries were prepared according to the standard protocol for the Stranded TruSeq RiboZero-Globin kit (Illumina) which includes Globin and rRNA depletion. RNA was fragmented for 4 min at 94 °C and each library was quantified using the KAPA library quantification kit (KAPA). Libraries were pooled before sequencing with a NextSeq-500 sequencer using NextSeq Control Software v1.3.0 (Illumina) on a high output flow cell (v1) with paired end reads of 76 bp (Illumina). Following sequencing, data was de-multiplexed and converted to FASTQ using bcl2fastq v2.17.1.14 (Illumina).

2.3. Data analysis

The FASTQ files were processed with the settings described by Trepnell [28] using Tophat v2.0.13 and Cufflinks v2.2.5. Filtering of genes and visualization of results was performed in cummerBund 2.8.2 [28] using the commands described by Trepnell [28] within the supplementary information for that paper. The reference genome and gene annotations used was UCSC HG19 prepackaged and downloaded via iGenomes (Illumina, downloaded June 2014).

3. Results

3.1. Library preparation and sequencing statistics

More than 325 million reads were generated from the 10 samples. The average reads per sample was 32.7 million ± 18.3 million, with 34.4 million ± 18.7 million for females and 31.1 million ± 20.0 million for males. The read length was 32–75 bp with an average of 74 bp, 91.5% of bases exceeded Q30. Detailed information of each sample's preparation and sequencing is shown in Table 1. Variation was observed in the dynamic range of transcript intensities, i.e. fragments per kilobase of transcript per million mapped reads (FPKM). Variance in FPKM was more profound across samples than between the sexes (Supplementary Fig. 1).

Table 1.

RNA input and RNA-seq data of 10 dB (5 females and 5 males).

Sample ID	Input RNA (ng)	Read length range (bp)	Average read length (bp)	Raw bases Q10 + (bp)	Raw bases Q20 + (bp)	Raw bases Q30 + (bp)	Read count	Total sequence generated (bp)
PKU_F_0	314	32–75	74.51	99.99	99.75	93.18	41,164,517	3,067,015,571
PKU_F_1	70	32–75	74.48	99.99	99.52	93.64	49,891,753	3,715,888,452
PKU_F_2	40	32–75	74.57	99.98	99.74	93.99	47,901,645	3,571,868,192
PKU_F_3	40	35–75	74.41	99.87	99.42	89.84	28,409,088	2,113,940,315
PKU_F_4	< 0.2	35–75	63.65	76.11	51.85	44.73	4,520,783	287,745,978
PKU_M_0	60	32–75	74.48	99.99	99.68	93.28	45,422,192	3,382,887,870
PKU_M_1	500	32–75	74.51	99.99	99.81	94.51	19,988,616	1,489,344,449
PKU_M_2	1000	35–75	74.44	99.88	99.44	88.95	58,476,079	4,352,678,456
PKU_M_3	1000	35–75	74.54	99.96	99.48	89.81	20,580,446	1,533,986,335
PKU_M_4	1000	35–75	74.54	99.97	99.48	89.82	10,949,721	816,165,766

Open in a new tab

Supplementary Fig. 1 — A) Group level FPKM box plots. B) Sample level FPKM box plots. C) Group level density plot. D) Sample level density plot. Variation exists within the generated data set especially at the level ofindividual samples. Looking at the group level performance is homogenous with average read counts of 34.4 ± 18.7 million and 31.1 ± 20.0 million (Mean ± SD), respectively for female and male.

3.2. Transcript detection

To evaluate transcript detection rates, we limit the read alignment to redundant annotated features (transcripts and non-coding RNA). In total, our reference contained 26,799 features. An average of 64% of all features were detected (SD = 15%, range 32–81%) without FPKM cutoff while the number of detectable features dropped to 18% (SD = 7%, range 3–26%) when FPKM ≥ 1 was applied.

To test for potential outliers, dimensional reduction analysis with Multi-Dimensional Scaling (MDS) and Principle Component Analysis (PCA) were used. Both MDS and PCA exhibited two same potential outliers PKU_M_3 and PKU_F_1, with MDS showed a potential third outlier PKU_M_0 (Supplementary Fig. 2). We did not exclude the outliers from subsequent analysis of differential expression analysis based on sexes.

Supplementary Fig. 2 — Based on the gene level data we calculated A) Multi-Dimensional Scaling B) Principle component analysis. Both dimensional reductions agree that PKU_M_3 and PKU_F_1 are outliers. PKU_M_0 is an outlayer but unique to A.

3.3. Differentiation of sex-specific gene expression

To verify whether DBS can reveal biologically-relevant gene expression using RNA-seq, we evaluated a gene set (19 sex chromosome-linked genes; 21 features) identified by the Genotype-Tissue Expression (GTEx) consortium [29] that are differentially-expressed between male and female in blood. Hierarchical clustering from FPKM showed complete separation of males from females (Fig. 1A). Gene expression heat map showed female samples expressed only X chromosome-linked genes (XIST and TXLNG) and none of Y chromosome-linked genes while male samples expressed > 50% of Y chromosome-linked genes (Fig. 1B). FPKM values for each sex-specific gene are shown in Supplementary Table 1.

Fig. 1 — A) Hierarchical clustering of FPKM values based on differentially-expressed genes between males and females in blood genes identified by the GTEx consortium. B) Heatmap of FPKM values based on differentially-expressed genes between males and females in blood genes identified by the GTEx consortium.

3.4. Effect of globin depletion

In this study, although the exact efficiency of globin depletion is unquantifiable, our results showed that globin did not represent the majority of transcripts. To access the effect of globin depletion on RNA-seq data, FPKM values of eight hemoglobin transcripts (HBBP1, HBG1, HBG2, HBE1, HBZ, HBM, HBA1 and HBQ1) were used to create hierarchical clustering and heatmaps. PKU_M_1 has higher FPKM values for HBG1 and HBG2 when compared with the rest of the samples. Interestingly, by using the entire hemoglobin gene set in the analysis, female samples can be discriminated from male (Fig. 2A and B). However, when HBA1 is removed from the analysis, sexes of the samples could not be differentiated (Fig. 2C and D). Analysis using the full set of transcripts separated the three previously identified outliers into another cluster (Fig. 2E). FPKM values for hemoglobin transcripts are shown in Supplementary Table 2.

Fig. 2 — Hierarchical clustering (HC) and heat maps (HM) based on a gene set enrichment of hemoglobin species. A) HC of all the identified blood genes. B) HM of all the identified blood genes. C) HC of the identified blood genes, excluding *HBA1*. D) HM of all the identified blood genes, excluding *HBA1*. E) HC based on the entire gene set (unfiltered).

4. Discussion

Our results show that we can perform RNA-seq on archived DBS samples, despite having some outliers in the dimensional reduction analysis. Sufficient data can be generated for hypothesis driven research such as gender differentiation shown in this study (gender were correctly called 10 out of 10). Thus, it is feasible to perform RNA-seq on neonatal DBS for transcriptome study shortly after birth. While some biological questions can be answered with DNA sequencing, other inquiries such as pathway analysis for specific disease as demonstrated by Ho et al. in cerebral palsy shows gene expression studies n DBS can contribute to new knowledge [3].

RNA input for our library preparation was restricted to two 3.2 mm DBS punches for each sample. This imposed limitation is not an arbitrary decision as it reflects what is typically granted by the DNSB steering committee. The steering committee sets this limitation in order to ensure sample availability for the primary purpose of diagnostics, but also availability for future research projects. The amount of total RNA from two punches of 3.2 mm DBS ranges from < 0.2 ng to 1 μg while the Ribo-Zero Globin reduction kit calls for 100 ng to 1 μg from either fragmented or intact total RNA input to enhance transcript coverage. Consequently, some of the preparations yielded less than the required library output before sample pooling. In cases with low library output, we used all available material and accepted a reduction in the expected read count. Thus, the RNA-seq read counts varied among individual samples. However, the read count difference is negligible when comparing group averages, as shown by 34 million and 31 million reads for females and males respectively.

Here, we detected an average of 65% annotated features (genes and non-coding RNA) at an average reads count of 32.7 million in 10 DBS samples. With FPKM ≥ 1, the number of detectable features was reduced to 18% compared with 33% in a previous study using globin reduction in human blood with ~ 30 M mapped reads [21]. A study on mouse Th2 cells reported ~ 70% detected genes at ~ 25 million reads when there is no FPKM cutoff [30]. This suggests that the number of annotated features depends on several factors such as read counts, sample types, and types of organism.

As HbA and HbF are gene complexes, we are unable to quantify their depletion directly. Thus, we evaluated the individual globin subunit transcripts within our dataset instead. While our experiment did not allow us to assess the effectiveness of the depletion directly, we can deduce the globin depletion effect by comparing our data with those in the public datasets. In GTEx, average FPKM values in blood for HBA1 and HBA2 are reported at 43,438 and 121,010 respectively. In our study, HBA1 has an average FPKM of 1318 and HBA2 is undetected (Supplementary Table 2). Thus, the majority of these globin species are been depleted in our study. Interestingly, hierarchical clustering of a globin gene enrichment data set showed two distinct clusters for females and males, driven by HBA1 (Fig. 2A and B). When HBA1 was removed from the analysis, hierarchical clustering and heat map showed a mixture of males and females (Fig. 2C and D). This observation is in concordance with a study which suggested that sex-associated factor may affects globin switching from HbF to HbA, showing HbA concentration differed significantly between female and male newborns [31]. A mixture of female and male clusters is also shown when using the entire gene set (unfiltered) (Fig. 2E). In addition, others have also shown that globin depletion does not introduce significant bias in biological and technical replicates [21]. This finding further support the feasibility of using DBS to detect reliable biologically-relevant gene expression.

Our experimental design was intended to be simple and hypothesis driven, since the inference we wish to draw upon in this proof-of-principle study is that if we can determine sex, then other biologically significant signals could also be detected. We chose to maintain this simplicity by strictly following the established RNA-seq data analysis pipeline published by Tranell et al. [28], thereby applying an established approach and avoid arbitrary decision bias of the findings. When enriching our dataset with sex-specific transcripts in blood determined by GTEx, we are able to distinguish females from males in the DBS samples, as presented by hierarchical clustering and heat map in Fig. 1. The separation of female from male cluster is close to 1, indicating a near complete segregation which agrees with the sex call from the Sequenome Sample ID panel (data not shown). However, not all sex-segregating transcripts are consistently expressed at detectable levels (Supplementary Table 1). The exact reason is unclear. One explanation can be the age difference between the GTEx cohort and DBS samples: DBS are taken from newborns while GTEx samples are taken from post-mortem donors aged 21–70 [32], [33]. Despite some discordance with the GTEx data, there are a number of relevant candidate genes for sex determination in neonatal samples within the GTEx sex-specific set. From Fig. 1, Fig. 2, as well as Supplementary Fig. 1, Supplementary Fig. 2, we found the expression of LINC00278, EIF1AY, UTY and RPS4Y1 unique in males, while XIST and HBA1 are unique in females. These genes are suitable to determine sex in DBS.

In addition to laboratory research applications, DBS technology can be expanded to patients of any age as a standard blood collection to simplify clinical procedures. Depending on the purpose of clinical visitation, DBS can be drawn from a finger in the patient's own home to reduce or eliminate frequent visits to the clinic. It would also minimize the discomfort to a prick in the finger compared to a standard blood draw. DBS may also facilitate clinical diagnosis. For example, fetal hypoxia that leads to more than a million stillbirths worldwide each year can be diagnosed with the presence of fetal induced hypoxia but requires multiple testing at multiple times [34], [35]. DBS can be collected at multiple times with minimum stress to measure hypoxia-induced RNA and identify critically hypoxic fetuses in utero. There is also a potential to collect blood as DBS samples in poor-resourced clinical settings as this methodology greatly simplifies the logistics and reduce processing costs. It has been recommended to perform microarray differentially gene expression analysis from DBS stored at ambient temperature within 6 years while there is no gene expression difference between samples stored at 10 years apart at − 20 °C. However, the effects of storage time and temperature on RNA-seq data obtained from DBS is currently unknown and warranted future investigation [16], [36].

In conclusion, this proof-of-concept study demonstrated the feasibility to obtain reliable gene expression from DBS using RNA-seq. It is also feasible to use globin depletion protocols intended for adult hemoglobin for fetal globin depletion in DBS. Sex can be differentiated from DBS and a combination of six transcripts: LINC00278, EIF1AY, UTY, RPS4Y1, XIST and HBA1 allowed high confidence of sex determinations. We can also determine novel biological changes such as differences in HbA levels between males and females. This newly established RNA-seq assay for DBS will enable high- throughput molecular epidemiology and diagnostic studies to better understand various perinatal diseases.

The following are the supplementary data related to this article.

Supplementary Table 1 — FPKM values per gene. This geneset has been identified by the GTEx consortium as being differentially expressed between the sexes in blood.

Supplementary Table 2 — Table of FPKM values for hemoglobin genes.

Authors' contributions

J.B.-G. designed the study and supervised the laboratory work, analyzed and interpreted the data and drafted the manuscript.

C.M.H. analyzed and interpreted the data and drafted the manuscript.

S.K.K. consulted on the RNA-extractions and participated in interpretation of the results and writing of the manuscript.

M.L.J. involved in the study design and did the majority of the laboratory work.

C.S.H. and M.B.-H. participated in the handling of the samples and the interpretation of the results.

M.V.H., M.C. and D.M.H. initiated the study, participated in its design and the interpretation of results.

All authors critically revised and approved the manuscript.

Competing financial interests

The authors declare no competing financial interests.

Acknowledgements

The authors would like to thank and acknowledge our dear deceased friend and colleague Mads Vilhelm Hollegaard. We are but dwarfs perched on the shoulders of giants and thus see further than they do. In all our work with Mads, he helped us see the furthest horizons.

References

1.Kim Y.A., Przytycka T.M. Bridging the gap between genotype and phenotype via network approaches. Front. Genet. 2012;3:227. doi: 10.3389/fgene.2012.00227. [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Green N.S., Pass K.A. Neonatal screening by DNA microarray: spots and chips. Nat. Rev. Genet. 2005;6(2):147–151. doi: 10.1038/nrg1526. [DOI] [PubMed] [Google Scholar]
3.Ho N.T. Gene expression in archived newborn blood spots distinguishes infants who will later develop cerebral palsy from matched controls. Pediatr. Res. 2013;73(4 Pt 1):450–456. doi: 10.1038/pr.2012.200. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Thygesen L.C. Introduction to Danish (nationwide) registers on health and social issues: structure, access, legislation, and archiving. Scand. J. Public Health. 2011;39(7 Suppl):12–16. doi: 10.1177/1403494811399956. [DOI] [PubMed] [Google Scholar]
5.Schmidt M., Pedersen L., Sorensen H.T. The Danish Civil Registration System as a tool in epidemiology. Eur. J. Epidemiol. 2014;29(8):541–549. doi: 10.1007/s10654-014-9930-3. [DOI] [PubMed] [Google Scholar]
6.Pedersen C.B. The Danish civil registration system. A cohort of eight million persons. Dan. Med. Bull. 2006;53(4):441–449. [PubMed] [Google Scholar]
7.Norgaard-Pedersen B., Hougaard D.M. Storage policies and use of the Danish newborn screening biobank. J. Inherit. Metab. Dis. 2007;30(4):530–536. doi: 10.1007/s10545-007-0631-x. [DOI] [PubMed] [Google Scholar]
8.Hartlev M. Genomic databases and biobanks in Denmark. J. Law Med. Ethics. 2015;43(4):743–753. doi: 10.1111/jlme.12316. [DOI] [PubMed] [Google Scholar]
9.Hollegaard M.V. Genome-wide scans using archived neonatal dried blood spot samples. BMC Genomics. 2009;10:297. doi: 10.1186/1471-2164-10-297. [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Hollegaard M.V. Archived neonatal dried blood spot samples can be used for accurate whole genome and exome-targeted next-generation sequencing. Mol. Genet. Metab. 2013;110(1–2):65–72. doi: 10.1016/j.ymgme.2013.06.004. [DOI] [PubMed] [Google Scholar]
11.Hollegaard M.V. DNA methylome profiling using neonatal dried blood spot samples: a proof-of-principle study. Mol. Genet. Metab. 2013;108(4):225–231. doi: 10.1016/j.ymgme.2013.01.016. [DOI] [PubMed] [Google Scholar]
12.Hollegaard M.V. Robustness of genome-wide scanning using archived dried blood spot samples as a DNA source. BMC Genet. 2011;12:58. doi: 10.1186/1471-2156-12-58. [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Zhang Y.H., McCabe E.R. RNA analysis from newborn screening dried blood specimens. Hum. Genet. 1992;89(3) doi: 10.1007/BF00220548. [DOI] [PubMed] [Google Scholar]
14.Khoo S.K. Acquiring genome-wide gene expression profiles in Guthrie card blood spots using microarrays. Pathol. Int. 2011;61(1) doi: 10.1111/j.1440-1827.2010.02611.x. [DOI] [PubMed] [Google Scholar]
15.Gauffin F. Quantitation of RNA decay in dried blood spots during 20 years of storage. Clin. Chem. Lab. Med. 2009;47(12):1467–1469. doi: 10.1515/CCLM.2009.351. [DOI] [PubMed] [Google Scholar]
16.Grauholm J. Gene expression profiling of archived dried blood spot samples from the Danish Neonatal Screening Biobank. Mol. Genet. Metab. 2015;116(3):119–124. doi: 10.1016/j.ymgme.2015.06.011. [DOI] [PubMed] [Google Scholar]
17.Resau J.H. Evaluation of sex-specific gene expression in archived dried blood spots (DBS) Int. J. Mol. Sci. 2012;13(8):9599–9608. doi: 10.3390/ijms13089599. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Slaughter J. High correlations in gene expression between paired umbilical cord blood and neonatal blood of healthy newborns on Guthrie cards. J. Matern. Fetal Neonatal Med. 2013;26(18):1765–1767. doi: 10.3109/14767058.2013.804050. [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Wei C. Comparison of frozen and unfrozen blood spots for gene expression studies. J. Pediatr. 2014;164(1):189–191.e1. doi: 10.1016/j.jpeds.2013.09.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Wang C. The concordance between RNA-seq and microarray data depends on chemical treatment and transcript abundance. Nat. Biotechnol. 2014;32(9):926–932. doi: 10.1038/nbt.3001. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Shin, H., et al., Variation in RNA-Seq transcriptome profiles of peripheral whole blood from healthy individuals with and without globin depletion. PLoS One, 2014. 9(3): p. e91041. [DOI] [PMC free article] [PubMed]
22.Marioni J.C. RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays. Genome Res. 2008;18(9) doi: 10.1101/gr.079558.108. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.O'Neil, D., H. Glowatz, and M. Schlumpberger, Ribosomal RNA Depletion for efficient use of RNA-seq capacity. Curr. Protoc. Mol. Biol., 2013. Chapter 4: p. Unit 4 19. [DOI] [PubMed]
24.Zhao W. Comparison of RNA-Seq by poly (A) capture, ribosomal RNA depletion, and DNA microarray for expression profiling. BMC Genomics. 2014;15:419. doi: 10.1186/1471-2164-15-419. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Mastrokolias A. Increased sensitivity of next generation sequencing-based expression profiling after globin reduction in human blood RNA. BMC Genomics. 2012;13:28. doi: 10.1186/1471-2164-13-28. [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Akinsheye I. Fetal hemoglobin in sickle cell anemia. Blood. 2011;118(1):19–27. doi: 10.1182/blood-2011-03-325258. [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Adachi K. Characterization of two types of fetal hemoglobin: alpha 2G gamma 2 and alpha 2A gamma 2. Blood. 1990;75(10):2070–2075. [PubMed] [Google Scholar]
28.Trapnell C. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat. Protoc. 2012;7(3) doi: 10.1038/nprot.2012.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
29.The Genotype-Tissue Expression (GTEx) project. Nat. Genet., 2013. 45(6): p. 580–5. [DOI] [PMC free article] [PubMed]
30.Hebenstreit D. RNA sequencing reveals two major classes of gene expression levels in metazoan cells. Mol. Syst. Biol. 2011;7:497. doi: 10.1038/msb.2011.28. [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Galacteros F., Guilloud-Bataille M., Feingold J. Sex, gestational age, and weight dependancy of adult hemoglobin concentration in normal newborns. Blood. 1991;78(4):1121–1124. [PubMed] [Google Scholar]
32.Consortium G.T. The genotype-tissue expression (GTEx) project. Nat. Genet. 2013;45(6):580–585. doi: 10.1038/ng.2653. [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Keen J.C., Moore H.M. The genotype-tissue expression (GTEx) project: linking clinical data with molecular analysis to advance personalized medicine. J. Pers. Med. 2015;5(1):22–29. doi: 10.3390/jpm5010022. [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Lawn J.E. Stillbirths: where? when? why? How to make the data count? Lancet. 2011;377(9775):1448–1463. doi: 10.1016/S0140-6736(10)62187-3. [DOI] [PubMed] [Google Scholar]
35.Whitehead C.L., Tong S. Measuring hypoxia-induced RNA in maternal blood: a new way to identify critically hypoxic fetuses in utero? Expert. Rev. Mol. Diagn. 2014;14(5):509–511. doi: 10.1586/14737159.2014.915749. [DOI] [PubMed] [Google Scholar]
36.Ho N.T. Effect of storage time on gene expression data acquired from unfrozen archived newborn blood spots. Mol. Genet. Metab. 2016;119(3):207–213. doi: 10.1016/j.ymgme.2016.08.001. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0005] 1.Kim Y.A., Przytycka T.M. Bridging the gap between genotype and phenotype via network approaches. Front. Genet. 2012;3:227. doi: 10.3389/fgene.2012.00227. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0010] 2.Green N.S., Pass K.A. Neonatal screening by DNA microarray: spots and chips. Nat. Rev. Genet. 2005;6(2):147–151. doi: 10.1038/nrg1526. [DOI] [PubMed] [Google Scholar]

[bb0015] 3.Ho N.T. Gene expression in archived newborn blood spots distinguishes infants who will later develop cerebral palsy from matched controls. Pediatr. Res. 2013;73(4 Pt 1):450–456. doi: 10.1038/pr.2012.200. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0020] 4.Thygesen L.C. Introduction to Danish (nationwide) registers on health and social issues: structure, access, legislation, and archiving. Scand. J. Public Health. 2011;39(7 Suppl):12–16. doi: 10.1177/1403494811399956. [DOI] [PubMed] [Google Scholar]

[bb0025] 5.Schmidt M., Pedersen L., Sorensen H.T. The Danish Civil Registration System as a tool in epidemiology. Eur. J. Epidemiol. 2014;29(8):541–549. doi: 10.1007/s10654-014-9930-3. [DOI] [PubMed] [Google Scholar]

[bb0030] 6.Pedersen C.B. The Danish civil registration system. A cohort of eight million persons. Dan. Med. Bull. 2006;53(4):441–449. [PubMed] [Google Scholar]

[bb0035] 7.Norgaard-Pedersen B., Hougaard D.M. Storage policies and use of the Danish newborn screening biobank. J. Inherit. Metab. Dis. 2007;30(4):530–536. doi: 10.1007/s10545-007-0631-x. [DOI] [PubMed] [Google Scholar]

[bb0040] 8.Hartlev M. Genomic databases and biobanks in Denmark. J. Law Med. Ethics. 2015;43(4):743–753. doi: 10.1111/jlme.12316. [DOI] [PubMed] [Google Scholar]

[bb0045] 9.Hollegaard M.V. Genome-wide scans using archived neonatal dried blood spot samples. BMC Genomics. 2009;10:297. doi: 10.1186/1471-2164-10-297. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0050] 10.Hollegaard M.V. Archived neonatal dried blood spot samples can be used for accurate whole genome and exome-targeted next-generation sequencing. Mol. Genet. Metab. 2013;110(1–2):65–72. doi: 10.1016/j.ymgme.2013.06.004. [DOI] [PubMed] [Google Scholar]

[bb0055] 11.Hollegaard M.V. DNA methylome profiling using neonatal dried blood spot samples: a proof-of-principle study. Mol. Genet. Metab. 2013;108(4):225–231. doi: 10.1016/j.ymgme.2013.01.016. [DOI] [PubMed] [Google Scholar]

[bb0060] 12.Hollegaard M.V. Robustness of genome-wide scanning using archived dried blood spot samples as a DNA source. BMC Genet. 2011;12:58. doi: 10.1186/1471-2156-12-58. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0065] 13.Zhang Y.H., McCabe E.R. RNA analysis from newborn screening dried blood specimens. Hum. Genet. 1992;89(3) doi: 10.1007/BF00220548. [DOI] [PubMed] [Google Scholar]

[bb0070] 14.Khoo S.K. Acquiring genome-wide gene expression profiles in Guthrie card blood spots using microarrays. Pathol. Int. 2011;61(1) doi: 10.1111/j.1440-1827.2010.02611.x. [DOI] [PubMed] [Google Scholar]

[bb0075] 15.Gauffin F. Quantitation of RNA decay in dried blood spots during 20 years of storage. Clin. Chem. Lab. Med. 2009;47(12):1467–1469. doi: 10.1515/CCLM.2009.351. [DOI] [PubMed] [Google Scholar]

[bb0080] 16.Grauholm J. Gene expression profiling of archived dried blood spot samples from the Danish Neonatal Screening Biobank. Mol. Genet. Metab. 2015;116(3):119–124. doi: 10.1016/j.ymgme.2015.06.011. [DOI] [PubMed] [Google Scholar]

[bb0085] 17.Resau J.H. Evaluation of sex-specific gene expression in archived dried blood spots (DBS) Int. J. Mol. Sci. 2012;13(8):9599–9608. doi: 10.3390/ijms13089599. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0090] 18.Slaughter J. High correlations in gene expression between paired umbilical cord blood and neonatal blood of healthy newborns on Guthrie cards. J. Matern. Fetal Neonatal Med. 2013;26(18):1765–1767. doi: 10.3109/14767058.2013.804050. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0095] 19.Wei C. Comparison of frozen and unfrozen blood spots for gene expression studies. J. Pediatr. 2014;164(1):189–191.e1. doi: 10.1016/j.jpeds.2013.09.025. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0100] 20.Wang C. The concordance between RNA-seq and microarray data depends on chemical treatment and transcript abundance. Nat. Biotechnol. 2014;32(9):926–932. doi: 10.1038/nbt.3001. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0105] 21.Shin, H., et al., Variation in RNA-Seq transcriptome profiles of peripheral whole blood from healthy individuals with and without globin depletion. PLoS One, 2014. 9(3): p. e91041. [DOI] [PMC free article] [PubMed]

[bb0110] 22.Marioni J.C. RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays. Genome Res. 2008;18(9) doi: 10.1101/gr.079558.108. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0115] 23.O'Neil, D., H. Glowatz, and M. Schlumpberger, Ribosomal RNA Depletion for efficient use of RNA-seq capacity. Curr. Protoc. Mol. Biol., 2013. Chapter 4: p. Unit 4 19. [DOI] [PubMed]

[bb0120] 24.Zhao W. Comparison of RNA-Seq by poly (A) capture, ribosomal RNA depletion, and DNA microarray for expression profiling. BMC Genomics. 2014;15:419. doi: 10.1186/1471-2164-15-419. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0125] 25.Mastrokolias A. Increased sensitivity of next generation sequencing-based expression profiling after globin reduction in human blood RNA. BMC Genomics. 2012;13:28. doi: 10.1186/1471-2164-13-28. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0130] 26.Akinsheye I. Fetal hemoglobin in sickle cell anemia. Blood. 2011;118(1):19–27. doi: 10.1182/blood-2011-03-325258. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0135] 27.Adachi K. Characterization of two types of fetal hemoglobin: alpha 2G gamma 2 and alpha 2A gamma 2. Blood. 1990;75(10):2070–2075. [PubMed] [Google Scholar]

[bb0140] 28.Trapnell C. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat. Protoc. 2012;7(3) doi: 10.1038/nprot.2012.016. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0145] 29.The Genotype-Tissue Expression (GTEx) project. Nat. Genet., 2013. 45(6): p. 580–5. [DOI] [PMC free article] [PubMed]

[bb0150] 30.Hebenstreit D. RNA sequencing reveals two major classes of gene expression levels in metazoan cells. Mol. Syst. Biol. 2011;7:497. doi: 10.1038/msb.2011.28. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0155] 31.Galacteros F., Guilloud-Bataille M., Feingold J. Sex, gestational age, and weight dependancy of adult hemoglobin concentration in normal newborns. Blood. 1991;78(4):1121–1124. [PubMed] [Google Scholar]

[bb0160] 32.Consortium G.T. The genotype-tissue expression (GTEx) project. Nat. Genet. 2013;45(6):580–585. doi: 10.1038/ng.2653. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0165] 33.Keen J.C., Moore H.M. The genotype-tissue expression (GTEx) project: linking clinical data with molecular analysis to advance personalized medicine. J. Pers. Med. 2015;5(1):22–29. doi: 10.3390/jpm5010022. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0170] 34.Lawn J.E. Stillbirths: where? when? why? How to make the data count? Lancet. 2011;377(9775):1448–1463. doi: 10.1016/S0140-6736(10)62187-3. [DOI] [PubMed] [Google Scholar]

[bb0175] 35.Whitehead C.L., Tong S. Measuring hypoxia-induced RNA in maternal blood: a new way to identify critically hypoxic fetuses in utero? Expert. Rev. Mol. Diagn. 2014;14(5):509–511. doi: 10.1586/14737159.2014.915749. [DOI] [PubMed] [Google Scholar]

[bb0180] 36.Ho N.T. Effect of storage time on gene expression data acquired from unfrozen archived newborn blood spots. Mol. Genet. Metab. 2016;119(3):207–213. doi: 10.1016/j.ymgme.2016.08.001. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

RNA sequencing of archived neonatal dried blood spots

Jonas Bybjerg-Grauholm

Christian Munch Hagen

Sok Kean Khoo

Maria Louise Johannesen

Christine Søholm Hansen

Marie Bækvad-Hansen

Michael Christiansen

David Michael Hougaard

Mads V Hollegaard

Abstract

1. Introduction