Abstract
The human immune defense mechanisms and factors associated with good versus poor health outcomes following viral respiratory tract infections (VRTI), as well as correlates of protection following vaccination against respiratory viruses, remain incompletely understood. To shed further light into these mechanisms, a number of systems-scale studies have been conducted to measure transcriptional changes in blood leukocytes of either naturally or experimentally infected individuals, or in individual’s post-vaccination. Here we are making available a public repository, for research investigators for interpretation, a collection of transcriptome datasets obtained from human whole blood and peripheral blood mononuclear cells (PBMC) to investigate the transcriptional responses following viral respiratory tract infection or vaccination against respiratory viruses. In total, Thirty one31 datasets, associated to viral respiratory tract infections and their related vaccination studies, were identified and retrieved from the NCBI Gene Expression Omnibus (GEO) and loaded in a custom web application designed for interactive query and visualization of integrated large-scale data. Quality control checks, using relevant biological markers, were performed. Multiple sample groupings and rank lists were created to facilitate dataset query and interpretation. Via this interface, users can generate web links to customized graphical views, which may be subsequently inserted into manuscripts to report novel findings. The GXB tool enables browsing of a single gene across projects, providing new perspectives on the role of a given molecule across biological systems in the diagnostic and prognostic following VRTI but also in identifying new correlates of protection. This dataset collection is available at: http://vri1.gxbsidra.org/dm3/geneBrowser/list.
Keywords: Transcriptomics, Bioinformatics, Software, Viral respiratory infection, Influenza viruses, Respiratory syncytial viruses (RSV), Rhinoviruses, Whole Blood, PBMC.
Introduction
Viral respiratory tract infections (VRTI) are responsible for the majority hospitalizations among infants and the elderly. They are caused mainly by a heterogeneous group of viruses, including rhinoviruses, influenza viruses, parainfluenza viruses, respiratory syncytial virus (RSV), enteroviruses, coronaviruses, and certain strains of adenovirus 1, 2. Few antiviral therapies are currently approved and routinely used for VRTI. Most of these are specific inhibitors of influenza viruses 3. Moreover, for most respiratory viruses, there is no licensed vaccine available 4, 5, with the exception of flu vaccines for which protection generally lasts only one flu season. Consequently, clinical management of individuals with VRTI is mostly restricted to supportive care 5.
As clinical symptoms are often overlapping and are not specific for any of the viral species, it is difficult to establish a clinical diagnosis without laboratory testing 1. Furthermore, clinical manifestations of VRTI are highly variable, ranging from asymptomatic infections or illness with mild symptoms (a common cold) to clinically severe disease with life-threatening complications, such as respiratory failure and in some cases may have a fatal outcome 6. Infants, the elderly and patients with chronic lung or heart diseases in particular are at high risk 7.
Thus, there is an evident need to better understand the molecular mechanisms underlying the disease pathogenesis, progression as well as severity of, and immunity against, VRTI among humans 8. In this context, different large scale gene expression studies have been conducted using whole blood or peripheral blood mononuclear cells (PBMCs), to assess the human immune response to natural 9– 11 and experimental viral respiratory infections 12, 13; in particular, to influenza and RSV infections, and also to vaccination 14– 17.
Here, we make available, through an interactive web application, a curated collection of datasets that were obtained from pediatric and adult patients with natural VRTI, volunteers with experimental exposition to respiratory viruses and also vaccinated volunteers. Transcriptomics datasets were obtained from whole blood and PBMCs.
A total of 31 datasets were retrieved and selected from the NCBI Gene Expression Omnibus (GEO), a public repository of transcriptome profiles. The identified datasets are particularly relevant to our interest in understanding the pathobiology of VRTI and vaccination. As described in recent publications 18, 19, these datasets were loaded into a custom interactive web application, the Gene Expression Browser (GXB), which enables easy access to large datasets and interactive visualization of our dataset collection related to VRTI and vaccination against respiratory viruses. It also provides access to demographic and clinical information. Importantly, the user can customize data plots by adding multiple layers of parameters (e.g. age, gender, sample type, type of infection, type of vaccine, sample collection time), modify the sample ordering and genes, and generate links (mini URL) that can be shared via e-mail or used in publications. Therefore, we are providing here a resource enabling browsing of datasets relevant to blood transcriptional responses to VRTI and vaccination that offers a unique opportunity to identify host genes and their regulation that may be of diagnostic and/or prognostic value, or that may be tested as novel correlates of protection in subsequent studies. For example, a comparative approach of the transcriptional response signatures between experimentally infected and vaccinated individuals could be used to identify common mechanisms that define the poor health outcomes versus strong protection. The ability to pool, compare and analyze the immune responses to different infections and vaccines, in different individuals and at various age, offers a unique opportunity for a better understanding of the pathophysiology of VRTI.
Methods
A total of 120 datasets, potentially relevant to human immune responses to VRTI and vaccination, were identified in GEO using the following search query:
Homo sapiens[Organism] AND ((“respiratory syncytial virus”[DESC] OR RSV[DESC] OR metapneumovirus[DESC] OR hMPV[DESC] OR influenza[DESC] OR parainfluenza[DESC] OR rhinovirus[DESC] OR rhinoviruses[DESC] OR adenovirus[DESC] OR adenoviruses[DESC] OR HAdV[DESC] OR coronaviruses[DESC] OR HCoV[DESC]) OR (vaccine OR vaccines OR vaccination) AND (blood[DESC] OR PBMC[DESC] OR PBMCs[DESC] OR lymphocyte[DESC] OR lymphocytes[DESC] OR “B cell”[DESC] OR “B cells”[DESC] OR “plasma cells”[DESC] OR “T cell”[DESC] OR “T cells”[DESC] OR Treg[DESC] OR Tregs[DESC] OR monocyte[DESC] OR monocytes[DESC] OR dendritic[DESC] OR DC[DESC] OR DCs[DESC] OR "natural killer"[DESC] OR NK[DESC] OR NKT[DESC] OR neutrophil[DESC] OR neutrophils[DESC]) AND (“Expression profiling by array”[gdsType] OR “Expression profiling by high throughput sequencing”[gdsType]).
Most of retrieved datasets were generated from human blood and human PBMC, using Illumina or Affymetrix commercial platforms or RNA-sequencing. All the entries that were returned with this query were manually curated. The process involved reading all the descriptions available of the datasets, the study design and the GEO-linked article in pubmed. Finally, only studies using human whole blood and human PBMCs, associated with natural or experimental VRTI, or vaccination against VRTI, were retained for our dataset collection. For the retained datasets, if the platform used to generate the transcriptome profiles was not supported by GXB or if from an in vitro study, they were exlcluded from our dataset collection. Based on these criteria, 31 datasets were retained. These include datasets that were generated from whole blood or PBMCs of individuals who were either naturally (12) or experimentally infected (3) (with influenza viruses, RSV, Rhinovirus, Rotavirus) as well as from healthy, uninfected (age-matched) volunteers. The remaining 16 datasets were generated from whole blood or PBMCs of individuals who had received flu vaccines ( Figure 1). The datasets that comprise our collection are listed in Table 1.
Table 1. List of datasets constituting the collection.
*Gender information and/or Xist probe was not available. QC – quality control, ND – no data
Once the final selection had been made, each dataset was downloaded from GEO by using the SOFT file format. Then, the datasets were uploaded on the Gene Expression Browser (GXB), an interactive web application hosted on the Amazon Web Services cloud 20. Information about samples and study design were also uploaded. The available samples were put into groups based on relevant study variables and genes were ranked according to the different groups comparisons. A detailed description of the GXB software tool is available from recent publications 19– 21. This software interface allows user to easily navigate and filter the dataset collection. A web tutorial can be easily accessed online. Annotation and functionality of the web software interface were described previously by our group 18, 19, 21, and is reproduced here so that readers can use this article as a standalone resource. Briefly, datasets of interest can be quickly identified either by filtering on criteria from pre-defined sections on the left or by entering a query term in the search box at the top of the dataset navigation page. Clicking on one of the studies listed in the dataset navigation page opens a viewer designed to provide interactive browsing and graphic representations of large-scale data in an interpretable format. This interface is designed to present ranked gene lists and display expression results graphically in a context-rich environment. Selecting a gene from the rank ordered list on the left of the data-viewing interface will display its expression values graphically in the screen’s central panel. Directly above the graphical display drop down menus give users the ability: a) To change how the gene list is ranked - this allows the user to change the method used to rank the genes, or to only include genes that are selected for specific biological interest; b) To change sample grouping (Group Set button) - in some datasets, a user can switch between groups based on cell type to groups based on disease type, for example; c) To sort individual samples within a group based on associated categorical or continuous variables (e.g. gender or age); d) To toggle between the bar chart view and a box plot view, with expression values represented as a single point for each sample. Samples are split into the same groups whether displayed as a bar chart or box plot; e) To provide a color legend for the sample groups; f) To select categorical information that is to be overlaid at the bottom of the graph - for example, the user can display gender or smoking status in this manner; g) To provide a color legend for the categorical information overlaid at the bottom of the graph; h) To download the graph as a portable network graphics (png) image. Measurements have no intrinsic utility in absence of contextual information. It is this contextual information that makes the results of a study or experiment interpretable. It is therefore important to capture, integrate and display information that will give users the ability to interpret data and gain new insights from it. We have organized this information under different tabs directly above the graphical display. The tabs can be hidden to make more room for displaying the data plots, or revealed by clicking on the blue “show info panel” button on the top right corner of the display. Information about the gene selected from the list on the left side of the display is available under the “Gene” tab. Information about the study is available under the “Study” tab. Rolling the mouse cursor over a bar chart feature while displaying the “Sample” tab lists any clinical, demographic, or laboratory information available for the selected sample. Finally, the “Downloads” tab allows advanced users to retrieve the original dataset for analysis outside this tool. It also provides all available sample annotation data for use alongside the expression data in third party analysis software. Other functionalities are provided under the “Tools” drop-down menu located in the top right corner of the user interface. Some of the notable functionalities available through this menu include: a) Annotations, which provides access to all the ancillary information about the study, samples and dataset organized across different tabs; b) Cross-project view; which provides the ability for a given gene to browse through all available studies; c) Copy link, which generates a mini-URL encapsulating information about the display settings in use and that can be saved and shared with others (clicking on the envelope icon on the toolbar inserts the URL in an email message via the local email client); d) Chart options; which gives user the option to customize chart labels.
Quality Control
Quality control checks can be performed on the datasets loaded on GXB, for example by examining concordance of the gender-specific expression of the XIST gene in those datasets for which gender information was available as metadata. The XIST gene is essential for imprinted and random X-chromosome inactivation 22 and therefore, expression is expected to be high in female and low in male samples. Respective hyperlinks are found in Table 1 allow you to visualize the XIST experession based on the gender information provided with the GEO submission. Figure 2 shows XIST gene expression in a representative dataset, along with gender information available that was recorded and made available in GEO.
Data availability
All datasets included in our curated collection are also available publically via the NCBI GEO website : https://www.ncbi.nlm.nih.gov/gds/; and are referenced throughout the manuscript by their GEO accession numbers (e.g. GSE17763). Signal files and sample description files can also be downloaded from the GXB tool under the “downloads” tab.
Acknowledgments
We would like to thank all the investigators who decided to make their datasets publically available by depositing them in GEO.
Funding Statement
All the authors listed on this publication received support from the Qatar Foundation. Support for this project was provided by the Qatar National Research Fund [NPRP10-0205-170348].
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
[version 1; peer review: 2 approved]
References
- 1. Hodinka RL: Respiratory RNA Viruses. Microbiol Spectr. 2016;4(4). 10.1128/microbiolspec.DMIH2-0028-2016 [DOI] [PubMed] [Google Scholar]
- 2. Tan WC: Viruses in asthma exacerbations. Curr Opin Pulm Med. 2005;11(1):21–6. 10.1097/01.mcp.0000146781.11092.0d [DOI] [PubMed] [Google Scholar]
- 3. Ison MG: Clinical use of approved influenza antivirals: therapy and prophylaxis. Influenza Other Respir Viruses. 2013;7 Suppl 1:7–13. 10.1111/irv.12046 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Zhu T, Zhang C, Yu L, et al. : The preventive effect of vaccine prophylaxis on severe respiratory syncytial virus infection: A meta-analysis. Virol Sin. 2015;30(5):371–8. 10.1007/s12250-015-3630-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Caballero MT, Polack FP, Stein RT: Viral bronchiolitis in young infants: new perspectives for management and treatment. J Pediatr (Rio J). 2017;93 Suppl 1:75–83. 10.1016/j.jped.2017.07.003 [DOI] [PubMed] [Google Scholar]
- 6. Jong VL, Ahout IM, van den Ham HJ, et al. : Transcriptome assists prognosis of disease severity in respiratory syncytial virus infected infants. Sci Rep. 2016;6: 36603. 10.1038/srep36603 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Ackerson B, Tseng HF, Sy LS, et al. : Severe Morbidity and Mortality Associated With Respiratory Syncytial Virus Versus Influenza Infection in Hospitalized Older Adults. Clin Infect Dis. 2018. 10.1093/cid/ciy991 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Mejias A, Dimo B, Suarez NM, et al. : Whole blood gene expression profiles to assess pathogenesis and disease severity in infants with respiratory syncytial virus infection. PLoS Med. 2013;10(11):e1001549. 10.1371/journal.pmed.1001549 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Parnell GP, McLean AS, Booth DR, et al. : A distinct influenza infection signature in the blood transcriptome of patients with severe community-acquired pneumonia. Crit Care. 2012;16(4):R157. 10.1186/cc11477 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Tsuge M, Oka T, Yamashita N, et al. : Gene expression analysis in children with complex seizures due to influenza A(H1N1)pdm09 or rotavirus gastroenteritis. J Neurovirol. 2014;20(1):73–84. 10.1007/s13365-013-0231-5 [DOI] [PubMed] [Google Scholar]
- 11. Zhai Y, Franco LM, Atmar RL, et al. : Host Transcriptional Response to Influenza and Other Acute Respiratory Viral Infections--A Prospective Cohort Study. PLoS Pathog. 2015;11(6):e1004869. 10.1371/journal.ppat.1004869 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Woods CW, McClain MT, Chen M, et al. : A host transcriptional signature for presymptomatic detection of infection in humans exposed to influenza H1N1 or H3N2. PLoS One. 2013;8(1):e52198. 10.1371/journal.pone.0052198 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Zaas AK, Chen M, Varkey J, et al. : Gene expression signatures diagnose influenza and other symptomatic respiratory viral infections in humans. Cell Host Microbe. 2009;6(3):207–17. 10.1016/j.chom.2009.07.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Parnell G, McLean A, Booth D, et al. : Aberrant cell cycle and apoptotic changes characterise severe influenza A infection--a meta-analysis of genomic signatures in circulating leukocytes. PLoS One. 2011;6(3):e17186. 10.1371/journal.pone.0017186 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Nakaya HI, Hagan T, Duraisingham SS, et al. : Systems Analysis of Immunity to Influenza Vaccination across Multiple Years and in Diverse Populations Reveals Shared Molecular Signatures. Immunity. 2015;43(6):1186–98. 10.1016/j.immuni.2015.11.012 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Nakaya HI, Wrammert J, Lee EK, et al. : Systems biology of vaccination for seasonal influenza in humans. Nat Immunol. 2011;12(8):786–95. 10.1038/ni.2067 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Tsang JS, Schwartzberg PL, Kotliarov Y, et al. : Global analyses of human immune variation reveal baseline predictors of postvaccination responses. Cell. 2014;157(2):499–513. 10.1016/j.cell.2014.03.031 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Rinchai D, Boughorbel S, Presnell S, et al. : A curated compendium of monocyte transcriptome datasets of relevance to human monocyte immunobiology research [version 2; referees: 2 approved]. F1000Res. 2016;5:291. 10.12688/f1000research.8182.2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Rahman M, Boughorbel S, Presnell S, et al. : A curated transcriptome dataset collection to investigate the functional programming of human hematopoietic cells in early life [version 1; referees: 2 approved]. F1000Res. 2016;5:414. 10.12688/f1000research.8375.1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Speake C, Presnell S, Domico K, et al. : An interactive web application for the dissemination of human systems immunology data. J Transl Med. 2015;13:196. 10.1186/s12967-015-0541-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Marr AK, Boughorbel S, Presnell S, et al. : A curated transcriptome dataset collection to investigate the development and differentiation of the human placenta and its associated pathologies [version 2; referees: 2 approved]. F1000Res. 2016;5:305. 10.12688/f1000research.8210.2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Cerase A, Pintacuda G, Tattermusch A, et al. : Xist localization and function: new insights from multiple levels. Genome Biol. 2015;16:166. 10.1186/s13059-015-0733-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Thakar J, Mohanty S, West AP, et al. : Aging-dependent alterations in gene expression and a mitochondrial signature of responsiveness to human influenza vaccination. Aging (Albany NY). 2015;7(1):38–52. 10.18632/aging.100720 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Obermoser G, Presnell S, Domico K, et al. : Systems scale interactive exploration reveals quantitative and qualitative differences in response to influenza and pneumococcal vaccines. Immunity. 2013;38(4):831–44. 10.1016/j.immuni.2012.12.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Bermejo-Martin JF, Martin-Loeches I, Rello J, et al. : Host adaptive immunity deficiency in severe pandemic influenza. Crit Care. 2010;14(5):R167. 10.1186/cc9259 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Brand HK, Ahout IM, Ridder D, et al. : Olfactomedin 4 Serves as a Marker for Disease Severity in Pediatric Respiratory Syncytial Virus (RSV) Infection. PLoS One. 2015;10(7):e0131927. 10.1371/journal.pone.0131927 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Hoang LT, Tolfvenstam T, Ooi EE, et al. : Patient-based transcriptome-wide analysis identify interferon and ubiquination pathways as potential predictors of influenza A disease severity. PLoS One. 2014;9(11):e111640. 10.1371/journal.pone.0111640 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Berdal JE, Mollnes TE, Wæhre T, et al. : Excessive innate immune response and mutant D222G/N in severe A (H1N1) pandemic influenza. J Infect. 2011;63(4):308–16. 10.1016/j.jinf.2011.07.004 [DOI] [PubMed] [Google Scholar]
- 29. Franco LM, Bucasas KL, Wells JM, et al. : Integrative genomic analysis of the human immune response to influenza vaccination. eLife. 2013;2:e00299. 10.7554/eLife.00299 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Cao RG, Suarez NM, Obermoser G, et al. : Differences in antibody responses between trivalent inactivated influenza vaccine and live attenuated influenza vaccine correlate with the kinetics and magnitude of interferon signaling in children. J Infect Dis. 2014;210(2):224–33. 10.1093/infdis/jiu079 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Davenport EE, Antrobus RD, Lillie PJ, et al. : Transcriptomic profiling facilitates classification of response to influenza challenge. J Mol Med (Berl). 2015;93(1):105–14. 10.1007/s00109-014-1212-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Herberg JA, Kaforou M, Gormley S, et al. : Transcriptomic profiling in childhood H1N1/09 influenza reveals reduced expression of protein synthesis genes. J Infect Dis. 2013;208(10):1664–8. 10.1093/infdis/jit348 [DOI] [PMC free article] [PubMed] [Google Scholar]