Nontypeable Haemophilus influenzae (NTHi) is a major bacterial pathogen that causes multiple infections. We report high-depth-coverage RNA-Seq data from three NTHi strains, each of which encodes a different phase-variable methyltransferase.
ABSTRACT
Nontypeable Haemophilus influenzae (NTHi) is a major bacterial pathogen that causes multiple infections. We report high-depth-coverage RNA-Seq data from three NTHi strains, each of which encodes a different phase-variable methyltransferase. Major gene expression differences occur, commensurate with modA phase variation, and data will serve as an important resource for future studies.
ANNOUNCEMENT
Nontypeable Haemophilus influenzae (NTHi) is responsible for human respiratory tract infections and invasive disease (1–3). Previous work characterizing NTHi showed that phase-variable N6-adenine DNA methyltransferases (ModA) are involved in epigenetic regulation and virulence (4–7). Phase-variable methyltransferase expression leads to genome-wide methylation differences, epigenetically regulating multiple genes, namely, a phasevarion (phase-variable regulon) (8, 9). modA alleles show high variability (<25% nucleotide identity) in their central, target recognition domain (TRD) that dictates specificity (10). Different TRDs methylate different sequences and define a phasevarion (8). We have shown that ∼65% of otitis media clinical isolates possessed one of five modA alleles, modA2, 4, 5, 9, and 10 (5). We used NTHi strains C486 (modA4), 477 (modA5), and 1209 (modA9) (5) and generated variants where we locked each modA allele on (expressed) or off (not expressed) by reducing the number of AGCC[n] repeats so that genes could not phase-vary using standard genetic techniques. Using these strains, we prepared triplicate biological replicates of total RNA with TRIzol reagent (Thermo Fisher) according to the manufacturer’s instructions from mid-log cultures grown in brain-heart infusion (BHI; Oxoid, UK) broth at 37°C with 150 rpm shaking in an aerobic atmosphere.
Libraries were prepared using the Illumina Ribo-Zero Gold protocol and assessed using an Agilent Bioanalyzer DNA 1000 chip. qPCR quantification was used to assess individual libraries before normalizing (2 nM) and pooling using the Illumina cBot system with TruSeq PE Cluster kit v3 reagents. Sequencing (150 bp paired-end runs) was performed on the Illumina NovaSeq system with TruSeq SBS kit v3 reagents. The average number of sequence reads for each triplicate sample is as follows: modA4 on, 56,173,140; modA4 off, 53,157,713; modA5 on, 50,496,014; modA5 off, 53,084,278; modA9 on, 59,341,255; modA9 off, 57,150,972). Sequence quality was assessed according to the standard protocols of the Australian Genome Research Facility (AGRF). Sequence reads were aligned against the respective reference genomes (GenBank accession numbers CP007471 [C486; modA4], CP007470 [477; modA5], and JMQP00000000 [1209; modA9]) with Bowtie 2 aligner (v2.3.3.1) using standard settings. Default software settings were used unless otherwise stated. Transcripts were assembled with StringTie v1.3.3 (http://ccb.jhu.edu/software/stringtie/) with the reads alignment and reference-annotation-based assembly option. This methodology generated assemblies for known and potentially novel transcripts. The GENCODE annotation that contained both the coding and noncoding annotation for each genome was used as a guide (http://www.gencodegenes.org/). Raw gene count values were analyzed with edgeR (https://bioconductor.org/packages/release/bioc/html/edgeR.html) to compute differential gene expression values. Counts were summarized at the gene level with the featureCounts v1.5.3 utility of the subread package (http://subread.sourceforge.net/). Gene expression differences between respective modA locked on and off were expressed as logFC (log2-fold change of expression). Our analysis generated logCPM values (average log count per million for the gene across all samples), F values (quasi-likelihood F statistic for the gene across all samples), P values for the test of a statistically different expression, and the FDR (false discovery rate/adjusted P value for multiple hypothesis testing).
By setting a cutoff for a greater than 2-fold differential expression, and excluding rRNA and tRNA genes, we show that 55 genes were downregulated and 82 genes were upregulated in strain C486 when modA4 was on, 74 genes were downregulated and 14 genes were upregulated in strain 477 when modA5 was on, and 116 genes were downregulated and 76 were upregulated in strain 1209 when modA9 was on. Differentially regulated genes include those involved in central metabolism, nutrient acquisition, and stress response. We have previously identified the sites recognized by the ModA4, ModA5, and ModA9 methyltransferases (5). By combining this information with these new RNA-Seq data, we demonstrate that gene expression changes mediated by modA phase variation are complex and likely involve both the primary and secondary regulation of genes. These data will serve as a major resource for dissecting the exact molecular mechanisms of modA-mediated gene expression differences and for studying the cascade events that result in multiple gene expression differences commensurate with genome-wide methylation differences.
Data availability.
The data announced here were deposited in GEO DataSets under series GSE121835 (full data set), GSE121832 (modA4 on versus off), GSE121833 (modA5 on versus off), and GSE121834 (modA9 on versus off).
ACKNOWLEDGMENTS
We thank the Australian Genome Research Facility (AGRF) for RNA-Seq and bioinformatic analysis.
Work was funded by Australian National Health and Medical Research Council (NHMRC) project grant 1099279 to J.M.A., program grant 1071659 and principal research fellowship 1138466 to M.P.J., a Garnett Passe and Rodney Williams grant-in-aid (supplementation) to J.M.A., NIH R01 grant DC015688 to L.O.B. and M.P.J., and Australian Research Council (ARC) discovery grants 180100976 to J.M.A. and 170104691 to M.P.J.
REFERENCES
- 1.Pichichero ME, Casey JR, Hoberman A, Schwartz R. 2008. Pathogens causing recurrent and difficult-to-treat acute otitis media, 2003–2006. Clin Pediatr 47:901–906. doi: 10.1177/0009922808319966. [DOI] [PubMed] [Google Scholar]
- 2.Sethi S, Murphy TF. 2008. Infection in the pathogenesis and course of chronic obstructive pulmonary disease. N Engl J Med 359:2355–2365. doi: 10.1056/NEJMra0800353. [DOI] [PubMed] [Google Scholar]
- 3.Langereis JD, de Jonge MI. 2015. Invasive disease caused by nontypeable Haemophilus influenzae. Emerg Infect Dis 21:1711–1718. doi: 10.3201/eid2110.150004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Srikhanta YN, Maguire TL, Stacey KJ, Grimmond SM, Jennings MP. 2005. The phasevarion: a genetic system controlling coordinated, random switching of expression of multiple genes. Proc Natl Acad Sci U S A 102:5547–5551. doi: 10.1073/pnas.0501169102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Atack JM, Srikhanta YN, Fox KL, Jurcisek JA, Brockman KL, Clark TA, Boitano M, Power PM, Jen FE-C, McEwan AG, Grimmond SM, Smith AL, Barenkamp SJ, Korlach J, Bakaletz LO, Jennings MP. 2015. A biphasic epigenetic switch controls immunoevasion, virulence and niche adaptation in non-typeable Haemophilus influenzae. Nat Commun 6:7828. doi: 10.1038/ncomms8828. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Brockman KL, Branstool MT, Atack JM, Robledo-Avila F, Partida-Sanchez S, Jennings MP, Bakaletz LO. 2017. The ModA2 phasevarion of nontypeable Haemophilus influenzae regulates resistance to oxidative stress and killing by human neutrophils. Sci Rep 7:3161. doi: 10.1038/s41598-017-03552-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Brockman KL, Jurcisek JA, Atack JM, Srikhanta YN, Jennings MP, Bakaletz LO. 2016. ModA2 Phasevarion switching in nontypeable Haemophilus influenzae increases the severity of experimental otitis media. J Infect Dis 214:817–824. doi: 10.1093/infdis/jiw243. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Atack JM, Tan A, Bakaletz LO, Jennings MP, Seib KL. 2018. Phasevarions of bacterial pathogens: methylomics sheds new light on old enemies. Trends Microbiol 26:715–726. doi: 10.1016/j.tim.2018.01.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Tan A, Atack JM, Jennings MP, Seib KL. 2016. The capricious nature of bacterial pathogens: phasevarions and vaccine development. Front Immunol 7:586. doi: 10.3389/fimmu.2016.00586. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Gawthorne JA, Beatson SA, Srikhanta YN, Fox KL, Jennings MP. 2012. Origin of the diversity in DNA recognition domains in phasevarion associated modA genes of pathogenic Neisseria and Haemophilus influenzae. PLoS One 7:e32337. doi: 10.1371/journal.pone.0032337. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The data announced here were deposited in GEO DataSets under series GSE121835 (full data set), GSE121832 (modA4 on versus off), GSE121833 (modA5 on versus off), and GSE121834 (modA9 on versus off).
