Dataset for transcriptome analysis of Salmonella enterica subsp. enterica serovar Typhimurium strain 14028S response to starvation

Natalia E Gogoleva; Vladimir Ya Kataev; Alexander S Balkin; Andrey O Plotnikov; Elena I Shagimardanova; Anastasia M Subbot; Sergey V Cherkasov; Yuri V Gogolev

doi:10.1016/j.dib.2020.106008

. 2020 Jul 7;31:106008. doi: 10.1016/j.dib.2020.106008

Dataset for transcriptome analysis of Salmonella enterica subsp. enterica serovar Typhimurium strain 14028S response to starvation

Natalia E Gogoleva ^a,^b, Vladimir Ya Kataev ^b, Alexander S Balkin ^b, Andrey O Plotnikov ^b,^⁎, Elena I Shagimardanova ^c, Anastasia M Subbot ^d, Sergey V Cherkasov ^b, Yuri V Gogolev ^a,^c

PMCID: PMC7364113 PMID: 32695865

Abstract

Salmonella enterica is an ubiquitous pathogen throughout the world causing gastroenteritis in humans and animals. Survival of pathogenic bacteria in the external environment may be associated with the ability to overcome the stress caused by starvation. The bacterial response to starvation is well understood in laboratory cultures with a sufficiently high cell density. However, bacterial populations often have a small size when facing this challenge in natural biotopes. The aim of this work was to find out if there are differences in the transcriptomes of S. enterica depending on the factor of cell density during starvation. Here we present transcriptome data of Salmonella enterica subsp. enterica serovar Typhimurium str. 14028S grown in carbon rich or carbon deficient medium with high or low cell density. These data will help identify genes involved in adaptation of low-density bacterial populations to starvation conditions.

Keywords: Salmonella enterica, Starvation, Low-density population, Rna-seq, Illumina

Specifications Table

Subject	Biochemistry, Genetics and Molecular Biology: Molecular Biology
Specific subject area	Transcriptomics
Type of data	Charts, tables, transcriptome sequences
How data were acquired	High-throughput RNA-sequencing
Data format	Raw reads filtered and analysed with statistical tests, FASTQ
Parameters for data collection	Total RNA was extracted from Salmonella enterica subsp. enterica serovar Typhimurium str. 14028S cells cultured under carbon and phosphorus starvation
Description of data collection	RNA from control and starving samples subjected to RNA-sequencing and transcriptome profiling with subsequent analysis
Data source location	Kazan Scientific Centre of RAS, Kazan, Russia.
Data accessibility	Cleaned FASTQ files are deposited in a public repository: Repository name: NCBI Sequence Read Archive (SRA) Data identification number: PRJNA554270 Direct URL to data: https://www.ncbi.nlm.nih.gov/bioproject/?term=PRJNA554270

Open in a new tab

Value of the data

1.
For the first time, transcriptome data were obtained for salmonella starved for carbon and phosphorus at high or low cell density.
2.
The transcriptome dataset can be used to identify S. enterica genes differentially expressed under starvation conditions at high or low bacterial population density.
3.
These data can help elucidate mechanisms of persistence used by the pathogenic microorganisms for survival and growth in the oligotrophic biotopes.

1. Data description

The dataset of this article provides information on raw RNA-seq reads obtained from samples of Salmonella enterica serovar Typhimurium 14028S cultures grown in a mineral medium providing carbon and phosphorus starvation, or in the medium supplemented with glucose as a carbon source. The final content of the basic elements in the mineral medium (excluding carbon and nitrogen) is contained in Table 1. Information on the sampling time and growth rate of the cultures is presented in Table 2. This table also provides the NCBI SRA accession numbers of the cleaned FASTQ files for all biological replicates. The transcriptome data obtained are summarized in Table 3. To identify coding and non-coding transcripts, they were mapped on the reference genome. Up- and down-regulated genes were counted with the Differentially Expressed Genes (DEGs) analysis of transcriptomes of the salmonella cultures starved at high or low cell density, as well as the cultures grown in the glucose rich medium (Fig. 1). Besides, we evaluated the identity of 1000 the most variable genes associated with salmonella transcriptome responses to starvation with the heat map analysis (Fig. 2).

Table 1.

Content of the basic elements in the AB mineral medium.

Element	Concentration, mg/L
K	92.00
S	86.90
Mg	72.10
Na	6.96
Ca	2.40
Fe	0.21
P	0.10
Zn	0.04
Mn	0.01

Open in a new tab

Table 2.

Characteristics of Salmonella enterica serovar Typhimurium 14028S cultures taken for transcriptome analysis.

Biological replicates	Carbon source	Inoculation titer, CFU/ml	Duration of cultivation, hours / phase of growth or stationary state	Titer of sampled culture, CFU/ml	NCBI SRA accession number
High_4h_1	absent	1 × 10⁹	4 growth	8.0 × 10⁹	SRX7476254
High_4h_2	absent	1 × 10⁹	4 growth	8.1 × 10⁹	SRX7476255
High_4d_1	absent	1 × 10⁹	96 / stationary	5.1 × 10⁶	SRX7476246
High_4d_2	absent	1 × 10⁹	96 / stationary	6.8 × 10⁶	SRX7476247
Low_24h_1	absent	1 × 10³	24 / growth	2.2 × 10⁵	SRX7476244
Low_24h_2	absent	1 × 10³	24 / growth	2.6 × 10⁵	SRX7476245
Low_3d_1	absent	1 × 10³	72 / stationary	4.3 × 10⁵	SRX7476248
Low_3d_2	absent	1 × 10³	72 / stationary	6.1 × 10⁵	SRX7476249
Glucose_6h_1	glucose	1 × 10³	6 / growth	2.3 × 10⁸	SRX7476250
Glucose_6h_2	glucose	1 × 10³	6 / growth	4.9 × 10⁸	SRX7476251
Glucose_24h_1	glucose	1 × 10³	24 / stationary	2.0 × 10⁸	SRX7476252
Glucose_24h_2	glucose	1 × 10³	24 / stationary	3.1 × 10⁸	SRX7476253

Open in a new tab

Table 3.

Number of cleaned reads and reads mapped on the reference genome.

Library	Number of cleaned reads	Number of reads mapped on genome	% Mapped reads	Number of reads mapped on coding sequences	% Reads mapped on coding sequences	% Reads mapped on intergenic regions
High_4h_1	18,406,709	17,908,185	97.3	9475,008	51.48	48.52
High_4h_2	24,679,633	24,254,947	98.3	13,297,127	53.88	46.12
High_4d_1	10,203,859	9865,524	96.7	2477,269	24.28	75.72
High_4d_2	9545,957	9257,803	97.0	2144,698	22.47	77.53
Low_24h_1	20,478,595	19,684,154	96.1	14,737,915	71.97	28.03
Low_24h_2	10,285,237	9705,585	94.4	6413,217	62.35	37.35
Low_3d_1	26,260,086	25,040,464	95.4	8464,941	32.24	67.76
Low_3d_2	8004,395	7588,638	94.8	2107,150	26.32	73.68
Glucose_6h_1	12,440,848	12,193,924	98.6	10,810,167	87.38	12.62
Glucose_6h_ 2	12,826,518	12,443,566	97.6	11,354,492	89.03	10.97
Glucose_24h_1	9851,028	9510,553	96.5	5662,677	57.48	42.52
Glucose_24h_ 2	12,072,222	11,594,716	96.0	7208,000	59.71	40.29

Open in a new tab

Fig 1 — *Salmonella enterica* serovar Typhimurium 14028S transcriptome responses to starvation at high or low cell density. The pie charts show the number of genes with increased or decreased expression at two different time points corresponded to the periods of maximum growth or transition to the stationary phase, respectively.

Fig 2 — Heat map analysis of 1000 of the most variable genes associated with *Salmonella enterica* serovar Typhimurium 14028S transcriptome responses to starvation at high or low cell density.

Hierarchical clustering was constructed based on Pearson correlation distance.

2. Experimental design, materials and methods

2.1. Strains and growth conditions

In this work Salmonella enterica subsp. enterica serovar Typhimurium (ATCC 14,028) strain was used. This strain was cultured on a Luria-Bertani (LB) agar plate (Sigma Aldrich) and a single colony was used to inoculate 10 mL of LB broth. After 12 h aerobic incubation at 37 °C the salmonella cells were collected by centrifugation, washed twice, and transferred to the mineral carbon and phosphorus deficient AB medium containing 1.0 g/L NH₄Cl; 0.62 g/L MgSO₄ × 7H₂O; 0.15 g/L KCl; 0.013 g/L CaCl₂ × 2H₂O; 0.005 g/L FeSO₄ × 7H₂O, pH – 5.5. Previously, this medium has been used to study the starving cultures of a phytopathogenic bacterium Pectobacterium atrosepticum [1,2]. Similar nutrient limited conditions may be found in some natural water bodies and water courses. The elemental composition of the prepared AB medium was checked using an inductively coupled plasma mass spectrometer Aurora M90 (Bruker Corporation, USA). Only a small amount of phosphorus and sodium as additional components was recorded (Table 1). The cell suspensions were incubated under starvation conditions with initial population density of 10³ or 10⁹ CFU per ml in glass vials without aeration at 28 °C for 4–96 h before sampling (Table 2). The low-density cultures were also incubated in AB medium supplemented with 1% glucose.

2.2. Experiment design

In order to study the dynamics of transcriptome response during salmonella adaptation to starvation, sampling was conducted at two time points corresponded to the periods of maximum growth and transition to the stationary phase, respectively.

In our experiments the number of CFU increased after inoculation in all salmonella cultures. In starving high-density cultures, an eight-fold increase in CFU number observed over four hours was replaced by a gradual decrease, which stabilized by the fourth day of incubation at a final value of about 0.06% of the inoculation titer. Such dynamics of the salmonella culture is in a good agreement with the “altruistic” model of the bacterial stress response associated with programmed death [3]. Based on this dynamics of CFU, the starving high-density cultures were sampled after 4 h and 4 days of incubation (“High_4h” and “High_4d”, Table 2).

Low-density cultures entered the stationary phase just after the period of exponential growth was over. The control low-density cultures in carbon-supplemented AB medium (“Glucose_” in Table 2) were sampled after 6 and 24 h of incubation that corresponded to periods of the maximum growth and the transition to the stationary phase, respectively. However, the growth of experimental low-density cultures incubated in the carbon-deficient medium was significantly impaired. Thus, sampling of these cultures delayed up to 24 h and 3 days (“Low_24h” and “Low_3d”, Table 2).

Total RNA was isolated and cDNA libraries were prepared for RNA-sequencing. Directional libraries were sequenced on Illumina Hiseq 2500 in single reads. The RNA-seq raw reads were stored in FASTQ files, and further analyzed to get the clean reads.

2.3. Library construction and sequencing

Bacterial cells were harvested by filtration through nitrocellulose membranes with diameter of pores 0.22 µm (Millipore, USA). The membranes were placed into tubes containing 1 ml of Extract reagent (Evrogen, Russia) and heated at 55 °C 10 min. Then total RNA was extracted according to the manufacturer's protocol. DNA contaminants were removed using RNase-free DNase I kit (Ambion, USA). RNA integrity was checked with an Agilent 2100 bioanalyzer (Agilent,USA). rRNA depletion was performed using Ribo-Zero rRNA Removal Kit for Gram-Negative Bacteria (Illumina, USA). NEBNext Ultra Directional RNA Library Prep Kit for Illumina was used to prepare RNA-seq libraries. The resulting average size of the cDNA libraries was approximately 300 bp. The libraries were sequenced in a single lane of a flow cell on the HiSeq 2500 (Illumina) platform.

2.4. Sequence QC and filtering

A total 173,505,388 raw single-end reads 60 bp long were obtained (Table 3). Quality of the raw reads was controlled with FastQC software v. 0.11.3 [4]. Reads belonged to rRNA were removed using SortMeRNA [5]. Trimming of reads and adapters removal were performed with Trimmomatic v.0.35 [6].

2.5. Reads alignment to the reference genome and data analysis

Filtered high-quality reads were mapped onto the reference genome sequence of Salmonella enterica subsp. enterica serovar Typhimurium 14028S assembly GCA_000022165.1 [7] with TopHat 2 [8]. Before mapping the reference genome was indexed with Bowtie2 [9]. Coverage estimates and statistics of the reads mapping are presented in Table 3. Gene level count tables were obtained using the prepDE.py Python script supplemented to Stringtie tool [10]. Differential expression of genes was calculated via DESeq2 [11]. DEGs analysis was carried out for those genes with fold change (FC) 4.0 and fold discovery rate (FDR) adjusted p-value threshold of 0.05

Author's contribution and ethics statement

Natalia E. Gogoleva: Conceptualization, Investigation, Methodology, Validation, Writing - Original draft preparation, Funding acquisition. Vladimir Ya. Kataev: Investigation, Writing - Original draft preparation. Alexander S. Balkin: Software, Formal analysis, Data curation. Andrey O. Plotnikov: Writing - Original draft preparation, Review & Editing. Elena I. Shagimardanova: Resources. Anastasia M. Subbot: Investigation. Sergey V. Cherkasov: Supervision. Yuri V. Gogolev: Conceptualization, Methodology, Writing - Original draft preparation, Review & Editing, Funding acquisition.

All ethical requirements were observed in the preparation of the publication. The work was not related to the use of human objects and did not include experiments with animals.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

The work was supported by the Russian Science Foundation (project 20–14–00166 for RNA sequencing experiments). The bioinformatic part of the work was supported by the RFBR project 19–315–50031. The cultural part of the work was carried out using the equipment of the Center of Shared Scientific Equipment of the Institute for Cellular and Intracellular Symbiosis of the Ural Branch of RAS. Work of NEG and YVG was supported by government assignment to FRC Kazan Scientific Center of RAS (sampling and RNA preservation procedures).

Footnotes

Supplementary material associated with this article can be found, in the online version, at doi:10.1016/j.dib.2020.106008.

Appendix. Supplementary materials

mmc1.xml^{(1.3KB, xml)}

References

1.Gorshkov V.Yu., Petrova O.E., Mukhametshina N.E., Ageeva M.V., Mulyukin A.L., Gogolev Yu.V. Formation of “Nonculturable” dormant forms of the phytopathogenic enterobacterium Erwinia carotovora. Microbiology. 2009;78(5):585. doi: 10.1134/S0026261709050099. [DOI] [Google Scholar]
2.Gorshkov V., Kwenda S., Petrova O., Osipova E., Gogolev Y., Moleleki L.N. Global gene expression analysis of cross-protected phenotype of Pectobacterium atrosepticum. PLoS ONE. 2017;12(1) doi: 10.1371/journal.pone.0169536. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Tanouchi Y., Pai A., Buchler N.E., You L. Programming stress-induced altruistic death in engineered bacteria. Mol. Syst. Biol. 2012;8:626. doi: 10.1038/msb.2012.57. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Andrews S. Babraham Institute; Cambridge, United Kingdom: 2011. FastQC: a Quality Control Tool For High Throughput Sequence Data.http://www.bioinformatics.babraham.ac.uk/projects/fastqc/ (accessed 30 April 2020) [Google Scholar]
5.Kopylova E., Noé L., Touzet H. SortMeRNA: fast and accurate filtering of ribosomal RNAs in metatranscriptomic data. Bioinformatics. 2012;28(24):3211–3217. doi: 10.1093/bioinformatics/bts611. [DOI] [PubMed] [Google Scholar]
6.Bolger M.A., Lohse M., Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Jarvik T., Chris S., Groisman E.A., Ochman H. Short-term signatures of evolutionary change in the Salmonella enterica serovar typhimurium 14028 genome. J. Bacteriol. 2010;192(2):560–567. doi: 10.1128/JB.01233-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Kim D., Pertea G., Trapnell C., Pimentel H., Kelley R., Salzberg S.L. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol. 2013;14:R36. doi: 10.1186/gb-2013-14-4-r36. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Langmead B., Salzberg S. Fast gapped-read alignment with Bowtie 2. Nat. Methods. 2012;9:357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Kovaka S., Zimin A.V., Pertea G.M., Razaghi R., Salzberg S.L., Pertea M. Transcriptome assembly from long-read RNA-seq alignments with StringTie2. Genome Biol. 2019;20:278. doi: 10.1186/s13059-019-1910-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Love M.I., Huber W., Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:550. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

mmc1.xml^{(1.3KB, xml)}

[bib0001] 1.Gorshkov V.Yu., Petrova O.E., Mukhametshina N.E., Ageeva M.V., Mulyukin A.L., Gogolev Yu.V. Formation of “Nonculturable” dormant forms of the phytopathogenic enterobacterium Erwinia carotovora. Microbiology. 2009;78(5):585. doi: 10.1134/S0026261709050099. [DOI] [Google Scholar]

[bib0002] 2.Gorshkov V., Kwenda S., Petrova O., Osipova E., Gogolev Y., Moleleki L.N. Global gene expression analysis of cross-protected phenotype of Pectobacterium atrosepticum. PLoS ONE. 2017;12(1) doi: 10.1371/journal.pone.0169536. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0003] 3.Tanouchi Y., Pai A., Buchler N.E., You L. Programming stress-induced altruistic death in engineered bacteria. Mol. Syst. Biol. 2012;8:626. doi: 10.1038/msb.2012.57. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0004] 4.Andrews S. Babraham Institute; Cambridge, United Kingdom: 2011. FastQC: a Quality Control Tool For High Throughput Sequence Data.http://www.bioinformatics.babraham.ac.uk/projects/fastqc/ (accessed 30 April 2020) [Google Scholar]

[bib0005] 5.Kopylova E., Noé L., Touzet H. SortMeRNA: fast and accurate filtering of ribosomal RNAs in metatranscriptomic data. Bioinformatics. 2012;28(24):3211–3217. doi: 10.1093/bioinformatics/bts611. [DOI] [PubMed] [Google Scholar]

[bib0006] 6.Bolger M.A., Lohse M., Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0007] 7.Jarvik T., Chris S., Groisman E.A., Ochman H. Short-term signatures of evolutionary change in the Salmonella enterica serovar typhimurium 14028 genome. J. Bacteriol. 2010;192(2):560–567. doi: 10.1128/JB.01233-09. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0008] 8.Kim D., Pertea G., Trapnell C., Pimentel H., Kelley R., Salzberg S.L. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol. 2013;14:R36. doi: 10.1186/gb-2013-14-4-r36. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0009] 9.Langmead B., Salzberg S. Fast gapped-read alignment with Bowtie 2. Nat. Methods. 2012;9:357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0010] 10.Kovaka S., Zimin A.V., Pertea G.M., Razaghi R., Salzberg S.L., Pertea M. Transcriptome assembly from long-read RNA-seq alignments with StringTie2. Genome Biol. 2019;20:278. doi: 10.1186/s13059-019-1910-1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0011] 11.Love M.I., Huber W., Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:550. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Dataset for transcriptome analysis of Salmonella enterica subsp. enterica serovar Typhimurium strain 14028S response to starvation

Natalia E Gogoleva

Vladimir Ya Kataev

Alexander S Balkin

Andrey O Plotnikov

Elena I Shagimardanova

Anastasia M Subbot

Sergey V Cherkasov

Yuri V Gogolev

Abstract

Value of the data

1. Data description

Table 1.

Table 2.

Table 3.

Fig. 1.

Fig. 2.

2. Experimental design, materials and methods

2.1. Strains and growth conditions

2.2. Experiment design

2.3. Library construction and sequencing

2.4. Sequence QC and filtering

2.5. Reads alignment to the reference genome and data analysis

Author's contribution and ethics statement

Declaration of Competing Interest

Acknowledgments

Footnotes

Appendix. Supplementary materials

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Dataset for transcriptome analysis of Salmonella enterica subsp. enterica serovar Typhimurium strain 14028S response to starvation

Natalia E Gogoleva

Vladimir Ya Kataev

Alexander S Balkin

Andrey O Plotnikov

Elena I Shagimardanova

Anastasia M Subbot

Sergey V Cherkasov

Yuri V Gogolev

Abstract

Value of the data

1. Data description

Table 1.

Table 2.

Table 3.

Fig. 1.

Fig. 2.

2. Experimental design, materials and methods

2.1. Strains and growth conditions

2.2. Experiment design

2.3. Library construction and sequencing

2.4. Sequence QC and filtering

2.5. Reads alignment to the reference genome and data analysis

Author's contribution and ethics statement

Declaration of Competing Interest

Acknowledgments

Footnotes

Appendix. Supplementary materials

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases