Abstract
Salmonella enterica is an ubiquitous pathogen throughout the world causing gastroenteritis in humans and animals. Survival of pathogenic bacteria in the external environment may be associated with the ability to overcome the stress caused by starvation. The bacterial response to starvation is well understood in laboratory cultures with a sufficiently high cell density. However, bacterial populations often have a small size when facing this challenge in natural biotopes. The aim of this work was to find out if there are differences in the transcriptomes of S. enterica depending on the factor of cell density during starvation. Here we present transcriptome data of Salmonella enterica subsp. enterica serovar Typhimurium str. 14028S grown in carbon rich or carbon deficient medium with high or low cell density. These data will help identify genes involved in adaptation of low-density bacterial populations to starvation conditions.
Keywords: Salmonella enterica, Starvation, Low-density population, Rna-seq, Illumina
Specifications Table
| Subject | Biochemistry, Genetics and Molecular Biology: Molecular Biology |
|---|---|
| Specific subject area | Transcriptomics |
| Type of data | Charts, tables, transcriptome sequences |
| How data were acquired | High-throughput RNA-sequencing |
| Data format | Raw reads filtered and analysed with statistical tests, FASTQ |
| Parameters for data collection | Total RNA was extracted from Salmonella enterica subsp. enterica serovar Typhimurium str. 14028S cells cultured under carbon and phosphorus starvation |
| Description of data collection | RNA from control and starving samples subjected to RNA-sequencing and transcriptome profiling with subsequent analysis |
| Data source location | Kazan Scientific Centre of RAS, Kazan, Russia. |
| Data accessibility | Cleaned FASTQ files are deposited in a public repository: Repository name: NCBI Sequence Read Archive (SRA) Data identification number: PRJNA554270 Direct URL to data: https://www.ncbi.nlm.nih.gov/bioproject/?term=PRJNA554270 |
Value of the data
-
1.
For the first time, transcriptome data were obtained for salmonella starved for carbon and phosphorus at high or low cell density.
-
2.
The transcriptome dataset can be used to identify S. enterica genes differentially expressed under starvation conditions at high or low bacterial population density.
-
3.
These data can help elucidate mechanisms of persistence used by the pathogenic microorganisms for survival and growth in the oligotrophic biotopes.
1. Data description
The dataset of this article provides information on raw RNA-seq reads obtained from samples of Salmonella enterica serovar Typhimurium 14028S cultures grown in a mineral medium providing carbon and phosphorus starvation, or in the medium supplemented with glucose as a carbon source. The final content of the basic elements in the mineral medium (excluding carbon and nitrogen) is contained in Table 1. Information on the sampling time and growth rate of the cultures is presented in Table 2. This table also provides the NCBI SRA accession numbers of the cleaned FASTQ files for all biological replicates. The transcriptome data obtained are summarized in Table 3. To identify coding and non-coding transcripts, they were mapped on the reference genome. Up- and down-regulated genes were counted with the Differentially Expressed Genes (DEGs) analysis of transcriptomes of the salmonella cultures starved at high or low cell density, as well as the cultures grown in the glucose rich medium (Fig. 1). Besides, we evaluated the identity of 1000 the most variable genes associated with salmonella transcriptome responses to starvation with the heat map analysis (Fig. 2).
Table 1.
Content of the basic elements in the AB mineral medium.
| Element | Concentration, mg/L |
|---|---|
| K | 92.00 |
| S | 86.90 |
| Mg | 72.10 |
| Na | 6.96 |
| Ca | 2.40 |
| Fe | 0.21 |
| P | 0.10 |
| Zn | 0.04 |
| Mn | 0.01 |
Table 2.
Characteristics of Salmonella enterica serovar Typhimurium 14028S cultures taken for transcriptome analysis.
| Biological replicates | Carbon source | Inoculation titer, CFU/ml | Duration of cultivation, hours / phase of growth or stationary state | Titer of sampled culture, CFU/ml | NCBI SRA accession number |
|---|---|---|---|---|---|
| High_4h_1 | absent | 1 × 109 | 4 growth | 8.0 × 109 | SRX7476254 |
| High_4h_2 | absent | 1 × 109 | 4 growth | 8.1 × 109 | SRX7476255 |
| High_4d_1 | absent | 1 × 109 | 96 / stationary | 5.1 × 106 | SRX7476246 |
| High_4d_2 | absent | 1 × 109 | 96 / stationary | 6.8 × 106 | SRX7476247 |
| Low_24h_1 | absent | 1 × 103 | 24 / growth | 2.2 × 105 | SRX7476244 |
| Low_24h_2 | absent | 1 × 103 | 24 / growth | 2.6 × 105 | SRX7476245 |
| Low_3d_1 | absent | 1 × 103 | 72 / stationary | 4.3 × 105 | SRX7476248 |
| Low_3d_2 | absent | 1 × 103 | 72 / stationary | 6.1 × 105 | SRX7476249 |
| Glucose_6h_1 | glucose | 1 × 103 | 6 / growth | 2.3 × 108 | SRX7476250 |
| Glucose_6h_2 | glucose | 1 × 103 | 6 / growth | 4.9 × 108 | SRX7476251 |
| Glucose_24h_1 | glucose | 1 × 103 | 24 / stationary | 2.0 × 108 | SRX7476252 |
| Glucose_24h_2 | glucose | 1 × 103 | 24 / stationary | 3.1 × 108 | SRX7476253 |
Table 3.
Number of cleaned reads and reads mapped on the reference genome.
| Library | Number of cleaned reads | Number of reads mapped on genome | % Mapped reads | Number of reads mapped on coding sequences | % Reads mapped on coding sequences | % Reads mapped on intergenic regions |
|---|---|---|---|---|---|---|
| High_4h_1 | 18,406,709 | 17,908,185 | 97.3 | 9475,008 | 51.48 | 48.52 |
| High_4h_2 | 24,679,633 | 24,254,947 | 98.3 | 13,297,127 | 53.88 | 46.12 |
| High_4d_1 | 10,203,859 | 9865,524 | 96.7 | 2477,269 | 24.28 | 75.72 |
| High_4d_2 | 9545,957 | 9257,803 | 97.0 | 2144,698 | 22.47 | 77.53 |
| Low_24h_1 | 20,478,595 | 19,684,154 | 96.1 | 14,737,915 | 71.97 | 28.03 |
| Low_24h_2 | 10,285,237 | 9705,585 | 94.4 | 6413,217 | 62.35 | 37.35 |
| Low_3d_1 | 26,260,086 | 25,040,464 | 95.4 | 8464,941 | 32.24 | 67.76 |
| Low_3d_2 | 8004,395 | 7588,638 | 94.8 | 2107,150 | 26.32 | 73.68 |
| Glucose_6h_1 | 12,440,848 | 12,193,924 | 98.6 | 10,810,167 | 87.38 | 12.62 |
| Glucose_6h_ 2 | 12,826,518 | 12,443,566 | 97.6 | 11,354,492 | 89.03 | 10.97 |
| Glucose_24h_1 | 9851,028 | 9510,553 | 96.5 | 5662,677 | 57.48 | 42.52 |
| Glucose_24h_ 2 | 12,072,222 | 11,594,716 | 96.0 | 7208,000 | 59.71 | 40.29 |
Fig. 1.
Salmonella enterica serovar Typhimurium 14028S transcriptome responses to starvation at high or low cell density. The pie charts show the number of genes with increased or decreased expression at two different time points corresponded to the periods of maximum growth or transition to the stationary phase, respectively.
Fig. 2.
Heat map analysis of 1000 of the most variable genes associated with Salmonella enterica serovar Typhimurium 14028S transcriptome responses to starvation at high or low cell density.
Hierarchical clustering was constructed based on Pearson correlation distance.
2. Experimental design, materials and methods
2.1. Strains and growth conditions
In this work Salmonella enterica subsp. enterica serovar Typhimurium (ATCC 14,028) strain was used. This strain was cultured on a Luria-Bertani (LB) agar plate (Sigma Aldrich) and a single colony was used to inoculate 10 mL of LB broth. After 12 h aerobic incubation at 37 °C the salmonella cells were collected by centrifugation, washed twice, and transferred to the mineral carbon and phosphorus deficient AB medium containing 1.0 g/L NH4Cl; 0.62 g/L MgSO4 × 7H2O; 0.15 g/L KCl; 0.013 g/L CaCl2 × 2H2O; 0.005 g/L FeSO4 × 7H2O, pH – 5.5. Previously, this medium has been used to study the starving cultures of a phytopathogenic bacterium Pectobacterium atrosepticum [1,2]. Similar nutrient limited conditions may be found in some natural water bodies and water courses. The elemental composition of the prepared AB medium was checked using an inductively coupled plasma mass spectrometer Aurora M90 (Bruker Corporation, USA). Only a small amount of phosphorus and sodium as additional components was recorded (Table 1). The cell suspensions were incubated under starvation conditions with initial population density of 103 or 109 CFU per ml in glass vials without aeration at 28 °C for 4–96 h before sampling (Table 2). The low-density cultures were also incubated in AB medium supplemented with 1% glucose.
2.2. Experiment design
In order to study the dynamics of transcriptome response during salmonella adaptation to starvation, sampling was conducted at two time points corresponded to the periods of maximum growth and transition to the stationary phase, respectively.
In our experiments the number of CFU increased after inoculation in all salmonella cultures. In starving high-density cultures, an eight-fold increase in CFU number observed over four hours was replaced by a gradual decrease, which stabilized by the fourth day of incubation at a final value of about 0.06% of the inoculation titer. Such dynamics of the salmonella culture is in a good agreement with the “altruistic” model of the bacterial stress response associated with programmed death [3]. Based on this dynamics of CFU, the starving high-density cultures were sampled after 4 h and 4 days of incubation (“High_4h” and “High_4d”, Table 2).
Low-density cultures entered the stationary phase just after the period of exponential growth was over. The control low-density cultures in carbon-supplemented AB medium (“Glucose_” in Table 2) were sampled after 6 and 24 h of incubation that corresponded to periods of the maximum growth and the transition to the stationary phase, respectively. However, the growth of experimental low-density cultures incubated in the carbon-deficient medium was significantly impaired. Thus, sampling of these cultures delayed up to 24 h and 3 days (“Low_24h” and “Low_3d”, Table 2).
Total RNA was isolated and cDNA libraries were prepared for RNA-sequencing. Directional libraries were sequenced on Illumina Hiseq 2500 in single reads. The RNA-seq raw reads were stored in FASTQ files, and further analyzed to get the clean reads.
2.3. Library construction and sequencing
Bacterial cells were harvested by filtration through nitrocellulose membranes with diameter of pores 0.22 µm (Millipore, USA). The membranes were placed into tubes containing 1 ml of Extract reagent (Evrogen, Russia) and heated at 55 °C 10 min. Then total RNA was extracted according to the manufacturer's protocol. DNA contaminants were removed using RNase-free DNase I kit (Ambion, USA). RNA integrity was checked with an Agilent 2100 bioanalyzer (Agilent,USA). rRNA depletion was performed using Ribo-Zero rRNA Removal Kit for Gram-Negative Bacteria (Illumina, USA). NEBNext Ultra Directional RNA Library Prep Kit for Illumina was used to prepare RNA-seq libraries. The resulting average size of the cDNA libraries was approximately 300 bp. The libraries were sequenced in a single lane of a flow cell on the HiSeq 2500 (Illumina) platform.
2.4. Sequence QC and filtering
A total 173,505,388 raw single-end reads 60 bp long were obtained (Table 3). Quality of the raw reads was controlled with FastQC software v. 0.11.3 [4]. Reads belonged to rRNA were removed using SortMeRNA [5]. Trimming of reads and adapters removal were performed with Trimmomatic v.0.35 [6].
2.5. Reads alignment to the reference genome and data analysis
Filtered high-quality reads were mapped onto the reference genome sequence of Salmonella enterica subsp. enterica serovar Typhimurium 14028S assembly GCA_000022165.1 [7] with TopHat 2 [8]. Before mapping the reference genome was indexed with Bowtie2 [9]. Coverage estimates and statistics of the reads mapping are presented in Table 3. Gene level count tables were obtained using the prepDE.py Python script supplemented to Stringtie tool [10]. Differential expression of genes was calculated via DESeq2 [11]. DEGs analysis was carried out for those genes with fold change (FC) 4.0 and fold discovery rate (FDR) adjusted p-value threshold of 0.05
Author's contribution and ethics statement
Natalia E. Gogoleva: Conceptualization, Investigation, Methodology, Validation, Writing - Original draft preparation, Funding acquisition. Vladimir Ya. Kataev: Investigation, Writing - Original draft preparation. Alexander S. Balkin: Software, Formal analysis, Data curation. Andrey O. Plotnikov: Writing - Original draft preparation, Review & Editing. Elena I. Shagimardanova: Resources. Anastasia M. Subbot: Investigation. Sergey V. Cherkasov: Supervision. Yuri V. Gogolev: Conceptualization, Methodology, Writing - Original draft preparation, Review & Editing, Funding acquisition.
All ethical requirements were observed in the preparation of the publication. The work was not related to the use of human objects and did not include experiments with animals.
Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Acknowledgments
The work was supported by the Russian Science Foundation (project 20–14–00166 for RNA sequencing experiments). The bioinformatic part of the work was supported by the RFBR project 19–315–50031. The cultural part of the work was carried out using the equipment of the Center of Shared Scientific Equipment of the Institute for Cellular and Intracellular Symbiosis of the Ural Branch of RAS. Work of NEG and YVG was supported by government assignment to FRC Kazan Scientific Center of RAS (sampling and RNA preservation procedures).
Footnotes
Supplementary material associated with this article can be found, in the online version, at doi:10.1016/j.dib.2020.106008.
Appendix. Supplementary materials
References
- 1.Gorshkov V.Yu., Petrova O.E., Mukhametshina N.E., Ageeva M.V., Mulyukin A.L., Gogolev Yu.V. Formation of “Nonculturable” dormant forms of the phytopathogenic enterobacterium Erwinia carotovora. Microbiology. 2009;78(5):585. doi: 10.1134/S0026261709050099. [DOI] [Google Scholar]
- 2.Gorshkov V., Kwenda S., Petrova O., Osipova E., Gogolev Y., Moleleki L.N. Global gene expression analysis of cross-protected phenotype of Pectobacterium atrosepticum. PLoS ONE. 2017;12(1) doi: 10.1371/journal.pone.0169536. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Tanouchi Y., Pai A., Buchler N.E., You L. Programming stress-induced altruistic death in engineered bacteria. Mol. Syst. Biol. 2012;8:626. doi: 10.1038/msb.2012.57. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Andrews S. Babraham Institute; Cambridge, United Kingdom: 2011. FastQC: a Quality Control Tool For High Throughput Sequence Data.http://www.bioinformatics.babraham.ac.uk/projects/fastqc/ (accessed 30 April 2020) [Google Scholar]
- 5.Kopylova E., Noé L., Touzet H. SortMeRNA: fast and accurate filtering of ribosomal RNAs in metatranscriptomic data. Bioinformatics. 2012;28(24):3211–3217. doi: 10.1093/bioinformatics/bts611. [DOI] [PubMed] [Google Scholar]
- 6.Bolger M.A., Lohse M., Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Jarvik T., Chris S., Groisman E.A., Ochman H. Short-term signatures of evolutionary change in the Salmonella enterica serovar typhimurium 14028 genome. J. Bacteriol. 2010;192(2):560–567. doi: 10.1128/JB.01233-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Kim D., Pertea G., Trapnell C., Pimentel H., Kelley R., Salzberg S.L. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol. 2013;14:R36. doi: 10.1186/gb-2013-14-4-r36. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Langmead B., Salzberg S. Fast gapped-read alignment with Bowtie 2. Nat. Methods. 2012;9:357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Kovaka S., Zimin A.V., Pertea G.M., Razaghi R., Salzberg S.L., Pertea M. Transcriptome assembly from long-read RNA-seq alignments with StringTie2. Genome Biol. 2019;20:278. doi: 10.1186/s13059-019-1910-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Love M.I., Huber W., Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:550. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.


