Skip to main content
Data in Brief logoLink to Data in Brief
. 2019 Dec 16;28:105001. doi: 10.1016/j.dib.2019.105001

Dataset for transcriptome analysis of abscisic acid degrading bacterium Novosphingobium sp. P6W

Natalia E Gogoleva a,b, Tatiana A Konnova a, Timur T Ismailov a, Alexander S Balkin c, Andrey A Belimov d, Yuri V Gogolev a,b,
PMCID: PMC6938937  PMID: 31909107

Abstract

Plant growth-promoting rhizobacteria (PGPR) improve plant productivity and stress resistance. The mechanisms involved in plant-microbe interactions include the modulation of plant hormone status. The Novosphingobium sp. strain P6W was previously described as the bacterium capable of abscisic acid (ABA) degradation, and its inoculation decreased ABA concentrations in planta. The metabolic pathway for the ABA degradation in bacteria is still unknown. Here we present transcriptome data of Novosphingobium sp. P6W grown in the medium supplemented with ABA or fructose as the carbon source. Cleaned FASTQ files for the RNA-seq libraries are deposited in the NCBI Sequence Read Archive (SRA, Identifier: SRP189498) and have been assigned BioProject accession PRJNA529223.

Keywords: Plant growth-promoting rhizobacteria (PGPR), Novosphingobium sp., RNA-seq, Illumina, Rhizosphere


Specifications Table

Subject Biology
Specific subject area Transcriptomics
Type of data Transcriptome sequences, table, figure
How data were acquired High-throughput RNA-sequencing with Illumina HiSeq 2500
Data format Clean data, FASTQ
Experimental factors Growth of soil bacteria in a minimal medium supplemented with ABA
Experimental features Datasets for bacterial cultures utilizing ABA or fructose and under carbon starvation conditions
Data source location Kazan Scientific Centre of RAS, Kazan, Russia.
Data accessibility Cleaned FASTQ files are deposited in a public repository:
Repository name: NCBI SRA
Data identification number: PRJNA529223
Direct URL to data: https://www.ncbi.nlm.nih.gov/bioproject/529223
Value of the Data
  • These datasets will be valuable to the PGPR research community for characterizing changes in rhizobacterial gene expression caused by phytohormones and depending on environmental conditions.

  • Downstream analysis will allow the identification of genes involved in bacterial ABA degradation.

  • Cleaned sequencing reads can be further processed by researchers using their own bioinformatic algorithms and analyzed together with their own data.

1. Data description

The dataset contains cleaned sequencing data obtained through the transcriptome sequencing of Novosphingobium sp. P6W grown in the medium supplemented with ABA or fructose as the sole carbon source and under carbon starvation conditions. Samples for transcriptome profiling were collected at the exponential and stationary growth phases. Cleaned FASTQ files were deposited in NCBI Sequence Read Archive and accessible through the BioProject PRJNA529223. Information about bacterial culture samples is presented in Table 1. Reads were mapped onto the reference genome sequence and the coverage data were obtained. Statistics of sequence reads and sequence coverage data are shown in Table 2. PCA plot of RNA-seq data presented in Fig. 1 demonstrates the variance between sample groups and sample replicates according to gene expression levels. Each dot in the Fig. 1 indicates particular sample.

Table 1.

Samples of the Novosphingobium sp P6W cultures.

Sample name Biological replicates Carbone source Duration of cultivation, hours Culture density, OD Accession number
ABA exponential phase ABA_1 ABA 24 0.23 SRX5577386
ABA_2 ABA 24 0.21 SRX5577385
ABA_3 ABA 24 0.21 SRX5577384
ABA_4 ABA 24 0.24 SRX5577383
ABA_5 ABA 24 0.21 SRX5577391
ABA_6 ABA 24 0.20 SRX5577381
ABA stationary phase ABA_7 ABA 48 0.55 SRX5577382
ABA_8 ABA 48 0.51 SRX5577380
Carbon starvation exponential phase NoCarbon_1 absent 24 0.13 SRX5577387
NoCarbon_2 absent 24 0.10 SRX5577392
Carbon starvation stationary phase NoCarbon_3 absent 48 0.16 SRX5577379
NoCarbon_4 absent 48 0.19 SRX5577378
Fructose exponential phase Fructose_1 fructose 18 0.25 SRX5577390
Fructose_2 fructose 18 0.28 SRX5577389
Fructose_4 fructose 18 0.25 SRX5577388

Table 2.

Cleaned reads and reads mapped on reference genome.

Library Number of cleaned reads Number of reads mapped on genome % Mapped reads
ABA_1 10,899,064 10,346,749 94.93
ABA_2 10,757,369 10,281,619 95.58
ABA_3 9,060,795 8,713,460 96.17
ABA_4 12,313,428 11,778,892 95.66
ABA_5 9,715,928 9,659,951 99.42
ABA_6 11,740,625 10,636,562 90.60
ABA_7 12,473,706 12,413,817 99.52
ABA_8 6,292,959 5,820,562 92.49
NoCarbon_1 9,325,126 9,184,277 98.49
NoCarbon_2 4,655,901 4,254,299 91.37
NoCarbon_3 6,234,953 5,123,816 82.18
NoCarbon_4 4,468,833 4,286,867 95.93
Fructose_1 12,282,002 11,014,354 89.68
Fructose_2 10,869,930 9,944,951 91.49
Fructose_4 12,513,546 10,247,348 81.89

Fig. 1.

Fig. 1

Principal component analysis (PCA) of the general transcriptome characteristics. The first principal component (component 1) accounted for 62% and the second principal component (component 2) for 11% of the total variance in the dataset. Legend description: “ABA_exp”and “ABA_Stat” – samples of cultures grown in ABA supplemented medium taken at the exponential and stationary phases respectively (see samples ABA 1–6 and ABA 7 and 8 in Table 1); “Fructose_Exp” – samples of exponential phase cultures grown in the medium supplemented with fructose (see samples Fructose 1–3 in Table 1); “NoCarbon_Exp”and “NoCarbon_Stat” – samples of cultures incubated under carbon starvation for 24 and 48 hours respectively (see samples NoCarbon 1 and 2 and NoCarbon 3 and 4 in Table 1).

2. Experimental design, materials, and methods

2.1. Bacterial strains and growth conditions

The Novosphingobium sp. P6W strain was initially isolated from the rhizosphere of rice (Oryza sativa L.) seedlings [1]. Complete genome sequencing for this strain was performed previously [2]. Bacterial cells were grown aerobically at 28 °C in a minimal medium (g L-1: MgSO4x7H2O - 0.3; NH4NO3 - 0.5; KH2PO4 - 1.36; FeCl3 - 0.002; pH 6.7) supplemented with 250 mg/L (±)-abscisic acid (Sigma) or 250 mg/L d-fructose (Sigma) as a sole carbon source.

2.2. Experiment design

To identify the genes involved in ABA metabolism, the transcriptome profiles of exponential phase cultures growing in the minimal medium supplemented with ABA or fructose were compared. To exclude genes associated with stress adaptation, samples of cultures incubated under carbon starvation conditions for 24 and 48 hours were taken as corresponding controls. It was important to obtain information about the genes that decrease activity at the substrate depletion. For this purpose, samples of cultures grown in the ABA supplemented medium at the stationary phase were also taken.

2.3. Library construction and sequencing

Bacterial cultures were fixed with an equal volume of cold RNA-stabilizing solution (19% ethanol, 1% acidic phenol, pH 5.5) on ice for 30 minutes. Cells were harvested by centrifugation and RNA isolation was performed using RNA Extract Reagent (Evrogen, Russia) according to the manufacturer's protocol. DNA contaminants were removed using RNase-free DNase I kit (Ambion, USA). The integrity of the RNA was checked by Agilent 2100 bioanalyzer (USA). For rRNA removal the Ribo-Zero kit for Gram-negative bacteria (Illumina, USA) was used.

NEBNext Ultra Directional RNA Library Prep Kit for Illumina was used to prepare RNA-seq libraries. The resulting average size of the cDNA libraries was approximately 300 bp. Libraries were sequenced using the Illumina HiSeq 2500 sequencing platform.

2.4. Sequence QC and filtering

144,262,494 reads were obtained in total with a length of 60 nucleotides (Table 1). FastQC software (Version 0.11.5) [3] was used to assess the quality of the raw Fastq files and clean reads. Raw reads were filtered using BBDuk (v. 37.23, http://jgi.doe.gov/data-and-tools/bb-tools/) to remove Illumina adapters, NEB indexes and to quality-trim right end to Q20 (ktrim = r k = 23 mink = 11 hdist = 1 tpe tbo minlen = 25 qtrim = r trimq = 20). Thereafter, the rRNA reads were eliminated by using SortMeRNA v2.1 program [4].

2.5. Reads alignment to the reference genome

The high-quality reads were mapped onto the genome sequence of the Novosphingobium sp. P6W strain (assembly: GCA_000876675.2) (ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/876/675/GCF_000876675.2_ASM87667v2/GCF_000876675.2_ASM87667v2_genomic.fna.gz). HISAT2 version 2.1.0 [5] was used to build index of reference genome and align clean reads to reference genome with the following parameters: hisat2 -p --dta -x -U -S. SAM files of alignments created by HISAT2 were converted to BAM files using SAM-tools view [6]. Coverage estimates and reads mapping statistics are presented in Table 2. DESeq2 [7] was used to assess variance between sample groups and sample replicates using principle component analysis (PCA). PCA plot shown in the Fig. 1 demonstrates the overall quality of our sample collection, library preparation, and sequencing.

Acknowledgments

The work was supported by the Russian Science Foundation (project 17-14-01363 for RNA sequence experiments). The bioinformatic part of the work was supported by the RFBR project -17-04-01908. The cultural part of the work was supported by the Program of competitive growth of Kazan Federal University. The study was carried out using the equipment of the CSF-SAC FRC KSC RAS.

Footnotes

Appendix A

Supplementary data to this article can be found online at https://doi.org/10.1016/j.dib.2019.105001.

Conflict of Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Appendix A. Supplementary data

The following is the Supplementary data to this article:

Multimedia component 1
mmc1.xml (1.1KB, xml)

References

  • 1.Belimov A.A., Dodd I.C., Safronova V.I., Dumova V.A., Shaposhnikov A.I., Ladatko A.G., Davies W.J. Abscisic acid metabolizing rhizobacteria decrease ABA concentrations in planta and alter plant growth. Plant Physiol. Biochem. 2014;74:84–91. doi: 10.1016/j.plaphy.2013.10.032. [DOI] [PubMed] [Google Scholar]
  • 2.Gogoleva N.E., Nikolaichik Y.A., Ismailov T.T., Gorshkov V.Y., Safronova V.I., Belimov A.A., Gogolev Y.V. Complete genome sequence of the abscisic acid-utilizing strain Novosphingobium sp. P6W. 3 Biotech. 2019;9:1–8. doi: 10.1007/s13205-019-1625-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Andrews S. 2010. FastQC: a quality control tool for high throughput sequence data.http://www.bioinformatics.babraham.ac.uk/projects/fastqc/ [Google Scholar]
  • 4.Kopylova E., Noé L., Touzet H. SortMeRNA: fast and accurate filtering of ribosomal RNAs in metatranscriptomic data. Bioinformatics. 2012;28:3211–3217. doi: 10.1093/bioinformatics/bts611. [DOI] [PubMed] [Google Scholar]
  • 5.Kim D., Langmead B., Salzberg S.L. HISAT: a fast spliced aligner with low memory requirements. Nat. Methods. 2015;12:357–360. doi: 10.1038/nmeth.3317. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Li H., Handsaker B., Wysoker A., Fennell T., Ruan J., Homer N., Marth G., Abecasis G., Durbin R., 1000 Genome Project Data Processing Subgroup The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Love M.I., Huber W., Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15(550):1–21. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Multimedia component 1
mmc1.xml (1.1KB, xml)

Articles from Data in Brief are provided here courtesy of Elsevier

RESOURCES