Skip to main content
Genomics Data logoLink to Genomics Data
. 2015 Apr 30;4:162–164. doi: 10.1016/j.gdata.2015.04.018

Transcriptome analysis of the aquaporin AtPIP1;2 deficient line in Arabidopsis thaliana

Anastassia Boudichevskaia a,, Marlies Heckwolf a,b,c, Lea Althaus a, Ralf Kaldenhoff a
PMCID: PMC4535893  PMID: 26484207

Abstract

Atmospheric CO2 impacts all aspects of plant development. It has changed in the past and is predicted to change further on. Studies on the response of crop plants to low and elevated CO2 concerning growth, productivity and physiological processes are intense. In contrast, the molecular mechanisms of cellular CO2 exchange are still under discussion. At the same time it becomes more and more accepted that carbon dioxide is transported across cellular biomembranes by CO2 conducting aquaporins. Our recent study (Boudichevskaia et al., 2015) demonstrates that the lack of a single gene product – aquaporin AtPIP1;2 – resulted in massive transcriptional reprogramming in Arabidopsis as a consequence of reduced tissue CO2 diffusion rates. Therefore, the transcriptome data of the aquaporin AtPIP1;2 deficient line can be used in the comparative expression analyses for better understanding the role of aquaporins with regard to CO2 and water transport in plants. Here we describe a gene expression dataset generated for three biological replicates per genotype on Affymetrix platform. We provide detailed methods and analysis on microarray data which has been deposited in Gene Expression Omnibus (GEO): GSE62167. Additionally, we provide the R code for data preprocessing and quality control.

Keywords: Affymetrix, Microarray, Arabidopsis thaliana


Specifications
Organism/cell line/tissue Arabidopsis thaliana; leaf material
Sex
Sequencer or array type GeneChip®AraGene-1.0 ST arrays (Affymetrix, USA)
Data format CEL and CHP files
Experimental factors Expression in AtPIP1;2 T-DNA insertion line compared to the wild type (N60000)
Experimental features We performed microarray analysis on control and AtPIP1;2 T-DNA insertion line to determine differentially expressed genes. The aim was to compare the transcriptomic profile of the atpip1;2-1 plants with data, available in public domains, related to water stress or low CO2 content. Biological replicates: 3 control replicates and 3 atpip1;2-1 replicates
Consent N/A
Sample source location Darmstadt, Germany

Direct link to deposited data

The deposited data can be found at: http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE62167.

Experimental design, materials and methods

Arabidopsis thaliana wild type (N60000) and atpip1;2-1 (N519794) plant lines were obtained from the Nottingham Stock Centre [1] and backcrossed. The atpip1;2–1 plants were previously characterized by [2]. Plants were grown under conditions described in [3]. The leaf material of three week-old plants was used for microarray analysis. Total RNA was isolated from 5 individual plants for each microarray (15 plants from each line in total) using the Spectrum™ Plant Total RNA Kit (Sigma-Aldrich, USA) according to the manufacturer's protocol. RNA quality and quantity were determined by using a Nanodrop 2000 (Peqlab). RNA was processed for use on the GeneChip® AraGene-1.0 ST arrays (Affymetrix, USA). In total, 100 ng of total RNA was subjected to Ambion WT Expression Kit (Ambion, USA) and Affymetrix GeneChip WT Terminal Labeling Kit (Affymetrix, USA), following the manufacturer's protocols for whole genome gene expression analysis. The hybridization cocktail was prepared according to the Hybridization, Wash and Stain Kit manual (Affymetrix, USA). The GeneChip®AraGene-1.0 ST arrays (3 per genotype) were hybridized in a rotating oven at 45 °C for 17 h. Standard post-hybridization wash and stain were used (Fluidics Station 450), followed by scanning of the array strips using the Affymetrix GeneChip® Scanner 3000 7G.

Data analysis

After scanning the microarray slides were pre-analyzed using Partek's Genomic Suite's workflow designed for Exon-Arrays. All samples were normalized through the Robust Multi-array Average (RMA) algorithm described in [4]. The CEL files together with the corresponding CHP files were deposited on GEO under the accession number GSE62167. The exon signals of these files were summarized to the gene-level. The reliability of data was ensured by the use of three biological replicates per genotype in the experiment. The quality of biological replicates was determined by pairwise correlations [5]. High, positive Pearson's correlation coefficients were obtained with scores ranging from 0.98 to 0.99 [3]. The correlations were visualized using scatter plots comparisons of the normalized expression values (Fig. 1). The presence of outliers on the gene-wise level was in addition proved before statistical analyses were performed. The data points that differ by more than 50% from the mean expression value of two more uniform values of the same sample were removed from further analysis. Altogether 1190 loci (4.2% from the list of 28,387 genes) were omitted from the analysis. This procedure has been performed in R. The workflow is available in Supplementary material. The data about remaining genes can be found in Supporting file 1. Statistical analyses including t-test and false discovery rate (FDR) correction [6] are described in [3]. All genes with FDR corrected p-value < 0.05 were identified as significant (Supporting file 2). Totally, of the 27,197 genes quantified by microarray 3014 genes were scored as significantly regulated. Among these, 91 genes displayed twofold or more decrease in the expression rate and 83 genes revealed twofold or more increase in level of transcript in the AtPIP1;2 T-DNA insertion line (Table 1).

Fig. 1.

Fig. 1

XY scatter plots of transcriptome data among three biological replicates. (a) XY scatter plots of the atpip1;2-1 replicates with each other. (b) XY scatter plots of the wild type (N60000) replicates. Identical values are plotted on the blue line. The red lines indicate a log2-fold difference of 1.

Table 1.

Analysis of genes significantly regulated in the mutant line atpip1;2-1 based on the GeneChip® AraGene-1.0 ST arrays (Affymetrix). The information about the whole gene number and amount of genes after removal of outliers is represented. The data are based on three biological replicates. The differentially expressed genes were determined with the parametric t-test and further corrected for multiple comparisons (FDR with p-value cutoff of < 0.05).

Total amount of genes
Amount of differentially expressed genes
Before filtering After filtering Up-regulated
Down-regulated
Total ≥ Two-fold Total ≥ Two-fold
28,387 27,197 1550 83 1464 91

In addition to the parametric t-test, the SAM (Significant Analysis of Microarrays) approach embedded in R package was applied to the Affymetrix dataset to confirm gene expression results obtained with the t-test for highly expressed genes (fold-change of two or higher). The SAM was performed using the delta parameter of 0.6 and FDR of 9.3%. The workflow is available in Supplementary material. According to SAM, 152 genes were differentially expressed (wild type vs. atpip1;2-1 line). All genes claimed as significant based on the SAM approach were also significant according to the t-test (Supporting file 2).

Discussion

In this Data in Brief article, we describe a recently obtained dataset of a knockout-line expression profile (AtPIP1;2 deficient). In the related publication [3], these data are compared to profiles that are publicly available and obtained under low CO2 conditions [7], [8] or drought stress [9], [10]. The analyses demonstrate the relevance of the microarray approach aimed to uncover the physiological function of AtPIP1;2.

The following are the supplementary data related to this article.

Supplementary material

R flow for differential gene expression analysis.

mmc1.zip (2.1KB, zip)
Supporting file 1

Affymetrix data based on GeneChip® AraGene-1.0 ST arrays (Affymetrix, USA).

mmc2.xlsx (3.7MB, xlsx)
Supporting file 2

Differentially expressed genes in atpip1;2-1 mutant line based on the t-test and SAM.

mmc3.xlsx (373.1KB, xlsx)

Conflict of interest

The authors have no conflicts of interest.

Acknowledgments

We are grateful to Dmitri Pescianschi, Progress Inc. Michigan, US, for help in VBA macros development.

References

  • 1.Alonso J.M., Stepanova A.N., Leisse T.J., Kim C.J., Chen H., Shinn P. Genome wide insertional mutagenesis of Arabidopsis thaliana. Science. 2003;301:653–657. doi: 10.1126/science.1086391. [DOI] [PubMed] [Google Scholar]
  • 2.Heckwolf M., Pater D., Hanson D.T., Kaldenhoff R. The Arabidopsis thaliana aquaporin AtPIP1;2 is a physiologically relevant CO2 transport facilitator. Plant J. 2011;67:795–804. doi: 10.1111/j.1365-313X.2011.04634.x. [DOI] [PubMed] [Google Scholar]
  • 3.Boudichevskaia A., Heckwolf M., Kaldenhoff R. T-DNA insertion in aquaporin gene AtPIP1;2 generates transcription profiles reminiscent of a low CO2 response. Plant Cell Environ. 2015 doi: 10.1111/pce.12547. [DOI] [PubMed] [Google Scholar]
  • 4.Irizarry R.A., Bolstad B.M., Collin F., Cope L.M., Hobbs B., Speed T.P. Summaries of Affymetrix GeneChip probe level data. Nucleic Acids Res. 2003;31:e15. doi: 10.1093/nar/gng015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Lohse M., Nunes-Nesi A., Krüger P., Nagel A., Hannemann J., Giorgi F.M. Robin: an intuitive wizard application for R-based expression microarray quality assessment and analysis. Plant Physiol. 2010;153:642–651. doi: 10.1104/pp.109.152553. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Benjamini Y., Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. B. 1995;57:289–300. [Google Scholar]
  • 7.Bläsing O.E., Gibon Y., Günther M., Höhne M., Morcuende R., Osuna D. Sugars and circadian regulation make major contributions to the global regulation of diurnal gene expression in Arabidopsis. Plant Cell Online. 2005;17:3257–3281. doi: 10.1105/tpc.105.035261. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Li Y., Xu J., Haq N.U., Zhang H., Zhu X.G. Was low CO2 a driving force of C4 evolution: Arabidopsis responses to long-term low CO2 stress. J. Exp. Bot. 2014;65:3657–3667. doi: 10.1093/jxb/eru193. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Huang D., Wu W., Abrams S.R., Cutler A.J. The relationship of drought-related gene expression in Arabidopsis thaliana to hormonal and environmental factors. J. Exp. Bot. 2008;59:2991–3007. doi: 10.1093/jxb/ern155. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Harb A., Krishnan A., Ambavaram M.M.R., Pereira A. Molecular and physiological analysis of drought stress in Arabidopsis reveals early responses leading to acclimation in plant growth. Plant Physiol. 2010;154:1254–1271. doi: 10.1104/pp.110.161752. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary material

R flow for differential gene expression analysis.

mmc1.zip (2.1KB, zip)
Supporting file 1

Affymetrix data based on GeneChip® AraGene-1.0 ST arrays (Affymetrix, USA).

mmc2.xlsx (3.7MB, xlsx)
Supporting file 2

Differentially expressed genes in atpip1;2-1 mutant line based on the t-test and SAM.

mmc3.xlsx (373.1KB, xlsx)

Articles from Genomics Data are provided here courtesy of Elsevier

RESOURCES