Skip to main content
Data in Brief logoLink to Data in Brief
. 2016 Jun 8;8:520–528. doi: 10.1016/j.dib.2016.06.002

Dataset of Arabidopsis plants that overexpress FT driven by a meristem-specific KNAT1 promoter

L Duplat-Bermúdez a, R Ruiz-Medrano a, D Landsman b, L Mariño-Ramírez b, B Xoconostle-Cázares a,
PMCID: PMC4919726  PMID: 27366785

Abstract

In this dataset we integrated figures comparing leaf number and rosette diameter in three Arabidopsis FT overexpressor lines (AtFTOE) driven by KNAT1 promoter, “A member of the KNOTTED class of homeodomain proteins encoded by the STM gene of Arabidopsis[5], vs Wild Type (WT) Arabidopsis plats. Also, presented in the tables are some transcriptomic data obtained by RNA-seq Illumina HiSeq from rosette leaves of Arabidopsis plants of AtFTOE 2.1 line vs WT with accession numbers SRR2094583 and SRR2094587 for AtFTOE replicates 1–3 and AtWT for control replicates 1–2 respectively. Raw data of paired-end sequences are located in the public repository of the National Center for Biotechnology Information of the National Library of Medicine, National Institutes of Health, United States of America, Bethesda, MD, USA as Sequence Read Archive (SRA). Performed analyses of differential expression genes are visualized by Mapman and presented in figures. “Transcriptomic analysis of Arabidopsis overexpressing flowering locus T driven by a meristem-specific promoter that induces early flowering” [2], described the interpretation and discussion of the obtained data.

Keywords: Differential expression, Bioinformatics, Flowering


Specifications Table

Subject area Biology
More specific subject area Plant Sciences
Type of data Figure; tables
How data was acquired PCR final point and ddPCR (The QX100 Droplet Digital PCR (ddPCR™ System), direct count of leaves and measurement of rosette diameter with image analysis software (Imagej [http://rsb.info.nih.gov]), RNA-seq by Illumina HiSeq Sequencing,
Data format Analyzed
Experimental factors Three lines overexpressing FT (AtFTOE) and WT Arabidopsis plants
Experimental features Three lines of AtFTOE and WT Arabidopsis plants were grown on hydroponic under controlled conditions at 22 °C in short day (SD) photoperiod (8 h light /16 h dark) up to day 21 and then transferred to inductive conditions of long days (LD) photoperiod (16 h light /8 h dark)
Data source location Mexico City, Mexico and at the National Center for Biotechnology Information (NCBI)
Data accessibility Data is available with this article and at NCBI accession numbers SRR2094583 and SRR2094587

Value of the data

  • The amplification of transgene by PCR and copy variation number by ddPCR is fundamental to compare independent transgenic events.

  • ANOVA and T-Student test were employed to assess statistical significance of data (α=0.05) for leaf count and rosette diameter in order to compare three AtFTOE lines and WT.

  • Differentially expressed genes in the context of cellular functions are graphically presented by Mapman to understand the integrative changes in the metabolism.

  • Raw data from the Illumina HiSeq sequencing are available for further analyses.

1. Data

In this article are presented the data analyses (figures) from leaf count and rosette diameter for three lines AtFTOE (2.1, 3.1 an 4.3) compared with WT Arabidopsis plants (Fig. 2A and B respectively). Data corresponding to differential expression (log2 fold change) from AtFTOE 2.1 line vs WT Arabidopsis are visualized by Mapman (Fig. 3). Some data corresponding to down-regulated genes are presented in Table 3.

Fig. 2.

Fig. 2.

Interval plot from AtFTOE lines and WT for (A) Leaf count and (B) Rosette diameter at 21 days in SD conditions.

Fig. 3.

Fig. 3.

Mapman visualization showing the observed differential expression patterns, based on the Log2FCs of mRNA levels, in rosette leaves of three AtFTOE versus WT. Red color indicates up-regulated genes and blue color indicates down-regulated genes (A) Cell functions overview (B) Metabolism overview each gene is symbolized by a box.

Table 3.

List of Arabidopsis floral repressors downregulated in AtFTOE 2.1 line.

Genes Locus ID Localization Description FC
At3g27200 Plasma membrane Plastocyanin-like domain-containing protein. Cupredoxin superfamily protein. 0.11


 

 

 

 


SPL2, SQUAMOSA PROMOTER BINDING PROTEIN-LIKE 2 At5g43270 Nucleus Member of the SPL (squamosa-promoter binding protein-like) gene family, a novel gene family encoding DNA binding proteins and putative transcription factors. 0.57
AP2, APETALA 2, ATAP2, FL1, FLO2, FLORAL MUTANT 2, FLOWER AT4G36920 Nucleus Encodes a floral homeotic gene, a member of the AP2/EREBP (ethylene responsive element binding protein) class of transcription factors and is involved in the specification of floral organ identity, establishment of floral meristem identity, suppression of floral meristem indeterminacy, and development of the ovule and seed coat. AP2 also has a role in controlling seed mass. 0.46
At3g45160 Putative membrane lipoprotein. 0.47


 

 

 

 


ATMYB29 At5g07690 Nucleus Encodes a putative transcription factor. 0.56
CA1 At3g01500 Apoplast, chloroplast stroma, thylakoid membrane, plasma membrane Encodes a putative beta-carbonic anhydrase betaCA1. 0.44
At3g05730 Extracellular region Encodes a defensin-like (DEFL) family protein. 0.27
At5g05960 Extracellular region Bifunctional inhibitor/lipid-transfer protein/seed storage 2S albumin superfamily protein. 0.23


 

 

 

 


BETA CA2 At5g14740 Apoplast, chloroplast, chloroplast stroma, thylakoid membrane, cytoplasm Encodes a beta carbonic anhydrase likely to be localized in the cytoplasm. 0.41
At3g26960 Extracellular region Pollen Ole e 1 allergen and extensin family protein. 0.37


 

 

 

 


BETA GLUCOSIDASE 33, BGLU33 At2g32860 Chloroplast Beta glucosidase 0.27
XCP2, XYLEM CYSTEINE PEPTIDASE 2 At1g20850 Chloroplast, extracellular space, lysosome Cysteine-type peptidase activity, peptidase activity 0.15
At4g30650 Cell wall, chloroplast, extracellular space, lysosome Xylem cysteine peptidase 2 (XCP2) 0.50
At1g29660 GDSL-motif esterase/acyltransferase/lipase.Enzyme group with broad substrate specificity that may catalyze acyltransfer or hydrolase reactions with lipid and non-lipid substrates. 0.24


 

 

 

 


TON1 RECRUITING MOTIF 14, TRM14 At3g61380 Nucleus, plasma membrane Phosphatidylinositol N-acetyglucosaminlytransferase subunit P-related. 0.39
ATIMD3, IMD3, IPMDH1 At1g31180 Chloroplast stroma, mitochondrion, plastid, thylakoid The AtIMD3 is one out of 3 genes encoding the enzyme 3-isopropylmalate dehydrogenase involved in leucine biosynthesis in Arabidopsis. Its subcellular location has been targeted to plastids. 0.61
OEP6, OUTER ENVELOPE PROTEIN 6 At3g63160 Chloroplast, chloroplast envelope, chloroplast outer membrane, chloroplast thylakoid membrane, thylakoid Molecular_function unknown 0.48
At1g35190 Cytoplasm 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein 0.55


 

 

 

 


AGAMOUS-LIKE 19, AGL19, GL19 At4g22950 Nucleus MADS-box protein AGL19. 0.23
At3g27200 Cupredoxin superfamily protein,electron carrier activity, copper ion binding. 0.11
At2g39850 Extracellular region Subtilisin-like serine endopeptidase family protein; metabolic process, proteolysis. 0.56
At3g53100 Extracellular region GDSL-motif esterase/acyltransferase/lipase,hydrolase activity, acting on ester bonds. 0.33
At5g44400 Cell wall, cytoplasm, plasmodesma FAD-binding Berberine family protein. 0.11


 

 

 

 


MORPHOGENESIS OF ROOT HAIR 1, MRH1 At4g18640 Chloroplast Protein serine/threonine kinase activity. 0.11
ATPAP3, PAP3, PURPLE ACID PHOSPHATASE 3 At1g14700 Extracellular region, vacuole acid phosphatase activity, metal ion binding, protein serine/threonine phosphatase activity. 0.25
At3g29030 Cell wall, extracellular region, membrane Encodes an expansin. Naming convention from the Expansin Working Group. 0.17
At1g04040 Cell wall, plant-type cell wall, plasmodesma, vacuolar membrane, vacuole HAD superfamily, subfamily IIIB acid phosphatase 0.27


 

 

 

 


TPPH, TREHALOSE-6-PHOSPHATE PHOSPHATASE H At4g39770 Cytoplasm, cytosol, nucleus Trehalose biosynthetic process. 0.61
At5g63180 Pectin lyase fold/virulence factor Pectin lyase-like superfamily protein 0.35

2. Experimental design, materials and methods

2.1. Determination of transgene (FT) and copy variation number in three AtFTOE lines

Transgene FT (560 bp) was amplified by PCR from three AtFTOE (2.1,3.1 and 4.3) lines (Fig. 1A). Droplet digital PCR (ddPCR) was employed to determine transgene (FT) copy variation number (CVN) (Fig. 1B). As template were used 2.5 ng of genomic DNA previously digested with HindIII. Droplets were generated for PCR reaction with the specific primers AtFT-qPCR (F) (5′-TCCGTTTAATAGATCAATCAC-3′), FT-qPCR (R) (5′-CCACCATAACCAAAGTATAG-3) and probe TaqMan ddPCRFT [5′FAM] TCCTGAGGTCTTCTCCACCA [3′BHQ1]. The 152 bp of PCR-amplified product Arabidopsis HMGB1 (AT3G51880) was used as an internal, reference gene. The primers used as reference were HMGB1 probe [5′HEX]AGGCACCGGCTGAGAAGCCT[3′BHQ1], HMGB1F (5′-CAGAAAGGTGGGAAAGAGGA-3′) and HMGB1-R (5′-AAGGACCCAAACAAACCAAA-3′). The HMGB1 PCR-amplified product was 96 bp. After cycling, the PCR nano droplets were counted using the droplet reader Bio-Rad QX100 system (Bio-Rad 2012).

Fig. 1.

Fig. 1.

Amplification of transgene FT and copy number Variation in three AtFTOE lines (A) 560 bp amplified fragment correspondent to the size of the FT transgene visualized in 0.8% agarose gel stained with ethidium bromide. Carril 1: 1 kb plus DNA ladder (Thermo Fisher Scientific®); Carril 2: AtFTOE 2.1; Carril 3: AtFTOE 3.1; Carril 4: AtFTOE 4.3; Carril 5: WT Copy variation number of three AtFTOE lines determined by dPCR™ System (The QX100 Droplet Digital PCR) (B) Copy variation number of three AtFTOE lines determined by dPCR™ System (The QX100 Droplet Digital PCR).

2.2. Determination of leaf number and rosette diameter in three AtFTOE and WT plants

Three AtFTOE (2.1,3.1 and 4.3) lines and WT Arabidopsis Columbia-0 ecotype were employed. Seeds were stratified, kept at 4 °C for 3 days in the dark and then germinated and grown in hydroponic system [1] at 22 °C under controlled conditions, initially in short days (8 h light, 16 h dark) and after 21 days the seedlings were transferred to long-day (16 h light, 8 h dark) photoperiod under 100–120 mmol m−2 s−1. Plants grown under these conditions were used for determining rosette leaf number (Fig. 2A) by individual counting and rosette diameter (Fig. 2B) quantified by image analyses using the software Imagej (http://rsb.info.nih.gov). Cuantitative data were statistically analyzed with ANOVA and T-Student test at (α=0.05).

2.3. Massive mRNA-sequencing by Illumina, bioinformatics analysis and data processing

For RNA-Sequencing by illumina HiSeq was used total RNA from rosette leaves of 35-day old of AtFTOE 2.1 line and Wild Type (WT) plants. RNA was extracted with the RNeasy Plant Kit (Qiagen). The RNA-seq experiments were conducted with RNA isolated from three biological replicates per AtFTOE 2.1 line and two biological replicates for WT with accession numbers SRR2094583 and SRR2094587 respectively. Illumina sequencing was performed at Otogenetics (Georgia-USA). Illumina HiSeq Sequencing with PE50 yielded 20 million reads by triplicate (Table 2). The raw data files are available at the National Center for Biotechnology Information (NCBI) Sequence Read Archive (SRA) accession numbers SRR2094583 and SRR2094587 for AtFTOE replicates 1–3 and AtWT for control replicates 1–2 respectively. The paired-end reads were aligned to the reference Arabidopsis genome using Tophat (version 2.0.10) [7] and Bowtie2 (version 2.1.0) [4]. The reference Arabidopsis genome and gene model annotation files (TAIR10<ftp://igenome:G3nom3s4u@ussd-ftp.illumina.com/Arabidopsis_thaliana/NCBI/TAIR10/Arabidopsis_thaliana_NCBI_TAIR10.tar.gz>) were downloaded from the Illumina iGenomes (http://support.illumina.com/sequencing/sequencing_software/igenome.html). Differential expression was determined by Cufflinks (version 2.1.1) as described by Trapnell et al. [8] and then was visualized by CummeRbound, an edgeR package [3]. To analyze the variation in expression between two replicates from WT and three replicates from AtFTOE it was calculated the absolute difference of the log2 fold change and adjusted to P-value ≤0.05.

Table 2.

RNA sequences obtained by Illumina HighSeq2000/2500 PE100 sequencer.

Sample Total bases* Total reads
WT1 2,612,645,388 24,647,598
WT2 2,770,468,788 26,136,498
FT1 2,474,087,064 23,340,444
FT2 2,455,442,088 23,164,548
FT3 2,257,321,728 21,295,488

2.4. Visualization of differential expression by Mapman

In order to visualize differential expression we used Mapman tool, mapping the Mapman databases (http://mapman.gabipd.org/web/guest/mapman; Ath_AGI_LOCUS_TAIR10_Aug2012-3.m02; [6]) using raw data of differential expression log2 fold change and adjusted to P-value ≤0.05 (Fig. 3).

2.5. Primer design

Primers design (Table 1) was performed by OligoArchitect™ Primer and Probe Design Solutions (http://www.sigmaaldrich.com/technical-documents/articles/biology/probe-design-services.html). Gene sequences were obtained from Tair database (https://www.arabidopsis.org).

Table 1.

Primers to detect differentially expressed genes.

Gene Locus ID Primer Forward 5′−3′ Primer Reverse 5′−3′
AGAMOUS-LIKE 68,/MAF5 AT5g65080 MAF5- RTSYBG (F) GGAAGAAGAAGAGTAGAGAT MAF5-RTSYBG (R) ACAGAGAATTGAGAGTTGA
Squamosa promoter-binding-like protein 4 SPL4 AT5g65080 SPL4-RTSYBG (F)GTCGGAGAGGAATCAATG SPL4-RTSYBG (R) GGCATAGGAAGTGTCATC
MADS – BOX PROTEIN SOC1 AT2g45660 SOC1- RTSYBG (F) GAAGAGAATAGAGAATGC SOC1-RTSYBG (R) AGAGAAGATGATAAGAGAA
ATSTP13, MSS1, STP13, SUGAR TRANSPORT PROTEIN 13 AT5G26340 AtSTP13-qPCR(F) ACTCCGTTGACAAAGTCGGT AtSTP13-qPCR(R) TGGCGATTACGACTTGAGAG

Acknowledgments

This work was supported by Departmental funds from CINVESTAV-IPN, by CONACyT grants nos. 156162 to RR-M and 105985 to BX-C, and a SENASICA-SAGARPA grant to BX-C and RR-M. LD-B was supported by a doctoral fellowship from CONACyT no 388937. This research was partially supported by the Intramural Research Program of the NIH, NLM, NCBI.

Footnotes

Transparency document

Transparency data associated with this article can be found in the online version at 10.1016/j.dib.2016.06.002.

Appendix A

Supplementary data associated with this article can be found in the online version at 10.1016/j.dib.2016.06.002.

Transparency document. Supplementary material

Supplementary material

Supplementary material

mmc1.pdf (346.3KB, pdf)

Appendix A. Supplementary material

Supplementary material

Supplementary material

mmc2.zip (1.1KB, zip)
Supplementary material

Supplementary material

mmc3.xlsx (9.6KB, xlsx)
Supplementary material

Supplementary material

mmc4.xls (1.2MB, xls)

References

  • 1.Conn S.J., Hocking B., Dayod M., Xu B., Athman A., Henderson S., Gilliham M. Protocol: optimising hydroponic growth systems for nutritional and physiological analysis of Arabidopsis thaliana and other plants. Plant Methods. 2013;9(1):4. doi: 10.1186/1746-4811-9-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Duplat-Bermúdez L., Ruiz-Medrano R., Landsman D., Mariño-Ramírez L., Xoconostle-Cázares Transcriptomic analysis of Arabidopsis overexpressing flowering locus T driven by a meristem-specific promoter that induces early flowering. Gene. 2016 doi: 10.1016/j.gene.2016.04.060. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.L. Goff, C. Trapnell, D. Kelley, CummeRbund: analysis, exploration, manipulation, and visualization of Cufflinks high-throughput sequencing data. R package version 2.8.2., 2013.
  • 4.Langmead B., Salzberg S.L. Fast gapped-read alignment with Bowtie 2. Nat. Methods. 2012;9(4):357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Long J., Moan E.E., Medford J., Barton K. A member of the KNOTTED class of homeodomain proteins encoded by the STM gene of Arabidopsis. Lett. Nat. 1996;379:66–69. doi: 10.1038/379066a0. [DOI] [PubMed] [Google Scholar]
  • 6.Thimm O., Bläsing O., Gibon Y., Nagel A., Meyer S., Krüger P., Selbig J., Müller L.A., Rhee S.Y., Stitt M. mapman: a user-driven tool to display genomics data sets onto diagrams of metabolic pathways and other biological processes. Plant J. 2004;37:914–939. doi: 10.1111/j.1365-313x.2004.02016.x. [DOI] [PubMed] [Google Scholar]
  • 7.Trapnell C., Pachter L., Salzberg S.L. TopHat: discovering splice junctions with RNA-Seq. Bioinformatics. 2009;25(9):1105–1111. doi: 10.1093/bioinformatics/btp120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Trapnell C., Roberts A., Goff L., Pertea G., Kim D., Kelley D.R., Pachter L. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat. Protoc. 2012;7(3):562–578. doi: 10.1038/nprot.2012.016. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary material

Supplementary material

mmc1.pdf (346.3KB, pdf)
Supplementary material

Supplementary material

mmc2.zip (1.1KB, zip)
Supplementary material

Supplementary material

mmc3.xlsx (9.6KB, xlsx)
Supplementary material

Supplementary material

mmc4.xls (1.2MB, xls)

Articles from Data in Brief are provided here courtesy of Elsevier

RESOURCES