Skip to main content
Data in Brief logoLink to Data in Brief
. 2016 Feb 20;7:411–417. doi: 10.1016/j.dib.2016.02.032

Data for global lysine-acetylation analysis in rice (Oryza sativa)

Yehui Xiong a, Kai Zhang a, Zhongyi Cheng b, Guo-Liang Wang a,c,⁎⁎, Wende Liu a,
PMCID: PMC4781998  PMID: 26977447

Abstract

Rice is one of the most important crops for human consumption and is a staple food for over half of the world׳s population (Yu et al., 2002) [1]. A systematic identification of the lysine acetylome was performed by our research (Xiong et al., 2016) [2]. Rice plant samples were collected from 5 weeks old seedlings (Oryza sativa, Nipponbare). After the trypsin digestion and immunoaffinity precipitation, LC–MS/MS approach was used to identify acetylated peptides. After the collected MS/MS data procession and GO annotation, the InterProScan was used to annotate protein domain. Subcellular localization of the identified acetylated proteins was predicted by WoLF PSORT. The KEGG pathway database was used to annotate identified acetylated protein interactions, reactions, and relations. The data, supplied in this article, are related to “A comprehensive catalog of the lysine-acetylation targets in rice (O. sativa) based on proteomic analyses” by Xiong et al. (2016) [2].

Keywords: Rice, Acetylation, LC–MS/MS


Specifications Table

Subject area Plant Biology
More specific subject area Rice modification proteomics
Type of data MS data, Tables, Figures
How data was acquired Data-dependent acquisition of affinity purification and LC–MS/MS.
Data format Raw and analyzed.
Experimental factors Plants were harvested and washed with double distilled water, immediately frozen in liquid nitrogen and stored at −80 °C until protein extraction and following experiment.
Experimental features The protein were extracted and digested with trypsin, peptides were enriched by antibody immunoaffinity and subjected to the LC–MS/MS analysis.
Data source location State Key Laboratory for Biology of Plant Diseases and Insect Pests, Beijing, China
Data accessibility 1. Raw data is available on PRIDE database, accession ID number is PXD002291;
2. Analyzed data are supplied with this article.

Value of the data

  • The data are useful for model plant/crop biology research and protein modification research in plant or animal.

  • We used agarose conjugated with our high affinity pan anti-acetyl-lysine antibody for immunoaffinity enrichment, which improve the recognition range and efficiency.

  • The number of lysine-acetylated sites in this data was 23-times greater and the number of acetylation proteins was 16-times greater than in the other previous report.

1. Data

As an important model plant for biological research, rice also is one of the most important crops and food for human consumption [1]. We performed a systematic identification of the lysine acetylome in rice, recently [2].

Fig. 1 show the experimental and bioinformatics workflow of the rice lysine acetylome. A total of 1337 lysine-acetylation (Kac) sites on 716 proteins were identified, and data is presented in Supplementary Table S1. The raw data were available on PRIDE database with accession number PXD002291. Most of the peptides contain 7–15 amino acids (Fig. 2A). Of the 716 proteins, 435, 143, and 55 contained 1, 2, and 3 Kac sites, respectively (Fig. 2B). The information on the Kac peptides and proteins were summarized in Table S1. Kac motif is listed in Supplementary Table S2. To better understand the global cellular functions of the acetylated proteins in rice, a GO functional classification was conducted in terms of their molecular functions, cellular components, and biological processes, the GO data is listed in Supplementary Table S3. KEGG also conducted and the detail data listed in Supplementary Table S4. We also used Cytoscape software to generate an interaction network for all of the acetylated proteins. When the threshold confidence score was set at 0.70, associations were detected among 347 proteins (Supplementary Table S5). As shown in Fig. 3, this network contained a number of acetylated proteins at key hubs. The detailed data were discussed by Xiong et al. [2]. The raw data about this research is available on PRIDE database, accession number is PXD002291.

Fig. 1.

Fig. 1.

Experimental and bioinformatics workflow of the rice lysine acetylome analysis.

Fig. 2.

Fig. 2.

Distribution of acetylated peptides based on their (A) length and on their (B) number of acetylation peptides.

Fig. 3.

Fig. 3.

Interaction networks of acetylated proteins in rice.

2. Experimental design, materials and methods

2.1. Rice plants and growth conditions

Rice (Oryza sativa, Nipponbare) seeds were sterilized with 75% ethanol, 1 min; 40% NaClO, 30 min; followed with double distilled water washing 3 times; and transferred to half-strength Murashige and Skoog medium [3]. The rice seedlings were then transplanted into soil and kept in a growth chamber at 25 °C with 80% relative humidity and a 16-h-day/8-h-night cycle. After 5 weeks growth, the whole rice plants including leaves, stems, and roots (two independent, whole rice plant samples) were collected and washed with double distilled water for further LC–MS/MS analysis.

2.2. Protein extraction and peptide digestion

The protein extraction and peptide digestion procedures were modified from a previous report [4]. Briefly, the whole rice seedlings were ground in liquid nitrogen and then supplemented with 8 M urea containing 1 mM DTT, 2 mM EDTA, 1% Protease Inhibitor Cocktail Set III (Calbiochem), and HDAC inhibitor (50 mM sodium butyrate, 30 mM nicotinamide, 3 uM trichostatin A). The sample was then sonicated with 12 short bursts, 10 s per burst and separated by 30-s cooling intervals. The preparations were centrifuged at 4 °C for 10 min at 20,000g to remove unbroken cells and debris, the 2-D Quant kit (GE Healthcare) was used to determine the protein concentration of the preparation. Proteins were then precipitated with 15% trichloroacetic acid overnight at 4 °C. The precipitate was washed three times with ice-cold acetone and air-dried at 4 °C.

For protein digestion, 100 mM NH4HCO3 was added, which was followed by the addition of trypsin (Promega) at an enzyme-to-substrate ratio of 1:50; the digestion was performed at 37 °C for 12 h. Reduction and alkylation were performed by adding 5 mM dithiothreitol (DTT) at 56 °C for 45 min and 15 mM iodoacetamide (IAM) for 30 min at room temperature in darkness. The reaction was quenched with 30 mM cysteine at room temperature for 20 min. Then, additional trypsin at an enzyme-to-substrate ratio of 1:100 was added, and the preparation was incubated at 37 °C for 4 h to ensure complete digestion. The final digest was lyophilized in a SpeedVac and stored at −80 °C.

2.3. Immunoaffinity precipitation

The Kac peptides immunoaffinity enrichment was perform as described [4], [5]. Briefly, the tryptic peptides for Kac enrichment were re-dissolved in NETN buffer (100 mM NaCl, 1 mM EDTA, 50 mM Tris, 0.5% Nonidet P-40, pH 8.0) before anti-acetyl-lysine agarose beads (catalog no. PTM-104, PTM Biolabs) were added; the mixture was incubated at 4 °C with gentle shaking overnight. The agarose beads were then gently washed three times with NETN buffer, twice with ETN buffer (100 mM NaCl, 1 mM EDTA, 50 mM Tris, pH 8.0), and once with pure water. Finally, the bound peptides were eluted from the agarose beads with 1% trifluoroacetic acid (TFA) and were lyophilized to dryness in a SpeedVac. The resulting peptides were desalted with C18 ZipTips (Millipore) before LC–MS/MS analysis.

2.4. LC–MS/MS

LC–MS/MS has been extensively used to identify modified peptides [4], [5], [6], [7]. We re-dissolved the peptides of each sample in 6 μL of mobile phase A (2% CAN/0.1% FA) and then loaded the preparation onto a trap column (Acclaim PepMap 100 C18, 75 μm DI, 2 cm length, Dionex) by EASY nLC1000 nanoUPLCsystem (ThermoScientific). For separation, the peptide was eluted onto a C18 analytical column (Acclaim PepMapRSLC, 50 μm DI, 15 cm length, Dionex). A 40-min gradient was run at 280 nL/min, starting from 5 to 30% mobile phase B (98% ACN, 1.9% H2O, 0.1% FA) for 30 min, 30–40% mobile phase B for 4 min, 40–80% mobile phase B for 2 min, and 80% mobile phase B for 4 min.

The peptides were injected into an NSI source coupled to a tandem mass spectrometry (MS/MS) in Orbitrap Q Exactive (ThermoScientific). The electrospray voltage applied to the NSI source was set at 1.8 kV. The parent ions of peptides were detected in the Orbitrap for full-MS scan; the resolution was set at 70,000, and the m/z scan range was 350–1800. The top 15 peptides were the selected for the following MS/MS fragmentation in the Orbitrap by using HCD at 28% NCE with 4% stepped NCE, and the resolution for ion fragments was set at 17,500. The MS/MS threshold ion count was set at 4×104 with 2.5-s dynamic exclusion. For generation of MS/MS spectra, 2×105 ions were accumulated.

2.5. Database search

The collected MS/MS data were processed by MaxQuant with the integrated Andromeda search engine (version 1.4.1.2), Mascot research engine (version 2.3), and pFind (version 2.8) against the Uniprot_OS Japonica Database, concatenated with the reverse decoy database. Trypsin/P was specified as the cleavage enzyme, the maximum missing cleavage was set at 3, and up to 4 modifications and 5 charges were allowed in each peptide. Mass tolerance was set at 5 ppm for precursor ions and 0.02 Da for fragment ions. Carbamidomethylation on Cys was specified as the fixed modification, and oxidation on Met, acetylation on Lys, and acetylation on protein N-terminal were specified as variable modifications. The false discovery rates (FDRs) for protein, peptide, and modification site were all specified as 0.01. The minimum peptide length was set at 7. To obtain high-confident results, Lys-acetylation sites with localization probability <0.75 or from reverse protein sequences were removed.

2.6. Acetylated protein annotation

GO annotation of identified acetylated proteins were derived from ID mapping data of the UniProt database (ftp://ftp.uniprot.org/pub/databases/uniprot/current_release/knowledgebase/idmapping/) [8]. GO information for each protein was extracted from the ID mapping data as UniPortKB protein accession number. The KEGG pathway database was used to annotate identified acetylated protein interactions, reactions, and relations. The sequence of identified acetylated protein were used as input data for KOALA (KEGG Orthology and Links Annotation) software, and O. sativa japonica (taxonomy number: 39947) was set as the background species for search. Each input protein was then annotated by K number to a given sequence data set for subsequent analyses with reconstruct pathway [9]. InterProScan, a software package that allows protein and nucleic sequences to be scanned against InterPro׳s signatures, was used to annotate protein domain from the InterPro database, which combines signatures from multiple, diverse databases into a single searchable resource [10]. WoLF PSORT was used to predict the subcellular localization of the identified acetylated proteins [11].

Acknowledgments

This work was supported by National Natural Science Foundation of China grants (31272034 and 31422045) to W.D.L., the 973 Project (2012CB114005) of the Ministry of Science and Technology China to G.-L.W., and the National Transgenic Crop Initiative (2012ZX08009001) to G.-L.W.

Footnotes

Appendix A

Supplementary data associated with this article can be found in the online version at doi:10.1016/j.dib.2016.02.032.

Contributor Information

Guo-Liang Wang, Email: wang.620@osu.edu.

Wende Liu, Email: liuwende@caas.cn.

Appendix A. Supplementary material

Supplementary material

mmc1.pdf (1.2MB, pdf)

Supplementary material

mmc2.xlsx (248KB, xlsx)

Supplementary material

mmc3.xlsx (276.2KB, xlsx)

Supplementary material

mmc4.xlsx (1,000.4KB, xlsx)

Supplementary material

mmc5.xlsx (31KB, xlsx)

Supplementary material

mmc6.xlsx (55.6KB, xlsx)

References

  • 1.Yu J., Hu S., Wang J., Wong G.K.-S., Li S., Liu B. A draft sequence of the rice genome (Oryza sativa L. ssp. indica) Science. 2002;296:79–92. doi: 10.1126/science.1068037. [DOI] [PubMed] [Google Scholar]
  • 2.Xiong Y., Peng X., Cheng Z., Liu W., Wang G.L. A comprehensive catalog of the lysine-acetylation targets in rice (Oryza sativa) based on proteomic analyses. J. Proteom. 2016;30:30022–30027. doi: 10.1016/j.jprot.2016.01.019. [DOI] [PubMed] [Google Scholar]
  • 3.Park C.-H., Chen S., Shirsekar G., Zhou B., Khang C.H., Songkumarn P. The magnaporthe oryzae effector AvrPiz-t targets the RING E3 ubiquitin ligase APIP6 to suppress pathogen-associated molecular pattern–triggered immunity in rice. Plant Cell. 2012;24:4748–4762. doi: 10.1105/tpc.112.105429. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Liao G., Xie L., Li X., Cheng Z., Xie J. Unexpected extensive lysine acetylation in the trump-card antibiotic producer Streptomyces roseosporus revealed by proteome-wide profiling. J. Proteom. 2014;106:260–269. doi: 10.1016/j.jprot.2014.04.017. [DOI] [PubMed] [Google Scholar]
  • 5.Zhang K., Zheng S., Yang J.S., Chen Y., Cheng Z. Comprehensive profiling of protein lysine acetylation in Escherichia coli. J. Proteome Res. 2013;12:844–851. doi: 10.1021/pr300912q. [DOI] [PubMed] [Google Scholar]
  • 6.Pan J., Ye Z., Cheng Z., Peng X., Wen L., Zhao F. Systematic analysis of the lysine acetylome in Vibrio parahemolyticus. J. Proteome Res. 2014;13:3294–3302. doi: 10.1021/pr500133t. [DOI] [PubMed] [Google Scholar]
  • 7.Wu X., Oh M.H., Schwarz E.M., Larue C.T., Sivaguru M., Imai B.S. Lysine acetylation is a widespread protein modification for diverse proteins in arabidopsis. Plant Physiol. 2011;155:1769–1778. doi: 10.1104/pp.110.165852. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Barrell D., Dimmer E., Huntley R.P., Binns D., O׳Donovan C., Apweiler R. The GOA database in 2009--an integrated gene ontology annotation resource. Nucleic Acids Res. 2009;37:D396–D403. doi: 10.1093/nar/gkn803. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Kanehisa M., Goto S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000;28:27–30. doi: 10.1093/nar/28.1.27. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Zdobnov E.M., Apweiler R. InterProScan--an integration platform for the signature-recognition methods in InterPro. Bioinformatics. 2001;17:847–848. doi: 10.1093/bioinformatics/17.9.847. [DOI] [PubMed] [Google Scholar]
  • 11.Horton P., Park K.J., Obayashi T., Fujita N., Harada H., Adams-Collier C.J. WoLF PSORT: protein localization predictor. Nucleic Acids Res. 2007;35:W585–W587. doi: 10.1093/nar/gkm259. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary material

mmc1.pdf (1.2MB, pdf)

Supplementary material

mmc2.xlsx (248KB, xlsx)

Supplementary material

mmc3.xlsx (276.2KB, xlsx)

Supplementary material

mmc4.xlsx (1,000.4KB, xlsx)

Supplementary material

mmc5.xlsx (31KB, xlsx)

Supplementary material

mmc6.xlsx (55.6KB, xlsx)

Articles from Data in Brief are provided here courtesy of Elsevier

RESOURCES