Abstract
The Seed Proteome Web Portal (SPWP; http://www.seed-proteome.com/) gives access to information both on quantitative seed proteomic data and on seed-related protocols. Firstly, the SPWP provides access to the 475 different Arabidopsis seed proteins annotated from two dimensional electrophoresis (2DE) maps. Quantitative data are available for each protein according to their accumulation profile during the germination process. These proteins can be retrieved either in list format or directly on scanned 2DE maps. These proteomic data reveal that 40% of seed proteins maintain a stable abundance over germination, up to radicle protrusion. During sensu stricto germination (24 h upon imbibition) about 50% of the proteins display quantitative variations, exhibiting an increased abundance (35%) or a decreasing abundance (15%). Moreover, during radicle protrusion (24–48 h upon imbibition), 41% proteins display quantitative variations with an increased (23%) or a decreasing abundance (18%). In addition, an analysis of the seed proteome revealed the importance of protein post-translational modifications as demonstrated by the poor correlation (r2 = 0.29) between the theoretical (predicted from Arabidopsis genome) and the observed protein isoelectric points. Secondly, the SPWP is a relevant technical resource for protocols specifically dedicated to Arabidopsis seed proteome studies. Concerning 2D electrophoresis, the user can find efficient procedures for sample preparation, electrophoresis coupled with gel analysis, and protein identification by mass spectrometry, which we have routinely used during the last 12 years. Particular applications such as the detection of oxidized proteins or de novo synthesized proteins radiolabeled by [35S]-methionine are also given in great details. Future developments of this portal will include proteomic data from studies such as dormancy release and protein turnover through de novo protein synthesis analyses during germination.
Keywords: seed, proteome, website, Arabidopsis, germination, dormancy, longevity, plant
Introduction
Biologically, the seed might bethe most critical stage of Angiosperm development. Indeed, the seed number per plant, their size and their ability to germinate are key components of plant fitness (Donohue et al., 2005). Moreover, the seed structure helps plants to survive adverse environmental conditions but also helps to colonize new environments. Concerning seed biology, our comprehension of fundamental biological processes such as dormancy or germination was greatly enhanced by the plant model Arabidopsis thaliana used in combination with global “omics” approaches such as proteomics (North et al., 2010; Rajjou et al., 2012). Thus, modern functional genomics allow the characterization of cellular responses, gene network activation, and metabolic adaptation among a wide range of seed physiological states (Nambara and Nonogaki, 2012). These novel technologies represent powerful tools to accelerate basic and translational seed research. Yet, to our knowledge, there are no publically accessible websites dedicated to the Arabidopsis seed proteome. Thus, we decided to build a “Seed Proteome Web Portal” to give free access to relevant data on the Arabidopsis seed proteome.
Overview of the Seed Proteome Web Portal
The SPWP harbors a presentation of the founding laboratories, proteome data as well as detailed protocols and various links to bioinformatics resources or proteomic journals (please see site map).
The first main part of the SPWP is focused on seed proteome data that can be accessed either from the protein maps or from the protein catalog (Figure 1A). As of today, information on the 475 protein spots identified during Arabidopsis seed germination is available. In the protein map section, the user can retrieve a protein spot on a reference 2D gel obtained from Arabidopsis seeds (Figure 1A). The protein spot can be selected directly on the 2D gel image to open a new page containing protein spot data (Figure 1B). On that new page, the “essential” table gives information on the protein spot class, spot number, protein name, description, gene number (AGI-ID), and Mascot peptide matches are given (Figure 1B). In the “data” and “sequences” table, the user can also find experimental proteomic facts such as the expected/observed molecular weight and isoelectric point together with the peptide sequences that allowed identification of the protein (Figure 1B). Protein spot data can also be accessed via the protein catalog section.
The second main part of the SPWP offers a technical goldmine to scientists working on the seed proteome (Figure 1C). The user can find detailed protocols for a complete seed proteome experiment including seed germination, preparation of total protein extracts, 2D electrophoresis, gel staining, analysis of 2D gels, and protein identification by mass spectrometry are all explained in a detailed manner (please also see Rajjou et al., 2011). To our knowledge, there was no publically and freely available technical resource entirely dedicated to the study of the seed proteome. In addition, the SPWP also gives some specific protocols to study seed sub-proteome such as the oxidized or de novo synthesized proteomes (Figure 1D).
The Seed Proteome Illustrates the Developmental Switch Occurring during Arabidopsis Germination
Previous studies on the Arabidopsis seed demonstrated the presence of a high number of long-lived mRNA in the mature dry seed (Nakabayashi et al., 2005) together with the absolute requirement for protein synthesis to germinate (Rajjou et al., 2004). During Arabidopsis seed germination, we found a total of 475 protein spots corresponding unambiguously to 241 non-redundant proteins with a correspondence with a single AGI ID. A total of 18 protein spots were matched to multiple AGI-ID (Table S1 in Supplementary Material). Indeed, because of mRNA processing, protein proteolysis and chemical protein modifications, one gene can produce many different protein isoforms resulting in a wide proteome diversity. Each spot corresponds to a single protein isoform resulting from post-transcriptional or post-translational regulation of gene expression (1 mRNA = X protein isoforms). A total of 250 protein spots showed an increasing abundance during germination sensu stricto (0–24 h) or during radicle protrusion (24–48 h; Figure 2). Otherwise, 143 protein spots displayed a decreasing abundance over the course of seed germination (Figure 2). These results illustrate the major developmental switch that occurs during the germination phase in preparation for seedling establishment. Some of the protein spot abundance variation occurs due to post-translational modifications. Perhaps, the best example comes from 12S globulin subunits. Indeed, while there are only three genes coding for 12S globulin seed storage protein in Arabidopsis (At1g03880, At4g28250, At5g44120), a total of 104 protein spots corresponding to 12S globulin seed storage proteins can be retrieved on a 2D gel of seed proteins (Arc et al., 2011). Spots corresponding to the precursor form of the globulin seed storage proteins can be processed by cleavage and transformed into their mature protein form (Gallardo et al., 2001). Yet, at the seed proteome level, post-translational modifications by cleavage are not the most abundant modification as we found a good correlation between the theoretical and the observed protein molecular weight (Figure 3). In contrast, we found a very poor correlation between the proteins theoretical and the observed isoelectric point (Figure 3; Table S1 in Supplementary Material) illustrating the major impact of post-translational modifications, e.g., phosphorylation (Arc et al., 2011), glycosylation(Vuylsteker et al., 2000), or oxidation (Job et al., 2005) on the seed proteome.
Originality of the Seed Proteome
We investigated the specificity of the Arabidopsis seed proteome by comparison with the leaf proteome data from the Plant Proteomics Database (Sun et al., 2009) and by the comparison with the entire Arabidopsis proteome (TAIR10 Gene Annotation Data). First, we extracted the unambiguously identified non-redundant AGI-ID from the 475 protein spots of the seed proteome and classified them using the FunCat catalog (Ruepp et al., 2004). We also classified the leaf proteome and the whole genome with the FunCat catalog (Baerenfaller et al., 2008). After classification, we could observe that the “cell rescue, defense and virulence” “energy,” “protein fate,” and “storage protein” categories were over represented in the seed proteome as compared to the leaf proteome or entire proteome datasets (Figure 4, Table S2 in Supplementary Material). The seed proteome “cell rescue” category contains catalase, superoxide dismutase and peroxiredoxin proteins that detoxify reactive oxygen species produced very early during germination (Bailly, 2004). The “energy” category encompasses many proteins from glycolysis (e.g., glyceraldehyde 3-phosphate dehydrogenase, pyruvate carboxylase) as well as mitochondrial proteins from the tricarboxylic acid cycle (e.g., citrate synthase, succinyl-CoA ligase) or the glyoxylate cycle (isocitrate lyase, malate synthase). Thus, the seed proteome illustrates the high-energy demand in the seed upon metabolic resumption and it was recently shown that a majority of a 775 germination-specific gene subset was related to mitochondrial biogenesis (Narsai et al., 2011). The “protein fate” class of the seed proteome is composed of proteins involved in protein degradation (e.g., subunits of the 20S/26S proteasome), protein maturation (e.g., leucine aminopeptidase 1) or protein folding (e.g., heat shock proteins, peptidyl-prolyl cis-trans isomerase 1, protein disulfide isomerase). Finally, the “storage protein” category is overrepresented in the seed proteome due to its developmental specificity and serves both as an amino acid storage pool as well as supplemental roles, since the alpha-subunits are preferentially oxidized during seed germination (Job et al., 2005; Rajjou et al., 2006, 2007). Altogether, contrasting the FunCat classification of the seed proteome, the leaf proteome and the entire Arabidopsis proteome highlights the specificity of the seed proteome particularly concerning the energy demand and the correct protein folding necessary for seed germination.
In the Arabidopsis Seed, the Transcriptome and Proteome Information Yield Non-Redundant Biological Information
The most highly regulated genes during Arabidopsis seed development tended to be expressed preferentially in seeds compared with other plant organs (Ruuska et al., 2002). Moreover, a recent paper on both transcriptome and proteome during Arabidopsis seed development showed that 56% of 319 protein/transcript pairs had concordant expression patterns (Hajduch et al., 2010). In the Arabidopsis seed dry seed, more than 12000 stored mRNA species were detected (Nakabayashi et al., 2005). This transcriptome is characterized both by a great number of stored mRNA and by the rapid extensive changes occurring a few hours after imbibition (Nakabayashi et al., 2005; Preston et al., 2009; Narsai et al., 2011). Thus, due to the availability of the seed proteome, it was interesting to correlate the transcript and protein abundance variation during germination. Indeed, we wondered if we could expect similar correlations between protein and transcript accumulation profiles during seed development and germination. The 475 protein spots were matched to their corresponding AGI-ID and, we obtained a non-redundant seed proteome of 241 proteins. Then, the normalized abundance of each protein isoform corresponding to the same AGI-ID was summed. Employing the transcriptome data from Nakabayashi et al. (2005), we analyzed the mRNA accumulation corresponding to the 241 genes found in the seed non-redundant proteome. We obtained a probe signal for 218 genes and built a correlation between the transcript change and the protein change between 0 and 24 h after imbibition of non-dormant Arabidopsis seeds (Figure 5, Table S3 in Supplementary Material). It was obvious that, during seed germination, there is no correlation between the transcript and the protein level (r2 = 0.02). This is in accordance with the fact the long-lived stored mRNA are present in the dry seed state and that protein synthesis is required for seed germination while transcription is not (Rajjou et al., 2004; Kimura and Nambara, 2010; Sano et al., 2012). These observations suggest that there are no correlation between mRNA and protein half-lives in Arabidopsis seeds. Therefore, both the transcriptome and the proteome analyses in the Arabidopsis seed result in relevant information about the developmental switch occurring in the dry quiescent state to the metabolically active state.
In a complementary approach, we took advantage of two independent studies designed to identify tissue-specific genes in Arabidopsis by genome-wide transcriptome (Schmid et al., 2005) or proteome (Baerenfaller et al., 2008) approaches. To compare both transcriptome and proteome studies, we restricted our analysis to the “flower,” “leaves,” “root,” and “seed” tissues as they were common in the two studies. Altogether, 151 tissue-specific genes could be identified in these four tissues by transcriptomics while 469 could be identified by proteomics (Figure 6A, Table S4 in Supplementary Material). Surprisingly, we found that only 29 tissue-specific genes were commonly identified by the two approaches (Figure 6A). On closer examination, we found that 20 seed-specific genes were identified by both approaches out of a total of 190 seed-specific genes, i.e., 57 seed-specific genes identified by transcriptomics plus 133 seed-specific genes identified by proteomics (Figure 6B, Table S4 in Supplementary Material). This is in accordance with the poor correlation between RNA and protein levels that we found for the 241 non-redundant proteins of the seed proteome (Figure 5). Finally, in a study on 319 protein/transcript pairs during Arabidopsis seed filling, the observed correlation was equal to 56% (Hajduch et al., 2010). It suggests that the correlation between RNA and protein levels is strongly reduced during seed germination, a key transition in a plants life. Altogether, these analyses show that the seed transcriptome and proteome are not redundant and that each technique yields complementary biological information that reflects the importance of post-transcriptional as well as post-translational modifications in the seed.
Future Developments
As outlined here, the SPWP gives precise information for seed biologists. These data are complementary and non-redundant in comparison to seed transcriptome data in particular during Arabidopsis seed germination. In addition, the great number of protein isoforms revealed by the 2D analysis highlights the seed proteome diversity currently underestimated due to the difficulty to detect low-abundant proteins. Future developments of the SPWP will first include proteins regulated by dormancy release in Arabidopsis (Chibani et al., 2006) as well as protein turnover (L. Rajjou, unpublished data). Moreover, we will include proteomic studies on sugar beet (Beta vulgaris) and rice (Oryza sativa) germination. In addition, LC-MS/MS data on specific protein modifications such as carbonylation or phosphorylation during seed germination will be included to highlight the extent of post-translational modifications at this key development stage. Finally, the recent progress in laser-assisted microdissection combined with shotgun proteomics by LC-MS/MS could be applied to describe the metabolic compartmentalization in the Arabidopsis seed as it was done on other species (Gallardo et al., 2007; Finnie and Svensson, 2009).
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Supplementary Material
The Supplementary Material for this article can be found online at http://www.frontiersin.org/Plant_Proteomics/10.3389/fpls.2012.00098/abstract
References
- Arc E., Galland M., Cueff G., Godin B., Lounifi I., Job D., Rajjou L. (2011). Reboot the system thanks to protein post-translational modifications and proteome diversity: how quiescent seeds restart their metabolism to prepare seedling establishment. Proteomics 11, 1606–1618 10.1002/pmic.201000641 [DOI] [PubMed] [Google Scholar]
- Baerenfaller K., Grossmann J., Grobei M. A., Hull R., Hirsch-Hoffmann M., Yalovsky S., Zimmermann P., Grossniklaus U., Gruissem W., Baginsky S. (2008). Genome-scale proteomics reveals Arabidopsis thaliana gene models and proteome dynamics. Science 320, 938–941 10.1126/science.1157956 [DOI] [PubMed] [Google Scholar]
- Bailly C. (2004). Active oxygen species and antioxidants in seed biology. Seed Sci. Res. 14, 93–107 10.1079/SSR2004159 [DOI] [Google Scholar]
- Chibani K., Ali-Rachedi S., Job C., Job D., Jullien M., Grappin P. (2006). Proteomic analysis of seed dormancy in Arabidopsis. Plant Physiol. 142, 1493–1510 10.1104/pp.106.087452 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Donohue K., Dorn L., Griffith C., Kim E., Aguilera A., Polisetty C. R., Schmitt J. (2005). The evolutionary ecology of seed germination of Arabidopsis thaliana: variable natural selection on germination timing. Evolution 59, 758–770 10.1111/j.0014-3820.2005.tb01752.x [DOI] [PubMed] [Google Scholar]
- Finnie C., Svensson B. (2009). Barley seed proteomics from spots to structures. J. Proteomics 72, 315–234 10.1016/j.jprot.2008.12.001 [DOI] [PubMed] [Google Scholar]
- Gallardo K., Firnhaber C., Zuber H., Héricher D., Belghazi M., Henry C., Küster H., Thompson R. (2007). A combined proteome and transcriptome analysis of developing Medicago truncatula seeds: evidence for metabolic specialization of maternal and filial tissues. Mol. Cell. Proteomics 6, 2165–2179 10.1074/mcp.M700171-MCP200 [DOI] [PubMed] [Google Scholar]
- Gallardo K., Job C., Groot S. P., Puype M., Demol H., Vandekerckhove J., Job D. (2001). Proteomic analysis of Arabidopsis seed germination and priming. Plant Physiol. 126, 835–848 10.1104/pp.126.2.835 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hajduch M., Hearne L. B., Miernyk J. A., Casteel J. E., Joshi T., Agrawal G. K., Song Z., Zhou M., Xu D., Thelen J. J. (2010). Systems analysis of seed filling in Arabidopsis: using general linear modeling to assess concordance of transcript and protein expression. Plant Physiol. 152, 2078–2087 10.1104/pp.109.152413 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Job C., Rajjou L., Lovigny Y., Belghazi M., Job D. (2005). Patterns of protein oxidation in Arabidopsis seeds and during germination. Plant Physiol. 138, 790–802 10.1104/pp.105.062778 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kimura M., Nambara E. (2010). Stored and neosynthesized mRNA in Arabidopsis seeds: effects of cycloheximide and controlled deterioration treatment on the resumption of transcription during imbibition. Plant Mol. Biol. 73, 119–129 10.1007/s11103-010-9603-x [DOI] [PubMed] [Google Scholar]
- Nakabayashi K., Okamoto M., Koshiba T., Kamiya Y., Nambara E. (2005). Genome-wide profiling of stored mRNA in Arabidopsis thaliana seed germination: epigenetic and genetic regulation of transcription in seed. Plant J. 41, 697–709 10.1111/j.1365-313X.2005.02337.x [DOI] [PubMed] [Google Scholar]
- Nambara E., Nonogaki H. (2012). Seed biology in the 21st century: perspectives and new directions. Plant Cell Physiol. 53, 1–4 10.1093/pcp/pcr165 [DOI] [PubMed] [Google Scholar]
- Narsai R., Law S. R., Carrie C., Xu L., Whelan J. (2011). In-depth temporal transcriptome profiling reveals a crucial developmental switch with roles for RNA processing and organelle metabolism that are essential for germination in Arabidopsis. Plant Physiol. 157, 1342–1362 10.1104/pp.111.183129 [DOI] [PMC free article] [PubMed] [Google Scholar]
- North H., Baud S., Debeaujon I., Dubos C., Dubreucq B., Grappin P., Jullien M., Lepiniec L., Marion-Poll A., Miquel M., Rajjou L., Routaboul J. M., Caboche M. (2010). Arabidopsis seed secrets unravelled after a decade of genetic and omics-driven research. Plant J. 61, 971–981 10.1111/j.1365-313X.2009.04095.x [DOI] [PubMed] [Google Scholar]
- Preston J., Tatematsu K., Kanno Y., Hobo T., Kimura M., Jikumaru Y., Yano R., Kamiya Y., Nambara E. (2009). Temporal expression patterns of hormone metabolism genes during imbibition of Arabidopsis thaliana seeds: a comparative study on dormant and non-dormant accessions. Plant Cell Physiol. 50, 1786–1800 10.1093/pcp/pcp121 [DOI] [PubMed] [Google Scholar]
- Rajjou L., Belghazi M., Catusse J., Ogé L., Arc E., Godin B., Chibani K., Ali-Rachidi S., Collet B., Grappin P., Jullien M., Gallardo K., Job C., Job D. (2011). Proteomics and posttranslational proteomics of seed dormancy and germination. Methods Mol. Biol. 773, 215–236 10.1007/978-1-61779-231-1_14 [DOI] [PubMed] [Google Scholar]
- Rajjou L., Belghazi M., Huguet R., Robin C., Moreau A., Job C., Job D. (2006). Proteomic investigation of the effect of salicylic acid on Arabidopsis seed germination and establishment of early defense mechanisms. Plant Physiol. 141, 910–923 10.1104/pp.106.082057 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rajjou L., Duval M., Gallardo K., Catusse J., Bally J., Job C., Job D. (2012). Seed germination and vigor. Annu. Rev. Plant Biol. 63, 507–533 10.1146/annurev-arplant-042811-105550 [DOI] [PubMed] [Google Scholar]
- Rajjou L., Gallardo K., Debeaujon I., Vandekerckhove J., Job C., Job D. (2004). The effect of alpha-amanitin on the Arabidopsis seed proteome highlights the distinct roles of stored and neosynthesized mRNAs during germination. Plant Physiol. 134, 1598–1613 10.1104/pp.103.036293 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rajjou L., Lovigny Y., Job C., Belghazi M., Groot S., Job D. (2007). “Seed quality and germination,” in Seeds: Biology, Development and Ecology, eds Navie S., Adkins S., Ashmore S. (Cambridge: CAB International Publishing; ), 324–332 [Google Scholar]
- Ruepp A., Zollner A., Maier D., Albermann K., Hani J., Mokrejs M., Tetko I., Güldener U., Mannhaupt G., Münsterkötter M., Mewes H. W. (2004). The FunCat, a functional annotation scheme for systematic classification of proteins from whole genomes. Nucleic Acids Res. 32, 5539–5545 10.1093/nar/gkh894 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ruuska S. A., Girke T., Benning C., Ohlrogge J. B. (2002). Contrapuntal networks of gene expression during Arabidopsis seed filling. Plant Cell 14, 1191–1206 10.1105/tpc.000877 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sano N., Permana H., Kumada R., Shinozaki Y., Tanabata T., Yamada T., Hirasawa T., Kanekatsu M. (2012). Proteomic analysis of embryonic proteins synthesized from long-lived mRNAs during germination of rice seeds. Plant Cell Physiol. 53, 687–698 10.1093/pcp/pcs024 [DOI] [PubMed] [Google Scholar]
- Schmid M., Davison T. S., Henz S. R., Pape U. J., Demar M., Vingron M., Schölkopf B., Weigel D., Lohmann J. U. (2005). A gene expression map of Arabidopsis thaliana development. Nat. Genet. 37, 501–506 10.1038/ng1543 [DOI] [PubMed] [Google Scholar]
- Sun Q., Zybailov B., Majeran W., Friso G., Olinares P. D., van Wijk K. J. (2009). PPDB, the Plant Proteomics Database at Cornell. Nucleic Acids Res. 37, D969–D974 10.1093/nar/gkp493 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vuylsteker C., Cuvellier G., Berger S., Faugeron C., Karamanos Y. (2000). Evidence of two enzymes performing the de-N-glycosylation of proteins in barley: expression during germination, localization within the grain and set-up during grain formation. J. Exp. Bot. 51, 839–845 10.1093/jexbot/51.346.839 [DOI] [PubMed] [Google Scholar]