Skip to main content
Plant Physiology logoLink to Plant Physiology
. 2003 Mar;131(3):1104–1123. doi: 10.1104/pp.102.019034

Mapping the Proteome of Barrel Medic (Medicago truncatula)1,[w]

Bonnie S Watson 1, Victor S Asirvatham 1, Liangjiang Wang 1, Lloyd W Sumner 1,*
PMCID: PMC166875  PMID: 12644662

Abstract

A survey of six organ-/tissue-specific proteomes of the model legume barrel medic (Medicago truncatula) was performed. Two-dimensional polyacrylamide gel electrophoresis reference maps of protein extracts from leaves, stems, roots, flowers, seed pods, and cell suspension cultures were obtained. Five hundred fifty-one proteins were excised and 304 proteins identified using peptide mass fingerprinting and matrix-assisted laser desorption ionization time-of-flight mass spectrometry. Nanoscale high-performance liquid chromatography coupled with tandem quadrupole time-of-flight mass spectrometry was used to validate marginal matrix-assisted laser desorption ionization time-of-flight mass spectrometry protein identifications. This dataset represents one of the most comprehensive plant proteome projects to date and provides a basis for future proteome comparison of genetic mutants, biotically and abiotically challenged plants, and/or environmentally challenged plants. Technical details concerning peptide mass fingerprinting, database queries, and protein identification success rates in the absence of a sequenced genome are reported and discussed. A summary of the identified proteins and their putative functions are presented. The tissue-specific expression of proteins and the levels of identified proteins are compared with their related transcript abundance as quantified through EST counting. It is estimated that approximately 50% of the proteins appear to be correlated with their corresponding mRNA levels.


Legumes are valuable agricultural and commercial crops that serve as important nutrient sources for both humans and animals. For example, alfalfa (Medicago sativa) is an important forage crop with over 24 million acres planted annually with an annual U.S. value approaching 6 billion dollars (U.S. Department of Agriculture-National Agricultural Statistics Service, 2002). Legumes are characterized by symbiotic relationships with both nitrogen-fixing bacteria and arbuscular mycorrhizal fungi (Barker et al., 1990). These host-symbiont interactions result in the ability to fix atmospheric nitrogen and effect mutualistic and defense-related biosynthetic pathways such as the isoflavones, which have been reported to possess antimicrobial, anticarcinogenic, and other health-promoting properties (Dixon, 1999). Other secondary metabolites in legumes such as the triterpenes have been associated with defense and are of particular interest as novel pharmaceuticals (Small, 1996; Haridas et al., 2001).

The study of legume biology using many of the agriculturally important legumes such as soybean (Glycine max) and alfalfa is complicated by the large genome size and complex ploidy of these species. Fortunately, barrel medic (Medicago truncatula) has a smaller diploid genome that yields more manageable genetics. These traits, along with its autogamous nature, short generation time, and prolific seed production have made barrel medic a useful model legume (Barker et al., 1990; Cook et al., 1997; Cook, 1999; Bell et al., 2000; Trieu et al., 2000).

The impressive achievements in genome and expressed sequence tag (EST) sequencing have yielded a wealth of information for many model organisms, including the plants Arabidopsis and barrel medic. Unfortunately, sequence information alone is insufficient to answer questions concerning gene function, developmental/regulatory biology, and the biochemical kinetics of life. To address these questions, more comprehensive approaches that include quantitative and qualitative analyses of gene expression products are necessary at the transcriptome, proteome, and metabolome levels. Transcriptome approaches using microarray and serial analysis of gene expression technologies are powerful tools; however, mRNA abundances may only represent putative function because there is still a questionable correlation between mRNA and protein levels (Futcher et al., 1999; Gygi et al., 1999). In contrast, proteomics provides a more direct assessment of biochemical processes by monitoring the actual proteins performing the enzymatic, regulatory, and structural functions encoded by the genome and transcriptome. Recent improvements in high-resolution two-dimensional PAGE (2-DE; Klose and Kobalz, 1995; Görg et al., 1999), increased content of protein and nucleotide databases, and increased capabilities for protein identification utilizing modern mass spectrometry methods such as matrix-assisted laser desorption ionization time-of-flight mass spectrometry (MALDI-TOFMS; Pappin et al., 1993; Yates, 1998a, 1998b; Corthals et al., 2000) have made the large-scale profiling and identification of proteins a dynamic new area of research in plant biology.

Although there is a substantial amount of work in the literature on bacterial (Guerreiro et al., 1999; Morris and Djordevic, 2001), yeast (Futcher et al., 1999), and human proteomes (Anderson et al., 2001; Stensballe and Jensen, 2001), there is relatively less information on plant proteomes (van Wijk, 2001). Costa and coworkers have identified proteins from xylem and needles of maritime pine (Pinus pinaster; Costa et al., 1998, 1999), and Tsugita and coworkers have worked on the rice (Oryza sativa) proteome with some success (Tsugita et al., 1994). Both of these groups have relied heavily on Edman sequencing, which suffers due to the inability to sequence proteins blocked at the N terminus. More recently, researchers have reported on subcellular proteomes such as the chloroplast membrane (Peltier et al., 2000, 2002) whereas others have focused on single tissues including Arabidopsis seeds (Gallardo et al., 2001), Arabidopsis mitochondria (Kruft et al., 2001; Millar et al., 2001), maize (Zea mays) root tips (Chang et al., 2000), and barrel medic roots (Mathesius et al., 2001, 2002). To date, there has been no large-scale project to identify proteins from multiple tissues of the same plant species.

The objective of the present work was to survey the organ-/tissue-specific proteomes of the model legume barrel medic, to provide an overview of the barrel medic proteome, and to serve as a basis for future proteome comparisons of genetic mutants, biotically, abiotically, and/or environmentally challenged plants. The survey was accomplished using 2-DE to produce reference maps of protein extracts from leaves, stems, roots, flowers, seed pods, and cell suspension cultures. MALDI-TOFMS peptide mass fingerprinting was used to identify 304 proteins. HPLC coupled with quadrupole time-of-flight tandem mass spectrometry (LC/MS/MS) was used to validate marginal MALDI-TOFMS protein identifications. The identified proteins are discussed and classified based on putative functions determined through similarity (Bevan et al., 1998). Database search results are quantified and strategies discussed. The expression levels quantified by 2-DE are compared with mRNA levels quantified by EST counting.

RESULTS AND DISCUSSION

2-DE Reference Maps and Protein Identifications of Barrel Medic Tissues

2-DE reference maps were obtained for barrel medic leaves, stems, roots, flowers, seed pods, and cell suspension cultures and are provided in Figure 1. To qualitatively survey the proteins visualized by 2-DE, a total of 551 proteins (i.e. approximately 96 arbitrary protein spots per gel including positive molecular mass marker controls and negative gel blank controls) were excised from each of the organ-/tissue-specific Coomassie-stained 2-DE gels and analyzed by mass spectrometry. Typically, high-quality MALDI-TOFMS peptide mass maps were obtained, and representative spectra are provided in Figure 2. Of the 551 protein spots processed, 304 proteins were successfully identified and are listed in Table I.

Figure 1.

Figure 1

2-DE proteome reference maps were obtained for A, leaf; B, stem; C, root; D, flowers; E, seed pods; and F, cell suspension cultures. Proteins that were identified in this study are marked with arrows and numbers. The numbers correlate with protein identifications listed in Table I. 2-DE was performed using 0.75 to1.0 mg of protein, linear 11-cm IPG strips (pH 3–10), and a 12% (w/v) total acrylamide SDS second dimension. Gels were stained overnight with Coomassie Brilliant Blue R-250, destained the next day, and images recorded.

Figure 2.

Figure 2

Representative peptide mass maps obtained using MALDI-TOFMS illustrating good data quality but differences in protein identification success dependent upon the database queried. Mass spectral peaks are labeled with monoisotopic mass-to-charge ratio (m/z) values used for database searching. A, Stromal 70-kD heat shock-related protein (HSP70, accession no. Q02028) was successfully identified in seed pods (pds#7) using the NCBI databases. B, Isoflavone reductase (accession no. BE325778) from seed pods (pds#39) was identifiable only through use of the EST databases.

Table I.

Proteins identified in barrel medic tissues

Tissue Spot # Identification Accession Number Databases # Peptides/LC/MS/MS
lvs 35 er ATPase (CDC48-like protein)a NP190891 N 11
lvs 39 DNA mismatch repair proteina O66652 S 8
lvs 47 Rubisco BE420942 E/Hv, (E) 7
lvs 52 Rubisco BE420942 E/Hv 5
lvs 51 F23N19.10, TPR repeat protein AW694998 E/Mt, (E) 4/(513)
lvs 63 Cell division prt. FTSK homologa P45264 S, N 8
lvs 78 Rubisco BAA20039 N, S, E/Hv 11
lvs 82 Rubisco AAF97663 N, S, E/Hv 9
lvs 84 Rubisco AAF15326 N, S, E/Hv 9
lvs 92 Rubisco X69528 N NA/(389)
lvs 98 Transcription factora BF004459 (E) 5
lvs 105 S-adenosyl-Met synthetase BG581653 E/Mt, N, S 12
lvs 108 S-adenosyl-Met synthetase P50303 S 8
lvs 111 Rubisco activase AF251264 N NA/(372)
lvs 113b ATP synthase beta chain NP077960 N 8
lvs 113b Rubisco activase Q42450 S, E/Sb 6
lvs 124 Rubisco activase AAG61120 N, S 9
lvs 126 Rubisco activase AAG61120 N, S 8
lvs 123 Aminomethyl transferase, mito. Precursora BF521422 E/Mt, (E) 12
lvs 128 Aminomethyl transferase, (T protein)a P49364 N, S, E/Mt 11
lvs 136 Fru biphosphate aldolase BI309468 (E), N, S, E/Mt 9
lvs 138 Spermine synthase BE204391 E/Mt 5
lvs 139 Putative Arabidopsis thaliana proteina AW685607 E/Mt 5
lvs 141 Ankyrin repeat protein AL388433 E/Mt 5
lvs 144b Glyceraldehyde-3-phosphate dehydrogenase BF003409 E/Mt, (E) 10
lvs 144b Possible tartrate dehydrogenase P70792 S, N 6
lvs 149b Glyceraldehyde-3-phosphate dehydrogenase BG453922 (E) 7
lvs 149b Tartrate dehydrogenase P70792 N, S 6
lvs 155 Leu2 (3-isopropyl malate dehydrogenase)a P18120 N, S 7
lvs 158 Malate dehydrogenase T09286 N, E/Mt, (E) 14
lvs 196b Ascorbate peroxidase BG587041 (E) 5
lvs 187 Oxygen evolving enhancer protein P14226 S, N, E/Mt, (E) 7
lvs 191 Oxygen evolving enhancer protein P14226 S, N, E/Mt, (E) 9
lvs 188 Remorina BG588209 (E) 7
lvs 189 Remorina BG588209 (E) 8
lvs 196b Rubisco AAC35045 N, S 8
lvs 206b Mitotic cyclin B1-1a AAC24244 N 8
lvs 206b ATP synthasea BG582863 (E) 9
lvs 205 Oxygen-evolving enhancer protein 1 BG449793 (E) 5
lvs 219 Cystathione-B-lyasea P53780 S 6
lvs 222 Chloro membrane-associated 30-kD protein/transit pepta AW776774 E/Mt, (E) 9
lvs 223 RNA-binding protein BF641320 E/Mt, (E) 8
lvs 238 l-ascorbate peroxidase P48534 N, S, E/Mt, (E) 6
lvs 241 Ascorbate peroxidase AAL15164 N, S, (E) 6
lvs 237 Acid phosphatase BG588612 (E) 8
lvs 239 Acid phosphatase BG588612 (E) 11
lvs 251 Triose phosphate isomerase, cytosolic BF642390 E/Mt, (E) 10
lvs 250 ABC transportera NP488322 N 7
lvs 258b Pyrimidine-nucleoside phosphorylasea P39N9 S 8
lvs 263 Chaperonin 21 precursora AW775755 E/Mt 6
lvs 265 Chaperonin 21 precursora AW776607 E/Mt, (E) 5
lvs 261 Patatin-like proteina AAF98369 N 5
lvs 258b Transcription factor VSE-1a CAA05898 N 6
lvs 270 Oxygen-evolving enhancer protein P16059 S, E/Mt, (E) 7
lvs 280 Oxygen-evolving enhancer protein BF521386 E/Mt, S, (E) 12
lvs 281 Plastid specific ribosomal proteina BE318731 (E) 4
lvs 287 Gly-rich cell wall structural protein 2a AL366848 E/Mt, (E) 8
lvs 284 Oxygen-evolving enhancer protein P16059 S, E/Mt, (E) 5
lvs 338 Hypothetical proteina NP180029 N 5
lvs 363 Aspartate 1-decarboxylase precursora P52999 S 4
lvs 388 Rubisco small subunit BF520627 E/Mt, (E) 9
lvs 387 Rubisco small subunit BF519126 E/Mt 5
lvs 397 Plastocyanine precursor AW776926 E/M. t., (E) 7
lvs 422 Photosystem I iron-sulfur proteina NP039445 N, S 8
stm 5 Cell division (valosin-containing) protein P54774 N, S 8
stm 7 Heat shock protein 70 1909352A N, S 10
stm 10 TPR repeat protein AW694998 (E), E/Mt 7
stm 9 Heat shock protein 70 P37900 N, S, E/Mt 8
stm 17 Rubisco CAA93074 N, S 8
stm 16 Rubisco P28400 N, S 7
stm 18b ATP synthasea CAB85681 N 7
stm 18b Rubisco P30401 S, N 7
stm 19 Rubisco P04991 S, N, E/Mt 13
stm 20 Rubisco AAF97641 N, S, E/Mt 6
stm 22 Tubulin alpha chain Q43473 N, S, E/Mt 13
stm 23 26S proteasome AAA-ATPase subunita BE325937 E/Mt 5
stm 24 26S proteasome (TAT binding)a NP187204 N, S, E/Mt 7
stm 26 SAM synthetase P46611 N, S, E/Mt 7
stm 29 Actin Q96483 N, S, E/Mt 7
stm 36 ATPase or P loop kinasea NP347611 N 9
stm 37 Fru 1,6 biphosphate aldolase O65735 N, S, E/Mt 8
stm 39 Adenosine kinasea BF004017 (E) 10
stm 40 Malate dehydrogenase BI310064 (E) 6
stm 42 Annexina T09552 N, E/Mt 6
stm 43 Fructokinase AW584645 E/Mt, (E) 7
stm 44 Ribose-phosphate pyrophosphokinasea P47304 S 7
stm 45 IFR-like oxidoreductase BF644624 E/Mt, (E) 5
stm 48 Atran bp1a (Ran-binding protein 1 domain)a AW686211 (E) 7
stm 46 Cinnamoyl-CoA reductasea BF635045 (E) 7
stm 49 G protein beta subunita Q39836 N, S, E/Mt, (E) 4/(265)
stm 51 Oxygen-evolving enhancer protein I P14226 S, N 10
stm 52 RNA-binding protein-like NP196048 N 6
stm 57 Ascorbate peroxidase BG648814 (E) 4
stm 58 Proteasome subunit alpha type 7 (20S) Q9SXU1 S, N, E/Mt, (E) 5
stm 55 SAM:trans-caffeoyl CoA 3-O methyl transf.a T09399 N, E/Mt 10
stm 54 RNA-binding protein-like BF641320 E/Mt 5
stm 60 Ascorbate peroxidase BG648814 (E), E/Mt 5
stm 61 Acid phosphatase BG588612 (E), E/Mt 6
stm 62 Triosphosphate isomerase BF642390 E/Mt, (E) 10
stm 64 Expressed proteina AI774799 E/Le 5
stm 66 Uridylate monophosphate kinase AW981222 E/Mt, (E) 8
stm 70 23-kD O2-evolving pht. sys. II precursora P16059 N, S, E/Mt, (E) 7
stm 72 ATP synthase, delta chaina Q41000 S 4
stm 81 vcCYP AW775250 E/Mt, (E), N 8
stm 85 40S ribosomal protein S12a AL375805 E/Mt 4
stm 86 Gly-rich RNA binding protein AL379229 E/Mt, (E) 4
stm 88 60S ribosomal protein AW776748 E/Mt, (E) 4
stm 90 Nucleoside diphosphate kinase I P47922 S 5
stm 89 Gly-rich RNA binding protein AA660717 E/Mt, (E), N 9
stm 95 Hypotheticala NP174644 N 6
rts 2 Heat shock 70 Q02028 N, S, E/Mt 22
rts 5 Phosphoglyceromutase BG585916 (E) 5
rts 6 Protein disulfide isomerase BI309490 E/Mt, (E), N, S 10
rts 9 Putative methyl binding domain AL378817 E/Mt 4
rts 12 ATPase beta subunit CAA75477 N, S, E/Mt, (E) 13
rts 19 Actin isoform B T51183 N, S, E/Mt, (E) 13
rts 20 Peroxidase precursor AL369822 (E), E/Mt 8
rts 22 Ankyrin repeat protein HBP1 BI311773 (E), E/Mt 12
rts 28 Glyceraldehyde-3-phosphate dehydrogenase BG453922 (E), S 10
rts 32 Cationic peroxidase precursor BG584470 (E) 7
rts 33 Isoflavone reductase BG645198 (E) 13
rts 36 Isoflavone reductase homolog BI312226 E/Mt, (E) 5
rts 39 Acidic glucanasea BF650084 (E), E/Mt, N 4
rts 40 Cytochrome c oxidase subunit 6b-1 BI310278 (E), E/Mt 8
rts 41 Gluco endo-1,3-beta-d-glucosidase BE239884 E/Mt 4
rts 42 Hydroxyacyl glutathione hydrolasea BG584417 (E) 8
rts 43 Chitinasea CAA71402 N, S, E/Mt, (E) 9
rts 44 Chitinasea CAA71402 N, S, E/Mt, (E) 12
rts 45 Chitinasea CAA71402 N, S, E/Mt, (E) 10
rts 46 Cys proteinase BI269594 (E), E/Mt 4
rts 47 Cys proteinase precursor BG645760 (E), E/Mt 6
rts 48 Ascorbate peroxidase BG648814 (E) 4
rts 51 Ascorbate peroxidase P48534 S, N, E/Mt, (E) 4
rts 52 Triose phosphate isomerase BG584164 (E), E/Mt 11
rts 53 In2-1 proteina BF635446 E/Mt, (E) 7
rts 54 Uridylate kinase (UDP kinase) AW981222 E/Mt, (E) 4
rts 56 Chalcone-flavone isomerase AW559891 (E), E/Mt, N, S 6
rts 60 Unknown proteina AW686250 (E), E/Mt 9
rts 61 Alpha fucosidasea BE942130 (E) 6
rts 63 Putative protein T25B15.70a BF520168 E/Mt, (E) 8
rts 65 Seed protein precursor AL371551 E/Mt, (E) 4
rts 66 Vc Cyp (peptidyl isomerase) BE316900 (E), E/Mt 4
rts 69 Profucosidasea AW126318 (E) 5
rts 75 Putative proteina BF005271 (E), E/Mt 8
rts 77 Glyceraldehyde-3-phosphate dehydrogenase BF635050 E/Mt, (E), N 9
rts 79 Cu/Zn superoxide dismutasea AL387737 E/Mt 4
rts 80 aba-Responsive protein ABR17 BF648027 E/Mt, (E) 8
rts 81 Unknown proteina AL365549 E/Mt, (E) 6
rts 82 Putative ripening-related proteina BE943167 E/Mt, (E) 4
rts 92 Thioredoxin BE997543 (E), E/Mt 4
flw 3 Valosin-containing cell division protein P54774 S, N 16
flw 6 NADH ubiquinone oxidoreductase AW587332 E/Mt, (E) 6
flw 7 Heat shock 70 Q02028 N, S 10
flw 8 Poly(A+)-binding proteina BG584083 (E) 5
flw 11 Phosphoglyceromutase BG585916 (E), E/Mt, N, S 10
flw 13 Putative methyl-binding domain AL378817 (E), E/Mt 5
flw 14 Calreticulin AW773889 E/Mt 4/(348)
flw 17 ATPase beta subunit CAA75477 N, S, E/Mt, (E) 8
flw 18 Enolase CAB75428 N, S, E/Mt, (E) 7
flw 21 S-adenosyl Met synthetase AAL16064 N, S, E/Mt, (E) 8
flw 23 Rubisco activase AAK25798 N, S, E/Mt, (E) 9
flw 27 Ankyrin repeat protein HBP1 BI311773 (E) 9
flw 26 Fru-1,6-biphosphate aldolase O65735 N, S, E/Mt, (E) 7
flw 28 Aspartate aminotransferase P46643 S, N, E/Mt 4/(115)
flw 31 1-Aminocyclopro. carboxylic acid oxidasea AY062251 N, (E) 6
flw 32 Pyruvate dehydrogenase beta unita BF645846 (E), E/Mt 6
flw 30 Glyceraldehyde-3-phosphate dehydrogenase P34922 N, S, E/Mt, (E) 8
flw 33 Malate dehydrogenase O48905 S, N 4
flw 34 Malate dehydrogenase O48905 N, S, (E) 8
flw 35 Ripening-induced proteina BI308422 (E) 8
flw 39 Cytochrome c oxidase subunit 6b BI310278 (E), E/Mt 5
flw 40 Stromal ascorbate peroxidase Z67113 (E), E/Mt 7
flw 50 Acid phosphatase BG588612 (E), E/Mt 7
flw 51 Acid phosphatase BF004054 (E), E/Mt 6
flw 53 Triose phosphate isomerase BG584164 (E) 11
flw 55 Osmotin-like protein BI270608 (E) 8
flw 56 Chalcone isomerase BI310352 (E), E/Mt 6
flw 57 Ascorbate peroxidase AAL15164 N, S, E/Mt, (E) 5
flw 60 Oxygen-evolving enhancer protein 2 BF636854 E/Mt, (E) 5
flw 71 Peptidyl prolyl isomerase BE999037 (E), E/Mt 5
flw 73 Acid phosphatase AW584917 (E), E/Mt 4/(131)
flw 74 Peptidyl prolyl isomerase BE997455 (E), E/Mt 6
flw 75 Gly-rich RNA binding protein BF637655 E/Mt 4
flw 77 Peroxiredoxin (peroxidase) AW585033 (E), E/Mt 12
flw 78 Ubiquitin-like SMT3 protein AL376595 (E), E/Mt, S 8
flw 79 Ubiquitin-like SMT3 protein P55852 S, N, E/Mt mt 7
flw 81 60S acidic ribosomal protein p3 BF003585 (E), E/Mt 4/(123)
flw 82 Gly cleavage system h precursora BF518986 (E), E/Mt 5
flw 88 Acid ribosomal protein P2a2 AW329482 (E) 5
flw 91 Immunophilin AW574158 (E), E/Mt 6
flw 92 Rubisco small chain BI268542 (E), E/Mt 5
flw 94 Profilin 1a AL373653 (E) 9
flw 96 NADH plastoquinone oxidoreductase 4a BF631701 (E) 6
pds 5 Convicilina BI312063 (E) 5
pds 6 Convicilina BI310979 E/Mt 10
pds 7 Heat shock 70 Q02028 N, S 9
pds 8 Legumin a2 precursora BI312252 E/Mt 9
pds 9 Protein disulfide isomerase P29828 N, S, (E), E/Mt 7
pds 10 Glycinina BI308459 (E) 7
pds 13 Legumin a2 precursora BI311943 (E) 9
pds 12b Rubisco AAK70985 N, S 8
pds 12b NAp1p (plasma membrane intrinsic protein)a AW774263 E/Mt, (E) 10
pds 14 Vicilin 47kD precursora BI310576 (E) 9
pds 15 Provicilin precursora BI312400 (E) 6
pds 16 Vicilin 47kD precursora BI311712 (E) 4
pds 19 Vicilin 47kD precursora BI311712 (E) 7
pds 22 Glycinina BI311592 (E) 13
pds 20 Glycinina BI311729 (E) 4
pds 21 Glycinina BI308883 (E) 12
pds 23 Glycinina BI308883 (E) 6
pds 24 Legumin a2 precursora BI309500 (E) 11
pds 28 Legumin a2 precursora BI311943 (E) 14
pds 27 Legumin a2 precursora BI311943 (E) 11
pds 31 Fru 1,6-biphosphate aldolase O65735 N, S, (E), E/Mt 7
pds 30 Legumin a2 precursora BI311943 (E) 11
pds 29 Legumin a2 precursora BI311943 (E) 8
pds 32 Legumin a2a BI307938 (E) 5
pds 34 Cytosolic malate dehydrogenase BG583001 (E), N, S 6
pds 38 Malate dehydrogenase precursor AW688679 E/Mt, N 7
pds 37 Glycinina BI311164 (E) 10
pds 39 IFR-like NADH-dependent oxidoreductase BE325778 E/Mt 7
pds 41 Peroxidase 2 CAC38106 N, E/Mt 16
pds 43b Rubisco CAA62888 N, S 10
pds 43b Enolase BG362941 E/Gm 7
pds 44 Vicilin 47-kD precursora BI310576 (E) 9
pds 47 Acid phosphatase BF004054 E/Mt 5
pds 46 Acid phosphatase BF004054 E/Mt 7
pds 51 Acid phosphatase BG588612 (E) 9
pds 52 Proteosome 20S subunit BE922062 E/St 7
pds 53 Ascorbate peroxidase BG648703 (E), E/Mt 10
pds 54 Osmotin like protein precursor BG582096 (E), E/Mt 8
pds 55 Putative GSH-dependent dehydroascorbate reductasea BF636747 E/Mt, (E) 6
pds 56 Legumin a2 precursora BI307938 (E) 7
pds 57 Legumin a2 precursora BI309895 (E) 6
pds 58 Oxygen-evolving enhancer protein 2 AW775879 E/Mt, (E), S 12
pds 59 Legumin a2 precursora BI309155 (E) 9
pds 62 Legumin b (minor small)a BI311720 (E) 9
pds 65 Legumin b (minor small)a BI311720 (E) 12
pds 66 Legumin b (minor small)a BI311437 (E) 6
pds 68 Legumin-related high-Mr polypeptidea BI310430 (E) 8
pds 70 Hypothetical proteina AW685677 E/Mt, (E) 9
pds 71 Vicilin 4-kD precursora BI312335 (E) 10
pds 75 LEA proteina BG454568 (E) 4
pds 73 Legumin a2 precursora BI309500 (E) 4
pds 72 Eukaryotic initiation factor 5aa AL389124 E/Mt 4
pds 77 VcCyP peptidylprolyl isomerase BE997455 E/Mt, (E) 10
pds 78 VcCyP peptidylprolyl isomerase BE997455 E/Mt, (E) 13
pds 80 Ubiquitin-like protein AL376595 E/Mt 5
pds 82 aba-Responsive protein abr 17 BF648027 (E) 5
pds 84 Gly-rich RNA-binding protein BI309824 (E), E/Mt 11
pds 85 Acidic ribosomal protein AL383563 E/Mt 5
pds 91 Legumin (minor small)a BI311437 (E) 7
pds 95 Rubisco small subunit BF519894 E/Mt 8
pds 94 Plastocyanin precursor BF005687 E/Mt 6
cls 2 Cell division cycle prt 48 (valosin contain. prt) P54774 S 10
cls 5 Heat shock protein 70-kD (Bip A) T06598 E/Hv, N, S 10
cls 6 Luminal-binding protein CAC14168 N, S, E/Hv 8
cls 7 Psst 70 Q02028 N, S, E/Hv 9
cls 8 Putative-luminal binding protein CAC14168 N, S, (E) 9
cls 10 Leucyl aminopeptidasea S57811 N, S 4
cls 9 70-kD heat shock protein Q01899 N, S, E/Mt, (E) 8
cls 20 Catalasea P49315 N, S, (E) 6
cls 15 Selenium-binding proteina CAC67501 N, (E) 12
cls 16 Selenium-binding proteina CAC67501 N, (E) 7
cls 14b Calreticulin Q40401 N, E/Mt 4/(135)
cls 14b Nucleosome assembly protein 1a S60893 E/Mt NA/(208)
cls 18 ATP synthase beta subunit CAA75478 N, S, E/Mt, (E) 6
cls 19 Inosine-5′-monophosphate dehydrogenasea AAL18815 N, E/Mt, (E) 5
cls 21 Hydroxymethyltransferasea AW980652 E/Mt, (E) 4
cls 22 Enolase CAB75428 N, E/Mt 6
cls 23 SAM synthetase AAG17666 N, E/Mt, (E) 5
cls 24b Glc-6-phosphate 1 dehydrogenase Q42919 S 5
cls 24b SAM synthetase 2 Q96552 N, E/Mt, E/Gm 5
cls 28 Aspartate aminotransferase P28011 N, S, E/Mt 17
cls 29 Putative heat shock protein AAK63929 N 5
cls 30 12-Oxophytodienoic acid 10,11-reductasea BG648922 E/Mt, (E) 6
cls 31 12-Oxophytodienoate reductase (OPR2)a AW776305 E/Mt 6
cls 27 RAD23 (ubiquitin-like protein)a AW586882 (E) 4
cls 33 Probable mannitol dehydrogenase AW981164 (E) 6
cls 32 Alcohol dehydrogenasea P12886 S, (E) 5
cls 35b Catalasea P45739 S 5
cls 35b Fru-1,6-biphosphate aldolase P46257 N, S, E/Mt 5
cls 37 2-Nitropropane dioxygenase-like proteina BF518520 E/Mt, (E) 7
cls 39 Fructokinase AW584645 E/Mt, N, S 8
cls 46 Beta-1,3-glucanase BF650622 E/Mt 4/(378)
cls 49 Stromal l-ascorbate peroxidase precursor BE941206 E/Mt, (E) 5
cls 53 Cytochrome b5 reductasea NP 568391 E/Mt NA/(406)
cls 60 Glyceraldehyde-3-phosphate dehydrogenase P54270 N, S 5
cls 62 Rubisco, small subunit PS6577 S 4
cls 64 Proteosome subunit α-type 5 (20S subunit) Q9M4T8 E/Mt, N, S NA/(449)
cls 67 NADH ubiquinone oxidoreductase BG448277 (E) 4
cls 74 vcCyp (peptidylprolyl isomerase) AW775250 E/Mt 9
cls 78 Peroxiredoxin TPx1 (thioredoxin peroxidase) AW559683 (E), E/Mt 11
cls 81 Peptidylprolyl isomerase (immunophilin) BF635887 E/Mt, (E) 5
cls 83 Disease resistance response proteina BE942549 E/Mt, S 9
cls 82b aba-Responsive protein BF648027 E/Mt, (E) 10
cls 82b Leghemoglobin 2 (Pprg2)a P27993 S 4/(680)
cls 87 Class 10 PR proteina Q43560 N, S, E/Mt 5
cls 85 Cytochrome C-555a P00124 S 5
cls 88 Nucleoside diphosphate kinase P47922 S 7
cls 86 Gly-rich RNA binding protein AAF06329 E/Mt, N 9
cls 90 Immunophilin AL377066 E/Mt 4
cls 92 Acidic ribosomal protein (60S) AL378424 E/Mt 5
cls 96 10-kD chaperonina AL377948 E/Mt 4/(121)

Table I contains a list of identified proteins from specific tissues of Medicago truncatula. The data are separated by tissue and include: an assigned protein spot no. (see Fig. 1), database accession no. of the best match, databases that yielded concurrent identifications, and the number of MALDI-TOFMS peptides matched. LC/MS/MS was performed on select proteins, and Mascot scores for these proteins are provided in parentheses. Not applicable (NA) denotes that no MALDI data was used in the identification. Significantly more detailed data supporting the protein identifications can be found in Supplemental Table I. Accession no. is GenBank no. Databases have following notations: N, NCBI; S, SwissProt; E, pdbESTothers; (E), MtESTonly. Species are noted as Hv, Hordeum vulgare; Sb, Sorghum bicolor, LE, Lycopersicon esculentum; and Mt, Medicago truncatula.

a

 Putative unique protein identified only in one tissue. 

b

 Multiple proteins identified in this 2-DE spot. 

Supplemental Table I (see www.plantphysiol.org) contains extensive data that document the analytical rigor of the protein identifications. These data include an assigned protein spot number (see Fig. 1), an arbitrary peptide mass fingerprint data quality (PMFQ) score of 1 to 5 (with 5 being best, see “Materials and Methods”) to allow assessment of data quality, the number of peptides matched, m/z accuracy and sd of peptides matched, percent protein coverage, theoretical molecular mass and pI, experimental molecular mass and pI, the database accession number of the best match and the databases that yielded concurrent identifications, LC/MS/MS data for select proteins, and the organism to which the matching protein was identified through similarity. For protein identifications determined using the SwissProt and National Center for Biotechnology Information (NCBI) databases, the organism reported in supplemental Table I is that from which the protein or gene was directly sequenced. In the case of most ESTs, protein identifications were first made to barrel medic ESTs that were not annotated. These ESTs were annotated by comparison with The Institute for Genomic Research (TIGR) gene indices or through similarity to other organisms via BLAST. The organism yielding the highest similarity score is the organism reported for EST database identifications in Supplemental Table I. Protein function is also classified and recorded in Supplemental Table I. A minimum of four peptides is statistically necessary to qualify as a confident match (Pappin et al., 1993). Use of additional criteria such as those listed above are advised and increase the confidence in the protein identification. Most proteins identified in Table I had high confidence identifications; however, a small number (23) of the original proteins were identified using only four peptides that had poor m/z accuracies (i.e. above 30 ppm). These protein identifications were considered marginal and were further interrogated using LC/MS/MS. LC/MS/MS data were queried against the same three databases (NCBI, SwissProt, and dbESTothers) used to query MALDI-TOFMS data. The majority of identifications were found to be valid, but four MALDI-TOFMS proteins were revealed as misidentified. The correct LC/MS/MS identifications for these four are reported in Table I. Tandem data was also used to confirm a specific MALDI-TOFMS identified protein questioned by a reviewer in leaves (spot no. 51) that had a minimal four matching peptides and low sequence coverage. This identification was confirmed using LC/MS/MS. These results are provided in Figure 3 and include a search score from dbESTothers (12 peptides matched and Mascot score of 513), representative TOFMS data, and tandem TOF/MS/MS data. Nine proteins from the original list of 23 marginal identifications could not be validated by LC/MS/MS due to limited sample, therefore, were omitted from Table I.

Figure 3.

Figure 3

Representative LC/MS/MS data obtained on an ABI Qstar Pulsar for leaves (spot no. 51) confirming the identification of this protein as a TPR repeat protein (accession no. AW694998) as suggested by MALDI-TOFMS peptide mass fingerprinting. The data include: A, database search score and peptides successfully identified; B, example TOF/MS; and C, tandem TOF/MS/MS mass spectra for the peptide observed at m/z 677.62.

Database Query Strategies and Success Rates

In an attempt to maximize our protein identification success rate for barrel medic proteins, we have used protein (SwissProt), nucleotide (NCBI), and EST databases (dbESTothers, and barrel medic-only ESTs from NCBI) for queries of experimental peptide mass maps (Mann and Wilm, 1994; Pappin et al., 1993; Yates, 1998a,a, 1998b; Choudhary et al., 2001). The specific databases used to successfully identify each individual protein are reported in Table I, and a summary of the protein identification success rates is provided in Table II. In most cases, the resulting peptide mass maps were of high quality; however, this did not always translate to successful protein identification.

Table II.

Summary of protein identification success rates

Tissue Protein Databases EST Databases Overlap Total
Leaves 37 (44%) 42 (50%) 15 (18%) 64/84 (76%)
Stems 28 (30%) 33 (38%) 15 (16%) 46/94 (49%)
Roots 12 (13%) 40 (43%) 12 (13%) 40/94 (43%)
Flowers 16 (17%) 40 (43%) 13 (14%) 43/94 (46%)
Pods 9 (10%) 59 (65%) 7 (8%) 61/91 (67%)
Cells 33 (35%) 40 (40%) 23 (24%) 50/94 (53%)
 Total 135/551 (25%) 254/551 (46%) 85/551 (15%) 304/551 (55%)

Success rates are reported as total no. of proteins identified and as a percentage of those identified relative to those processed in parentheses.

The average protein identification success rate for all tissues using only the protein databases (SwissProt and NCBInr) was 25%, whereas the average protein identification success rate for all tissues using the EST database was 46% (see Table II). Interestingly, the average overlap in the number of proteins identified in both databases was only 15%; thus, searching both databases was complementary and not necessarily redundant. For example, the peptide maps provided in Figure 2 are of similar high quality; however, spectra 2b could not be identified successfully in the SwissProt or NCBI databases and could only be identified successfully through EST database queries. This complementary searching strategy yielded a final protein identification success rate of 55% for our representative protein set.

Strategies using multiple database queries have enhanced our ability to identify proteins even in the absence of a genomic sequence. Our overall success rate of 55% is good when compared with other reports focused on organisms without sequenced genomes. For example, a recent publication concerning pea (Pisum sativum) chloroplast proteins reported a success rate of 15% using mass spectrometry and Edman sequencing (Peltier et al., 2000), whereas a barrel medic root proteome article reported a success rate of 37% (Mathesius et al., 2001). Our protein identification success rates are approaching those for organisms with sequenced genomes. For example, identification success rates of 54% using MS only (Kruft et al., 2001) and 69% (Millar et al., 2001) using MS, immunoblotting, and Edman sequencing were reported for Arabidopsis mitochondrial proteomes. Further, protein identification success rates in human proteome projects are approximately 60% (Stensballe and Jensen, 2001). We expect protein identification success rates to continually increase as the population of unique ESTs continues to increase, as full-length EST sequences are generated, and as genomic sequence of barrel medic becomes available (Comment, 2002).

The average length of barrel medic ESTs used to successfully identify proteins in all organ/tissues was 597 ± 177 nucleotides (or 199 ± 59 amino acids). For proteins in the 30-kD range or less, this represents complete or almost complete sequence coverage by the EST; thus, our confidence in these identifications is very high. For larger proteins this only represents partial protein sequence; however, our data demonstrate that the current EST information is sufficient to allow confident identifications. Additional experimental data such as number of peptides matched, m/z accuracy, molecular mass, and pI provide additional confirmation of identification. It is logical that a strategy including both protein and nucleotide databases would yield greater protein identification rates as some mRNAs, such as mitochondrial and chloroplast-encoded mRNAs (i.e. Rubisco large subunit), do not contain poly(A+) tails (Sugiura and Takeda, 2000). These poly(A+) tails are used in the initial stages of affinity purification of mRNAs in the cDNA/EST library generation process (Sambrook et al., 1989). Messenger RNAs without poly(A+) tails pass through the affinity purification process and are unlikely to be sequenced. These proteins are poorly represented in the EST libraries but are present in many of the protein databases. Therefore, querying both provides greater identification success rates.

Protein Identifications and Functional Classifications

Putative protein functional classifications were assigned based on similarity to better understand the biological processes encompassed by the proteins identified using a 2-DE proteomics approach. Summaries of protein functions observed in the barrel medic proteome are provided in Figure 4. Protein functions were assigned using the protein function database Pfam (http://www.sanger.ac.uk/Software/Pfam/; Bateman et al., 2002) or Inter-Pro (http://www.ebi.ac.uk/interpro/; Apweiler et al., 2001). Protein function was categorized into 13 classes as previously described for Arabidopsis (Bevan et al., 1998). The “unclear” protein class included proteins that were successfully matched to putative proteins from such sources as the Arabidopsis genomic sequence but do not yet have a known function. Most proteins could be unambiguously classified; however, a small number of proteins were associated with multiple functions. Classifications for these proteins were based on their predominate function. Discussions concerning a portion of the proteins observed and their functional role are presented below in relation to the tissue in which they were observed.

Figure 4.

Figure 4

Summary of the distribution of tissue specific identified protein classes as determined using the protein function database Pfam (http://www.sanger.ac.uk/Software/Pfam/) and classification schema previously reported for Arabidopsis (Bevan et al., 1998).

Leaves

Photosynthetic enzymes dominated the 2-DE profiles of leaf tissue. Approximately 40% of the leaf protein mass visualized with Coomassie staining can be attributed to a small number of enzymes including the large subunit of Rubisco (26.1%), Rubisco small subunit (2.8%), Rubisco activase (3.2%), and oxygen-evolving protein (6.4%). Most of these proteins appear as multiple spots, and the reported percentages are estimates including all identified spots. The relatively high concentrations of the abundant photosynthetic enzymes demonstrate the importance of these enzymes; however, the prominence of these proteins, specifically Rubisco, in specific regions of the gel, generally contributes to lower quality 2-DE gels and prevents the observation of moderate or lower abundance proteins due to their relatively lower concentrations and the limited dynamic range of common 2-DE staining techniques including Coomassie. Other proteins involved in photosynthesis and carbon fixation were observed in leaf, including: PS1 iron-sulfur protein, ATP synthase, glyceraldehyde 3-phosphate dehydrogenase, malate dehydrogenase, triose phosphate isomerase, tartrate dehydrogenase, and Fru biphosphate aldolase. Many of these photosynthetic enzymes were also observed at lower levels in other green tissues such as stems and immature seed pods.

Several signal transduction proteins were observed in leaves, including the multiple domain protein remorin. Remorin binds simple and complex galacturonide and its C-terminal region has functional similarities to viral intercellular communication proteins (Reymond et al., 1996). Other proteins involved in protein destination or transport included chaperonin 21 precursor, an ankryin repeat protein, and an ATP-binding cassette transporter. Ankyrin repeat proteins have been associated with protein-protein interaction (Gorina and Pavletich, 1996), transcriptional regulation (Batchelor et al., 1998), and transcription inhibition (Jacobs and Harrison, 1998). ATP-binding cassette transporters are membrane-localized proteins that transport small hydrophilic molecules across membranes and include an ATP-binding domain (Higgins, 1992; Jasiñski et al., 2001). Interestingly, other membrane localized proteins were identified and included a chloroplast membrane-associated 30-kD protein (Li et al., 1994) and ATP synthase. The identifications of membrane proteins are important because these proteins are generally underrepresented in 2-DE proteomic studies due to low solubility (Molloy et al., 1998). The observation of plant proteins in 2-DE relative to their general average hydropathicity score has been discussed recently (Millar et al., 2001). Additional proteins identified in leaf tissues included: two cell division proteins, filamentous temperature sensitive protein K homolog cell division protein, miotic cyclin B1-1, DNA mismatch repair protein, RNA-binding protein, transcription factor, and a Gly-rich cell wall structural protein.

Stems

The 2-DE reference map of barrel medic stem proteins was of better quality than that of leaves, primarily due to a lower abundance of Rubisco. Many of the same photosynthetic and carbon metabolism enzymes reported above for leaf were also identified in stems. In addition, several members of the ATP complex associated with energy metabolism were observed. Proteins involved in protein destination and storage were also identified and included the 26S proteasome AAA-ATPase subunit and a 20S proteasome subunit alpha type 7 protein. The 26S proteasome is responsible for protein degradation of endogenous proteins.

Proteins involved in secondary metabolism are of specific interest to our functional genomics project focused on natural products (National Science Foundation Plant Genome Research Project no. 0109732). Several secondary metabolic enzymes were identified in stems and included cinnamoyl-CoA reductase, which plays a role in lignin biosynthesis, and isoflavone reductase-like oxidoreductase, an enzyme involved in phytoalexin production. Stems also revealed several kinases including adenosine kinase, fructokinase, Rib-phosphate pyrophosphokinase, uridylate monophosphate kinase, and nucleoside diphosphate kinase1. A number of RNA binding proteins thought to be important in transcription were also observed. Multiple ribosomal proteins including 40S and 60S ribosomal proteins were identified and function in protein synthesis.

Roots

The roots of legumes are of special interest because of their role in the characteristic symbiotic relationships formed with microorganisms. Although recent articles have been published on the proteomes of barrel medic nodulated root (Bestel-Corre et al., 2002) and uninoculated root (Mathesius et al., 2001), we have included roots as part of our survey for completeness and comparison. Approximately 24% of root proteins identified in this report were associated with plant disease/defense and included peroxidases, superoxide dismutases, ripening related protein, abscisic acid (ABA)-responsive protein, and chitinase. Peroxidases are generally involved in hydrogen peroxide detoxification and are induced by bacterial infection (Cook et al., 1995; Peng et al., 1996). Peroxidases also play a major role in lignin biosynthesis (Lewis and Yamamoto, 1990; Davin and Lewis, 1992). Several glucanases were also identified. These normally constitutively expressed proteins are induced in response to fungal and viral elicitation (Meins et al., 1992). Proteins involved in secondary metabolism of the flavonoid/isoflavonoid pathway made up another 8% of the identified root proteins. Similar to leaves, several membrane-localized proteins such as ATPase and cytochrome C oxidase were also observed in roots.

Relative to other tissues, a larger percentage (i.e. 15%) of the barrel medic root proteins were identified as putative proteins or unannotated proteins. These proteins could be confidently linked to specific ESTs or predicted open reading frames whose functions are still unknown. The observation of unannotated proteins provides experimental evidence of putative/predicted proteins that offer exceptional opportunities in gene annotation (Mann and Pandey, 2001). Because roots appear to have the largest percentage of proteins of unknown function, it is possible that many of these proteins may be specific to legumes and may be involved in microbial interactions characteristic of legumes.

The root 2-DE reference map and protein identifications reported here are consistent with the previous studies by Mathesius et al. (2001) in young, uninoculated barrel medic roots, and by Bestel-Corre et al. (2002) using roots inoculated with Glomus mosseae or Sinorhizobium meliloti. Similar to our results listed above, Mathesius and coworkers reported 5% of their identified root proteins to be associated with flavonoid metabolism and 18% with defense and stress response, yielding a total of 23% defense-related proteins. Further, the total overlap in identified root proteins between the current study and the detailed report by Mathesius and coworkers was over 50%. These included heat shock 70 protein, protein disulfide isomerase, glyceraldehyde-3-phosphate dehydrogenase, isoflavone reductase and chalcone isomerase, a glucosidase and a Cys proteinase, ascorbate peroxidase, alpha-fucosidase, and a ripening-related protein. Many of these proteins had very similar molecular mass and pI values in both studies. For example, cytochrome c oxidase was reported to have a gel molecular mass/pI of 37 kD/4.2 by Mathesius and coworkers, whereas it was observed at a molecular mass/pI of 36 kD/4.9 in the present study. Similarly, ripening related protein had an experimental molecular mass/pI of 16 kD/5.8 in this study and 18 kD/5.5 or 17 kD/6.2 (isoforms) in the Mathesius et al. work. Interestingly, some proteins demonstrated varied slightly between studies. For example, VcCyp was observed at a gel molecular mass/pI of 22 kD/6.3 in the current study as opposed to molecular mass/pI 20 kD/8.8 in Mathesius and coworkers. These slight inconsistencies may represent real differences in posttranslational modifications of the proteins or may be the result of experimental variability.

Proteins identified in all three investigations include a peroxidase precursor, cytochrome c oxidase subunit 6, VcCyp (cyclophilin), a superoxide dismutase, and ABA-responsive protein. Only the ABA-responsive protein and VcCyp were reported to be constitutively expressed by Bestel-Corre et al. (2002), whereas the others proteins common to all three investigations were identified by them as symbiosis-related proteins. Bestel-Corre also identified and reported profucosidase as a symbiosis-related protein. This protein was identified in the current report using uninoculated roots.

Interestingly, two proteins identified in this investigation were not found in either of the other two studies. Acidic glucanase was observed as a relatively abundant protein in the present report (rts#39), but due to its pI of 8.4 and the fact that Mathesius and coworkers' first dimension immobilized pH gradient (IPG) pH range was 4 to 7, it was not present on their gels. We also identified three isoforms of chitinase, all with a pI above 7, that are missing in the Mathesius et al. work. Bestel-Corre et al. (2002) used a pH of 3 to 10 first dimension IPG; thus, these proteins should be visible in their gels. Unfortunately, the total number of identified proteins in the Bestel-Corre report was limited, and these proteins were not identified by them.

Overall, these three reports (this report; Mathesius et al., 2001; Bestel-Corre et al., 2002) provide a wealth of information on the barrel medic root proteome. There are significant similarities between the reference maps that serve as landmarks and can be used for navigation through the root proteome. For example, ABA-responsive protein is one of the most abundant root proteins in each of these investigations. Its relative position can be used to locate PR10 (a highly abundant low-molecular mass protein reported by Mathesius and coworkers next to ABA-responsive protein, rts#80, that was not identified in the present study) in the present and other studies based on similarity. Unfortunately, absolute comparisons of the proteome reference maps are not always straightforward as demonstrated by the differences in molecular mass and pI values shown above for VcCyp.

Flowers

The proteome of flowers contained proteins from almost every functional category. The major portion (38%) of the identified proteins was associated with energy production including glycolysis, pyruvate metabolism, and the tricarbonylic acid (TCA) cycle. Another 21% of the identified proteins were involved with protein synthesis or protein destination. For example, peptidyl prolyl isomerase accelerates protein folding by catalyzing cis-trans isomerization in oligopeptides. Several proteins identified were related to disease/defense or involved in secondary metabolism, such as chalcone isomerase. These enzymes are commonly associated with flower pigmentation or UV protection and serve as important defense proteins in developing seeds. One of the proteins identified specifically in the flower proteome was profilin. Profilin normally binds to monomeric actin to prevent polymerization, although under certain conditions it can promote the polymerization of actin. It occurs in all organs, but is most abundant in mature pollen, making it more likely to be identified in flowers. Many proteins associated with oxidative responses were also identified in flowers. Low levels of a few photosynthetic enzymes were observed due to collection of green sepals with the flowers.

Seed Pods

The intact seed pod proteome was generated from tissue containing both seed and pod tissue. The proteins visualized and identified in the barrel medic seed pod proteome consisted primarily of globulins or seed storage proteins that serve as a nitrogen/nutritional source for developing plants. Several members of the superfamily of “cupins” were identified in barrel medic seed and included 7S and 11S globulins (Dunwell, 1998). The 11S globulins are non-glycosylated proteins and include glycinin and legumin (Hayashi et al., 1988; Duranti et al., 1995). The 7S proteins are a series of similar but progressively larger variations of the same subunit and include vicilin, convicilin, and legumin. It is also interesting to note that 85% of the proteins in this group have been matched to other legumes, suggesting a high level of sequence similarity in legume storage proteins. All of the barrel medic seed storage proteins were observed at multiple molecular masses and pIs. These may represent various stages of protein synthesis and degradation, posttranslational processing not observable at the genome or transcriptome level, or may be the products of multigene families. Similar variations in observed isoforms have been reported for Arabidopsis 12S seed storage proteins in mature and developing seeds (Gallardo et al., 2001).

A significant number of disease-/defense-related proteins were observed in seed pods including peroxidases, osmotin, and ABA-responsive protein. These proteins help defend the plant in early stages of development. Other proteins associated with carbon metabolism, nutrient acquisition, and protein syntheses were also observed. These proteins supply necessary nutrients to the developing plant. Several photosynthetic proteins were observed and are attributed to the collection of immature green seed pods.

Cell Suspension Cultures

Cell suspension cultures were initiated from barrel medic root calli (Dixon, 1980) and their proteome surveyed. Cell culture proteins were extracted with a Tris buffer and, thus, consisted primarily of cytosolic proteins. Most of the identified proteins from cell cultures could be classified in four categories: energy (24%), protein destination and storage (24%), metabolism (22%), and disease/defense (18%). The defense proteins were primarily composed of pathogenesis-related proteins. The most abundant proteins identified were an ABA-responsive protein and a class 10 PR protein. Other disease/defense proteins identified included selenium-binding protein, catalase, and peroxiredoxin. Several of the metabolic enzymes identified in cells were not identified in any other tissue. One of these, 12-oxophytodienoate reductase, is associated with the conversion of 12-oxophytodienoic acid to jasmonic acid.

In some instances, more than one protein was identified with high confidence in each protein spot. For example, spot cls#82 contained peptides that could be associated with both ABA-responsive protein and leghemoglobin. Interestingly, leghemoglobin was identified as a root nodule-specific isoform (Gallusci et al., 1991). This protein is root specific and is induced during nodulation; however, it is generally not observed at appreciable levels in uninoculated roots. Thus, the observation of leghemoglobin is unique here, and this protein may be induced by the cell culturing process. Further, it may also suggest a “memory” effect or root-specific expression pattern observed in the cell cultures that were originally generated from root material (Dixon, 1980). Although many flavonoid-related proteins were observed in other tissues such as root and stem, none were identified in the limited set of proteins surveyed in unchallenged cell cultures.

The proteome of suspension cell cultures is of special interest because the tissue is relatively homogeneous and, therefore, provides a good model tissue system for experiments directed toward integrated functional genomic studies of natural products (https://www.fastlane.nsf.gov/servlet/showaward?award=0109732). Future work will focus on generation of an extensive 2-DE proteome reference map of suspension cell cultures and the changes in the proteome after biotic and abiotic elicitation.

Tissue-/Organ-Specific Expression of Proteins

Many of the proteins identified were redundant as an average of 61% were identified in one or more tissues of barrel medic. The remaining 39% were identified in only one tissue and have the potential of being uniquely expressed in specific tissues/organs based on our limited dataset. The quantities of redundant and potentially unique proteins identified in each specific tissue are summarized in Figure 5. Many of the putative unique proteins are related to the primary function of the specific tissue. For example, photosynthetic enzymes such as PSI iron-sulfur protein and plastid specific ribosomal proteins were only identified in leaves. Other proteins identified only in a specific tissue include the seed storage proteins glycinin, convicilin, and legumin in seed pods. Profilin, a known pollen allergen, was also identified in flowers. These are limited examples illustrating the unique nature of the proteome, but we are hopeful that continued evaluation of the tissue- and organelle-specific proteomes of barrel medic will yield further insight into the specialized functionality of these tissues.

Figure 5.

Figure 5

Bar graph summarizing the number of redundant proteins identified in more than one tissue (A) and the number of putative tissue-specific proteins identified in a single tissue only (B). The graph is segregated by tissue. A total of 61% of the proteins were found to be redundant and 39% were found to be putatively tissue specific. Guarantee of specificity at this stage is difficult due to the limited size of the reported protein dataset relative to the total proteome.

Comparison of Barrel Medic Proteome and Transcriptome

A better understanding of the relationship between mRNA and protein abundances is needed to elucidate the processes and regulation of transcription and translation. Several recent publications present conflicting views concerning the correlation of mRNA and protein levels. Gygi et al. (1999) suggested that there is a poor correlation between most yeast mRNAs and protein levels with the exception of only the most abundant proteins. In contrast, Futcher et al. (1999) reported a good correlation between yeast mRNA abundances, measured by both SAGE and microarray chips, and protein abundances.

Given the large abundance of EST information for barrel medic (http://www.ncbi.nlm.nih.gov/entrez/query.fcgi/), a simple comparison of identified protein levels with their corresponding mRNA levels was performed. Currently, over 145,000 EST sequences from approximately 20 different non-subtractive, non-normalized (J. White, TIGR, personal communication) cDNA libraries are available (Covitz et al., 1998; Cook, 1999; Bell et al., 2000; Gyorgyey et al., 2000). It is possible that a select few sequences from these libraries are being held back by the contributors, but these are few and specialized, and should have a minimal affect on the following comparisons. The cDNA libraries were used to estimate or “count” the relative expression level of a particular barrel medic transcript based on the repetitive occurrence of sequences from the same mRNA (Audic and Claverie, 1997; Ewing et al., 1999). The relative abundances of the top 200 ESTs for barrel medic leaves, stems, uninoculated roots, flowers, seed pods, and elicited cell cultures were quantified in this manner and are provided in Supplemental Table II (see www.plantphysiol.org). The relative abundances for the ESTs were generated using cDNA libraries originating from similar tissues; however, these tissues were from multiple and separate origins. Comparisons were based on functional annotation and not necessarily on specific protein or GenBank numbers, i.e. oxygen-evolving protein as opposed to P14226. Although this comparison is not of high analytical rigor, it does provide insight into correlation of protein and mRNA levels.

Although the proteins were arbitrarily chosen across pI and molecular mass ranges, most represent relatively abundant proteins typical of 2-DE and CBB 250 staining. Based on the 2-DE protein quantification results presented here, 67% of the identified proteins were in the top 100 most abundant proteins visualized with Coomassie, whereas 97% of the proteins identified were in the top 200 most abundant proteins. Thus, identified proteins were compared with the top 200 most abundant tissue-specific ESTs in related cDNA libraries. The percentages of the identified proteins observed by 2-DE that were also observed in the top tissue specific ESTs are summarized in Table III. This summary reveals that an average of 50% of the identified proteins were observed in the top 200 tissue-specific ESTs. An evaluation of the top 100 tissue-specific ESTs shows that 40% of proteins identified in 2-DE experiments were also observed in the 100 most abundant tissue-specific ESTs. These results suggest a moderate level of correlation between mRNA and protein. For example, leaf proteins such as the photosynthetic enzymes Rubisco small subunit and oxygen-evolving protein appear to be highly correlated with their respective mRNA levels.

Table III.

Summary of the correlated protein and EST libraries

Tissue No. of Proteins Matched in Top 100 ESTs/No. of Identified Proteins No. of Proteins Matched in Top 200 ESTs/No. of Identified Proteins
Leaves (7,831 ESTs) 21/64 (33%) 30/64 (47%)
Stems (10,314 ESTs) 12/46 (26%) 16/46 (35%)
Roots (6,593 ESTs) 16/40 (40%) 19/40 (48%)
Flowers (3,404 ESTs) 16/43 (37%) 19/43 (44%)
Pods (4,587 ESTs) 45/61 (74%) 48/61 (79%)
Suspension cells (8,926 ESTs) 12/50 (24%) 19/50 (38%)
 Total (41,655 ESTs) 122/304 (40%) 151/304 (50%)

Ninety-seven percent of all identified proteins were quantified as being in the top 200 most abundant proteins observed in Coomassie-stained 2-DE gels. The occurrence of these identified proteins in the top 100 and 200 ESTs is reported. The no. of EST sequences used for EST counting is listed in parentheses under each tissue identifier.

Interestingly, some highly expressed proteins such as Rubisco large subunit were not observed in the EST libraries. As mentioned earlier, we believe that this is due to the chloroplast-encoded nature of certain mRNAs, such as Rubisco large subunit, which do not contain poly(A+) tails necessary for purification and cDNA library preparation (Sambrook et al., 1989).

Highly abundant leaf ESTs not represented in the protein data to date included aquaporins, chlorophyll-binding proteins, and cytochrome B6. This apparent lack of correlation can be explained by the integral thylakoid membrane nature of these proteins. It is commonly accepted that integral membrane proteins are underrepresented in 2-DE due to poor solubilization. Lipoxygenase also appeared in the top 100 clones of five tissue-specific EST libraries; however, it was never identified in the protein dataset. Plants express both cytosolic and chloroplast isoforms of lipoxygenase, most of which have a molecular mass of approximately 100 kD. A possible explanation for the absence of this protein from the protein data could be the inherent discrimination against high-molecular mass proteins encountered during isoelectric focusing using IPG strips of fixed gel composition (Candiano et al., 2002).

The lack of correlation between mRNA and protein could not always be explained. For example, identified stem proteins included acid phosphatase, actin, and osmotin; however, these proteins were absent or of very low abundance in the stem-specific EST library. Other proteins identified but not represented in the EST libraries included: RNA-binding protein and ankyrin repeat protein in flowers and hydroxyacyl glutathione hydrolase in roots. Interestingly, elongation factor 1-alpha was observed as a highly expressed EST (top 50) in all tissues but was not observed in the protein set. The lack of correlation may be due to the relative turnover rates of both transcripts and proteins, or translational controls such as codon bias (Gygi et al., 1999), mRNA secondary structure (Wang and Wessler, 2001), or upstream open reading frame repression (Wang and Wessler, 1998).

Based on the limited comparison above, we estimate a moderate 50% correlation between protein and mRNA levels. This value suggests a correlation that is higher than that reported by Gygi et al. (1999) but lower than that reported by Futcher et al. (1999). If the limitations imposed by the chloroplast-encoded proteins, poor representation of membrane proteins in 2-DE, and our limited protein dataset are taken into account, a higher correlation than that reported may be possible. Although a significant level of correlation is perceived, there are still many specific examples that show poor correlation.

CONCLUSIONS

To date, we have identified over 300 proteins in specific tissues of barrel medic. Protein identifications using only protein databases were 25% successful even with good peptide mass fingerprints. Significant increases in protein identification success rates were achieved by using EST sequence databases. Using complementary protein, nucleotide, and EST sequence libraries, we were able to achieve a protein identification success rate of 55% for our representative protein dataset. We consider this a relatively high success rate in the absence of a genomic sequence and in comparison with other plant proteomic projects. Tentative consensus searches currently are being performed and confirm many of the proposed identifications in this study (Asirvatham et al., 2002b); however, this topic will be discussed in a separate publication.

The 2-DE profiles of various barrel medic tissues provide reference maps for future proteomic comparisons of genetic mutants, biotically and abiotically challenged plants, and/or environmentally challenged plants. The identified proteins provide a survey of those proteins observable using current technology and also serve to define the limitations of the reported proteomics approach. For example, it will be difficult to study other physiological processes besides photosynthesis and carbon metabolism in leaves using current proteomic technologies due to the very high level of these proteins in leaves. Further, the proteins identified serve as physiological markers of tissue-specific protein expression. Based on the limited dataset, 39% of all the identified proteins were only identified in a single tissue. These putative unique proteins provide valuable insight into the specialized physiological function of each of the tissues. For example, a comparison of roots and root-derived cell cultures can yield insights into the physiological phenomena associated with the dedifferentiation of root tissue during establishment of a suspension cell culture.

A comparison between the levels of the identified proteins and mRNA levels quantified through EST counting was performed. It is estimated that on average 50% of the proteins appear to be correlated with their corresponding mRNA levels; conversely, 50% are not. Information on both transcript and protein levels can be utilized for targeting potential regulatory genes that are characterized by high transcript but low protein levels.

The proteins identified in this study as unclear or putative represent unique opportunities to probe molecular function. Systematic perturbations and monitoring of these proteins would be expected to yield insight into function. These abundant but unclassified proteins have been linked to specific ESTs and, thus, establish the feasibility to experimentally monitor both the protein and mRNA. The relatively high abundance of these proteins further stresses the biological but unknown importance of these proteins in barrel medic.

This report provides a comprehensive overview of the barrel medic proteome and provides a good foundation for future comparative proteomic efforts associated with this important model plant. The importance of barrel medic is further emphasized by the recent recommendation from the National Academy of Sciences that the goals of the National Plant Genome Initiative for 2003 through 2008 should focus on a small number of key species including barrel medic (http://books.nap.edu/books/0309085292/html/index.html). This work serves as a major step in this direction for a key plant species. As we seek to better understand gene function and to study the holistic biology of systems, it is inevitable that we study the proteome.

MATERIALS AND METHODS

Plant Material and Protein Extraction

Differentiated plant tissues were collected from barrel medic (Medicago truncatula cv Jemalong A17) grown in an environmentally controlled growth chamber and maintained under standard conditions (Asirvatham et al., 2002a). Eight-week-old plants were used for leaf and stem tissue. The top two apical unfolded trifoliates were sampled for leaf tissue, and stem tissue was restricted to the first two apical internodes. Flowers included all stages from buds until petal browning and all parts except the peduncles. Green seed pods were collected from a variety of developmental stages (including very young pods to those with maturing seeds) of 3-month-old plants. Roots were collected from seedlings grown in perlite 2 weeks after planting. Total protein from these tissues was extracted according to a reported method (Tsugita et al., 1994). In brief, tissues (0.4–1.0 g) were ground in liquid N2 and proteins precipitated at −20°C with 10% (w/v) TCA in acetone containing 0.07% (w/v) 2-mercaptoethanol for at least 45 min. The mixture was centrifuged at 35,000g at 4°C for 15 min, and the precipitates were washed with acetone containing 0.07% (w/v) 2-mercaptoethanol, 1 mm phenylmethylsulfonyl fluoride, and 2 mm EDTA. Pellets were dried by vacuum centrifugation and solubilized in 8 m urea, 4% (w/v) CHAPS, 20 mm DTT, 0.1% (v/v) Biolytes (pH 3–10; Bio-Rad Laboratories, Hercules, CA; Molloy et al., 1998).

Cell cultures derived from barrel medic cv Jemalong A17 roots were grown in the dark in shaker flasks and suspended in Schenk and Hildebrandt (SH) medium with transfer to fresh medium every 2 weeks. Cells were harvested 4 d after transfer, washed once with fresh SH medium and once with SH:water (1:1 [v/v]), ground in liquid N2, and extracted with 40 mm Tris (pH 9.5), 50 mm MgCl2, 2% (w/v) polyvinylpolypyrrolidone, 1 mm phenylmethylsulfonyl fluoride, and 120 units mL−1 endonuclease (catalogue no. E8263, Sigma, St. Louis) by sonication (Molloy et al., 1998). After centrifuging at 12,000g, 4°C, for 10 min, proteins in the supernatant were precipitated on ice with 12% (w/v) TCA, centrifuged, and washed with cold acetone. The pellet was air dried and resuspended in solubilization buffer.

Protein Quantification and Electrophoresis

Protein concentrations of all tissue extracts were quantified using the Bradford method (Bradford, 1976) and a commercial dye reagent (Bio-Rad) with bovine serum albumin as a standard. Eleven-centimeter immobilized pH gradient (IPG) strips (linear, pH 3–10) from Bio-Rad were rehydrated at 20°C with 0.75 to 1.0 mg of protein in 300 μL for 15 to 16 h. Focusing was carried out in a Bio-Rad Protean IEF Cell for a total of 35,000 volt hours. After focusing, strips were equilibrated with reduction and then with alkylation buffers, loaded onto a 12% (w/v) acrylamide gel, and run at 25 mA gel−1 (Asirvatham et al., 2002a). Gels were stained overnight with Coomassie Brilliant Blue R-250 and destained the next day. Gel images were digitized with a Bio-Rad FluorS equipped with a 12-bit camera. Experimental molecular mass and pI were calculated from digitized 2-DE images using standard molecular mass marker proteins and the linear calibration option of Genomic Solutions HT Analyzer software (Genomic Solutions, Ann Arbor, MI).

Digestions and MALDI-TOFMS

Protein spots were excised from the gel, washed twice with water for 15 min, and destained with a 1:1 (v/v) solution of acetonitrile and 50 mm ammonium bicarbonate while changing solutions every 30 min until the blue color of Coomassie was removed. 2-DE gel spots were then dehydrated by washing twice with 100% acetonitrile and dried by vacuum centrifugation. Gel plugs were rehydrated with a solution of 10 ng μL−1 bovine trypsin (Roche) in 25 mm ammonium bicarbonate and digested for 4 to 6 h at 37°C. The enzymatic digestions were stopped with the addition of 10% (v/v) formic acid, and the supernatant was saved. Gel plugs were extracted once with 25 μL acetonitrile:water (1:1 [v/v]) and once with 25 μL of 100% (w/v) acetonitrile. Supernatants were combined and taken to dryness. Peptides were resuspended in 2% (w/v) formic acid:acetonitrile (1:1 [w/v]), mixed 1:1 with matrix (10 mg mL−1 α-cyano-4-hydroxycinnamic acid in same solvent), and spotted for MALDI-TOFMS. Mass spectra were obtained with a PerSeptive Biosystems DE-STR at an instrument resolution exceeding 10,000 and internally mass calibrated by matching to at least one and often more autolytic trypsin peaks (906.5049, 1153.5741, 2163.0570, and 2273.1602). Database search results were reprocessed with a reiterative search algorithm (Intellical, XXXX, XX) at 20 ppm that recalibrates m/z based on the best hit. Intellical software is part of the ABI Proteomics Solutions 1 software. If the best match is a real match, the identification confidence score will increase after reiterative calibration. If the best match is a false positive, the score will generally decline. The process was especially useful when trypsin autolytic peaks were of low abundance or absent. Resultant peptide mass fingerprints were assigned an arbitrary quality score (PMFQ) to quantify the quality of the peptide fingerprint and are reported in Supplemental Table I. The PMFQ scores were assigned based on the relative number of analyte peptides observed and their relative intensities as compared with the most abundant trypsin autolytic peptide peaks (2,163 and 2,273). If no peptides were observed or if analyte peptides were less than 10% of the trypsin autolytic peaks, a PMFQ value of 0 was assigned. If fewer than five peptides with relative intensities less than the trypsin peaks were observed, then a PMFQ of 1 was assigned. If five or more analyte peptides with intensities approximately equal to the trypsin autolytic peaks were observed, then a PMFQ value of 3 was assigned. If significantly more peptides were observed with a relative intensity greater than the trypsin autolytic peaks (but trypsin peaks still >10% for internal m/z calibration) were observed, then a PMFQ value of 4 (approximately 10 peptides) or 5 (>10 peptides) was assigned. Both MALDI-TOFMS peptide fingerprints illustrated in Figure 2 have a PMFQ of 5.

Database Queries and Protein Identifications

The peptide mass fingerprints were compared with sequences in: (a) NCBInr database (release January 1, 2002), (b) SwissProt database (release January 1, 2002), and/or (c) dbESTothers (NCBI; release January 1, 2002), (d) and/or a subset of dbESTothers (NCBI) consisting of approximately 145,000 barrel medic EST sequences, dated November 15, 2001, and queried using MS-Fit (http://prospector.ucsf.edu) in an automated mode using Proteomic Solutions 1 software from Applied Biosystems (Foster City, CA). Mass spectra were de-isotoped, baseline corrected, and threshold adjusted before database searching. Database searches were performed using a 100-ppm mass accuracy with a minimum requirement of four peptide matches from a submission list of typically 30 peptides. The maximum number of missed cleavages was set at one. The only user-defined modification specified was carbamidomethylation of Cys; however, the software default considered possible modifications of N-terminal Gln to pyro-Glu, oxidation of Met, and protein N terminus acetylation. When peptide mass fingerprints were matched to sequences in the EST databases, functional information was obtained by BLASTX (NCBI; http://www.ncbi.nlm.nih.gov/BLAST/) of the sequence or reference of the clone identifier to the barrel medic gene index (MtGI; http://www.tigr.org/tdb/mtgi/). The theoretical molecular mass and pI of the identified protein were then calculated using GPMAW (Lighthouse data) and compared with the experimental molecular mass calculated from the digitized 2-DE images. Protein identifications were evaluated on the basis of multiple variables including the number of peptides matched, mass error (m/z accuracy), percent coverage of the matched protein with 10% of the full-length protein set as the minimum value, quality of the peptide maps, intensity of the matched peaks (18%–20% minimum), similarity of experimental and theoretical protein molecular masses and pIs, and species from which the sequence was matched. For EST matches, the percent coverage was calculated by dividing the number of matched amino acids by the total number of amino acids in the protein sequence returned from the BLASTX or MtGI searches.

LC/MS/MS

Select digest mixtures were analyzed by nanoscale HPLC coupled with LC/MS/MS. Data were obtained using an ABI QSTAR Pulsar (Applied Biosystems) hybrid quadrupole time-of-flight mass spectrometer. The instrument m/z was calibrated with standards supplied by the manufacturer. Separated peptides were introduced into the mass spectrometer from an HPLC system equipped with an autosampler (LC Packings, San Francisco). Separations were achieved using an LC Packings nanoscale pepmap column (15 cm × 75 μm i.d., 3 μm, 100 Å, C18) and a linear binary gradient (solvent A was 1% [v/v] formic acid in 95%:5% [v/v] water:acetonitrile, whereas solvent B was a 0.8% [v/v] formic acid in 5%:95% [v/v] water:acetonitrile). The linear gradient was 95% (w/v) A:5% (w/v) B (0 min) to 60% (w/v) A:40% (w/v) B over 33 min, then ramped to 5% (w/v) A:95% (w/v) B at 37 min and held at 5% (w/v) A:95% (w/v) B until 42 min, where it was returned to 95% (w/v) A:5% (w/v) B 48 min and allowed to reequilibrate to 95% (w/v) A:5% (w/v) B 60 min. Nanoscale-ESI was performed using a Protona interface and nanoelectrospray needles (silver-coated glass capillary, New Objective, Woburn, MA). Mass spectra datasets were searched against NCBInr, SwissProt, dbESTothers, and mtEST databases using Mascot (http://www.matrixscience.com). The search results were validated as described for the peptide mass fingerprint results.

EST Counting and Protein Relative Abundance Estimates

Barrel medic ESTs were extracted from dbEST (http://www.ncbi.nlm.nih.gov/, accessed November 4, 2001). ESTs were assembled into tentative consensus sequences by TIGR to generate the barrel medic gene index (MtGI, http://www.tigr.org/tdb/tgi.shtml). The MtGI release of September 7, 2001 was used to count the occurrence of barrel medic genes in six different EST datasets including leaf (one cDNA library of developing leaf, 7,831 ESTs), stem (one library of developing stem, 10,314 ESTs), root (three libraries of uninoculated root, 6,593 ESTs), flower (one library of developing flower, 3,404 ESTs), seed pod (one library of developing seed and one library of developing pod, 4,587 ESTs), and cell suspensions (one library of elicited cell suspensions, 8,926 ESTs). The barrel medic genes were then sorted in the descending order on their EST counts for each dataset and used in the comparison with proteomic data.

Protein abundances were calculated using the normalized spot volume of each protein determined with HT Analyzer software (Genomic Solutions) as previously reported (Asirvatham et al., 2002a).

ACKNOWLEGMENTS

We thank Dr. Richard Dixon for scientific discussion and editorial comments. We thank Drs. Zhentian Lei and Aaron Elmer for their assistance in performing LC/MS/MS analyses.

Supplementary Material

Supplemental Data

Footnotes

1

This work was supported by the Samuel Roberts Noble Foundation and by the National Science Foundation (Plant Genome Research Project no. 0109732).

[w]

The online version of this article contains Web-only data. The supplemental material is available at www.plantphysiol.org.

Article, publication date, and citation information can be found at www.plantphysiol.org/cgi/doi/10.1104/pp.102.019034.

LITERATURE CITED

  1. Anderson NG, Matheson A, Anderson NL. Back to the future: the human protein index and the agenda for post-proteomic biology. Proteomics. 2001;1:3–12. doi: 10.1002/1615-9861(200101)1:1<3::AID-PROT3>3.0.CO;2-T. [DOI] [PubMed] [Google Scholar]
  2. Apweiler R, Attwood TK, Bairoch A, Bateman A, Birney E, Biswas M, Bucher P, Cerutti L, Corpet F, Croning MDR et al. The InterPro database, an integrated documentation resource for protein families, domains, and functional sites. Nucleic Acids Res. 2001;29:37–40. doi: 10.1093/nar/29.1.37. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Asirvatham VS, Watson BS, Sumner LW. Analytical and biological variances associated with proteomic studies of Medicago truncatulaby 2-DE. Proteomics. 2002a;2:960–968. doi: 10.1002/1615-9861(200208)2:8<960::AID-PROT960>3.0.CO;2-2. [DOI] [PubMed] [Google Scholar]
  4. Asirvatham VS, Watson BS, Wang L, Sumner LW. Protein identification success rates in proteomics studies of Medicago truncatula using peptide mass fingerprints to search protein, nucleotide and EST databases in a species without sequenced genomes. Proceedings of the 50th ASMS Conference on Mass Spectrometry and Allied Topics, FL: Orlando; 2002b. , June 2–6. American Society for Mass Spectrometry, Santa Fe, NM, pp xxx–xxx. [Google Scholar]
  5. Audic S, Claverie J-M. The significance of digital gene expression profiles. Genome Res. 1997;7:986–995. doi: 10.1101/gr.7.10.986. [DOI] [PubMed] [Google Scholar]
  6. Barker DG, Bianchi S, Blondon F, Dattée Y, Duc G, Essad S, Flament P, Gallusci P, Génier G, Pierre G et al. Medicago truncatula, a model plant for studying the molecular genetics of the Rhizobium-legume symbiosis. Plant Mol Biol Rep. 1990;8:40–49. [Google Scholar]
  7. Batchelor AH, Piper DE, de la Brousse FC, McKnight SL, Wolberger C. The structure of GABPalpha/beta: an EST domain-ankryin repeat heterodimer bound to DNA. Science. 1998;279:1037–1041. doi: 10.1126/science.279.5353.1037. [DOI] [PubMed] [Google Scholar]
  8. Bateman A, Birney E, Cerruti L, Durbin R, Etwiller L, Eddy SR, Griffiths-Jones S, Howe KL, Marshall M, Sonnhammer EL. The Pfam protein families database. Nucleic Acids Res. 2002;30:276–280. doi: 10.1093/nar/30.1.276. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Bell CA, Dixon RA, Farmer AD, Flores R, Inman J, Gonzales RA, Harrison MJ, Paiva NL, Scott AD, Weller JW et al. The Medicagogenome initiative: a model legume database. Nucleic Acids Res. 2000;29:1–4. doi: 10.1093/nar/29.1.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Bestel-Corre G, Dumas-Gaudot E, Poinsot V, Dieu M, Dierick J-F, van Tuinen D, Remacle J, Gianinassi-Pearson V, Gianinazzi S. Proteome analysis and identification of symbiosis-related proteins from Medicago truncatulaGaertn. by two-dimensional electrophoresis and mass spectrometry. Electrophoresis. 2002;23:122–137. doi: 10.1002/1522-2683(200201)23:1<122::AID-ELPS122>3.0.CO;2-4. [DOI] [PubMed] [Google Scholar]
  11. Bevan M, Bancroft I, Bent E, Love K, Goodman H, Dean C, Bergkamp R, Dirske W, Van Staveren M, Stiekema W et al. Analysis of 1.9 Mb of contiguous sequence from chromosome 4 of Arabidopsis thaliana. Nature. 1998;391:485–488. doi: 10.1038/35140. [DOI] [PubMed] [Google Scholar]
  12. Bradford MM. A rapid and sensitive method for the quantitation of microgram quantities of protein utilizing the principle of protein-dye binding. Anal Biochem. 1976;72:248–254. doi: 10.1016/0003-2697(76)90527-3. [DOI] [PubMed] [Google Scholar]
  13. Candiano G, Musante L, Bruschi M, Ghiggeri GM, Herbert B, Antonucci F, Righetti PG. Two-dimensional maps in soft immobilized pH gradient gels: a new approach to the proteome of the third millennium. Electrophoresis. 2002;23:292–297. doi: 10.1002/1522-2683(200202)23:2<292::AID-ELPS292>3.0.CO;2-5. [DOI] [PubMed] [Google Scholar]
  14. Chang WWP, Huang L, Shen M, Webster C, Burlingame AL, Roberts JKM. Patterns of protein synthesis and tolerance of anoxia in root tips of maize seedlings acclimated to a low-oxygen environment, and identification of proteins by mass spectrometry. Plant Physiol. 2000;122:295–317. doi: 10.1104/pp.122.2.295. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Choudhary JS, Blackstock WP, Creasy DM, Cottrell JS. Matching peptide mass spectra to EST and genomic DNA databases. Trends Biotechnol. 2001;19:S17–S22. doi: 10.1016/S0167-7799(01)01795-4. [DOI] [PubMed] [Google Scholar]
  16. Comment. World's first complete legume genome sequencing project. Trends Plant Sci. 2002;7:101. [Google Scholar]
  17. Cook D, Dreyer D, Bonnet D, Howell M, Nony E, VandenBosch K. Transient induction of a peroxidase gene in Medicago truncatula precedes infection by Rhizobium meliloti. Plant Cell. 1995;7:43–55. doi: 10.1105/tpc.7.1.43. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Cook DR. Medicago truncatula: a model in the making! Curr Opin Plant Biol. 1999;2:301–304. doi: 10.1016/s1369-5266(99)80053-3. [DOI] [PubMed] [Google Scholar]
  19. Cook DR, VandenBosch K, de Bruijn FJ, Huguet T. Model legumes get the nod. Plant Cell. 1997;9:275–281. [Google Scholar]
  20. Corthals G, Gygi S, Aebersold R, Patterson SD. Identification of proteins by mass spectrometry. In: Rabilloud T, editor. Proteome Research: Two-Dimensional Gel Electrophoresis and Identification Methods. Berlin: Springer-Verlag; 2000. pp. 197–231. [Google Scholar]
  21. Costa P, Bahrman N, Frigerio J-M, Kremer A, Plomion C. Water-deficit-responsive proteins in maritime pine. Plant Mol Biol. 1998;38:587–596. doi: 10.1023/a:1006006132120. [DOI] [PubMed] [Google Scholar]
  22. Costa P, Pionneau C, Bauw G, Dubos C, Bahrmann N, Kremer A, Frigerio J-M, Plomion C. Separation and characterization of needle and xylem maritime pine proteins. Electrophoresis. 1999;20:1098–1108. doi: 10.1002/(SICI)1522-2683(19990101)20:4/5<1098::AID-ELPS1098>3.0.CO;2-Z. [DOI] [PubMed] [Google Scholar]
  23. Covitz PA, Smith LS, Long SR. Expressed sequence tags from a root-hair-enriched Medicago truncatulacDNA library. Plant Physiol. 1998;117:1325–1332. doi: 10.1104/pp.117.4.1325. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Davin LB, Lewis NG. Phenylpropanoid metabolism: biosynthesis of monolignols, lignans and neolignans, lignins and suberins. In: Stafford HA, Ibrahim RK, editors. Recent Advances in Phytochemistry, Phenolic Metabolism in Plants. New York: Plenum Press; 1992. pp. 325–376. [Google Scholar]
  25. Dixon RA. Plant tissue culture methods in the study of phytoalexin induction. In: Ingram DS, Helgeson JP, editors. Tissue Culture Methods for Plant Pathologists. Oxford: Blackwell Scientific Publications; 1980. pp. 185–186. [Google Scholar]
  26. Dixon RA. Isoflavonoids: biochemistry, molecular biology, and biological function. In: Barton D, Nakanishi K, Meth-Cohn O, editors. Comprehensive Natural Product Chemistry. New York: Elsevier; 1999. pp. 774–821. [Google Scholar]
  27. Dunwell JM. Cupins: a new superfamily of functionally diverse proteins that include germins and plant storage proteins. Biotechnol Genet Eng Rev. 1998;15:1–32. doi: 10.1080/02648725.1998.10647950. [DOI] [PubMed] [Google Scholar]
  28. Duranti M, Horstamann C, Gilroy J, Croy RR. The molecular basis for N-glycosylation in the 11S globulin (legumin) of lupin seed. J Protein Chem. 1995;14:107–110. doi: 10.1007/BF01888368. [DOI] [PubMed] [Google Scholar]
  29. Ewing RM, Kahla AB, Poirot O, Lopez F, Audic S, Claaverie J-M. Large-scale statistical analyses of rice ESTs reveal correlated patterns of gene expression. Genome Res. 1999;9:950–959. doi: 10.1101/gr.9.10.950. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Futcher B, Latter GI, Monardo P, McLaughlin CS, Garrels JI. A sampling of the yeast proteome. Mol Cell Biol. 1999;19:7357–7368. doi: 10.1128/mcb.19.11.7357. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Gallardo K, Job C, Groot SPC, Puype M, Demol H, Vanderkerckhove J, Job D. Proteomic analysis of Arabidopsisseed germination and priming. Plant Physiol. 2001;126:835–848. doi: 10.1104/pp.126.2.835. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Gallusci P, Dedieu A, Journet EP, Huguet T, Barker DG. Synchronous expression of leghaemoglobin in Medicago truncatuladuring nitrogen-fixing root nodule development and response to exogenously supplied nitrate. Plant Mol Biol. 1991;17:335–349. doi: 10.1007/BF00040629. [DOI] [PubMed] [Google Scholar]
  33. Görg A, Obermaler C, Boguth G, Weiss W. Recent developments in two-dimensional gel electrophoresis with immobilized pH gradients: wide pH gradients up to pH 12, longer separation distances and simplified procedures. Electrophoresis. 1999;20:712–717. doi: 10.1002/(SICI)1522-2683(19990101)20:4/5<712::AID-ELPS712>3.0.CO;2-Y. [DOI] [PubMed] [Google Scholar]
  34. Gorina S, Pavletich NP. Structure of the p53 tumor suppressor bound to the ankyrin and SH3 domains of 53BP2. Science. 1996;274:1001–1005. doi: 10.1126/science.274.5289.1001. [DOI] [PubMed] [Google Scholar]
  35. Guerreiro N, Djordjevic MA, Rolfe BG. Proteome analysis of the model microsymbiont Sinorhizobium meliloti: isolation and characterisation of novel proteins. Electrophoresis. 1999;20:818–825. doi: 10.1002/(SICI)1522-2683(19990101)20:4/5<818::AID-ELPS818>3.0.CO;2-6. [DOI] [PubMed] [Google Scholar]
  36. Gygi SP, Rochon Y, Franza BR, Aebersold R. Correlation between protein and mRNA abundance in yeast. Mol Cell Biol. 1999;19:1720–1730. doi: 10.1128/mcb.19.3.1720. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Gyorgyey J, Vaubert D, Jimenez-Zurdo JI, Charon C, Troussard L, Kondorosi A, Kondorosi E. Analysis of Medicago truncatulanodule expressed sequence tags. Mol Plant-Microbe Interact. 2000;13:62–71. doi: 10.1094/MPMI.2000.13.1.62. [DOI] [PubMed] [Google Scholar]
  38. Haridas V, Higuchi M, Jayatilake GS, Bailey D, Mujoo K, Blake ME, Arntzen CJ, Gutterman JU. Avicins: triterpenoid saponins from Acacia victoriae(Bentham) induce apoptosis by mitochondrial perturbation. Proc Natl Acad Sci USA. 2001;98:5821–5826. doi: 10.1073/pnas.101619098. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Hayashi M, Mori H, Nishimura M, Akazawa T, Hara-Nishimura I. Nucleotide sequence of cloned cDNA coding for pumpkin 11-S globulin beta subunit. Eur J Biochem. 1988;172:627–632. doi: 10.1111/j.1432-1033.1988.tb13935.x. [DOI] [PubMed] [Google Scholar]
  40. Higgins CF. ABC transporters: from microorganisms to man. Annu Rev Cell Biol. 1992;8:67–113. doi: 10.1146/annurev.cb.08.110192.000435. [DOI] [PubMed] [Google Scholar]
  41. Jacobs MD, Harrison SC. Structure of an IkappaBalpha/NF-kappaB complex. Cell. 1998;95:749–758. doi: 10.1016/s0092-8674(00)81698-0. [DOI] [PubMed] [Google Scholar]
  42. Jasiñski M, Stukkens Y, Degand H, Purnelle B, Marchand-Brynaert J, Boutry M. A plant plasma membrane ATP binding cassette-type transporter is involved in antifungal terpenoid secretion. Plant Cell. 2001;13:1095–1107. [PMC free article] [PubMed] [Google Scholar]
  43. Klose J, Kobalz U. Two-dimensional electrophoresis of proteins: an updated protocol and implications for a functional analysis of the genome. Electrophoresis. 1995;16:1034–1059. doi: 10.1002/elps.11501601175. [DOI] [PubMed] [Google Scholar]
  44. Kruft V, Holger E, Jänsch L, Wolf W, Braun H-P. Proteomic approach to identify novel mitochondrial proteins in Arabidopsis. Plant Physiol. 2001;127:1694–1710. [PMC free article] [PubMed] [Google Scholar]
  45. Lewis NG, Yamamoto E. Lignins: occurrence, biosynthesis and biodegradation. Annu Rev Plant Physiol Plant Mol Biol. 1990;41:455–496. doi: 10.1146/annurev.pp.41.060190.002323. [DOI] [PubMed] [Google Scholar]
  46. Li HM, Kaneko Y, Keegstra K. Molecular cloning of a chloroplastic protein associated with both the envelope and thylakoid membranes. Plant Mol Biol. 1994;25:619–632. doi: 10.1007/BF00029601. [DOI] [PubMed] [Google Scholar]
  47. Mann M, Pandey A. Use of mass spectrometry-derived data to annotate nucleotide and protein sequence databases. Trends Biochem Sci. 2001;26:54–60. doi: 10.1016/s0968-0004(00)01726-6. [DOI] [PubMed] [Google Scholar]
  48. Mann M, Wilm M. Error-tolerant identification of peptide sequence tags. Anal Chem. 1994;66:4390–4399. doi: 10.1021/ac00096a002. [DOI] [PubMed] [Google Scholar]
  49. Mathesius U, Imin N, Chen H, Djordjevic MA, Weinman JJ, Natera SHA, Morris AC, Kerim T, Paul S, Menzel C et al. Evaluation of proteome reference maps for cross-species identification of proteins by peptide mass fingerprinting. Proteomics. 2002;2:1288–1303. doi: 10.1002/1615-9861(200209)2:9<1288::AID-PROT1288>3.0.CO;2-H. [DOI] [PubMed] [Google Scholar]
  50. Mathesius U, Keijzers G, Natera SHA, Winman JJ, Djordjevic MA, Rolfe BG. Establishment of a root proteome reference map for the model legume Medicago truncatulausing the expressed sequence tag database for peptide mass fingerprinting. Proteomics. 2001;1:1424–1440. doi: 10.1002/1615-9861(200111)1:11<1424::AID-PROT1424>3.0.CO;2-J. [DOI] [PubMed] [Google Scholar]
  51. Meins FJ, Neuhaus J-M, Sperisen C, Ryals J. Plant Gene Research: The primary structure of plant pathogenesis-related glucanohydrolases and their genes. In: Boller T, Meins F, editors. Genes Involved in Plant Defense. New York: Springer-Verlag Wien; 1992. pp. 245–282. [Google Scholar]
  52. Millar AH, Sweetlove LJ, Giegé P, Leaver CJ. Analysis of the Arabidopsismitochondrial proteome. Plant Physiol. 2001;127:1711–1727. [PMC free article] [PubMed] [Google Scholar]
  53. Molloy MP, Herbert BR, Walsh BJ, Tyler MI, Traini M, Sanchez J-C, Hochstrasser DF, Williams KL, Gooley AA. Extraction of membrane proteins by differential solubilization for separation using two-dimensional gel electrophoresis. Electrophoresis. 1998;19:837–844. doi: 10.1002/elps.1150190539. [DOI] [PubMed] [Google Scholar]
  54. Morris AC, Djordevic MA. Proteome analysis of cultivar-specific interactions between Rhizobium leguminosarum biovar trifoliiand subterranean clover cultivar woogenenellup. Electrophoresis. 2001;22:586–598. doi: 10.1002/1522-2683(200102)22:3<586::AID-ELPS586>3.0.CO;2-L. [DOI] [PubMed] [Google Scholar]
  55. Pappin DJC, Hojrup P, Bleasby AJ. Rapid identification of proteins by peptide-mass fingerprinting. Curr Biol. 1993;3:327–332. doi: 10.1016/0960-9822(93)90195-t. [DOI] [PubMed] [Google Scholar]
  56. Peltier J-B, Friso G, Kalume DE, Roepstorff P, Nilsson F, Adamska I, van Wijk KJ. Proteomics of the chloroplast: systematic identification and targeting analysis of lumenal and peripheral thylakoid proteins. Plant Cell. 2000;12:319–341. doi: 10.1105/tpc.12.3.319. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Peltier J-B, Emanuelsson O, Kalume DE, Yetterberg J, Friso G, Rudella A, Liberles DA, Söderberg L, Roepstorff P, von Heijne G et al. Central functions of the lumenal and peripheral thylakoid proteome of Arabidopsis determined by experimentation and genome-wide prediction. Plant Cell. 2002;14:211–236. doi: 10.1105/tpc.010304. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Peng HM, Dreyer DA, VandenBosch KA, Cook D. Gene structure and differential regulation of the Rhizobium-induced peroxidase gene rip1. Plant Physiol. 1996;112:1437–1446. doi: 10.1104/pp.112.4.1437. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Reymond P, Kunz B, Paul-Pletzer K, Grimm R, Eckerskorn C, Farmer EE. Cloning of a cDNA encoding a plasma membrane-associated, uronide binding phosphoprotein with physical proteins similar to viral movement proteins. Plant Cell. 1996;8:2265–2276. doi: 10.1105/tpc.8.12.2265. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Sambrook J, Fritsch EF, Maniatis T. Construction and analysis of cDNA libraries. In: Ford N, Nolan C, Feruson M, editors. Molecular Cloning: A Laboratory Manual. Cold Spring Harbor, NY: Cold Spring Harbor Laboratory Press; 1989. pp. 8.1–8.86. [Google Scholar]
  61. Small E. Adaptations to herbivory in alfalfa (Medicago sativa) Can J Bot. 1996;74:807–822. [Google Scholar]
  62. Stensballe A, Jensen ON. Simplified sample preparation method for protein identification by matrix-assisted laser desorption/ionization mass spectrometry: in-gel digestion on the probe surface. Proteomics. 2001;1:955–966. doi: 10.1002/1615-9861(200108)1:8<955::AID-PROT955>3.0.CO;2-P. [DOI] [PubMed] [Google Scholar]
  63. Sugiura M, Takeda Y. Nucleic acids. In: Buchanan B, Gruissem W, Jones R, editors. Biochemistry and Molecular Biology of Plants. Waldorf, MA: American Society for Plant Physiologists; 2000. pp. 298–300. [Google Scholar]
  64. Trieu AT, Burleigh SH, Kardailsky IV, Maldonado-Mendoze IE, Versaw WK, Blaylock LA, Shin H, Chiou T-J, Katagi H, Dewbre GR et al. Transformation of Medicago truncatula via infiltration of seedlings or flowering plants with Agrobacterium. Plant J. 2000;22:531–541. doi: 10.1046/j.1365-313x.2000.00757.x. [DOI] [PubMed] [Google Scholar]
  65. Tsugita A, Kawakami T, Uchiyama Y, Kamo M, Miyatake N, Nozu Y. Separation and characterization of rice proteins. Electrophoresis. 1994;15:708–720. doi: 10.1002/elps.1150150198. [DOI] [PubMed] [Google Scholar]
  66. U.S. Department of Agriculture-National Agricultural Statistics Service (2002) Agricultural statistics 2002. 2000 Agricultural Statistics. http://www.usda.gov/nass/pubs/agr02/acro02.htm.
  67. van Wijk K. Challenges and prospects of plant proteomics. Plant Physiol. 2001;126:501–508. doi: 10.1104/pp.126.2.501. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Wang L, Wessler SR. Inefficient reinitiation is responsible for upstream open reading frame-mediated translation repression of the maize R gene. Plant Cell. 1998;10:1733–1745. doi: 10.1105/tpc.10.10.1733. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Wang L, Wessler SR. Role of mRNA secondary structure in translational repression of the maize transcriptional activator Lc1,2. Plant Physiol. 2001;125:1380–1387. doi: 10.1104/pp.125.3.1380. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Yates JR., III Mass spectrometry and the age of the proteome. J Mass Spectrom. 1998a;33:1–19. doi: 10.1002/(SICI)1096-9888(199801)33:1<1::AID-JMS624>3.0.CO;2-9. [DOI] [PubMed] [Google Scholar]
  71. Yates JR., III Database searching using mass spectrometry data. Electrophoresis. 1998b;19:893–900. doi: 10.1002/elps.1150190604. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental Data

Articles from Plant Physiology are provided here courtesy of Oxford University Press

RESOURCES