Abstract
In soybeans, eighteen members constitute the serine hydroxymethyltransferase (GmSHMT) gene family, of which the cytosolic-targeted GmSHMT08c member has been reported to mediate resistance to soybean cyst nematode (SCN). This work presents a comprehensive study of the SHMT gene family members, including synteny, phylogeny, subcellular localizations, haplotypes, protein homology modeling, mutational, and expression analyses. Phylogenetic analysis showed that SHMT genes are divided into four classes reflecting their subcellular distribution (cytosol, nucleus, mitochondrion, and chloroplast). Subcellular localization of selected GmSHMT members supports their in-silico predictions and phylogenetic distribution. Expression and functional analyses showed that GmSHMT genes display many overlapping, but some divergent responses during SCN infection. Furthermore, mutational analysis reveals that all isolated EMS mutants that lose their resistance to SCN carry missense and nonsense mutations at the GmSHMT08c, but none of the Gmshmt08c mutants carried mutations in the other GmSHMT genes. Haplotype clustering analysis using the whole genome resequencing data from a collection of 106 diverse soybean germplams (15X) was performed to identify allelic variants and haplotypes within the GmSHMT gene family. Interestingly, only the cytosolic-localized GmSHMT08c presented SNP clusters that were associated with SCN resistance, supporting our mutational analysis. Although eight GmSHMT members respond to the nematode infestation, functional and mutational analysis has shown the absence of functional redundancy in resistance to SCN. Structural analysis and protein homology modeling showed the presence of spontaneous mutations at important residues within the GmSHMT proteins, suggesting the presence of altered enzyme activities based on substrate affinities. Due to the accumulation of mutations during the evolution of the soybean genome, the other GmSHMT members have undergone neofunctionalization and subfunctionalization events.
Introduction
Soybean [Glycine max (L.) Merr.] is the most widely consumed legume crop worldwide. However, soybean production is limited by the presence of the soybean cyst nematode (SCN; Heterodera glycines I.), causing over $1 billion in yield losses annually in the U.S.1. Most of the SCN resistant soybean lines are mainly derived from two types; PI 88788 and Peking. Peking-type resistance to SCN requires resistant alleles at two loci, the Rhg4 and the rhg1-a2. The two genes underlying resistance to SCN have been isolated, encoding the soluble NSF attachment protein (GmSNAP18) at the rhg1-a locus and the serine hydroxymethyltransferase (GmSHMT08c) at the Rhg4 locus3,4.
The serine hydroxylmethyltransferase (SHMT; EC 2.1.2.1) gene family is present in all plant and animal lineages. SHMT is an ubiquitous, homotetrameric enzyme with a key role in one-carbon metabolism, methionine synthesis, and maintenance of redox homeostasis during photorespiration5,6. SHMT is involved in the catalysis of reversible hydroxymethyl group transfer and interconversion of serine/glycine and tetrahydrofolate (THF)/5,10-methyleneTHF via transaldimination reactions7–9. In humans, mutations in SHMT have been shown to be involved in multiple diseases including cancers and cardiovascular diseases10–12.
In plants, the SHMT gene family has been studied in the plant model Arabidopsis thaliana. All seven members were reported to be localized within four intracellular compartments, with two in the nucleus and two in the cytosol, two in the mitochondrion, and one in the chloroplast13. In eudicots including pea, potato, spinach, and A. thaliana, SHMT activity was detected in most cell compartments including cytosol, mitochondria, nucleus, and plastids14–17. However, in monocots, only Hordeum vulgare has reported the presence of plastid-targeted SHMT enzyme activity13,14, but no orthologous gene could be found, inferring an aneupleurotic pathway or enzyme. In soybean, a member of the dicot gene family (GmSHMT08c) underlying the Rhg4 loci has been reported to be involved in SCN resistance. It has been suggested that the SCN-resistant Rhg4 allele emerged via artificial selection during the soybean domestication process18. However, little is known about the role of the other GmSHMT genes in plant abiotic stresses, and if the rest of the GmSHMT gene family (or some members) can play similar roles as the GmSHMT08 and may present functional redundancy or additive effect in resistance to SCN, as it has been reported recently in case of the GmSNAP gene family19. Analyzing the function of SHMT genes or any other gene family in soybean is a difficult task, since about 80% percent of soybean genes are duplicates20–22. Soybean breeding and targeting essential genes to improve important agronomic traits (i.e. oil, protein, yield, resistant to biotic and abiotic stresses) is challenging due to those duplication events and the presence of multiple gene copies23,24. The soybean genome encodes multiple chloroplastic, mitochondrial, nuclear, and cytosolic-localized GmSHMT classes. Mutations at the cytosol-targeted GmSHMT08c but not on the other GmSHMT members result in the loss of resistance to SCN in the resistant c.v Forrest lines3. Here, we report a detailed characterization of the GmSHMT gene family in soybean including structure, synteny, phylogeny, expression, homology modeling, subcellular localization, and mutational analyses.
Results
Duplication of GmSHMT in the soybean genome
Genome-wide analysis showed that the GmSHMT gene family in soybean is composed of at least twelve members named as GmSHMT02, GmSHMT04, GmSHMT05, GmSHMT06, GmSHMT08 (with three genes on chromosome 08), GmSHMT09, GmSHMT12, GmSHMT13, GmSHMT14, and GmSHMT18 (the number indicated the chromosome locations), in addition to a thirteenth member (GmSHMT15) corresponding to a truncated protein (Supplementary Fig. S1). Chromosome 08 carries three basal members of the GmSHMT gene family; GmSHMT08c (cytosolic), GmSHMT08n (nucleic), and GmSHMT08m (mitochondrial). Chromosome 13 carries a basal GmSHMT13ch (chloroplastic) and a multifunctional GmSHMT13m (mitochondrial). GmSHMT gene family members encode proteins that vary in size between 471aa and 603aa, except the GmSHMT15 gene that encodes a truncated protein resulting in 244aa (Supplementary Fig. S1).
A number of genes with lower similarity to SHMT or associated with other protein domains were also found, increasing the number of GmSHMT gene family to eighteen members on the soybean genome (Supplementary Fig. S2). In fact, five additional GmSHMTs with multiple-bifunctional activities including a SHMT/threoninealdolase (Glyma.12g159200; GmSHMT12c, Glyma.15G090000; GmSHMT15c, and Glyma.16G108100; GmSHMT16c), a SHMT/wall-associated receptor kinase galacturonan-binding (Glyma.13G077700; GmSHMT13c), and a SHMT/bZIP transcription factor (Glyma.09G184300; GmSHMT09n) have been annotated in Phytozome, and their orthologs from the plant model were studied earlier (Supplementary Fig. S2)25. It has been reported that the SHMT/threoninealdolases and SHMT/wall-associated receptor kinase galacturonan-binding presented a cytosolic localization in yeast and Arabidopsis, respectively26,27. However, the Arabidopsis SHMT/bZIP transcription factor presented a nucleic localization28. Protein alignments showed that these GmSHMTs have diverged in their structure and function from the basal SHMT proteins. In this study, we studied only members of the GmSHMT family annotated as basal serine hydroxymethyltransferases.
To test the contribution of the soybean segmental duplications to the increase of soybean SHMT genes, we analyzed the distributions of GmSHMTs on chromosomal duplicated segments compiled from the Plant Genome Duplication Database29–31. Synteny analysis showed the presence of several commonly linked genes in duplicated blocks (Supplementary Fig. S3). Gene pairs GmSHMT08c/GmSHMT05c, GmSHMT12n/GmSHMT08n, GmSHMT06n/GmSHMT04n, GmSHMT13ch/GmSHMT15ch, and GmSHMT13m/GmSHMT14m were located on highly conserved duplicated blocks of ch08/ch05, ch12/ch08, ch06/ch04, ch13/ch15, in addition to an old duplication between ch13/ch14, respectively. These blocks encompassed 551, 14, 711, 391, and 24 conserved genes or anchors, respectively (Supplementary Table S1). Furthermore, GmSHMT08m was present in two different more diverged duplicated pairs (ch08/ch09, and ch08/ch02), with the presence of 24 and 10 additional conserved linked genes (Supplementary Fig. S3D). In addition, GmSHMT02m was found in two duplicated blocks located on ch02/14 (recent duplication) and ch02/18 (older duplication), with the presence of 248 and 7 additional conserved genes surrounding GmSHMT02m, respectively (Supplementary Fig. S3D; Supplementary Table S1).
Phylogenetic analysis of the SHMT genes
To elucidate the evolution of the GmSHMT gene family in soybean, phylogenetic analysis was conducted using genes from 22 sequenced plant species. The analysis separately grouped SHMTs into two groups and four classes corresponding to the cellular localization of these genes; nuclear, cytosol, mitochondria, and chloroplast-localized GmSHMTs. The two nuclear and cytosol-localized GmSHMTs classes clustered together, whereas the mitochondria and chloroplast-localized GmSHMTs classes formed a second cluster. Each class included sequences from various lineages: monocots, eudicots, mosses, lycophytes, and algae (Fig. 1). Surprisingly, the chloroplast-localized class did not include a monocot species. Extensive searches employing a large number of monocots including O. sativa, Z. mays, S. bicolor, S. italica, H. vulgare, P. hallii, P. virgatum, and T. aestivum failed to identify any GmSHMT member within the chloroplastic clade (Supplementary Fig. S4). The phylogenetic distribution of SHMT genes showed that several within-class duplicates are the result of duplication or polyploidization events at the species or lineage levels (Fig. 1).
Structure analysis of intron-exons
Structural analysis revealed that organellar GmSHMTs had the highest number of exons and introns. As shown in Supplementary Fig. S5, all the five mitochondrial-localized GmSHMTs had 15 exons. The two chloroplastic genes have different number of exons; GmSHMT13ch had 11 exons while GmSHMT15ch presented 6 exons only and encodes for a truncated protein. Moreover, all four nuclear-localized GmSHMT members contained 4 exons. In contrast, the two cytosol-localized GmSHMTs had different numbers of exons, with GmSHMT05c and GmSHMT08c having 4 and 3 exons, respectively (Supplementary Fig. S5).
Chlamydomonas reinhardtii represents the polyphyletic chlorophytes, one of which was hypothesized to be a relative of the aquatic ancestor of all land plants, the green algae32. Comparison between C. reinhardtii and the land plant species showed that they present similar number of exons and introns (with land plants having 1 exon and 1 intron less) (Fig. 2). However, Cre06.g293950 SHMT gene from algae (clustering with both cytosolic and nucleic SHMTs) contained eight exons, while their cytosolic and nucleic counterparts have four exons only in both. This is maybe due to four intron loss/gain events that may have occurred in the ancestor of land plant (Fig. 2). For instance, either the algae or the land plant SHMT genes may have acquired or lost these introns during their lineage-specific evolution.
Moreover, the cytosolic GmSHMT08c presented different exon and intron structure between Forrest and WI82 soybean lines. The analysis shows evidence of alternative splicing variants and exon skipping events that may have occurred between the cytosolic GmSHMT08c from Forrest and GmSHMT08c from WI82 (Supplemental Fig. S5). The exon skipping event is common in plants and animals33.
Subcellular localization of selected GmSHMTs
To confirm in silico GmSHMT subcellular localization’s predictions, two cytosol-targeted GmSHMT05c and GmSHMT08c, two nucleus-targeted GmSHMT06n and GmSHMT08n, two mitochondrial-targeted GmSHMT02m and GmSHMT14m, as well as two chloroplastic-targeted GmSHMT13ch and GmSHMT15ch were studied for their subcellular localization by transforming onion epidermal cells using YFP fusions. The obtained results confirmed the subcellular localization predictions of the four GmSHMT classes. Indeed, GmSHMT05c::YFP and GmSHMT08c::YFP were localized in the cytosol of the transformed onion epidermal cells, whereas GmSHMT06n::YFP and GmSHMT08n::YFP accumulated in the nucleus (Fig. 3). GmSHMT02m::YFP presented a mitochondrial-like subcellular localization, while the GmSHMT14m::YFP presented both mitochondrial-like and cytosolic localization. The GmSHMT13ch::YFP presented a chloroplastic-like localization, and as expected, no signal has been detected for the GmSHMT15ch::YFP (coding for a truncated protein).
Organ-specific expression of the SHMT genes
To gain insight into the expression profile divergence of different GmSHMT family members in soybean, expression data of each GmSHMT gene were compiled from the publically available RNA-seq database (Soybase.org)34. This dataset contains seven tissues from various developmental stages including vegetative (leaves, root and nodules) and seed development. While no RNAseq data were available for the GmSHMT15ch, the rest of the GmSHMT gene members presented different gene expression patterns (Fig. 4). Whereas most of the GmSHMT members presented an ubiquitous expression in all the tissues analyzed, the two duplicated cytosol-targeted GmSHMT08c and GmSHMT05c were highly expressed in roots and were predominantly expressed in pods. GmSHMT08c, GmSHMT05c, GmSHMT08m, GmSHMT09m, and GmSHMT18m were mainly expressed in young leaflets. GmSHMT08c, GmSHMT05c, GmSHMT08m, GmSHMT09m, and GmSHMT13ch were abundantly expressed in flowers.
GmSHMT13ch was highly abundant in the nodules. Expression data compiled from the public bio-analytic resource for plant biology database (http://bar.utoronto.ca) showed that the chloroplast-targeted MtSHMT2ch gene from Medicago truncatula, was abundantly expressed in leaves and seeds, but not in nodules, as was the case for the chloroplast-targeted GmSHMT13ch in soybean (Supplementary Fig. S6). Nucleus-targeted MtSHMT2n from M. truncatula was highly expressed in nodules35. Moreover, chloroplast-targeted SHMTs from the non-leguminous plant Solanum lycopersicum were abundantly expressed in fruit, leaves, and roots36. The chloroplastic AtSHMT3ch from A. thaliana was mainly expressed in seeds (Supplementary Fig. S6)37,38.
The mitochondria-targeted GmSHMT09m was mainly expressed in soybean seeds. Mitochondria-targeted AtSHMT2m from A. thaliana was also mainly expressed in seed, as well as in the root and 1st node. However, the mitochondria-targeted MtSHMT5m from M. truncatula and the SlSHMT2m and SlSHMT4m from S. lycopersicum were mainly expressed in the pods/fruit, leaves, and roots (Supplementary Fig. S6). These data suggest that GmSHMT from the four different classes evolved to play different roles, and the function of a determined class is not organ specific.
Expression analysis of the GmSHMT genes following soybean infection with SCN
Unlike GmSHMT family members, the GmSHMT08c has been associated with SCN resistance3. To gain insight into the function of the GmSHMT family members, their specific response to SCN was investigated in the susceptible line Essex and the SCN resistant line Forrest (Peking-type; Rhg4a). Expression analysis using qRT-PCR at three, five, and ten days post SCN inoculation demonstrates that all cytosol, mitochondria, and chloroplast-targeted GmSHMT members were induced under SCN infection. Transcript abundance analyses showed that GmSHMT gene family members follow three types of expression patterns in response to SCN infection (Fig. 5). Whereas the cytosol-targeted GmSHMT05c and the mitochondria-targeted GmSHMT14m transcripts increased in both compatible and incompatible reactions to SCN, the chloroplast-targeted GmSHMT13ch, the cytosol-targeted GmSHMT08c, and the four mitochondria-targeted GmSHMT02m, GmSHMT08m, GmSHMT09m, and GmSHMT18m transcripts were significantly increased only in the incompatible reaction. However, transcripts from a third group including all the four nucleus-targeted GmSHMT04n, GmSHMT06n, GmSHMT08n, and GmSHMT12n were not induced during SCN infection (Fig. 5). The GmSHMT15 gene was not expressed, which is coherent with the resuts obtained from the subcellular localization analysis (Fig. 3) and the RNAseq data (Fig. 4), suggesting that the GmSHMT15 gene diverged from the others or has become pseudogenized.
Domain variations within GmSHMT members
Investigation of GmSHMT08c for conserved domain showed the presence of forty amino acid residues that play an important role in maintaining dimer interfaces (16 residues), tetrahydrofolate (THF) binding sites (7 residues), pyridoxal phosphate (PLP) cofactor binding sites (13 residues), and active sites for catalysis (4 residues) (Fig. 6A,B)39. In silico analysis showed that the cytosolic GmSHMT05c member presents the same conserved residues like GmSHMT08c (Fig. 6C), confirming their segmental duplication inferred from syntenic analysis. However, nucleic, cytosolic, and mitochondrial-targeted GmSHMTs show variations within their oligomeric structures, catalytic sites, and cofactor binding residues (Fig. 6D).
Nucleic GmSHMT members did not conserve most of the residues at positions Q77, G106, S107, H121, L129, Y153, V301, A306, and N374. In addition, E18, H134, and L135 residues differ in GmSHMT08n and GmSHMT12n, whereas the F284 residue was not conserved in both GmSHMT04n and GmSHMT06n. Moreover, Q77, Y104, R257, A302, and Q305 residues were not conserved in all chloroplastic-targeted GmSHMTs.
Mitochondrial-targeted GmSHMTs did not conserve K25, Q77, Y104, and Y153 residues. The GmSHMT09m gene family member had differing residues from the two cytosolic-targeted GmSHMTs at positions K58, Y59, S190, and G310 (Fig. 6D).
Domain variation analysis of the GmSHMT classes showed that the highest domain variations were observed within the nucleic-targeted GmSHMT classes, with 14 domain variation out of 40 (35%), affecting protein structure (dimerization and tetramerization), substrate binding (including THF and PLP binding), and catalysis (Figs 1 and 6). However, less domain variation was observed in the mitochondrial-targeted GmSHMT classes; 6 out of 40 (15%), suggesting functional conservation during evolution. Similarly, the mitochondrial-targeted GmSHMT class presented limited variations; 4 out of 40 (10%), except for the GmSHMT09m that conferred extra 3 domains variation (17.5%). As expected, no domain variation has been observed between the two cytosolic-targeted GmSHMT05c and GmSHMT08c.
SCN susceptible EMS-induced mutant lines carry mutations at the GmSHMT08c only
To gain more insight into the possible involvement of the GmSHMT gene family members in resistance to SCN, we developed EMS mutagenized soybean lines from the SCN resistant Forrest. Next, forward genetic screening was employed to screen for mutants that lost their resistance to SCN. The susceptible Forrest EMS mutagenized population was screened for mutations within all the GmSHMT gene family members. In the current study, three new EMS mutants were found to lose their resistance to SCN, in addition to the fifteen EMS mutants that we reported recently3,40. Targeted sequencing analysis showed that all eighteen mutations were located in the GmSHMT08c gene. Interestingly, none of the selected Gmshmt08c mutants carried mutations in the coding or non coding regions of the other 12 GmSHMT genes corresponding to the four classes including the cytosolic-targeted GmSHMT05c member. In total, sixteen Gmshmt08 missense mutants were identified. The Gmshmt08 EMS mutants F6266 (E61K) and F6756 (M125I) were identified earlier by reverse genetics3, F427 (G71D), F1336 (L299F), F891 (A302V), F1460 (G326E), F1433 (G138S), F1722 (G138D), F473 (G106S), F40 (S44F), F347 (G62S), F546 (A149T), F650 (P285S) were reported recently40, in addition to the newly isolated Gmshmt08 EMS mutants F1801 (N368T), F1927 (G132D), and F1817 (R257Q) were identified by forward genetics. Two additional Gmshmt08 nonsense mutants F1261 (Q30*) and F234 (Q226*) have been identified to carry a premature stop codon after 30 and 226 amino acid of the GmSHMT08 protein, respectively. The presence of the new induced GmSHMT08 haplotypes resulted in the increase of SCN female indexes in all the identified missense mutants, up to 93.4%, indicating that Forrest mutants became susceptible to SCN, although these Gmshmt08 EMS mutants do not carry any mutations on the other 12 GmSHMT genes (Fig. 7). Thus, none of the other GmSHMT gene family could replace the function of the GmSHMT08c in all the identified EMS mutagenized Gmshmt08c mutants. These findings support the hypothesis of the absence of functional redundancy among the GmSHMT gene family in resistance to SCN.
SNP variants and haplotype analysis of the GmSHMT gene family members
The Soybase database provides a beneficial integrated genetic linkage map tool that can be used to infer whether a gene may belong to a given QTL (Cregan et al., 1999). In the last two decades, 189 SCN QTLs have been mapped and reported in the soybean genome, distributed across 19 out of 20 soybean chromosomes. Extensive search of the GmSHMT genes showed that GmSHMT08c had been consistently mapped across a variety of soybean germplasm and represented the major source of resistance in most soybean cultivars at the Rhg4 locus2,3,41. Beside the GmSHMT08c, the other cytosol-targeted paralog GmSHMT05c, in addition to the other three mitochondria-targeted GmSHMT09m, GmSHMT08m, and GmSHMT18m have also been found in a QTL for resistance to SCN42. However, all four nucleic and chloroplast-targeted GmSHMT13, in addition to the other two mitochondria-targeted GmSHMT14m and GmSHMT02m were mapped in QTL that were linked to other agronomic traits including seed content and composition (oil, protein, etc.), seed yield, and plant development, but none of those have been found within QTL for resistance to SCN (Table 1).
Table 1.
Gene name | Gene ID | Gene position | QTL | QTL position | Parents | Number loci tested | Lod score | Interval length | Reference |
---|---|---|---|---|---|---|---|---|---|
GmSHMT08c | Glyma.08g108900 | Gm08: 08,358,422 – 08,363,343 | SCN | Gm08: 03,828,727 − 08,388,481 | Magellan X PI 404198A | 194 | 5.8 | 25.6 | Guo et al., 2006 |
GmSHMT05c | Glyma.05G152100 | Gm05: 34,563,195 – 34,565,889 | SCN | Gm05: 14,157,044 – 35,074,014 | PI438489B X Hamilton | 115 | ND | 18.3 | Yue et al., 2001 |
GmSHMT13ch | Glyma.13G222300 | Gm13: 33,522,900 – 33,527,302 | Seed protein | Gm13: 31,220,086 – 38,929,324 | Essex X Williams | ND | ND | ND | Hyten et al., 2004 |
GmSHMT04n | Glyma.04G254300 | Gm04: 48,571,800 – 48,577,505 | Seed weight | Gm04: 48,708,390 – 52,389,145 | Charleston X Dong Nong 594 | ND | ND | 2 | Teng et al., 2009 |
GmSHMT06n | Glyma.06G107800 | Gm06: 08,663,690 – 08,668,262 | Pod maturity | Gm06: 07,057,089 – 08,964,865 | Minsoy X Noir 1 | 665 | ND | ND | Specht et al., 2001 |
GmSHMT12n | Glyma.12G170300 | Gm12: 32,509,864 – 32,514,684 | Neutral detergent fiber | Gm12: 32,410,129 – 35,695,155 | PI 483463 X Hutcheson | ND | 10.48 | ND | Asekova et al., 201 |
GmSHMT08n | Glyma.08g187800 | Gm08: 15,060,906 – 15,065,634 | Seed genistein | Gm08: 04,776,921 – 20,678,814 | Zhongdou 27 X Jiunong 20 | 606 | ND | 73.1 | Han et al.,. 2015 |
GmSHMT02m | Glyma.02g217100 | Gm02: 40,402,376 – 40,409,818 | First flower | Gm 02: 31,189,638 – 41,513,786 | JP036034 X Ryuhou | 720 | ND | 13.85 | Kuroda et al., 2013 |
GmSHMT09m | Glyma.09g202000 | Gm09: 42,616,719 – 42,623,320 | SCN | Gm09: 41,477,149 – 47,446,419 | S08-80 X PI 464925B | 118 | 2.08 | 24.67 | Winter et al., 2007 |
Gm09: 42,616,719 – 42,623,320 | Pod number | Gm09: 36,924,281 – 45,989,139 | BARC-8 X Garimpo | 75 | 2.06 | 24.5 | Vieira et al., 2006 | ||
GmSHMT08m | Glyma.08g274400 | Gm08: 36,279,886 – 36,286,158 | SCN | Gm08: 16,438,021 – 40,476,678 | Magellan X PI 567516 C | 252 | 4.3 | 18.55 | Vuong et al., 2010 |
Gm08: 36,279,886 – 36,286,158 | Seed oil plus protein | Gm08: 20,683,774 – 47,837,939 | Charleston X Dong Nong 594 | 164 | 2.6 | 38 | Chen et al., 2007 | ||
GmSHMT18m | Glyma.08g274400 | Gm18: 27,237,675 – 27,243,626 | Seed glycinin | Gm18: 20,151,954 – 48,589,101 | Kefeng No. 1 X Nannong 1138-2 | 221 | ND | 11.11 | Ma et al., 2016 |
Gm18: 27,237,675 – 27,243,626 | SCN | Gm18: 03,729,034 – 50,158,095 | PI438489B X Hamilton | 115 | ND | 26.1 | Yue et al., 2001 | ||
GmSHMT14m | Glyma.08g274400 | Gm14: 44,788,273 – 44,795,824 | Fe effic | Gm14: 40,368,007 – 47,207,943 | Pride B216 X A15 | 102 | 2.7 | 14 | Lin et al., 1997 |
Haplotype diversity at the Rhg4 loci determines the SCN resistance at different re-sequenced soybean germplasms3. In fact, only lines carrying H1, H2, and H3 haplotypes at the GmSHMT08c have been shown to be resistant to SCN3. However, soybean lines carrying other GmSHMT08 haplotypes (i.e. H4, H5, and H8) were susceptible to SCN. The sequence diversity and identification of these Rhg4 haplotypes from a wide range of soybean belonging to the USDA soybean germplasm collection would be beneficial for breeders. In the current study, the possible role of the GmSHMT members in SCN resistance was explored using the natural variations in GmSHMT genes. Correlation with SCN resistance to the five SCN races using whole genome resequencing data (WGRS) was used. To infer the allelic variation in 106 diverse soybean lines43, all GmSHMT genes were analyzed for synonymous and non-synonymous SNPs; premature stop codons; and indels. The WGRS dataset included non-domesticated; semi-domesticated, and elite domesticated introductions belonging to the USDA soybean collection. Interestingly, haplotyping analysis of the 106 lines showed that none of the GmSHMT gene family members presented specific GmSHMT haplotypes that correlate with SCN resistance (Supplementary Figs S7–S17), with the exception of the GmSHMT08c (Fig. 8). In fact, only GmSHMT08c presented two non-synonymous SNPs, resulting in proline to arginine (P130R) and asparagine to tyrosine/serine (N358Y/H) changes in 14 out of 106 lines analyzed (Fig. 8). All 14 soybean lines (belonging to Peking-type of SCN resistance) carrying these non-synonymous SNPs at GmSHMT08c correlated with resistance to SCN.
Discussion
GmSHMT family evolved in early land plant evolution through genome duplication and divergence
The soybean genome contains a high number of SHMT genes (about 18 members), compared to the plant model A. thaliana (7 members), S. lycopersicum (7), or M. truncatula (12) (Supplementary Table S2). Previous studies suggested that segmental duplications, tandem rearrangement, and polyploidy events contributed to their evolution20,21,23,44. The conservation of duplicates is likely the result of functional divergence through sub-functionalization and neo-functionalization under selection pressure imposed by developmental and environmental conditions.
Phylogenetic analyses showed that land plants have four subclasses of GmSHMTs, reflecting their subcellular distribution (cytosol, nucleus, mitochondrion, and chloroplast), which is in agreement with a previous study18. Subcellular localization of selected GmSHMT members confirmed their in-silico prediction and phylogenetic distribution. The existence of SHMT sequences representing all classes in C. reinhardtii suggests that at least three SHMT classes (mitochondria, chloroplast, and a common nucleus/cytosol) evolved early in the ancestor of land plants through duplication and functional divergence. The presence of a single C. reinhardtii (Cre06.g293950) SHMTn/c sequence clustering with nucleus and cytosol SHMT suggests that SHMT from these classes evolved from a single ancestor gene through duplication. An alternative explanation is that one of the copies may get lost after duplication. In addition, the phylogenetic distribution of sequences from the same species suggests that duplication/polyploidization events occurred at the species level and contributed to the duplication of SHMT genes. This is in accordance with previous studies reporting large segmental duplications or whole genome duplication in various plant species45–47.
Structural analysis revealed various patterns of intron/exon loss or gain that may happen during the evolution of the SHMT. It has been reported earlier the presence of an intron loss that occurred within the cytosolic SHMTc18. The current study reveals the presence of several intron loss/gain events that may have occurred early in land plant ancestor.
It is noteworthy that no chloroplast-targeted SHMTs exist in monocotyledon genomes. This is consistent with previous studies reporting the absence of the SHMT activity in wheat chloroplasts14. Chloroplastic SHMT is essential in providing one-carbon units for the biosynthesis of 5,10-methyleneTHF, which is then oxidized by other catalysts9,48,49. The oxidized 5,10-methyleneTHF is essential as a light-harvesting cofactor in plastid-localized cryptochrome50. It’s well documented that the products of photosynthesis feedback into the circadian clocks to set the plant’s rhythm, increase the photosynthesis rate, growth, survival, and competitiveness advantage51–53. This process helps plants regulate physiology and triggers metabolism and stress responses. Recent studies have shown that the circadian clock’s effect on the daily growth rhythms is different between monocots and eudicots54,55. We speculate that the absence of chloroplastic GmSHMTs in monocots could be linked to these reported differences between monocots and eudicots. However, this hypothesis needs further investigation.
Domain variation suggest the presence of altered affinities in nucleic, chloroplastic, and mitochondria targeted GmSHMTs
Structural analysis of the GmSHMT gene family members showed that the four nucleic GmSHMTs did not conserve the same residues involved in subunit assembly, THF and PLP binding if compared to the cytosolic GmSHMTs (Figs 1B and 6D). For instance, it was inferred that the nucleus-targeted GmSHMT may have acquired new substrate affinities, lost their PLP cofactor binding, catalysis, and/or subunit assembly (tetramerization), as it was shown using induced mutagenesis within the H134N, H147N, and H150N mutants of sheep liver serine hydroxymethyltransferase (ScSHMT) corresponding to His121 and His134 residues in soybean GmSHMT0856. Interestingly, both spontaneously occurring mutations at His121 and His134 were found to be located four, two, and four amino acids away from the identified M125I, G132D, and G138S/G138D mutations at the GmSHMT08c gene, respectively. These two mutants from a mutagenized EMS Forrest soybean population lost their resistance to SCN.
Moreover, the Gln77 and Tyr104 residues involved in SHMT dimer interface were not conserved in the chloroplast-targeted GmSHMTs; in addition to the Ala302 and Gln305, which have been demonstrated to be essential for maintaining the SHMT oligomeric structure57,58. Gln77 and Tyr104 natural occurring mutations are located six and two residues away from G71D and G106S Gmshmt08c EMS induced mutants, respectively. Ala302 and Gln305 residues were found to be located three and six residues away from another GmSHMT08c L299F EMS induced Gmshmt08c mutant. Interestingly, all three reported mutations in the resistant Forrest soybean lost their SCN resistance. The Gmshmt08c A302V mutant which corresponds to the Ala302 increases also susceptibility to SCN up to 52.4%. Chloroplastic GmSHMT13 and GmSHMT15 did not conserve Arg257 residue. The Arg257 is important for maintaining SCN resistance in soybean since we observed that the induced EMS R257Q mutation at the GmSHMT08c increases susceptibility to SCN up to 50%.
Furthermore, Lys25, Gln77, Tyr104, and Tyr153 residues involved in SHMT dimer interface were different among the mitochondria and cytosol-targeted GmSHMTs. Tyr153 is found two residues away from the A149T Gmshmt08c mutation, which may have increased SCN resistance up to 10.3%. The mitochondrial GmSHMT09 member did not present the same Tyr59 and Ser190 conserved residues, both involved in PLP binding and catalysis. Tyr59 residue is located two amino acids away from the EMS induced E61K Gmshmt08c mutant, which increases SCN susceptibility up to 60%.
Additionally, in vitro kinetic studies of Gmshmt08c mutated alleles when compared to the Forrest GmSHMT08c allele, used as a positive control, showed differences in enzymatic activity of the GmSHMT08c protein3,40. GmSHMT08c alleles carrying the E61K, G71D, G326E, A302V, and M125I mutations resulted in proteins that are enzymatically inactive, as they were unable to support the growth of bacteria. In contrast, the GmSHMT08c allele from the wild type Forrest supported growth of the mutant bacteria3,40.
Therefore, we hypothesize that the accumulation of these natural spontaneous mutations may alter the function and/or oligomeric structure of the nucleus, mitochondria, and chloroplast-targeted GmSHMT proteins, or may alter their substrate affinities, ultimately leading to their sub-functionalization or/and neo-functionalization contributing to the loss of SCN resistance and gain of new functions3,40,59. Thus, modeling of the GmSHMT enzyme structures of all soybean isozyme classes presented features that may explain the differences in activity and SCN resistance between the GmSHMT classes.
Soybean GmSHMT genes display overlapping responses and divergent functions in resistance to SCN
All four nucleus-targeted GmSHMT members were not induced under SCN infection. Protein modeling showed that most of the domain variations occurred within the nucleus-targeted GmSHMT members only. Thus, we suggest that the high number of domain variations may alter the enzymatic activity and change the affinity of the protein and thus its function toward SCN resistance. Although cytosol, chloroplast, and mitochondria-targeted GmSHMT members were induced in SCN incompatible interaction; only the cytosolic GmSHMT08c has been shown to be involved in resistance to SCN. This was observed within the soluble NSF attachment protein (GmSNAP) gene family, where only the GmSNAP18 was the major gene for SCN resistance. In fact, although four GmSNAP gene members were induced in response to SCN inoculation, GmSNAP14 and GmSNAP02 were not involved in SCN resistance, but only GmSNAP18 and GmSNAP1119. Although the last duplications happened about 13 million years ago23, GmSHMT genes have likely accumulated mutations that led to the divergence of their protein sequences, ultimately leading to a neofunctionalization or subfunctionalization.
Despite the observed divergence in the structure and function within the cytosol, chloroplast, and mitochondria-targeted GmSHMT gene family, the overlapped expression of certain members is most likely due to their response to the same regulatory elements. In fact, although different members of the GmSHMT family have evolved to mediate different functions due to the presence of natural occurring mutations, certain members are still responding to the same stimuli. Similar functional patterns have been observed within the orthologs of the soybean SWEET gene family, which acquired new functions during evolution60. Whereas some SWEET genes have been involved in seed development, other SWEET members play a major role in disease resistance. Furthermore, it has been reported that duplications resulting in multiple gene families represent functional redundancy and/or divergence19,61,62. Previous studies in Arabidopsis have shown that three members (TTL1, TTL3, and TTL4) of the Tetratricopeptide repeat thioredoxin-like (TTL) gene family are required for osmotic stress tolerance presenting an additive effect and are essential for root growth and integrity61. However, the TTL2 member diverged and acquired a novel function in male sporogenesis. This is a good case of gene functional redundancy and gene divergence that occurred within the same gene family. It has been reported that duplication of the CLAVATA gene family follows a process of divergence in the function of their gene members. In fact, GmCLV1A acts on shoot architecture, whereas GmCLV1B (GmNARK) functions in controlling nodule numbers63. In the current study, characterization of the GmSHMT members demonstrates that this gene family diverged over time and gave rise to different GmSHMT members with the absence of functional redundancy in resistance to SCN.
This characteristic has been supported by the SNP variant analysis of the GmSHMT08c that was extended to the rest of the GmSHMT gene family members. In fact, only GmSHMT08c member presented distinct haplotype formed by these SNPs which correlates with resistance and susceptibility to SCN within the 106 soybean lines analyzed. All 14 lines belonging to Peking-type of resistance, which requires in addition to the rhg1-a (GmSNAP18) the presence of the Rhg4 locus (GmSHMT08c), presented different GmSHMT08c haplotype from the susceptible lines that cluster with SCN resistance (Fig. 8). The rest of the lines showing resistance to SCN but carrying susceptible haplotype at the Rhg4 locus (GmSHMT08c) could be explained by their PI88788 source of resistance that does not require the resistant Rhg4 allele (GmSHMT08c); and thus, their resistance is due to the presence of the rhg1-b locus only (Fig. 8), which haplotype has been shown to be different than the rest of soybean SCN susceptible lines3,4.
Although the role of the other GmSHMT members needs to be elucidated, this study revealed that GmSHMT genes might have diverged or acquired new function during the evolution. While several members showed an overlap in their responses to SCN stimulus, only GmSHMT08c function in resistance to SCN. The functional divergence of GmSHMT genes had likely happened through exon skipping, alternative splicing variants, amino acid polymorphism, organ-specific expression, and subcellular localization. Considering the function of soybean GmSHMT08 in SCN resistance, the duplication and retention of SHMT genes in plants suggest that SHMT genes may play a key role in soybean adaptation, which is important for proper responses to changing environments.
Material and Methods
SHMT sequences and phylogenetic analysis
SHMT sequences used in phylogenetic analyses include sequences retrieved from different databases including NCBI, Soybase (W82.a2.v1), and Phytozome (v12.1). Sequences were identified by querying Arabidopsis sequences against sequences from these databases employing tblastn using default parameters. The retrieved sequences were checked for motifs (PLP, THF, catalysis, dimerization, and tetramerization) that are specific to this family. Only sequences with similarity to GmSHMT genes that have the previously mentioned motifs were considered in this study. GmSHMT gene organization (introns and exons), and their predicted amino acid sequences were retrieved from the Phytozome database (v12.1). GmSHMT genomic and corresponding protein sequences from the first splicing presenting the translated amino acid sequences from all the exons have been used in the analyses. We used sequences from sets of plants with fully sequenced genomes representing key positions on the angiosperm phylogenetic tree. Sequences were carefully inspected and corrected for annotation errors before use. The analysis included the GmSHMTs identified in soybean, in addition to SHMTs from eudicot species including the seven cytosolic, mitochondrial, chloroplastic, and nucleic SHMTs from the eudicot model (A. thaliana), other monocots including a monocot model (O. sativa), and the most primitive lineage models including a lycophyte (S. moellendorfii), a moss (P. patens), and a chlorophytic algae (C. reinhardtii).
SHMT nucleotide cDNA sequences were translated into protein sequences. The inferred protein sequences were then aligned using Muscle with default parameters, and manually adjusted. Phylogenetic analyses were performed on the aligned amino acid sequences, in PHYML using the WAG model and assuming among site rate heterogeneity (WAG + G). 1000 bootstrap replicates were run to estimate branch support.
GmSHMT cloning and subcellular localization
The coding sequences of the GmSHMT genes (Glyma.02G217100, Glyma.05g152100, Glyma.08G108900, Glyma.06g107800, Glyma.08G187800, Glyma.13G222300, Glyma.14G184500, and Glyma.15G089900) were amplified from Forrest cDNA using forward and reverse primers containing EcoRI or HindIII and SalI restriction enzyme sites, respectively. PCR products were digested and then ligated to the N-terminus of the yellow fluorescent protein (YFP) reporter gene in the pSAT6-EYFP-N1 vector. The fusion constructs were verified by sequencing. Gold particles were coated with plasmid DNA and delivered into onion epidermal cells using biolistic bombarded as previously described47. The bombarded onion epidermal peels were kept in the dark at 26 °C for at least 20 hours before being visualized using EVOS® FL Auto Cell Imaging System (Life Technologies) to determine the subcellular localization of the fused proteins.
qRT-PCR analysis of GmSHMT gene family members
Soybean seedlings from the SCN susceptible line Essex and from the SCN resistant line Forrest were grown in autoclaved silt loam and sandy soil in the growth chamber for one week and then infected with eggs from the PA3 population. Total RNA was isolated from root samples after three, five, and ten days following SCN infection as described previously19. Experiments were repeated threefold with similar results. Results from one biological replicate are shown. All presented results were performed with the analysis of variance ANOVA, using JMP Pro V12 software as described earlier19,64. Primers used for qRT-PCR are listed in Supplemental Table S3.
Homology modeling of GmSHMT08c and mutational analysis
Homology modeling of a putative asymmetric homotetrameric SHMT protein structure was conducted with Deepview and Swiss Model Workspace software using the SHMT protein sequence from Forrest and the available SHMT crystal structure from Homo sapiens as a template; PDB accession 1BJ4 chain A65. All THF and PLP binding sites, active sites, and catalysis residues retrieved from NCBI conserved domain database were modeled against this template with a sequence identity of 60%59. GmSHMT EMS induced and natural mutations and haplotypes identified were then mapped onto the model.
Development of the EMS mutagenesis Forrest population
The soybean c.v. Forrest seed was from Southern Illinois University-Carbondale Agricultural Research Center, and was used to develop an EMS mutagenized population. The wild type Forrest seed was mutagenized with 0.6% (w/v) EMS as described by66, and planted to harvest 1,536 and 2,827 M2 families of seed in 2011 and 2013 respectively, which was then advanced to M3 generation in 2014.
SCN-infection phenotyping and forward genetics screeening
SCN-infection screening was performed on the EMS induced Forrest mutants, as described by67. More than 3,000 mutant families from both 2011 and 2013 M2 generations were screened for SCN resistance. Eighteen EMS mutants identified from Forrest lost their resistance to SCN. All eighteen mutants carried missense (16 mutants) or nonsense (2 mutants) mutations at the GmSHMT08c, but not on the rest of the GmSHMT members.
Haplotype clustering analysis
The whole genome re-sequencing data43 of 106 diverse soybean lines (15X coverage) was utilized to identify allelic variants and haplotype in GmSHMT gene family. SNP based haplotypes were examined by generating map and genotype data files using TASSEL 5.0 program and clustering pictorial output for SACPD gene. The clusters were created and visualized using FLAPJACK software68 as described in details previously69. Additionally, the phenotypic data for soybean cyst nematode screening was obtained from Nguyen Lab (unpublished) and clustered with SNP matrix. Transcript sequence-based annotation (W82.a2. v1) was used to classify synonymous and non-synonymous SNPs by translation into amino acid sequences.
Supplementary information
Acknowledgements
This research was supported by the Southern Illinois University graduate school fellowship, in part from the United Soybean Board to KM, by the grant# W911NF-11-1-0178 to MAK from the Department of Defense (DoD), by industrial grants to DAL. Support was also provided by a startup fund from the University of South Dakota to AB and was partially supported by funds from the Tennessee Soybean Promotion Board to TH. We also acknowledge Dr. Nicholas Provart for granting us the permission to use the images created by the open-source eFP Browser software available at Utoronto.ca website.
Author Contributions
N.L. drafted the manuscript, performed structural and synteny analyses, haplotyping studies, performed protein homology modeling, mutational analysis, and genotyping. N.L. and Z.Z. performed SCN phenotyping and EMS mutagenesis. N.L. and A.b.B. performed phylogenetic, comparative genomic, and expression analyses. G.P. and H.N. performed haplotype analysis and phenotypic clustering using soybean whole genome sequence data and edited the manuscript. A.z.B., M.A.K., and Z.Z. performed data analysis. M.A.K., D.A.L., A.b.B., and K.M. edited the manuscript. T.H. and S.P. did the subcellular localization and edited the manuscript. N.L., A.b.B., and K.M. conceived the project, designed, and planned the experiment. K.M. supervised students and oversea the work.
Data Availability
The developed EMS mutagenized Forrest mutants are the property of Southern Illinois University but released to all requestors. The new serine hydroxylmethyltransferase (SHMT; EC 2.1.2.1) alleles are deposited at NCBI. GenBank accession numbers are listed in Supplemental Table S4.
Competing Interests
The authors declare no competing interests.
Footnotes
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Gunvant Patil, Sarbottam Piya and Zhou Zhou contributed equally.
Electronic supplementary material
Supplementary information accompanies this paper at 10.1038/s41598-018-37815-w.
References
- 1.Koenning, S. R. & Wrather, J. A. Suppression of soybean yield potential in the continental United States by plant diseases from 2006 to 2009. Plant Health Progress, 10.1094/PHP-2010-122-01-RS (2010).
- 2.Meksem K, et al. ‘Forrest’ resistance to the soybean cyst nematode is bigenic: saturation mapping of the Rhg1and Rhg4 loci. Theoretical and Applied Genetics. 2001;103:710–717. doi: 10.1007/s001220100597. [DOI] [Google Scholar]
- 3.Liu S, et al. soybean cyst nematode resistance gene points to a new mechanism of plant resistance to pathogens. Nature. 2012;492:256–260. doi: 10.1038/nature11651. [DOI] [PubMed] [Google Scholar]
- 4.Liu S, et al. The soybean GmSNAP18 gene underlies two types of resistance to soybean cyst nematode. Nature Communications. 2017;8:14822. doi: 10.1038/ncomms14822. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Schirch L. Serine hydroxymethyltransferase. Advances in enzymology and related areas of molecular biology. 1982;53:83–112. doi: 10.1002/9780470122983.ch3. [DOI] [PubMed] [Google Scholar]
- 6.Appaji Rao N, Ambili M, Jala VR, Subramanya HS, Savithri HS. Structure-function relationship in serine hydroxymethyltransferase. Biochimica et biophysica acta. 2003;1647:24–29. doi: 10.1016/S1570-9639(03)00043-8. [DOI] [PubMed] [Google Scholar]
- 7.Stover PV. S. Serine Hydroxymethyltransferase Catalyzes the Hydrolysis of 5,10-Methyltetrahydrofolate to 5-Formyltetrahydrofolate. The Journal of biological chemistry. 1990;265:14227–14233. [PubMed] [Google Scholar]
- 8.Narkewicz MR, Sauls SD, Tjoa SS, Teng C, Fennessey PV. Evidence for intracellular partitioning of serine and glycine metabolism in Chinese hamster ovary cells. The Biochemical journal. 1996;313(Pt 3):991–996. doi: 10.1042/bj3130991. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Hanson AD, Gage DA, Shachar-Hill Y. Plant one-carbon metabolism and its engineering. Trends in plant science. 2000;5:206–213. doi: 10.1016/S1360-1385(00)01599-5. [DOI] [PubMed] [Google Scholar]
- 10.Skibola CF, et al. Polymorphisms in the thymidylate synthase and serine hydroxymethyltransferase genes and risk of adult acute lymphocytic leukemia. Blood. 2002;99:3786–3791. doi: 10.1182/blood.V99.10.3786. [DOI] [PubMed] [Google Scholar]
- 11.Lim U, et al. Polymorphisms in cytoplasmic serine hydroxymethyltransferase and methylenetetrahydrofolate reductase affect the risk of cardiovascular disease in men. The Journal of nutrition. 2005;135:1989–1994. doi: 10.1093/jn/135.8.1989. [DOI] [PubMed] [Google Scholar]
- 12.Amelio I, Cutruzzolá F, Antonov A, Agostini M, Melino G. Serine and glycine metabolism in cancer. Trends in biochemical sciences. 2014;39:191–198. doi: 10.1016/j.tibs.2014.02.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Zhang Y, Sun K, Sandoval FJ, Santiago K, Roje S. One-carbon metabolism in plants: characterization of a plastid serine hydroxymethyltransferase. The Biochemical journal. 2010;430:97–105. doi: 10.1042/bj20100566. [DOI] [PubMed] [Google Scholar]
- 14.Gardeström P, Edwards GE, Henricson D, Ericson I. The localization of serine hydroxymethyltransferase in leaves of C3 and C4 species. Physiologia Plantarum. 1985;64:29–33. doi: 10.1111/j.1399-3054.1985.tb01208.x. [DOI] [Google Scholar]
- 15.Turner SR, Ireland R, Morgan C, Rawsthorne S. Identification and localization of multiple forms of serine hydroxymethyltransferase in pea (Pisum sativum) and characterization of a cDNA encoding a mitochondrial isoform. The Journal of biological chemistry. 1992;267:13528–13534. [PubMed] [Google Scholar]
- 16.Besson V, Neuburger M, R´ebeill´e F, Douce R. Evidence for three serinehydroxymethyltransferases in green leaf cells: purification and characterization of the mitochondrial and chloroplastic isoforms. Plant physiology. 1995;33:665–673. [Google Scholar]
- 17.Neuburger M, Rebeille F, Jourdain A, Nakamura S, Douce R. Mitochondria are a major site for folate and thymidylate synthesis in plants. The Journal of biological chemistry. 1996;271:9466–9472. doi: 10.1074/jbc.271.16.9466. [DOI] [PubMed] [Google Scholar]
- 18.Wu, X.-Y. et al. Soybean Cyst Nematode Resistance Emerged via Artificial Selection of Duplicated Serine HydroxymethyltransferaseGenes. Frontiers in Plant Science 7, 10.3389/fpls.2016.00998 (2016). [DOI] [PMC free article] [PubMed]
- 19.Lakhssassi N, et al. Characterization of the Soluble NSF Attachment Protein gene family identifies two members involved in additive resistance to a plant pathogen. Scientific Reports. 2017;7:45226. doi: 10.1038/srep45226. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Yin G, et al. The large soybean (Glycine max) WRKY TF family expanded by segmental duplication events and subsequent divergent selection among subgroups. BMC plant biology. 2013;13:148. doi: 10.1186/1471-2229-13-148. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Zhu Y, et al. Soybean (Glycine max) expansin gene superfamily origins: segmental and tandem duplication events followed by divergent selection among subfamilies. BMC plant biology. 2014;14:93. doi: 10.1186/1471-2229-14-93. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Singh VK, Jain M. Genome-wide survey and comprehensive expression profiling of Aux/IAA gene family in chickpea and soybean. Front Plant Sci. 2015;6:918. doi: 10.3389/fpls.2015.00918. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Schmutz, J. et al. Genome sequence of the palaeopolyploid soybean. Nature463, 178–183, http://www.nature.com/nature/journal/v463/n7278/suppinfo/nature08670_S1.html (2010). [DOI] [PubMed]
- 24.Lakhssassi, N. et al. Characterization of the FAD2 Gene Family in Soybean Reveals the Limitations of Gel-Based TILLING in Genes with High Copy Number. Frontiers in Plant Science8, 10.3389/fpls.2017.00324 (2017). [DOI] [PMC free article] [PubMed]
- 25.Goodstein DM, et al. Phytozome: a comparative platform for green plant genomics. Nucleic Acids Research. 2012;40:D1178–D1186. doi: 10.1093/nar/gkr944. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Edgar AJ. Mice have a transcribed L-threonine aldolase/GLY1 gene, but the human GLY1 gene is a non-processed pseudogene. BMC genomics. 2005;6:32–32. doi: 10.1186/1471-2164-6-32. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.He ZH, Cheeseman I, He D, Kohorn BD. A cluster of five cell wall-associated receptor kinase genes, Wak1-5, are expressed in specific organs of Arabidopsis. Plant molecular biology. 1999;39:1189–1196. doi: 10.1023/a:1006197318246. [DOI] [PubMed] [Google Scholar]
- 28.Tsugama D, Liu S, Takano T. A bZIP protein, VIP1, is a regulator of osmosensory signaling in Arabidopsis. Plant physiology. 2012;159:144–155. doi: 10.1104/pp.112.197020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Tang H, et al. Synteny and Collinearity in Plant Genomes. Science (New York, N.Y.) 2008;320:486–488. doi: 10.1126/science.1153917. [DOI] [PubMed] [Google Scholar]
- 30.Tang H, et al. Unraveling ancient hexaploidy through multiply-aligned angiosperm gene maps. Genome Res. 2008;18:1944–1954. doi: 10.1101/gr.080978.108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Lee T-H, Tang H, Wang X, Paterson AH. PGDD: a database of gene and genome duplication in plants. Nucleic acids research. 2012;41:D1152–D1158. doi: 10.1093/nar/gks1104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Delaux PM, et al. Algal ancestor of land plants was preadapted for symbiosis. Proc. Natl. Acad. Sci. USA. 2015;112:13390–13395. doi: 10.1073/pnas.1515426112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Patil, G. & Nicander, B. Identification of two additional members of the tRNA isopentenyltransferase family in Physcomitrella patens. Plant molecular biology82, 10.1007/s11103-013-0072-x (2013). [DOI] [PubMed]
- 34.Severin AJ, et al. RNA-Seq Atlas of Glycine max: a guide to the soybean transcriptome. BMC plant biology. 2010;10:160. doi: 10.1186/1471-2229-10-160. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Benedito VA, et al. A gene expression atlas of the model legume Medicago truncatula. The Plant journal: for cell and molecular biology. 2008;55:504–513. doi: 10.1111/j.1365-313X.2008.03519.x. [DOI] [PubMed] [Google Scholar]
- 36.Matas AJ, et al. Tissue- and cell-type specific transcriptome profiling of expanding tomato fruit provides insights into metabolic and regulatory specialization and cuticle formation. The Plant cell. 2011;23:3893–3910. doi: 10.1105/tpc.111.091173. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Schmid M, et al. A gene expression map of Arabidopsis thaliana development. Nature genetics. 2005;37:501–506. doi: 10.1038/ng1543. [DOI] [PubMed] [Google Scholar]
- 38.Winter D, et al. An “Electronic Fluorescent Pictograph” browser for exploring and analyzing large-scale biological data sets. PLoS One. 2007;2:e718. doi: 10.1371/journal.pone.0000718. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Marchler-Bauer A, et al. CDD/SPARCLE: functional classification of proteins via subfamily domain architectures. Nucleic Acids Res. 2017;45:D200–d203. doi: 10.1093/nar/gkw1129. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Kandoth, P. K. et al. Systematic mutagenesis of serine hydroxymethyltransferase reveals essential role in nematode resistance. Plant physiology, 10.1104/pp.17.00553 (2017). [DOI] [PMC free article] [PubMed]
- 41.Concibido VC, Diers BW, Arelli PR. A Decade of QTL Mapping for Cyst Nematode Resistance in Soybean. Crop Science. 2004;44:1121–1131. doi: 10.2135/cropsci2004.1121. [DOI] [Google Scholar]
- 42.Yue P, Arelli PR, Sleper DA. Molecular characterization of resistance to Heterodera glycines in soybean PI 438489B. Theoretical and Applied Genetics. 2001;102:921–928. doi: 10.1007/s001220000453. [DOI] [Google Scholar]
- 43.Valliyodan, B. et al. Landscape of genomic diversity and trait discovery in soybean. Scientific Reports6, 23598, 10.1038/srep23598 https://www.nature.com/articles/srep23598#supplementary-information (2016). [DOI] [PMC free article] [PubMed]
- 44.Clarindo WR, de Carvalho CR, Alves BMG. Mitotic evidence for the tetraploid nature of Glycine max provided by high quality karyograms. Plant Systematics and Evolution. 2007;265:101–107. doi: 10.1007/s00606-007-0522-5. [DOI] [Google Scholar]
- 45.Hanada K, Zou C, Lehti-Shiu MD, Shinozaki K, Shiu SH. Importance of lineage-specific expansion of plant tandem duplicates in the adaptive response to environmental stimuli. Plant physiology. 2008;148:993–1003. doi: 10.1104/pp.108.122457. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Dassanayake M, et al. The genome of the extremophile crucifer Thellungiella parvula. Nature genetics. 2011;43:913–918. doi: 10.1038/ng.889. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Liu J, et al. Soybean kinome: functional classification and gene expression patterns. Journal of experimental botany. 2015;66:1919–1934. doi: 10.1093/jxb/eru537. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Luo M, Orsi R, Patrucco E, Pancaldi S, Cella R. Multiple transcription start sites of the carrot dihydrofolate reductase-thymidylate synthase gene, and sub-cellular localization of the bifunctional protein. Plant molecular biology. 1997;33:709–722. doi: 10.1023/A:1005798207693. [DOI] [PubMed] [Google Scholar]
- 49.Zrenner R, Stitt M, Sonnewald U, Boldt R. Pyrimidine and purine biosynthesis and degradation in plants. Annual review of plant biology. 2006;57:805–836. doi: 10.1146/annurev.arplant.57.032905.105421. [DOI] [PubMed] [Google Scholar]
- 50.Huang Y, et al. Crystal structure of cryptochrome 3 from Arabidopsis thaliana and its implications for photolyase activity. Proc. Natl. Acad. Sci. USA. 2006;103:17701–17706. doi: 10.1073/pnas.0608554103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Blasing OE, et al. Sugars and circadian regulation make major contributions to the global regulation of diurnal gene expression in Arabidopsis. The Plant cell. 2005;17:3257–3281. doi: 10.1105/tpc.105.035261. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Dodd AN, et al. Plant circadian clocks increase photosynthesis, growth, survival, and competitive advantage. Science. 2005;309:630–633. doi: 10.1126/science.1115581. [DOI] [PubMed] [Google Scholar]
- 53.Dalchau N, et al. The circadian oscillator gene GIGANTEA mediates a long-term response of the Arabidopsis thaliana circadian clock to sucrose. Proc. Natl. Acad. Sci. USA. 2011;108:5104–5109. doi: 10.1073/pnas.1015452108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Covington MF, Maloof JN, Straume M, Kay SA, Harmer SL. Global transcriptome analysis reveals circadian regulation of key pathways in plant growth and development. Genome biology. 2008;9:R130. doi: 10.1186/gb-2008-9-8-r130. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Campoli C, Shtaya M, Davis SJ, von Korff M. Expression conservation within the circadian clock of a monocot: natural variation at barley Ppd-H1 affects circadian expression of flowering time genes, but not clock orthologs. BMC plant biology. 2012;12:97. doi: 10.1186/1471-2229-12-97. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Jagath JR, Sharma B, Rao NA, Savithri HS. The role of His-134, -147, and -150 residues in subunit assembly, cofactor binding, and catalysis of sheep liver cytosolic serine hydroxymethyltransferase. The Journal of biological chemistry. 1997;272:24355–24362. doi: 10.1074/jbc.272.39.24355. [DOI] [PubMed] [Google Scholar]
- 57.Jala VR, Ambili M, Prakash V, Rao NA, Savithri HS. Disruption of distal interactions of Arg 262 and of substrate binding to Ser 52 affect catalysis of sheep liver cytosolic serine hydroxymethyltransferase. Indian journal of biochemistry & biophysics. 2003;40:226–237. [PubMed] [Google Scholar]
- 58.Vivoli M, et al. Role of a conserved active site cation-pi interaction in Escherichia coli serine hydroxymethyltransferase. Biochemistry. 2009;48:12034–12046. doi: 10.1021/bi901568b. [DOI] [PubMed] [Google Scholar]
- 59.Lakhssassi, N. et al. Physical and Functional Interactions Between the Soybean Cyst Nematode Resistance Proteins GmSHMT08 and GmSNAP18. Under review, Plant Cell (2017).
- 60.Patil G, et al. Soybean (Glycine max) SWEET gene family: insights through comparative genomics, transcriptome profiling and whole genome re-sequence analysis. BMC Genomics. 2015;16:520. doi: 10.1186/s12864-015-1730-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Lakhssassi N, et al. The Arabidopsis tetratricopeptide thioredoxin-like gene family is required for osmotic stress tolerance and male sporogenesis. Plant Physiol. 2012;158:1252–1266. doi: 10.1104/pp.111.188920. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Perez-Perez JM, et al. Functional Redundancy and Divergence within the Arabidopsis RETICULATA-RELATED Gene Family. Plant Physiol. 2013;162:589–603. doi: 10.1104/pp.113.217323. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Mirzaei S, et al. Neodiversification of homeologous CLAVATA1-like receptor kinase genes in soybean leads to distinct developmental outcomes. Sci Rep. 2017;7:8878. doi: 10.1038/s41598-017-08252-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Lakhssassi, N. et al. Stearoyl-acyl carrier protein desaturase mutations uncover an impact of stearic acid in leaf and nodule structure. Plant physiology, 10.1104/pp.16.01929 (2017). [DOI] [PMC free article] [PubMed]
- 65.Renwick SB, Snell K, Baumann U. The crystal structure of human cytosolic serine hydroxymethyltransferase: a target for cancer chemotherapy. Structure (London, England: 1993) 1998;6:1105–1116. doi: 10.1016/S0969-2126(98)00112-9. [DOI] [PubMed] [Google Scholar]
- 66.Meksem, K. et al. In The Handbook of Plant Functional Genomics 251–265 (Wiley-VCH Verlag GmbH & Co. KGaA, 2008).
- 67.Liu X, et al. Soybean cyst nematode resistance in soybean is independent of the Rhg4 locus LRR-RLK gene. Funct Integr Genomics. 2011;11:539–549. doi: 10.1007/s10142-011-0225-4. [DOI] [PubMed] [Google Scholar]
- 68.Milne I, et al. Flapjack—graphical genotype visualization. Bioinformatics. 2010;26:3133–3134. doi: 10.1093/bioinformatics/btq580. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Patil G, et al. Genomic-assisted haplotype analysis and the development of high-throughput SNP markers for salinity tolerance in soybean. Scientific Reports. 2016;6:19199. doi: 10.1038/srep19199. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The developed EMS mutagenized Forrest mutants are the property of Southern Illinois University but released to all requestors. The new serine hydroxylmethyltransferase (SHMT; EC 2.1.2.1) alleles are deposited at NCBI. GenBank accession numbers are listed in Supplemental Table S4.