Skip to main content
Wiley Open Access Collection logoLink to Wiley Open Access Collection
. 2014 Aug 22;13(2):177–187. doi: 10.1111/pbi.12238

The QQS orphan gene of Arabidopsis modulates carbon and nitrogen allocation in soybean

Ling Li 1,*, Eve Syrkin Wurtele 1,*
PMCID: PMC4345402  PMID: 25146936

Abstract

The genome of each species contains as high as 8% of genes that are uniquely present in that species. Little is known about the functional significance of these so-called species specific or orphan genes. The Arabidopsis thaliana gene Qua-Quine Starch (QQS) is species specific. Here, we show that altering QQS expression in Arabidopsis affects carbon partitioning to both starch and protein. We hypothesized QQS may be conserved in a feature other than primary sequence, and as such could function to impact composition in another species. To test the potential of QQS in affecting composition in an ectopic species, we introduced QQS into soybean. Soybean T1 lines expressing QQS have up to 80% decreased leaf starch and up to 60% increased leaf protein; T4 generation seeds from field-grown plants contain up to 13% less oil, while protein is increased by up to 18%. These data broaden the concept of QQS as a modulator of carbon and nitrogen allocation, and demonstrate that this species-specific gene can affect the seed composition of an agronomic species thought to have diverged from Arabidopsis 100 million years ago.

Keywords: QQS, orphan, carbon and nitrogen allocation, protein, starch, Glycine max

Introduction

The ability to optimize protein productivity of plant-based foods could have far-ranging impacts to both world health and to sustainability (Godfray et al., 2010; Heitschmidt et al., 1996; Pimentel and Pimentel, 2003). Dietary protein is essential for animals, whereas photosynthetic organisms can biosynthesize all amino acids required for protein synthesis. Over four billion of the seven billion people on our planet obtain the majority of their dietary protein from plants (Pimentel and Pimentel, 2003; Young and Pellett, 1994). However, for many people, protein intake is insufficient, and its deficiency results in mental retardation, stunting of growth, and greatly increased susceptibility to disease, predominantly affecting children (Gomes et al., 2009; Muller and Krawinkel, 2005; Victora et al., 2010).

Because plants are solar-powered heterotrophs in the food web, consumption of plant-derived proteins has far less impact on the environment than consumption of animal protein sources, especially considering the earth's dwindling water resources (Pimentel and Pimentel, 2003). Thus, increasing the use of plants as a protein source, rather than animals, would have a major ecological significance.

Plant composition is determined by a metabolic network that mediates the conversion of imported, photosynthetically-derived carbon and nitrogen into protein, oil and carbohydrate. Many of the pathways by which plants synthesize or degrade protein, oil and starch have been delineated; however, much less is understood about the mechanisms that integrate these pathways and that regulate carbon and nitrogen allocation to and within this network (Eastmond, 2006; Eastmond et al., 1997; Higashi et al., 2006; Li et al., 2009, 2012; Schiltz et al., 2004; Sulpice et al., 2013). Understanding this process holistically is a major biological challenge.

To identify genes that impact plant composition, our strategy leveraged the model species Arabidopsis. Based on the postulate that the basic molecular genetic mechanisms for homeostasis are conserved across species (Li et al., 2009), we selected Arabidopsis single-gene mutants that appeared morphologically like the wild type (WT) control, but differed in composition, and then determined the transcripts whose expression was impacted in the mutants. We anticipated identifying a combination of metabolic and regulatory genes by this approach. The ss3 knockout mutant of Arabidopsis is high in starch, but has a normal morphological phenotype (Zhang et al., 2005, 2008); among the genes whose expression is altered in Atss3 mutants relative to WT plants is Qua-Quine Starch (QQS, locus At3g30720) (Li et al., 2009). Reduction of QQS expression in transgenic QQS RNAi lines of Arabidopsis resulted in plants that were morphologically indistinguishable from control lines, but that expressed a 15%–30% increase in leaf starch content (Li et al., 2009).

This study further defines how QQS functions in Arabidopsis, showing that altering expression of QQS in either over-expression or QQS RNAi lines results in shifts in leaf protein and starch content.

The QQS gene encodes a protein of only 59 amino acids whose homolog is not identifiable by primary sequence comparisons to any other sequenced species, not even the closely related Brassica napus (Li et al., 2009) or Arabidopsis lyrata; as such, QQS is considered an orphan gene. Orphan genes (also referred to as species-specific genes, or, in prokaryotes, ORFans) can be defined as genes that encode proteins that are unique to a given species, having no identifiable sequence homologs in other species (Gollery et al., 2006, 2007; Li et al., 2009). The concept of orphan genes was first described by Fischer and Eisenberg in 1999 from studies of microbial genomes (Fischer and Eisenberg, 1999).

Although many have predicted that genes considered species specific would later turn out to be an artifact of sparse genome sequence, this has proved not to be the case (Arendsee et al., 2014; Gollery et al., 2006, 2007; Marsden et al., 2006; Neme and Tautz, 2013; Silveira et al., 2013; Tautz and Domazet-Loso, 2011). Orphan genes appear to be present in all species, and represent a significant fraction (approximately 0.5% to >8%) of analysed eukaryotic and prokaryotic genomes. The function of most orphan genes is obscure; however, they have been considered to be a determinant of species character (Gollery et al., 2006; Tautz and Domazet-Loso, 2011).

Orphan genes are thought to arise by multiple mechanisms (Carvunis et al., 2012; Donoghue et al., 2011; Tautz and Domazet-Loso, 2011; Wissler et al., 2013). Nongenic sequence can be defined as the sequence that is not a part of an organism's genes (Adler, 1992). In Saccharomyces cervesiae, nongenic sequences have been shown to be transcribed widely (Nagalakshmi et al., 2008) and some is also translated (Carvunis et al., 2012). Genes could arise de novo from such nongenic sequences via a noncoding or coding proto-gene that becomes more stabilized during evolution into an orphan gene (Carvunis et al., 2012). Orphan genes also arise via pre-existing genes, whose sequence can be highly modified by combinations of gene duplication, domain shuffling, shifting of location of translation frames and subsequent diversification (Carvunis et al., 2012; Ohno, 1987; Tautz and Domazet-Loso, 2011). The lack of even remote footprints of any A. thaliana genic sequence within QQS indicates it is likely an orphan gene that arose de novo (Silveira et al., 2013).

Another feature of orphans may be a general lack of co-expression with other genes. One way to consider this feature is to evaluate their representation in regulons. Regulons of eukaryotes can be defined as clusters of genes that have a prevailing pattern of co-expression across multiple/thousands of diverse conditions (Biehl et al., 2005; Feng et al., 2012; Mentzen et al., 2008); these have been detailed for the Arabidopsis transcriptome using a Markov chain cluster (MCL) approach (Mentzen and Wurtele, 2008). Only some of the genes of an organism can be classified into regulons; others do not have a prevailing pattern of co-expression with other genes (Feng et al., 2012; Mentzen and Wurtele, 2008). The Arabidopsis transcriptome can be partitioned into 872 regulons; about 49% of all Arabidopsis genes are members of these regulons (Mentzen and Wurtele, 2008). However, only 4.3% of orphans are members of regulons. Thus orphans are highly underrepresented among the co-expressed gene clusters.

We conjecture that some orphan genes might have arisen and stabilized because they confer a selective advantage by interacting with a previously existing protein. This previously existing protein could be relatively conserved, that is, present in many lineages. Because QQS influences composition, a process critical to plants in general, our working hypothesis is that QQS contains structural features that would interact with a conserved protein, such that introduction of the QQS gene into another species would influence its compositional traits.

We tested the hypothesis that QQS could function to regulate composition in another species by introducing the QQS gene into the major food crop, soybean. Here, we demonstrate the expression of the QQS gene in soybean increases leaf and seed protein and decreases leaf and seed carbohydrate. These results reveal that expression of the QQS gene increases carbon and nitrogen allocation to protein and that it can exert this function even in an ectopic species.

Results

Arabidopsis plants over-expressing and under-expressing QQS have altered carbon and nitrogen allocation to protein and starch

Our findings that down-regulation of QQS increases starch content in Arabidopsis (Li et al., 2009) might indicate that altered QQS impacts starch content without changing other aspects of leaf composition. Alternately, QQS might have a more general effect on carbon and nitrogen allocation, affecting protein and/or lipid content. To distinguish between these alternatives, we evaluated the protein and starch content of transgenic Arabidopsis plants that either over-expressed [under control of the constitutive cauliflower mosaic virus (CaMV) 35S promoter, see Figure S1] or suppressed (using QQS RNAi, Li et al., 2009) the accumulation of the QQS coding sequence. A total of ten independent QQS over-expression mutant lines and four independent QQS RNAi mutant lines were grown and evaluated using a randomized complete block design. The visual phenotype of these transgenic Arabidopsis lines appeared identical to the control lines throughout development, from seedlings to senescence (Figure1a,b). However, leaf starch content in lines that over-expressed QQS (QQS-OE) was decreased by up to 23% (Figure S2). Conversely, QQS RNAi lines showed an increase in leaf starch content (Figures1b and S2), consistent with our previous publication (Li et al., 2009). Leaf protein content was altered in each of the QQS-OE and QQS RNAi mutant lines that we tested; specifically, protein content was increased by about 3% in QQS-OE mutants and decreased by 3%–7% in QQS RNAi mutants (Figure1c). These data indicate that QQS acts either directly or indirectly as a regulator of carbon and nitrogen metabolism and affects not only the accumulation of starch, but also protein.

Figure 1.

Figure 1

Phenotype and leaf composition of transgenic lines of Arabidopsis with down-regulation or over-expression of Qua-Quine Starch (QQS). Seedling shoots of QQS RNAi plants (Li et al., 2009) derived from four independent transformation events, and QQS-OE plants derived from ten independent transformation events, together with wild type (WT) control plants, were sampled at the end of the light period. QQS RNAi plants are known to have increased starch (Li et al., 2009). (a) All of these lines of transgenic QQS RNAi and QQS-OE plants are similar in morphology to WT plants throughout development. This phenotype (i.e. the QQS-OE plants were indistinguishable from WT in appearance) contrasts with the 35S:QQS transgenic phenotype of slower growth and rounded leaves that was described by Seo et al. (2011); possibly this is due to a difference in experimental conditions in which the plant were grown. (b) Starch staining shows increased starch in QQS RNAi mutants and decreased starch in QQS-OE mutants compared with WT. (c) Leaf protein is significantly decreased in QQS RNAi mutants and increased in QQS-OE mutants compared with WT. QQS mutants are each in the T2 generation, and each independent transformation events are designated by T2-independent transformation event #. All data in bar charts show mean ± SE (standard error), n = 3. Student's t-test was used to compare QQS RNAi and QQS-OE with WT, *P < 0.05; **P < 0.01.

The BLink algorithm (NCBI, http://www.ncbi.nlm.nih.gov/sutils/blink.cgi?mode=query) identifies 1155 orphan genes in Arabidopsis, based on the gene models described in Arabidopsis using the TAIR10 genome release (http://ftp://ftp.arabidopsis.org/home/tair/Genes/TAIR10_genome_release/), including 30 mitochondrial genes (as dated on September 23, 2013). In addition to these 1115 A. thaliana-specific genes, 839 genes can be identified using BLink that are unique to A. thaliana and A. lyrata but not identifiable outside of the Arabidopsis genus; these can be referred to as clade-specific genes or genus-specific genes, rather than the more restrictive term, orphans.

Orphans are typically shorter than typical genes (Amiri et al., 2003; Knowles and McLysaght, 2009; Wu et al., 2011). In A. thaliana, the median length of the predicted protein models from the 1155 orphan genes is 57 amino acid (range, 16–445 amino acid) while the median length of predicted protein models from all genes is 349 amino acid (range, 16–5393 amino acid) (Figure2). When an alternate, more inclusive, algorithm for orphan genes is used, the predicted proteins have a similarly short length (Gollery et al., 2006).

Figure 2.

Figure 2

Distribution of orphan protein lengths in Arabidopsis. The median length of the predicted protein models from Arabidopsis thaliana orphan genes (blue bar) is smaller (57 amino acid) than the median length of all A. thaliana predicted protein models (orange bar) (349 amino acid) (range of orphan protein models: 16–445 amino acid; range of all protein models: 16–5393 amino acid). The X-axis represents the protein length (in amino acids); the Y-axis represents the number of genes. Orphan genes are predicted by NCBI BLink (http://www.ncbi.nlm.nih.gov/sutils/blink.cgi?mode=query). The insert provides a magnified view of the distribution of the sizes of orphan genes.

Eukaryotic orphan genes generally tend to have greater tendency of being associated with transposable elements, relatively low expression that is organ specific, and less helical and sheet structure (as predicted computationally) (Arendsee et al., 2014; Donoghue et al., 2011; Wilson et al., 2007). QQS has only some of these characteristics. Like a typical orphan gene, it is a short peptide (Figure2) with a GC content of 40%, slightly higher than the average for Arabidopsis genes (this average GC content is 36% for Arabidopsis, Mishra et al., 2009). QQS is embedded in a neighbourhood of chromosome 3 that is highly enriched in transposons (Figure S3) (Li et al., 2009) and generally unusual. In the 3′ direction, QQS is 2.5 kb from At3g30725, a ‘glutamine dumper-like’ protein of unclear function; however, in the 5′ direction, QQS is a surprising 77.6 kb from the nearest gene, At3g30705, which is also an orphan gene and has no known function. The sequence immediately 5′ of the QQS start site contains multiple 5′ flanking siRNA repeated elements (Lister et al., 2008; Silveira et al., 2013).

Unlike what has been reported as general orphan characteristics, QQS expression in Arabidopsis ecotype Columbia (Col-0) is relatively not low and QQS is expressed in most plant organs; furthermore, its expression responds strongly to genetic and environmental perturbations (Li et al., 2009). We evaluated qualitatively whether QQS expression is affected under two environmental conditions that are known to increase starch accumulation in Arabidopsis: high sucrose (Aloni et al., 1997) and low temperature (Espinoza et al., 2010) using transgenic plants containing the QQS promoter driving the GUS coding sequence (Figure S4). QQS expression is decreased in plants grown in medium contain 5% sucrose compared with medium without added sucrose. QQS expression is also decreased in plants growing at 4 °C compared with 22 °C. These results further demonstrate the strong sensitivity of QQS expression to environmental perturbations.

Disorder prediction tools (http://www.disprot.org/predictors.php) indicate that QQS protein has a disordered N-terminal tail (approximately 20 first residues). The remainder of QQS protein is predicted to contain two α helixes.

Soybean plants expressing QQS have increased leaf protein

To test the hypothesis that QQS could affect carbon and nitrogen allocation in an ectopic species, we determined whether expression of the QQS transgene would impact composition in the major crop plant, soybean (Glycine max). We chose soybean based on its evolutionary divergence from Arabidopsis and because of its importance as a direct or indirect source of dietary protein. Indeed, soy is the predominant source of protein for humans (Pathan and Sleper, 2008). In addition to its potential agronomic value, increasing the protein content in an already protein-rich crop could provide a stringent test of how QQS may affect compositional traits. Soybean (cultivar Williams 82) was transformed with the QQS coding sequence under the control of the constitutive CaMV 35S promoter (Figure3). Transgenic soybean lines that expressed the QQS transgene were identified by selection for herbicide resistance conferred by the introduced vector, followed by real-time PCR analysis (Table S1).

Figure 3.

Figure 3

Characterization of leaves of growth chamber-grown transgenic soybean plants expressing Qua-Quine Starch (QQS). (a) Transgenic QQS-E plants (engineered in the Williams 82 background) are not visually distinguishable from Williams 82 control plants. (b) Leaf starch is decreased in QQS-E plants compared with wild type (WT) controls. (c) Leaf protein is increased in QQS-E plants compared with WT controls. WT1 and WT2, Williams 82 controls from two growth chambers; QQS-E, transgenic Williams 82 lines expressing the QQS coding sequence, selected by herbicide resistance. QQS-E mutants of T1 generation derived from 17 lines of four independent transformation events (for starch) and from 33 lines of five independent transformation events (for protein) are designated by: independent transformation event #–line #. All data in bar charts show mean ± SE, n = 3. Student's t-test was used to compare QQS-E and WT, *P < 0.05; **P < 0.01.

T1-generation lines of soybean expressing the QQS gene survived from herbicide selection, from each of five independent transformation events (a total of 33 lines), together with Williams 82 controls, were evaluated after they were grown in growth chambers in a completely randomized design within each chamber. Visual examinations of the plants throughout development from seedling to senescence indicated that the morphology of the transgenic soybean plants expressing the QQS transcript was similar to the WT control plants (Figure3a). However, in leaves of the transgenic QQS-expressing lines (QQS-E), starch content was reduced to levels as low as 20% of the WT control plants (Figure3b). In contrast, leaf protein content was increased by up to 60% in the transgenic soybean plants (Figure3c). The effect of QQS on the starch and protein accumulation in lines from the same transformation event may vary as the expression level of the transgene in different lines of the same transformation event may be different (Shou et al., 2004). The plots of leaf starch and protein versus QQS transcript accumulation in Arabidopsis and in soybean indicate that for both species, when QQS RNA accumulation is elevated, protein accumulation is increased and starch accumulation is decreased. The relationship between QQS RNA level and starch and protein concentrations was not linear over the range of QQS RNA accumulation tested (Figure S5).

Soybean plants expressing QQS have increased seed protein

The increase in leaf protein associated with QQS gene expression in soybean led us to assess whether QQS had a broader impact, also affecting seed composition. As a preliminary indication of whether QQS expression alters seed protein content, T2 seeds from soybean plants that had been grown in growth chambers and survived from herbicide selection were screened by Nuclear Magnetic Resonance Spectrometry (NMR). These data (Figure S6) indicated a significant increase in seed protein in the QQS-expressing lines. To evaluate this finding in more detail, plants were propagated via single-seed descent. Progeny were grown in a completely randomized design in a greenhouse. The transgenic soybean plants expressing the QQS transcript survived from herbicide selection were indistinguishable in morphology and development from the WT control plants (Figure S7a). T3 seeds from the greenhouse-grown lines (seeds from the offspring of the same T1 plant were pooled and harvested together) were evaluated for composition by destructive chemical analysis. These seeds also showed a significant increase in seed protein in the QQS-expressing lines, compared with WT controls; the seed oil content of these lines was similar or slightly decreased (Figure S7b).

Based on the results of these analyses, we tested the effect of QQS expression in field-grown plants. Segregating seeds of independent transgenic lines were planted in a field in a randomized complete block design. Plants were monitored weekly during development. Plants were sprayed by herbicide and data on survival was monitored (Table S2). One line is possibly a homozygous line (16-6); in that line, all plants were herbicide resistant. WT nontransgenic siblings were identified by PCR analysis of DNA of leaf pieces from individual progeny of self-propagated T3 generation plants derived from three independent transformation events. Morphology and development of field-grown plants was indistinguishable among the populations of these WT-sibling lines and QQS-expressing lines (Figures4a and S8a).

Figure 4.

Figure 4

Characterization of seeds of field-grown transgenic soybean plants expressing Qua-Quine Starch (QQS). QQS-E plants derived from three independent transformation events (a total of four lines) and segregating wild type (WT)-sibling control plants were grown in a randomized block design in the field. Composition was determined in mature T4 seeds by chemical methods. Figure S8c shows near infrared spectroscopy analysis of different seed batches. (a) Transgenic QQS-E soybean plants are similar in morphology from WT-sibling plants throughout development and in visual phenotypes (Figure S8a). (b) Seed development and seed size are similar in QQS-E compared to WT-siblings. (c) Seed composition in QQS-E compared with that of the WT-sibling controls: seed protein content is increased, oil content is similar or decreased, and protein + oil content is increased. DAP, days after planting; DAF, days after flowering. WT, Williams 82 siblings identified by PCR from segregating populations of T3 plants (from transformation events 7 and 32); QQS-E, transgenic Williams 82 expressing the QQS coding sequence, selected by herbicide resistance. QQS-E mutants are designated by independent transformation event #–line #. All data in bar charts show mean ± SE, n = 3. Student's t-test was used to compare QQS-E and WT, *P < 0.05; **P < 0.01. Scale bar, 1 cm.

Seeds were harvested at maturity. Seed morphology, seed size and shape, seed weight per seed and per plant, and moisture content were similar among the populations of field-grown plants (Figures4b and S8b,c). However, seed composition was significantly affected, as determined by independent methods: near infrared spectroscopy (NIRS, for eight lines, Table S3) and destructive chemical analyses (for four lines) to mature pooled seeds from herbicide-resistant mutant plants transformed with QQS compared to mature pooled seeds from WT plants. Protein content was from 10% to 18% higher in seeds of QQS-expressing lines as compared to those of WT-sibling controls (Figures4c and S8c). The levels of several free amino acids were also increased (Table S4). The increase in the seed protein content did not affect the relative distribution of amino acids in hydrolysed proteins while the amino acid contents are significantly increased (Figure S8d and Table S5). Seed oil content in different lines ranged from a value similar to that of WT-sibling seeds to a 13% decrease (Figures4c and S8c). The level of C16:0 is slightly decreased in the mutants (Table S6). There was no detectable change in levels of free fatty acids. Carbohydrate and fibre contents were decreased (Figure S8c). The total protein and oil content in seeds of QQS-expressing soybean was increased by up to 6.5% as compared with that of the WT-siblings (Figures4c and S8c). Thus, individual lines of soybean expressing the QQS transgene appeared morphologically indistinguishable but showed increases of 18.0%, 10.3%, 13.1%, 12.9% seed protein and 6.5%, 6.3%, 5.5% and 4.3 % seed protein + oil, compared with their sibling WTs. Segregation study of individual transgenic plants and their segregated siblings identified by PCR analysis of DNA of leaf pieces, and these individual plants’ seed composition analysis by NIRS indicated that the high-protein trait was associated with QQS expression (Table S7).

Discussion

Orphan gene function in plant biology

Little is understood about the functional significance of the vast majority of orphan genes in any species. Indeed, to our knowledge, QQS is the only plant orphan gene that has been studied in any detail. In general, orphan genes have been considered to confer a species-specific function, for example, immunity to particular pathogens, self-recognition, or resistance/adaptation to an environmental stress (Gollery et al., 2007; Khalturin et al., 2009). Of the plant orphan genes that anything is known about (other than QQS), most have been identified in mutant screens for genes that alter resistance to abiotic or biotic stresses (Gollery et al., 2006, 2007; Luhua et al., 2013). In the most comprehensive analysis of plant orphan genes to date, Mittler and colleagues (Luhua et al., 2013) evaluated the responses to abiotic stresses of knockout mutants of 1007 Arabidopsis genes randomly selected from those annotated as of ‘unknown function’ (TAIR, 2005 gene model release). Of these genes annotated as of ‘unknown function’, 12 were also orphan genes. Knockout mutants of nine of these 12 orphan genes conferred an altered response to one or more abiotic stresses (Luhua et al., 2013).

Among the 839 clade-specific genes that are common to A. thaliana and A. lyrata are a small cluster of genes that, although divergent across the two species, are still recognizable by sequence; these genes have been shown to play a role in self-recognition (Takeuchi and Higashiyama, 2012). The genes encode cysteine-rich peptides (CRP810_1; AtLUREs) that are among the 300 defensin-like (DEFL) genes in Arabidopsis and function in self-recognition associated with pollen attraction (Takeuchi and Higashiyama, 2012). AtLURE1 (AT5G43285) from A. thaliana, when introduced into Torenia fournieri, enables A. thaliana pollen to be attracted to and penetrate T. fournieri ovules (Takeuchi and Higashiyama, 2012). Thus, with the possible exception of QQS, the few plant orphan or near-orphan (clade-specific) genes with functional information appear to play a role in recognition or defence-related processes (Gollery et al., 2007; Kim et al., 2009; Li et al., 2009).

QQS and carbon and nitrogen allocation

QQS plays a role, direct or indirect, in regulating carbon and nitrogen allocation to starch (Li et al., 2009) and protein, a process that would be expected to have considerable commonality among plant species. Although carbon and nitrogen allocation might be considered as very distinct from species-specific recognition, immune response, or defence, it actually is closely intertwined. Plants are ‘planted’ in one place, and therefore must respond with tremendous sensitivity to environmental cues. Global approaches to understand the processes of photosynthesis, nutrient supply and carbon and nitrogen allocation are revealing the intricate relationship among what some might consider distinct processes (Stitt et al., 2010; Sulpice et al., 2013; Thum et al., 2008).

Among the most pronounced of the extremely varied pattern of QQS expression across changes in genotypes, environments and developments (Li et al., 2009), QQS expression level is strongly changed in a variety of knockout mutants. For example, QQS expression is significantly higher (compared with WT controls) in mutants of genes as diverse as PEN3, a putative ATP binding cassette transporter that contributes to pathogen resistance (Stein et al., 2006); starch synthase 3 (SS3) (Li et al., 2009); WIN1, involved in regulating cuticular wax deposition (Kannangara et al., 2007); and the brassinosteroid-induced FER, which functions in pollen tube-ovule interaction (Guo et al., 2009). Plants expressing the NahG transgene, a bacterial gene that hydroxylates salicylic acid (SA) and reduces SA-mediated signalling (Takahashi et al., 2004), have increased QQS expression (ArrayExpress experiment ID ‘E-GEOD-5727’, data submitted by Buchanan-Wollaston). SA plays a role in plant defense against pathogens (Lin et al., 2013). Consistent with our direct demonstration that QQS impacts carbon and nitrogen allocation, QQS has been implicated in the ability of Arabidopsis to adjust to reduced carbon and energy environments, based on its altered expression in knockout lines of the EXORDIUM-LIKE1 (EXL1) gene (At1g35140) (Schroder et al., 2011). In addition, the inverse relationship between QQS expression and starch accumulation across multiple environments fits our model that plants adjust QQS expression in response to stresses. Overall, the extremely variable expression pattern of QQS, combined with its lack of co-expression with other genes and association with compositional changes under changing environments (Li et al., 2009), is consistent with the concept that the QQS-induced compositional changes in A. thaliana may aid in the metabolic adaptation of that species to its environment.

Ectopic function of QQS

Our demonstration that the introduction of the QQS gene into soybean results in a significant increase in protein accumulation and decrease in lipid accumulation in seeds indicate the potential of QQS as a molecular tool to increase the protein content of agronomic species. Comparison of composition in plants grown in growth chamber, greenhouse and field materials indicates that the general trend of high-protein content in QQS-E soybean holds for plants grown under these very different environments. Protein (and oil) content typically is extremely responsive to both genotype and environment (Arslanoglu et al., 2011; Jing et al., 2003; Singh et al., 1993). Fertilizers containing nitrogen were applied to growth chamber- and greenhouse-grown soybeans, while no fertilizer was applied to field-grown soybeans. Therefore, the differences in overall composition (e.g. seed protein and oil) between the growth chamber-, greenhouse- and field-grown material, as well as the difference in composition between the QQS-E and control lines could be due to either genotype or environment, or a genotype and environment effect.

QQS expression in soybean causes greater increases in protein in soybean than in Arabidopsis. One possible explanation for the larger effect of QQS on soybean is that over time Arabidopsis has evolved mechanisms for homeostatic balances for the QQS gene; however, these mechanisms are not present in soybean. An alternate explanation is that the signalling mechanisms in soybean respond with different sensitivity than those in Arabidopsis.

Interestingly, leaves of QQS-E soybean lines with a ratio as low as 1.9 QQS RNA/18S rRNA by real-time PCR display the high-protein, low-starch trait. However, QQS-E 7-1 did not have high leaf protein, despite having significant QQS expression. Possible explanations are that Event 7 had multiple transgene insertion sites, and these multiple transgenes may have been retained in line 7-1, but not in 7-7. It may be that the insertion site of one of these transgenes interfered with some metabolic process, or that the expression of QQS was not stable in QQS-E 7-1. Indeed, there does not appear statistically significant linear regression of leaf starch and protein contents with leaf QQS expression (as determined by transcript level) in either Arabidopsis (R2 = 0.83 for starch and 0.64 for protein) or soybean (R2 = 0.78 for starch and 0.49 for protein); these calculations are complicated by the fact that the relationship between the composition traits and QQS transcript level is not necessarily linear, but complex, and have not yet been fully defined. This is perhaps not surprising given the very low level of QQS protein that accumulates in Arabidopsis even in mutant lines that highly express QQS (Li et al., 2009). It is not unusual for regulatory proteins to have very low levels of expression (Nagaraj et al., 2011). Thus, it may be that only a very small concentration of QQS saturates the QQS receptor, and any increases over this concentration do not affect protein and starch contents. Other possible explanations for a lack of strong linear correlation between levels of QQS transcript and composition are that QQS translational efficiency or stability or the effectiveness of the QQS protein to biochemically express its function or post-translational modification is limiting; also, a variety of post-translational regulatory mechanisms can come into play, as have been described for other transgenes (Lillo et al., 2004; Vaucheret et al., 2001). These considerations present interesting questions about the mechanism by which QQS acts in soybean; our current working hypothesis is that soybean and Arabidopsis have a common protein with which QQS interacts, and that QQS-interactor becomes saturated at low levels of QQS expression.

The experiments described do not distinguish whether the high-protein trait in the seeds is a maternal effect or a seed effect. The 35S promoter, which we are using in these studies, drives GUS expression in both leaf and in seed (Anderson and Botella, 2007; Wu et al., 2010), which is consistent with either a seed or a maternal effect. Thus, it is possible that the QQS expression in the seed causes the high-protein trait (a seed effect). Alternately, a signalling molecule or a larger flux of organic carbon and nitrogen from the leaves might drive the high-protein trait in the soybean seeds- this would represent a maternal effect of QQS.

An increase in soybean protein of the magnitude reported in this study has societal relevance, as soybean provides a major source of global dietary vegetable protein (Pathan and Sleper, 2008; Wilson, 2008). Over 70 years of soybean breeding efforts have not been able to break the inverse relationship between seed protein content and oil content, or the inverse relationship between seed protein content and yield. However, the transgenic expression of QQS increases seed protein content in soybean grown under three diverse conditions (growth chamber, greenhouse and field) without detectably affecting plant or seed morphology or seed weight (these factors were determined under all three conditions). In seeds of field-grown plants, for which we made more detailed determinations, there were no significant differences in seed yield per plant, the relative composition of amino acids in the hydrolysed protein from seeds, the moisture content or the yield. We analysed free amino acids to determine whether the increase in protein we observed was associated with a very substantial increase in any free amino acid; this does not seem to be the case. Some free amino acid levels were altered in QQS-E soybean seeds. Glutamic acid and arginine are reported to help to avoid protein aggregating and precipitating (Golovanov et al., 2004); it is possible that these shifts in free glutamic acid and arginine might help to adapt to the increased protein content. Lysine and arginine are among the essential amino acid group according to their importance to nutrition and physiology values (Belitz et al., 2009). Thus increased free lysine and arginine in QQS-E seeds could potentially provide an increased value; however, the absolute level of free lysine and arginine is very low.

Taken together, our findings suggest that QQS could be introduced by breeding or transformation into an elite soybean variety with specific desirable agronomic traits to increase protein content. For example, QQS could be introduced into an Iowa soybean variety that is resistant to the soybean cyst nematode but relatively low in protein (http://www.cad.iastate.edu/gensoyrel.html), or in African soybean varieties highly resistant to rust, bacterial blight and leaf spot (http://www.iita.org/soybean-asset/-/asset_publisher/t3fI/content/better-soybean-varieties-offer-african-farmers-new-opportunities?redirect=%2Fsoybean#.U8ib1fldWSp).

Conclusions

Our data demonstrate that QQS expression alters plant composition. We show that expressing this gene in a plant species that has no QQS sequence homolog increases protein content of leaves and seeds, yet the morphology and development of the soybean expressing QQS cannot be distinguished from the WT sibling controls. Thus, the QQS-expressing mutant appears to preserve overall homeostasis while selectively effecting composition. QQS might be acting upstream of the process that controls carbon partitioning, or might be central to this process. These results also illustrate that orphan genes, although often poorly annotated and even ignored, may provide a valuable resource for new traits.

The evolutionary changes that resulted in the de novo origin (Silveira et al., 2013) of the QQS gene of A. thaliana must have been rapid and extensive as there is no gene homolog in even the closely related species, A. lyrata. Yet, soybean, a species that diverged from Arabidopsis approximately 100 million years ago (Hedges and Kumar, 2009), appears to contain a conserved receptor or mechanism that recognizes QQS and responds to its occurrence by conferring a compositional phenotype. This research reveals the fundamental capacity of a species-specific gene to act across species to impact the major metabolic function of carbon and nitrogen allocation.

Experimental procedures

Construction of QQS over-expression vector and Arabidopsis transformation and selection are provided in Appendix S1.

Soybean transformation, selection and nomenclature

Glycine max cultivar Williams 82 was transformed by Agrobacterium-mediated soybean transformation using half-seed explants (Paz et al., 2004). The transformation and selection of T1 plants were performed at the Plant Transformation Facility at Iowa State University (ISU) (http://www.agron.iastate.edu/ptf/index.aspx). Soybean plants from independent transformation events were selected based on herbicide resistance (segregated WTs were killed) and confirmed by real-time PCR analysis to identify transgenic lines that expressed the QQS transgene.

The progeny of each independent soybean transformation is referred to as an ‘event’ (a transformation event is considered independent if it is taken from an individual plate). Each plant germinated from one T1 seed is called a ‘line’ and the line designation continues throughout generations. So multiple lines stem from one independent transformation event; because each line is the result of sexual reproduction; these lines may not be genetically identical. A total of 33 lines from five independent transformation events were confirmed on the basis of BAR selection followed by PCR analysis for presence of the QQS gene.

Plant growth

This study used WT A. thaliana ecotype Columbia (Col-0), and transgenic lines derived from Col-0. Detailed information is provided in Appendix S1.

Detailed information about transgenic QQS-expressing (QQS-E) soybean grown in growth chambers (T1 generation) and in a greenhouse (T2 generation, selected on herbicide resistance and segregated WTs were killed), is provided in Appendix S1.

Transgenic QQS-E soybean (T3 generation) and WT (Williams 82) plants were planted at a randomized block design in the field at ISU Curtiss Farm in Ames, IA. One line (60 seeds) was planted in one row, with a total of three replicates in three rows. Each row was ten feet long and the rows were 2.5 feet apart. The criteria used for selecting the events-lines to study further in the field were: having sufficient seeds for NIRS analysis and planting in the field; having an increased leaf protein/decreased leaf starch trait; and the presence of the QQS transgene (as determined by PCR). The field conditions were harsh prior to germination, and there was considerable flooding; some seeds in flooded area were eaten by ground squirrels or other animals, and a number of lines were lost (including QQS-E 7-1). Eight mutant lines that survived germination were sprayed by herbicide, and numbers of herbicide-resistant and herbicide-sensitive progeny were counted. For yield trials, only seeds harvested from plants from the middle seven feet were used for yield estimate (grams of seed weight per plant). The seeds from different plants of the same line were pooled and used as different replicates for seed composition analysis by NIRS (all eight lines) and by chemical methods (four lines). Some plants were randomly marked for genotype screening by PCR to identify transgenic plants and their WT-siblings. Seeds from these plants were harvested per individual plant and were not pooled with seeds from other plants.

QQS determination by PCR is provided in Appendix S1.

Leaf composition analyses

For screening starch using I2/KI staining in Arabidopsis, shoots were harvested at the end of the light period, and processed as described before (Li et al., 2007). Detailed information is provided in Appendix S1.

Leaf starch and protein were determined at the end of the light period in Arabidopsis seedling shoots of 20 days after planting (DAP) grown in a growth chamber, and in soybean leaves that were newly, fully expanded, harvested from branches two to four from shoot apex from 58-DAP T1 plants (starch: four independent transformation events, protein: five). Three (for starch) or five (for protein) plants per replicate and three replicates from each independent T2 lines (Arabidopsis), and leaves from three positions that were used as three replicates in the T1 plants (soybean), were analysed. Detailed information about leaf starch and protein determination is provided in Appendix S1.

Soybean seed composition analyses

Information about NMR and NIRS screening is provided in Appendix S1.

Destructive chemical analyses were mostly conducted at Eurofins (Des Moines, IA). Methods were: protein content, AOCS Ba 4e-93 (American Oil Chemists’ Society, 1997); oil content, AOCS Ac 3-44 (American Oil Chemists’ Society, 1997); moisture content, AOCS Ac 2-41 (American Oil Chemists’ Society, 1997); hydrolysed fatty acid profiling, AOCS Ce 2-66 and AOCS Ce 1-62 (American Oil Chemists’ Society, 1991); free amino acid profiling, AOAC 999.13 modified (Fontaine et al., 2000). About 30 g of seeds (approximately 170 seeds from nine plants, for protein content), 10 g (for oil content), 10 g (for moisture content), 10 g (for hydrolysed fatty acid profiling) and 20 g (for free amino acid profiling) per replicate were tested, respectively, with three biological replicates for each sample.

Hydrolysed amino acid composition analysis was conducted at the Experiment Station Chemical Laboratories, University of Missouri (http://www.aescl.missouri.edu/), using method AOAC 982.30 E (a,b,c) Ch. 45.3.05 (Association of Official Analytical Chemists (AOAC), 2006). About 30 g of seeds per replicate were tested, with three biological replicates for each sample.

Statistical analyses

For each experiment, plants were collected and analysed in a randomized complete block design or completely randomized design. Plant composition tests were conducted with a minimum of three biological tests. For all composition analyses, plant samples were assigned randomized numbers and provided to the analysis facilities for determination in a randomized order with no designator of genotype.

Data are presented as mean ± SE. Two sets of independent samples were compared using Student's t-test (two-tailed) with assumption of equal variances (n = 3). P < 0.05 was considered significant (*); P < 0.01 was considered very significant (**).

Bioinformatics analyses is provided in Appendix S1.

Acknowledgments

We are grateful to Walter Fehr for advice on soybean breeding, Zebulun Arendsee and Ruoran Li for orphan gene analyses, and Basil Nikolau and Jianming Yu for helpful suggestions on the manuscript. We thank Kan Wang and Diane Luth from the ISU Plant Transformation Facility for generating transgenic lines of soybean; Jianling Peng for the QQS-OE construct; Taner Sen and Vladimir Uversky for information on prediction of QQS protein structure and disorder; Kent Berns for field management; Wenguang Zheng, Sheng Huang, Marah Hoel, Xiaoran Shang, Alan Kading, Le Song, Hiwot Abebe, Ana Boehm and Sean Wefel for assistance with soybean growth and harvest; Charles Hurburgh and Glen Rippke from ISU Grain Quality Laboratory, for NIRS analysis; Dan Duvick for seed free fatty acid determinations; Jian Li for statistical consultation; Waitent Sow, Kevin Wenceslao, Dallas Jones, and ISU W. M. Keck Plant Metabolomics Laboratory for contributions to composition analysis; Eurofins (Des Moines, IA) for analysis of seed composition; ISU Soil and Plant Analysis Laboratory for leaf combustion analysis; Experiment Station Chemical Laboratories at University of Missouri for amino acid profiling; Pennington Caroline from UEA Consulting Limited for PCR analyses of QQS transcript level; and Sabry Elias, Oregon State University Seed Lab for NMR Spectrometry screening. This material is based in part upon work supported by National Science Foundation EEC-0813570 (to E.S.W.) and MCB-0951170 (to E.S.W. and L.L.); United Soybean Board award 2287 (to L.L.); ISU Research Foundation (to L.L.); and Center for Metabolic Biology (to E.S.W.). Any opinions, findings and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.

Author contributions

L.L. and E.S.W. conceived the project. L.L. performed the experiments. L.L and E.S.W. co-supervised the project and contributed equally to data analysis and preparation of the paper.

Supporting information

Additional Supporting information may be found in the online version of this article:

Figure S1 QQS-OE construct used to transform Arabidopsis thaliana and soybean (Glycine max).

Figure S2 Starch quantification shows increased starch in QQS RNAi mutants and decreased starch in QQS-OE mutants compared to WT.

Figure S3 Mapping of Arabidopsis thaliana orphan genes and transposons on chromosomes.

Figure S4 Spatial expression of QQS and starch accumulation with 5% sucrose and in the cold.

Figure S5 The plots of leaf starch and protein versus QQS transcript in Arabidopsis and in soybean.

Figure S6 Composition screening of seeds from transgenic lines of growth chamber-grown transgenic soybean plants expressing QQS.

Figure S7 Phenotype and seed composition of transgenic lines of greenhouse-grown soybean plants expressing QQS.

Figure S8 Phenotype and seed composition of transgenic lines of field-grown soybean plants expressing QQS.

Table S1 QQS transcript accumulation (by real-time PCR) in leaves of QQS-expressing transgenic lines and WT lines.

Table S2 Chi-square analysis assuming a 3:1 (Resistant:Sensitive) ratio for T3 generation of progeny of QQS-E containing 35S:QQS and pNos:Bar.

Table S3 Seed composition of transgenic lines of field-grown soybean plants expressing QQS.

Table S4 Profiles of most abundant free amino acids from seeds of QQS-expressing transgenic lines, compared with WTs.

Table S5 Profiles of amino acids from hydrolysed proteins: almost all amino acids analysed are increased in seeds of QQS-expressing transgenic lines, compared with WTs.

Table S6 Hydrolysed fatty acid profiles in the seeds of QQS-expressing transgenic lines, compared with WTs.

Table S7 Seed composition of individual plants of transgenic lines and their segregated siblings of field-grown soybean plants expressing QQS.

Appendix S1 Supplementary experimental procedures.

pbi0013-0177-sd1.pdf (2.2MB, pdf)

References

  1. Adler RG. Genome research: fulfilling the public's expectations for knowledge and commercialization. Science. 1992;257:908–914. doi: 10.1126/science.1502557. [DOI] [PubMed] [Google Scholar]
  2. Aloni B, Karni L, Zaidman Z, Schaffer AA. The relationship between sucrose supply, sucrose-cleaving enzymes and flower abortion in pepper. Ann. Bot. 1997;79:601–605. [Google Scholar]
  3. American Oil Chemists’ Society (AOCS) Official and Tentative Method of the American Oil Chemists’ Society. 4th edn. Champaign, IL: AOCS Press; 1991. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. American Oil Chemists’ Society (AOCS) Official Methods and Recommended Practices of the American Oil Chemists’ Society. 5th edn. Champaign, IL: AOCS Press; 1997. [Google Scholar]
  5. Amiri H, Davids W, Andersson SG. Birth and death of orphan genes in Rickettsia. Mol. Biol. Evol. 2003;20:1575–1587. doi: 10.1093/molbev/msg175. [DOI] [PubMed] [Google Scholar]
  6. Anderson DJ, Botella JR. Expression analysis and subcellular localization of the Arabidopsis thaliana G-protein β-subunit AGB1. Plant Cell Rep. 2007;26:1469–1480. doi: 10.1007/s00299-007-0356-1. [DOI] [PubMed] [Google Scholar]
  7. Arendsee Z, Li L, Wurtele ES. Coming of age: the species-specific (orphan) genes of plants. Trends Plant Sci. 2014 doi: 10.1016/j.tplants.2014.07.003. In press. [DOI] [PubMed] [Google Scholar]
  8. Arslanoglu F, Aytac S, Oner EK. Effect of genotype and environment interaction on oil and protein content of soybean (Glycine max (L.) Merrill) seed. Afr. J. Biotechnol. 2011;10:18409–18417. [Google Scholar]
  9. Association of Official Analytical Chemists (AOAC) Official Methods of Analysis of the Association of Official Analytical Chemists. Gaithersburg, MD: AOAC International; 2006. [Google Scholar]
  10. Belitz HD, Grosch W, Schieberle P. Amino acids, peptides, protein. In: Belitz HD, Grosch W, Schieberle P, editors. Food Chemistry. 4th revised and extended edn. Berlin: Springer; 2009. pp. 8–34. [Google Scholar]
  11. Biehl A, Richly E, Noutsos C, Salamini F, Leister D. Analysis of 101 nuclear transcriptomes reveals 23 distinct regulons and their relationship to metabolism, chromosomal gene distribution and co-ordination of nuclear and plastid gene expression. Gene. 2005;344:33–41. doi: 10.1016/j.gene.2004.09.009. [DOI] [PubMed] [Google Scholar]
  12. Carvunis AR, Rolland T, Wapinski I, Calderwood MA, Yildirim MA, Simonis N, Charloteaux B, Hidalgo CA, Barbette J, Santhanam B, Brar GA, Weissman JS, Regev A, Thierry-Mieg N, Cusick ME, Vidal M. Proto-genes and de novo gene birth. Nature. 2012;487:370–374. doi: 10.1038/nature11184. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Donoghue MT, Keshavaiah C, Swamidatta SH, Spillane C. Evolutionary origins of Brassicaceae specific genes in Arabidopsis thaliana. BMC Evol. Biol. 2011;11:47. doi: 10.1186/1471-2148-11-47. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Eastmond PJ. SUGAR-DEPENDENT1 encodes a patatin domain triacylglycerol lipase that initiates storage oil breakdown in germinating Arabidopsis seeds. Plant Cell. 2006;18:665–675. doi: 10.1105/tpc.105.040543. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Eastmond PJ, Dennis DT, Rawsthorne S. Evidence that a malate/inorganic phosphate exchange translocator imports carbon across the leucoplast envelope for fatty acid synthesis in developing castor seed endosperm. Plant Physiol. 1997;114:851–856. doi: 10.1104/pp.114.3.851. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Espinoza C, Degenkolbe T, Caldana C, Zuther E, Leisse A, Willmitzer L, Hincha DK, Hannah MA. Interaction with diurnal and circadian regulation results in dynamic metabolic and transcriptional changes during cold acclimation in Arabidopsis. PLoS ONE. 2010;5:e14101. doi: 10.1371/journal.pone.0014101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Feng Y, Hurst J, Almeida-De-Macedo M, Chen X, Li L, Ransom N, Wurtele ES. Massive human co-expression network and its medical applications. Chem. Biodivers. 2012;9:868–887. doi: 10.1002/cbdv.201100355. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Fischer D, Eisenberg D. Finding families for genomic ORFans. Bioinformatics. 1999;15:759–762. doi: 10.1093/bioinformatics/15.9.759. [DOI] [PubMed] [Google Scholar]
  19. Fontaine J, Eudaimon M, Fontaine J, Eudaimon M. Liquid chromatographic determination of lysine, methionine, and threonine in pure amino acids (feed grade) and premixes: collaborative study. J. AOAC Int. 2000;83:771–783. [PubMed] [Google Scholar]
  20. Godfray HC, Beddington JR, Crute IR, Haddad L, Lawrence D, Muir JF, Pretty J, Robinson S, Thomas SM, Toulmin C. Food security: the challenge of feeding 9 billion people. Science. 2010;327:812–818. doi: 10.1126/science.1185383. [DOI] [PubMed] [Google Scholar]
  21. Gollery M, Harper J, Cushman J, Mittler T, Girke T, Zhu JK, Bailey-Serres J, Mittler R. What makes species unique? The contribution of proteins with obscure features. Genome Biol. 2006;7:R57. doi: 10.1186/gb-2006-7-7-r57. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Gollery M, Harper J, Cushman J, Mittler T, Mittler R. POFs: what we don't know can hurt us. Trends Plant Sci. 2007;12:492–496. doi: 10.1016/j.tplants.2007.08.018. [DOI] [PubMed] [Google Scholar]
  23. Golovanov AP, Hautbergue GM, Wilson SA, Lian LY. A simple method for improving protein solubility and long-term stability. J. Am. Chem. Soc. 2004;126:8933–8939. doi: 10.1021/ja049297h. [DOI] [PubMed] [Google Scholar]
  24. Gomes SP, Nyengaard JR, Misawa R, Girotti PA, Castelucci P, Blazquez FH, de Melo MP, Ribeiro AA. Atrophy and neuron loss: effects of a protein-deficient diet on sympathetic neurons. J. Neurosci. Res. 2009;87:3568–3575. doi: 10.1002/jnr.22167. [DOI] [PubMed] [Google Scholar]
  25. Guo H, Li L, Ye H, Yu X, Algreen A, Yin Y. Three related receptor-like kinases are required for optimal cell elongation in Arabidopsis thaliana. Proc. Natl Acad. Sci. USA. 2009;106:7648–7653. doi: 10.1073/pnas.0812346106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Hedges SB, Kumar S. The Timetree of Life. New York, NY: Oxford University Press; 2009. [Google Scholar]
  27. Heitschmidt RK, Short RE, Grings EE. Ecosystems, sustainability, and animal agriculture. J. Anim. Sci. 1996;74:1395–1405. doi: 10.2527/1996.7461395x. [DOI] [PubMed] [Google Scholar]
  28. Higashi Y, Hirai MY, Fujiwara T, Naito S, Noji M, Saito K. Proteomic and transcriptomic analysis of Arabidopsis seeds: molecular evidence for successive processing of seed proteins and its implication in the stress response to sulfur nutrition. Plant J. 2006;48:557–571. doi: 10.1111/j.1365-313X.2006.02900.x. [DOI] [PubMed] [Google Scholar]
  29. Jing Q, Jiang D, Dai T, Cao W. Effects of genotype and environment on wheat grain quality and protein components. Chin. J. Appl. Ecol. 2003;14:1649–1653. [PubMed] [Google Scholar]
  30. Kannangara R, Branigan C, Liu Y, Penfield T, Rao V, Mouille G, Hofte H, Pauly M, Riechmann JL, Broun P. The transcription factor WIN1/SHN1 regulates Cutin biosynthesis in Arabidopsis thaliana. Plant Cell. 2007;19:1278–1294. doi: 10.1105/tpc.106.047076. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Khalturin K, Hemmrich G, Fraune S, Augustin R, Bosch TC. More than just orphans: are taxonomically-restricted genes important in evolution? Trends Genet. 2009;25:404–413. doi: 10.1016/j.tig.2009.07.006. [DOI] [PubMed] [Google Scholar]
  32. Kim MJ, Shin R, Schachtman DP. A nuclear factor regulates abscisic acid responses in Arabidopsis. Plant Physiol. 2009;151:1433–1445. doi: 10.1104/pp.109.144766. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Knowles DG, McLysaght A. Recent de novo origin of human protein-coding genes. Genome Res. 2009;19:1752–1759. doi: 10.1101/gr.095026.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Li L, Ilarslan H, James MG, Myers AM, Wurtele ES. Genome wide co-expression among the starch debranching enzyme genes AtISA1, AtISA2, and AtISA3 in Arabidopsis thaliana. J. Exp. Bot. 2007;58:3323–3342. doi: 10.1093/jxb/erm180. [DOI] [PubMed] [Google Scholar]
  35. Li L, Foster CM, Gan Q, Nettleton D, James MG, Myers AM, Wurtele ES. Identification of the novel protein QQS as a component of the starch metabolic network in Arabidopsis leaves. Plant J. 2009;58:485–498. doi: 10.1111/j.1365-313X.2009.03793.x. [DOI] [PubMed] [Google Scholar]
  36. Li Z, Gao J, Benning C, Sharkey TD. Characterization of photosynthesis in Arabidopsis ER-to-plastid lipid trafficking mutants. Photosynth. Res. 2012;112:49–61. doi: 10.1007/s11120-012-9734-9. [DOI] [PubMed] [Google Scholar]
  37. Lillo C, Meyer C, Lea US, Provan F, Oltedal S. Mechanism and importance of post-translational regulation of nitrate reductase. J. Exp. Bot. 2004;55:1275–1282. doi: 10.1093/jxb/erh132. [DOI] [PubMed] [Google Scholar]
  38. Lin J, Mazarei M, Zhao N, Zhu JJ, Zhuang X, Liu W, Pantalone VR, Arelli PR, Stewart CN, Jr, Chen F. Overexpression of a soybean salicyclic acid methlyltransferase confers resistance to soybean cyst nematode. Plant Biotechnol. J. 2013;11:1135–1145. doi: 10.1111/pbi.12108. [DOI] [PubMed] [Google Scholar]
  39. Lister R, O'Malley RC, Tonti-Filippini J, Gregory BD, Berry CC, Millar AH, Ecker JR. Highly integrated single-base resolution maps of the epigenome in Arabidopsis. Cell. 2008;133:523–536. doi: 10.1016/j.cell.2008.03.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Luhua S, Hegie A, Suzuki N, Shulaev E, Luo X, Cenariu D, Ma V, Kao S, Lim J, Gunay MB, Oosumi T, Lee SC, Harper J, Cushman J, Gollery M, Girke T, Bailey-Serres J, Stevenson RA, Zhu JK, Mittler R. Linking genes of unknown function with abiotic stress responses by high-throughput phenotype screening. Physiol. Plant. 2013;148:322–333. doi: 10.1111/ppl.12013. [DOI] [PubMed] [Google Scholar]
  41. Marsden RL, Lee D, Maibaum M, Yeats C, Orengo CA. Comprehensive genome analysis of 203 genomes provides structural genomics with new insights into protein family space. Nucleic Acids Res. 2006;34:1066–1080. doi: 10.1093/nar/gkj494. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Mentzen WI, Wurtele ES. Regulon organization of Arabidopsis. BMC Plant Biol. 2008;8:99. doi: 10.1186/1471-2229-8-99. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Mentzen WI, Peng J, Ransom N, Nikolau BJ, Wurtele ES. Articulation of three core metabolic processes in Arabidopsis: fatty acid biosynthesis, leucine catabolism and starch metabolism. BMC Plant Biol. 2008;8:76. doi: 10.1186/1471-2229-8-76. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Mishra AK, Agarwal S, Jain CK, Rani V. High GC content: critical parameter for predicting stress regulated miRNAs in Arabidopsis thaliana. Bioinformation. 2009;4:151–154. doi: 10.6026/97320630004151. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Muller O, Krawinkel M. Malnutrition and health in developing countries. CMAJ. 2005;173:279–286. doi: 10.1503/cmaj.050342. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Nagalakshmi U, Wang Z, Waern K, Shou C, Raha D, Gerstein M, Snyder M. The transcriptional landscape of the yeast genome defined by RNA sequencing. Science. 2008;320:1344–1349. doi: 10.1126/science.1158441. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Nagaraj N, Wisniewski JR, Geiger T, Cox J, Kircher M, Kelso J, Paabo S, Mann M. Deep proteome and transcriptome mapping of a human cancer cell line. Mol. Syst. Biol. 2011;7:548. doi: 10.1038/msb.2011.81. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Neme R, Tautz D. Phylogenetic patterns of emergence of new genes support a model of frequent de novo evolution. BMC Genomics. 2013;14:117. doi: 10.1186/1471-2164-14-117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Ohno S. Early genes that were oligomeric repeats generated a number of divergent domains on their own. Proc. Natl Acad. Sci. USA. 1987;84:6486–6490. doi: 10.1073/pnas.84.18.6486. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Pathan MS, Sleper DA. Advances in soybean breeding. In: Stacey G, editor. Genetics and Genomics of Soybean. New York: Springer; 2008. pp. 113–134. [Google Scholar]
  51. Paz MM, Shou H, Guo Z, Zhang Z, Banerjee AK, Wang K. Assessment of conditions affecting Agrobacterium-mediated soybean transformation using the cotyledonary node explant. Euphytica. 2004;136:167–179. [Google Scholar]
  52. Pimentel D, Pimentel M. Sustainability of meat-based and plant-based diets and the environment. Am. J. Clin. Nutr. 2003;78:660S–663S. doi: 10.1093/ajcn/78.3.660S. [DOI] [PubMed] [Google Scholar]
  53. Schiltz S, Gallardo K, Huart M, Negroni L, Sommerer N, Burstin J. Proteome reference maps of vegetative tissues in pea. An investigation of nitrogen mobilization from leaves during seed filling. Plant Physiol. 2004;135:2241–2260. doi: 10.1104/pp.104.041947. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Schroder F, Lisso J, Mussig C. EXORDIUM-LIKE1 promotes growth during low carbon availability in Arabidopsis. Plant Physiol. 2011;156:1620–1630. doi: 10.1104/pp.111.177204. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Seo PJ, Kim MJ, Ryu JY, Jeong EY, Park CM. Two splice variants of the IDD14 transcription factor competitively form nonfunctional heterodimers which may regulate starch metabolism. Nat. Commun. 2011;2:303. doi: 10.1038/ncomms1303. [DOI] [PubMed] [Google Scholar]
  56. Shou H, Frame BR, Whitham SA, Wang K. Assessment of transgenic maize events produced by particle bombardment or Agrobacterium-mediated transformation. Mol. Breed. 2004;13:201–208. [Google Scholar]
  57. Silveira AB, Trontin C, Cortijo S, Barau J, Del Bem LE, Loudet O, Colot V, Vincentz M. Extensive natural epigenetic variation at a de novo originated gene. PLoS Genet. 2013;9:e1003437. doi: 10.1371/journal.pgen.1003437. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Singh KB, Bejiga G, Malhotra RS. Genotype–environment interactions for protein content in chickpea. J. Sci. Food Agric. 1993;63:87–90. [Google Scholar]
  59. Stein M, Dittgen J, Sanchez-Rodriguez C, Hou BH, Molina A, Schulze-Lefert P, Lipka V, Somerville S. Arabidopsis PEN3/PDR8, an ATP binding cassette transporter, contributes to nonhost resistance to inappropriate pathogens that enter by direct penetration. Plant Cell. 2006;18:731–746. doi: 10.1105/tpc.105.038372. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Stitt M, Lunn J, Usadel B. Arabidopsis and primary photosynthetic metabolism—more than the icing on the cake. Plant J. 2010;61:1067–1091. doi: 10.1111/j.1365-313X.2010.04142.x. [DOI] [PubMed] [Google Scholar]
  61. Sulpice R, Flis A, Ivakov AA, Apelt F, Krohn N, Encke B, Abel C, Feil R, Lunn JE, Stitt M. Arabidopsis coordinates the diurnal regulation of carbon allocation and growth across a wide range of photoperiods. Mol. Plant. 2013;7:137–155. doi: 10.1093/mp/sst127. [DOI] [PubMed] [Google Scholar]
  62. Takahashi H, Kanayama Y, Zheng MS, Kusano T, Hase S, Ikegami M, Shah J. Antagonistic interactions between the sa and ja signaling pathways in Arabidopsis modulate expression of defense genes and gene-for-gene resistance to cucumber mosaic virus. Plant Cell Physiol. 2004;45:803–809. doi: 10.1093/pcp/pch085. [DOI] [PubMed] [Google Scholar]
  63. Takeuchi H, Higashiyama T. A species-specific cluster of defensin-like genes encodes diffusible pollen tube attractants in Arabidopsis. PLoS Biol. 2012;10:e1001449. doi: 10.1371/journal.pbio.1001449. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Tautz D, Domazet-Loso T. The evolutionary origin of orphan genes. Nat. Rev. Genet. 2011;12:692–702. doi: 10.1038/nrg3053. [DOI] [PubMed] [Google Scholar]
  65. Thum KE, Shin MJ, Gutierrez RA, Mukherjee I, Katari MS, Nero D, Shasha D, Coruzzi GM. An integrated genetic, genomic and systems approach defines gene networks regulated by the interaction of light and carbon signaling pathways in Arabidopsis. BMC Syst. Biol. 2008;2:31. doi: 10.1186/1752-0509-2-31. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Vaucheret H, Béclin C, Fagard M. Post-transcriptional gene silencing in plants. J. Cell Sci. 2001;114:3083–3091. doi: 10.1242/jcs.114.17.3083. [DOI] [PubMed] [Google Scholar]
  67. Victora CG, de Onis M, Hallal PC, Blossner M, Shrimpton R. Worldwide timing of growth faltering: revisiting implications for interventions. Pediatrics. 2010;125:e473–e480. doi: 10.1542/peds.2009-1519. [DOI] [PubMed] [Google Scholar]
  68. Wilson RF. Soybean: market driven research needs. In: Stacey G, editor. Genetics and Genomics of Soybean. New York: Springer; 2008. pp. 3–16. [Google Scholar]
  69. Wilson GA, Feil EJ, Lilley AK, Field D. Large-scale comparative genomic ranking of taxonomically restricted genes (TRGs) in bacterial and archaeal genomes. PLoS ONE. 2007;2:e324. doi: 10.1371/journal.pone.0000324. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Wissler L, Gadau J, Simola DF, Helmkampf M, Bornberg-Bauer E. Mechanisms and dynamics of orphan gene emergence in insect genomes. Genome Biol. Evol. 2013;5:439–455. doi: 10.1093/gbe/evt009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Wu L, EL-Mezawy A, Duong M, Shah S. Two seed coat-specific promoters are functionally conserved between Arabidopsis thaliana and Brassica napus. In Vitro Cell Dev. Biol. Plant. 2010;46:338–347. [Google Scholar]
  72. Wu DD, Irwin DM, Zhang YP. De novo origin of human protein-coding genes. PLoS Genet. 2011;7:e1002379. doi: 10.1371/journal.pgen.1002379. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Young VR, Pellett PL. Plant proteins in relation to human protein and amino acid nutrition. Am. J. Clin. Nutr. 1994;59:1203S–1212S. doi: 10.1093/ajcn/59.5.1203S. [DOI] [PubMed] [Google Scholar]
  74. Zhang X, Myers AM, James MG. Mutations affecting starch synthase III in Arabidopsis alter leaf starch structure and increase the rate of starch synthesis. Plant Physiol. 2005;138:663–674. doi: 10.1104/pp.105.060319. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Zhang X, Szydlowski N, Delvalle D, D'Hulst C, James MG, Myers AM. Overlapping functions of the starch synthases SSII and SSIII in amylopectin biosynthesis in Arabidopsis. BMC Plant Biol. 2008;8:96. doi: 10.1186/1471-2229-8-96. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Figure S1 QQS-OE construct used to transform Arabidopsis thaliana and soybean (Glycine max).

Figure S2 Starch quantification shows increased starch in QQS RNAi mutants and decreased starch in QQS-OE mutants compared to WT.

Figure S3 Mapping of Arabidopsis thaliana orphan genes and transposons on chromosomes.

Figure S4 Spatial expression of QQS and starch accumulation with 5% sucrose and in the cold.

Figure S5 The plots of leaf starch and protein versus QQS transcript in Arabidopsis and in soybean.

Figure S6 Composition screening of seeds from transgenic lines of growth chamber-grown transgenic soybean plants expressing QQS.

Figure S7 Phenotype and seed composition of transgenic lines of greenhouse-grown soybean plants expressing QQS.

Figure S8 Phenotype and seed composition of transgenic lines of field-grown soybean plants expressing QQS.

Table S1 QQS transcript accumulation (by real-time PCR) in leaves of QQS-expressing transgenic lines and WT lines.

Table S2 Chi-square analysis assuming a 3:1 (Resistant:Sensitive) ratio for T3 generation of progeny of QQS-E containing 35S:QQS and pNos:Bar.

Table S3 Seed composition of transgenic lines of field-grown soybean plants expressing QQS.

Table S4 Profiles of most abundant free amino acids from seeds of QQS-expressing transgenic lines, compared with WTs.

Table S5 Profiles of amino acids from hydrolysed proteins: almost all amino acids analysed are increased in seeds of QQS-expressing transgenic lines, compared with WTs.

Table S6 Hydrolysed fatty acid profiles in the seeds of QQS-expressing transgenic lines, compared with WTs.

Table S7 Seed composition of individual plants of transgenic lines and their segregated siblings of field-grown soybean plants expressing QQS.

Appendix S1 Supplementary experimental procedures.

pbi0013-0177-sd1.pdf (2.2MB, pdf)

Articles from Plant Biotechnology Journal are provided here courtesy of Wiley

RESOURCES