Skip to main content
Plants logoLink to Plants
. 2021 Jul 16;10(7):1465. doi: 10.3390/plants10071465

Identification and Expression Analysis of the Genes Involved in the Raffinose Family Oligosaccharides Pathway of Phaseolus vulgaris and Glycine max

Ramon de Koning 1, Raphaël Kiekens 1, Mary Esther Muyoka Toili 1,2, Geert Angenon 1,*
Editor: Vagner A Benedito
PMCID: PMC8309293  PMID: 34371668

Abstract

Raffinose family oligosaccharides (RFO) play an important role in plants but are also considered to be antinutritional factors. A profound understanding of the galactinol and RFO biosynthetic gene families and the expression patterns of the individual genes is a prerequisite for the sustainable reduction of the RFO content in the seeds, without compromising normal plant development and functioning. In this paper, an overview of the annotation and genetic structure of all galactinol- and RFO biosynthesis genes is given for soybean and common bean. In common bean, three galactinol synthase genes, two raffinose synthase genes and one stachyose synthase gene were identified for the first time. To discover the expression patterns of these genes in different tissues, two expression atlases have been created through re-analysis of publicly available RNA-seq data. De novo expression analysis through an RNA-seq study during seed development of three varieties of common bean gave more insight into the expression patterns of these genes during the seed development. The results of the expression analysis suggest that different classes of galactinol- and RFO synthase genes have tissue-specific expression patterns in soybean and common bean. With the obtained knowledge, important galactinol- and RFO synthase genes that specifically play a key role in the accumulation of RFOs in the seeds are identified. These candidate genes may play a pivotal role in reducing the RFO content in the seeds of important legumes which could improve the nutritional quality of these beans and would solve the discomforts associated with their consumption.

Keywords: common bean, Fabaceae, legumes, qRT-PCR, raffinose family oligosaccharides (RFOs), RNA-seq, soybean

1. Introduction

The common bean (Phaseolus vulgaris) and soybean (Glycine max) are of outstanding importance for human and animal nutrition. They have a high nutritional quality, containing large amounts of proteins, carbohydrates, vitamins, minerals and dietary fibers. Beans are used as a major source of calories and proteins in developing countries, and they are the main legumes in vegetarian diets [1,2,3,4]. Furthermore, diets consisting mainly of cereals can be complemented with legumes to increase the essential nutrients acquired, making them ideal components to include in our diets [5].

A major drawback of the consumption of beans in large quantities is the presence of several anti-nutritional factors. Beans contain high amounts of raffinose family oligosaccharides (RFOs) [1,6,7,8,9]. These sugars contain one or more α-1,6 galactosyl bonds which make them indigestible for humans and monogastric animals because they lack α-galactosidase, the enzyme required to break down these sugars. Instead, these sugars undergo fermentation by gut microbiota in the large intestine resulting in the production of gases such as methane and carbon dioxide [1,10]. The production of gases does not pose any problems for humans or animals when small quantities of beans are consumed. On the contrary, the breakdown of small amounts of RFOs might stimulate the growth of beneficial bacteria in the large intestine providing RFOs a potential small probiotic effect [11,12]. However, the consumption of large amounts of beans results in the production of gases which leads to unwanted flatulence and digestive disturbances [13]. The presence of RFOs in the feed also results in shorter transit times through the digestive tract, reducing the absorption of nutrients and causing poor weight gain [14,15]. Lowering the amount of RFOs in the seeds could improve the nutritional quality of these beans and would solve the discomforts associated with their consumption [14]. Since RFOs have many functions in the plant, a thorough understanding of the galactinol and RFO biosynthetic gene families and the expression patterns of the individual genes is needed to be able to reduce the RFO content in the seeds, without compromising normal plant development and functioning.

RFOs play an important role in the transport and storage of carbon within the plant [16]. Sucrose produced as a result of photosynthesis in mesophyll cells (source), can move via the bundle sheath cells located at the minor veins, into specialized companion cells which are also known as intermediary cells. Here RFO biosynthetic enzymes are present which convert sucrose into RFOs. The RFOs are then loaded into the phloem according to a polymer trapping model [17]. This model specifies that, due to the large size of the RFOs, the diffusion back into the mesophyll cells is prevented. The osmotic pressure facilitates the migration towards the sieve elements where the RFOs are then transported to other parts of the plant (sinks) where they can be broken down by alkaline α-galactosidase [16,17,18,19,20]. The production of RFOs in the intermediary cells allows plants to maintain a high sugar concentration in the phloem [19]. It is also suggested that RFOs play a role in the abiotic stress responses of plants. This is mainly based on the observations that the RFO content increases in plants as a response to various stresses such as drought, heat, cold and salt stress [21,22,23,24,25,26]. Several different mechanisms in which RFOs provide protection against abiotic stress have been proposed. During abiotic stress, RFOs could stabilize membranes and proteins with hydrophilic interactions, act as reactive oxygen species scavengers or serve as osmolytes to reduce water loss [25,27,28,29,30]. In leguminous crops, RFOs are mainly stored in the seeds under normal growth conditions. They are produced de novo during the maturation of the seed and protect the seed against seed desiccation and provide seed longevity for storage [31,32,33,34,35,36]. During germination, they are rapidly broken down by acidic α-galactosidases providing energy and carbon to the young seedling [37,38]. However, it needs to be noted that they are not essential for germination in soybean [39].

Four different members of the RFO family are found in plants: raffinose, stachyose, verbascose and ajugose, which are respectively, tri-, tetra-, penta- and hexasaccharides [37]. Soybean and common bean primarily produce, raffinose and stachyose, and to a lesser extent, verbascose in their seeds [10,14]. The production of RFOs is mainly dependent on the influx of galactinol which functions as a galactosyl donor for the RFO synthesis. Galactinol is produced by galactinol synthase (GolS) which catalyzes the galactosyl transfer from uridine diphosphate galactose to myo-inositol forming galactinol [37]. Raffinose synthase (RS) catalyzes the formation of raffinose, by transferring a galactosyl moiety from galactinol to sucrose. This reversible reaction produces the first RFO in the RFO biosynthesis pathway [40]. The larger RFOs, stachyose and verbascose, can both be formed by stachyose synthase (SS) which has, contrary to RS, a broader substrate specificity [41]. For the formation of stachyose and subsequently verbascose, SS uses respectively, raffinose or stachyose as a galactosyl acceptor. As a galactosyl donor, either galactinol, stachyose or galactosyl cyclitols can be used. Furthermore, SS also has the ability to synthesize these galactosyl cyclitols by facilitating the galactosyl transfer from galactinol to a cyclitol [37]. Galactosyl cyclitols contain, like RFOs, an α-1,6 bond. It is important to point out that galactosyl cyclitols potentially have a similar function as RFOs during seed development due to their high similarity in structure and contribute to the digestive problems humans and monogastric animals face after consumption of beans [37,42]. An alternative route to produce higher RFOs (stachyose, verbascose and ajugose), independent of galactinol, has only been reported in two species of the Lamiaceae family (Ajuga reptans and Coleus blumei) where the enzyme galactan:galactan galactosyltransferase (GGT) catalyzes the transfer of a galactosyl moiety from one RFO to another [43,44]. The GGT enzyme has only been found in the vacuoles of leaves, which makes the synthesis of higher RFOs in seeds unlikely to be dependent on this pathway [41].

Previous attempts to lower the amount of RFOs in the seeds have shown to be successful in G. max. Valentine et al. (2017) reduced the raffinose and stachyose content in seeds from respectively, 0.63% and 3.79% in the wild type to 0.11% and 1.21% in a transgenic line using a silencing construct targeting a RS isoform [14]. In a recent study by Le et al. (2020) two GolS genes were knocked out, reducing the total RFO content by 35.2% [45]. However, more progress can be made to lower the RFO content in the seeds more efficiently without compromising normal plant development and functioning. An in-depth study of all genes involved can elucidate which strategies can be best perused to achieve this. In contrast to G. max, the RFO biosynthesis pathway is still understudied in P. vulgaris. In this paper, an overview of the annotation and genetic structure of all galactinol- and RFO biosynthesis genes is given for soybean and common bean. In P. vulgaris, three GolS genes, two RS genes and one SS gene were identified for the first time. To discover the expression patterns of these genes in different tissues, two expression atlases have been created through a re-analysis of publicly available RNA-seq studies. Furthermore, little is known about the expression patterns of these galactinol- and RFO biosynthesis genes during seed development in common bean. De novo expression analysis through an RNA-seq study during seed development in three varieties of common bean gave more insight into the expression patterns of these genes and seed specific genes involved in the RFO production have been identified. The results of the expression analysis indicate tissue-specific expression patterns of different classes of galactinol- and RFO synthase genes in soybean and common bean. With the obtained knowledge, suitable candidate genes to alter the expression levels of the galactinol- and RFO synthase genes are proposed to lower the amount of RFOs in the seeds.

2. Results

2.1. In Silico Identification of the Galactinol- and RFO Biosynthetic Enzymes in Phaseolus vulgaris and Glycine max

Using a customized bioinformatics pipeline, three GolS genes were identified in P. vulgaris, (Phvul.001G215300, Phvul.001G223700 and Phvul.007G203400) and six GolS genes in G. max (Glyma.03G229800, Glyma.19G227800, Glyma.20G094500, Glyma.03G222000, Glyma.19G219100 and Glyma.10G145300). Some genes encode multiple predicted isoforms. In P. vulgaris, only for Phvul.001G215300 two isoforms are predicted. In G. max, two isoforms for Glyma.19G227800 and three isoforms for Glyma.10G145300 are predicted. A phylogenetic tree has been made using the amino acid sequences of the predicted proteins of the GolS enzymes of P. vulgaris, G. max and well-characterized GolS enzymes of Arabidopsis thaliana and Cicer arietinum (Figure 1a). Furthermore, the gene structures of the GolS enzymes are presented in Figure 1b. Based on the clustering of the P. vulgaris and G. max enzymes in the phylogenetic tree, the GolS enzymes of G. max and P. vulgaris were categorized into three classes: galactinol synthase 1 (GolS1), galactinol synthase 2 (GolS2) and galactinol synthase 3 (GolS3). The predicted primary transcript isoforms of the GolS1 genes consist of three exons and two introns. The GolS2 and GolS3 genes consist of four exons and three introns.

Figure 1.

Figure 1

Analysis of the phylogenetic relationships and gene structures of galactinol synthase (GolS). (a) Evolutionary relationship of GolS enzymes of G. max, P. vulgaris and other well-characterized taxa (A. thaliana and C. arietinum). The GolS enzymes of P. vulgaris and G. max are subdivided into three different classes based on their clustering (GolS1, GolS2 and GolS3. The phylogenetic tree was made in MEGA X (v10.2.4) using the neighbor-joining algorithm combined with a bootstrap test of 1000 replicates (next to the branches, the percentage of replicate trees in which the associated taxa clustered together is shown) [46,47,48]. The Poisson correction method was used to compute the evolutionary distances with the number of amino acid substitutions per site as a unit [49]. The phylogenetic tree was drawn to scale using the calculated evolutionary distances as branch lengths. This analysis involved 23 amino acid sequences. For each sequence pair, all ambiguous positions were removed using the pairwise deletion option and the final dataset consisted of 355 positions. The multiple sequence alignment was created using the MUSCLE algorithm; (b) structures of genes encoding the GolS enzymes were visualized using GSDS software [50].

Nine possible RFO biosynthesis genes have been predicted in P. vulgaris and fourteen in G. max using our bioinformatics pipeline. Most proteins encoded by these genes are annotated as potential RFO biosynthesis enzymes but could also be wrongly annotated. To distinguish RS from alkaline α-galactosidase two conserved motifs (FMxLGTEAxxLG and SGDPxGTxWLQGCHMVHC) were used (Figure A1 in Appendix A). These motifs are present in the amino acid sequence of RS but absent in alkaline α-galactosidase [51]. RS can be distinguished from SS because of an amino acid insert present only in SS (Figure A2 in Appendix A) [40,52]. A phylogenetic tree of the amino acid sequences has been made to evaluate the evolutionary relationship of the different RFO biosynthetic and hydrolytic enzymes of P. vulgaris, G. max and other well-characterized taxa: A. thaliana, Pisum sativum, Vigna angularis, Solanum lycopersicum, Oryza sativa and Zea mays (Figure 2a). The phylogenetic tree clusters 4 different groups of enzymes with high certainty (bootstrap score > 88): RS, SS, alkaline α-galactosidase and α-galactosidase. In addition, gene features of these enzymes were visualized (Figure 2b). The RS genes in G. max and P. vulgaris all consist of 5 exons and 4 introns. The genes encoding SS consist of 4 exons and 3 introns. This is in contrast with the alkaline α-galactosidase genes which mostly consist of 13 exons although quite a large variation can be seen in this group. Some of the alkaline α-galactosidase genes consist of 1, 7, 12 or 14 predicted exons. From these results, it can be concluded that P. vulgaris most likely contains at least two RS genes (Phvul.009G175400 and Phvul.004G007100) and only one SS gene (Phvul.001G214300). G. max has three RS genes (Glyma.05G003900, Glyma.06G179200 and Glyma.05G040300) and one SS gene (Glyma.19G217700). Based on the phylogenetic tree, the RS enzymes of G. max and P. vulgaris have consequently been categorized into two classes: raffinose synthase 1 (RS1) and raffinose synthase 2 (RS2). An overview of the genes involved in the RFO biosynthesis pathway in G. max and P. vulgaris can be found in Table 1. The corresponding names of the genes found in Table 1 will be used in the rest of the text.

Figure 2.

Figure 2

Analysis of the phylogenetic relationships and gene structures of the raffinose family oligosaccharides’ biosynthetic and catabolic enzymes. (a) Evolutionary relationship of the raffinose synthase (RS), stachyose synthase (SS) and (alkaline) α-galactosidase (AGA) enzymes of G. max, P. vulgaris and other well-characterized enzymes of different taxa (A. thaliana, P. sativum, V. angularis, S. lycopersicum, O. sativa and Z. mays). The RS enzymes of P. vulgaris and G. max are subdivided into two different classes (RS1 and RS2) based on their clustering. The phylogenetic tree was made in MEGA X (v10.2.4) using the neighbor-joining algorithm combined with a bootstrap test of 1000 replicates (next to the branches are the percentage of replicate trees in which the associated taxa clustered together is shown) [46,47,48]. The Poisson correction method was used to compute the evolutionary distances with the number of amino acid substitutions per site as a unit [49]. The phylogenetic tree was drawn to scale using the calculated evolutionary distances as branch lengths. This analysis involved 43 amino acid sequences. For each sequence pair, all ambiguous positions were removed using the pairwise deletion option and the final dataset consisted of 1030 positions. The multiple sequence alignment was created using the MUSCLE algorithm. (b) Structures of genes encoding the RFO synthase enzymes were visualized using GSDS software [50].

Table 1.

Overview of the enzymes involved in the RFO biosynthesis pathway and corresponding genes.

Species Gene Name Accession Number * Function Chromo-Some Start
(bp)
End
(bp)
Protein Length
(AA)
#
Exons
#
Transcripts
P. vulgaris PvGolS1 Phvul.001G215300 Galactinol synthase Chr1 47,166,933 47,168,674 339 3 2
G. max GmGolS1_A Glyma.03G222000 Chr3 42,494,623 42,497,111 339 3 1
G. max GmGolS1_B Glyma.19G219100 Chr19 47,148,225 47,150,373 335 3 1
P. vulgaris PvGolS2 Phvul.001G223700 Chr1 47,870,097 47,872,698 327 4 1
G. max GmGolS2_A Glyma.03G229800 Chr3 43,172,457 43,175,687 331 4 1
G. max GmGolS2_B Glyma.19G227800 Chr19 47,911,130 47,914,214 330 4 2
P. vulgaris PvGolS3 Phvul.007G203400 Chr7 32,610,928 32,612,577 326 4 1
G. max GmGolS3_A Glyma.10G145300 Chr10 38,014,453 38,016,396 328 4 3
G. max GmGolS3_B Glyma.20G094500 Chr20 33,759,417 33,761,555 324 4 1
P. vulgaris PvRS1 Phvul.004G007100 Raffinose synthase Chr4 519,197 523,594 763 5 1
G. max GmRS1 Glyma.05G003900 Chr5 307,461 312,091 758 5 1
P. vulgaris PvRS2 Phvul.009G175400 Chr9 26,053,801 26,057,657 777 5 1
G. max GmRS2_A Glyma.06G179200 Chr6 15,217,419 15,223,877 810 5 2
G. max GmRS2_B Glyma.05G040300 Chr5 3,593,378 3,598,821 782 5 1
P. vulgaris PvSS Phvul.001G214300 Stachyose synthase Chr1 47,049,258 47,052,441 857 4 2
G. max GmSS Glyma.19G217700 Chr19 47,033,812 47,037,286 860 4 1

* Accession numbers were adopted from Phytozome v12.1 (P. vulgaris v2.1; G. max Wm82.a2.v1).

2.2. Transcriptomic Analysis of the RFO Biosynthetic Pathway

2.2.1. Expression Analysis of Galactinol- and RFO Biosynthesis Genes in Glycine max and Phaseolus vulgaris by RNA-seq Re-Analysis

An expression atlas has been made from the RNA-seq data of studies SRP038111 (G. max cv. Wm82) and SRP030614 (P. vulgaris cv. BAT93) [53,54]. The normalized expression data, represented as transcripts per million (TPM), of the different GolS, RS and SS genes is shown in Figure 3 for G. max cv. Wm82 and Figure 4 for P. vulgaris cv. BAT93 during five different growth stages. The different classes of GolS (GolS1, GolS2 and GolS3) and RS (RS1 and RS2), as defined by the phylogenetic tree analysis, show tissue-specific expression patterns for both G. max cv. Wm82 and P. vulgaris cv. BAT93.

Figure 3.

Figure 3

Heatmap of the normalized expression values (TPM) of the galactinol synthase (GmGolS1-3), raffinose synthase (GmRS1-2) and stachyose synthase (GmSS) genes in various plant tissues during different developmental stages in G. max cv. Wm82 (SRP038111). The rows represent plant tissues in five different developmental stages: emergence stage, early vegetative stage, late vegetative stage, flowering stage and seed developmental stage. More detailed information of the exact growth stage of the tissues can be found after the tissues names between brackets, with growth stage abbreviations (VE: emergence stage; V1: first-node stage; V7: seventh-node stage; EM: early maturation stage of the seed; MM: mid maturation stage of the seed; DAF: days after flowering) adapted from Fehr et al. [55]. The columns represent the different genes and their expression levels, represented with a color gradient in which white indicates no expression (0 TPM) and red the highest expression (200+ TPM). Hierarchical clustering was performed between the columns to obtain gene clusters.

Figure 4.

Figure 4

Heatmap of the normalized expression values, presented in transcripts per million (TPM), of the galactinol synthase (PvGolS1-3), raffinose synthase (PvRS1-2) and stachyose synthase (PvSS) genes in various plant tissues during different developmental stages in P. vulgaris cv. BAT93 (SRP030614). The rows represent plant tissues in five different developmental stages: emergence stage, early vegetative stage, late vegetative stage, flowering stage and seed developmental stage. More detailed information of the exact growth stage of the tissues can be found after the tissues name between brackets, indicated in days after sowing (DAS) and with growth stage abbreviations (V1: emergence stage; V2: primary leaves stage; V3: first trifoliate leaf stage; V4: third trifoliate leaf stage; R5: preflowering stage; R6: flowering stage; R9: maturity stage; MM: mid maturation stage of the seed) adapted from Vlasova et al. [53]. The columns represent the different genes and their expression levels, represented with a color gradient in which white indicated no expression (0 TPM) and red the highest expression (200+ TPM). Hierarchical clustering was performed between the columns to obtain gene clusters.

The normalized expression data of RNA-seq study SRP038111 in G. max cv. Wm82 shows that the GolS1 genes (GmGolS1_A and GmGolS1_B) are expressed highly in the hypocotyl during the emergence stage (resp. 59.4 and 128.2 TPM) and early vegetative stage (resp. 38.0 and 113.7 TPM). During the late vegetative stage, both genes are expressed in different types of tissues; GmGolS1_B, however, is mainly expressed in the stem node (122.4 TPM), flower bud (80.5 TPM), leaf bud (51.6 TPM) and trifoliate leaf (39.4 TPM) during the late vegetative stage. Furthermore, GmGolS1_B shows a higher expression in all stages of flower development. These two genes are highly expressed during the mid-maturation stage of the seed, with GmGolS1_A being expressed two times higher than GmGolS1_B in the seeds with expression levels of 721.7 TPM and 328.2 TPM, respectively. At the mid-maturation stage of the seed, both genes are expressed more in comparison with any other galactinol or RFO synthase gene. When observing the GolS2 genes (GmGolS2_A and GmGolS2_B) it can be seen that GmGolS2_A is highly expressed in the flower during the flowering stage, with expression levels of 176.2 TPM during the opening and 183.4 TPM five days after the opening of the flower. During the other developmental stages, no significant expression is observed. GmGolS2_B is mainly expressed in the seeds during the mid-maturation stage of the seeds (280.9 TPM) and less pronounced in the flower during the opening (30.0 TPM) and five days after the opening (25.1 TPM). The GolS3 genes (GmGolS3_A and GmGolS3_B) are both expressed in the roots (resp. 190.8 and 41.0 TPM) and hypocotyl (resp. 73.9 and 125.7 TPM) during the emergence stage. GmGolS3_B is mainly expressed in the hypocotyl (109.6 TPM) and to a lower extent in the primary leaf (30.8 TPM) during the early vegetative stage. In the late vegetative stage, GmGolS3_B is mainly expressed in the flower bud (103.6 TPM), stem node (54.3 TPM) and trifoliate leaf (53.5 TPM). During the flowering stage, both genes are expressed in the flower bud (resp. 20.3 and 49.4 TPM) and in the flower during the opening (resp. 26.7 and 87.0 TPM), 5 days after the opening (resp. 37.8 and 64.1 TPM) and during the senescence (resp. 29.2 and 80.3 TPM). During the mid-maturation stage of the seed, GmGolS3_A is primarily expressed (95.2 TPM). The soybean RS1 gene, GmRS1, is highly expressed in the primary leaf (355.9 TPM) during the early vegetative stage and in the trifoliate leaf (46.6 TPM) in the late vegetative stage. In the early vegetative stage, it is also expressed in the leaf bud (42.6 TPM) and hypocotyl (39.7 TPM). During the seed developmental stage, it is almost not expressed in the seed (13.6 TPM). This is in contrast with the RS2 genes (GmRS2_A and GmRS2_B) that are only expressed in the seed during the mid-maturation stage of the seed (resp. 123.0 and 109.4 TPM). In soybean, the SS gene, GmSS, is expressed in the hypocotyl during the emergence stage and early vegetative stage (resp. 34.8 and 20.6 TPM). During the late vegetative stage, it is expressed in the trifoliate leaf (23.4 TPM) and flower bud (21.5 TPM). In the seed developmental stage, it is expressed in the seed during the mid-maturation stage of the seed (214.0 TPM). The hierarchical clustering of the genes in the heatmap in Figure 3 also shows that the different GolS, RS and SS genes within one class have tissue-specific expression patterns and generally cluster close together. However, the GS2 genes do not cluster together mainly because of the expression difference in the seeds.

In P. vulgaris cv. BAT93, the normalized expression data of the RNA-seq study SRP030614 (Figure 4) shows that PvGolS1 is expressed mainly in the primary leaf (334.7 TPM), epicotyl (214.6 TPM), hypocotyl (208.1 TPM) and to a lesser extent in the primary root (48.0 TPM) and cotyledons (47.5 TPM) during the emergence stage. During the early vegetative stage, it is mainly expressed in the hypocotyl (105.8 TPM) and to a lesser extent in the primary leaf (22.3 TPM), first trifoliate leaf (19.7 TPM) and the neck of the root (14.2 TPM). During the late vegetative stage, it is mainly expressed in the stem (107.3 TPM). During this stage, expression can also be seen in the hypocotyl (27.4 TPM) and trifoliate leaf (13.4 TPM) and the axial meristem (10.6 TPM). In the flowering stage, it is expressed in the stem node (56.7 TPM), the root (37.1 TPM), the trifoliate leaf (28.3 TPM) and axial meristem (8.9 TPM). During the seed developmental stage, it is the only gene of this pathway that is expressed in the seed during the mid-maturation stage (196.8 TPM). In addition, PvGolS1 shows the most TPM variation throughout the tissues in comparison with the other genes analyzed. Furthermore, in the primary root, a contrasting expression is seen, with low PvGolS1 expression compared to higher expression for all other genes except PvRS2. The common bean GolS2 gene, PvGOLS2, is almost exclusively expressed in the primary root (438.26 TPM) during the emergence stage. The GolS3 gene, PvGolS3, is also primarily expressed in the primary root (512.2 TPM) and to a lesser extent in the hypocotyl (140.0 TPM), the cotyledons (118.0 TPM), epicotyl (74.4 TPM) and primary leaf (66.0 TPM) during the emergence stage. In the early vegetative stage, it is expressed in the neck of the root (11.9 TPM) and during the late vegetative stage, expression is observed in the axial meristem (24.6 TPM) and hypocotyl (21.9 TPM). During the flowering stage, PvGolS3 is also mainly expressed in the roots (40.6 TPM) and to a lesser extent in the stem node (12.3 TPM). In common bean, the RS1 gene, PvRS1, is primarily expressed in the primary root (298.2 TPM) during the emergence stage and to a lesser extent in the primary leaf (47.0 TPM), the cotyledons (42.8 TPM), the epicotyl (37.2 TPM) and hypocotyl (35.9 TPM). During the early vegetative stage, it is expressed in the primary leaf (10.5 TPM) and during the late vegetative stage in the trifoliate leaf (5.3 TPM). PvRS2 is only expressed during the emergence stage, where it shows expression in the primary leaf (224.2 TPM) and to a lesser extent in the epicotyl (36.9 TPM), cotyledons (36.9 TPM) and hypocotyl (32.2 TPM). Finally, the common bean SS gene, PvSS, is primarily expressed in the primary root (153.3 TPM) and hypocotyl (147.8 TPM) and to a lesser extent in the primary leaf (58.5 TPM), cotyledons (42.1 TPM) and epicotyl (36.5 TPM) during the emergence stage.

2.2.2. De Novo Expression Analysis of Galactinol- and RFO Biosynthesis Genes during Seed Development in P. vulgaris

Only a limited amount of information is available on the seed development in common bean. The RNA-seq study SRP030614 only contains one datapoint of the expression in the seeds (79 DAS, R9). To get a better understanding of the expression of the galactinol- and RFO biosynthesis genes during the seed development, an expression atlas was made by RNA-seq analysis for the common bean cultivars Canadian wonder, Pinto and Rosecoco. The expression levels were measured during early (15 DAF), mid (20 DAF) and late (30 and 35 DAF) maturation stages of seed development using four biological repeats. In all cultivars, PvGolS1 is expressed at a much higher level in comparison with the other galactinol- and RFO biosynthesis genes for all developmental stages (Figure 5). During the early stage of the seed development (15 DAF), PvGolS1 is expressed in all three cultivars with expression levels of 8.5 TPM (SE = 1.6) in Canadian wonder, 132.6 TPM (SE = 54.5) in Pinto and 11.1 TPM (SE = 2.7) in Rosecoco. From here, the expression level increases over time, with expression levels during the mid-maturation stage of the seed (20 DAF) of 58.3 TPM (SE = 30.1) in Canadian wonder, 445.6 TPM (SE = 77.9) in Pinto and 128.1 TPM (SE = 31.7) in Rosecoco. The highest expression of PvGolS1 was measured in the late stage of seed development (30 and 35 DAF). Here the expression level increased up to 4684.2 TPM (SE = 212.0) in Canadian wonder and 3351.0 TPM (SE = 1107.2) in Rosecoco at 35 DAF. In Pinto, the highest expression was measured at 30 DAF, with an expression level of 1305.9 TPM (SE = 190.4). In all cultivars, expression of the other galactinol- and RFO biosynthesis genes was low to undetectable in the early and mid-maturation stages. Only a slight expression of PvSS, and to a minor extent PvRS2 and PvGolS2, was measured during the mid-maturation stage of seed development in Pinto with expression levels of respectively, 16.8 TPM (SE = 2.1), 4.6 TPM (SE = 1.2) and 3.2 TPM (SE = 0.7). The expression of PvGolS2 increased over time during the late stage of the seed development with the highest expression level measured at 35 DAF in Canadian wonder (154.6 TPM; SE = 11.3), Pinto (188.7 TPM; SE = 9.4) and Rosecoco (156.9 TPM; SE = 21.3). PvRS2 was also mainly expressed during the late stage of the seed development with the highest expression level measured at 35 DAF in Canadian wonder (89.4 TPM; SE = 7.0) and Rosecoco (61.7 TPM; SE = 13.7). In Pinto, the highest expression was measured at 30 DAF (29.9 TPM; SE = 8.0). PvGolS3 and PvRS1 were not expressed in the seeds.

Figure 5.

Figure 5

RNA-seq expression analysis of the galactinol- and RFO biosynthesis genes during seed development in P. vulgaris cvs. Canadian wonder, Pinto and Rosecoco. The normalized expression values of GolS1 (PvGolS1), GolS2 (PvGolS2), GolS3 (PvGolS3), RS1 (PvRS1), RS2 (PvRS2) and SS (PvSS) in the seed are shown during different developmental stages: 15, 20, 30 and 35 days after flowering (DAF). The expression values are log2 transformed and expressed in transcripts per million (TPM). Error bars represent the standard error.

2.2.3. Validation of RNA-Sequencing Results

The RNA-seq results of the galactinol- and RFO biosynthesis genes during the seed development were validated by measuring the expression values of the six galactinol and RFO synthase genes in the different developmental stages of the seeds of P. vulgaris cv. Rosecoco using qRT-PCR. When the differential gene expression levels of these genes were compared for the different developmental stages of the seed, a similar expression pattern was observed in both the RNA-seq and qRT-PCR data (Figure 6). Because most of these genes exhibit very low expression levels during the early and mid-maturation stages of seed development (15 DAF vs. 20 DAF), it is more difficult to compare the differential gene expression patterns for these stages. However, the overall correlation between the RNA-seq and qPCR results was high, indicated by a Spearman’s rank correlation coefficient (rs) of 0.77. Especially when comparing the differential gene expressions during the mid and late maturity stages of the seeds, a high correlation could be seen (rs = 0.93). In addition, the qRT-PCR results confirmed the absence of the expression of PvGolS3 and PvRS1 in the seeds during the seed development.

Figure 6.

Figure 6

Validation of the RNA-seq data using qRT-PCR. The gene expression ratios of GolS1 (PvGolS1), GolS2 (PvGolS2), RS2 (PvRS2) and SS (PvSS) are represented as a result of the relative comparison of the different developmental stages of the seeds of P. vulgaris cv. Rosecoco: 15 vs. 20 days after flowering (DAF), 20 vs. 30 DAF and 30 vs. 35 DAF. These ratios are expressed as log2fold changes (Log2 FC). The housekeeping gene β-tubulin was used for the normalization of the qRT-PCR data [56]. Error bars represent the standard error.

3. Discussion

A lot of progress has already been made for G. max in the identification of the genes involved in the RFO biosynthesis pathway. Our study confirms the results of the earlier identified genes involved in the RFO pathway. Le et al. (2020) identified the same six GolS genes and showed that GmGolS1_A and GmGolS2_B both contribute to the production of RFOs through two CRISPR/Cas9-mediated knockout lines [45]. Dierking and Bilyeu (2008) identified the same three RS genes and Valentine et al. (2017) proved that GmRS2 contributes to the production of raffinose with the use of a soybean line in which GmRS2 was silenced [7,14]. Qiu et al. (2015) identified and characterized the same SS gene as in our study [57]. This validates the performance of our bioinformatics pipeline and no new additional galactinol and RFO biosynthesis genes were found in G. max. In contrast, the galactinol- and RFO synthase genes have not yet been identified in P. vulgaris until now. We identified three GolS genes in P. vulgaris, named PvGolS1, PvGolS2 and PvGolS3. PvGolS1 consists of 3 exons, while both PvGolS2 and PvGolS3 contain 4 exons. This is in accordance with the genetic structures of other well-characterized GolS genes [23,51,57,58,59,60]. Previous research showed that AtSIP2 in A. thaliana was incorrectly annotated as an RS gene and encodes an alkaline α-galactosidase with substrate specificity for raffinose [61]. We found that in P. vulgaris, many genes annotated as potential RS genes in the databases were incorrectly annotated and had similar gene sequences and structures as those of alkaline α-galactosidases. We identified two RS genes, PvRS1 and PvRS2, and one SS gene, PvSS, in P. vulgaris. Their genetic structure corresponds with well-characterized RS and SS genes of species other than P. vulgaris [40,51,52,57,62]. The presence of PvSS and PvGolS1 in P. vulgaris was also reported by Moghaddam et al. (2018) in a Genome-Wide Association Study (GWAS), where they showed their involvement in the RFO biosynthesis pathway [9].

G. max and P. vulgaris are two closely related species within the Fabaceae family. Around 11 million years ago, G. max underwent a whole-genome duplication (WGD) and consistently we found, for almost all galactinol- and RFO synthase genes in P. vulgaris, a duplicate in G. max [63,64]. However, the RS1 and SS genes appear to have only one copy in G. max as in P. vulgaris. After the WGD, chromosomes were subject to rearrangements and deletions resulting in gene loss [65]. It is possible that duplicates of the RS1 and SS genes were lost from the G. max genome during these rearrangements. We subdivided the different galactinol and RFO biosynthesis genes into different classes based on the sequence and structure similarity of the genes between P. vulgaris and G. max. The chromosomal location of the genes in the different classes corresponds with syntenic relationship between these two species (Figure S1 of the Supplementary Materials) [63,64]. Indeed, duplication and speciation events can lead to the formation of genes with novel functionalities or different expression patterns. However, the high homology between the newly identified genes indicates a high probability that these genes still have the same function which is supported by the research done in G. max [14,45,57]. When the expression patterns are compared in G. max cv. Wm82 it is interesting to see that in general the different classes of galactinol- and RFO biosynthesis genes cluster together in the expression heatmap generated from the re-analysis data of the RNA-seq study. This indicates that the galactinol- and RFO biosynthesis genes within one class have a conserved expression pattern. This is further supported when the expression patterns of the different classes were compared between G. max and P. vulgaris. However, not all plant tissues are equally represented in the RNA-seq re-analysis studies of P. vulgaris cv. BAT93 and G. max cv. Wm82, which needs to be taken into account. Our novel RNA-seq study measured the expression levels of the galactinol- and RFO synthase genes in the seeds of P. vulgaris cv. Canadian wonder, cv. Rosecoco and cv. Pinto during the early, mid and late maturation stage of the seed. A similar expression pattern can be seen for all the galactinol- and RFO biosynthesis genes in these three cultivars and hence the expression patterns were only validated in one variety of P. vulgaris, namely, cv. Rosecoco.

In multiple plant species such as Coffea arabica and Zea mays, tissue-specific expression patterns for GolS genes have been observed [58,66,67]. In C. arabica, for example, higher expression levels of CaGolS1 were measured in the leaves in comparison with CaGolS2 and CaGolS3. Little to no expression was measured for CaGolS2 in all tissues and CaGolS3 was mainly expressed in the roots and flowers [66]. In this paper, we show this is also the case for G. max and P. vulgaris. In G. max, GmGolS1_A and GmGolS1_B are both highly expressed during the mid-maturation stage of the seeds, with GmGolS1_A being expressed two times higher than GmGolS1_B. At this stage, both genes are expressed more in soybean in comparison with the other galactinol- and RFO synthase genes indicating that they have a possible significant role during seed development. Indeed, their involvement in the RFO biosynthesis pathway during seed development in soybean is confirmed by Le et al. (2020) [45]. Knocking out GmGolS1_A in G. max resulted in a significant reduction of the total RFO content in the seed, whereas the double knockout line of both GmGolS1_A and GmGolS1_B further reduced the total RFO content but to a lesser extent. This indicates that especially GmGolS1_A is important during seed development. The function of the GolS1 genes is not limited to seed development. Both GolS1 genes in G. max show expression in other organs, including the roots, hypocotyl, stem node, leaves and flower during the different developmental stages, however, GmGolS1_B shows a higher expression level in these tissues in comparison with GmGolS1_A. In P. vulgaris, PvGolS1 also shows a relatively high expression level during seed development, especially in the mid and late stages of seed maturation. This was observed in our novel RNA-seq study in P. vulgaris cvs. Rosecoco, Pinto and Canadian wonder and was validated with qRT-PCR in P. vulgaris cv. Rosecoco. The transcript levels of PvGolS1 were higher than other galactinol- and RFO synthase genes. This result is uniform between the RNA-seq re-analysis data of P. vulgaris cv. BAT93 and our RNA-seq data. Besides the expression in the seeds, PvGolS1 also shows expression in all vegetative tissues of P. vulgaris during the different developmental stages. The expression of PvGolS1 is especially high in the hypocotyl, epicotyl, stem, leaves and roots which corresponds with the expression pattern of the GolS1 genes in G. max. This indicates that the general expression pattern of the GolS1 class genes is conserved between G. max and P. vulgaris. However, in G. max, GmGolS1_A is mostly active during the seed development, whereas GmGolS1_B is mostly active in the other vegetative tissues. The GolS2 genes in G. max are the only genes that do not cluster together in the expression heatmap of G. max cv. Wm82 and show a distinct expression pattern compared to each other. GmGolS2_A is mainly expressed in the flower, whereas GmGolS2_B shows its highest expression in the seeds during the seed mid-maturation stage. Both genes are to a lesser extent expressed in the hypocotyls. In P. vulgaris, PvGolS2 shows a high expression in the primary root during the emergence stage and to a lower extent in the hypocotyl, epicotyl and stem. During seed development, PvGolS2 is expressed mainly in the late stage of seed development in all three P. vulgaris cultivars. These results suggest that PvGolS2 of P. vulgaris and GmGolS2_B of G. max play a role during the seed development, whereas GmGolS2_A seems to have a specific role in the flower in G. max. It is interesting to note that the GolS3 genes in P. vulgaris and G. max do not play a major role during the seed development in comparison with the other GolS genes. In G. max, only a low expression can be seen in the seeds whereas in P. vulgaris no expression of PvGolS3 was detected in the RNA-seq re-analysis data of cv. BAT93 and the novel RNA-seq analysis data of cvs. Rosecoco, Pinto and Canadian wonder. The GolS3 genes are primarily expressed in the roots, cotyledon and hypocotyl in both G. max and P. vulgaris. In G. max, GmGolS3_A and GmGolS3_B are also expressed in the flower and leaves. The GolS3 genes show, in comparison with the other galactinol and RFO synthase genes, the highest expression in the roots, indicating a possible function for this class in the roots.

Tissue-specific expression patterns can also be seen for the RS genes. The RS1 gene in G. max and P. vulgaris is mainly expressed in the leaves and hypocotyl. In P. vulgaris, PvRS1 is additionally expressed in the primary root during the emergence stage. Expression of the RS1 gene cannot be seen in either species during seed development. In contrast, the RS2 genes are mainly active in the seeds during the mid and late maturation stage in P. vulgaris and G. max. In soybean and chickpea seeds, raffinose mainly accumulates in the late stage of seed development which corresponds with the expression pattern seen for the RS2 genes in common bean and soybean [34,68,69]. Furthermore, research done by Valentine et al. (2017) showed that by silencing the GmRS2_A gene in G. max, the raffinose and stachyose content in the seeds significantly decreased [14]. In P. vulgaris, expression of PvRS2 can also be seen during the emergence stage of the plant, mainly in the primary leaves. However, no expression of PvRS2 can be seen in the later developmental stages. This suggests that the RS1 genes in P. vulgaris and G. max are mainly active in the vegetative tissue, whereas the RS2 genes are primarily active in the seeds. These results are partly in contrast with the qRT-PCR results of Dierking and Bilyeu (2008) that showed comparable expression levels for GmRS1 and GmRS2_A genes in the seeds and leaves of G. max [7]. These authors further showed that GmRS1 was not associated with the seed raffinose and stachyose content, which corresponds with our observations. They suggested that that GmRS2_B is probably not associated with the seeds’ raffinose and stachyose content because GmRS1 and GmRS2_B are located on the same chromosome. However, our observations suggest that GmRS2_B is also responsible for the production of raffinose in the seeds. The SS single-copy gene in G. max and P. vulgaris mainly shows expression in the primary and trifoliate leaves, the hypocotyl and cotyledon after emergence. It also shows expression in the seeds, primarily during the seeds’ mid and late maturation stage. In P. vulgaris, PvSS is also expressed in the primary root during the emergence stage. The elevated expression level of SS in the seeds during the mid and late maturation stage of the seed development in G. max and P. vulgaris corresponds with the observations made in the seeds of soybean and chickpea where the stachyose content also increased during this period [34,68,69]. The combined results of the gene expression atlases demonstrate that the different classes of galactinol- and RFO synthase genes show tissue-specific expression patterns in soybean and common bean grown under standard conditions (Figure 7). The different classes of galactinol- and RFO synthase genes most likely also show a specific expression pattern during abiotic stress conditions [23,26,31,32]. For example, in C. arabica, the GolS gene CaGolS1 showed an increased expression level during drought, heat and salt stress. The GolS gene CaGolS1, which was almost not expressed during standard growth conditions, had elevated transcript levels during drought and salt stress and CaGolS3 was primarily expressed during drought stress [66]. Further research in P. vulgaris and G. max could give a better understanding of the role of the different galactinol- and RFO synthase classes during abiotic stress.

Figure 7.

Figure 7

Overview of the RFO metabolic pathway and corresponding genes in P. vulgaris and G. max. The galactinol- and RFO synthase genes for P. vulgaris and G. max are shown to the right of the metabolic pathway. The main expression of these genes, either in the seed or the vegetative tissue, is indicated behind the gene names and the expression level of these genes in the seed are indicated in color, ranging from red (high), orange (average) to blue (low or no expression).

Lowering the amount of RFOs in the seeds could improve the nutritional quality of these beans and would solve the discomforts associated with their consumption [14]. However, RFOs also play an important role in the seed, protecting it against desiccation, providing longevity during storage and as an energy source during germination [31,32,33,34,35,36,37,38]. In A. thaliana, it was shown that seed vigor is not correlated to the absolute amount of a specific RFO molecule but rather to the total amount of RFOs and the ratio of RFOs to sucrose [51]. Contrastingly, when a wild-type soybean was compared with a low RFO variety, no significant difference was observed in terms of germination rate [39]. When choosing target genes this should be taken into account and further research should determine the optimal balance between seed health and the benefits for the consumer. Furthermore, RFOs are also needed for sugar transport through the phloem and protect the plant against abiotic stress. Gene targets that would not compromise normal plant development and functioning should be selected. One possible strategy is the targeting of seed-specific galactinol- and RFO synthase genes responsible for the production of RFOs in the seeds. With the current insights of the expression patterns, suitable candidate genes to alter the expression levels of the galactinol- and RFO synthase genes are proposed to lower the amount of RFOs in the seeds. In soybean, the GolS gene, GmGolS1_A, and the RS genes, GmRS2_A and GmRS2_B, form interesting genes to target because of their expression patterns, indicating that their main activity is during seed development (Figure 7). In common bean, PvRS2 is the only gene with a seed-specific expression pattern, making it an interesting candidate to knock out. The GolS1 gene, PvGolS1, shows a very high expression level in the seeds of the common bean, indicating its importance in the accumulation of galactinol in the seeds. However, this gene is also expressed in many other plant tissues. Targeting this gene could potentially lead to unwanted phenotypic side effects. However, this does not necessarily have to be the case as was observed in a soybean double mutant where both the GolS1 genes were knocked out, with no adverse effects on the plants’ growth [45]. Besides the formation of stachyose, SS also facilitates the formation of galactinol cyclitols which additionally contribute to the digestive problems associated with RFOs [37,42]. Stachyose is also the main RFO compound present in common bean and soybean seeds. This makes the SS gene also an interesting candidate gene [1,69]. Targeting a combination of these proposed GolS, RS and SS genes will most likely lead to the best result to lower the quantities of RFOs in the seeds. This should increase the nutritional value and decrease the flatulence and stomach discomforts associated with the consumption of common bean and soybean.

4. Materials and Methods

4.1. Plant Material

Seeds from P. vulgaris cvs. Pinto, Rosecoco and Canadian wonder, obtained from the Kenya Agricultural and Livestock Research Organization (Nairobi, Kenya) were grown in a greenhouse (Brussels, Belgium) under a 16/8 h light/dark regime. Seeds were harvested at 15, 20, 30 and 35 days after flowering using the flash freezing method with liquid nitrogen and stored at −80 °C.

4.2. In Silico Identification of the Galactinol and RFO Biosynthetic Genes in Phaseolus vulgaris and Glycine max

The amino acid sequences of well-characterized galactinol- and RFO synthase enzymes of Arabidopsis thaliana (AT2G47180.1, AT1G56600.1, AT1G09350.1, AT1G60470.1, AT5G23790.1, AT4G26250.1, AT1G60450.1, AT5G30500.1, NP_198855.1, NP_192106.3), Cicer arietinum (AMP59727.1, AMP59729.1), Pisum sativum (CAD20127.2), Oryza sativa (XP_015621501.1), Zea mays (NP_001354805.1) and Vigna angularis (CAB64363.1) were initially used as queries against the protein and gene databases of the National Center for Biotechnology Information (NCBI) and Phytozome databases to search for the orthologs in P. vulgaris and G. max using BLASTP and TBLASTN (Table S1) [23,40,51,52,60,61,62,70,71,72,73,74,75,76,77,78]. The resulting gene/protein sequences were further used to perform a combination of BLASTP, TBLASTN and BLASTN to find all potential galactinol and RFO biosynthetic genes in P. vulgaris and G. max. A multiple sequence alignment of the amino acid sequences was made using the MUSCLE algorithm in MEGA X (v10.2.4) and screened for conserved motifs to distinguish the different RFO synthase enzymes [48]. Two conserved motifs (FMxLGTEAxxLG and SGDPxGTxWLQGCHMVHC) were used by the motif search method of MEGA X(v10.2.4) to distinguish RS from alkaline α-galactosidase [51]. To further distinguish SS from RS the presence of a specific amino acid insert, only present in SS as described in Peterbauer et al. (1999), has been evaluated through multiple sequence alignment [52]. Sequence similarity and identity were evaluated and compared with other well-characterized galactinol- and RFO synthase enzymes of A. thaliana (AT2G47180.1, AT1G56600.1, AT1G09350.1, AT1G60470.1, AT5G23790.1, AT4G26250.1, AT1G60450.1, AT5G30500.1, NP_198855.1, NP_192106.3), C. arietinum (AMP59727.1, AMP59729.1), P. sativum (CAD20127.2), O. sativa (XP_015621501.1), Z. mays (NP_001354805.1) and V. angularis (CAB64363.1). Isoforms where predicted based on the annotation of the Phytozome database. To evaluate the evolutionary relationship, a phylogenetic tree was made in MEGA X (v10.2.4) using the amino acid sequences of the different galactinol and RFO biosynthetic and hydrolytic enzymes of P. vulgaris, G. max and well-annotated species A. thaliana (AT2G47180.1, AT1G56600.1, AT1G09350.1, AT1G60470.1, AT5G23790.1, AT4G26250.1, AT1G60450.1, AT5G30500.1, NP_198855.1, NP_192106.3, OAP05273.1, NP_191190.2, NP_001031855.1), C. arietinum (AMP59727.1, AMP59729.1), P. sativum (CAD20127.2), O. sativa (XP_015621501.1), Z. mays (NP_001354805.1, AAQ07251.2, NP_001105794.2, NP_001105775.2) and V. angularis (CAB64363.1), Cucumis melo (AAM75139.1, AAM75140.1) and Solanum lycopersicum (AAF04591.1) using the neighbor-joining algorithm combined with a bootstrap test of 1000 replicates (Table S1) [46,48,51,58,79]. The Poisson correction method was used to compute the evolutionary distances with the number of amino acid substitutions per site as a unit [49]. The phylogenetic tree was drawn to scale using the calculated evolutionary distances as branch lengths. The features of the galactinol and RFO biosynthesis genes were visualized using Gene Structure Display Server 2.0 (GSDS) software [50].

4.3. Transcriptome Analysis of the RFO Biosynthesis Pathway

4.3.1. Expression Atlas of Galactinol- and RFO Biosynthesis Genes in G. max and P. vulgaris by RNA-seq Re-Analysis

The sequence read archive (SRA) of the International Nucleotide Sequence Database Collaboration (INSDC) was screened for publicly available RNA-seq studies that could be used for the creation of an expression atlas of G. max and P. vulgaris under normal conditions. For P. vulgaris, study SRP030614 containing RNA-seq data of genotype BAT93, as described in Vlasova et al. (2016), was used for the creation of an expression atlas [53]. The common bean plants of this study were grown under a 16 h light and 8 h dark photoperiod at ±25 °C and 80% humidity. The study consists of 61 run accession files (SRR) containing raw read files (FASTQ) comprising 453 247 Mbases. The RNA-seq data of study SRP038111, as described in Shen et al. (2014), was used for the creation of an expression atlas for G. max cv. Wm82 [54]. The soybean plants in this study were grown during the growing season at the experimental station of the Institute of Genetics and Developmental Biology of the Chinese Academy of Sciences (38°06′56″ N 114°32′00″ E). This study consists of 28 SRR files containing raw read files (FASTQ) comprising 181 065 Mbases. FASTQC (v0.11.8) was used for the quality control of the FASTQ files. The raw read files were converted into gene count files using a customized Bash script adapted from Kiekens et al. (unpublished; manuscript in preparation) [80]. In short, raw read files were trimmed using Sickle (v1.33) with the threshold value for the fragment length set at 35 bp and the threshold for the quality score set at 30 [81]. STAR (v2.6.0) was used to map the trimmed reads on the Wm82.a2.v1 genome assembly for G. max and the Pvulgaris_442_v2.1 genome assembly for P. vulgaris (http://phytozome.jgi.doe.gov/; accessed on 15 May 2020) [74,82]. The number of sequence reads per gene was calculated using HTSeq (v0.9.1) [83]. The resulting count files were normalized for both the gene length and sequencing depth and were expressed in transcripts per million (TPM) [84]. Heatmaps were created using Pheatmap (v1.0.12) within R studio (v1.4.1106) [85].

4.3.2. Expression Atlas of Galactinol- and RFO Biosynthesis Genes during the Seed Development in P. vulgaris by De Novo RNA-seq Analysis

At four different developmental stages (15, 20, 30 and 35 days after flowering (DAF)) seed samples of P. vulgaris cvs. Pinto, Rosecoco and Canadian wonder were used to extract RNA from using the RNeasy PowerPlant Kit (Qiagen, Hilden, Germany, CAT #13500-50). The quality and quantity of the RNA samples were measured using a Thermo Scientific (Waltham, MA, USA) NanoDrop 1000 Spectrophotometer and the RNA integrity was determined using the bleach gel electrophoresis protocol described by Aranda et al. [86]. Four biological repeats for each developmental stage were sent for sequencing to the Genomics Core facility (Leuven, Belgium). The library was prepared using the QuantSeq 3′ mRNA-Seq Library Prep Kit (FWD) (Lexogen, Wien, Austria, CAT #015.96) and Illumina’s HiSeq4000 was used for sequencing. The resulting raw read files were converted into gene count files using a customized Bash script based on the optimized QuantSeq FWD/REV Data Analysis Pipeline (Available at https://www.lexogen.com/wp-content/uploads/2021/01/015UG108V0310_QuantSeq-Data-Analysis-Pipeline-on-BlueBee-Platform_2021-01-20.pdf; accessed on 15 Aug 2020). The quality of the raw read files was checked using FASTQC (v0.11.8) and trimmed using BBDuk from the BBmap suite (v38.50b, settings: -k13 -ktrim r -useshortkmers t -mink 5 -qtrim r -trimq 10 -minlength 20). FastQC (v0.11.8) was also used to check the quality of the trimmed files. Mapping of the reads on the Pvulgaris_442_v2.1 genome assembly (P. vulgaris v2.1, DOE-JGI and USDA-NIFA, http://phytozome.jgi.doe.gov/; accessed on 15 May 2020) was performed using STAR (v2.6.0) after which Qualimap 2 (v2.2.1) was used to control the quality of the mapping [74,82,87]. The number of sequence reads per gene was calculated using HTSeq (v0.9.1) [83]. The resulting count files were normalized for both the gene length and sequencing depth and were expressed in transcripts per million (TPM) [84]. A more detailed analysis of the RNA-seq data will be published elsewhere by Toili et al. (unpublished; manuscript in preparation) [88].

4.4. Validation of RNA-Sequencing Results in P. vulgaris cv. Rosecoco

The results of the novel RNA-seq study were validated in one variety of P. vulgaris, namely, cv. Rosecoco. In this variety, the expression levels of the galactinol- and RFO biosynthesis genes were measured during different stages of the seed development using qRT-PCR. The primers used for qRT-PCR were designed using primer-BLAST [89]. Standard settings were used for the product size ranging from 70 to 200 bases and melting temperature ranging from 58.0 to 62.0 °C with an optimum of 60.0 °C and a maximum temperature difference of 2 °C. The concentration of dNTPs was set to 0.4 and the concentration of divalent cations was set to 3.0 in the advanced settings. To ensure no genomic DNA could be amplified, only primers that spanned an exon-exon junction were chosen. Standard curves were made to verify the specificity and amplification efficiency of each primer pair. β-tubulin was used as a reference gene [56]. An overview of the primers used for qRT-PCR can be found in Table S2 of the Supplementary Materials.

RNA samples to be used for the qRT-PCR were extracted from 15, 20, 30 and 35 DAF seeds of P. vulgaris cv. Rosecoco (see Section 4.3.2) and treated with RQ1 RNase-Free DNase (Promega, Madison, WI, USA, CAT #M6101) to remove genomic DNA. cDNA was synthesized using the RevertAid H Minus First Strand cDNA Synthesis Kit (Thermo Scientific, CAT #K1631). GoTaq qPCR Master Mix (Promega, CAT # A6001) was used to load the samples on the CFX96 TouchTM Real-Time PCR Detection System (Bio-Rad, Hercules, CA, USA) to perform qRT-PCR. The following settings were used: 95 °C for 3 min followed by 40 cycles of 15 s at 95 °C and 1 min at 60 °C each. The fluorescence was measured after each cycle. The dissociation was analyzed starting at 65.0 °C and increasing till 95.0 °C with increments of 0.5 °C every 5 s. After every increment, the fluorescence signal was measured. For each developmental stage, the gene expression was measured of three biological repeats. The comparative ΔCT method was used to normalize the cDNAs threshold cycle (Ct) values observed by qRT-PCR using β-tubulin as a reference gene [56,90,91]. The differential expression levels of the galactinol- and RFO synthase genes between the different developmental stages (15 vs. 20 DAF, 20 vs. 30 DAF and 30 vs. 35 DAF) were calculated using the 2−ΔΔCt method [91].

Within R studio (v1.4.1106), DESeq2 (v1.30.1) was used to calculate the differential expression levels of the galactinol- and RFO synthase genes between the different developmental stages (15 vs. 20 DAF, 20 vs. 30 DAF and 30 vs. 35 DAF) using the RNA-seq count files of P. vulgaris cv. Rosecoco [92]. To compare the results of the RNA-seq and qRT-PCR data, the log2 fold changes were represented in a bar chart and the correlation between the RNA-seq and qRT-PCR results were calculated using Spearman’s rank correlation coefficient.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/plants10071465/s1, Figure S1: Visualization of the syntenic relationship between P. vulgaris and G. max; Table S1: Overview of the enzymes used for the identification of the galactinol- and RFO synthase genes in P. vulgaris and G. max; Table S2: Primers used for qRT-PCR.

Appendix A

Figure A1.

Figure A1

Visualization of the amino acid motifs present in raffinose synthase (RS) but absent in alkaline α-galactosidase. A multiple sequence alignment was performed using the MUSCLE algorithm in MEGA X of the protein sequences of potential RFO synthase enzymes of G. max and P. vulgaris together with well-characterized RFO synthase and alkaline α-galactosidase enzymes of O. sativa, C. melo, Z. mays, A. thaliana, P. sativum, V. angularis, Stachys affinis and Alonsoa meridionalis. To distinguish RS from alkaline α-galactosidase, two conserved motifs, FMxLGTEAxxLG and SGDPxGTxWLQGCHMVHC, were used. These motifs are present in the amino acid sequence of RS but absent in alkaline α-galactosidase [51].

Figure A2.

Figure A2

Visualization of the amino acid insert only present in stachyose synthase (SS). A multiple sequence alignment was performed using the MUSCLE algorithm in MEGA X of the protein sequences of potential RFO synthase enzymes of G. max and P. vulgaris together with well-characterized RFO synthase and alkaline α-galactosidase enzymes of O. sativa, C. melo, Z. mays, A. thaliana, P. sativum, V. angularis, S. affinis and A. meridionalis. RS can be distinguished from SS because of an insert present only in SS [52].

Author Contributions

Conceptualization, R.d.K., R.K. and G.A.; methodology, R.d.K., R.K. and G.A.; software, R.d.K. and R.K.; validation, R.d.K., G.A., R.d.K. and M.E.M.T.; formal analysis, R.d.K., R.K. and M.E.M.T.; investigation, R.d.K. and M.E.M.T.; resources, G.A.; data curation, R.d.K. and R.K.; writing—original draft preparation, R.d.K.; writing—review and editing, R.d.K., G.A., M.E.M.T. and R.K.; visualization, R.d.K. and R.K.; supervision, R.d.K. and G.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research was partially funded by VLI-UOS, grant number KE2017IUC037A101.

Data Availability Statement

The data generated and analyzed in this study is available in this paper and the Supplementary Materials. The raw reads have been uploaded to the European Nucleotide Archive database (study PRJEB45523).

Conflicts of Interest

The authors declare no conflict of interest.

Footnotes

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Doria E., Campion B., Sparvoli F., Tava A., Nielsen E. Anti-nutrient components and metabolites with health implications in seeds of 10 common bean (Phaseolus vulgaris L. and Phaseolus lunatus L.) landraces cultivated in southern Italy. J. Food Compos. Anal. 2012;26:72–80. doi: 10.1016/j.jfca.2012.03.005. [DOI] [Google Scholar]
  • 2.Ganesan K., Xu B. Polyphenol-Rich Dry Common Beans (Phaseolus vulgaris L.) and Their Health Benefits. Int. J. Mol. Sci. 2017;18:2331. doi: 10.3390/ijms18112331. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Campos-Vega R., Oomah B.D., Loarca-Piña G., Vergara-Castañeda H.A. Common Beans and Their Non-Digestible Fraction: Cancer Inhibitory Activity—An Overview. Foods. 2013;2:374–392. doi: 10.3390/foods2030374. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Keefe S.O., Bianchi L., Sharman J. Soybean Nutrition. SM J. Nutr. Metab. 2015;1:1–9. [Google Scholar]
  • 5.Maphosa Y., Jideani V. The Role of Legumes in Human Nutrition. In: Chávarri Hueda M., editor. Functional Food—Improve Health through Adequate Food. IntechOpen; London, UK: 2017. [DOI] [Google Scholar]
  • 6.John K.M., Khan F., Luthria D.L., Garrett W., Natarajan S. Proteomic analysis of anti-nutritional factors (ANF’s) in soybean seeds as affected by environmental and genetic factors. Food Chem. 2017;218:321–329. doi: 10.1016/j.foodchem.2016.09.072. [DOI] [PubMed] [Google Scholar]
  • 7.Dierking E.C., Bilyeu K.D. Association of a Soybean Raffinose Synthase Gene with Low Raffinose and Stachyose Seed Phenotype. Plant Genome. 2008;1:135–145. doi: 10.3835/plantgenome2008.06.0321. [DOI] [Google Scholar]
  • 8.McPhee K.E., Zemetra R.S., Brown J., Myers J.R. Genetic Analysis of the Raffinose Family Oligosaccharides in Common Bean. J. Am. Soc. Hortic. Sci. 2002;127:376–382. doi: 10.21273/JASHS.127.3.376. [DOI] [Google Scholar]
  • 9.Moghaddam S.M., Brick M.A., Echeverria D., Thompson H.J., Brick L.A., Lee R., Mamidi S., McClean P.E. Genetic Architecture of Dietary Fiber and Oligosaccharide Content in a Middle American Panel of Edible Dry Bean. Plant Genome. 2018;11:170074. doi: 10.3835/plantgenome2017.08.0074. [DOI] [PubMed] [Google Scholar]
  • 10.Yamaguishi C.T., Sanada C.T., Gouvêa P.M., Pandey A., Woiciechowski A.L., Parada J.L., Soccol C.R. Biotechnological process for producing black bean slurry without stachyose. Food Res. Int. 2009;42:425–429. doi: 10.1016/j.foodres.2009.01.019. [DOI] [Google Scholar]
  • 11.Voragen A.G. Technological aspects of functional food-related carbohydrates. Trends Food Sci. Technol. 1998;9:328–335. doi: 10.1016/S0924-2244(98)00059-4. [DOI] [Google Scholar]
  • 12.Rycroft C.E., Jones M.R., Gibson G.R., Rastall R.A. A comparative in vitro evaluation of the fermentation properties of prebiotic oligosaccharides. J. Appl. Microbiol. 2001;91:878–887. doi: 10.1046/j.1365-2672.2001.01446.x. [DOI] [PubMed] [Google Scholar]
  • 13.Tomlin J., Lowis C., Read N.W. Investigation of normal flatus production in healthy volunteers. Gut. 1991;32:665–669. doi: 10.1136/gut.32.6.665. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Valentine M.F., De Tar J.R., Mookkan M., Firman J.D., Zhang Z.J. Silencing of Soybean Raffinose Synthase Gene Reduced Raffinose Family Oligosaccharides and Increased True Metabolizable Energy of Poultry Feed. Front. Plant Sci. 2017;8:692. doi: 10.3389/fpls.2017.00692. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Coon C.N., Leske K.L., Akavanichan O., Cheng T.K. Effect of Oligosaccharide-Free Soybean Meal on True Metabolizable Energy and Fiber Digestion in Adult Roosters. Poult. Sci. 1990;69:787–793. doi: 10.3382/ps.0690787. [DOI] [PubMed] [Google Scholar]
  • 16.Sengupta S., Mukherjee S., Basak P., Majumder A.L. Significance of galactinol and raffinose family oligosaccharide synthesis in plants. Front. Plant Sci. 2015;6:656. doi: 10.3389/fpls.2015.00656. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Zhang C., Turgeon R. Mechanisms of phloem loading. Curr. Opin. Plant Biol. 2018;43:71–75. doi: 10.1016/j.pbi.2018.01.009. [DOI] [PubMed] [Google Scholar]
  • 18.Haritatos E., Keller F., Turgeon R. Raffinose oligosaccharide concentrations measured in individual cell and tissue types in Cucumis melo L. leaves: Implications for phloem loading. Planta. 1996;198:614–622. doi: 10.1007/BF00262649. [DOI] [PubMed] [Google Scholar]
  • 19.Eom J.-S., Choi S.-B., Ward J., Jeon J.-S. The mechanism of phloem loading in rice (Oryza sativa) Mol. Cells. 2012;33:431–438. doi: 10.1007/s10059-012-0071-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Yadav U.P., Ayre B.G., Bush D.R. Transgenic approaches to altering carbon and nitrogen partitioning in whole plants: Assessing the potential to improve crop yields and nutritional quality. Front. Plant Sci. 2015;6:275. doi: 10.3389/fpls.2015.00275. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Zuther E., Buchel K., Hundertmark M., Stitt M., Hincha D.K., Heyer A.G. The role of raffinose in the cold acclimation response of Arabidopsis thaliana. FEBS Lett. 2004;576:169–173. doi: 10.1016/j.febslet.2004.09.006. [DOI] [PubMed] [Google Scholar]
  • 22.Zuther E., Schulz E., Childs L.H., Hincha D.K. Clinal variation in the non-acclimated and cold-acclimated freezing tolerance of Arabidopsis thaliana accessions. Plant Cell Environ. 2012;35:1860–1878. doi: 10.1111/j.1365-3040.2012.02522.x. [DOI] [PubMed] [Google Scholar]
  • 23.Taji T., Ohsumi C., Iuchi S., Seki M., Kasuga M., Kobayashi M., Yamaguchi-Shinozaki K., Shinozaki K. Important roles of drought- and cold-inducible genes for galactinol synthase in stress tolerance in Arabidopsis thaliana. Plant J. 2002;29:417–426. doi: 10.1046/j.0960-7412.2001.01227.x. [DOI] [PubMed] [Google Scholar]
  • 24.Bartels D., Sunkar R. Drought and Salt Tolerance in Plants. Crit. Rev. Plant Sci. 2005;24:23–58. doi: 10.1080/07352680590910410. [DOI] [Google Scholar]
  • 25.Elsayed A.I., Rafudeen M.S., Golldack D. Physiological aspects of raffinose family oligosaccharides in plants: Protection against abiotic stress. Plant Biol. 2014;16:1–8. doi: 10.1111/plb.12053. [DOI] [PubMed] [Google Scholar]
  • 26.Panikulangara T.J., Eggers-Schumacher G., Wunderlich M., Stransky H., Schöffl F. Galactinol synthase1. A Novel Heat Shock Factor Target Gene Responsible for Heat-Induced Synthesis of Raffinose Family Oligosaccharides in Arabidopsis. Plant Physiol. 2004;136:3148–3158. doi: 10.1104/pp.104.042606. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Peters S.W. Ph.D. Thesis. University of Zurich; Zurich, Switzerland: 2010. Raffinose Family Oligaosaccharides (RFOs) are Putative Abiotic Stress Protectants: Case Studies on Frost Tolerance and Water Deficit in Ajuga reptans and Arabidopsis thaliana. [Google Scholar]
  • 28.Nishizawa-Yokoi A., Yabuta Y., Shigeoka S. Galactinol and Raffinose Constitute a Novel Function to Protect Plants from Oxidative Damage. Plant Physiol. 2008;147:1251–1263. doi: 10.1104/pp.108.122465. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Seki M., Umezawa T., Urano K., Shinozaki K. Regulatory metabolic networks in drought stress responses. Curr. Opin. Plant Biol. 2007;10:296–302. doi: 10.1016/j.pbi.2007.04.014. [DOI] [PubMed] [Google Scholar]
  • 30.Hincha D.K., Zuther E., Heyer A.G. The preservation of liposomes by raffinose family oligosaccharides during drying is mediated by effects on fusion and lipid phase transitions. Biochim. Biophys. Acta BBA Biomembr. 2003;1612:172–177. doi: 10.1016/S0005-2736(03)00116-0. [DOI] [PubMed] [Google Scholar]
  • 31.Bailly C., Audigier C., Ladonne F., Wagner M.-H., Coste F., Corbineau F., Côme D. Changes in oligosaccharide content and antioxidant enzyme activities in developing bean seeds as related to acquisition of drying tolerance and seed quality. J. Exp. Bot. 2001;52:701–708. doi: 10.1093/jexbot/52.357.701. [DOI] [PubMed] [Google Scholar]
  • 32.Blackman S.A., Obendorf R.L., Leopold A.C. Maturation Proteins and Sugars in Desiccation Tolerance of Developing Soybean Seeds. Plant Physiol. 1992;100:225–230. doi: 10.1104/pp.100.1.225. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Blöchl A., Peterbauer T., Richter A. Inhibition of raffinose oligosaccharide breakdown delays germination of pea seeds. J. Plant Physiol. 2007;164:1093–1096. doi: 10.1016/j.jplph.2006.10.010. [DOI] [PubMed] [Google Scholar]
  • 34.Gangola M., Jaiswal S., Kannan U., Gaur P., Båga M., Chibbar R.N. Galactinol synthase enzyme activity influences raffinose family oligosaccharides (RFO) accumulation in developing chickpea (Cicer arietinum L.) seeds. Phytochemistry. 2016;125:88–98. doi: 10.1016/j.phytochem.2016.02.009. [DOI] [PubMed] [Google Scholar]
  • 35.Kandler O., Hopf H. Occurrence, Metabolism, and Function of Oligosaccharides. Volume 3. Academic Press; Cambridge, MA, USA: 1980. [Google Scholar]
  • 36.Frias J., Bakhsh A., Jones D.A., Arthur A.E., Vidal-Valverde C., Rhodes M.J.C., Hedley C.L. Genetic analysis of the raffinose oligosaccharide pathway in lentil seeds. J. Exp. Bot. 1999;50:469–476. doi: 10.1093/jxb/50.333.469. [DOI] [Google Scholar]
  • 37.Peterbauer T., Richter A. Biochemistry and physiology of raffinose family oligosaccharides and galactosyl cyclitols in seeds. Seed Sci. Res. 2001;11:185–197. doi: 10.1079/SSR200175. [DOI] [Google Scholar]
  • 38.Blöchl A., Peterbauer T., Hofmann J., Richter A. Enzymatic breakdown of raffinose oligosaccharides in pea seeds. Planta. 2008;228:99–110. doi: 10.1007/s00425-008-0722-4. [DOI] [PubMed] [Google Scholar]
  • 39.Dierking E.C., Bilyeu K.D. Raffinose and stachyose metabolism are not required for efficient soybean seed germination. J. Plant Physiol. 2009;166:1329–1335. doi: 10.1016/j.jplph.2009.01.008. [DOI] [PubMed] [Google Scholar]
  • 40.Peterbauer T., Mach L., Mucha J., Richter A. Functional expression of a cDNA encoding pea (Pisum sativum L.) raffinose synthase, partial purification of the enzyme from maturing seeds, and steady-state kinetic analysis of raffinose synthesis. Planta. 2002;215:839–846. doi: 10.1007/s00425-002-0804-7. [DOI] [PubMed] [Google Scholar]
  • 41.Peterbauer T., Mucha J., Mach L., Richter A. Chain elongation of raffinose in pea seeds. Isolation, characterization, and molecular cloning of a multifunctional enzyme catalyzing the synthesis of stachyose and verbascose. J. Biol. Chem. 2002;277:194–200. doi: 10.1074/jbc.M109734200. [DOI] [PubMed] [Google Scholar]
  • 42.Lahuta L.B., Goszczyńska J. Inhibition of raffinose family oligosaccharides and galactosyl pinitols breakdown delays germination of winter vetch (Vicia villosa Roth.) seeds. Acta Soc. Bot. Pol. 2011;78:203–208. doi: 10.5586/asbp.2009.025. [DOI] [Google Scholar]
  • 43.Bachmann M., Keller F. Metabolism of the Raffinose Family Oligosaccharides in Leaves of Ajuga reptans L. (Inter- and Intracellular Compartmentation) Plant Physiol. 1995;109:991–998. doi: 10.1104/pp.109.3.991. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Gilbert G.A., Wilson C., Madore M.A. Root-Zone Salinity Alters Raffinose Oligosaccharide Metabolism and Transport in Coleus. Plant Physiol. 1997;115:1267–1276. doi: 10.1104/pp.115.3.1267. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Le H., Nguyen N.H., Ta D.T., Le T.N.T., Bui T.P., Le N.T., Nguyen C.X., Rolletschek H., Stacey G., Stacey M.G., et al. CRISPR/Cas9-Mediated Knockout of Galactinol Synthase-Encoding Genes Reduces Raffinose Family Oligosaccharide Levels in Soybean Seeds. Front. Plant Sci. 2020;11:2033. doi: 10.3389/fpls.2020.612942. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Saitou N., Nei M. The neighbor-joining method: A new method for reconstructing phylogenetic trees. Mol. Biol. Evol. 1987;4:406–425. doi: 10.1093/oxfordjournals.molbev.a040454. [DOI] [PubMed] [Google Scholar]
  • 47.Calestani D.V. Contributo alla Sistemica: Ombrellifere D’Europa. Webbia. 1905;1:89–280. doi: 10.1080/00837792.1905.10669550. [DOI] [Google Scholar]
  • 48.Kumar S., Stecher G., Li M., Knyaz C., Tamura K. MEGA X: Molecular evolutionary genetics analysis across computing platforms. Mol. Biol. Evol. 2018;35:1547–1549. doi: 10.1093/molbev/msy096. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Zuckerkandl E., Pauling L. Evolutionary Divergence and Convergence in Proteins. In: Bryson V., Vogel H.J., editors. Evolving Genes and Proteins. Academic Press; New York, NY, USA: 1965. pp. 97–166. [DOI] [Google Scholar]
  • 50.Hu B., Jin J., Guo A.-Y., Zhang H., Luo J., Gao G. GSDS 2.0: An upgraded gene feature visualization server. Bioinformatics. 2015;31:1296–1297. doi: 10.1093/bioinformatics/btu817. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Li T., Zhang Y., Wang D., Liu Y., Dirk L.M., Goodman J., Downie A.B., Wang J., Wang G., Zhao T. Regulation of Seed Vigor by Manipulation of Raffinose Family Oligosaccharides in Maize and Arabidopsis thaliana. Mol. Plant. 2017;10:1540–1555. doi: 10.1016/j.molp.2017.10.014. [DOI] [PubMed] [Google Scholar]
  • 52.Peterbauer T., Mucha J., Mayer U., Popp M., Glossl J., Richter A. Stachyose synthesis in seeds of adzuki bean (Vigna angularis): Molecular cloning and functional expression of stachyose synthase. Plant J. 1999;20:509–518. doi: 10.1046/j.1365-313X.1999.00618.x. [DOI] [PubMed] [Google Scholar]
  • 53.Vlasova A., Capella-Gutiérrez S., Rendón-Anaya M., Oñate M., Ángel H., Minoche A.E., Erb I., Câmara F., Prieto-Barja P., Corvelo A., et al. Genome and transcriptome analysis of the Mesoamerican common bean and the role of gene duplications in establishing tissue and temporal specialization of genes. Genome Biol. 2016;17:32. doi: 10.1186/s13059-016-0883-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Shen Y., Zhou Z., Wang Z., Li W., Fang C., Wu M., Ma Y., Liu T., Kong L.-A., Peng D.-L., et al. Global Dissection of Alternative Splicing in Paleopolyploid Soybean. Plant Cell. 2014;26:996–1008. doi: 10.1105/tpc.114.122739. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Fehr W.R., Caviness C.E. Stages of soybean development. Spec. Rep. 1977;87:1–12. [Google Scholar]
  • 56.Pereira W., Bassinello P.Z., Brondani C., Vianello R.P. An improved method for RNA extraction from common bean seeds and validation of reference genes for qPCR. Crop Breed. Appl. Biotechnol. 2017;17:150–158. doi: 10.1590/1984-70332017v17n2a22. [DOI] [Google Scholar]
  • 57.Qiu D., Vuong T., Valliyodan B., Shi H., Guo B., Shannon J.G., Nguyen H.T. Identification and characterization of a stachyose synthase gene controlling reduced stachyose content in soybean. Theor. Appl. Genet. 2015;128:2167–2176. doi: 10.1007/s00122-015-2575-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Liu Y., Zhang L., Chen L., Ma H., Ruan Y., Xu T., Xu C., He Y., Qi M. Molecular cloning and expression of an encoding galactinol synthase gene (AnGolS1) in seedling of Ammopiptanthus nanus. Sci. Rep. 2016;6:36113. doi: 10.1038/srep36113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Li R., Yuan S., He Y., Fan J., Zhou Y., Qiu T., Lin X., Yao Y., Liu J., Fu S., et al. Genome-Wide Identification and Expression Profiling Analysis of the Galactinol Synthase Gene Family in Cassava (Manihot esculenta Crantz) Agronomy. 2018;8:250. doi: 10.3390/agronomy8110250. [DOI] [Google Scholar]
  • 60.Salvi P., Kamble N.U., Majee M. Stress-Inducible Galactinol Synthase of Chickpea (CaGolS) is Implicated in Heat and Oxidative Stress Tolerance Through Reducing Stress-Induced Excessive Reactive Oxygen Species Accumulation. Plant Cell Physiol. 2018;59:155–166. doi: 10.1093/pcp/pcx170. [DOI] [PubMed] [Google Scholar]
  • 61.Peters S., Egert A., Stieger B., Keller F. Functional Identification of Arabidopsis ATSIP2 (At3g57520) as an Alkaline -Galactosidase with a Substrate Specificity for Raffinose and an Apparent Sink-Specific Expression Pattern. Plant Cell Physiol. 2010;51:1815–1819. doi: 10.1093/pcp/pcq127. [DOI] [PubMed] [Google Scholar]
  • 62.Li S., Li T., Kim W.-D., Kitaoka M., Yoshida S., Nakajima M., Kobayashi H. Characterization of raffinose synthase from rice (Oryza sativa L. var. Nipponbare) Biotechnol. Lett. 2007;29:635–640. doi: 10.1007/s10529-006-9268-3. [DOI] [PubMed] [Google Scholar]
  • 63.E McClean P., Mamidi S., McConnell M., Chikara S., Lee R. Synteny mapping between common bean and soybean reveals extensive blocks of shared loci. BMC Genom. 2010;11:1–10. doi: 10.1186/1471-2164-11-184. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Lee C., Yu D., Choi H.-K., Kim R.W. Reconstruction of a composite comparative map composed of ten legume genomes. Genes Genom. 2017;39:111–119. doi: 10.1007/s13258-016-0481-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Schmutz J., Cannon S.B., Schlueter J.A., Ma J., Mitros T., Nelson W., Hyten D.L., Song Q., Thelen J.J., Cheng J., et al. Genome sequence of the palaeopolyploid soybean. Nature. 2010;463:178–183. doi: 10.1038/nature08670. [DOI] [PubMed] [Google Scholar]
  • 66.Santos T., Budzinski I.G., Marur C.J., Petkowicz C., Pereira L.F.P., Vieira L.G. Expression of three galactinol synthase isoforms in Coffea arabica L. and accumulation of raffinose and stachyose in response to abiotic stresses. Plant Physiol. Biochem. 2011;49:441–448. doi: 10.1016/j.plaphy.2011.01.023. [DOI] [PubMed] [Google Scholar]
  • 67.Zhao T.-Y., Thacker R., Corum J.W., Snyder J.C., Meeley R.B., Obendorf R.L., Downie B. Expression of the maize GALACTINOL SYNTHASE gene family: (I) Expression of two different genes during seed development and germination. Physiol. Plant. 2004;121:634–646. doi: 10.1111/j.1399-3054.2004.00367.x. [DOI] [Google Scholar]
  • 68.Lowell C.A., Kuo T.M. Oligosaccharide Metabolism and Accumulation in Developing Soybean Seeds. Crop. Sci. 1989;29:459–465. doi: 10.2135/cropsci1989.0011183X002900020044x. [DOI] [Google Scholar]
  • 69.Saldivar X., Wang Y.-J., Chen P., Hou A. Changes in chemical composition during soybean seed development. Food Chem. 2011;124:1369–1375. doi: 10.1016/j.foodchem.2010.07.091. [DOI] [Google Scholar]
  • 70.Tsaniklidis G., Benovias A., Delis C., Aivalakis G. Acidic alpha galactosidase during the maturation and cold storage of cherry tomatoes. Acta Physiol. Plant. 2016;38:1–9. doi: 10.1007/s11738-016-2075-0. [DOI] [Google Scholar]
  • 71.Egert A., Keller F., Peters S. Abiotic stress-induced accumulation of raffinose in Arabidopsis leaves is mediated by a single raffinose synthase (RS5, At5g40390) BMC Plant Biol. 2013;13:218. doi: 10.1186/1471-2229-13-218. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Gangl R., Behmüller R., Tenhaken R. Molecular cloning of AtRS4, a seed specific multifunctional RFO synthase/galactosylhydrolase in Arabidopsis thaliana. Front. Plant Sci. 2015;6:789. doi: 10.3389/fpls.2015.00789. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Sun Z., Qi X., Wang Z., Li P., Wu C., Zhang H., Zhao Y. Overexpression of TsGOLS2, a galactinol synthase, in Arabidopsis thaliana enhances tolerance to high salinity and osmotic stresses. Plant Physiol. Biochem. 2013;69:82–89. doi: 10.1016/j.plaphy.2013.04.009. [DOI] [PubMed] [Google Scholar]
  • 74.Goodstein D.M., Shu S., Howson R., Neupane R., Hayes R., Fazo J., Mitros T., Dirks W., Hellsten U., Putnam N., et al. Phytozome: A comparative platform for green plant genomics. Nucleic Acids Res. 2011;40:D1178–D1186. doi: 10.1093/nar/gkr944. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Altschul S.F., Gish W., Miller W., Myers E.W., Lipman D.J. Basic local alignment search tool. J. Mol. Biol. 1990;215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
  • 76.Zhao T.-Y., Iii J.W.C., Mullen J., Meeley R.B., Helentjaris T., Martín D., Downie B. An alkaline α-galactosidase transcript is present in maize seeds and cultured embryo cells, and accumulates during stress. Seed Sci. Res. 2006;16:107–121. doi: 10.1079/SSR2006243. [DOI] [Google Scholar]
  • 77.Carmi N., Zhang G., Petreikov M., Gao Z., Eyal Y., Granot D., Schaffer A.A. Cloning and functional expression of alkaline alpha-galactosidase from melon fruit: Similarity to plant SIP proteins uncovers a novel family of plant glycosyl hydrolases. Plant J. 2003;33:97–106. doi: 10.1046/j.1365-313X.2003.01609.x. [DOI] [PubMed] [Google Scholar]
  • 78.Imaizumi C., Tomatsu H., Kitazawa K., Yoshimi Y., Shibano S., Kikuchi K., Yamaguchi M., Kaneko S., Tsumuraya Y., Kotake T. Heterologous expression and characterization of an Arabidopsis β-l-arabinopyranosidase and α-d-galactosidases acting on β-l-arabinopyranosyl residues. J. Exp. Bot. 2017;68:4651–4661. doi: 10.1093/jxb/erx279. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Hillis D.M., Bull J.J. An Empirical Test of Bootstrapping as a Method for Assessing Confidence in Phylogenetic Analysis. Syst. Biol. 1993;42:182. doi: 10.1093/sysbio/42.2.182. [DOI] [Google Scholar]
  • 80.Kiekens R., de Koning R., Toili E., Angenon G. The Hidden Potential of High-Throughput RNA-Seq Re-Analysis, a Case Study for DHDPS, Key Enzyme of the Aspartate-Derived Lysine Biosynthesis Pathway and Its Role in Abiotic and Biotic Stress Responses in Soybean. 2021 doi: 10.3390/plants11131762. Manuscript submitted for publication. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Joshi N., Fass J. Sickle: A Sliding-Window, Adaptive, Quality-Based Trimming Tool for FastQ Files (Version 1.33) [Software] [(accessed on 15 May 2020)]; Available online: https://github.com/najoshi/sickle.
  • 82.Dobin A., Gingeras T.R., Spring C., Flores R., Sampson J., Knight R., Chia N., Technologies H.S. Mapping RNA-seq with STAR. Curr. Protoc. Bioinform. 2016;51:586–597. doi: 10.1002/0471250953.bi1114s51.Mapping. [DOI] [Google Scholar]
  • 83.Anders S., Pyl P.T., Huber W. HTSeq—A Python framework to work with high-throughput sequencing data. Bioinformatics. 2015;31:166–169. doi: 10.1093/bioinformatics/btu638. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Wagner G.P., Kin K., Lynch V.J. Measurement of mRNA abundance using RNA-seq data: RPKM measure is inconsistent among samples. Theory Biosci. 2012;131:281–285. doi: 10.1007/s12064-012-0162-3. [DOI] [PubMed] [Google Scholar]
  • 85.Kolde R. Package “pheatmap”. R Packag. 2015;1:790 [Google Scholar]
  • 86.Aranda P.S., Lajoie D.M., Jorcyk C.L. Bleach gel: A simple agarose gel for analyzing RNA quality. Electrophoresis. 2012;33:366–369. doi: 10.1002/elps.201100335. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Okonechnikov K., Conesa A., García-Alcalde F. Qualimap 2: Advanced multi-sample quality control for high-throughput sequencing data. Bioinformatics. 2016;32:292–294. doi: 10.1093/bioinformatics/btv566. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Toili M.E.M., de Koning R., Kiekens R., Wahome S., Githiri S.M., Angenon G. A comparative transcriptome analysis of fast and slow-cooking common beans (Phaseolus vulgaris L.) during seed development reveals differentially expressed genes involved in the hard-to-cook defect. 2021 Manuscript submitted for publication. [Google Scholar]
  • 89.Ye J., Coulouris G., Zaretskaya I., Cutcutache I., Rozen S., Madden T.L. Primer-BLAST: A tool to design target-specific primers for polymerase chain reaction. BMC Bioinform. 2012;13:134. doi: 10.1186/1471-2105-13-134. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Schmittgen T.D., Livak K.J. Analyzing real-time PCR data by the comparative CT method. Nat. Protoc. 2008;3:1101–1108. doi: 10.1038/nprot.2008.73. [DOI] [PubMed] [Google Scholar]
  • 91.Livak K.J., Schmittgen T.D. Analysis of Relative Gene Expression Data Using Real-Time Quantitative PCR and the 2−ΔΔCT Method. Methods. 2001;25:402–408. doi: 10.1006/meth.2001.1262. [DOI] [PubMed] [Google Scholar]
  • 92.Love M.I., Huber W., Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:550. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

The data generated and analyzed in this study is available in this paper and the Supplementary Materials. The raw reads have been uploaded to the European Nucleotide Archive database (study PRJEB45523).


Articles from Plants are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES