Abstract
Tofu is a popular soybean (Glycine max (L.) Merr.) food with a long tradition in Asia and rising popularity worldwide, including Central Europe. Due to the labour-intensive phenotyping procedures, breeding for improved tofu quality is challenging. Therefore, our objective was to unravel the genetic architecture of traits relevant for tofu production in order to assess the potential of marker-assisted selection and genomic selection in breeding for these traits. To this end, we performed QTL mapping with 188 genotypes from a biparental mapping population. The population was evaluated in a two-location field trial, and tofu was produced in the laboratory to evaluate tofu quality. We identified QTL for all investigated agronomic and quality traits, each explaining between 6.40% and 27.55% of the genotypic variation, including the most important tofu quality traits, tofu yield and tofu hardness. Both traits showed a strong negative correlation (r = -0.65), and consequently a pleiotropic QTL on chromosome 10 was found with opposite effects on tofu hardness and tofu weight, highlighting the need to balance selection for both traits. Four QTL identified for tofu hardness jointly explained 68.7% of the genotypic variation and are possible targets for QTL stacking by marker-assisted selection. To exploit also small-effect QTL, genomic selection revealed moderate to high mean prediction accuracies for all traits, ranging from 0.47 to 0.78. In conclusion, inheritance of tofu quality traits is highly quantitative, and both marker-assisted selection and genomic selection present valuable tools to advance tofu quality by soybean breeding.
Supplementary Information
The online version contains supplementary material available at 10.1007/s11032-024-01529-x.
Keywords: Soybean, Tofu traits, QTL mapping, Genomic selection, Plant breeding
Introduction
Soybean (Glycine max (L.) Merr.) is a leguminous crop of high economic importance. It is primarily used as animal feed, however soybean is also widely used in human nutrition. Soybean products for human consumption include, for example, soybean oil, soy sauce, tempeh, and tofu, i.e. soybean curd. Soybean products are valued for their high protein content and the various health benefits associated with the consumption of soy foods (e.g., Zhang et al. 2003; Lima et al. 2017). With the rise in popularity of plant-based products high in protein, often used as meat alternatives, the demand for tofu and tofu products continues to increase (Statista 2020). Consequently, breeding soybean varieties with improved tofu quality is becoming increasingly important. Amongst other parameters, tofu quality is characterized by tofu hardness, protein content, amino acid profile, and taste (Kurasch et al. 2018a; Wang et al. 2020), making tofu quality evaluation complex. Especially for tofu hardness, the desired levels depend heavily on the intended use and the region of consumption. For example in Asia, a wide variety of tofu texture is offered, whereas the majority of tofu sold in German supermarkets tends to be very firm.
The procedure of evaluating tofu quality is laborious and time consuming, as tofu has to be produced for direct testing, which additionally requires adequate sample sizes of soybeans for destructive measurements. Therefore, selection on a large scale or in the early stages of the breeding cycle is not feasible. In addition to the effect of the genotype, tofu quality traits are influenced by the soybean growing environment (Aziadekey et al. 2002; Poysa and Woodrow 2002; Min et al. 2005; Kurasch et al. 2018a), seed storage conditions after harvest (Kong et al. 2008) and tofu production method (Kurasch et al. 2018a; Huang et al. 2022). Research on associations between tofu quality and soybean agronomic and seed quality traits is inconclusive, making indirect selection, for example by breeding for high protein content, unreliable. Genomics based methods like marker-assisted selection and genomic selection would therefore be valuable tools to drive soybean breeding for superior suitability for tofu production. However, only few QTL mapping studies have been reported. Kurasch et al. (2018b) performed QTL mapping of a range of tofu quality traits, but in each of the three populations identified at maximum one QTL per trait, whereas other studies only focused on mapping loci related to tofu and soymilk yield (e.g., Wang et al. 2008; Huang et al. 2023). Thus, little is known so far about the genetic control of tofu quality traits besides the quantitative inheritance of the different tofu yield and quality parameters (Kurasch et al. 2018b).
A prerequisite for marker-assisted selection is the availability of markers that are diagnostic for or at least closely linked to genetic loci associated with the trait of interest. This is feasible for monogenic traits and for quantitative traits for which individual quantitative trait loci (QTL) explain a large proportion of the genotypic variation of the trait. When, in contrast, many QTL of small effect are responsible for the expression of a complex, highly quantitative trait, marker-assisted selection is not feasible. An alternative approach to assist selection of such traits in breeding is genomic selection. This approach typically utilises a large number of markers covering the whole genome in order to calculate a genomic estimated breeding value that is then used to make selection decisions.
In Germany, production of soybean is on the rise (Eurostat 2024). Furthermore, tofu manufacturers have an interest in locally grown soybeans of high tofu quality. The adaptation to the Central European climate in combination with high tofu quality requires cultivar improvement by plant breeding. To advance soybean breeding with respect to tofu quality, the aims of this study were to investigate the genetic architecture of tofu quality traits by QTL mapping and based on these findings draw conclusions for the potential of marker-assisted and genomic selection as tools to improve tofu quality parameters in breeding.
Materials and methods
Plant material and assessment of agronomic traits
One hundred and ninety-eight recombinant inbred lines (RIL) in F4:6 generation (self-pollinated to F4 and then bulked until F6) from a biparental population were tested together with their parental lines, resulting in 200 genotypes in total. Both parental lines were breeding lines high in protein and suitable for tofu production. The field trials were conducted in 2021 in Eckartsweier (EWE) and Hohenheim (HOH) in Southwest Germany. The trials were designed as an alpha lattice design with two replications per location and 20 blocks of 10 genotypes per replication. The soybeans were grown in two-row micro plots of 1.1 m length and 0.32 m row spacing, resulting in a plot size of 0.352 m2. Before sowing, the soybean seeds were inoculated with Liquifix and Legumefix (Legume Technology LTD, East Bridgeford, UK). While measuring seed yield was not meaningful due to the small plot size, thousand-seed weight was measured as a yield parameter. Protein and oil content were estimated in the laboratory on seeds dried at 9–10% seed moisture content using near-infrared spectroscopy using a PSS-HOP spectrometer (Polytec GmbH, Waldenbronn, Germany).
Genotyping
For genotyping, the 198 RILs and their two parental lines were grown in 96-well seedling trays with 3.5 × 3.5 cm wells filled with a 2:1 mix of soil (Substrat 5, Klasmann-Deilmann GmbH, Geeste, Germany) and sand in a growth chamber (12 h light and 12 h darkness; 28°C). Young leaf material was harvested after 15 days and all samples were dried in silica gel for one week.
About 15 mg of dried leaf material was used for DNA extractions following a slightly modified plate-based protocol described by Fazekas et al. (2012). Final washing steps were done using first washing buffer 2 and then 70% ethanol. DNA quantity was measured using the QuantiFluor® dsDNA System kit (Promega, Madison, USA) and all samples were normalised to 10 ng/µl. Genotyping-by-Sequencing (GBS) libraries were constructed following a slightly modified protocol described by Elshire et al. (2011). In short, single i7 indexes were added to allow multiplexing of the libraries. Additionally, an ExoSAP digest using the GenUP™ Exo SAP Kit (biotechrabbit, Berlin, Germany) was added according to the manufacturer’s protocol to remove single stranded DNA and small dimers after the PCR. PCR products were then selected by size using the ProNex® Size-Selective Purification System kit (Promega, Madison, USA). The final quality of the libraries was assessed using the HS NGS Fragment Kit (1–6000 bp, Agilent, Santa Clara, USA). Libraries were sequenced on an Illumina Nova Seq machine (Macrogene and Azenta/Genewiz) generating 150 bp paired-end reads with an average of 4,280,863 reads per genotype (Table S1).
For bioinformatic processing of the GBS data, the GB-eaSy pipeline (Wickland et al. 2017) was followed. First, reads were demultiplexed based on the barcode sequences using the GBSX tool (Herten et al. 2015). Quality checks on the raw data were performed using FastQC and all further steps were done excluding the second read due to quality issues. Reads were cleaned using the program fastp (Chen et al. 2018) and reads were then mapped against the Wm82.a2.v1 soybean reference genome (Schmutz et al. 2010) using bwa mem (Li and Durbin 2009) with default settings. SNP calling was done by first generating the genotype likelihoods using bcftools mpileup (Li 2011) using the flags –skip-indels, –min-MQ 20, and –min-BQ 20 and then performing the SNP calling on the mpileup files using bcftools call (Li 2011) with the flags –multiallelic-caller and –variants-only by chromosome. Finally, SNP files were concatenated using bcftools concat (Li 2011). Raw SNP files were filtered with vcftools (Danecek et al. 2011) using the –max-alleles 2, –minDP 8 and –max-missing 0.8 to flags to retain a final SNP file with a total of 315,385 SNPs.
Preparation of test tofu and assessment of tofu quality traits
In order to assess tofu quality traits, test tofu was prepared using an adaptation of the bench scale method developed by Kurasch et al. (2018a). Seed moisture content of samples of 80 g were measured using a FarmPro moisture analyser (Horn GmbH, Bad Saulgau, Germany). The sample weights were then adjusted to a moisture content of 12% (corrected dry bean weight). Afterwards, the samples were soaked in tap water for 24 h at a temperature of about 8 °C. To prepare soymilk, the drained soybeans were ground with 480 ml of tap water for one minute at 10,000 rpm. The slurry was then filtered with a filter hose to separate the raw soymilk from the pulp. The soymilk was heated to 98.5 °C for 3 min with constant stirring at 150 rpm. An estimate of solid contents dissolved in the soymilk in °Brix was obtained with a PR-32α refractometer (ATAGO Co., LTD, Tokyo, Japan). A measured value was accepted once the value was constant for three consecutive measurement runs. The soymilk was then adjusted to 8.4°Brix by adding heated tap water. Of the adjusted soymilk, 500 ml were used to further process into tofu. As a coagulant, 0.45 g CaSO4 was dissolved in 20.55 ml of a 2.95% Nigari solution (MgCl2 6H2O) prepared with tap water. For that, the coagulant was added to the soymilk at a temperature between 84.5 and 85.5 °C. The mixture was stirred for six seconds at 250 rpm and then placed into a water bath at 84.5–85.5 °C for 20 min. Afterwards, the curd was broken up using a curd breaker and left in the water bath. After 10 min, the whey was drained from the curd with a sieve for 45 s. The curd was filled into moulds of 7.5 cm diameter and pressed with rising pressure from 0.1 to 1 to 2 to 3 to 4 Pa for 4 min each. The pressed tofu was stored in cold water at around 8 °C for 12 to 18 h overnight. After draining, the tofu was weighed to obtain tofu weight (TW).
For additional tofu quality parameters, the soaking factor (SF) was calculated as .
. Tofu hardness (TH) in N was obtained using a texturometer (Zwick AG, Ulm, Germany). Soymilk weight (SMW) in g refers to the weight of the soymilk adjusted to 8.4°Brix. Tofu yield (TY) was calculated as . The tofu value (TV) was calculated as . The number of hard beans was counted in each sample. Soybeans that did not take up water during the soaking process were considered as hard beans.
Phenotypic analysis
Best linear unbiased estimators (BLUE), best linear unbiased predictors (BLUP) and variance components were calculated for all traits using the model
where yijkm represents the phenotypic observation, gi the ith genotype, lj the jth test location, glij is the genotype-by-location interaction, rjk is the replication k in location j, bjkm is the mth block in replication k and location j, and εijkm is the corresponding error term. To calculate BLUE values, gi was considered as a fixed effect. To calculate BLUP values and variance components, gi was considered a random effect. The counts of hard beans were transformed by square root transformation prior to the estimation of BLUEs, BLUPs and variance components in order to fulfil the model assumptions. Thirteen small negative BLUE values were set to zero for biological plausibility. Heritability (h2) was calculated as proposed by Cullis et al. (2006) and described by Piepho and Möhring (2007) as follows:
where describes the mean variance of the difference between two BLUP values and describes the genotypic variance.
All analyses were conducted using the programming language R (R Core Team 2021) in the RStudio environment (Posit team 2023). BLUEs, BLUPs and variance components were estimated using the R package ASReml-R (Butler 2021).
Linkage map construction and QTL analysis
A linkage map was constructed for the QTL mapping using the R package ASMap (Taylor and Butler 2017). Prior to map construction, missing marker values were imputed and the marker data was corrected for miscalled alleles using the R package ABHgenotypeR (Reuscher and Furuta 2016). In addition, genotypes with heterozygosity above 30% and markers with excessive segregation distortion were removed. During linkage map construction, markers were additionally filtered for inflated double crossovers. Physical and genetic marker orders were plotted against each other and markers showing a strong deviation from the expected pattern were removed. After these filtering steps, the final linkage map was constructed with the remaining 188 genotypes and 2,452 SNP markers. The same data set was then also used for QTL mapping.
QTL analysis was conducted using a composite interval mapping (CIM) approach with an additive genetic model and the software PlabMQTL (Utz 2011). Empirical logarithm of odds (LOD) thresholds with a genome-wide significance level of were computed for each trait with permutation tests of 1000 runs (Churchill and Doerge 1994). The modified Bayesian information criterion (mBIC) was used for cofactor selection. The proportion of genotypic variance explained by each QTL (pG) was calculated as the value of normalised according to Zhu et al. (2004), where describes the proportion of the phenotypic variance explained by the respective QTL. QTL results were validated by 200 independent fivefold cross-validations (Utz et al. 2000).
Genomic prediction
Genomic prediction was conducted for all traits by ridge regression BLUP (RR-BLUP) using the rrBLUP package in R (Endelman 2011). The genotypes and SNPs from the linkage map were used and missing values were imputed using the software LinkImpute (Money et al. 2015). Using BLUEs across locations, 1000 runs of a fivefold cross-validation were applied, where the training set (TS) included a random set of 80% of the genotypes and the prediction set (PS) contained the remaining 20%. The prediction accuracy was calculated as , where rMP represents the correlation between genomic estimated breeding values (GEBV) and observed BLUE values and h represents the square root of the heritability (Lande and Thompson 1990; Dekkers 2007).
Results
Heritabilities for thousand-seed weight, protein content and oil content were medium to high at 0.65, 0.72 and 0.62, respectively (Table 1). For the tofu quality traits, heritabilities were generally lower ranging from 0.23 for tofu weight to 0.66 for the number of hard beans. The genotypic variance was highly significant (P < 0.001) for all traits except for tofu weight. The effect of the trial location was highly significant (P < 0.001) as well, for all traits except for the soaking factor. In contrast, the genotype-by-location interaction variance was significant only for thousand-seed weight, oil content and hard beans.
Table 1.
Summary statistics of the evaluated traits. Mean of BLUE values (), genotypic variance (), location variance (), the respective interaction variance (), error variance (), and heritability () for thousand-seed weight in gram (TSW), protein content in percent (PC), oil content in percent (OC), soaking factor (SF), soymilk weight in gram (SM), tofu weight in gram (TW), tofu yield (TY), tofu hardness in Newton (TH), tofu value (TV), and the square root of counted hard beans (HB)
| Trait | + | |||||
|---|---|---|---|---|---|---|
| TSW | 227.97 | 102.29*** | 66.78*** | 44.09*** | 123.44 | 0.65 |
| PC | 46.73 | 0.63*** | 0.11*** | 9.33e−8ns | 0.92 | 0.72 |
| OC | 15.91 | 0.10*** | 0.34*** | 0.04** | 0.16 | 0.62 |
| SF | 2.11 | 5.33e−4*** | 0.05e−4ns | 0.78e−4ns | 9.86e−4 | 0.64 |
| HB | 1.05 | 0.32*** | 0.11*** | 0.09** | 0.46 | 0.66 |
| SM | 571.55 | 61.82*** | 76.47*** | 6.73ns | 146.93 | 0.59 |
| TW | 139.29 | 5.08ns | 18.39*** | 2.98ns | 59.34 | 0.23 |
| TY | 1.97 | 2.55e−3*** | 4.07e−3*** | 0.47e−3ns | 10.00e−3 | 0.47 |
| TH | 78.96 | 33.05*** | 62.06*** | 1.04e−5ns | 196.42 | 0.39 |
| TV | 300.95 | 212.00*** | 101.83*** | 1.87e−4ns | 1848.20 | 0.30 |
*P < 0.05, **P < 0.01, ***P < 0.001, nsnot significant
+Mean values for HB were transformed back to represent the count of hard beans
BLUEs across locations varied substantially between the genotypes and approximately followed a normal distribution for all traits except for hard bean count (Fig. 1). On average, the genotypes showed high thousand-seed weight (227.97 g), very high protein content (46.73%) and lower oil content (15.91%) (Table 1). For tofu quality traits, tofu hardness ranged from 54.67 N to 108.11 N and the mean of 78.96 N is slightly above the producer’s threshold of 75 N. In contrast, ranging between 1.74 and 2.16, mean tofu yield of 1.97 lay slightly below the producer’s threshold of 2.00. The two parental genotypes showed very similar performance for all tofu quality traits except for the soymilk weight (Fig. 1). For thousand-seed weight, protein content and oil content, the parents differed more strongly, as parent 1 showed higher thousand-seed weight and protein content, whereas parent 2 was higher in oil content.
Fig. 1.
Trait distributions. Distribution of BLUE values across locations for the traits thousand-seed weight (TSW), protein content, oil content, soaking factor, soymilk weight, tofu weight, tofu yield, tofu hardness, tofu value, and number of hard beans. The performance of the parental genotypes is indicated by red triangles. Note, BLUE values for hard bean counts were estimated based on square root transformed values, the resulting BLUEs were transformed back for this figure
The yield parameter thousand-seed weight was significantly (P < 0.05) positively correlated with protein content and soymilk weight at a moderate level, whereas it was negatively correlated with oil content, tofu weight, hard bean count and soaking factor (Fig. 2). However, thousand-seed weight showed no significant correlation with tofu yield, tofu hardness and tofu value. Protein content was weakly to moderately positively correlated with soymilk weight, tofu value and tofu hardness and negatively correlated with tofu weight, soaking factor and count of hard beans. Tofu yield showed a strong positive correlation with tofu weight (r = 0.87) and moderate positive correlation with soymilk weight. In contrast, tofu yield was moderately negatively correlated with hard bean count (r = −0.34) and tofu hardness (r = −0.65). In addition, tofu hardness was strongly positively correlated with the tofu value (r = 0.80), whereas the correlation with tofu weight was strongly negative (r = −0.71). Tofu value was further moderately positively correlated with soymilk weight and negatively correlated with tofu weight and hard bean count. Hard bean count was also negatively correlated with soymilk weight at r = −0.60.
Fig. 2.
Correlation among the investigated traits. a Network plot and b matrix with the correlations among the traits thousand-seed weight (TSW), protein content (PC), oil content (OC), soaking factor (SF), soymilk weight (SM), tofu weight (TW), tofu yield (TY), tofu hardness (TH), tofu value (TV), and number of hard beans (HB). Asterisk indicates correlation coefficients significantly different from zero at the 5% significance level
QTL mapping identified at least one QTL for all traits (Tab. 2, Fig. S1). For thousand-seed weight, three QTL were found on chromosomes 7, 14 and 16, explaining between 9.41% and 12.33% of the genotypic variance. For protein content, we found five QTL on chromosomes 5, 11, 12, 14 and 16 explaining 6.40–13.96% of the genotypic variance and three different QTL were detected for oil content explaining 9.43–16.17%. On chromosome 4, a pleiotropic QTL was found for the number of hard beans and soymilk, explaining 13.59% and 16.15% of the genotypic variance, respectively, for which the favourable allele was associated with a reduction in hard beans of 0.06 and an increase of 3.49 g in soymilk weight. Another pleiotropic QTL was found on chromosome 10 for tofu weight, explaining 27.55% of genotypic variance, tofu hardness (20.65%) and tofu value (20.18%). For tofu weight, the favourable allele was contributed by parent 1, leading to an increase of 1.39 g in tofu weight, whereas tofu hardness and tofu value were increased by 3.44 N and 7.59, respectively, by the allele from parent 2. Additional major QTL that explained over 20% of the respective genotypic variance were found on chromosome 13 for tofu yield and on chromosome 5 for tofu value, where the allele of parent 2 was associated with an increase of 0.03 and 8.84, respectively. Overall, for the tofu quality traits, parent 2 contributed the majority of the favourable alleles. The major QTL showed substantial effects on the respective traits across, as well as within locations (Fig. S2).
Table 2.
Results from QTL mapping. Chromosome (Chr.) and position (Pos) of detected QTL and their support interval (SI) in centiMorgan (cM), flanking markers indicating their physical position, logarithm of odds (LOD), donor of the favourable allele, the additive effect (Add. eff.), proportion of genotypic variance explained by the QTL (pG), and the frequency in which the QTL was observed in 200 independent five-fold cross-validation runs (Freq. CV), shown for the traits thousand-seed weight (TSW), protein content (PC), oil content (OC), soaking factor (SF), count of hard beans (HB), soymilk weight (SM), tofu weight (TW), tofu yield (TY), tofu hardness (TH), and tofu value (TV)
| Trait | Chr. | Pos. (SI) (cM) |
Left marker | Right marker | LOD | Donor (fav. allele) | Add. eff.+ | pG (%) | Freq. CV |
|---|---|---|---|---|---|---|---|---|---|
| TSW | 7 | 57 (53–61) | Chr07_5770755 | Chr07_6463334 | 3.15 | Parent 1 | 3.34 | 9.41 | 0.26 |
| 14 | 61 (53–66) | Chr14_8232119 | Chr14_9109810 | 3.59 | Parent 1 | 3.54 | 10.67 | 0.46 | |
| 16 | 117 (106–130) | Chr16_28461478 | Chr16_35670141 | 4.18 | Parent 1 | 5.20 | 12.33 | 0.68 | |
| PC | 5 | 131 (128–133) | Chr05_41123747 | Chr05_41392595 | 2.88 | Parent 2 | 0.22 | 6.40 | 0.16 |
| 11 | 73 (58–79) | Chr11_25910388 | Chr11_18186161 | 4.51 | Parent 1 | 0.29 | 9.81 | 0.57 | |
| 12 | 145 (142–146) | Chr12_35603615 | Chr12_35692321 | 6.57 | Parent 1 | 0.35 | 13.96 | 0.42 | |
| 14 | 9 (5–24) | Chr14_1159327 | Chr14_1437584 | 4.13 | Parent 2 | 0.26 | 9.04 | 0.25 | |
| 16 | 144 (134–146) | Chr16_36955239 | Chr16_37051444 | 5.26 | Parent 1 | 0.30 | 11.36 | 0.33 | |
| OC | 9 | 0 (0–2) | Chr09_223061 | Chr09_305244 | 6.44 | Parent 2 | 0.15 | 16.17 | 0.79 |
| 12 | 187 (186–187) | Chr12_38815575 | Chr12_39085077 | 4.70 | Parent 2 | 0.13 | 12.05 | 0.26 | |
| 18 | 96 (75–104) | Chr18_10268740 | Chr18_12826385 | 3.63 | Parent 2 | 0.11 | 9.43 | 0.19 | |
| SF | 10 | 129 (120–139) | Chr10_43599219 | Chr10_48347975 | 3.41 | Parent 1 | 0.01 | 9.69 | 0.37 |
| 17 | 121 (118–122) | Chr17_40412430 | Chr17_40584687 | 4.17 | Parent 1 | 0.01 | 11.72 | 0.70 | |
| 18 | 105 (103–107) | Chr18_16111507 | Chr18_16686711 | 6.98 | Parent 2 | 0.01 | 19.01 | 0.76 | |
| HB | 4 | 59 (55–60) | Chr04_7990022 | Chr04_8395715 | 4.74 | Parent 2 | −0.06 | 13.59 | 0.53 |
| 14 | 24 (21–26) | Chr14_3546556 | Chr14_3734218 | 3.04 | Parent 2 | −0.04 | 8.92 | 0.26 | |
| SM | 2 | 120 (118–126) | Chr02_38862876 | Chr02_39191621 | 2.69 | Parent 2 | 2.55 | 9.53 | 0.12 |
| 4 | 59 (58–65) | Chr04_7990022 | Chr04_8395715 | 4.66 | Parent 2 | 3.49 | 16.15 | 0.58 | |
| 4 | 122 (114–134) | Chr04_46792598 | Chr04_48009592 | 3.02 | Parent 2 | 3.10 | 10.69 | 0.40 | |
| TW | 10 | 19 (17–22) | Chr10_2920019 | Chr10_3139957 | 3.09 | Parent 1 | 1.39 | 27.55 | 0.21 |
| TY | 13 | 120 (118–122) | Chr13_30645621 | Chr13_30673726 | 4.54 | Parent 2 | 0.03 | 20.39 | 0.40 |
| TH | 1 | 76 (72–84) | Chr01_47186726 | Chr01_48805860 | 4.10 | Parent 2 | 2.86 | 15.74 | 0.22 |
| 8 | 28 (24–32) | Chr08_4651791 | Chr08_6045309 | 3.69 | Parent 2 | 2.70 | 14.24 | 0.28 | |
| 10 | 14 (11–18) | Chr10_2348299 | Chr10_2906572 | 5.47 | Parent 2 | 3.44 | 20.65 | 0.52 | |
| 18 | 47 (34–61) | Chr18_2098093 | Chr18_3883865 | 4.75 | Parent 1 | 3.35 | 18.10 | 0.20 | |
| TV | 5 | 1 (0–3) | Chr05_1420686 | Chr05_1545718 | 4.35 | Parent 2 | 8.84 | 26.21 | 0.72 |
| 8 | 152 (137–156) | Chr08_44206039 | Chr08_45163890 | 2.69 | Parent 2 | 6.93 | 16.55 | 0.25 | |
| 10 | 16 (11–19) | Chr10_2348299 | Chr10_2906572 | 3.31 | Parent 2 | 7.59 | 20.18 | 0.40 |
+For QTL mapping HB counts transformed with square root transformation were used, here the values were transformed back to represent actual HB counts
We exemplarily used the QTL identified for tofu hardness to evaluate the effect of QTL stacking, which revealed a substantial effect on the target trait as several allelic combinations of at least three favourable alleles consistently performed above the desired threshold of 75 N (Fig. 3).
Fig. 3.
The effect of QTL stacking. The boxplot illustrates the effect of combining favourable alleles of the four identified QTL for tofu hardness on chromosomes 1, 8, 10 and 18 in 188 genotypes. + indicates the presence of the homozygous favourable allele, -indicates the homozygous unfavourable allele, heterozygous loci and missing values. The red dotted line represents the quality threshold of 75 N desired in German tofu production
Genomic prediction with the same 188 genotypes and the 2,452 genome-wide markers used for QTL mapping revealed moderate mean prediction accuracies for all traits, ranging from 0.45 for thousand-seed weight to 0.61 for protein content (Fig. 4). Five-fold cross-validation also revealed a large variation in prediction accuracy between the runs. For example, prediction accuracies for tofu hardness ranged from −0.20 to 0.76 with a mean of 0.49, which for the majority of the tofu quality traits was in a similar range.
Fig. 4.
The potential of genomic prediction. Genomic prediction accuracies from 1000 runs of cross-validation and their respective means are shown
Discussion
Breeding for soybean tofu quality
The increasing trend towards vegetarian or vegan nutrition moves traits related to tofu quality to the foreground in soybean breeding programs targeted at human consumption. For the biparental mapping population underlying this study, we specifically used two parental lines that both are suitable for tofu production. The reasoning behind this is that such crosses are typical for applied breeding programs and thus the population allows investigating the variation and its genetic control relevant for tofu quality breeding.
The tofu quality traits showed low to medium heritabilities between 0.23 for tofu value and 0.66 for the number of hard beans (Tab. 1). Notably, both parental lines showed similar performance in most tofu quality traits and consequently, the genotypic variance might be rather small in comparison to the error variance, resulting in a lower heritability (Fig. 1, Tab. 1). This conclusion is supported by the fact that the heritability was especially low for traits for which the parents performed most similarly and showed a relatively high error variance (i.e. tofu weight, tofu yield, tofu hardness and tofu value). The significant effect of both the soybean genotype and the growing environment, specifically on tofu yield and hardness, is in accordance with previous studies (Wang et al. 1983; Mullin et al. 2001; Aziadekey et al. 2002; Min et al. 2005; Kurasch et al. 2018b). A source of the rather large error effects likely lies in inhomogeneities in the test tofu laboratory. When establishing the bench-scale tofu testing method followed in this study, Kurasch et al. (2018a) found strong effects of the team preparing the test tofu as well as the day of tofu production. Consequently, several steps of the procedure were subsequently standardized or even automated. Nevertheless, the various steps of the tofu production process in combination with the small amount of seeds used for it, are still a source for confounding effects. However, despite the medium heritabilities found in this study, our results highlight the feasibility of breeding soybean with improved tofu quality.
The genotypes of the mapping population varied substantially for the two most important tofu quality traits, tofu yield (between 1.74 and 2.16) and especially for tofu hardness (ranging from 54.67 to 108.11 N) (Fig. 1). For tofu yield, the tofu manufacturer desires a minimum value of 2.0 with the optimum of 2.2. Many genotypes therefore showed too low tofu yield for production purposes, including the parental lines not reaching the cut-off level of 2.0. For tofu hardness, the desired range lies between 75 and 120 N. In addition to the rather low tofu yields, many genotypes also showed values below this desired minimum value. This may be due to the environment, which, as shown here, also has a substantial effect on these two traits. However, while the absolute trait level is relevant for tofu production, the present variation of the traits is the decisive factor for successful QTL mapping. Interestingly, we observed transgressive segregation for all traits, meaning that there were progeny with trait values outperforming the superior parent or lower than the inferior parent. This means that both parents carry favourable and unfavourable alleles for the target traits that are newly combined in the progeny. Consequently, the combination of favourable alleles from both parents results in genotypes with a performance superior to either parent, which are candidates for new varieties and illustrate how breeding progress for tofu quality can be achieved.
The relationship between agronomic traits and tofu quality is unclear. In our mapping population, no correlation was found between thousand-seed weight and tofu hardness or tofu yield. For thousand-seed weight and protein content or soymilk weight, however, moderate positive correlations of 0.30 and 0.28, respectively, could be observed. Regarding protein content, the literature on its influence on tofu quality is conflicting. Previously, a positive association between seed protein content and tofu yield has been reported (Bhardwaj et al. 1999). In other studies, however, this conclusion could not be corroborated (Wang et al. 1983; Lim et al. 1990; Kurasch et al. 2018a, 2018b). Non-significant or low correlation coefficients between protein content and tofu yield parameters and tofu hardness were found in our study (Fig. 2), supporting the conclusion that the protein content by itself has a low relevance and it is more likely the protein composition that determines tofu quality. Notably, however, this does not rule out an effect of protein content, for which a certain minimum may be required as a basis for tofu quality. In our soybean material, all genotypes had a high protein content between 44.61% and 49.25% and thus, in populations including variation in the lower protein content range, the effect of protein content on tofu quality traits might be larger. The analysis of trait correlations also revealed the well-known strong negative correlation between protein and oil content, explaining why, if present, they often have opposite effects on tofu traits.
The tofu quality traits showed substantial positive as well as negative correlations among each other. For example, a strong negative correlation between tofu yield and tofu hardness (r = −0.65; Fig. 2) was observed, that has also been reported in previous studies (Min et al. 2005; Kurasch et al. 2018a). Therefore, a simultaneous improvement of both traits is challenging since tofu yield is closely linked to higher water content, whereas firmer tofu tends to hold less water (Min et al. 2005). Consequently, a balance between both traits needs to be achieved to produce tofu with a texture suited to consumer preferences while providing an adequate tofu yield. Taken together, also in the cross of two tofu-type soybeans, there is substantial variation in the resulting progeny and the mapping population is consequently well suited to dissect the genetic architecture of tofu quality traits.
The genetic architecture of tofu quality traits
Characterization of the genetic architecture of target traits holds the potential to identify major QTL that could then be utilized in marker-assisted selection. In our study, we identified QTL for all evaluated traits, including the tofu quality traits.
We identified three QTL for thousand-seed weight on chromosomes 7, 14 and 16, for which the favourable allele was always contributed by parent 1, the parent with the higher seed weight. Several QTL have been reported for seed weight on chromosome 7 (Orf et al. 1999; Csanádi et al. 2001; Hoeck et al. 2003; Teng et al. 2009; Li et al. 2010; Yan et al. 2014), of which one overlaps with our QTL (Teng et al. 2009). Seed weight QTL have also been reported on chromosome 14 (Specht et al. 2001; Hoeck et al. 2003; Liu et al. 2011) and chromosome 16 (Maughan et al. 1996; Kim et al. 2010).
For the number of hard beans, which significantly impairs soymilk weight, tofu yield and tofu value (Fig. 2), we identified two QTL on chromosomes 4 and 14, where the allele from parent 2 was associated with fewer hard beans in both cases. For traits related to seed coat permeability, several QTL have been reported (e.g., Liu et al. 2007; Ren et al. 2024), but none on chromosome 4 or 14. However, Kurasch et al. (2018b) reported a QTL on chromosome 4 for the weight gain after 25 h of soaking that overlaps with the QTL identified here. Hard beans are characterised by not taking up water during the soaking process. Therefore, both findings indicate causal genetic variation in that region of chromosome 4. In our study, this QTL was pleiotropic for soymilk weight. Since hard beans cannot be processed into soymilk and are removed before blending the seeds, this QTL likely indirectly influences soymilk yield by affecting the soaking behaviour of the seeds.
There was no substantial correlation between protein content and tofu traits in our population and none of the QTL for protein content overlapped with those of the tofu traits. However, two QTL on chromosome 8 associated with tofu value in our population have also been reported for soluble protein content (Lu et al. 2013), indicating at least some association between (soluble) protein and tofu quality. Moreover, a QTL for tofu hardness has been identified in a similar region of chromosome 8 (Kurasch et al. 2018b). This supports a possible role of water-soluble protein in the expression of tofu quality traits (Lu et al. 2013). Further publications suggest that tofu yield parameters and hardness are linked to protein composition rather than protein content alone. Especially the ratio of the storage proteins glycinin (11S) to β-conglycinin (7S) may affect tofu quality, with a higher proportion of 11S protein being associated with firmer tofu in the majority of publications (e.g., Saio et al. 1971; Yagasaki et al. 2000; Renkema et al. 2001; Poysa et al. 2006; Stanojevic et al. 2011). Notably, the influence of the storage proteins and their subunit composition strongly relies on the tofu production method and the soybean variety, which may explain contrasting findings (Utsumi and Kinsella 1985; Cai and Chang 1999; Yang and James 2023). Interestingly, the gene Gy4, encoding the glycinin G4 subunit on chromosome 10 (SoyBase, www.soybase.org), is located within the confidence interval of the QTL we detected for tofu weight, which is pleiotropic for tofu hardness and tofu value (Table 2, Fig. S1). In the genotyping-by-sequencing data, the gene region was not covered. We therefore employed re-sequencing data of the parental lines (Zhu et al. 2023) to search for polymorphisms in Gy4. This revealed no polymorphisms in the Gy4 coding sequence, the introns, the promoter or the downstream region. This indicates that if Gy4 is the causal gene underlying this QTL, this may be due to expression differences caused by sequence polymorphisms farther away from the gene, or that despite being a very plausible candidate, the QTL is not caused by Gy4 but another gene in this chromosomal region, which requires further research on the molecular level. In addition, two of the QTL found for tofu hardness in this study overlap with previously reported protein content QTL on chromosome 1 (Pandurangan et al. 2012; Mao et al. 2013) and on chromosome 18 (Jun et al. 2008; Lu et al. 2013; Mao et al. 2013), indicating a possible influence of protein content on tofu hardness despite the observed relatively low phenotypic correlation.
Taken together, the tofu quality traits can be considered as complex traits but with some medium- to large-effect QTL being present. In addition, the substantial effects of the identified breeding-relevant major QTL on the respective traits within single locations indicate their effectiveness across various environments (Fig. S2). As the tofu traits are laborious to phenotype and in addition are strongly influenced by the growing environment and other non-genetic factors, being able to directly address and exploit such QTL, and with them larger proportions of the genotypic variance, would be a big step forward in targeted breeding for tofu quality.
Potential of marker-assisted and genomic selection to advance tofu quality breeding
Marker-assisted selection is a valuable tool to directly exploit causal genetic loci in advancing a trait of interest. However, when QTL explain only a small proportion of genotypic variance, this approach is not suitable to drive the improvement of the targeted traits. Together, the four QTL identified in our study for tofu hardness could explain 68.7% of the genotypic variance (Table 2). We therefore exemplarily used this trait to explore the potential of QTL stacking, i.e. the combination of several favourable QTL alleles in a genotype. This revealed that the combination of at least three QTL leads to tofu hardness values consistently above the manufacturer’s desired threshold of 75 N for most combinations (Fig. 3). The major QTL on chromosome 10 that shows the strongest effect on ensuring meeting this threshold, however, was pleiotropic for tofu weight and tofu value. In line with the negative correlation of tofu weight with the other two traits, the allele from parent 2 was found to increase tofu hardness and tofu value, but had a negative effect on tofu weight. In general, a pleiotropic QTL can be due to one gene affecting two or more traits or can also be caused by closely linked loci, each affecting only one trait. In the latter case, the causal alleles can be separated by recombination, even though in practice this may require substantial efforts. In case of a single locus with pleiotropic effect, a breeder must decide which allele to select for and consequently which trait to increase and which to decrease. The decrease could be better accepted for the trait for which a compensation through other loci is more easily achieved. Nonetheless, the result on QTL stacking indicates that a marker-assisted selection approach might be feasible to efficiently exploit major QTL and with them a substantial proportion of the genotypic variance. Notably, however, QTL always depend on the studied population and results are often not transferable to other populations. The loci need to segregate within the population and their effects might vary in different genetic backgrounds. In a similar study, Kurasch et al. (2018b) found a QTL for tofu hardness on chromosome 8 in only one of three evaluated biparental populations. Interestingly, the pleiotropic QTL they identified for tofu hardness and tofu value at 62 and 64 cM on chromosome 8, respectively, did not overlap with the tofu hardness QTL identified in our population on chromosome 8 at 28 cM. This further highlights the strong dependency of the QTL results on the studied population, which limits the transferability required for marker-assisted selection. Therefore, in order to make use of the identified QTL, their genomic location needs to be fine-mapped and their effect on the trait of interest needs to be verified in different populations.
A complementary approach to marker-assisted selection is genomic selection, that offers the ability to utilise genomic information for the improvement of highly quantitative traits. In this study, mean prediction accuracies were moderate to high, ranging between 0.47 for thousand-seed weight and 0.78 for protein content (Fig. 4). Given the mentioned complexity in phenotyping the tofu quality traits, this can be very valuable in breeding. However, the variation of prediction accuracies between the cross-validation runs was rather large for all traits. For example, prediction accuracies for tofu hardness ranged from −0.27 to 1.03 with a mean of 0.49. A variation this large indicates a strong dependency of the respective prediction accuracy on the composition of the training and prediction sets. In a similar study, evaluating a very similar set of tofu quality parameters in two RIL populations, Kurasch et al. (2018b) found in part substantially lower mean prediction accuracies for tofu weight, tofu yield, tofu hardness and tofu value but with a similarly large variation. Thus, the accuracy of genomic prediction of tofu quality traits needs to be increased further and made more reliable for application in practical soybean breeding. This can likely be achieved by larger training sets including different genetic material representative of the breeding program. Ideally, the phenotypic assessment of the traits can also be improved further, so that the heritability increases and the data underlying the training of the prediction model become more robust. A potential limitation of genomic selection are the associated costs, as the approach requires genome-wide marker data. Taken together, marker-assisted selection and genomic selection show potential to assist breeding of tofu quality traits, but both require further validation and optimization before they can be routinely employed in breeding programs.
Conclusion and outlook
Tofu quality traits are becoming increasingly important in soybean breeding programs targeted at the food market. In this study, we evaluated a biparental population and observed substantial variation and transgressive segregation for all investigated traits, illustrating that an improvement of tofu quality traits can also be achieved in best-by-best crosses that are typical in applied breeding programs. We also observed intricate interrelations among the tofu traits, including negative correlations, which need to be balanced during selection, potentially using index selection. QTL mapping revealed some moderate- to large-effect QTL for all tofu quality traits, but the results also illustrate an in general complex inheritance with many additional small-effect QTL. Consequently, both marker-assisted and genomic selection hold potential for breeding, or even their joint use or integration, but both require further research and optimization. In conclusion, soybean breeding for improved tofu quality traits is feasible to meet the increasing demand and diversity of vegetarian and vegan products.
Supplementary Information
Below is the link to the electronic supplementary material.
Acknowledgements
We would like to thank the technical staff of the State Plant Breeding Institute for their efforts in the fieldwork and data collection and the staff of the Taifun Tofu GmbH for producing test tofu.
Author contributions
Cleo A. Döttinger: Conceptualization, Methodology, Formal Analysis, Investigation, Data Curation, Writing – Original Draft, Writing – Review & Editing, Visualization. Kim A. Steige: Conceptualization, Methodology, Formal Analysis, Investigation, Data Curation, Writing – Original Draft, Writing – Review & Editing. Volker Hahn: Conceptualization, Methodology, Investigation, Data Curation, Writing – Review & Editing, Supervision, Project Administration, Funding Acquisition. Willmar L. Leiser: Conceptualization, Methodology, Writing – Review & Editing, Project Administration, Funding Acquisition. Kristina Bachteler: Conceptualization, Data Curation, Writing – Review & Editing, Funding Acquisition. Xintian Zhu: Investigation, Writing – Review & Editing. Tobias Würschum: Conceptualization, Methodology, Writing – Review & Editing, Supervision, Project Administration, Funding Acquisition.
Funding
Open Access funding enabled and organized by Projekt DEAL. This research was funded by the German Federal Ministry for Education and Research (BMBF) under grant number 031B1046A and handled by the project management organization Projektträger Jülich (PtJ).
Data availability
Data is contained in the supplementary material.
Declarations
Conflicts of interest
The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.
Footnotes
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- Aziadekey M, Schapaugh WT, JR., Herald TJ, (2002) Genotype by environment interaction for soymilk and tofu quality characteristics. J Food Qual 25:243–259. 10.1111/j.1745-4557.2002.tb01022.x [Google Scholar]
- Bhardwaj HL, Bhagsari AS, Joshi JM, Rangappa M, Sapra VT, Rao MSS (1999) Yield and Quality of Soymilk and Tofu Made from Soybean Genotypes Grown at Four Locations. Crop Sci 39:401–405. 10.2135/cropsci1999.0011183X0039000200017x [Google Scholar]
- Butler D (2021) asreml: Fits the Linear Mixed Model. VSN International, Hemel Hempstead, United Kindom (www.vsni.co.uk) [Google Scholar]
- Cai T, Chang KC (1999) Processing effect on soybean storage proteins and their relationship with tofu quality. J Agric Food Chem 47:720–727. 10.1021/jf980571z [DOI] [PubMed] [Google Scholar]
- Chen S, Zhou Y, Chen Y, Gu J (2018) fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34:i884–i890. 10.1093/bioinformatics/bty560 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Churchill GA, Doerge RW (1994) Empirical threshold values for quantitative trait mapping. Genetics 138:963–971. 10.1093/genetics/138.3.963 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Csanádi G, Vollmann J, Stift G, Lelley T (2001) Seed quality QTLs identified in a molecular map of early maturing soybean. Theor Appl Genet 103:912–919. 10.1007/s001220100621 [Google Scholar]
- Cullis BR, Smith AB, Coombes NE (2006) On the design of early generation variety trials with correlated data. JABES 11:381–393. 10.1198/108571106X154443 [Google Scholar]
- Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, Handsaker RE, Lunter G, Marth GT, Sherry ST, McVean G, Durbin R (2011) The variant call format and VCFtools. Bioinformatics 27:2156–2158. 10.1093/bioinformatics/btr330 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dekkers JCM (2007) Prediction of response to marker-assisted and genomic selection using selection index theory. J Anim Breed Genet 124:331–341. 10.1111/j.1439-0388.2007.00701.x [DOI] [PubMed] [Google Scholar]
- Elshire RJ, Glaubitz JC, Sun Q, Poland JA, Kawamoto K, Buckler ES, Mitchell SE (2011) A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS ONE 6:e19379. 10.1371/journal.pone.0019379 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Endelman JB (2011) Ridge regression and other kernels for genomic selection with R package rrBLUP. Plant Genome 4:250–255. 10.3835/plantgenome2011.08.0024 [Google Scholar]
- Eurostat (2024) Crop production in EU standard humidity. https://ec.europa.eu/eurostat/databrowser/view/APRO_CPSH1__custom_11679343/default/table?lang=en. Accessed 4 June 2024
- Fazekas AJ, Kuzmina ML, Newmaster SG, Hollingsworth PM (2012) DNA barcoding methods for land plants. DNA Barcodes 858:223–252. 10.1007/978-1-61779-591-6_11 [DOI] [PubMed] [Google Scholar]
- Herten K, Hestand MS, Vermeesch JR, van Houdt JKJ (2015) GBSX: a toolkit for experimental design and demultiplexing genotyping by sequencing experiments. BMC Bioinformatics 16:73. 10.1186/s12859-015-0514-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hoeck JA, Fehr WR, Shoemaker RC, Welke GA, Johnson SL, Cianzio SR (2003) Molecular marker analysis of seed size in soybean. Crop Sci 43:68–74. 10.2135/cropsci2003.6800 [Google Scholar]
- Huang Z, Liu H, Zhao L, He W, Zhou X, Chen H, Zhou X, Zhou J, Liu Z (2022) Evaluating the effect of different processing methods on fermented soybean whey-based tofu quality, nutrition, and flavour. LWT 158:113139. 10.1016/j.lwt.2022.113139 [Google Scholar]
- Huang M, Chen S, Li W, Tian R, Zhang H, Shao Z, Li X, Zhang C (2023) Genetic loci and candidate genes with pleiotropic effects for tofu and soymilk weights across multiple environments in soybean (Glycine max). Plant Breed 142:149–169. 10.1111/pbr.13075 [Google Scholar]
- Jun T-H, Van K, Kim MY, Lee S-H, Walker DR (2008) Association analysis using SSR markers to find QTL for seed protein content in soybean. Euphytica 162:179–191. 10.1007/s10681-007-9491-6 [Google Scholar]
- Kim H-K, Kim Y-C, Kim S-T, Son B-G, Choi Y-W, Kang J-S, Park Y-H, Cho Y-S, Choi I-S (2010) Analysis of Quantitative Trait Loci (QTLs) for seed size and fatty acid composition using recombinant inbred lines in soybean. J Life Sci 20:1186–1192. 10.5352/JLS.2010.20.8.1186 [Google Scholar]
- Kong F, Chang SKC, Liu Z, Wilson LA (2008) Changes of soybean quality during storage as related to soymilk and tofu making. J Food Sci 73:S134–S144. 10.1111/j.1750-3841.2007.00652.x [DOI] [PubMed] [Google Scholar]
- Kurasch AK, Hahn V, Miersch M, Bachteler K, Würschum T (2018) Analysis of tofu-related traits by a bench-scale tofu production method and their relationship with agronomic traits in European soybean. Plant Breed 137:271–282. 10.1111/pbr.12581 [Google Scholar]
- Kurasch AK, Leiser WL, Bachteler K, Miersch M, Hahn V, Würschum T (2018) Evaluation of the genetic architecture of tofu traits in soybean towards genomics-assisted breeding. Plant Breed 137:873–882. 10.1111/pbr.12651 [Google Scholar]
- Lande R, Thompson R (1990) Efficiency of marker-assisted selection in the improvement of quantitative traits. Genetics 124:743–756. 10.1093/genetics/124.3.743 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li H (2011) A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics 27:2987–2993. 10.1093/bioinformatics/btr509 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li H, Durbin R (2009) Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25:1754–1760. 10.1093/bioinformatics/btp324 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li D, Sun M, Han Y, Teng W, Li W (2010) Identification of QTL underlying soluble pigment content in soybean stems related to resistance to soybean white mold (Sclerotinia sclerotiorum). Euphytica 172:49–57. 10.1007/s10681-009-0036-z [Google Scholar]
- Lim BT, deMan JM, deMan L, Buzzell RI (1990) Yield and quality of tofu as affected by soybean and soymilk characteristics. Calcium Sulfate Coagulant J Food Sci 55:1088–1092. 10.1111/j.1365-2621.1990.tb01605.x [Google Scholar]
- Lima A, Oliveira J, Saúde F, Mota J, Ferreira RB (2017) Proteins in soy might have a higher role in cancer prevention than previously expected: soybean protein fractions are more effective MMP-9 inhibitors than non-protein fractions, Even in cooked seeds. Nutrients 9:201. 10.3390/nu9030201 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu B, Fujita T, Yan Z-H, Sakamoto S, Xu D, Abe J (2007) QTL mapping of domestication-related traits in soybean (Glycine max). Ann Bot 100:1027–1038. 10.1093/aob/mcm149 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu W, Kim MY, Van K, Lee Y-H, Li H, Liu X, Lee S-H (2011) QTL identification of yield-related traits and their association with flowering and maturity in soybean. J Crop Sci Biotechnol 14:65–70. 10.1007/s12892-010-0115-7 [Google Scholar]
- Lu W, Wen Z, Li H, Yuan D, Li J, Zhang H, Huang Z, Cui S, Du W (2013) Identification of the quantitative trait loci (QTL) underlying water soluble protein content in soybean. Theor Appl Genet 126:425–433. 10.1007/s00122-012-1990-8 [DOI] [PubMed] [Google Scholar]
- Mao T, Jiang Z, Han Y, Teng W, Zhao X, Li W (2013) Identification of quantitative trait loci underlying seed protein and oil contents of soybean across multi-genetic backgrounds and environments. Plant Breed 132:630–641. 10.1111/pbr.12091 [Google Scholar]
- Maughan PJ, Maroof MA, Buss GR (1996) Molecular-marker analysis of seed-weight: genomic locations, gene action, and evidence for orthologous evolution among three legume species. Theor Appl Genet 93:574–579. 10.1007/BF00417950 [DOI] [PubMed] [Google Scholar]
- Min S, Yu Y, St. Martin S (2005) Effect of soybean varieties and growing locations on the physical and chemical properties of soymilk and tofu. J Food Sci 70:C8–C21. 10.1111/j.1365-2621.2005.tb09026.x [Google Scholar]
- Money D, Gardner K, Migicovsky Z, Schwaninger H, Zhong G-Y, Myles S (2015) LinkImpute fast and accurate genotype imputation for nonmodel organisms. G3 (Bethesda) 5:2383–2390 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mullin WJ, Fregeau-Reid JA, Butler M, Poysa V, Woodrow L, Jessop DB, Raymond D (2001) An interlaboratory test of a procedure to assess soybean quality for soymilk and tofu production. Food Res Int 34:669–677. 10.1016/S0963-9969(01)00087-4 [Google Scholar]
- Orf JH, Chase K, Jarvik T, Mansur LM, Cregan PB, Adler FR, Lark KG (1999) Genetics of soybean agronomic traits: I. comparison of three related recombinant inbred populations. Crop Sci 39:1642–1651. 10.2135/cropsci1999.3961642x [Google Scholar]
- Pandurangan S, Pajak A, Molnar SJ, Cober ER, Dhaubhadel S, Hernández-Sebastià C, Kaiser WM, Nelson RL, Huber SC, Marsolais F (2012) Relationship between asparagine metabolism and protein concentration in soybean seed. J Exp Bot 63:3173–3184. 10.1093/jxb/ers039 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Piepho H-P, Möhring J (2007) Computing heritability and selection response from unbalanced plant breeding trials. Genetics 177:1881–1888. 10.1534/genetics.107.074229 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Posit team (2023) RStudio: Integrated Development Environment for R. Posit Software. PBC, Boston, MA [Google Scholar]
- Poysa V, Woodrow L (2002) Stability of soybean seed composition and its effect on soymilk and tofu yield and quality. Food Res Int 35:337–345. 10.1016/S0963-9969(01)00125-9 [Google Scholar]
- Poysa V, Woodrow L, Yu K (2006) Effect of soy protein subunit composition on tofu quality. Food Res Int 39:309–317. 10.1016/j.foodres.2005.08.003 [Google Scholar]
- R Core Team (2021) R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria (https://www.R-project.org/) [Google Scholar]
- Ren H, Zhao K, Zhang C, Lamlom SF, Liu X, Wang X, Zhang F, Yuan R, Gao Y, Cao B, Zhang B (2024) Genetic analysis and QTL mapping of seed hardness trait in a soybean (Glycine max) recombinant inbred line (RIL) population. Gene 905:148238. 10.1016/j.gene.2024.148238 [DOI] [PubMed] [Google Scholar]
- Renkema J, Knabben J, van Vliet T (2001) Gel formation by β-conglycinin and glycinin and their mixtures. Food Hydrocolloids 15:407–414. 10.1016/S0268-005X(01)00051-0 [Google Scholar]
- Reuscher S, Furuta T (2016) ABHgenotypeR. https://github.com/StefanReuscher/ABHgenotypeR
- Saio K, Kajikawa M, Watanabe T (1971) Food processing characteristics of soybean proteins. Agric Biol Chem 35:890–898. 10.1080/00021369.1971.10860005 [Google Scholar]
- Schmutz J, Cannon SB, Schlueter J, Ma J, Mitros T, Nelson W, Hyten DL, Song Q, Thelen JJ, Cheng J, Xu D, Hellsten U, May GD, Yu Y, Sakurai T, Umezawa T, Bhattacharyya MK, Sandhu D, Valliyodan B, Lindquist E, Peto M, Grant D, Shu S, Goodstein D, Barry K, Futrell-Griggs M, Abernathy B, Du J, Tian Z, Zhu L, Gill N, Joshi T, Libault M, Sethuraman A, Zhang X-C, Shinozaki K, Nguyen HT, Wing RA, Cregan P, Specht J, Grimwood J, Rokhsar D, Stacey G, Shoemaker RC, Jackson SA (2010) Genome sequence of the palaeopolyploid soybean. Nature 463:178–183. 10.1038/nature08670 [DOI] [PubMed] [Google Scholar]
- Specht JE, Chase K, Macrander M, Graef GL, Chung J, Markwell JP, Germann M, Orf JH, Lark KG (2001) Soybean response to water: A QTL analysis of drought tolerance. Crop Sci 41:493–509. 10.2135/cropsci2001.412493x [Google Scholar]
- Stanojevic SP, Barac MB, Pesic MB, Vucelic-Radovic BV (2011) Assessment of soy genotype and processing method on quality of soybean tofu. J Agric Food Chem 59:7368–7376. 10.1021/jf2006672 [DOI] [PubMed] [Google Scholar]
- Statista (2020) Umsatzentwicklung mit Fleischersatzprodukten aus Tofu in Deutschland nach Produktformen in den Jahren 2018 bis 2022 (in Millionen Euro) [Graph]. https://de.statista.com/statistik/daten/studie/1100705/umfrage/marktentwicklung-von-fleischersatzprodukten-aus-tofu/. Accessed 26 April 2024
- Taylor J, Butler D (2017) R Package ASMap: Efficient Genetic Linkage Map Construction and Diagnosis. J Stat Soft 79:1–29. 10.18637/jss.v079.i06 [Google Scholar]
- Teng W, Han Y, Du Y, Sun D, Zhang Z, Qiu L, Sun G, Li W (2009) QTL analyses of seed weight during the development of soybean (Glycine max L. Merr.). Heredity (Edinb) 102:372–380. 10.1038/hdy.2008.108 [DOI] [PubMed] [Google Scholar]
- Utsumi S, Kinsella JE (1985) Forces involved in soy protein gelation: effects of various reagents on the formation, hardness and solubility of heat-induced gels made from 7S, 11S, and soy isolate. J Food Sci 50:1278–1282. 10.1111/j.1365-2621.1985.tb10461.x [Google Scholar]
- Utz HF (2011) PlabMQTL - Software for meta-QTL analysis with composite interval mapping. Institute of Plant Breeding, Seed Science, and Population Genetics. University of Hohenheim, Stuttgart, Germany [Google Scholar]
- Utz HF, Melchinger AE, Schön CC (2000) Bias and sampling error of the estimated proportion of genotypic variance explained by quantitative trait loci determined from experimental data in maize using cross validation and validation with independent samples. Genetics 154:1839–1849. 10.1093/genetics/154.4.1839 [PMC free article] [PubMed] [Google Scholar]
- Wang HL, Swain EW, Kwolek WF, Fehr WR (1983) Effect of soybean varieties on the yield and quality of tofu. Cereal Chem 60:245–248 [Google Scholar]
- Wang C, Gai J, Fu S, Yu D, Chen S (2008) Inheritance and QTL mapping of tofu and soymilk output in soybean. Scientia Agricultura Sinica 41:1274–1282 [Google Scholar]
- Wang F, Meng J, Sun L, Weng Z, Fang Y, Tang X, Zhao T, Shen X (2020) Study on the tofu quality evaluation method and the establishment of a model for suitable soybean varieties for Chinese traditional tofu processing. LWT 117:108441. 10.1016/j.lwt.2019.108441 [Google Scholar]
- Wickland DP, Battu G, Hudson KA, Diers BW, Hudson ME (2017) A comparison of genotyping-by-sequencing analysis methods on low-coverage crop datasets shows advantages of a new workflow. GB-eaSy BMC Bioinformatics 18:586. 10.1186/s12859-017-2000-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yagasaki K, Kousaka F, Kitamura K (2000) Potential improvement of soymilk gelation properties by using soybeans with modified protein subunit compositions. Breed Sci 50:101–107. 10.1270/jsbbs.50.101 [Google Scholar]
- Yan L, Li Y-H, Yang C-Y, Ren S-X, Chang R-Z, Zhang M-C, Qiu L-J (2014) Identification and validation of an over-dominant QTL controlling soybean seed weight using populations derived from Glycine max × Glycine soja. Plant Breed 133:632–637. 10.1111/pbr.12197 [Google Scholar]
- Yang A, James AT (2013) Effects of soybean protein composition and processing conditions on silken tofu properties. J Sci Food Agric 93:3065–3071. 10.1002/jsfa.6140 [DOI] [PubMed] [Google Scholar]
- Zhang X, Shu XO, Gao Y-T, Yang G, Li Q, Li H, Jin F, Zheng W (2003) Soy food consumption is associated with lower risk of coronary heart disease in Chinese women. J Nutr 133:2874–2878. 10.1093/jn/133.9.2874 [DOI] [PubMed] [Google Scholar]
- Zhu S, Rossnagel BG, Kaeppler HF (2004) Genetic analysis of quantitative trait loci for groat protein and oil content in oat. Crop Sci 44:254–260. 10.2135/cropsci2004.2540 [Google Scholar]
- Zhu X, Leiser WL, Hahn V, Würschum T (2023) The genetic architecture of soybean photothermal adaptation to high latitudes. J Exp Bot 74:2987–3002. 10.1093/jxb/erad064 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Data is contained in the supplementary material.




