Skip to main content
Molecular Breeding : New Strategies in Plant Improvement logoLink to Molecular Breeding : New Strategies in Plant Improvement
. 2025 Jan 3;45(1):8. doi: 10.1007/s11032-024-01529-x

Unravelling the genetic architecture of soybean tofu quality traits

Cleo A Döttinger 1, Kim A Steige 2, Volker Hahn 2, Kristina Bachteler 3, Willmar L Leiser 2, Xintian Zhu 1, Tobias Würschum 1,
PMCID: PMC11699088  PMID: 39758754

Abstract 

Tofu is a popular soybean (Glycine max (L.) Merr.) food with a long tradition in Asia and rising popularity worldwide, including Central Europe. Due to the labour-intensive phenotyping procedures, breeding for improved tofu quality is challenging. Therefore, our objective was to unravel the genetic architecture of traits relevant for tofu production in order to assess the potential of marker-assisted selection and genomic selection in breeding for these traits. To this end, we performed QTL mapping with 188 genotypes from a biparental mapping population. The population was evaluated in a two-location field trial, and tofu was produced in the laboratory to evaluate tofu quality. We identified QTL for all investigated agronomic and quality traits, each explaining between 6.40% and 27.55% of the genotypic variation, including the most important tofu quality traits, tofu yield and tofu hardness. Both traits showed a strong negative correlation (r = -0.65), and consequently a pleiotropic QTL on chromosome 10 was found with opposite effects on tofu hardness and tofu weight, highlighting the need to balance selection for both traits. Four QTL identified for tofu hardness jointly explained 68.7% of the genotypic variation and are possible targets for QTL stacking by marker-assisted selection. To exploit also small-effect QTL, genomic selection revealed moderate to high mean prediction accuracies for all traits, ranging from 0.47 to 0.78. In conclusion, inheritance of tofu quality traits is highly quantitative, and both marker-assisted selection and genomic selection present valuable tools to advance tofu quality by soybean breeding.

Supplementary Information

The online version contains supplementary material available at 10.1007/s11032-024-01529-x.

Keywords: Soybean, Tofu traits, QTL mapping, Genomic selection, Plant breeding

Introduction

Soybean (Glycine max (L.) Merr.) is a leguminous crop of high economic importance. It is primarily used as animal feed, however soybean is also widely used in human nutrition. Soybean products for human consumption include, for example, soybean oil, soy sauce, tempeh, and tofu, i.e. soybean curd. Soybean products are valued for their high protein content and the various health benefits associated with the consumption of soy foods (e.g., Zhang et al. 2003; Lima et al. 2017). With the rise in popularity of plant-based products high in protein, often used as meat alternatives, the demand for tofu and tofu products continues to increase (Statista 2020). Consequently, breeding soybean varieties with improved tofu quality is becoming increasingly important. Amongst other parameters, tofu quality is characterized by tofu hardness, protein content, amino acid profile, and taste (Kurasch et al. 2018a; Wang et al. 2020), making tofu quality evaluation complex. Especially for tofu hardness, the desired levels depend heavily on the intended use and the region of consumption. For example in Asia, a wide variety of tofu texture is offered, whereas the majority of tofu sold in German supermarkets tends to be very firm.

The procedure of evaluating tofu quality is laborious and time consuming, as tofu has to be produced for direct testing, which additionally requires adequate sample sizes of soybeans for destructive measurements. Therefore, selection on a large scale or in the early stages of the breeding cycle is not feasible. In addition to the effect of the genotype, tofu quality traits are influenced by the soybean growing environment (Aziadekey et al. 2002; Poysa and Woodrow 2002; Min et al. 2005; Kurasch et al. 2018a), seed storage conditions after harvest (Kong et al. 2008) and tofu production method (Kurasch et al. 2018a; Huang et al. 2022). Research on associations between tofu quality and soybean agronomic and seed quality traits is inconclusive, making indirect selection, for example by breeding for high protein content, unreliable. Genomics based methods like marker-assisted selection and genomic selection would therefore be valuable tools to drive soybean breeding for superior suitability for tofu production. However, only few QTL mapping studies have been reported. Kurasch et al. (2018b) performed QTL mapping of a range of tofu quality traits, but in each of the three populations identified at maximum one QTL per trait, whereas other studies only focused on mapping loci related to tofu and soymilk yield (e.g., Wang et al. 2008; Huang et al. 2023). Thus, little is known so far about the genetic control of tofu quality traits besides the quantitative inheritance of the different tofu yield and quality parameters (Kurasch et al. 2018b).

A prerequisite for marker-assisted selection is the availability of markers that are diagnostic for or at least closely linked to genetic loci associated with the trait of interest. This is feasible for monogenic traits and for quantitative traits for which individual quantitative trait loci (QTL) explain a large proportion of the genotypic variation of the trait. When, in contrast, many QTL of small effect are responsible for the expression of a complex, highly quantitative trait, marker-assisted selection is not feasible. An alternative approach to assist selection of such traits in breeding is genomic selection. This approach typically utilises a large number of markers covering the whole genome in order to calculate a genomic estimated breeding value that is then used to make selection decisions.

In Germany, production of soybean is on the rise (Eurostat 2024). Furthermore, tofu manufacturers have an interest in locally grown soybeans of high tofu quality. The adaptation to the Central European climate in combination with high tofu quality requires cultivar improvement by plant breeding. To advance soybean breeding with respect to tofu quality, the aims of this study were to investigate the genetic architecture of tofu quality traits by QTL mapping and based on these findings draw conclusions for the potential of marker-assisted and genomic selection as tools to improve tofu quality parameters in breeding.

Materials and methods

Plant material and assessment of agronomic traits

One hundred and ninety-eight recombinant inbred lines (RIL) in F4:6 generation (self-pollinated to F4 and then bulked until F6) from a biparental population were tested together with their parental lines, resulting in 200 genotypes in total. Both parental lines were breeding lines high in protein and suitable for tofu production. The field trials were conducted in 2021 in Eckartsweier (EWE) and Hohenheim (HOH) in Southwest Germany. The trials were designed as an alpha lattice design with two replications per location and 20 blocks of 10 genotypes per replication. The soybeans were grown in two-row micro plots of 1.1 m length and 0.32 m row spacing, resulting in a plot size of 0.352 m2. Before sowing, the soybean seeds were inoculated with Liquifix and Legumefix (Legume Technology LTD, East Bridgeford, UK). While measuring seed yield was not meaningful due to the small plot size, thousand-seed weight was measured as a yield parameter. Protein and oil content were estimated in the laboratory on seeds dried at 9–10% seed moisture content using near-infrared spectroscopy using a PSS-HOP spectrometer (Polytec GmbH, Waldenbronn, Germany).

Genotyping

For genotyping, the 198 RILs and their two parental lines were grown in 96-well seedling trays with 3.5 × 3.5 cm wells filled with a 2:1 mix of soil (Substrat 5, Klasmann-Deilmann GmbH, Geeste, Germany) and sand in a growth chamber (12 h light and 12 h darkness; 28°C). Young leaf material was harvested after 15 days and all samples were dried in silica gel for one week.

About 15 mg of dried leaf material was used for DNA extractions following a slightly modified plate-based protocol described by Fazekas et al. (2012). Final washing steps were done using first washing buffer 2 and then 70% ethanol. DNA quantity was measured using the QuantiFluor® dsDNA System kit (Promega, Madison, USA) and all samples were normalised to 10 ng/µl. Genotyping-by-Sequencing (GBS) libraries were constructed following a slightly modified protocol described by Elshire et al. (2011). In short, single i7 indexes were added to allow multiplexing of the libraries. Additionally, an ExoSAP digest using the GenUP™ Exo SAP Kit (biotechrabbit, Berlin, Germany) was added according to the manufacturer’s protocol to remove single stranded DNA and small dimers after the PCR. PCR products were then selected by size using the ProNex® Size-Selective Purification System kit (Promega, Madison, USA). The final quality of the libraries was assessed using the HS NGS Fragment Kit (1–6000 bp, Agilent, Santa Clara, USA). Libraries were sequenced on an Illumina Nova Seq machine (Macrogene and Azenta/Genewiz) generating 150 bp paired-end reads with an average of 4,280,863 reads per genotype (Table S1).

For bioinformatic processing of the GBS data, the GB-eaSy pipeline (Wickland et al. 2017) was followed. First, reads were demultiplexed based on the barcode sequences using the GBSX tool (Herten et al. 2015). Quality checks on the raw data were performed using FastQC and all further steps were done excluding the second read due to quality issues. Reads were cleaned using the program fastp (Chen et al. 2018) and reads were then mapped against the Wm82.a2.v1 soybean reference genome (Schmutz et al. 2010) using bwa mem (Li and Durbin 2009) with default settings. SNP calling was done by first generating the genotype likelihoods using bcftools mpileup (Li 2011) using the flags –skip-indels, –min-MQ 20, and –min-BQ 20 and then performing the SNP calling on the mpileup files using bcftools call (Li 2011) with the flags –multiallelic-caller and –variants-only by chromosome. Finally, SNP files were concatenated using bcftools concat (Li 2011). Raw SNP files were filtered with vcftools (Danecek et al. 2011) using the –max-alleles 2, –minDP 8 and –max-missing 0.8 to flags to retain a final SNP file with a total of 315,385 SNPs.

Preparation of test tofu and assessment of tofu quality traits

In order to assess tofu quality traits, test tofu was prepared using an adaptation of the bench scale method developed by Kurasch et al. (2018a). Seed moisture content of samples of 80 g were measured using a FarmPro moisture analyser (Horn GmbH, Bad Saulgau, Germany). The sample weights were then adjusted to a moisture content of 12% (corrected dry bean weight). Afterwards, the samples were soaked in tap water for 24 h at a temperature of about 8 °C. To prepare soymilk, the drained soybeans were ground with 480 ml of tap water for one minute at 10,000 rpm. The slurry was then filtered with a filter hose to separate the raw soymilk from the pulp. The soymilk was heated to 98.5 °C for 3 min with constant stirring at 150 rpm. An estimate of solid contents dissolved in the soymilk in °Brix was obtained with a PR-32α refractometer (ATAGO Co., LTD, Tokyo, Japan). A measured value was accepted once the value was constant for three consecutive measurement runs. The soymilk was then adjusted to 8.4°Brix by adding heated tap water. Of the adjusted soymilk, 500 ml were used to further process into tofu. As a coagulant, 0.45 g CaSO4 was dissolved in 20.55 ml of a 2.95% Nigari solution (MgCl2 6H2O) prepared with tap water. For that, the coagulant was added to the soymilk at a temperature between 84.5 and 85.5 °C. The mixture was stirred for six seconds at 250 rpm and then placed into a water bath at 84.5–85.5 °C for 20 min. Afterwards, the curd was broken up using a curd breaker and left in the water bath. After 10 min, the whey was drained from the curd with a sieve for 45 s. The curd was filled into moulds of 7.5 cm diameter and pressed with rising pressure from 0.1 to 1 to 2 to 3 to 4 Pa for 4 min each. The pressed tofu was stored in cold water at around 8 °C for 12 to 18 h overnight. After draining, the tofu was weighed to obtain tofu weight (TW).

For additional tofu quality parameters, the soaking factor (SF) was calculated as .

SF=soakedbeanweightcorrecteddrybeanweight. Tofu hardness (TH) in N was obtained using a texturometer (Zwick AG, Ulm, Germany). Soymilk weight (SMW) in g refers to the weight of the soymilk adjusted to 8.4°Brix. Tofu yield (TY) was calculated as TY=TWcorrecteddrybeanweight×SMW500. The tofu value (TV) was calculated as TV=TY2×TH. The number of hard beans was counted in each sample. Soybeans that did not take up water during the soaking process were considered as hard beans.

Phenotypic analysis

Best linear unbiased estimators (BLUE), best linear unbiased predictors (BLUP) and variance components were calculated for all traits using the model

yijkm=μ+gi+lj+glij+rjk+bjkm+εijkm,

where yijkm represents the phenotypic observation, gi the ith genotype, lj the jth test location, glij is the genotype-by-location interaction, rjk is the replication k in location j, bjkm is the mth block in replication k and location j, and εijkm is the corresponding error term. To calculate BLUE values, gi was considered as a fixed effect. To calculate BLUP values and variance components, gi was considered a random effect. The counts of hard beans were transformed by square root transformation prior to the estimation of BLUEs, BLUPs and variance components in order to fulfil the model assumptions. Thirteen small negative BLUE values were set to zero for biological plausibility. Heritability (h2) was calculated as proposed by Cullis et al. (2006) and described by Piepho and Möhring (2007) as follows:

h2¯=1-ϑBLUP2σg2¯,

where ϑBLUP¯ describes the mean variance of the difference between two BLUP values and σg2 describes the genotypic variance.

All analyses were conducted using the programming language R (R Core Team 2021) in the RStudio environment (Posit team 2023). BLUEs, BLUPs and variance components were estimated using the R package ASReml-R (Butler 2021).

Linkage map construction and QTL analysis

A linkage map was constructed for the QTL mapping using the R package ASMap (Taylor and Butler 2017). Prior to map construction, missing marker values were imputed and the marker data was corrected for miscalled alleles using the R package ABHgenotypeR (Reuscher and Furuta 2016). In addition, genotypes with heterozygosity above 30% and markers with excessive segregation distortion were removed. During linkage map construction, markers were additionally filtered for inflated double crossovers. Physical and genetic marker orders were plotted against each other and markers showing a strong deviation from the expected pattern were removed. After these filtering steps, the final linkage map was constructed with the remaining 188 genotypes and 2,452 SNP markers. The same data set was then also used for QTL mapping.

QTL analysis was conducted using a composite interval mapping (CIM) approach with an additive genetic model and the software PlabMQTL (Utz 2011). Empirical logarithm of odds (LOD) thresholds with a genome-wide significance level of α0.10 were computed for each trait with permutation tests of 1000 runs (Churchill and Doerge 1994). The modified Bayesian information criterion (mBIC) was used for cofactor selection. The proportion of genotypic variance explained by each QTL (pG) was calculated as the value of pG=Rpart2h2 normalised according to Zhu et al. (2004), where Rpart2 describes the proportion of the phenotypic variance explained by the respective QTL. QTL results were validated by 200 independent fivefold cross-validations (Utz et al. 2000).

Genomic prediction

Genomic prediction was conducted for all traits by ridge regression BLUP (RR-BLUP) using the rrBLUP package in R (Endelman 2011). The genotypes and SNPs from the linkage map were used and missing values were imputed using the software LinkImpute (Money et al. 2015). Using BLUEs across locations, 1000 runs of a fivefold cross-validation were applied, where the training set (TS) included a random set of 80% of the genotypes and the prediction set (PS) contained the remaining 20%. The prediction accuracy was calculated as rGP=rMPh, where rMP represents the correlation between genomic estimated breeding values (GEBV) and observed BLUE values and h represents the square root of the heritability (Lande and Thompson 1990; Dekkers 2007).

Results

Heritabilities for thousand-seed weight, protein content and oil content were medium to high at 0.65, 0.72 and 0.62, respectively (Table 1). For the tofu quality traits, heritabilities were generally lower ranging from 0.23 for tofu weight to 0.66 for the number of hard beans. The genotypic variance was highly significant (P < 0.001) for all traits except for tofu weight. The effect of the trial location was highly significant (P < 0.001) as well, for all traits except for the soaking factor. In contrast, the genotype-by-location interaction variance was significant only for thousand-seed weight, oil content and hard beans.

Table 1.

Summary statistics of the evaluated traits. Mean of BLUE values (X¯), genotypic variance (σG2), location variance (σL2), the respective interaction variance (σGL2), error variance (σε2), and heritability (h2) for thousand-seed weight in gram (TSW), protein content in percent (PC), oil content in percent (OC), soaking factor (SF), soymilk weight in gram (SM), tofu weight in gram (TW), tofu yield (TY), tofu hardness in Newton (TH), tofu value (TV), and the square root of counted hard beans (HB)

Trait X¯+ σG2 σL2 σGL2 σε2 h2
TSW 227.97 102.29*** 66.78*** 44.09*** 123.44 0.65
PC 46.73 0.63*** 0.11*** 9.33e−8ns 0.92 0.72
OC 15.91 0.10*** 0.34*** 0.04** 0.16 0.62
SF 2.11 5.33e−4*** 0.05e−4ns 0.78e−4ns 9.86e−4 0.64
HB 1.05 0.32*** 0.11*** 0.09** 0.46 0.66
SM 571.55 61.82*** 76.47*** 6.73ns 146.93 0.59
TW 139.29 5.08ns 18.39*** 2.98ns 59.34 0.23
TY 1.97 2.55e−3*** 4.07e−3*** 0.47e−3ns 10.00e−3 0.47
TH 78.96 33.05*** 62.06*** 1.04e−5ns 196.42 0.39
TV 300.95 212.00*** 101.83*** 1.87e−4ns 1848.20 0.30

*P < 0.05, **P < 0.01, ***P < 0.001, nsnot significant

+Mean values for HB were transformed back to represent the count of hard beans

BLUEs across locations varied substantially between the genotypes and approximately followed a normal distribution for all traits except for hard bean count (Fig. 1). On average, the genotypes showed high thousand-seed weight (227.97 g), very high protein content (46.73%) and lower oil content (15.91%) (Table 1). For tofu quality traits, tofu hardness ranged from 54.67 N to 108.11 N and the mean of 78.96 N is slightly above the producer’s threshold of 75 N. In contrast, ranging between 1.74 and 2.16, mean tofu yield of 1.97 lay slightly below the producer’s threshold of 2.00. The two parental genotypes showed very similar performance for all tofu quality traits except for the soymilk weight (Fig. 1). For thousand-seed weight, protein content and oil content, the parents differed more strongly, as parent 1 showed higher thousand-seed weight and protein content, whereas parent 2 was higher in oil content.

Fig. 1.

Fig. 1

Trait distributions. Distribution of BLUE values across locations for the traits thousand-seed weight (TSW), protein content, oil content, soaking factor, soymilk weight, tofu weight, tofu yield, tofu hardness, tofu value, and number of hard beans. The performance of the parental genotypes is indicated by red triangles. Note, BLUE values for hard bean counts were estimated based on square root transformed values, the resulting BLUEs were transformed back for this figure

The yield parameter thousand-seed weight was significantly (P < 0.05) positively correlated with protein content and soymilk weight at a moderate level, whereas it was negatively correlated with oil content, tofu weight, hard bean count and soaking factor (Fig. 2). However, thousand-seed weight showed no significant correlation with tofu yield, tofu hardness and tofu value. Protein content was weakly to moderately positively correlated with soymilk weight, tofu value and tofu hardness and negatively correlated with tofu weight, soaking factor and count of hard beans. Tofu yield showed a strong positive correlation with tofu weight (r = 0.87) and moderate positive correlation with soymilk weight. In contrast, tofu yield was moderately negatively correlated with hard bean count (r = −0.34) and tofu hardness (r = −0.65). In addition, tofu hardness was strongly positively correlated with the tofu value (r = 0.80), whereas the correlation with tofu weight was strongly negative (r = −0.71). Tofu value was further moderately positively correlated with soymilk weight and negatively correlated with tofu weight and hard bean count. Hard bean count was also negatively correlated with soymilk weight at r = −0.60.

Fig. 2.

Fig. 2

Correlation among the investigated traits. a Network plot and b matrix with the correlations among the traits thousand-seed weight (TSW), protein content (PC), oil content (OC), soaking factor (SF), soymilk weight (SM), tofu weight (TW), tofu yield (TY), tofu hardness (TH), tofu value (TV), and number of hard beans (HB). Asterisk indicates correlation coefficients significantly different from zero at the 5% significance level

QTL mapping identified at least one QTL for all traits (Tab. 2, Fig. S1). For thousand-seed weight, three QTL were found on chromosomes 7, 14 and 16, explaining between 9.41% and 12.33% of the genotypic variance. For protein content, we found five QTL on chromosomes 5, 11, 12, 14 and 16 explaining 6.40–13.96% of the genotypic variance and three different QTL were detected for oil content explaining 9.43–16.17%. On chromosome 4, a pleiotropic QTL was found for the number of hard beans and soymilk, explaining 13.59% and 16.15% of the genotypic variance, respectively, for which the favourable allele was associated with a reduction in hard beans of 0.06 and an increase of 3.49 g in soymilk weight. Another pleiotropic QTL was found on chromosome 10 for tofu weight, explaining 27.55% of genotypic variance, tofu hardness (20.65%) and tofu value (20.18%). For tofu weight, the favourable allele was contributed by parent 1, leading to an increase of 1.39 g in tofu weight, whereas tofu hardness and tofu value were increased by 3.44 N and 7.59, respectively, by the allele from parent 2. Additional major QTL that explained over 20% of the respective genotypic variance were found on chromosome 13 for tofu yield and on chromosome 5 for tofu value, where the allele of parent 2 was associated with an increase of 0.03 and 8.84, respectively. Overall, for the tofu quality traits, parent 2 contributed the majority of the favourable alleles. The major QTL showed substantial effects on the respective traits across, as well as within locations (Fig. S2).

Table 2.

Results from QTL mapping. Chromosome (Chr.) and position (Pos) of detected QTL and their support interval (SI) in centiMorgan (cM), flanking markers indicating their physical position, logarithm of odds (LOD), donor of the favourable allele, the additive effect (Add. eff.), proportion of genotypic variance explained by the QTL (pG), and the frequency in which the QTL was observed in 200 independent five-fold cross-validation runs (Freq. CV), shown for the traits thousand-seed weight (TSW), protein content (PC), oil content (OC), soaking factor (SF), count of hard beans (HB), soymilk weight (SM), tofu weight (TW), tofu yield (TY), tofu hardness (TH), and tofu value (TV)

Trait Chr. Pos. (SI)
(cM)
Left marker Right marker LOD Donor (fav. allele) Add. eff.+ pG (%) Freq. CV
TSW 7 57 (53–61) Chr07_5770755 Chr07_6463334 3.15 Parent 1 3.34 9.41 0.26
14 61 (53–66) Chr14_8232119 Chr14_9109810 3.59 Parent 1 3.54 10.67 0.46
16 117 (106–130) Chr16_28461478 Chr16_35670141 4.18 Parent 1 5.20 12.33 0.68
PC 5 131 (128–133) Chr05_41123747 Chr05_41392595 2.88 Parent 2 0.22 6.40 0.16
11 73 (58–79) Chr11_25910388 Chr11_18186161 4.51 Parent 1 0.29 9.81 0.57
12 145 (142–146) Chr12_35603615 Chr12_35692321 6.57 Parent 1 0.35 13.96 0.42
14 9 (5–24) Chr14_1159327 Chr14_1437584 4.13 Parent 2 0.26 9.04 0.25
16 144 (134–146) Chr16_36955239 Chr16_37051444 5.26 Parent 1 0.30 11.36 0.33
OC 9 0 (0–2) Chr09_223061 Chr09_305244 6.44 Parent 2 0.15 16.17 0.79
12 187 (186–187) Chr12_38815575 Chr12_39085077 4.70 Parent 2 0.13 12.05 0.26
18 96 (75–104) Chr18_10268740 Chr18_12826385 3.63 Parent 2 0.11 9.43 0.19
SF 10 129 (120–139) Chr10_43599219 Chr10_48347975 3.41 Parent 1 0.01 9.69 0.37
17 121 (118–122) Chr17_40412430 Chr17_40584687 4.17 Parent 1 0.01 11.72 0.70
18 105 (103–107) Chr18_16111507 Chr18_16686711 6.98 Parent 2 0.01 19.01 0.76
HB 4 59 (55–60) Chr04_7990022 Chr04_8395715 4.74 Parent 2 −0.06 13.59 0.53
14 24 (21–26) Chr14_3546556 Chr14_3734218 3.04 Parent 2 −0.04 8.92 0.26
SM 2 120 (118–126) Chr02_38862876 Chr02_39191621 2.69 Parent 2 2.55 9.53 0.12
4 59 (58–65) Chr04_7990022 Chr04_8395715 4.66 Parent 2 3.49 16.15 0.58
4 122 (114–134) Chr04_46792598 Chr04_48009592 3.02 Parent 2 3.10 10.69 0.40
TW 10 19 (17–22) Chr10_2920019 Chr10_3139957 3.09 Parent 1 1.39 27.55 0.21
TY 13 120 (118–122) Chr13_30645621 Chr13_30673726 4.54 Parent 2 0.03 20.39 0.40
TH 1 76 (72–84) Chr01_47186726 Chr01_48805860 4.10 Parent 2 2.86 15.74 0.22
8 28 (24–32) Chr08_4651791 Chr08_6045309 3.69 Parent 2 2.70 14.24 0.28
10 14 (11–18) Chr10_2348299 Chr10_2906572 5.47 Parent 2 3.44 20.65 0.52
18 47 (34–61) Chr18_2098093 Chr18_3883865 4.75 Parent 1 3.35 18.10 0.20
TV 5 1 (0–3) Chr05_1420686 Chr05_1545718 4.35 Parent 2 8.84 26.21 0.72
8 152 (137–156) Chr08_44206039 Chr08_45163890 2.69 Parent 2 6.93 16.55 0.25
10 16 (11–19) Chr10_2348299 Chr10_2906572 3.31 Parent 2 7.59 20.18 0.40

+For QTL mapping HB counts transformed with square root transformation were used, here the values were transformed back to represent actual HB counts

We exemplarily used the QTL identified for tofu hardness to evaluate the effect of QTL stacking, which revealed a substantial effect on the target trait as several allelic combinations of at least three favourable alleles consistently performed above the desired threshold of 75 N (Fig. 3).

Fig. 3.

Fig. 3

The effect of QTL stacking. The boxplot illustrates the effect of combining favourable alleles of the four identified QTL for tofu hardness on chromosomes 1, 8, 10 and 18 in 188 genotypes. + indicates the presence of the homozygous favourable allele, -indicates the homozygous unfavourable allele, heterozygous loci and missing values. The red dotted line represents the quality threshold of 75 N desired in German tofu production

Genomic prediction with the same 188 genotypes and the 2,452 genome-wide markers used for QTL mapping revealed moderate mean prediction accuracies for all traits, ranging from 0.45 for thousand-seed weight to 0.61 for protein content (Fig. 4). Five-fold cross-validation also revealed a large variation in prediction accuracy between the runs. For example, prediction accuracies for tofu hardness ranged from −0.20 to 0.76 with a mean of 0.49, which for the majority of the tofu quality traits was in a similar range.

Fig. 4.

Fig. 4

The potential of genomic prediction. Genomic prediction accuracies from 1000 runs of cross-validation and their respective means are shown

Discussion

Breeding for soybean tofu quality

The increasing trend towards vegetarian or vegan nutrition moves traits related to tofu quality to the foreground in soybean breeding programs targeted at human consumption. For the biparental mapping population underlying this study, we specifically used two parental lines that both are suitable for tofu production. The reasoning behind this is that such crosses are typical for applied breeding programs and thus the population allows investigating the variation and its genetic control relevant for tofu quality breeding.

The tofu quality traits showed low to medium heritabilities between 0.23 for tofu value and 0.66 for the number of hard beans (Tab. 1). Notably, both parental lines showed similar performance in most tofu quality traits and consequently, the genotypic variance might be rather small in comparison to the error variance, resulting in a lower heritability (Fig. 1, Tab. 1). This conclusion is supported by the fact that the heritability was especially low for traits for which the parents performed most similarly and showed a relatively high error variance (i.e. tofu weight, tofu yield, tofu hardness and tofu value). The significant effect of both the soybean genotype and the growing environment, specifically on tofu yield and hardness, is in accordance with previous studies (Wang et al. 1983; Mullin et al. 2001; Aziadekey et al. 2002; Min et al. 2005; Kurasch et al. 2018b). A source of the rather large error effects likely lies in inhomogeneities in the test tofu laboratory. When establishing the bench-scale tofu testing method followed in this study, Kurasch et al. (2018a) found strong effects of the team preparing the test tofu as well as the day of tofu production. Consequently, several steps of the procedure were subsequently standardized or even automated. Nevertheless, the various steps of the tofu production process in combination with the small amount of seeds used for it, are still a source for confounding effects. However, despite the medium heritabilities found in this study, our results highlight the feasibility of breeding soybean with improved tofu quality.

The genotypes of the mapping population varied substantially for the two most important tofu quality traits, tofu yield (between 1.74 and 2.16) and especially for tofu hardness (ranging from 54.67 to 108.11 N) (Fig. 1). For tofu yield, the tofu manufacturer desires a minimum value of 2.0 with the optimum of 2.2. Many genotypes therefore showed too low tofu yield for production purposes, including the parental lines not reaching the cut-off level of 2.0. For tofu hardness, the desired range lies between 75 and 120 N. In addition to the rather low tofu yields, many genotypes also showed values below this desired minimum value. This may be due to the environment, which, as shown here, also has a substantial effect on these two traits. However, while the absolute trait level is relevant for tofu production, the present variation of the traits is the decisive factor for successful QTL mapping. Interestingly, we observed transgressive segregation for all traits, meaning that there were progeny with trait values outperforming the superior parent or lower than the inferior parent. This means that both parents carry favourable and unfavourable alleles for the target traits that are newly combined in the progeny. Consequently, the combination of favourable alleles from both parents results in genotypes with a performance superior to either parent, which are candidates for new varieties and illustrate how breeding progress for tofu quality can be achieved.

The relationship between agronomic traits and tofu quality is unclear. In our mapping population, no correlation was found between thousand-seed weight and tofu hardness or tofu yield. For thousand-seed weight and protein content or soymilk weight, however, moderate positive correlations of 0.30 and 0.28, respectively, could be observed. Regarding protein content, the literature on its influence on tofu quality is conflicting. Previously, a positive association between seed protein content and tofu yield has been reported (Bhardwaj et al. 1999). In other studies, however, this conclusion could not be corroborated (Wang et al. 1983; Lim et al. 1990; Kurasch et al. 2018a, 2018b). Non-significant or low correlation coefficients between protein content and tofu yield parameters and tofu hardness were found in our study (Fig. 2), supporting the conclusion that the protein content by itself has a low relevance and it is more likely the protein composition that determines tofu quality. Notably, however, this does not rule out an effect of protein content, for which a certain minimum may be required as a basis for tofu quality. In our soybean material, all genotypes had a high protein content between 44.61% and 49.25% and thus, in populations including variation in the lower protein content range, the effect of protein content on tofu quality traits might be larger. The analysis of trait correlations also revealed the well-known strong negative correlation between protein and oil content, explaining why, if present, they often have opposite effects on tofu traits.

The tofu quality traits showed substantial positive as well as negative correlations among each other. For example, a strong negative correlation between tofu yield and tofu hardness (r = −0.65; Fig. 2) was observed, that has also been reported in previous studies (Min et al. 2005; Kurasch et al. 2018a). Therefore, a simultaneous improvement of both traits is challenging since tofu yield is closely linked to higher water content, whereas firmer tofu tends to hold less water (Min et al. 2005). Consequently, a balance between both traits needs to be achieved to produce tofu with a texture suited to consumer preferences while providing an adequate tofu yield. Taken together, also in the cross of two tofu-type soybeans, there is substantial variation in the resulting progeny and the mapping population is consequently well suited to dissect the genetic architecture of tofu quality traits.

The genetic architecture of tofu quality traits

Characterization of the genetic architecture of target traits holds the potential to identify major QTL that could then be utilized in marker-assisted selection. In our study, we identified QTL for all evaluated traits, including the tofu quality traits.

We identified three QTL for thousand-seed weight on chromosomes 7, 14 and 16, for which the favourable allele was always contributed by parent 1, the parent with the higher seed weight. Several QTL have been reported for seed weight on chromosome 7 (Orf et al. 1999; Csanádi et al. 2001; Hoeck et al. 2003; Teng et al. 2009; Li et al. 2010; Yan et al. 2014), of which one overlaps with our QTL (Teng et al. 2009). Seed weight QTL have also been reported on chromosome 14 (Specht et al. 2001; Hoeck et al. 2003; Liu et al. 2011) and chromosome 16 (Maughan et al. 1996; Kim et al. 2010).

For the number of hard beans, which significantly impairs soymilk weight, tofu yield and tofu value (Fig. 2), we identified two QTL on chromosomes 4 and 14, where the allele from parent 2 was associated with fewer hard beans in both cases. For traits related to seed coat permeability, several QTL have been reported (e.g., Liu et al. 2007; Ren et al. 2024), but none on chromosome 4 or 14. However, Kurasch et al. (2018b) reported a QTL on chromosome 4 for the weight gain after 25 h of soaking that overlaps with the QTL identified here. Hard beans are characterised by not taking up water during the soaking process. Therefore, both findings indicate causal genetic variation in that region of chromosome 4. In our study, this QTL was pleiotropic for soymilk weight. Since hard beans cannot be processed into soymilk and are removed before blending the seeds, this QTL likely indirectly influences soymilk yield by affecting the soaking behaviour of the seeds.

There was no substantial correlation between protein content and tofu traits in our population and none of the QTL for protein content overlapped with those of the tofu traits. However, two QTL on chromosome 8 associated with tofu value in our population have also been reported for soluble protein content (Lu et al. 2013), indicating at least some association between (soluble) protein and tofu quality. Moreover, a QTL for tofu hardness has been identified in a similar region of chromosome 8 (Kurasch et al. 2018b). This supports a possible role of water-soluble protein in the expression of tofu quality traits (Lu et al. 2013). Further publications suggest that tofu yield parameters and hardness are linked to protein composition rather than protein content alone. Especially the ratio of the storage proteins glycinin (11S) to β-conglycinin (7S) may affect tofu quality, with a higher proportion of 11S protein being associated with firmer tofu in the majority of publications (e.g., Saio et al. 1971; Yagasaki et al. 2000; Renkema et al. 2001; Poysa et al. 2006; Stanojevic et al. 2011). Notably, the influence of the storage proteins and their subunit composition strongly relies on the tofu production method and the soybean variety, which may explain contrasting findings (Utsumi and Kinsella 1985; Cai and Chang 1999; Yang and James 2023). Interestingly, the gene Gy4, encoding the glycinin G4 subunit on chromosome 10 (SoyBase, www.soybase.org), is located within the confidence interval of the QTL we detected for tofu weight, which is pleiotropic for tofu hardness and tofu value (Table 2, Fig. S1). In the genotyping-by-sequencing data, the gene region was not covered. We therefore employed re-sequencing data of the parental lines (Zhu et al. 2023) to search for polymorphisms in Gy4. This revealed no polymorphisms in the Gy4 coding sequence, the introns, the promoter or the downstream region. This indicates that if Gy4 is the causal gene underlying this QTL, this may be due to expression differences caused by sequence polymorphisms farther away from the gene, or that despite being a very plausible candidate, the QTL is not caused by Gy4 but another gene in this chromosomal region, which requires further research on the molecular level. In addition, two of the QTL found for tofu hardness in this study overlap with previously reported protein content QTL on chromosome 1 (Pandurangan et al. 2012; Mao et al. 2013) and on chromosome 18 (Jun et al. 2008; Lu et al. 2013; Mao et al. 2013), indicating a possible influence of protein content on tofu hardness despite the observed relatively low phenotypic correlation.

Taken together, the tofu quality traits can be considered as complex traits but with some medium- to large-effect QTL being present. In addition, the substantial effects of the identified breeding-relevant major QTL on the respective traits within single locations indicate their effectiveness across various environments (Fig. S2). As the tofu traits are laborious to phenotype and in addition are strongly influenced by the growing environment and other non-genetic factors, being able to directly address and exploit such QTL, and with them larger proportions of the genotypic variance, would be a big step forward in targeted breeding for tofu quality.

Potential of marker-assisted and genomic selection to advance tofu quality breeding

Marker-assisted selection is a valuable tool to directly exploit causal genetic loci in advancing a trait of interest. However, when QTL explain only a small proportion of genotypic variance, this approach is not suitable to drive the improvement of the targeted traits. Together, the four QTL identified in our study for tofu hardness could explain 68.7% of the genotypic variance (Table 2). We therefore exemplarily used this trait to explore the potential of QTL stacking, i.e. the combination of several favourable QTL alleles in a genotype. This revealed that the combination of at least three QTL leads to tofu hardness values consistently above the manufacturer’s desired threshold of 75 N for most combinations (Fig. 3). The major QTL on chromosome 10 that shows the strongest effect on ensuring meeting this threshold, however, was pleiotropic for tofu weight and tofu value. In line with the negative correlation of tofu weight with the other two traits, the allele from parent 2 was found to increase tofu hardness and tofu value, but had a negative effect on tofu weight. In general, a pleiotropic QTL can be due to one gene affecting two or more traits or can also be caused by closely linked loci, each affecting only one trait. In the latter case, the causal alleles can be separated by recombination, even though in practice this may require substantial efforts. In case of a single locus with pleiotropic effect, a breeder must decide which allele to select for and consequently which trait to increase and which to decrease. The decrease could be better accepted for the trait for which a compensation through other loci is more easily achieved. Nonetheless, the result on QTL stacking indicates that a marker-assisted selection approach might be feasible to efficiently exploit major QTL and with them a substantial proportion of the genotypic variance. Notably, however, QTL always depend on the studied population and results are often not transferable to other populations. The loci need to segregate within the population and their effects might vary in different genetic backgrounds. In a similar study, Kurasch et al. (2018b) found a QTL for tofu hardness on chromosome 8 in only one of three evaluated biparental populations. Interestingly, the pleiotropic QTL they identified for tofu hardness and tofu value at 62 and 64 cM on chromosome 8, respectively, did not overlap with the tofu hardness QTL identified in our population on chromosome 8 at 28 cM. This further highlights the strong dependency of the QTL results on the studied population, which limits the transferability required for marker-assisted selection. Therefore, in order to make use of the identified QTL, their genomic location needs to be fine-mapped and their effect on the trait of interest needs to be verified in different populations.

A complementary approach to marker-assisted selection is genomic selection, that offers the ability to utilise genomic information for the improvement of highly quantitative traits. In this study, mean prediction accuracies were moderate to high, ranging between 0.47 for thousand-seed weight and 0.78 for protein content (Fig. 4). Given the mentioned complexity in phenotyping the tofu quality traits, this can be very valuable in breeding. However, the variation of prediction accuracies between the cross-validation runs was rather large for all traits. For example, prediction accuracies for tofu hardness ranged from −0.27 to 1.03 with a mean of 0.49. A variation this large indicates a strong dependency of the respective prediction accuracy on the composition of the training and prediction sets. In a similar study, evaluating a very similar set of tofu quality parameters in two RIL populations, Kurasch et al. (2018b) found in part substantially lower mean prediction accuracies for tofu weight, tofu yield, tofu hardness and tofu value but with a similarly large variation. Thus, the accuracy of genomic prediction of tofu quality traits needs to be increased further and made more reliable for application in practical soybean breeding. This can likely be achieved by larger training sets including different genetic material representative of the breeding program. Ideally, the phenotypic assessment of the traits can also be improved further, so that the heritability increases and the data underlying the training of the prediction model become more robust. A potential limitation of genomic selection are the associated costs, as the approach requires genome-wide marker data. Taken together, marker-assisted selection and genomic selection show potential to assist breeding of tofu quality traits, but both require further validation and optimization before they can be routinely employed in breeding programs.

Conclusion and outlook

Tofu quality traits are becoming increasingly important in soybean breeding programs targeted at the food market. In this study, we evaluated a biparental population and observed substantial variation and transgressive segregation for all investigated traits, illustrating that an improvement of tofu quality traits can also be achieved in best-by-best crosses that are typical in applied breeding programs. We also observed intricate interrelations among the tofu traits, including negative correlations, which need to be balanced during selection, potentially using index selection. QTL mapping revealed some moderate- to large-effect QTL for all tofu quality traits, but the results also illustrate an in general complex inheritance with many additional small-effect QTL. Consequently, both marker-assisted and genomic selection hold potential for breeding, or even their joint use or integration, but both require further research and optimization. In conclusion, soybean breeding for improved tofu quality traits is feasible to meet the increasing demand and diversity of vegetarian and vegan products.

Supplementary Information

Below is the link to the electronic supplementary material.

Acknowledgements

We would like to thank the technical staff of the State Plant Breeding Institute for their efforts in the fieldwork and data collection and the staff of the Taifun Tofu GmbH for producing test tofu.

Author contributions

Cleo A. Döttinger: Conceptualization, Methodology, Formal Analysis, Investigation, Data Curation, Writing – Original Draft, Writing – Review & Editing, Visualization. Kim A. Steige: Conceptualization, Methodology, Formal Analysis, Investigation, Data Curation, Writing – Original Draft, Writing – Review & Editing. Volker Hahn: Conceptualization, Methodology, Investigation, Data Curation, Writing – Review & Editing, Supervision, Project Administration, Funding Acquisition. Willmar L. Leiser: Conceptualization, Methodology, Writing – Review & Editing, Project Administration, Funding Acquisition. Kristina Bachteler: Conceptualization, Data Curation, Writing – Review & Editing, Funding Acquisition. Xintian Zhu: Investigation, Writing – Review & Editing. Tobias Würschum: Conceptualization, Methodology, Writing – Review & Editing, Supervision, Project Administration, Funding Acquisition.

Funding

Open Access funding enabled and organized by Projekt DEAL. This research was funded by the German Federal Ministry for Education and Research (BMBF) under grant number 031B1046A and handled by the project management organization Projektträger Jülich (PtJ).

Data availability

Data is contained in the supplementary material.

Declarations

Conflicts of interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Footnotes

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  1. Aziadekey M, Schapaugh WT, JR., Herald TJ, (2002) Genotype by environment interaction for soymilk and tofu quality characteristics. J Food Qual 25:243–259. 10.1111/j.1745-4557.2002.tb01022.x [Google Scholar]
  2. Bhardwaj HL, Bhagsari AS, Joshi JM, Rangappa M, Sapra VT, Rao MSS (1999) Yield and Quality of Soymilk and Tofu Made from Soybean Genotypes Grown at Four Locations. Crop Sci 39:401–405. 10.2135/cropsci1999.0011183X0039000200017x [Google Scholar]
  3. Butler D (2021) asreml: Fits the Linear Mixed Model. VSN International, Hemel Hempstead, United Kindom (www.vsni.co.uk) [Google Scholar]
  4. Cai T, Chang KC (1999) Processing effect on soybean storage proteins and their relationship with tofu quality. J Agric Food Chem 47:720–727. 10.1021/jf980571z [DOI] [PubMed] [Google Scholar]
  5. Chen S, Zhou Y, Chen Y, Gu J (2018) fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34:i884–i890. 10.1093/bioinformatics/bty560 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Churchill GA, Doerge RW (1994) Empirical threshold values for quantitative trait mapping. Genetics 138:963–971. 10.1093/genetics/138.3.963 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Csanádi G, Vollmann J, Stift G, Lelley T (2001) Seed quality QTLs identified in a molecular map of early maturing soybean. Theor Appl Genet 103:912–919. 10.1007/s001220100621 [Google Scholar]
  8. Cullis BR, Smith AB, Coombes NE (2006) On the design of early generation variety trials with correlated data. JABES 11:381–393. 10.1198/108571106X154443 [Google Scholar]
  9. Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, Handsaker RE, Lunter G, Marth GT, Sherry ST, McVean G, Durbin R (2011) The variant call format and VCFtools. Bioinformatics 27:2156–2158. 10.1093/bioinformatics/btr330 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Dekkers JCM (2007) Prediction of response to marker-assisted and genomic selection using selection index theory. J Anim Breed Genet 124:331–341. 10.1111/j.1439-0388.2007.00701.x [DOI] [PubMed] [Google Scholar]
  11. Elshire RJ, Glaubitz JC, Sun Q, Poland JA, Kawamoto K, Buckler ES, Mitchell SE (2011) A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS ONE 6:e19379. 10.1371/journal.pone.0019379 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Endelman JB (2011) Ridge regression and other kernels for genomic selection with R package rrBLUP. Plant Genome 4:250–255. 10.3835/plantgenome2011.08.0024 [Google Scholar]
  13. Eurostat (2024) Crop production in EU standard humidity. https://ec.europa.eu/eurostat/databrowser/view/APRO_CPSH1__custom_11679343/default/table?lang=en. Accessed 4 June 2024
  14. Fazekas AJ, Kuzmina ML, Newmaster SG, Hollingsworth PM (2012) DNA barcoding methods for land plants. DNA Barcodes 858:223–252. 10.1007/978-1-61779-591-6_11 [DOI] [PubMed] [Google Scholar]
  15. Herten K, Hestand MS, Vermeesch JR, van Houdt JKJ (2015) GBSX: a toolkit for experimental design and demultiplexing genotyping by sequencing experiments. BMC Bioinformatics 16:73. 10.1186/s12859-015-0514-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Hoeck JA, Fehr WR, Shoemaker RC, Welke GA, Johnson SL, Cianzio SR (2003) Molecular marker analysis of seed size in soybean. Crop Sci 43:68–74. 10.2135/cropsci2003.6800 [Google Scholar]
  17. Huang Z, Liu H, Zhao L, He W, Zhou X, Chen H, Zhou X, Zhou J, Liu Z (2022) Evaluating the effect of different processing methods on fermented soybean whey-based tofu quality, nutrition, and flavour. LWT 158:113139. 10.1016/j.lwt.2022.113139 [Google Scholar]
  18. Huang M, Chen S, Li W, Tian R, Zhang H, Shao Z, Li X, Zhang C (2023) Genetic loci and candidate genes with pleiotropic effects for tofu and soymilk weights across multiple environments in soybean (Glycine max). Plant Breed 142:149–169. 10.1111/pbr.13075 [Google Scholar]
  19. Jun T-H, Van K, Kim MY, Lee S-H, Walker DR (2008) Association analysis using SSR markers to find QTL for seed protein content in soybean. Euphytica 162:179–191. 10.1007/s10681-007-9491-6 [Google Scholar]
  20. Kim H-K, Kim Y-C, Kim S-T, Son B-G, Choi Y-W, Kang J-S, Park Y-H, Cho Y-S, Choi I-S (2010) Analysis of Quantitative Trait Loci (QTLs) for seed size and fatty acid composition using recombinant inbred lines in soybean. J Life Sci 20:1186–1192. 10.5352/JLS.2010.20.8.1186 [Google Scholar]
  21. Kong F, Chang SKC, Liu Z, Wilson LA (2008) Changes of soybean quality during storage as related to soymilk and tofu making. J Food Sci 73:S134–S144. 10.1111/j.1750-3841.2007.00652.x [DOI] [PubMed] [Google Scholar]
  22. Kurasch AK, Hahn V, Miersch M, Bachteler K, Würschum T (2018) Analysis of tofu-related traits by a bench-scale tofu production method and their relationship with agronomic traits in European soybean. Plant Breed 137:271–282. 10.1111/pbr.12581 [Google Scholar]
  23. Kurasch AK, Leiser WL, Bachteler K, Miersch M, Hahn V, Würschum T (2018) Evaluation of the genetic architecture of tofu traits in soybean towards genomics-assisted breeding. Plant Breed 137:873–882. 10.1111/pbr.12651 [Google Scholar]
  24. Lande R, Thompson R (1990) Efficiency of marker-assisted selection in the improvement of quantitative traits. Genetics 124:743–756. 10.1093/genetics/124.3.743 [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Li H (2011) A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics 27:2987–2993. 10.1093/bioinformatics/btr509 [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Li H, Durbin R (2009) Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25:1754–1760. 10.1093/bioinformatics/btp324 [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Li D, Sun M, Han Y, Teng W, Li W (2010) Identification of QTL underlying soluble pigment content in soybean stems related to resistance to soybean white mold (Sclerotinia sclerotiorum). Euphytica 172:49–57. 10.1007/s10681-009-0036-z [Google Scholar]
  28. Lim BT, deMan JM, deMan L, Buzzell RI (1990) Yield and quality of tofu as affected by soybean and soymilk characteristics. Calcium Sulfate Coagulant J Food Sci 55:1088–1092. 10.1111/j.1365-2621.1990.tb01605.x [Google Scholar]
  29. Lima A, Oliveira J, Saúde F, Mota J, Ferreira RB (2017) Proteins in soy might have a higher role in cancer prevention than previously expected: soybean protein fractions are more effective MMP-9 inhibitors than non-protein fractions, Even in cooked seeds. Nutrients 9:201. 10.3390/nu9030201 [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Liu B, Fujita T, Yan Z-H, Sakamoto S, Xu D, Abe J (2007) QTL mapping of domestication-related traits in soybean (Glycine max). Ann Bot 100:1027–1038. 10.1093/aob/mcm149 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Liu W, Kim MY, Van K, Lee Y-H, Li H, Liu X, Lee S-H (2011) QTL identification of yield-related traits and their association with flowering and maturity in soybean. J Crop Sci Biotechnol 14:65–70. 10.1007/s12892-010-0115-7 [Google Scholar]
  32. Lu W, Wen Z, Li H, Yuan D, Li J, Zhang H, Huang Z, Cui S, Du W (2013) Identification of the quantitative trait loci (QTL) underlying water soluble protein content in soybean. Theor Appl Genet 126:425–433. 10.1007/s00122-012-1990-8 [DOI] [PubMed] [Google Scholar]
  33. Mao T, Jiang Z, Han Y, Teng W, Zhao X, Li W (2013) Identification of quantitative trait loci underlying seed protein and oil contents of soybean across multi-genetic backgrounds and environments. Plant Breed 132:630–641. 10.1111/pbr.12091 [Google Scholar]
  34. Maughan PJ, Maroof MA, Buss GR (1996) Molecular-marker analysis of seed-weight: genomic locations, gene action, and evidence for orthologous evolution among three legume species. Theor Appl Genet 93:574–579. 10.1007/BF00417950 [DOI] [PubMed] [Google Scholar]
  35. Min S, Yu Y, St. Martin S (2005) Effect of soybean varieties and growing locations on the physical and chemical properties of soymilk and tofu. J Food Sci 70:C8–C21. 10.1111/j.1365-2621.2005.tb09026.x [Google Scholar]
  36. Money D, Gardner K, Migicovsky Z, Schwaninger H, Zhong G-Y, Myles S (2015) LinkImpute fast and accurate genotype imputation for nonmodel organisms. G3 (Bethesda) 5:2383–2390 [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Mullin WJ, Fregeau-Reid JA, Butler M, Poysa V, Woodrow L, Jessop DB, Raymond D (2001) An interlaboratory test of a procedure to assess soybean quality for soymilk and tofu production. Food Res Int 34:669–677. 10.1016/S0963-9969(01)00087-4 [Google Scholar]
  38. Orf JH, Chase K, Jarvik T, Mansur LM, Cregan PB, Adler FR, Lark KG (1999) Genetics of soybean agronomic traits: I. comparison of three related recombinant inbred populations. Crop Sci 39:1642–1651. 10.2135/cropsci1999.3961642x [Google Scholar]
  39. Pandurangan S, Pajak A, Molnar SJ, Cober ER, Dhaubhadel S, Hernández-Sebastià C, Kaiser WM, Nelson RL, Huber SC, Marsolais F (2012) Relationship between asparagine metabolism and protein concentration in soybean seed. J Exp Bot 63:3173–3184. 10.1093/jxb/ers039 [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Piepho H-P, Möhring J (2007) Computing heritability and selection response from unbalanced plant breeding trials. Genetics 177:1881–1888. 10.1534/genetics.107.074229 [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Posit team (2023) RStudio: Integrated Development Environment for R. Posit Software. PBC, Boston, MA [Google Scholar]
  42. Poysa V, Woodrow L (2002) Stability of soybean seed composition and its effect on soymilk and tofu yield and quality. Food Res Int 35:337–345. 10.1016/S0963-9969(01)00125-9 [Google Scholar]
  43. Poysa V, Woodrow L, Yu K (2006) Effect of soy protein subunit composition on tofu quality. Food Res Int 39:309–317. 10.1016/j.foodres.2005.08.003 [Google Scholar]
  44. R Core Team (2021) R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria (https://www.R-project.org/) [Google Scholar]
  45. Ren H, Zhao K, Zhang C, Lamlom SF, Liu X, Wang X, Zhang F, Yuan R, Gao Y, Cao B, Zhang B (2024) Genetic analysis and QTL mapping of seed hardness trait in a soybean (Glycine max) recombinant inbred line (RIL) population. Gene 905:148238. 10.1016/j.gene.2024.148238 [DOI] [PubMed] [Google Scholar]
  46. Renkema J, Knabben J, van Vliet T (2001) Gel formation by β-conglycinin and glycinin and their mixtures. Food Hydrocolloids 15:407–414. 10.1016/S0268-005X(01)00051-0 [Google Scholar]
  47. Reuscher S, Furuta T (2016) ABHgenotypeR. https://github.com/StefanReuscher/ABHgenotypeR
  48. Saio K, Kajikawa M, Watanabe T (1971) Food processing characteristics of soybean proteins. Agric Biol Chem 35:890–898. 10.1080/00021369.1971.10860005 [Google Scholar]
  49. Schmutz J, Cannon SB, Schlueter J, Ma J, Mitros T, Nelson W, Hyten DL, Song Q, Thelen JJ, Cheng J, Xu D, Hellsten U, May GD, Yu Y, Sakurai T, Umezawa T, Bhattacharyya MK, Sandhu D, Valliyodan B, Lindquist E, Peto M, Grant D, Shu S, Goodstein D, Barry K, Futrell-Griggs M, Abernathy B, Du J, Tian Z, Zhu L, Gill N, Joshi T, Libault M, Sethuraman A, Zhang X-C, Shinozaki K, Nguyen HT, Wing RA, Cregan P, Specht J, Grimwood J, Rokhsar D, Stacey G, Shoemaker RC, Jackson SA (2010) Genome sequence of the palaeopolyploid soybean. Nature 463:178–183. 10.1038/nature08670 [DOI] [PubMed] [Google Scholar]
  50. Specht JE, Chase K, Macrander M, Graef GL, Chung J, Markwell JP, Germann M, Orf JH, Lark KG (2001) Soybean response to water: A QTL analysis of drought tolerance. Crop Sci 41:493–509. 10.2135/cropsci2001.412493x [Google Scholar]
  51. Stanojevic SP, Barac MB, Pesic MB, Vucelic-Radovic BV (2011) Assessment of soy genotype and processing method on quality of soybean tofu. J Agric Food Chem 59:7368–7376. 10.1021/jf2006672 [DOI] [PubMed] [Google Scholar]
  52. Statista (2020) Umsatzentwicklung mit Fleischersatzprodukten aus Tofu in Deutschland nach Produktformen in den Jahren 2018 bis 2022 (in Millionen Euro) [Graph]. https://de.statista.com/statistik/daten/studie/1100705/umfrage/marktentwicklung-von-fleischersatzprodukten-aus-tofu/. Accessed 26 April 2024
  53. Taylor J, Butler D (2017) R Package ASMap: Efficient Genetic Linkage Map Construction and Diagnosis. J Stat Soft 79:1–29. 10.18637/jss.v079.i06 [Google Scholar]
  54. Teng W, Han Y, Du Y, Sun D, Zhang Z, Qiu L, Sun G, Li W (2009) QTL analyses of seed weight during the development of soybean (Glycine max L. Merr.). Heredity (Edinb) 102:372–380. 10.1038/hdy.2008.108 [DOI] [PubMed] [Google Scholar]
  55. Utsumi S, Kinsella JE (1985) Forces involved in soy protein gelation: effects of various reagents on the formation, hardness and solubility of heat-induced gels made from 7S, 11S, and soy isolate. J Food Sci 50:1278–1282. 10.1111/j.1365-2621.1985.tb10461.x [Google Scholar]
  56. Utz HF (2011) PlabMQTL - Software for meta-QTL analysis with composite interval mapping. Institute of Plant Breeding, Seed Science, and Population Genetics. University of Hohenheim, Stuttgart, Germany [Google Scholar]
  57. Utz HF, Melchinger AE, Schön CC (2000) Bias and sampling error of the estimated proportion of genotypic variance explained by quantitative trait loci determined from experimental data in maize using cross validation and validation with independent samples. Genetics 154:1839–1849. 10.1093/genetics/154.4.1839 [PMC free article] [PubMed] [Google Scholar]
  58. Wang HL, Swain EW, Kwolek WF, Fehr WR (1983) Effect of soybean varieties on the yield and quality of tofu. Cereal Chem 60:245–248 [Google Scholar]
  59. Wang C, Gai J, Fu S, Yu D, Chen S (2008) Inheritance and QTL mapping of tofu and soymilk output in soybean. Scientia Agricultura Sinica 41:1274–1282 [Google Scholar]
  60. Wang F, Meng J, Sun L, Weng Z, Fang Y, Tang X, Zhao T, Shen X (2020) Study on the tofu quality evaluation method and the establishment of a model for suitable soybean varieties for Chinese traditional tofu processing. LWT 117:108441. 10.1016/j.lwt.2019.108441 [Google Scholar]
  61. Wickland DP, Battu G, Hudson KA, Diers BW, Hudson ME (2017) A comparison of genotyping-by-sequencing analysis methods on low-coverage crop datasets shows advantages of a new workflow. GB-eaSy BMC Bioinformatics 18:586. 10.1186/s12859-017-2000-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Yagasaki K, Kousaka F, Kitamura K (2000) Potential improvement of soymilk gelation properties by using soybeans with modified protein subunit compositions. Breed Sci 50:101–107. 10.1270/jsbbs.50.101 [Google Scholar]
  63. Yan L, Li Y-H, Yang C-Y, Ren S-X, Chang R-Z, Zhang M-C, Qiu L-J (2014) Identification and validation of an over-dominant QTL controlling soybean seed weight using populations derived from Glycine max × Glycine soja. Plant Breed 133:632–637. 10.1111/pbr.12197 [Google Scholar]
  64. Yang A, James AT (2013) Effects of soybean protein composition and processing conditions on silken tofu properties. J Sci Food Agric 93:3065–3071. 10.1002/jsfa.6140 [DOI] [PubMed] [Google Scholar]
  65. Zhang X, Shu XO, Gao Y-T, Yang G, Li Q, Li H, Jin F, Zheng W (2003) Soy food consumption is associated with lower risk of coronary heart disease in Chinese women. J Nutr 133:2874–2878. 10.1093/jn/133.9.2874 [DOI] [PubMed] [Google Scholar]
  66. Zhu S, Rossnagel BG, Kaeppler HF (2004) Genetic analysis of quantitative trait loci for groat protein and oil content in oat. Crop Sci 44:254–260. 10.2135/cropsci2004.2540 [Google Scholar]
  67. Zhu X, Leiser WL, Hahn V, Würschum T (2023) The genetic architecture of soybean photothermal adaptation to high latitudes. J Exp Bot 74:2987–3002. 10.1093/jxb/erad064 [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

Data is contained in the supplementary material.


Articles from Molecular Breeding : New Strategies in Plant Improvement are provided here courtesy of Springer

RESOURCES