Skip to main content
Wiley Open Access Collection logoLink to Wiley Open Access Collection
. 2020 May 16;230(1):387–398. doi: 10.1111/nph.16413

A tetrasomic inheritance model and likelihood‐based method for mapping quantitative trait loci in autotetraploid species

Jing Chen 1, Lindsey Leach 1, Jixuan Yang 2, Fengjun Zhang 2,3, Qin Tao 2, Zhenyu Dang 2, Yue Chen 2, Zewei Luo 1,2,
PMCID: PMC7984458  PMID: 31913501

Summary

  • Dissecting the genetic architecture of quantitative traits in autotetraploid species is a methodologically challenging task, but a pivotally important goal for breeding globally important food crops, including potato and blueberry, and ornamental species such as rose. Mapping quantitative trait loci (QTLs) is now a routine practice in diploid species but is far less advanced in autotetraploids, largely due to a lack of analytical methods that account for the complexities of tetrasomic inheritance.

  • We present a novel likelihood‐based method for QTL mapping in outbred segregating populations of autotetraploid species. The method accounts properly for sophisticated features of gene segregation and recombination in an autotetraploid meiosis. It may model and analyse molecular marker data with or without allele dosage information, such as that from microarray or sequencing experiments.

  • The method developed outperforms existing bivalent‐based methods, which may fail to model and analyse the full spectrum of experimental data, in the statistical power of QTL detection, and accuracy of QTL location, as demonstrated by an intensive simulation study and analysis of data sets collected from a segregating population of potato (Solanum tuberosum).

  • The study enables QTL mapping analysis to be conducted in autotetraploid species under a rigorous tetrasomic inheritance model.

Keywords: autotetraploids, double reduction, mixed chromosome pairing, quantitative trait locus (QTL), Solanum tuberosum

Introduction

Most agronomic traits targeted in plant or animal breeding programmes are quantitative or complex traits, including yield and quality traits, and resistance to various biotic and abiotic stresses. Phenotypic variation of these traits is under polygenic control, and to a significant extent is also influenced by environmental factors. By using abundantly distributed genomic DNA polymorphisms, this variation can be mapped onto specific chromosomal regions known as quantitative trait loci (QTLs). Models and methods for QTL mapping have been well established in diploid species. However, the corresponding methods are far less advanced in polyploid species, particularly for autopolyploids, even though this group encompasses evolutionarily and economically important plants and aquaculture animals, including potato (Solanum tuberosum, the world's third most important food crop), leek, and horticultural crops such as blueberry and rose. This is largely attributed to the complexities in gene segregation and recombination under polysomic inheritance.

In an autotetraploid genome, such as cultivated potato, the four homologous chromosomes may pair in three possible ways and may show random or preferential bivalent formation in different species or genotypes (Bourke et al., 2017). Alternatively, the four chromosomes may form a quadrivalent, which may lead to the phenomenon of double reduction, where identical alleles carried on the sister chromatids enter into the same gamete, causing systematic allelic segregation distortion. This may occur with a frequency of up to 25% in autotetraploids, but it never occurs in a diploid or allopolyploid meiosis (Luo et al., 2006). We have also shown that recombination frequency between a pair of loci can be as high as 75% under a tetrasomic model, compared with 50% in diploids (Luo et al., 2006). These complexities highlight the substantial differences in the patterns of gene segregation and recombination in autopolyploids compared with diploid species. Additional complexities include a high level of heterozygosity stemming from the outbreeding nature of autotetraploids and a much wider spectrum of gene segregation compared with diploids (Bingham, 1980).

Much research has focused on developing theory and methods for QTL analysis in autotetraploids. Methods have been proposed for genetic linkage analysis and QTL mapping in experimental populations of autopolyploid species (Hackett et al., 2001, 2014, 2017; Cao et al., 2005; Xu et al., 2014), and widely practised in QTL mapping analyses in polyploid species (Massa et al., 2015; Da Silva et al., 2017; Massa et al., 2018; Mengist et al., 2018; Bourke et al., 2019; da Silva Pereira et al., 2019). However, these studies have been based on various assumptions that have substantially avoided some complexities of the analyses, but in doing so have ignored some essential features of autotetrasomic inheritance and practical data analysis. Specifically, these refer to different patterns of gene segregation and recombination due to different pairings of homologous chromosomes during meiosis of autotetraploids. Strictly speaking, none of the existing methods in the literature has thoroughly incorporated these into the development of a method for QTL analyses in autotetraploid species as detailed in the following. To fill this theoretical and methodological gap in the field of quantitative genetics, we developed a novel likelihood‐based method for mapping of QTLs in outbred segregating populations of autotetraploid species.

Description

In general, there are two key components involved in the development of methods for mapping QTLs (Sen & Churchill, 2001), as detailed in the following. The first component is development of a quantitative genetic model, which links the QTL genes to their phenotypic effects on the trait. The second key step is tetrasomic linkage analysis involving a tested QTL and its surrounding genetic markers.

An outbred autotetraploid mapping population and data notation

The mapping population is created from crossing two autotetraploid parents, P1 and P2, and presents the first generation of segregation and recombination of genes carried by the two parental individuals. We consider a linkage map of m molecular marker loci, M 1, M 2, …, Mm, at each of which there could be up to eight different alleles segregating within the full‐sib mapping population. Let rj (j=1,2,,m-1) be the recombination frequency in the jth marker interval flanked by markers Mj and Mj +1, and αj (j=1,2,,m) be the coefficient of double reduction at the jth marker. The parents together with n offspring individuals are scored at the marker loci. Let oi=(oi,j)j=1,,m be a vector of marker phenotype for the ith offspring individual at the m marker loci. Similarly, p1=(p1,j)j=1,,m or p2=(p2,j)j=1,m is the marker phenotype for P1 or P2 at the m marker loci. oi,j, p1,j, and p2,j are given by 1 × 8 vectors for the jth marker locus, where 1 (or 0) indicates the presence (or absence) of a particular allele. The marker data may be in a form with or without allele dosage information. The trait phenotypic data yiY and the offspring marker data oiO (i=1,,n) are modelled through the likelihood function of the model parameters Ω={Ω1,Ω2,Ω3}, as shown in the following:

L(Ω|=O,Y)=i=1nPr{oi,yi|Ω1,Ω2,Ω3}=i=1nPr{yi|oi,Ω1,Ω2,Ω3}Pr{oi|Ω1,Ω2,Ω3}i=1=zi,joi,jzi,j+1oi,j+1k=04f(yi|qik,Ω1)Pr{qik|zi,jzi,j+1,Ω2}Pr{zi,jzi,j+1|oi,Ω3} (Eqn 1)

The model parameters in the likelihood function are organized as follows:

Ω={Ω1,Ω2,Ω3}={(μ,θ1,θ2,θ3,θ4,σ2),(rj,g1,j,g2,j,g1,j+1,g2,j+1,qP1,qP2),(α,r,g1,g2)}

and will be explained in the following sections on formulation of the probability distributions involved in the aforementioned likelihood function.

Orthogonal quantitative genetic model in autotetraploids

In Eqn 1, f(yi|qik=Gk,Ω1)=exp[-(yi-Gk)2/2σ2]/(2πσ2)1/2, with Ω1=(μ,θ1,θ2,θ3,θ4,σ2). The genotypic value of the kth genotype (Qkq(4-k)) at a putative biallelic QTL to be mapped is modelled through the orthogonal model Gk=μ+wk1θ1+wk2θ2+wk3θ3+wk4θ4, for k=0,1,...,4, representing the number of trait‐phenotype‐increasing alleles. Here, µ and σ 2 are the population mean and residual variance, θi (i=1,,4) are the monogenic, digenic, trigenic, and quadrigenic genetic effects of the QTL, and wkj (k=0,1,,4;j=1,,4) are the corresponding orthogonal contrast scales for the genetic effects of genotype k for the jth contrast (j=1,2,...,4). The rationale, statistical properties, and parameter estimation of the orthogonal quantitative genetic model are detailed in our recent work (Chen et al., 2018).

Tetrasomic linkage analysis for quantitative trait locus mapping in autotetraploids

We previously worked out the probability distribution of 136 possible two‐locus gamete genotypes in the offspring from crossing two autotetraploid parental individuals (Luo et al., 2004). By incorporating a biallelic QTL into this two‐locus linkage analysis, we have worked out the three‐locus gamete genotype distribution, ρqm, of an autotetraploid individual under a quadrivalent pairing model (Table 1), which is fully characterized by the double reduction parameter α at the marker locus closest to the centromere, and the recombination frequency parameters. This probability distribution involves a total of 2080 (136 × 16 − 6 × 12 − 6 × 4) different gamete genotypes at the QTL and its flanking markers. This number reduces to 64 if the homologous chromosomes undergo bivalent pairing in meiosis, as summarized in Supporting Information Table S1. Accordingly, one can work out ρq|m, the conditional probability of a gamete QTL genotype given the genotype of the flanking markers. By assuming random union of gametes from the two parents, we worked out the joint marker–QTL–marker zygote genotype probability distribution involving a total of 4326 400 (2080 × 2080) or 4096 (64 × 64) possible zygote genotypes under quadrivalent or bivalent pairing.

Table 1.

Probability distribution of diploid gamete genotypes at a quantitative trait locus (QTL) and its flanking marker loci from a quadrivalent meiosis of an autotetraploid individual.

graphic file with name NPH-230-387-g002.jpg

where γij=r1i(1-r1)2-ir2j(1-r2)2-j, α is the coefficient of double reduction at locus A and α=1-α, r 12 is the recombination frequency between locus A and B, r12=1-r12, and r 1 (or r 2) is the recombination frequency between the QTL and its left (or right) flanking marker.

In practice, many offspring genotypes may be identical because there are a smaller number of segregating alleles at the marker loci and can thus be sorted computationally together with their corresponding probabilities. We have developed a computer‐based algorithm and program to handle any number of segregating alleles at the marker loci. These enable calculation of Pr{qik|zi,jzi,j+1,Ω2} in Eqn 1, which is the conditional probability of the QTL genotype of individual i given its flanking marker genotype and the model parameters Ω2=(rj,g1,j,g2,j,g1,j+1,g2,j+1,qP1,qP2). Here, zi,jzi,j+1 is the genotype configuration for offspring i in the marker interval j flanked by markers Mj and Mj +1, rj is the recombination frequency in the jth marker interval, gi,j is the genotype of parent i (i=1,2) at the jth marker locus, and qPi is the marker–QTL–marker genotype configuration of parent i (i.e. with known linkage phase of the marker and QTL alleles). Although the QTL alleles and linkage phase are unknown in practice, we may search all possible configurations over the likelihood function (Eqn 1) and determine the most likely QTL configuration. Moreover, we established r12=r1+r2-4r1r2/3 to relate the recombination frequencies in the three‐locus tetrasomic linkage analysis, as detailed in Notes S1.

Conditional probability of flanking marker genotype

Pr{zi,jzi,j+1|oi,Ω3} in Eqn 1 is the conditional probability of the zygote genotype of individual i at the flanking markers j and j + 1 given all the marker phenotypes on the linkage group and the parental marker genotypes g 1 and g 2 at the marker loci. Ω3={α,r,g1,g2}, with α being a vector of the coefficient of double reduction at the marker loci, r is a vector of the recombination frequencies between the adjacent marker loci, and g 1 and g 2 are the parental genotypes at the marker loci. We previously developed a model based on the hidden Markov method to calculate the probability (Leach et al., 2010), as detailed in the present notation in Methods S1.

Quantitative trait locus interval mapping under different patterns of homologous chromosome pairing

It has been established earlier herein that the conditional probability distribution of QTL genotypes given flanking marker genotypes in an autotetraploid segregating population depends on the pattern of pairing between homologous chromosomes in meiosis. The probability distribution has been developed for quadrivalent (Table 1) or bivalent (Table S1) pairing. These probability distributions may be plugged into a statistically appropriate method for the QTL mapping analysis if one knows which of the two chromosome pairings occurs in the species of interest.

An obvious question arises that chromosome pairing behaviour is usually unknown a priori, and the homologous chromosomes may show a mixture of bivalent and quadrivalent pairings, as observed in many autotetraploid species, including potato (Quiros, 1982; Jones et al., 1996; Bradshaw, 2007; Bourke et al., 2015). To tackle the problem, we have first shown in Table 2 that the gamete genotype distribution under mixed chromosomal pairing has an almost identical pattern to that under quadrivalent pairing, except for the difference in value of the coefficient of double reduction. We show that the coefficient of double reduction at a locus under the mixed pairing model (α′) can be related to the coefficient in the complete quadrivalent pairing model (α) through the simple relationship α′ = λα (Methods S2), where λ is the frequency of quadrivalent chromosome pairing in meiosis. The deviations between the true distribution gi under the mixed pairings (the fifth column in Table 2) and the approximate distribution ζi are highlighted in bold. Note that, for a given recombination frequency between the two loci, these deviations will be smaller when λ is larger (i.e. when there is a higher frequency of quadrivalent pairing), which will make estimation of gamete probabilities under mixed chromosome pairing more precise. Conversely, when λ takes its smallest value of zero (i.e. complete bivalent pairing), the loss of information will be greatest. The Kullback–Leibler divergence from ζi to gi is given in the present context by:

DKL(g||ζ)=-igilogζigi=-0.301(r-0.585)2+0.103 (Eqn 2)

Table 2.

Probability distribution of diploid gamete genotypes at two linked loci with homologous chromosomes showing quadrivalent, bivalent, or a mixture of the two pairing patterns in an autotetraploid meiosis from an individual with genotype AiBi/AjBj/AkBk/AlBl.

Gametes (1i,j,k,l4) Double reduction occurred at Probabilities (gi, i=1,,11) Gamete genotype probabilities in mixed pairing meiosis
Quadrivalent meiosis Bivalent meiosis True value (gi, i=1,,11) Approximation (‍ζi‍, i=1,,11‍)
AiBi/AiBi
A and B
λ(1-r)2
α(1-r)2
α(1-r)2
AiBj/AiBj
A and B
λαr2/3
αr2/3
αr2/3
AiBi/AiBj
A
2λαr(1-r)
2αr(1-r)
2αr(1-r)
AiBj/AiBk
A
2λαr2/3
2αr2/3
2αr2/3
AiBi/AjBi
B
2λ(1-α)r(1-r)/3
2(1-α)r(1-r)/3-2(1-λ)r(1-r)/3
2(1-α)r(1-r)/3
AiBj/AkBj
B
2λ(1-α)r2/9
2(1-α)r2/9-2(1-λ)r2/9
2(1-α)r2/9
AiBi/AjBj
λ(1-α)(1-r)2
(1-λ)(1-r)2
(1-α)(1-r)2
(1-α)(1-r)2
AiBi/AjBk
4λ(1-α)r(1-r)/3
2(1-λ)r(1-r)
4(1-α)r(1-r)/3+2(1-λ)r(1-r)/3
4(1-α)r(1-r)/3
AiBj/AjBi
λ(1-α)r2/9
(1-α)r2/9-(1-λ)r2/9
(1-α)r2/9
AiBj/AjBk
4λ(1-α)r2/9
4(1-α)r2/9-4(1-λ)r2/9
4(1-α)r2/9
AiBj/AkBl
2λ(1-α)r2/9
(1-λ)r2
2(1-α)r2/9+7(1-λ)r2/9
2(1-α)r2/9

The coefficient of double reduction in a population undergoing complete quadrivalent (or mixed bivalent and quadrivalent) chromosome pairing is given by α (or α′), where α=λα and λ is the proportion of quadrivalent pairing in a mixture of quadrivalent and bivalent pairings. Dashes denote genotypes incompatible with bivalent pairing. The terms in bold are those stemming from the mixed homologous chromosome pairings.

Note that if 0.00 < r < 0.05, D KL(g′||ζ) will vary between 0 and 0.017, reflecting that very little information will be lost by approximating gi with ζi. In particular, note from Table 2 that the proportion of gametes with genotypes involving double reduction is the same for the true distribution and the approximate distribution with mixed chromosome pairing (i.e. ζ1+ζ2+ζ3+ζ4=g1+g2+g3+g4). The methods we have previously developed may therefore be used to estimate the average coefficient of double reduction in autotetraploids undergoing a mixture of quadrivalent and bivalent pairings in meiosis (Luo et al., 2000, 2004). These results rationalize use of the QTL mapping method developed in the present study under the quadrivalent pairing model in the case where homologous chromosomes actually undergo a mixture of quadrivalent and bivalent pairing. This will be tested through an intensive simulation study in the following.

Model parameter estimation

We work out the maximum likelihood estimates (MLEs) of the QTL genotype means as defined in the above, G^k and σ^2, through the EM algorithm (Dempster et al., 1977) and, in turn, the MLEs of the model parameters Ω1=(μ,θ1,θ2,θ3,θ4,σ2) through iteratively calculating the following two steps.

The E step calculates the probability of the ith individual having the kth QTL genotype in the jth marker interval given its phenotype and marker data as given in the following:

ωijqk=Prqik|yi,oi,Ω1,Ω2,Ω3=zi,joi,jzi,j+1oi,j+1f(yi|qik,Ω1)Prqik|zi,jzi,j+1,Ω2Przi,jzi,j+1|oi,Ω3k=04f(yi|qik,Ω1)Prqik|zi,jzi,j+1,Ω2 (Eqn 3)

Derivation of this equation is detailed in Methods S3.

The M step then updates the estimates of the genetic parameters from:

Gk=i=1nωiqkyii=1nωiqk (Eqn 4)
σ2=i=1nk=04ωiqk(yi-Gk)2n (Eqn 5)

The iterative algorithm is initiated using the sample variance for σ 2 and by using K‐means clustering to derive initial genotypic values for Gk. As the E and M steps are repeated iteratively following Eqns (Eqn 3), (Eqn 4), (Eqn 5), the likelihood function will increase and the estimated parameters will converge to the MLEs, G^k and σ^2. The genetic effects at the QTL can be solved from G^k using:

μ^θ^1θ^2θ^3θ^4=1w41w42w43w441w31w32w33w341w21w22w23w241w11w12w13w141w01w02w03w04-1G^4G^3G^2G^1G^0 (Eqn 6)

Calculation of wij, the orthogonal scales for the genetic effects of QTL genotype i (i=0,1,...,4) for the jth contrast (j=1,2,...,4), depends on the probability distribution of the QTL genotypes, which can be obtained from the E step in Eqn 3 as detailed in our previous work (Chen et al., 2018). It should be noted that some of the QTL genetic effects may not be estimable for some parental QTL genotype configurations. For example, the trigenic and quadrigenic genetic effects will be indeterminable for the parental QTL genotype configuration QQqq × qqqq because no relevant offspring QTL genotypes would be generated from the parental QTL genotypes.

With the MLEs of the QTL genotype effects, the likelihood ratio statistic for testing the presence of a QTL at a location characterized by rj 1, the recombination frequency between the QTL and its left flanking marker, is calculated as:

LOD(rj1)=log[L(G^k,σ^2|O,Y)/L(G~,σ~2|O,Y)] (Eqn 7)

where LOD is the logarithm of the odds. G~ and σ~2 are the estimates of genotypic effects and variance under the no QTL model, given by the mean and variance of the trait calculated from all individuals, assuming the phenotypes to be independently identically normally distributed.

Any possible location within each marker interval may be tested for the presence of QTLs to generate a LOD score profile for each chromosome. Permutation of the offspring phenotypic trait values over the corresponding marker data was used to see how the LOD scores distribute under the null hypothesis that there is no QTL present (Churchill & Doerge, 1994).

The foregoing analysis relies on the availability of information of QTLs and marker genotype and linkage phase information, which is unknown in practice. Assuming parent P1 has the higher trait value implying a larger number of trait‐increasing alleles Q, there are nine possible genotype configurations for P1 and P2, listed as (1) QQQQ×QQQq, (2) QQQQ×QQqq, (3) QQQQ×Qqqq, (4) QQQq×QQqq, (5) QQQq×Qqqq, (6) QQQq×qqqq, (7) QQqq×Qqqq, (8) QQqq×qqqq, and (9) Qqqq×qqqq. Taking all possible marker–QTL linkage phases into consideration, there will be up to 92 possible combinations of parental QTL genotypes. QTL interval mapping is carried out for each possible parental QTL genotype configuration. The Bayesian information criterion (BIC) (Schwarz, 1978) is calculated at each location tested as follows:

BIC=loge(n)k-2loge(L^) (Eqn 8)

where L^ is the maximized value of the likelihood function of the model, n is the population size, and k is the number of parameters estimated by the model (i.e. the population mean, genetic effects and the residual variance). The most likely parental QTL configuration should have the lowest BIC value, and the estimated genotypic value of parent P1 must be higher than that of parent P2. Note that there are eight parental QTL configurations for which at least one parental genotype does not appear in the offspring, listed as (1) QQQQ×qqqQ, (2) QQQQ×qqQq, (3) QQQQ×qQqq, (4) QQQQ×Qqqq, (5) QQQq×qqqq, (6) QQqQ×qqqq, (7) QqQQ×qqqq, and (8) qQQQ×qqqq. Under these eight models, we expect to obtain an identical likelihood profile with one of the other 84 parental QTL configurations. For example, the same likelihood profile is expected for genotype configuration (1) QQQQ×qqqQ and an alternative configuration, QQQQ×QQQq. However, the genetic effects would differ in sign under these two models, allowing the correct model to be distinguished, given that the monogenic effect must be positive by definition (Chen et al., 2018).

Results

Simulation data analysis

A series of simulation data sets were generated under simulation models I and II, which simulated different pairings of homologous chromosomes during meiosis, marker densities on simulated chromosomes, and particularly generated marker data either with (model I) or without (model II) allele dosage information, as detailed in Notes S2. These simulation data sets were analysed using the QvMethod (the method developed in the present study for modelling quadrivalent chromosome pairing) and/or the BvMethod (the method formulated in the study for modelling bivalent pairing). The methods showed 100% power to detect QTL with a trait heritability of 5% or 10% in a first‐generation segregating (S1) population of 300 individuals from crossing two autotetraploid parental lines differing by one, two, or three trait‐increasing alleles under the corresponding chromosome pairing model. Prediction accuracy of the parental QTL genotype configuration depends heavily on the heritability of the simulated QTL and improves when the difference in the number of the trait‐increasing alleles between the parental QTL genotypes increases. Incorrect prediction of QTL genotype configuration may lead to bias in the estimation of higher order genic (e.g. tri or quadrigenic) effects at the QTL (Tables S2, S3). The inferred location of QTLs was within 10 cM of the simulated position in > 70% cases when the heritability was 10%, whereas an expected reduction in accuracy to 44–67% was observed when heritability was only 5%. Similar results were obtained for simulation model II where marker allele dosage information is unknown, as explained in Notes S2 (Tables S4, S5). These results indicate that both BvMethod and QvMethod enable powerful detection of QTLs in outbred autotetraploid populations of a reasonable size and can accurately estimate QTL location and genetic effect parameters when the pairing behaviour of homologous chromosomes is known a priori to involve exclusively bivalent or exclusively quadrivalent pairing during meiosis.

To explore the robustness and appropriateness of implementing BvMethod or QvMethod to analyse the data generated under mixed quadrivalent and bivalent chromosomal pairing, we conducted a series of simulations involving 20 biallelic markers on a 100 cM chromosome, with parental QTL genotype QQQq×Qqqq and a trait heritability of 10%. Each population was generated from a given frequency λ of quadrivalent chromosome pairing, ranging from λ = 0 (complete bivalent pairing) to λ = 1 (complete quadrivalent pairing). It must be noted that when BvMethod is implemented with the simulation data generated with λ > 0, a proportion of offspring marker genotypes will be incompatible (i.e. not expected to be observed) under the assumption of bivalent chromosome pairing. Incompatibilities may arise in two different ways. First, due to the occurrence of double reduction; for example, a one‐locus gamete with genotype A 1 A 1 must have resulted from double reduction in a parent with genotype A 1 A 2 A 3 A 4. Second, the offspring haplotype must come from no more than two parental chromosomes if bivalent pairing occurs during meiosis. For example, a three‐locus gamete genotype A 1 B 1 C 1/A 2 B 3 C 4 is incompatible with bivalent pairing.

Table 3 summarizes the means and SEs of the MLEs of genetic effects at the QTL, and the empirical power across varying frequencies of quadrivalent pairing from λ = 0 to λ = 1. Individuals with incompatible genotypes at any of the 20 markers were removed from the data set before analysis with BvMethod. As the frequency of quadrivalent chromosomal pairing increased, estimation of higher order (digenic and trigenic) QTL genetic effects from BvMethod became increasingly biased. Note that the offspring genotypes needed for estimation of the quadrigenic effect (θ 4) are not present under the bivalent chromosomal pairing model. QvMethod estimated the higher order genetic effects much more accurately than BvMethod did, though note that the variance is inherently higher for higher level genetic effects, as detailed in Notes S3. The proportion of correctly predicted parental QTL genotype configurations dropped dramatically using BvMethod, even when the proportion of quadrivalent pairing was low (λ = 0.25), with QvMethod correctly predicting parental QTL genotype 68% of the time, compared with only 49% using BvMethod.

Table 3.

Parameter estimation from BvMethod or QvMethod based on 200 repeated simulations under a mixed chromosome pairing model.

Parameter (=true value) λ = 0.00 λ = 0.25 λ = 0.50 λ = 0.75 λ = 1.00
BvMethod QvMethod BvMethod QvMethod BvMethod QvMethod BvMethod QvMethod BvMethod QvMethod
μ (=10) 10.12 (0.10) 10.14 (0.10) 9.93 (0.10) 9.56 (0.10) 10.15 (0.11) 9.78 (0.13) 10.35 (0.13) 9.39 (0.19) 10.52 (0.13) 10.03 (0.11)
θ 1 (=10) 8.76 (0.17) 8.55 (0.16) 8.69 (0.19) 8.90 (0.17) 7.98 (0.28) 9.06 (0.17) 7.98 (0.29) 9.25 (0.18) 7.99 (0.31) 9.37 (0.15)
θ 2 (=−6) −4.58 (0.40) −4.78 (0.40) −4.59 (0.44) −4.95 (0.44) −3.59 (0.53) −5.04 (0.38) −3.47 (0.62) −5.13 (0.35) −3.19 (0.66) −5.43 (0.30)
θ 3 (=3) −6.99 (1.79) −6.15 (1.84) −4.76 (1.40) −0.62 (1.01) −11.75 (1.75) 1.53 (0.80) −12.96 (2.02) 2.09 (0.76) −12.67 (1.98) 1.86 (0.74)
θ 4 (=−1) −1.18 (2.77) −3.62 (2.24) 0.87 (2.46) 0.38 (2.25)
σ^
21.62 (0.07) 21.64 (0.07) 22.60 (0.07) 22.61 (0.07) 23.63 (0.08) 23.63 (0.07) 24.58 (0.08) 24.69 (0.07) 25.13 (0.09) 25.25 (0.08)
σ true 21.69 22.67 23.60 24.50 25.35
Individuals removed 0.0828 (0.0009) 0.1670 (0.0015) 0.2515 (0.0017) 0.3344 (0.0019)
Empirical Power 1.00 1.00 1.00 1.00 1.00 1.00 0.99 1.00 0.98 1.00
Q genotype0 0.660 0.665 0.490 0.680 0.395 0.675 0.345 0.715 0.260 0.755
Q genotype1 0.980 0.980 0.965 0.990 0.86 0.975 0.835 0.980 0.815 0.995
Accuracy (cM) 1.66 (0.66) 1.75 (0.73) 3.51 (0.74) 2.75 (0.54) 5.68 (0.98) 2.17 (0.59) 8.395 (1.206) 3.05 (0.65) 10.145 (1.262) 2.56 (0.66)
Proportion in (± 10 cM) 0.845 0.830 0.820 0.860 0.720 0.840 0.600 0.820 0.575 0.855

The simulated parental quantitative trait locus (QTL) genotype was QQQq×Qqqq. The proportion of quadrivalent chromosome pairing is given by λ. Q genotype0 (Q genotype1) represents the proportion of predicted parental QTL genotype configurations exactly matching (differing by only a single allele) the simulated QTL genotypes. The mean and SEs of parameter estimates are given based on 200 replicate simulations. Individuals with at least one marker genotype that is unexpected from the theoretical genotype distribution under a model of bivalent chromosome pairing are removed from the simulated data sets before analysis with BvMethod. Accuracy (cM) is the distance between the true QTL location and its inferred location. Heritability was equal to 10%.

QvMethod is substantially more accurate than BvMethod in locating the QTLs when the homologous chromosomes show a mixture of both bivalent and quadrivalent pairings. As quadrivalent pairing increases in frequency, the accuracy of BvMethod drops markedly.

For example, at λ = 0.50, the QTL detected is 5.68 ± 0.98 cM away from the true QTL location using BvMethod, but this is narrowed down to only 2.17 ± 0.59 cM by using QvMethod. At higher values of λ, the improvement in accuracy through use of QvMethod becomes more pronounced. Note also that performance of QvMethod is comparable to that of BvMethod even when the homologous chromosomes undergo pure bivalent pairing (λ = 0). A similar pattern of results was obtained in the analyses where individual incompatible genotypes were treated as missing data, as others (Hackett et al., 2014, 2017) have done elsewhere (Tables S6, S7). The degree of improvement in accuracy through use of QvMethod will naturally depend on the particular parental genotype configuration (Tables S6, S7), and hence on how extreme the difference is between the true offspring genotype distribution and the genotype distribution obtained under the assumption of bivalent pairing.

Experimental data analysis

We demonstrated the utility of QvMethod in real experimental data analysis using single nucleotide polymorphism (SNP) and phenotype data sets collected by Bradshaw et al. (2008) and repeatedly analysed by Hackett et al. (2014, 2017). The data were collected from 190 offspring individuals generated from a cross between two autotetraploid parental potato lines, ‘Stirling’ and 12601ab1. As described elsewhere (Hackett et al., 2017), this outbred segregating population was scored for 12 quantitative traits. The methods described in Hackett et al. (2001, 2014) have recently been engineered into the Windows‐supported software tetraploidsnpmap, which enables genetic linkage map reconstruction and QTL mapping analysis in autotetraploids (Hackett et al., 2017). The software is illustrated by mapping QTLs for the 12 agronomic traits on linkage group V reconstructed with 119 SNP markers, which corresponds to potato (S. tuberosum) chromosome V. Individual marker genotypes incompatible with the assumption of bivalent pairing were removed from the data by the authors. To demonstrate the method we have developed (i.e. QvMethod) and enable a direct comparison with the method in TetraploidSNPMap (abbreviated here as H2017), we focused here on QTL mapping analysis with the 12 traits on linkage group V (though we have scanned QTLs for these traits over all 12 potato chromosomes). In the real data analysis, an empirical mapping resolution was defined as the size (in centimorgans) of the chromosomal interval over which the LOD score profile dropped by a value of 2.0 on both sides of the QTL peak.

Fig. 1 illustrates the LOD score profiles of QTL detection for QvMethod (red lines) and H2017 (green lines) for the 12 potato agronomic traits. The significance thresholds (dotted lines) were calculated from 100 permutation tests of the LOD score test statistic for the corresponding method. QvMethod clearly shows a markedly higher statistical power for detecting the QTLs than the bivalent‐pairing‐based H2017 method does. In particular, QvMethod detected QTLs for four of the 12 traits (tuber size, Fry10, Globodera cyst counts, and flower colour) that were missed by the H2017 method (Fig. 1; Table 2). Table 4 shows that QvMethod confers a substantially higher QTL mapping resolution than the H2017 method does. Table 2 also shows that the QTLs detected by QvMethod usually explain a larger proportion of genetic variance of the trait (Vg%) compared with the QTLs detected by H2017. These observations suggest that H2017, a method strictly based on a bivalent chromosomal pairing model, may fail to detect QTLs that have a small genetic effect on a quantitative trait. For a given Vg%, the estimated proportion of additive genetic variance of the QTLs detected, ρ^, indicates that the QTLs detected by both methods contribute largely additive genetic variation to the trait phenotype variation. The notable exception is tuber yield, which may reflect the particularly complex genetic architecture of this trait, both in potato (Schonhals et al., 2017) and other crops (Shi et al., 2009). Note that three of the four QTLs that were missed by H2017 (Fry10, Gtc, Fc) have a relatively small additive genetic variance, reflecting the fact that H2017 only models additive QTL genetic effects.

Figure 1.

Figure 1

Logarithm of odds (LOD) score profiles of quantitative trait loci (QTLs) mapped on potato chromosome V for 12 agronomic traits of autotetraploid potato (Solanum tuberosum). The LOD score profiles were derived from QvMethod (red lines) and H2017 (green lines). The dotted lines represent the corresponding significance thresholds from 100 permutation tests. The red bars indicate the most likely chromosomal locations of the QTLs detected by QvMethod, and listed on the chromosomal x‐axes are the markers closest to the detected QTLs. The 12 traits are yield (fresh tuber weight in kilograms per plot), Ht (canopy height), size (tuber), shape (tuber), DM (dry matter), Fry4 and Fry10 (frying colour at 4°C and 10°C), Mat (maturity), Fb4 (foliage blight), Tb% (tuber blight), Gtc (Globodera pallida cyst counts), and Fc (flower colour). H2017 represents the method developed in Hackett et al. (2017).

Table 4.

Empirical mapping resolution and estimates of genetic effects of quantitative trait loci (QTL) on chromosome V for 12 traits in potato (Solanum tuberosum).

Trait Resolution (cM) Vg (%) QvMethod parameter estimates
QvMethod H2017 QvMethod H2017 Parental QTL
μ^
θ^1
θ^2
θ^3
θ^4
σ^
ρ^ (%)
Yield 4 38 9.04 7.88
QqQQ×QqQq
7.98 0.16 −1.29 1.29 6.61 1.76 5.52
Ht 2 10 32.08 35.59
QQQq×qqqq
53.00 8.55 11.97 6.5 95.90
Size 3 ns 8.46 ns
qqQQ×qqQq
4.29 0.26 −0.18 −0.55 1.45 0.76 77.23
Shape 3 23 23.01 14.59
qqqQ×QqqQ
4.26 0.37 0.39 −0.22 −0.51 0.60 77.86
Dm 3 35 14.59 6.82
qqqQ×QqqQ
22.14 0.47 0.65 −1.54 −1.68 1.21 55.00
Fry4 3 37 11.58 6.03
qQqQ×Qqqq
4.08 0.28 −0.21 0.18 0.88 0.66 85.74
Fry10 11 ns 4.30 ns
qQqQ×QqQQ
6.34 0.13 −0.37 0.19 0.04 0.63 31.29
Mat 3 8 57.08 57.91
QQQq×qqqq
4.76 2.27 2.17 1.01 98.49
Fb4 1 14 19.78 17.89
QQQq×qqqq
4.87 1.77 −0.48 1.81 99.91
Tb% 3 13 26.48 20.70
qqqQ×qqqq
56.94 27.88 27.95 23.78 98.50
Gtc 4 ns 12.44 ns
QQqQ×Qqqq
4.26 0.33 1.27 −1.68 −4.96 1.34 22.47
Fc 5 ns 11.75 ns
qqQq×qQqQ
1.44 0.14 0.04 0.81 1.68 0.46 43.10

The empirical QTL mapping resolution is defined as the chromosomal region in centimorgans over which the logarithm of odds score drops by a value of 2.0 on each side of the QTL peak. Vg (%) is the proportion of genetic variance explained by the QTL detected. In the predicted parental QTL genotype, Q (or q) indicates the trait increasing (or decreasing) allele. μ^, θ^1, θ^2, θ^3, θ^4, and σ^ are the estimated population mean, monogenic, digenic, trigenic, quadrigenic effects, and residual error for the QTL detected based on the orthogonal contrast quantitative genetic model. ρ^ is the estimated proportion of genetic variance explained by additive QTL genetic effects. The 12 traits are yield (fresh tuber weight in kilograms per plot), Ht (canopy height), size (tuber), shape (tuber), DM (dry matter), Fry4 and Fry 10 (frying colour at 4°C and 10°C), Mat (maturity), Fb4 (foliage blight), Tb% (tuber blight), Gtc (Globodera pallida cyst counts), and Fc (flower colour). Dashes denote indeterminable parameters, because the relevant offspring genotypes are not observable for the given predicted parental QTL genotypes. ns denotes nonsignificant QTL. H2017 represents the method described in Hackett et al. (2017).

Significance of chromosomal pairing patterns in quantitative trait locus mapping efficiency

The previously mentioned simulation study and real data analysis demonstrated that QvMethod outperforms other methods that assume only bivalent chromosomal pairing in the meiosis of autotetraploids, both in terms of QTL detection power and mapping resolution. This thus urges the essentiality of modelling the complex features of tetrasomic gene segregation and recombination for QTL mapping in autotetraploid species and motivated us to explore the difference in statistical efficiency between the two types of method.

We explored how the different chromosomal pairings affect the genetic structure of mapping populations. It is clear from Table 2 that the genotypic distribution of the mapping population differs substantially according to the mode of chromosome pairing (bivalent or quadrivalent), not only in the probability of a genotype, but also in the number of possible genotypes. In particular, if homologous chromosomes undergo quadrivalent pairing, there could be a substantial proportion of individuals in the mapping population carrying gametes from double reduction events (hereafter referred to as double reduction gametes) and hence being incompatible with the assumption of bivalent pairing. Fig. 2(a) (red dotted line) shows that the expected proportion of individuals carrying double reduction gametes increases with distance from the centromere and could be as high as 40%, but only a small number of incompatible offspring genotypes are observable. Furthermore, for three of the 12 possible parental genotype configurations, no incompatible offspring genotypes can be observed. A substantial gap therefore exists between the incompatible genotypes that can be observed directly from the marker data (Fig. 2a, coloured solid lines) and the expected number of incompatible genotypes (Fig. 2a, red dotted line). We also investigated the influence of excluding individuals with observable incompatible marker data on estimation of the level of double reduction, based on 100 repeated simulations with 200 offspring individuals from crossing two autotetraploid parents. The distribution of double reduction estimates is far below the simulated value for each of nine possible parental genotype configurations when the incompatible data are removed (Fig. 2b). This shows how excluding only a small (i.e. the observable) fraction of incompatible individuals is very sensitive in weakening the signal of double reduction.

Figure 2.

Figure 2

Occurrence and estimation of double reduction at a single locus in an offspring population from crossing two outbred autotetraploids under a quadrivalent pairing model. An offspring population of 200 individuals was simulated for a single locus based on a cross between two outbred parents with nine possible parental quantitative trait locus (QTL) genotype configurations. (a) The expected (red dotted line) and observable (solid lines) proportion of individuals carrying double reduction gametes in the population from crossing different parental QTL genotypes. (b) Distribution of estimates for the coefficient of double reduction in the population from crossing nine possible parental QTL genotypes (labelled on horizontal axis) when offspring individuals carrying observable double reduction gametes were excluded from the data set before the analysis. The simulated value is indicated by the red dotted line. Each simulation was repeated 100 times.

These observations largely, if not fully, explain why the development of QvMethod in this study outperforms the bivalent‐pairing‐based methods when all or some of the homologous chromosomes undergo quadrivalent pairing during meiosis, and indicate that the bivalent‐pairing‐based methods, strictly speaking, cannot truly model and analyse the full spectrum of data from real QTL mapping experiments in autotetraploid species.

Discussion

Mapping of QTLs is a key initial step towards understanding the molecular mechanisms underpinning quantitative genetic variation. It has taken decades from the conception of an idea (Sax, 1923) to maturation of the methods (Lander & Botstein, 1989; Sen & Churchill, 2001) in diploid species, and QTL mapping is now a routine genetic analysis in almost all evolutionarily and economically important diploid species. However, linkage analysis with autotetraploids has remained a historical challenge since the work of prominent geneticists such as J. B. S. Haldane, K. Mather, and R. A. Fisher (Haldane, 1930; Mather, 1936; Fisher, 1947). Although a breakthrough on this topic was made in our previous work (Luo et al, 2004, 2006; Leach et al., 2010), linkage analysis involving QTLs under a tetrasomic inheritance model, the theoretical basis for mapping QTLs, has remained a theoretical bottleneck and a methodological gap in the field of quantitative genetic analysis with autotetraploid species.

Here, we have presented a theoretical and methodological breakthrough that enables QTL mapping to be carried out in autotetraploid species on the basis of a rigorous tetrasomic inheritance model. The methods account properly for the key features of gene segregation and recombination when homologous chromosomes undergo bivalent and quadrivalent pairing, as well as a mixture of the two modes of chromosome pairing in the meiosis of autotetraploid species. Hackett et al. (2001, 2014, 2017) pioneered the development of statistical methods for QTL mapping in autotetraploids that assume exclusively bivalent chromosomal pairing, and have been widely cited in the literature. This assumption may substantially simplify the statistical analyses so as to enable the methodology development, but it comes at the price of sacrificing the key features of tetrasomic inheritance in autotetraploid species.

Quadrivalent chromosomal pairing has been observed in various autotetraploid plants, with a frequency varying from up to 10% in kiwi (Wu et al., 2014), 20–30% in potato (Bourke et al., 2015), and up to 36% in Pennisetum orientale (Deniz & Dogru, 2006). In these cases, bivalent‐based methods cannot deal with the full spectrum of the data because there will be a large proportion of observed data that is incompatible with the expected distribution under the bivalent pairing assumption. In practice, it has been proposed to exclude the incompatible data from QTL mapping analysis when using bivalent‐based methods (Hackett et al., 2014, 2017; da Silva Pereira et al., 2019). We have shown that the chance to directly observe the incompatible data is very low or even impossible, because a substantial proportion of individuals carry gametes from double reduction events that are undetectable in the mapping population and therefore cannot be excluded from the data. This creates a serious problem, whereby the genetic structure of the mapping population deviates significantly from that expected under bivalent chromosome pairing, even when the observable incompatible data are removed. Our development of QvMethod accounts for quadrivalent chromosome pairing and therefore outperforms the bivalent‐based methods both in terms of statistical power of QTL detection and accuracy of locating QTLs, as demonstrated through simulation study and analysis with real data from a segregating population of autotetraploid potato (S. tuberosum). Recently, Bourke et al. (2019) explored factors affecting QTL analysis in polyploids by combining the computer software ‘tetraorigin’, which was designed by Zheng et al. (2016) to predict the parental origin of marker alleles in autotetraploid populations, and a simplified additive genetic model into QTL mapping analysis. However, the method described in Bourke et al. (2019) lacks an essential component; that is, genetic linkage analysis between QTLs and surrounding markers for QTL mapping in an outbred autotetraploid segregating population, as we have presented here.

The QTL mapping method presented here is based on a biallelic quantitative genetic model detailed in our recent work (Chen et al., 2018). A full multiple allele quantitative genetic model for a tetraploid individual, such as the one defined in Kempthorne (1957), is statistically intractable because it involves a total of 96 genetic parameters (Hackett et al., 2001). This number increases to 120 in a segregating population from crossing two parental lines divergent at all four alleles of a QTL (8 monogenic, 28 digenic, 48 trigenic, and 36 quadrigenic effects). A simplified version of this multiple allele model has been implemented by either setting linear constraints on the genetic effects, as in Hackett et al. (2001, 2014, 2017), or by modelling only some of the genetic effects of a full model (e.g. additive effects), as in Bourke et al. (2019). Using these simplified models, estimates of the genetic parameters may be biased and difficult to interpret because many other genetic effects (e.g. interactive effects between QTL alleles) are ignored. To solve the intractability problem and to adequately estimate the genetic parameters in a full model, we proposed an orthogonal contrast‐based model for modelling quantitative genetic effects in various autotetraploid populations, as detailed in Chen et al. (2018), and implement it in the present QTL mapping method. The biallelic model focuses on genetic effects of QTL alleles on trait phenotypes and models their increasing or decreasing effects on the traits. It is statistically tractable and completely models all genetic effects (additive and interactive) at a putative QTL. In particular, the model does not imply that there are only two alleles on the molecular level (i.e. sequence variants), but instead the two ‘alleles’ define the increasing or decreasing quantitative genetic effects of QTL alleles on the trait.

In addition to trait phenotype data, molecular marker data present the other essential component in any QTL analysis. New‐generation genomic DNA sequencing techniques enable discovery and generation of a large number of DNA sequence polymorphic markers in a tested genome. Genotyping by sequencing (GBS) is relatively straightforward in diploid species, though serious consideration must be given to several major sources of variation in collecting and processing the sequencing data for accurate identification of allele‐specific sequencing reads (Geijn et al., 2015). GBS in tetraploids is a much more challenging task and involves distinguishing the number of each constituent allele (i.e. the allele dosage) in a heterozygote genotype (e.g. Uitdewilligen et al., 2013). Though methods for diagnosis of allele dosage from DNA sequencing data in tetraploids are beginning to emerge (Margarido & Heckerman, 2015; Gerard et al., 2018), this task remains an incredible challenge and is ripe for further investigation. The QTL methods developed in this study can utilize marker data with or without known allele dosage information. Hackett et al. (2014) showed that use of marker dosage allele information may improve the efficiency of QTL mapping analysis. An open question arises as to what extent the biased diagnosis of allele dosage would affect the efficiency of mapping QTLs in autotetraploids.

Author contributions

ZL conceived of and designed the study. ZL and JC designed the theoretical model and statistical methods. JC and ZL implemented the statistical methods. JC analysed the data with inputs from ZL and LL. Contributions to data collection and preparation of the project were made by JY, FZ, QT, ZD and YC. The paper was written by ZL, JC and LL.

Supporting information

Methods S1 Conditional probability of flanking marker genotypes given their surrounding marker data.

Methods S2 Relationship between the coefficients of double reduction at linked loci under the mixed chromosome pairing model.

Methods S3 The EM algorithm for maximum likelihood estimation of QTL genetic effects.

Notes S1 Relationship between recombination frequencies of QTL and flanking markers under quadrivalent and bivalent chromosome pairing models.

Notes S2 Simulation model and parameters.

Notes S3 Variance in the estimation of genetic effect parameters.

Table S1 Probability distribution of diploid gamete genotypes at a QTL and its flanking marker loci from a bivalent meiosis of an autotetraploid individual.

Table S2 Parameter estimation from BvMethod based on 100 simulations with biallelic SNP markers under a bivalent chromosome pairing model.

Table S3 Parameter estimation from QvMethod based on 100 simulations with biallelic SNP markers under a quadrivalent chromosome pairing model.

Table S4 Parameter estimation from BvMethod based on 100 simulations with multi‐allelic markers under a bivalent chromosome pairing model.

Table S5 Parameter estimation from BvMethod or QvMethod based on 200 repeated simulations under a mixed chromosome pairing model.

Table S6 Parameter estimation from BvMethod or QvMethod based on 200 repeated simulations under a mixed chromosome pairing model with simulated parental QTL genotypes QQQq × Qqqq.

Table S7 Parameter estimation from BvMethod or QvMethod based on 200 repeated simulations under a mixed chromosome pairing model with simulated parental QTL genotypes QQqq × qqqq.

Please note: Wiley Blackwell are not responsible for the content or functionality of any Supporting Information supplied by the authors. Any queries (other than missing material) should be directed to the New Phytologist Central Office.

Acknowledgements

The study was supported by research grants from BBSRC (BB/N008952/1) in the UK, the National Nature Science Foundation of China (grant nos. 31671328 and 31871240). and the Leverhulme Trust. Two reviewers and editors are thanked for their critical comments and suggestions that have helped clarify several ambiguities and improve presentation of the paper. The authors declare no competing financial interests.

Data availability statement

The methods, BvMethod and QvMethod, developed in this paper have been built into an R package and can be downloaded together with the data sets analysed in the present study from Figshare with permanent https://doi.org/10.6084/m9.figshare.11298179.

References

  1. Bingham ET. 1980. Maximizing heterozygosity in autopolyploids. In: Lewis WH, ed. Polyploidy: biological relevance. Boston, MA, USA: Springer, 471–489. [Google Scholar]
  2. Bourke PM, Arens P, Voorrips RE, Esselink GD, Koning‐Boucoiran CF, van't Westende WPC, Santos Leonardo T, Wissink P, Zheng C, van Geest G et al. 2017. Partial preferential chromosome pairing is genotype dependent in tetraploid rose. The Plant Journal 90: 330–343. [DOI] [PubMed] [Google Scholar]
  3. Bourke PM, Hackett CA, Voorrips RE, Visser RGF, Maliepaard C. 2019. Quantifying the power and precision of QTL analysis in autopolyploids under bivalent and multivalent genetic models. G3: Genes, Genomes, Genetics 9: 2107–2122. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Bourke PM, Voorrips RE, Visser RG, Maliepaard C. 2015. The double‐reduction landscape in tetraploid potato as revealed by a high‐density linkage map. Genetics 201: 853–863. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Bradshaw JE. 2007. The canon of potato science: 4. Tetrasomic inheritance. Potato Research 50: 219–222. [Google Scholar]
  6. Bradshaw JE, Hackett CA, Pande B, Waugh R, Bryan B. 2008. QTL mapping of yield, agronomic and quality traits in tetraploid potato (Solanum tuberosum subsp. tuberosum). Theoretical and Applied Genetics 116: 193–211. [DOI] [PubMed] [Google Scholar]
  7. Cao D, Craig BA, Doerge RW. 2005. A model selection‐based interval‐mapping method for autopolyploids. Genetics 169: 2371–2382. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Chen J, Zhang F, Wang L, Leach LJ, Luo ZW. 2018. Orthogonal contrast based models for quantitative genetic analysis in autotetraploid species. New Phytologist 220: 332–346. [DOI] [PubMed] [Google Scholar]
  9. Churchill GA, Doerge RW. 1994. Empirical threshold values for quantitative trait mapping. Genetics 138: 963–971. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Da Silva WL, Ingram J, Hackett CA, Coombs JJ, Douches D, Bryan GJ, De Jong W, Gray S. 2017. Mapping loci that control tuber and foliar symptoms caused by PVY in autotetraploid potato (Solanum tuberosum L.). G3: Genes, Genomes, Genetics 7: 3587–3595. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Dempster AP, Laird NM, Rubin DB. 1977. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society: Series B (Methodological) 39: 1–38. [Google Scholar]
  12. Deniz B, Dogru U. 2006. Meiotic behaviour in natural diploid, tetraploid, and commercial diploid crested wheatgrass. New Zealand Journal of Agricultural Research 49: 405–409. [Google Scholar]
  13. Fisher RA. 1947. The theory of linkage in polysomic inheritance. Philosophical Transactions of the Royal Society of London. Series B: Biological Sciences 23: 55–87. [Google Scholar]
  14. Gerard D, Ferrao LFV, Garcia AAF, Stephens M. 2018. Genotyping polyploids from messy sequencing data. Genetics 210: 789–807. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Hackett CA, Boskamp B, Vogogias A, Preedy KF, Milne I. 2017. TetraploidSNPMap: software for linkage analysis and QTL mapping in autotetraploid populations using SNP dosage data. Journal of Heredity 108: 438–442. [Google Scholar]
  16. Hackett CA, Bradshaw JE, Bryan GJ. 2014. QTL mapping in autotetraploids using SNP dosage information. Theoretical and Applied Genetics 127: 1885–1904. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Hackett CA, Bradshaw JE, McNicol JW. 2001. Interval mapping of quantitative trait loci in autotetraploid species. Genetics 159: 1819–1832. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Haldane JBS. 1930. Theoretical genetics of autopolyploids. Journal of Genetics 22: 359–372. [Google Scholar]
  19. Jones GH, Khazanehdari KA, Ford‐Lloyd BV. 1996. Meiosis in the leek (Allium porrum L.) revisited. II. Metaphase I observations. Heredity 76: 186–191. [Google Scholar]
  20. Kempthorne O. 1957. An introduction to genetic statistics. New York, NY, USA: John Wiley & Sons. [Google Scholar]
  21. Lander ES, Botstein D. 1989. Mapping mendelian factors underlying quantitative traits using RFLP linkage maps. Genetics 121: 185–199. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Leach LJ, Wang L, Kearsey MJ, Luo ZW. 2010. Multilocus tetrasomic linkage analysis using hidden Markov chain model. Proceedings of the National Academy of Sciences, USA 107: 4270–4274. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Luo ZW, Hackett CA, Bradshaw JE, McNicol JW, Milbourne D. 2000. Predicting parental genotypes and gene segregation for tetrasomic inheritance. Theoretical and Applied Genetics 100: 1067–1073. [Google Scholar]
  24. Luo ZW, Zhang RM, Kearsey MJ. 2004. Theoretical basis for genetic linkage analysis in autotetraploid species. Proceedings of the National Academy of Sciences, USA 101: 7040–7045. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Luo ZW, Zhang Z, Leach L, Zhang RM, Bradshaw JE, Kearsey MJ. 2006. Constructing genetic linkage maps under a tetrasomic model. Genetics 172: 2634–2645. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Margarido GRA, Heckerman D. 2015. ConPADE: genome assembly ploidy estimation from next‐generation sequencing data. PLoS Computational Biology 11: e1004229. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Massa AN, Manrique‐Carpintero NC, Coombs JJ, Haynes KG, Bethke PC, Brandt TL, Gupta SK, Yencho GC, Novy RG, Douches DS. 2018. Linkage analysis and QTL mapping in a tetraploid russet mapping population of potato. BMC Genetics 19: e87. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Massa AN, Manrique‐Carpintero NC, Coombs JJ, Zarka DG, Boone AE, Kirk WW, Hackett CA, Bryan GJ, Douches DS. 2015. Genetic linkage mapping of economically important traits in cultivated tetraploid potato (Solanum tuberosum L.). G3: Genes, Genomes, Genetics 5: 2357–2364. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Mather K. 1936. Segregation and linkage in autotetraploids. Journal of Genetics 32: 287–314. [Google Scholar]
  30. Mengist MF, Alves S, Griffin D, Creedon J, McLaughlin MJ, Jones PW, Milbourne D. 2018. Genetic mapping of quantitative trait loci for tuber‐cadmium and zinc concentration in potato reveals associations with maturity and both overlapping and independent components of genetic control. Theoretical and Applied Genetics 131: 929–945. [DOI] [PubMed] [Google Scholar]
  31. Quiros CF. 1982. Tetrasomic segregation for multiple alleles in alfalfa. Genetics 101: 117–127. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Sax K. 1923. The association of size difference with seed‐coat pattern and pigmentation in PHASEOLUS VULGARIS . Genetics 8: 552–560. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Schonhals EM, Ding J, Ritter E, Paulo MJ, Cara N, Tacke E, Hofferbert HR, Lubeck J, Strahwald J, Gebhardt C. 2017. Physical mapping of QTL for tuber yield, starch content and starch yield in tetraploid potato (Solanum tuberosum L.) by means of genome wide genotyping by sequencing and the 8.3 K SolCAP SNP array. BMC Genomics 18: e642. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Schwarz G. 1978. Estimating the dimension of a model. Annals of Statistics 6: 461–464. [Google Scholar]
  35. Sen S, Churchill GA. 2001. A statistical framework for quantitative trait mapping. Genetics 159: 371–87. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Shi JQ, Li RY, Qiu D, Jiang CC, Long Y, Morgan C, Bancroft I, Zhao JY, Meng JL. 2009. Unraveling the complex trait of crop yield with quantitative trait loci mapping in Brassica napus . Genetics 182: 851–861. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. da Silva Pereira G, Gemenet DC, Mollinari M, Olukolu BA, Wood JC. 2019. Multiple QTL mapping in autopolyploids: a random‐effect model approach with application in a hexaploid sweetpotato full‐sib population. bioRxiv . 10.1101/622951 [DOI] [PMC free article] [PubMed]
  38. Uitdewilligen JGAM , Wolters AMA, D'hoop BB, Borm TJA, Visser RGF, van Eck HJ. 2013. A next‐generation sequencing method for genotyping‐by‐sequencing of highly heterozygous autotetraploid potato. PLoS ONE 8: e62355. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. van de Geijn B, McVicker G, Gilad Y, Pritchard JK. 2015. Wasp: allele‐specific software for robust molecular quantitative trait locus discovery. Nature Methods 12: 1061–1063. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Wu JH, Datson PM, Manako KI, Murray BG. 2014. Meiotic chromosome pairing behaviour of natural tetraploids and induced autotetraploids of Actinidia chinensis . Theoretical and Applied Genetics 127: 549–557. [DOI] [PubMed] [Google Scholar]
  41. Xu F, Lyu Y, Tong C, Wu W, Zhu X, Yin D, Yan Q, Zhang J, Pang X, Tobias CM et al. 2014. A statistical model for QTL mapping in polysomic autotetraploids underlying double reduction. Briefings in Bioinformatics 15: 1044–1056. [DOI] [PubMed] [Google Scholar]
  42. Zheng C, Voorrips RE, Jansen J, Hackett CA, Ho J, Bink MC. 2016. Probablistic multilocus haplotype reconstruction in outcrossing tetraploids. Genetics 203: 119–131. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Methods S1 Conditional probability of flanking marker genotypes given their surrounding marker data.

Methods S2 Relationship between the coefficients of double reduction at linked loci under the mixed chromosome pairing model.

Methods S3 The EM algorithm for maximum likelihood estimation of QTL genetic effects.

Notes S1 Relationship between recombination frequencies of QTL and flanking markers under quadrivalent and bivalent chromosome pairing models.

Notes S2 Simulation model and parameters.

Notes S3 Variance in the estimation of genetic effect parameters.

Table S1 Probability distribution of diploid gamete genotypes at a QTL and its flanking marker loci from a bivalent meiosis of an autotetraploid individual.

Table S2 Parameter estimation from BvMethod based on 100 simulations with biallelic SNP markers under a bivalent chromosome pairing model.

Table S3 Parameter estimation from QvMethod based on 100 simulations with biallelic SNP markers under a quadrivalent chromosome pairing model.

Table S4 Parameter estimation from BvMethod based on 100 simulations with multi‐allelic markers under a bivalent chromosome pairing model.

Table S5 Parameter estimation from BvMethod or QvMethod based on 200 repeated simulations under a mixed chromosome pairing model.

Table S6 Parameter estimation from BvMethod or QvMethod based on 200 repeated simulations under a mixed chromosome pairing model with simulated parental QTL genotypes QQQq × Qqqq.

Table S7 Parameter estimation from BvMethod or QvMethod based on 200 repeated simulations under a mixed chromosome pairing model with simulated parental QTL genotypes QQqq × qqqq.

Please note: Wiley Blackwell are not responsible for the content or functionality of any Supporting Information supplied by the authors. Any queries (other than missing material) should be directed to the New Phytologist Central Office.

Data Availability Statement

The methods, BvMethod and QvMethod, developed in this paper have been built into an R package and can be downloaded together with the data sets analysed in the present study from Figshare with permanent https://doi.org/10.6084/m9.figshare.11298179.


Articles from The New Phytologist are provided here courtesy of Wiley

RESOURCES