Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2019 Nov 7.
Published in final edited form as: Theor Popul Biol. 2003 Jun;63(4):281–293. doi: 10.1016/s0040-5809(03)00020-0

Genealogy-dependent variation in viability among self-incompatibility genotypes

Marcy K Uyenoyama 1,*
PMCID: PMC6836294  NIHMSID: NIHMS1009244  PMID: 12742174

Abstract

Many hermaphroditic plants avoid self-fertilization by rejecting pollen that express genetically determined specificities in common with the pistil. The S-locus, comprising the determinants of pistil and pollen specificity, typically shows extremely high polymorphism, with dozens to hundreds of specificities maintained within species. This article explores a conjecture, motivated by empirical findings, that the expression of recessive deleterious factors at sites closely linked to the S-locus may cause greater declines in the viability of zygotes constituted from more closely related S-alleles. Diffusion approximation models incorporating variation in viability among S-locus genotypes and antagonistic interactions between a new specificity and its immediate parent specificity are constructed and analyzed. Results indicate that variation in viability tends to reduce the number of specificities maintained in a population at stochastic steady state, and that genealogy-based antagonism reduces the rate of bifurcation of S-allele lineages. These effects may account for some of the unusual features observed in empirical studies of S-allele genealogies.

1. Introduction

Homomorphic self-incompatibility (SI) in many flowering plants excludes pollen that express genetically determined specificities in common with the pistil (see de Nettancourt, 1977). Under gametophytic SI (GSI), pollen specificity is determined in the gametophyte stage, by the haploid genome of the pollen itself, and under sporophytic SI (SSI), in the sporophyte stage, by the genome of the plant that produced the pollen. Because pollen that express rarer specificities induce the rejection reaction in fewer pistils, SI engenders intense frequency-dependent balancing selection, promoting the invasion and maintenance of rare specificities. Self incompatible plants typically show very high levels of polymorphism at the locus (S-locus) which determines specificity, with up to hundreds of S-alleles maintained within species (Lawrence, 2000).

To account for the extraordinary allelic diversity observed by Emerson (1938) in natural populations, Wright (1939) developed a diffusion approximation model of GSI. While this model predicts high S-allele polymorphism, the dozens of alleles observed by Emerson appeared incompatible with the low estimated population size. Later work by Fisher (1958) and Wright (1960, 1964) led to refinements of the model and its approximations, but the rate of mutation to new S-alleles required to account for the number of segregating S-alleles remained rather beyond the maximum considered likely on the basis of empirical studies of spontaneous and artificial mutations at the S-locus (Wright, 1960).

1.1. Genetic structure

Functional studies of induced mutations (reviewed by Golz et al., 2000; Nasrallah et al., 2000) established that the S-locus comprises at least two distinct genes, the determinants of pistil and pollen specificity. Absolute linkage between these genes is considered essential to SI function, as crossing-over would permit self-fertilization by generating a haplotype that encodes different pistil and pollen specificities.

Genetic analysis of the S-locus has revealed that recombination is greatly reduced or suppressed, not only between the determinants of pistil and pollen specificity, but across large genomic tracts surrounding the S-locus. In the system of SSI expressed in species of Brassica, segregating haplotypes encoding different S-locus specificities show extensive structural differences, suggesting that conventional crossing-over within the S-locus would generate only nonfunctional gametes (Boyes et al., 1997; Suzuki et al., 1999; Nasrallah, 2000). Comprising perhaps hundreds of kilobases (KB), the S-locus region in Brassica is gene-rich, containing approximately one expressed gene per 5:4 KB in B. rapa (Suzuki et al., 1999). In solanaceous species expressing GSI, the Slocus lies in a centromeric region, and the region of greatly reduced recombination extends over at least 1 MB (McCubbin and Kao, 1999; McCubbin et al., 2000).

1.2. Evolutionary analysis

Characterization of the genes that control pistil expression in model systems of SSI (Nasrallah et al., 1985) and GSI (Anderson et al., 1986) revolutionized the evolutionary, as well as genetic and physiological, investigation of SI. By expanding the empirical basis from number and frequency of alleles to include genealogical relationships among alleles, the advent of molecular population genetics has provided much new information relating to the classical conundrum of S-locus diversity. However, resolution of genealogical relationships among segregating S-allele lineages has only deepened the mystery. Very high rates of origin of new S-alleles would imply rapid turnover of S-allele lineages, in apparent contradiction with the ancient divergence of lineages segregating within species (Ioerger et al., 1990; Dwyer et al., 1991; Uyenoyama, 1995). Further, analyses of genealogical structure among segregating S-alleles sampled from natural populations have revealed significantly long terminal branches relative to expectation based on SI alone (Uyenoyama, 1997). Other systems of balancing selection, including the evolutionarily-independent form of SI expressed in Brassica (Schierup et al., 1998) and the vertebrate major histocompatibility complex (Richman and Kohn, 1999), also exhibit this pattern.

Further, significant departures from equal frequencies among S-alleles controlling GSI in Papaver rhoeas (field poppy) may reflect linkage of the S-locus to factors affecting seed dormancy and albinism (Lawrence and Franklin-Tong, 1994). All structural information available to date in model systems of both SSI and GSI indicates tight linkage between the S-locus and perhaps hundreds of expressed genes.

A factor that may account for both the unusual genealogical structure of S-allele genealogies and the apparent pleiotropy of S-alleles is the progressive accumulation of S-allele-specific load due to recessive deleterious mutations at tightly linked sites (Uyenoyama, 1997, 2000). The very low rates of recombination in the vicinity of the S-locus may promote the development of S-allele-specific load. Formation of a zygote comprising recently diverged specificities would result in homozygosis and expression of recessive deleterious mutations in regions flanking the S-locus. Zygote viability or fertility would decline with increasing genealogical relationship between the constituent S-alleles.

1.3. Overview of the analysis

Here, I extend the Wright (1939) model of GSI to accommodate genealogy-dependent viability selection, denoting an association between zygote viability and the time since divergence of the constituent alleles. Such dependence may derive from S-allele-specific load, disequilibrium between S-alleles and deleterious mutations at loci closely linked to the S-locus. In order to accommodate S-allele-specific load, the models relax some of the symmetry assumptions of the Wright model.

Introduction of variation in zygote viability alone is expected to reduce the probability of invasion of new specificities (see, for example, Karlin and Levikson, 1974). In order to provide a means of distinguishing the general effects of variation in viability from the specific effects of genealogy-dependent viability selection, I first consider Model 1, which incorporates a positive variance, but without mean differences, in viability among S-locus genotypes.

Model 2 addresses the primary issue of interest, genealogy-dependent viability selection. In lieu of an explicit representation of the accumulation of deleterious mutations at multiple sites within the S-locus region, I assume that the relative time since divergence between a pair of alleles determines the viability of the zygote they would form. The distribution of divergence time among common S-alleles depends on the process of origin and extinction of specificities.

A rare, newly arisen S-specificity (O) may wander in the boundary layer close to extinction for an extended period before either exiting the population or jumping into the interior (cf. Gillespie, 1994). The probability of invasion of O depends on the distribution of its divergence times from other alleles. In particular, it may have diverged perhaps orders of magnitude more recently from its immediate parent specificity (P) than from other common S-alleles. Model 2 assumes that the identity of the first common S-allele to become extinct during the period in which O resides in the boundary layer is independent of genealogical relationship to O: In the event that it is P that becomes extinct during this period, the distribution of coalescence times between O and the remaining common S-alleles would be comparable to that between any pair of common S-alleles. Alternatively, the presence of P in the population throughout the period in which O is rare is expected to reduce the probability that O will rise to common frequencies.

Variation in zygote viability in combination with antagonism between closely related S-alleles tends to reduce both S-allele number and the rate of bifurcation of S-allele lineages. Consequently, while allele-specific load may account for some of the features of observed S-allele genealogies, it would exacerbate the discrepancy between the number of alleles observed in natural populations and current theoretical expectations.

2. Method

In order to describe the analytical approach, I begin with an exposition of the Wright (1939) model of GSI, together with its key approximations and results (see also Uyenoyama and Takebayashi, 2003).

2.1. Wright model

The intense frequency-dependent balancing selection engendered by GSI derives entirely from prezygotic interactions between seed parent and pollen tube, and not at all from variation in postzygotic components of fitness, including fertility or viability. Accordingly, Wright’s (1939) model prescribes uniform seed set and viability among genotypes:

Pij'=qiki,jPjk1qjqk+qjki,jPik1qiqk/2, (1)

in which Pij represents the frequency of SiSj zygotes, the prime indicates the offspring generation, qk is the frequency of S-allele Sk in the pollen pool, and the denominators ensure equal seed set among genotypes. Because GSI prevents the formation of homozygotes, indices I, j, and k are necessarily distinct.

In general, this recursion system is of very high dimension, reflecting the maintenance of many specificities under GSI. Using an inspired series of approximations, which were later shown to be robust, Wright (1939) succeeded in capturing the joint evolution of genetic drift, mutation, and GSI selection among arbitrary numbers of specificities in a one-dimensional diffusion model. The effective number of alleles (n) corresponds to the inverse of steady-state homozygosity:

F=qi2=1/n, (2)

for qi the frequency of the ith allele (Kimura and Crow, 1964). Wright’s approach entails first treating n as a parameter and later determining its relationship to population size and the rate of mutation to new S-alleles.

Because mating compatibility depends only on identity or nonidentity between pistil and pollen specificities, S-alleles are exchangeable: the probability distributions of allele frequencies are unchanged under permutation of allele names. Consider a particular S-allele, Si, with frequency q,and assume equal frequencies among the n − 1 alternative S-alleles segregating in the population:

qj=1q/n1, (3)

for j different from i: Similarly, assume equal frequencies among all genotypes that bear Si and equal frequencies among all other genotypes:

Pij=P, (4)
Pjk=1Pn12n1, (5)
q=Pn1/2, (6)

for i; j, and k denoting distinct specificities.

These approximations and substitution of homozygosity for the inverse of allele number (F = 1/n) permit reduction of the model to a single recursion:

ΔqqFq13F+2qF. (7)

Wright (1964) further replaced q in the denominator by F,

Δq=qFq1F12F, (8)

to determine the now canonical diffusion equation coefficients:

μx=axxFux, (9)
σ2x=x12x/2N, (10)

in which N is the effective population size; u the rate of mutation to new S-alleles; and

a=11F12F (11)

(see Wright, 1969, pp. 405–406). The form of the diffusion coefficient (10) reflects drift due to sampling of zygotes rather than genes (Fisher, 1958, Chapter IV; Wright, 1960). Further, because SI prevents the formation of homozygotes, reproduction requires at least three S-alleles, ensuring nonnegative σ2(x) for any reproducing population.

Wright’s diffusion approximation model of GSI contains only two parameters, effective population size (N) and the rate of mutation to new S-specificities (u), with no parameter representing the intensity of the dominant selective force of prezygotic mating exclusion. In (9), GSI selection generates change at a rate proportional to x(x − F) Unless this quantity is comparable in magnitude to 1=2N, the force of GSI overwhelms drift, causing very rapid changes in frequency. In essence, the diffusion approximation holds only for x near zero or near the deterministic equilibrium F = 1/n, with the process jumping instantaneously between these regions (see Sasaki, 1989, 1992; Gillespie, 1983, 1994).

2.2. Determination of homozygosity

Wright’s model addresses the change in frequency of a focal S-allele, against a background of n − 1 alternative S-alleles, for n corresponding to the inverse of homozygosity (F) Clearly, the value of n is not in fact a parameter, but is assumed to follow a stationary distribution at stochastic steady state. Because extinction is the only boundary that a given S-allele can reach in finite time, a stationary distribution for its frequency does not exist. However, the property of exchangeability of S-alleles permits the number of S-alleles with frequency in a small interval Φxdx to be obtained from the drift and diffusion coefficients describing changes in the frequency of a particular S-allele. Yokoyama and Nei (1979) assumed that Φxdx is proportional to the Wright–Fisher formula:

Φxdx=Climnne2ημξ/σ2ξdξ/σ2xdx, (12)

with proportionality constant

C=4Nu. (13)

From the condition that allele frequencies (x) sum to unity,

01/2xΦxdx=1, (14)

Yokoyama and Hetherington (1982) derived an implicit solution for F(=1/n):

u8Nπ1F12FN/1Fe2FN/1F12F=1. (15)

These derivations are somewhat heuristic because the notions of stochastic equilibrium and stationary distribution are not well-defined for SI systems: in particular, the ultimate state is extinction, upon the eventual decline of the number of S-alleles in the population to two, at which point GSI would preclude all fertilization (Ewens, 1964).

An alternative derivation of the number of S-alleles involves equating the rate of incorporation and extinction of common specificities (compare analyses of overdominant viability selection by Sasaki, 1989, 1992; Slatkin and Muirhead, 1999). Each of the 2N genes in the population gives rise to a new S-allele at rate u, with the rate of origin of new common S-alleles corresponding to

λ=2Nuv1/2N, (16)

in which v(x) represents the probability that an allele that appears with initial frequency x increases to frequency 1/n before extinction. A fundamental result from diffusion theory (see Karlin and Taylor, 1981, Chapter 15) gives this probability as

vx=SxS0S1/nS0, (17)

in which

Sy=ysηdη, (18)
sη=e2ημξ/σ2ξdξ. (19)

S(y); called the scale function, expresses the scaling of distance under which the distance of the process from a point is proportional to hitting probability (Karlin and Taylor, 1981, p. 196). Substitution of drift (9) and diffusion (10) coefficients of the Wright model into (17) produces

λ=2Nua2b=4NuF1F12F (20)

for a given in (11) and

b=121F+u. (21)

As noted by Gillespie (1994) for overdominant viability selection, the probability that a rare allele will escape the boundary layer and jump into the interior is independent of its initial frequency within the boundary layer.

Karlin and Taylor (1981, Chapter 15) show that the expected time for an allele initiated at frequency x to reach either endpoint of the interval (α, β) is

2SxSαSβSαxβSβSξmξdξ+2SβSxSβSααxSξSαmξdξ, (22)

in which

mξ=1σ2ξsξ. (23)

Reproduction under GSI requires at least three S-alleles, implying

limβ1/2Sβ=. (24)

This expression indicates that the upper boundary is in effect infinitely distant from the interior. Taking the limit (α→0, β→1/2) produces t(x) the expected time for a common S-allele initiated at frequency x to decline to extinction. For the Wright model,

tx=2aa2b2eNa2b2ba2NbπNb. (25)

The rate of extinction of any particular S-allele corresponds to the reciprocal of t(x), giving a total rateof extinction among the n common S-alleles maintained in the population of

μ=n/tx. (26)

Equating the rates of origin and extinction (λ= μ)determines the number of common alleles (n = 1/F). This manipulation, using the drift and diffusion coefficients for the Wright model, confirms the solution of Yokoyama and Hetherington (1982), obtained by a different approach. The proportionality constant C (13) of the stationary distribution of allele numbers may then be determined from the condition that allele frequencies sum to unity (14). This determination completes the Wright model by relating the number of common alleles (n) to the parameters of the model (N and u).

The expressions derived here differ from those given by Vekemans and Slatkin (1994) and Muirhead (2001). In adapting Takahata’s (1990) analysis of symmetric overdominance to GSI, Vekemans and Slatkin (1994) followed Takahata in substituting x/2N for the diffusion coefficient σ2(x)=x(12x)/2N (10). This substitution introduces a factor of 2, later considered equivocal by Slatkin and Muirhead (1999). By modifying classical results on the probability of fixation of alleles under viability selection, Muirhead (2001) proposed an expression for the probability of invasion of new S-alleles that differs from (20). She then used it to infer the rate of extinction from the Yokoyama– Hetherington solution (15) for the number of common alleles (n), obtaining an expression different from (25). The results presented here derive from a self-contained analysis of the Wright model.

2.3. Scaling factor

Takahata (1990) suggested that the process of coalescence among symmetrically overdominant alleles in a population of size 2N resembles that among neutral alleles in a population of size 2Nf, for f a scaling factor summarizing the expansion of coalescence times under balancing selection. Coalescence among j neutral lineages in a population of size 2Nf occurs at rate

j22Nf, (27)

and among lineages under balancing selection at rate

λ11/nj2n2=λjj1n2. (28)

The expression for balancing selection reflects the rise of a new specificity to common frequencies (λ) without the extinction of its parent specificity 11/n and coalescence between a pair of the focal j lineages (see Hey, 1992; Takahata et al., 1992). Equating these expressions determines the scaling factor:

f=n24Nλ=ntx4N, (29)

in which the latter equality follows from (26) under the assumption of stochastic equilibrium (λ= μ).

For the Wright (1939) model,

f=116N2uaF3, (30)

in which F is determined by the Yokoyama and Hetherington (1982) solution (15). This expression differs from the quantity given in Vekemans and Slatkin (1994) by the equivocal 2 indicated earlier.

3. Model 1: variation in viability

Model 1 addresses variation in viability among zygotes, independent of genealogical relationship between alleles. The change in frequency of a given allele Si depends on the distribution of viabilities of carriers of Si relative to noncarriers.

3.1. Drift and diffusion coefficients

Selection due to both prezygotic incompatibility and postzygotic variation in viability determine the frequency of a genotype bearing S-specificities Si and Sj:

TPij'=τj2qki,jPjk1qjqk+qjki,jPik1qqk (31)

in which Pij denotes the frequency of zygotes with genotype SiSj; q and qj the frequencies in the pollen pool of specificities Si and Sj, respectively, τj the viability in the present generation of genotype SiSj; the prime the frequency among offspring at the time of reproduction; and T the normalizer that ensures that genotypic frequencies sum to unity. Following Wright (1939), I consider changes in the frequency of Si and assume symmetry among the frequencies of the n − 1 remaining common specificities in the population:

qj=1qn1, (32)
Pij=P, (33)
Pjk=1Pn1n12, (34)

for i, j, and k distinct indices. This approximation assumes that the departures from symmetry generated by variation in viability have little effect on the average viability of carriers of Si. Viability selection does not affect pollen production, implying

q=Pn1/2. (35)

Incorporation of these assumptions and summation over all alternative specificities Sj permit reduction of the recursion to

TP'=T2q'/n1=τn1q12qn3+2q+P/2, (36)

in which τ represents the average viability among genotypes that bear specificity Si.

Determination of the normalizer T entails construction of a similar recursion for the frequency of genotype SjSk, in which both specificities differ from Si. Summing over all such genotypes as before produces

T12q'=τ1q12qn3n3+2q+Pn1/2, (37)

in which τ* denotes the expected viability among genotypes that do not bear Si. Summing these recursions produces

T=τττ12q+2qnq1n3+2q. (38)

An expression for the change in frequency of Si over one generation follows from (36) and (38):

TΔq=τqFq13F+2qF+qττ12q+2qqF13F+2qF. (39)

Passing to the diffusion limit requires that genetic changes induced by mating incompatibility, viability selection, and drift occur at comparable rates. As in the Wright model, changes due to incompatibility and drift occur on similar time scales if qFq and 1/2N are of comparable orders. Terms in (39) which involve differentials in viability ττ represent the intensity of viability selection. I assume that

Eτττ=0, (40)
Eτττ2=D2N, (41)

in which the first equation indicates equal expected viabilities among genotypes formed by randomly pairing S-specificities. Selection intensities of greater order than 1/2N would overwhelm the process, while those of lesser order would have negligible effect. Expanding T under the assumption of small selection differentials due to both incompatibility and viability selection produces

Δq=qFq13F+2qF+τττ12q×1+τττ12q. (42)

This expression determines the mean and variance coefficients of the diffusion model:

EΔx=xxFa+x12x2D/2N, (43)
EΔx2=x12x2D/2N, (44)

in which x represents the frequency of the focal S-specificity and a corresponds to (11). The form of a (11) reflects Wright’s substitution of F for q in the denominator of the first term of (42), an approximation that permits considerable simplification at the cost of little error for large n (see Wright, 1964). Combining these expressions with terms representing the loss of the focal S-specificity through mutation to new S-specificities (rate u) and drift produces the drift and diffusion coefficients for this first model:

μ1x=xxFaxu+x12x2D/2N, (45)
σ2x=x12xdx/2N, (46)

in which

dx=1+Dx12x. (47)

As Karlin and Levikson (1974) found for general diffusion models involving stochastic fluctuations in selection intensity, the variance in the intensity of viability selection (D/2N) contributes to the mean displacement μ1x as well as to the variance coefficient σ2x.

3.2. Rate of origin of new specificities

Substitution of (45) and (46) into the expression for the scale function (19) gives

sη=BηeNAη, (48)

for

Aη=2blog12η+aDr1r2r1logr1η+logηr2+2br1r2r1logr1η+r2logηr2, (49)
Bη=1dηr1ηηr21/2r1r2, (50)

in which r1 and r2 represent the roots of d(η):

1+Dξ12ξ=2Dr1ξξr2=0 (51)

(cf. Karlin and Levikson, 1974). Integration of s(η) using Laplace’s asymptotic expansion method for large N (see Olver, 1974, Chapter 3) provides the probability of invasion of a new specificity,

v1x=2/A'0=a2b, (52)

for b (21) as defined by Yokoyama and Hetherington (1982).

These expressions indicate that rate of origin of new specificities under the influence of incompatibility and viability selection is

λ1=2Nua2b=4NuaF+ou. (53)

Variation in viability (D) affects the rate of origin of new specificities only through its influence on the number of common specificities maintained in the population (n) Examination of the drift coefficient μ1(x) (45) indicates that for small x, the term representing GSI dominates the term representing viability selection.

3.3. Rate of extinction of common specificities

Substitution of the drift (45) and diffusion (46) coefficients into the expression for the time to extinction (22) produces

tx=πdx^Nb12x^2Nb2ax^2L1x^NaL2x^L3x^2Nb, (54)

in which

L1x^=r1x^r2r2r1x^1/Dr1r2, (55)
L2x^=r1x^r2r2r1x^1/2r1r2, (56)
L3x^=r1r1x^r1r2x^r2r21/r1r2 (57)

for

x^=a2b2a=Fua. (58)

As is the case for the rate of origin (53), the time to extinction of a common S-specificity (segregating at frequencies near 1/n) is, to the order of the approximation of the diffusion model, independent of the initial frequency x.

Application of l’Hôpital’s rule indicates that as variation in the intensity of viability selection diminishes to zero,

limD0L1=e2x^, (59)

with L2 and L3 converging to unity. In the limit of negligible D, the time to extinction reduces to (25), corresponding to the Wright model.

3.4. Determination of homozygosity

The number of common S-alleles (n = 1/F) is determined as a balance between the origin and extinction of common specificities. Setting λ1 (53) equal to n/t1(x) produces

u8NπdF1F12FN/1FL1FN/1F12F×L2FL3FN/1F=1 (60)

(to order u). In the limit of negligible variation in viability (D = 0), this equation reduces to the Yokoyama-Hetherington (1982) solution (15).

4. Model 2: genealogy-dependent selection

Under genealogy-dependent viability selection, zygotes constituted from more recently diverged S-specificities have lower viability. A rare specificity newly formed by mutation from an existing common specificity diverged from its parent much more recently than from any other common specificity. Model 2 addresses this exceptional relationship between a newly arisen rare specificity and its parent. A subsequent section examines the relationship between zygote viability and the distribution of coalescence times between two randomly chosen common specificities.

4.1. Drift and diffusion coefficients

Let Si in recursion (31) for SiSj represent the newly arisen specificity and Sj a common specificity different from the parent, Sp. The symmetry assumptions now distinguish Sp from the other common specificities:

qj=1qn1,ji, (61)
Pij=P,ji,p, (62)
Pip=Q, (63)
Pjl=1Pn2+Q(n12),ij,l. (64)

The frequency of Si corresponds to

q=Pn2+Q/2. (65)

Summation over all alternative specificities Sj (ji,p) produces

TP'=τ22q12qn3+2q+Pn3+Qn2, (66)

in which τ now represents the average viability of zygotes bearing Si together with a common specificity other than Sp. The recursion for the frequency of the zygote formed from the parent and offspring specificities is

TQ'=τ^22q12qn3+2q+P, (67)

in which τ^ represents the viability of SiSp zygotes. These expressions determine the recursion for the frequency of Si:

Tq'=τ42q12qn2n3+2q+Pn3+Q+τ^42q12qn3+2q+P. (68)

Only a fraction 1/n of the zygotes that bear the rare specificity also bear its parent; further, these genotypes have much lower viability than all other genotypes. To obtain a recursion in a single variable, I ignore terms involving Q or τ^:

Tq'=τ2q12qn2n3+2q+qn3n2, (69)

in which the normalizer T corresponds to (38), reflecting the rarity of Si.

The expression for the change in frequency of Si over one generation becomes

TΔq=τqFq13F+2qF2n52qn3n213F+2qF+qττ12q+2qqF13F+2qF. (70)

This expression describes the change in frequency of specificity Si only before it becomes common and in the presence of its parent specificity; after that point, the recursion for common specificities (39) governs its evolution. A weighted average, which reduces to the rare specificity recursion (∆qr) for small values of q and to the common specificity recursion (∆qc) for q near F, approximates the transition between these two selective regimes:

Δqw=FqΔqr+qΔqcF. (71)

Substitution of (70) for ∆qr and (39) for ∆qc produces

TΔqw=τqFqF+2q13F213F+2qF12F+qττ12q+2qqF13F+2qF. (72)

In both ∆qr and ∆qc, n represents the number of common specificities, a set which includes the focal specificity only when it is common (∆qc) and not when it is rare (∆qr) Assuming as before that terms involving q(F − q) and the viability selection differential (ττ*) are of the order of the inverse of the population size, the mean displacement coefficient for Model 2 becomes

μ2x=xxFaF+2q13F212Fxu+x12x2D2N. (73)

Expression (46) gives the variance coefficient for this model.

4.2. Rate of origin of new specificities

Drift and diffusion coefficients (73) and (46) imply an invasion probability for a new specificity in the presence of its parent specificity of

v2x=aF212F2u. (74)

Comparison of (52) and (74) confirms that antagonistic interactions between a new specificity and its parent discourage invasion.

Whether the invasion probability of a particular novel specificity (O) corresponds to (52) or (74) depends on the presence or absence of its immediate parent specificity (P) during the interval before it either jumps into the interior or becomes extinct. Consider the first common specificity to become extinct after the formation of O,. while it remains in the boundary layer between extinction and invasion. Under the assumption that the existence of O in very low frequency has negligible effect on the fate of P, the probability that it is P that becomes extinct is 1/n; in this event, O invades with probability v1(x) Alternatively, with probability (1 − 1/n) P persists in the population, implying an invasion probability of v2(x) Over the long-term, new specificities become common in the population at rate

λ=2Nuv1x/n+v2x11/n=2NuF235F/1F12F2+ou. (75)

Bifurcation of lineages entails the coexistence of a new specificity together with its parent. In the absence of antagonism between P and O, bifurcation occurs at rate λ11/n, declining in its presence to

λ1r/n=2Nuv2x11/n=2NuF2/12F2+ou. (76)

This expression implies that genealogy-dependent viability selection increases the probability of exclusion of P:

r=v1xv1x/n+v2x11/n=212FF35F+Ou>1. (77)

Given the successful invasion of O, P is more likely to have been absent (r>1 for v1(x)>v2(x)) Even though the existence of O has no effect on the unconditional probability of extinction of P, antagonism between O and P increases the probability of parental extinction conditional on O having invaded.

4.3. Determination of homozygosity

To determine the equilibrium number of common specificities maintained in the population, I set the rate of incorporation of newly derived specificities in the face of possible parental interference (75) equal to the rate of extinction of common specificities, obtained from (54). This formulation reflects that once the new specificity becomes common, the time since divergence from its parent follows the same distribution of divergence times as between any pair of common specificities. To order u, F is implicitly defined by

uF35F2NπdF1F×12FN/1F1L=1, (78)

for

L=L1FN/1F12FL2FL3FN/1F.

In the limit of negligible variation in viability among zygotes expressing two common specificities (D = 0), this expression reduces to

uF35F2Nπ1F12FN/1F1×e2NF/1F12F=1. (79)

4.4. Scaling factor

Under genealogy-dependent viability selection, bifurcation of lineages occurs at rate λ(1-r/n) With this modification, equating (27) and (28) produces

4Nf=n2n1λnr,f=12F8N2uaF4, (80)

in which (78) determines F.

Expression (30), in which F is determined by the Yokoyama and Hetherington (1982) solution (15), gives the scaling factor for the Wright (1939) model (r= 1; D = 0). In Model 1, which incorporates variation in viability (D>0) but not parental exclusion (r = 1), the scaling factor corresponds to (30), but with F determined by (60).

5. Variation in selection intensity

Expression of recessive deleterious mutations in regions cosegregating with the S-locus may induce a relationship between the time since divergence of a pair of specificities and the viability of the zygote formed from that pair. In a given zygote, mutations that arose prior to the divergence of the constituent haplotypes occur in homozygous form and those that arose subsequently in heterozygous form (Fig. 1). Viability declines as genealogical distance declines, reflecting greater proportions of mutations expressed in homozygous form

Fig. 1.

Fig. 1.

Deleterious mutations in a region flanking the S-locus mapped on the allelic genealogy of S-specificities S1,S2, and S3: Zygotes constituted from less closely related haplotypes bear fewer mutations in homozygous form: 1 mutation in homozygous form and 3 mutations in heterozygous form for S1=S2; and 1 homozygous and 4 heterozygous for S1=S3, Specificities S2 and S3, which diverged from their common ancestor more recently, share 2 mutations.

I assign the viability of a zygote formed by the fusion of a random pair of common specificities as

τ=1s1As2B, (81)

in which s1 and s2 denote selection coefficients representing reductions in viability due to the expression of deleterious mutations in heterozygous and homozygous form, respectively; A the number of deleterious mutations that arose in either lineage since their common ancestor; and B the number that arose before their divergence but after the most recent common ancestor (MRCA) of all specificities currently segregating in the population. Because all individuals in the population carry the mutations borne by the MRCA in homozygous form, those mutations do not affect relative fitness among zygotes. Parameter D, representing variation in viability selection (41), depends upon the distribution of coalescence times of random pairs of specificities.

For τ and τ* viabilities of zygotes formed from random pairs of specificities, the expected square of the relative difference is close to but somewhat greater than the ratio of squares:

Eτττ2Eττ2Eτ2. (82)

Because the numbers of mutations occurring in disjunct time periods are independent,

Eττ2=Es1AA+s2BB2=2s12VarA+s22VarB. (83)

The existence of the limiting diffusion requires that the square of the selection coefficients (s12 and s22) be of the order of the effect due to drift (1/2N) Because the expectation of τ2 differs from unity by the square of the selection intensities,

D2N=2s12VarA+s22VarB. (84)

Assumption of a Poisson distribution for mutation number given divergence time implies

VarA=2vETc+2v2VarTc, (85)
VarB=vET2Tc+v2VarT2Tc, (86)

(see Hudson, 1990), in which v denotes the per-haplotype rate of deleterious mutation, Tc the time since divergence of a random pair of specificities, and T2 the time since the most recent common ancestor of all specificities segregating in the population (Fig. 2).

Fig. 2.

Fig. 2.

Times to coalescence between S1 and S2 (Tc) and among all n S-specificities currently segregating in the population (T2) Mutations that appeared in specificities S1 or S2 since their divergence occur in heterozygous form in S1S2 zygotes, while those that appeared before divergence occur in homozygous form.

The focal pair of specificities coalesce in a given generation only if a lineage bifurcated and the specificities represent the two descendant lineages. For a pergeneration probability of coalescence of

P=λ1r/nn2, (87)

the probability generating function (pgf) for the pairwise coalescence time (Tc) is

gz=k=1zkQk1P=zPPz1Qʼ (88)

in which P and Q sum to unity. This function yields

ETc=n2n12λnr=2Nf, (89)
VarTc=2Nf21. (90)

The expression for E[Tc] confirms a well-known result, with f the scaling factor given by (80); the expression for Var[Tc] reflects that the pgf incorporates a geometric distribution rather than the exponential distribution customarily used in coalescence theory (see Hudson, 1990).

Once the focal pair of specificities coalesce, the remaining time to the MRCA depends on the number of lineages remaining at that point. Fig. 3 depicts a genealogy among the n distinct S-specificities segregating in the population. State i corresponds to the section of the genealogy in which i ancestral lineages remain. The probability that the focal pair coalesce at the boundary between states i and i – 1 is

PrX=i=2ii1j=i1n12jj1=2n+1ii+1n1=21+Fii+11F. (91)

Given coalescence at the boundary between states i and i – 1; the remaining time to the MRCA T2Tc corresponds to the sum of the lengths of the states from i – 1 to 2. Expression (28), with the rate of bifurcation λ11/n replaced by λ1r/n, provides the rate of termination of state j:

Pj=λnrjj1n2n1=jj14Nf (92)

(using (30)). The pgf for the length of state j corresponds to

hjz=k=1zkQjk1Pj=zPjPjz1Qj, (93)

in which Pj and Qj sum to unity. Independence among states permits formation of the pgf of the time between coalescence and the MRCA T2Tc as the product over the states. Taking the expectation over the state of coalescence produces

hz=i=2nPrX=ij=2i1hjz. (94)

This pgf provides

ET2Tc=2Nf12/n=2Nf12F, (95)
VarT2Tc=4Nf221+F1Fi=3n1ii+1j=2i11j2ET2Tc1+ET2Tc. (96)

The expression for ET2Tc confirms expectation from standard coalescence theory of the difference between the expected times to the MRCA of n lineages 4N11/n and of a random pair of lineages (2N).

Fig. 3.

Fig. 3.

Genealogy traced backward in time from the n distinct S-specificities currently segregating in the population. State j corresponds to the interval in which the current n lineages have j ancestral lineages.

6. Discussion

Patterns observed in genealogies of S-alleles derived from natural populations motivated this exploration of variation in viability among S-locus genotypes and allele-specific load. Low rates of crossing-over between haplotypes encoding distinct S-specificities may permit the accumulation of recessive deleterious mutations at sites linked to the S-locus. Under full expression of GSI, mutations restricted to a single specificity class never occur in homozygous form. Screening of the effects of homozygous expression of such mutations is postponed until the formation of zygotes comprising haplotypes encoding distinct specificities that descend from the same ancestral specificity. The models introduced here address the effects of variation in viability among zygotes and greater antagonism between more closely related haplotypes on the number of S-alleles maintained in a population and their genealogical relationships. Numerical simulation studies designed to test the analytical results derived from these models are currently underway.

6.1. Variation in zygote viability

Variation in viability among S-locus genotypes, independent of genealogical relationships among alleles, tends to reduce the invasion probability of new specificities. In order to distinguish the effects of variation itself from those due to allele-specific load, I first studied Model 1, in which zygote viability is independent of genealogical relationship between constituent alleles.

Results confirm that variation in viability reduces the number of S-alleles maintained in population (60). This reduction in allele number increases the probability of invasion of new specificities (53). Consequently, variation in viability decreases the period over which a given specificity is common within the population.

6.2. Genealogy-dependent selection

Genealogy-dependent viability selection denotes a positive association between viability of a given zygoteand the time since divergence of its constituent haplotypes. S-allele-specific load may contribute to such associations. In a given zygote, mutations that arose after the divergence of the constituent haplotypes occur in heterozygous form, while those that arose prior to divergence occur in homozygous form. Model 2 addresses the effects of S-allele-specific load by expressing the distribution of viabilities in terms of the distribution of coalescence times between specificities.

Immediately after the appearance of novel specificity (O); before it rises to common frequencies or becomes extinct, the time since divergence from its immediate parent specificity (P) is negligible relative to divergence times among common S-alleles. The presence of P decreases the probability of invasion of O, implying that antagonistic interactions between closely related specificities reduces the rate of bifurcation of S-allele lineages (77). In contrast, the presence of O in very low frequencies is assumed to have no effect on the probability or rate of extinction of P: If P becomes extinct soon after giving rise to O, O is more likely to invade. Over the long-term, the evolutionary dynamics of new specificities reflect a mixture of the two cases, corresponding to the early extinction or persistence of the immediate parental specificity.

6.3. Empirical observations

Model 2 suggests that reductions in viability due to Sallele-specific load are of the order of the product of the square of selection coefficients associated with heterozygous (s1) and homozygous (s2) expression of deleterious mutations, the per-haplotype rate of deleterious mutation (v), and coalescence times among common specificities (see (84) and (86)). Available empirical information is consistent with substantial reductions of this kind. For example, the cosegregation of the S-locus with genomic tracts of considerable size (Boyes et al., 1997; McCubbin and Kao, 1999) suggests high perhaplotype rates of mutation. Further, analyses of Sallele genealogies indicate coalescence times between specificities of the order of tens of millions of years (Ioerger et al., 1990; Dwyer et al., 1991; Uyenoyama, 1995).

Genealogy-dependent viability selection may increase variation in allele frequency (D in (46) and (73)) and reduce both the number of common S-alleles maintained in population (78) and the rate of bifurcation of S-allele lineages (77). The S-allele genealogy of Solanum carolinense, which shows the greatest deviations from expectation based on GSI selection, exhibits an apparent reduction in the rate of bifurcation of lineages (Uyenoyama, 1997). Further, this species maintains the smallest number of S-alleles known (Lawrence, 2000).

6.4. Other factors affecting evolutionary dynamics

Both the number of segregating S-specificities and their genealogical relationships to one other depend critically on the mechanism through which new specificities arise. While the Wright model and the extensions considered here portray the origin of a new specificity as a single mutational event, recent experimental studies suggest passage through intermediate states, possibly with altered SI expression (Matton et al., 1999).

That the determinants of pistil and pollen specificity are genetically and physiologically distinct (Golz et al., 2000; Nasrallah et al., 2000) itself suggests that the formation of a new S-specificity requires at least two mutations. Genetic conflicts between the evolutionary interests of the genes that control pistil and pollen function are expected to arise, even though absolute linkage consigns them to a common fate (Uyenoyama and Newbigin, 2000; Uyenoyama et al., 2001). Pollen that express a novel specificity would induce the rejection reaction in few pistils, promoting the partial breakdown of SI and the appearance of nonreciprocal pollen exclusion among genotypes. Mutations in the pistil gene that restore reciprocal incompatibility by permitting recognition of the new pollen specificity would be favored. The introduction of a new specificity through this evolutionary pathway almost always entails the replacement of the ancestral specificity by the singlemutant, expressing the new pollen specificity alone, and of this intermediate by the double-mutant, expressing a new, full-function specificity (Uyenoyama et al., 2001). S-allele turnover through this mechanism would rarely permit bifurcation of lineages within populations, with population structure perhaps determining the rate and pattern of branching in S-allele genealogies.

Observation of shared characteristics in patterns of S-allele diversity, including exceedingly high numbers of specificities and long terminal branches in gene genealogies, suggests the operation of shared processes. Accumulation of recessive deleterious mutations in regions tightly linked to genes that determine specificity represents a general process likely operating across evolutionarily independent SI systems.

Acknowledgments

I thank two anonymous reviewers for their comments. US Public Health Service grant GM 37841 provided support for this study.

References

  1. Anderson MA, Cornish EC, Mau S-L, Williams EG, Hoggart R, et al. , 1986. Cloning of cDNA for a stylar glycoprotein associated with expression of self-incompatibility in (Nicotiana alata). Nature 321, 44–48. [Google Scholar]
  2. Boyes DC, Nasrallah ME, Vrebalov J, Nasrallah JB, 1997. The self-incompatibility (S) haplotypes of Brassica contain highly divergent and rearranged sequences of ancient origin. Plant Cell 9, 237–247. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Dwyer KG, Balent MA, Nasrallah JB, Nasrallah ME, 1991. DNA sequences of self-incompatibility genes from Brassica campestris and B. oleracea: polymorphism predating speciation. Plant Mol. Biol 16, 481–486. [DOI] [PubMed] [Google Scholar]
  4. Emerson S, 1938. The genetics of self-incompatibility in Oenothera organensis. Genetics 23, 190–202. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Ewens WJ, 1964. On the problem of self-sterility alleles. Genetics 50, 1433–1438. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Fisher RA, 1958. The Genetical Theory of Natural Selection, 2nd Revised Edition. Dover, New York. [Google Scholar]
  7. Gillespie J, 1983. Some properties of finite populations experiencing strong selection and weak mutation. Amer. Nat 121, 691–708. [Google Scholar]
  8. Gillespie J, 1994. Substitution processes in molecular evolution. II. Exchangeable models from population genetics. Evolution 48, 1101–1113. [DOI] [PubMed] [Google Scholar]
  9. Golz JF, Clarke AE, Newbigin E, 2000. Mutational approaches to the study of self-incompatibility: revisiting the pollen-part mutants. Ann. Bot 85 (Suppl. A), 95–103. [Google Scholar]
  10. Hey J, 1992. Using phylogenetic trees to study speciation and extinction. Evolution 46, 627–640. [DOI] [PubMed] [Google Scholar]
  11. Hudson RR, 1990. Gene genealogies and the coalescent process. In: Futuyma D, Antonovics J (Eds.), Oxford Surveys in Evolutionary Biology, Vol. 7 Oxford University Press, New York, pp. 1–44. [Google Scholar]
  12. Ioerger TR, Clark AG, Kao T.-h., 1990. Polymorphism at the self- incompatibility locus in Solanaceae predates speciation. Proc. Natl Acad. Sci. USA 87, 9732–9735. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Karlin S, Levikson B, 1974. Temporal fluctuations in selection intensities: case of small population size. Theor. Popul. Biol 6, 383–412. [DOI] [PubMed] [Google Scholar]
  14. Karlin S, Taylor HM, 1981. A Second Course in Stochastic Processes Academic Press, New York. [Google Scholar]
  15. Kimura M, Crow JF, 1964. The number of alleles that can be maintained in a finite population. Genetics 49, 725–738. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Lawrence MJ, 2000. Population genetics of the homomorphic self- incompatibility polymorphisms in flowering plants. Ann. Bot 85 (Suppl. A), 221–226. [Google Scholar]
  17. Lawrence MJ, Franklin-Tong VE, 1994. The population genetics of the self-incompatibility polymorphism in Papaver rhoeas. IX. Evidence of an extra effect of selection acting on the S-locus. Heredity 72, 353–364. [Google Scholar]
  18. Matton DP, Luu DT, Xike Q, Laublin G, O’Brien M, et al. , 1999. Production of an S RNase with dual specificity suggests a novel hypothesis for the generation of new S alleles. Plant Cell 11, 2087–2097. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. McCubbin AG, Kao T.-h., 1999. The emerging complexity of self-incompatibility (S-) loci. Sex. Plant Reprod 12, 1–5. [Google Scholar]
  20. McCubbin AG, Wang X, Kao T.-h., 2000. Identification of self- incompatibility (S-) locus linked pollen cDNA markers in Petunia inflata. Genome 43, 619–627. [PubMed] [Google Scholar]
  21. Muirhead CA, 2001. Consequences of population structure on genes under balancing selection. Evolution 55, 1532–1541. [DOI] [PubMed] [Google Scholar]
  22. Nasrallah JB, 2000. Cell-cell signaling in the self-incompatibility response. Curr. Opin. Plant Biol 3, 368–373. [DOI] [PubMed] [Google Scholar]
  23. Nasrallah JB, Kao T.-h., Chen C-H, Goldberg ML, Nasrallah ME, 1985. A cDNA clone encoding an S-locus-specific glycopro- tein from Brassica oleracea. Nature 326, 617–619. [Google Scholar]
  24. Nasrallah ME, Kandasamy MK, Chang M-C, Stadler Z, Lim S, et al. , 2000. Identifying genes for pollen-stigma recognition in crucifers. Ann. Bot 85 (Suppl. A), 125–132. [Google Scholar]
  25. de Nettancourt D, 1977. Incompatibility in Angiosperms Springer, Berlin. [Google Scholar]
  26. Olver FWJ, 1974. Asymptotics and Special Functions Academic Press, New York. [Google Scholar]
  27. Richman AD, Kohn JR, 1999. Self-incompatibility alleles from Physalis: implications for historical inference from balanced genetic polymorphisms. Proc. Natl Acad. Sci. USA 96, 168–172. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Sasaki A, 1989. Evolution of pathogen strategies Ph.D. Thesis, Kyushu University, Kyushu, Japan. [Google Scholar]
  29. Sasaki A, 1992. The evolution of host and pathogen genes under epidemiological interaction. In: Takahata N (Ed.), Population Paleo-Genetics Japan Scientific Societies Press, Tokyo, pp. 247–263. [Google Scholar]
  30. Schierup MH, Vekemans X, Christiansen FB, 1998. Allelic genealogies in sporophytic self-incompatibility systems in plants. Genetics 150, 1187–1198. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Slatkin M, Muirhead CA, 1999. Overdominant alleles in a population of variable size. Genetics 152, 775–781. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Suzuki G, Taki N, Hirose T, Fukui K, Nishio T, et al. , 1999. Genomic organization of the S locus: identification and characterization of genes in SLG=SRK region of S9 haplotype of Brassica campestris (syn. rapa). Genetics 153, 391–400. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Takahata N, 1990. A simple genealogical structure of strongly balanced allelic lines and trans-species evolution of polymorphism. Proc. Natl Acad. Sci. USA 87, 2419–2423. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Takahata N, Satta Y, Klein J, 1992. Polymorphism and balancing selection at major histocompatibility complex loci. Genetics 130, 925–938. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Uyenoyama MK, 1995. A generalized least-squares estimate for the origin of sporophytic self-incompatibility. Genetics 139, 975–992. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Uyenoyama MK, 1997. Genealogical structure among alleles regulating self-incompatibility in natural populations of flowering plants. Genetics 147, 1389–1400. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Uyenoyama MK, 2000. The evolution of breeding systems. In: Singh RS, Krimbas C (Eds.), Evolutionary Genetics from Molecules to Morphology Cambridge University Press, New York, pp. 300–316. [Google Scholar]
  38. Uyenoyama MK, Newbigin E, 2000. Evolutionary dynamics of dual-specificity self-incompatibility alleles. Plant Cell 12, 310–312. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Uyenoyama MK, Takebayashi N, 2003. Genus-specific diversification of mating types. In: Singh RS, Uyenoyama MK (Eds.), The Evolution of Population Biology Cambridge University Press, New York: (Chapter 13). [Google Scholar]
  40. Uyenoyama MK, Zhang Y, Newbigin E, 2001. On the origin of self-incompatibility haplotypes: transition through self-compatible intermediates. Genetics 157, 1805–1817. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Vekemans X, Slatkin M, 1994. Gene and allelic genealogies at a gametophytic self-incompatibility locus. Genetics 137, 1157–1165. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Wright S, 1939. The distribution of self-sterility alleles in populations. Genetics 24, 538–552. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Wright S, 1960. On the number of self-incompatibility alleles maintained in equilibrium by a given mutation rate in a population of a given size: a reexamination. Biometrics 16, 61–85. [Google Scholar]
  44. Wright S, 1964. The distribution of self-incompatibility alleles in populations. Evolution 18, 609–619. [Google Scholar]
  45. Wright S, 1969. Evolution and the Genetics of Populations, Volume 2: The Theory of Gene Frequencies University of Chicago Press, Chicago. [Google Scholar]
  46. Yokoyama S, Hetherington LE, 1982. The expected number of self- incompatibility alleles in finite plant populations. Heredity 48, 299–303. [Google Scholar]
  47. Yokoyama S, Nei M, 1979. Population dynamics of sex-determining alleles in honey bees and self-incompatibility alleles in plants. Genetics 91, 609–626. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES