Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2021 Aug 1.
Published in final edited form as: Theor Popul Biol. 2020 Apr 7;134:129–146. doi: 10.1016/j.tpb.2020.02.004

Assortative mating by population of origin in a mechanistic model of admixture

Amy Goldberg 1,2,*, Ananya Rastogi 3, Noah A Rosenberg 2
PMCID: PMC7387155  NIHMSID: NIHMS1582622  PMID: 32275920

Abstract

Populations whose mating pairs have levels of similarity in phenotypes or genotypes that differ systematically from the level expected under random mating are described as experiencing assortative mating. Excess similarity in mating pairs is termed positive assortative mating, and excess dissimilarity is negative assortative mating. In humans, empirical studies suggest that mating pairs from various admixed populations—whose ancestry derives from two or more source populations—possess correlated ancestry components that indicate the occurrence of positive assortative mating on the basis of ancestry. Generalizing a two-sex mechanistic admixture model, we devise a model of one form of ancestry-assortative mating that occurs through preferential mating based on source population. Under the model, we study the moments of the admixture fraction distribution for different assumptions about mating preferences, including both positive and negative assortative mating by population. We demonstrate that whereas the mean admixture under assortative mating is equivalent to that of a corresponding randomly mating population, the variance of admixture depends on the level and direction of assortative mating. We consider two special cases of assortative mating by population: first, a single admixture event, and second, constant contributions to the admixed population over time. In contrast to standard settings in which positive assortment increases variation within a population, certain assortative mating scenarios allow the variance of admixture to decrease relative to a corresponding randomly mating population: with the three populations we consider, the variance-increasing effect of positive assortative mating within a population might be overwhelmed by a variance-decreasing effect emerging from mating preferences involving other pairs of populations. The effect of assortative mating is smaller on the X chromosome than on the autosomes because inheritance of the X in males depends only on the mother’s ancestry, not on the mating pair. Because the variance of admixture is informative about the timing of admixture and possibly about sex-biased admixture contributions, the effects of assortative mating are important to consider in inferring features of population history from distributions of admixture values. Our model provides a framework to quantitatively study assortative mating under flexible scenarios of admixture over time.

1. Introduction

Mechanistic models describing the dynamics of admixture among two or more populations have proven informative for understanding the processes that underlie patterns of admixture in admixed populations. Such models have examined a variety of phenomena, including sex-biased admixture (Goldberg et al., 2014; Goldberg and Rosenberg, 2015), the interaction of admixture with migration and population size (Long, 1991; Pool and Nielsen, 2009), the consequences of spatial structure for ancestry patterns (Wang et al., 2011; Sedghifar et al., 2015), hybrid incompatibilities and epistasis (Lindtke and Buerkle, 2015; Schumer et al., 2015; Schumer and Brandvain, 2016; Sedghifar et al., 2016), and the effect of admixture on linkage disequilibrium (Chakraborty and Weiss, 1988; Pfaff et al., 2001; Gravel, 2012; Loh et al., 2013; Liang and Nielsen, 2014; Ni et al., 2016; Zaitlen et al., 2017).

Recently, a general family of admixture models has introduced a framework for examining the effects of an ongoing admixture process on the properties of admixture levels in populations. Verdu and Rosenberg (2011) devised a mechanistic model of admixture allowing for flexible admixture contributions over time, studying the dynamics of the distribution of ancestry within an admixed population. Goldberg et al. (2014) and Goldberg and Rosenberg (2015) extended this model to allow for varying contributions from males and females, evaluating the consequences of sex-biased admixture for autosomal and X-chromosomal ancestry. Gravel (2012) derived the distribution of autosomal ancestry and ancestry-tract lengths under a related model allowing for multiple waves of migration. These studies demonstrate the potential of mechanistic admixture models for accommodating complex admixture histories.

One phenomenon of recent interest in the study of admixed populations is assortative mating, in which admixture levels of individuals in a population system that includes an admixed population and its source populations influence the formation of mating pairs. Empirical analyses of mating pairs in admixed populations have documented nonrandom pairings with respect to admixture levels (Risch et al., 2009; Sebro et al., 2010; Zaitlen et al., 2017). In particular, Zou et al. (2015) found that in multiple Latino populations, spouse pairs are correlated in their genomic ancestry patterns. Assortative mating by admixture has also been suggested as a cause of geographic structure in patterns of genetic and phenotypic variation in admixed populations (Ruiz-Linares et al., 2014), and as an explanation for empirically observed differences between phenotype-based ancestry classifications and genomic ancestry (Parra et al., 2003).

A particular form of assortative mating by admixture can be more specifically characterized as assortative mating by source population (Ritchie et al., 1989; Howard, 1993; Duenez-Guzman et al., 2009; Melo et al., 2009; Ruiz-Linares et al., 2014; Schumer et al., 2017). In this type of assortative mating by admixture, different groups—source populations and admixed populations, for example—come to have distinct geographic locations, trait preferences, host preferences, or in the case of human populations, social identities (Qian, 1997; Jacobson et al., 2004). Individual ancestry levels within the admixed group are less important for mating choices than the group membership itself. Such a pattern can occur in hybrid zones; for example, Melo et al. (2009) found that certain Heliconius hybrids show distinct wing color patterns, and during mating, hybrids prefer the hybrid patterns over the patterns displayed in the parental populations.

Positive assortative mating by source population can take place, in which individuals from the source populations preferentially mate within the source groups, while individuals from the admixed group preferentially mate within the admixed group (Figure 1b). Alternatively, negative assortative mating by source population can also take place—for example, during major migration events—in which pairings might be more likely to involve individuals from different source populations (Figure 1c).

Figure 1:

Figure 1:

Schematic of the mechanistic model of assortative mating by population. Two source populations, S1 and S2, contribute females and males to the next generation of the hybrid population H, potentially with time-varying proportions. The fractional contributions of the source populations and the hybrid population to the next generation g are s1,g, s2,g, and hg, respectively. Sex-specific contributions from the populations are s1,gf, s2,qf and hgf, and S1,gm, S2,gm and hgm, for females and males, respectively. Hα,g,δγ represents the fraction of admixture from source population α ∈ {1, 2} in generation g for a random individual of sex δ ∈ {f, m} in population H for chromosomal type γ ∈ {A, X}. Within the admixed population, at every generation, parents from generation g – 1 pair according to one of three mating models. Individuals from S1 are represented by triangles, S2 by pentagons, and H by squares. (A) Random mating. The probability of a pairing is given by the product of the proportional contributions of the two populations. (B) Positive assortative mating. Individuals are more likely to mate with individuals from their own population. (C) Negative assortative mating. Individuals are more likely to mate with individuals from a different population. In each panel, a mating pair is indicated by a pair of adjacent symbols. Each panel considers the same values for the contributions from the three populations to generation g + 1.

Here, we extend mechanistic models describing the evolution of admixture levels in an admixed population to accommodate assortative mating by source population. In Section 2, we develop the mechanistic admixture model, allowing mating pairs to vary in their probability of occurrence based on individual populations of origin. In Sections 3 and 4, we derive recursive expressions for the moments of autosomal and X-chromosomal admixture, respectively, as functions of sex-specific contributions and the properties of assortative mating. Next, in Sections 5 and 6, we analyze the behavior of the moments of the admixture fraction distribution for specific cases of the admixture model with constant contributions over time. We conclude with a discussion in Section 7.

2. Model

We follow the notation of Goldberg and Rosenberg (2015), extending earlier mechanistic admixture models to consider an admixed population that preferentially mates based on population of origin. That is, individuals from each of the populations—source populations S1 and S2, and the hybrid population, H—preferentially mate with individuals of the other sex based on population of origin. We consider the case of two source populations, so that individuals come from one of three populations: S1, H, or S2. Individuals from the source populations may contribute to pool of possible parents for the next generation of H, but individuals from H cannot contribute to the source populations, and each source population does not contribute to the other source population.

Following Goldberg and Rosenberg (2015), we define the parameters S1,gδ and S2,gδ as the fractions of individuals of sex δ at generation g that trace to source populations S1 and S2, respectively, in the previous generation. For the admixed population H, hgδ is the corresponding parameter. We maintain relations between parameter sets from Goldberg et al. (2014, eqs. 16). Specifically, female (superscript f) and male (superscript m) contributions must each sum to 1,

s1,gf+hgf+s2,gf=s1,gm+hgm+s2,gm=1. (1)

Additionally, the total contribution from a source population is the average of the sex-specific contributions from that source,

s1,g=(s1,gf+s1,gm)/2 (2)
s2,g=(s2,gf+s2,gm)/2 (3)
hg=(hgf+hgm)/2 (4)

For a randomly mating population, the probability that an individual in the admixed population has a specific parental pairing is the product of the associated female and male contribution parameters. For example, the probability that an individual in the admixed population in generation g has a female parent from S1 and a male parent from H is s1,g1fhg1m. To consider deviations from random mating, we define a new parameter cij,g as the difference between the probability that a mating pair in generation g contains a female from population i and a male from population j (Table 1) and the corresponding probability in a randomly mating population with the same contribution parameters. Here, i, j ϵ {1, h, 2} for populations S1, H, and S2, respectively.

Table 1:

Recursions for autosomal and X-chromosomal admixture. The table shows the probability of parental pairings and the admixture fractions for each of the nine possible pairings for a randomly chosen female and a randomly chosen male from the admixed population at generation g.

Case Female parent’s population Male parent’s population Probability Admixture in males Admixture in females
H1,g,mA H1,g,mX H1,g,fA H1,g.fX
1 S1 S1 s1,g1fs1,g1m+c11,g 1 1 1 1
2 S1 H s1,g1fhg1m+c1h,g 1+H1,g1,mA2 1 1+H1,g1,mA2 1+H1,g1,mX2
3 S1 S2 s1,g1fs2,g1m+c12,g ½ 1 ½ ½
4 H S1 hg1fs1,g1m+ch1,g 1+H1,g1,fA2 H1,g1,fX 1+H1,g1,fA2 1+H1,g1,fX2
5 H H hg1fhg1m+chh,g H1,g1,fA+H1,g1,mA2 H1,g1,fX H1,g1,fA+H1,g1,mA2 H1,g1,fX+H1,g1,mX2
6 H S2 hg1fs2,g1m+ch2,g H1,9A1,f2 H1,g1,fX H1,g1,fA2 H1,g1,fX2
7 S2 S1 s2,g1fs1,g1m+c21,g ½ 0 ½ ½
8 S2 H s2,g1fhg1m+c2h,g H1,g1,mA2 0 H1,g1,mA2 H1,g1,mX2
9 S2 S2 s2,g1fs2,g1m+c22,g 0 0 0 0

The parameters cij,g govern the strength and direction of assortative mating. We assume that the assortative mating preference is constant over time after the founding of the admixed population. Therefore, we have two sets of parameters: cij,0 for the founding generation, and cij for all further generations. Because the sum of the probabilities for all parental pairings must be 1, we have

0=c11,0+c12,0+c21,0+c22,0 (5)
0=c11+c1h+c12=ch1+chh+ch2=c21+c2h+c22 (6)
0=c11+ch1+c21=c1h+chh+c2h=c12+ch2+c22. (7)

The values of the cij are bounded such that the probability of each given parental pairing (Table 1) takes its values in the interval [0, 1], and such that each probability is no greater than the probability of one of its constituent components. For example, if c11 > 0, then c11min(s1,g1f,s1,g1m)s1,g1fs1,g1m. If c11 < 0, then |c11|s1,g1fs1,g1m. The value of cij is necessarily zero if the female contribution from population i or the male contribution from population j is zero.

For positive assortative mating in population i, with i ∈ {S1, H, S2}, individuals in population i are more likely to mate with individuals from their own population than from other populations, so that cii > 0 and cii > cij and cii > cji for each ji. If individuals from a source population are least likely to mate with individuals from the other source population, then we also have c12 < 0 and c21 < 0. Similarly, for negative assortative mating, individuals in each population are less likely to mate with individuals from their own population than with individuals from other populations, so that cii < 0 for all i and cii < cij and cii < cji for all i and ji. If individuals from a source population are most likely to mate with individuals from the other source, then c12 > 0 and c21 > 0.

We study the random variable Hα,g,δγ describing the ancestry proportion or admixture fraction from source population α at generation g for a chromosomal type γ in a random individual of sex δ in the admixed population. The chromosomal type is indicated by γ, and it can be either autosomal, A, or X-chromosomal, X. For the autosomes, Hα,g,fA and Hα,g,mA are identically distributed (Goldberg et al., 2014). For the X chromosome, the distribution of Hα,g,fX depends on both female and male admixture distributions in generation g – 1, as females inherit one X chromosome from each parent. For Hα,g,mX, the distribution depends only on the female contributions, as males inherit a single X chromosome from their mothers. Note that it is permitted for an individual in the admixed population to have admixture fraction 0 or 1 if all its ancestors trace to a single source population.

Based on these inheritance patterns, we can write the values for the autosomal and X-chromosomal admixture fractions of an individual randomly chosen from the admixed population given one of nine possible sets of parents, L, along with the probability that an individual has that set of parents (Table 1). The probability of a parental pairing is a function of the sex-specific contributions from the populations and the assortative mating parameters cij. If cij,0 = 0 and cij = 0 for all parental pairings, with j not necessarily distinct from i, this model reduces to the random-mating model of Goldberg and Rosenberg (2015).

3. Moments of the autosomal admixture fraction

3.1. Expectation

Under the model, following Goldberg et al. (2014, eq. 12), we can use the law of total expectation to write a recursion for the expected value of the admixture fraction from source population 1 for a random individual of sex δ sampled from the admixed population in generation g as a function of conditional expectations for all possible parental pairs L. As H1,g,fA and H1,g,mA are identically distributed, we write expressions for sex δ when considering autosomal admixture, understanding that δ takes on the same value, f or m, throughout Section 3. Using the values from Table 1, for the first generation, in which neither parent is from population H, we have

E[H1,1,δA]=(s1,0fs1,0m+c11,0)E[1]+(s1,0fs2,0m+s2,0fs1,0m+c12,0+c21,0)E[12]+(s2,0fs2,0m+c22,0)E[0]. (8)

For all subsequent generations, g ≥ 2, we have

E[H1,g,δA]=(s1,g1fs1,g1m+c11)E[1]+(s1,g1fhg1m+c1h)E[1+H1,g1,mA2]+(hg1fs1,g1m+ch1)E[1+H1,g1,fA2]+(s1,g1fs2,g1m+s2,g1fs1,g1m+c12+c21)E[12]+(hg1fhg1m+chh)E[H1,g1,fA+H1,g1,mA2]+(hg1fs2,g1m+ch2)E[H1,g1,fA2]+(s2,g1fhg1m+c2h)E[H1,g1,mA2]+(s2,g1fs2,g1m+c22)E[0]. (9)

Using eqs. (1)–(4), we can simplify eq. (8) to give, for g = 1,

E[H1,1,δA]=s1,0+c11,0+c12,0+c21,02.

For g ≥ 2, recalling that H1,g,fA and H1,g,mA are identically distributed, eq. (9) becomes

E[H1,g,δA]=s1,g1+c11+c12+c1h+ch1+c212+(hg1+c1h2+ch12+chh+ch22+c2h2)E[H1,g1,δA].

Applying eqs. (5)–(7), for the expected value of autosomal admixture in a random individual of sex δ from the admixed population sampled at g = 1, we have

E[H1,1,δA]=s1,0. (10)

For subsequent generations g ≥ 2, we have

E[H1,g,δA]=s1,g1+hg1E[H1,g1,δA]. (11)

Eqs. (10) and (11) are the same as were found under a randomly mating population by Goldberg et al. (2014, eqs. 17 and 19). That is, positive or negative assortative mating according to population of origin in an admixed population does not affect the expectation of autosomal admixture. The concordance of mean ancestry under assortative and random mating makes sense because the conditions in eqs. (5)–(7) maintain the ancestry proportions from generation to generation in a manner that does not permit natural selection favoring one or another ancestral membership. The ancestries of the parents contributing to the next generation are the same in both scenarios; assortative mating only changes the probability distribution of the parental pairing.

3.2. Higher moments

Next, we write a general recursion for the higher moments of the autosomal admixture fraction from population S1 in a randomly chosen individual of sex δ from the admixed population. For moments k ≥ 1, similar to eqs. 20 and 21 of Goldberg et al. (2014) in generation g = 1, we obtain

(H1,1,δA)k={1k if L=S1S1, with P[L=S1S1]=s1,0fs1,0m+c11,0(12)k if L=S1S2, with P[L=S1S2]=s1,0fs2,0m+c12,0(12)k if L=S2S1, with P[L=S2S1]=s2,0fs1,0m+c21,00k if L=S2S2, with P[L=S2S2]=s2,0fs2,0m+c22,0. (12)

For g ≥ 2, we have

H1,g,δAk=1kif L=S1S1, with PL=S1S1=s1,g1fs1,g1m+c111+H1,g1,mA2kif L=S1H, with PL=S1H=s1,g1fhg1m+c1h12kif L=S1S2, with PL=S1S2=s1,g1fs2,g1m+c121+H1,g1,fA2kif L=HS1, with PL=HS1=hg1fs1,g1m+ch1H1,g1,fA+H1,g1,mA2kif L=HH, with PL=HH=hg1fhg1m+chhH1,g1,fA2kif L=HS2, with PL=HS2=hg1fs2,g1m+ch212kif L=S2S1, with PL=S2S1=s2,g1fs1,g1m+c21H1,g1,mA2kif L=S2H, with PL=S2H=s2,g1fhg1m+c2h0kif L=S2S2, with PL=S2S2=s2,g1fs2,g1m+c22. (13)

For k = 1, eqs. (12) and (13) simplify to produce the recursion in Table 1. Using the law of total expectation, and following our calculation for the expected value of autosomal admixture, we can write an expression for the higher moments of autosomal admixture by summing the conditional values for autosomal admixture given the parental pairings over all possible sets of parental source populations. For the first generation, g = 1, we have

E[(H1,1,δA)k]=(s1,0fs1,0m+c11,0)E[1k]+(s1,0fs2,0m+s2,0fs1,0m+c12,0+c21,0)E[(12)k]+(s2,0fs2,0m+c22,0)E[0k]. (14)

For g ≥ 2, we have

E[(H1,g,δA)k]=(s1,g1fs1,g1m+c11)E[1k]+(s1,g1fhg1m+c1h)E[(1+H1,g1,mA2)k]+(hg1fs1,g1m+ch1)E[(1+H1,g1,fA2)k]+(s1,g1fs2,g1m+s2,g1fs1,g1m+c12+c21)E[(12)k]+(hg1fhg1m+chh)E[(H1,g1,fA+H1,g1,mA2)k]+(hg1fs2,g1m+ch2)E[(H1,g1,fA2)k]+(s2,g1fhg1m+c2h)E[(H1,g1,mA2)k]+(s2,g1fs2,g1m+c22)E[0k]. (15)

Recalling that H1,g1,fA and H1,g1,mA are conditionally independent given H1,g2,fA and H1,g2,mA (Goldberg et al., 2014), we use the binomial theorem to simplify eqs. (14) and (15). For g = 1, we have

E[(H1,1,δA)k]=s1,0fs1,0m+c11,0+s1,0fs2,0m+s2,0fs1,0m+c12,0+c21,02k. (16)

For g ≥ 2, we have

E[(H1,g,δA)k]=s1,g1fs1,g1m+c11+s1,g1fs2,g1m+s2,g1fs1,g1m+c12+c21+c1h+ch12k+s1,g1fhg1m+hg1fs1,g1m+c1h+ch12k(i=0k(ki)E[(H1,g1,δA)i])+hg1fhg1m+chh2k(i=0k(ki)E[(H1,g1,δA)ki]E[(H1,g1,δA)i])+s2,g1fhg1m+hg1fs2,g1m+ch2+c2h2kE[(H1,g1,δA)k]. (17)

For k = 1, eqs. (16) and (17) reduce to eqs. (10) and (11).

The recursions for the higher moments of the autosomal admixture fraction distribution follow the corresponding expressions for a randomly mating population, but with additional terms that are linear in the coefficients for the mating preferences. Setting the assortative mating parameters cij,0 and cij equal to 0 for all i and j, eqs. (16) and (17) reduce to the recursion for the moments of admixture for a corresponding randomly mating population, eqs. 24 and 26 from Goldberg et al. (2014).

3.3. Variance

Using eqs. (16) and (17) for k = 2, we can write expressions for the second moment of autosomal admixture. Recalling eqs. (1)–(4), for g = 1, we have

E[(H1,1,δA)2]=s1,0f(1+s1,0m)+s1,0m(1+s1,0f)+2c11,04. (18)

For g ≥ 2, because H1,g1,fA and H1,g1,mA are identically distributed, we have

E[(H1,g,δA)2]=s1,g1f(1+s1,g1m)+s1,g1m(1+s1,g1f)+2c114+s1,g1fhg1m+hg1fs1,g1m+c1h+ch12E[H1,g1,δA]+hg1fhg1m+chh2(E[H1,g1,δA])2+hg1f+hg1m4E[(H1,g1,δA)2]. (19)

Using the definition of the variance V[H1,g,δA]=E[(H1,g,δA)2](E[H1,g,δA])2, with the expressions for the expected value of autosomal admixture, eqs. (10) and (11), and the second moment, eqs. (18) and (19), we can write an expression for the variance of autosomal admixture. For g = 1, we have

V[H1,1,δA]=s1,0f(1s1,0f)+s1,0m(1s1,0m)+2c11,04. (20)

For all subsequent generations, g ≥ 2, we have

V[H1,g,δA]=s1,g1f(1s1,g1f)+s1,g1m(1s1,g1m)+2c114+c1h+ch1s1,g1fhg1fs1,g1mhg1m2E[H1,g1,δA]+hg1f(1hg1f)+hg1m(1hg1m)+2chh4(E[H1,g1,δA])2+hg1f+hg1m4V[H1,g1,δA]. (21)

For a randomly mating population, with cij,0 = 0 and cij,g = 0 for all i and j, eqs. (20) and (21) reduce to eqs. 32 and 33 from Goldberg et al. (2014). Defining the variance of autosomal admixture in a randomly mating population at g = 1 and g ≥ 2, eqs. 32 and 33 from Goldberg et al. (2014), as VRM[H1,1,δA] and VRM[H1,g,δA], respectively, we can rewrite eqs. (20) and (21). For g = 1, we have

V[H1,1,δA]=c11,02+VRM[H1,1,δA]. (22)

For g ≥ 2, recalling eqs. (6) and (7), we have

V[H1,g,δA]=c112+(c1h+ch12)E[H1,g1,δA]+chh2(E[H1,g1,δA])2+VRM[H1,g,δA]. (23)

Eqs. (22) and (23) indicate that, in a single generation, assortative mating changes the variance of admixture compared to a randomly mating population. The variance of autosomal admixture contains information about assortative mating because it depends on the assortative mating parameters cij and it differs from the expression for a randomly mating population with the same contributions for the source populations.

4. Moments of the X-chromosomal admixture fraction

4.1. Expectation

Following the same approach as in the corresponding derivation for the autosomal admixture fraction, we can use the law of total expectation to write recursions for the moments of X-chromosomal admixture. Because the distribution of admixture differs for females and males on the X chromosome (Table 1), we follow Goldberg and Rosenberg (2015) and write coupled expressions for the moments of female and male X-chromosomal admixture fractions.

For X-chromosomal admixture in a randomly chosen female from the admixed population, the recursive expressions for the moments of admixture match the expressions for autosomal admixture presented in Section 3, exchanging γ = A for γ = X in the superscript. However, X-chromosomal admixture is not identically distributed in females and in males from the admixed population. Therefore, certain expressions for the moments of admixture cannot be reduced as they were for the autosomes.

Using eqs. (1)–(7), we can use the values from Table 1 to write expressions for the expectation of X-chromosomal admixture. For X-chromosomal admixture in a female from the admixed population, the recursive expressions for the expectation of admixture are the same as for autosomal admixture (eqs. (10) and (11)), exchanging γ = A for γ = X in the superscript. For X-chromosomal admixture in males, for g = 1, we have

E[H1,1,mX]=S1,0f+c11,0+c12,0=S1,0f. (24)

For g ≥ 2, we have

E[H1,g,mX]=s1,g1f+c11+c1h+c12+(hg1f+ch1+chh+ch2)E[H1,g1,fX]=s1,g1f+hg1fE[H1,g1,fX]. (25)

As was the case for the autosomes, the expectation of X-chromosomal admixture in eqs. (24) and (25) is the same under assortative mating as the expected value for a randomly mating population (Goldberg and Rosenberg, 2015, eqs. 34).

4.2. Higher moments

Following the derivation for the autosomes and using Table 1, we can write general coupled recursions for the higher moments of the X-chromosomal admixture fraction from population S1 in a randomly chosen female and male from the admixed population. As was true for the expectation, the recursion for the X-chromosomal admixture fraction in a female from the admixed population for k ≥ 1 is the same as the recursion for autosomal admixture in eqs. (12) and (13), exchanging the superscript A for X. For X-chromosomal admixture in a male, we have, for g = 1,

H1,1,mXk=1k if L=S1S1, with PL=S1S1=s1,0fs1,0m+c11,01k if L=S1S2, with PL=S1S2=s1,0s2,0m+c12,00k if L=S2S1, with PL=S2S1=s2,0s1,0m+c21,00k if L=S2S2, with PL=S2S2=s2,02s2,0+c22,0

For g ≥ 2, we have

H1,g,δXk=1k if L=S1S1, with PL=S1S1=s1,g1fs1,g1m+c11,g1k if L=S1H, with PL=S1H=s1,g1fhg1m+c1h,g1k if L=S1S2, with PL=S1S2=s1,g1fs2,g1m+c12,gH1,g1,fXk if L=HS1, with PL=HS1=hg1fs1,g1m+ch1,gH1,g1,fXk if L=HH, with PL=HH=hg1fhg1m+chh,gH1,g1,fXk if L=HS2, with PL=HS2=hg1fs2,g1m+ch2,g0k if L=S2S1, with PL=S2S1=s2,g1fs1,g1m+c21,g0k if L=S2H, with PL=S2H=s2,g1fhg1m+c2h,g0k if L=S2S2, with PL=S2S2=s2,g1fs2,g1m+c22,g.

Using the law of total expectation and the binomial theorem, and following the derivation of the autosomal admixture moments, we can write simplified coupled expressions for the moments of X-chromosomal admixture, separately considering a female and a male from the admixed population for k ≥ 1. For g = 1, the recursion for the kth moment of X-chromosomal admixture in a female is equal to the corresponding recursion in the case of autosomal admixture, eq. (16). For a male in g = 1, we have

E[(H1,1,mX)k]=s1,0f. (26)

For g ≥ 2, using the conditional independence of H1,g1,fX and H1,g1,mX given H1,g2,fX and H1,g2,mX, we have

E[(H1,g,fX)k]=s1,g1fs1,g1m+c11+s1,g1fs2,g1m+s2,g1fs1,g1m+c12+c21+c1h+ch12k+hg1fs1,g1m+ch12k(i=0k(ki)E[(H1,g1,fX)i])+s1,g1fhg1m+c1h2k(i=0k(ki)E[(H1,g1,mX)i])+hg1fhg1m+chh2k(i=0k(ki)E[(H1,g1,fX)ki]E[(H1,g1,mX)i])+hg1fs2,g1m+ch22kE[(H1,g1,fX)k]+s2,g1fhg1m+c2h2kE[(H1,g1,mX)k], (27)
E[(H1,g,mX)k]=s1,g1f+hg1fE[(H1,g1,fX)k]. (28)

Unlike for the autosomes, the female and male admixture fractions are not identically distributed or conditionally independent, so the dependence on both H1,g1,fX and H1,g1,mX in eq. (27) cannot be further reduced. For k ≥ 2, moments of the X-chromosomal admixture fraction depend on the assortative mating parameters, cij,0 and cij. However, conditional on H1,g1,fX, moments of the X-chromosomal admixture fraction sampled in a male from the admixed population do not depend on the assortative mating parameters. Because a single copy of the X chromosome is inherited from mother to son, the distribution of admixture for male X chromosomes in a given generation is affected only by the origin of the mother and not by the probabilities of parental pairings in Table 1.

4.3. Variance

For k = 2, we can use eqs. (26)–(28) to write coupled expressions for the second moment of X-chromosomal admixture in a randomly selected female and male from the admixed population. Recalling eqs. (1)–(4), for g = 1, the second moment of X-chromosomal admixture in a female is the same as the expression for autosomal admixture in eq. (18), substituting superscript X in place of A. For the second moment of X-chromosomal admixture in males in g = 1, we have

E[(H1,1,mX)2]=s1,0f.

For g ≥ 2, we have

E[(H1,g,fX)2]=s1,g1f(1+s1,g1m)+s1,g1m(1+s1,g1f)+2c114+hg1fs1,g1m+ch12E[H1,g1,fX]+s1,g1fhg1m+c1h2E[H1,g1,mX]+hg1fhg1m+chh2E[H1,g1,fX]E[H1,g1,mX]+hg1f4E[(H1,g1,fX)2]+hg1m4E[(H1,g1,mX)2],
EH1,g,mX2=s1,g1f+hg1fEH1,g1,fX2.

Following our derivation of the variance of autosomal admixture in Section 3.3, we can write the variance of X-chromosomal admixture in a female and in a male from the admixed population using the expected values and second moments of X-chromosomal admixture. For g = 1, we have

V[H1,1,fX]=s1,0f(1s1,0f)+s1,0m(1s1,0m)+2c11,04, (29)
V[H1,1,mX]=s1,0f(1s1,0f). (30)

For g ≥ 2, we have

V[H1,g,fX]=s1,g1f(1s1,g1f)+s1,g1m(1s1,g1m)+2c114+ch1s1,g1fhg1f2E[H1,g1,fX]+c1hs1,g1mhg1m2E[H1,g1,mX]+chh2E[H1,g1,fX]E[H1,g1,mX]+hg1f(1hg1f)4(E[H1,g1,fX])2+hg1m(1hg1m)4(E[H1,g1,mX])2+hg1f4V[H1,g1,fX]+hg1m4V[H1,g1,mX], (31)
V[H1,g,mX]=s1,g1f(1s1,g1f)2s1,g1fhg1fE[H1,g1,fX]+hg1f(1hg1f)(E[H1,g1,fX])2+hg1fV[H1,g1,fX]. (32)

We denote the variance of X-chromosomal admixture under random mating, in a random female and male from the admixed population, by VRM[H1,g,fX] and VRM[H1,g,mX], respectively. Setting all cij,0 and cij parameters to zero, for g = 1, we have

VRM[H1,1,fX]=s1,0f(1s1,0f)+s1,0m(1s1,0m)4, (33)
VRM[H1,1,mX]=s1,0f(1s1,0f). (34)

For g ≥ 2, we have

VRM[H1,g,fX]=s1,g1f(1s1,g1f)+s1,g1m(1s1,g1m)4s1,g1fhg1f2E[H1,g1,fX]s1,g1mhg1m2E[H1,g1,mX]+hg1f(1hg1f)4(E[H1,g1,fX])2+hg1m(1hg1m)4(E[H1,g1,mX])2+hg1f4V[H1,g1,fX]+hg1m4V[H1,g1,mX], (35)
VRM[H1,g,mX]=s1,g1f(1s1,g1f)2s1,g1fhg1fE[H1,g1,fX]+hg1f(1hg1f)(E[H1,g1,fX])2+hg1fV[H1,g1,fX]. (36)

Using eqs. (33)–(36), we can rewrite the variances in eqs. (29)–(32) as functions of the variance under a similar randomly mating population. For g = 1, we have

V[H1,1,fX]=c11,02+VRM[H1,1,fX], (37)
V[H1,1,mX]=VRM[H1,1,mX]. (38)

For g ≥ 2, conditional on H1,g1,fX and H1,g1,mX, and recalling eqs. (6) and (7), we have

V[H1,g,fX]=c112+ch12E[H1,g1,fX]+c1h2E[H1,g1,mX]+chh2E[H1,g1,fX]E[H1,g1,mX]+VRM[H1,g,fX], (39)
V[H1,g,mX]=VRM[H1,g,mX]. (40)

In contrast to the expectation, the variance of X-chromosomal admixture depends on the assortative mating parameters c11, ch1, c1h, and chh. Conditional on the female and male variances of X-chromosomal admixture in generation g – 1, the variance of X-chromosomal admixture in a male sampled in generation g is equivalent to that in a corresponding randomly mating population.

5. Special case: a single admixture event

To analyze the behavior of the model, we study the moments of the distribution of autosomal and X-chromosomal admixture for two special cases of constant admixture processes over time. First, in Section 5, we consider a case with no contributions from the sources after the initial founding. That is, for g ≥ 2, we set s1,g1f=s1,g1m=s2,g1f=s2,g1m=0 and hg1f=hg1m=1. Because only the admixed population contributes after the first generation, all assortative mating parameters for further generations equal 0, cij = 0.

Next, in Section 6, we examine the special case of constant non-zero contributions over time. In Sections 5 and 6, we assume that the assortative mating process is the same for both sexes. That is, cij,0 = cji,0 and cij = cji. The behavior of the expectation of autosomal and X-chromosomal admixture is equivalent to that seen in a randomly mating population, as studied by Goldberg et al. (2014) and Goldberg and Rosenberg (2015). Therefore, as in Goldberg et al. (2014), we focus on the variance, on which the assortative mating parameters have an impact. In Section 5, we consider the variances of autosomal and X-chromosomal admixture as functions of s1,0f,s1,0m,c11,0, and g. We focus on the dependence of the variance of admixture on the assortative mating parameter c11,0. Note that the variance of admixture under random mating (c11,0 = 0) in a single-admixture scenario was studied in detail for autosomes by Goldberg et al. (2014); for completeness, we include analogous calculations for the X chromosome in Appendix A.

5.1. Autosomes

Under this model of a single admixture event, we can write an exact expression for the variance of autosomal admixture. Using the equation for the variance of autosomal admixture in eq. (20), we observe that the variance for a single admixture event can be written as a geometric sequence with ratio 12. For g ≥ 1, we have

VH1,g,δA=s1,0f1s1,0f+s1,0m1s1,0m+2c11,02g+1. (41)

Recalling the variance of autosomal admixture for a randomly mating population produced by a single admixture event from Goldberg et al. (2014, eq. 35), we rewrite eq. (41) as a function of VRM[H1,g,δA],

V[H1,g,δA]=VRM[H1,g,δA]+c11,02g. (42)

As is true in a randomly-mating population, the long-term limit of the variance over time is zero, with the effect of assortative mating being multiplied by 12 each generation, and with the distribution of admixture narrowing around the same mean value as in a randomly mating population, s1,0 (Goldberg et al., 2014, eq. 35).

For positive assortative mating, with c11,0 > 0, the variance (eq. (42)) is larger than in a corresponding randomly mating population (Goldberg et al., 2014, eq. 35). For negative assortative mating, c11,0 < 0, the variance is smaller (Figure 2a). The effect of the initial non-random mating on the population decreases monotonically each generation, as all individuals mate randomly within the admixed population, with no further contributions from the sources. In a given generation and given the contributions s1,0f and s1,0m, the variance is directly related to c11,0. Therefore, the maximal and minimal variance occur when c11,0 is maximized and minimized, respectively.

Figure 2:

Figure 2:

Variance of ancestry under a single-admixture scenario with assortative mating in the founding generation. (A) Autosomes, V[H1,g,δA]. (B) X chromosomes in a female, V[H1,g,fX]. (C) X chromosomes in a male, V[H1,g,mX]. Each panel considers positive assortative mating (green), c11,0 = 0.1, negative assortative mating (purple), c11,0 = −0.1, and random mating (black), c11,0 = 0. The plots use eqs. (41), (43), and (44) with s1,0f=s1,0m=0.5. For positive assortment, the variance is higher than for random mating. For negative assortative mating, the variance is lower.

Negative assortative mating also introduces a minimum in the variance of autosomal admixture, with V[H1,g,δA]=0, when the numerator of eq. (41) is zero, at c11,0=s1,0f(1s1,0f)/2s1,0m(1s1,0m)/2. Under this scenario of complete negative assortative mating, individuals mate only with individuals from the other source population.

5.2. X chromosome

Similarly to the case of autosomal admixture, we can write an exact solution for the variance of X-chromosomal admixture for a scenario with a single admixture event and no subsequent admixture. We follow the derivation of the expectation of X-chromosomal admixture in Goldberg and Rosenberg (2015, eqs. 611). Recalling eqs. (29) and (30), we rewrite the variance of X-chromosomal admixture under a scenario of a single admixture event as a coupled pair of recursions for V[H1,g,fX] and V[H1,g,mX].

For g = 1,

V[H1,1,fX]=s1,0f(1s1,0f)+s1,0m(1s1,0m)+2c11,04,
V[H1,1,mX]=s1,0f(1s1,0f).

For g ≥ 2,

V[H1,g,fX]=14(V[H1,g1,fX]+V[H1,g1,mX]),
V[H1,g,mX]=V[H1,g1,fX]. (43)

Next, for g ≥ 3, we can rewrite the variance of X-chromosomal admixture in a female as a two-generation recursion,

V[H1,g,fX]=V[H1,g1,fX]+V[H1,g2,fX]4.

The variance of X-chromosomal admixture in a female is similar in form to the expectation seen in Goldberg and Rosenberg (2015, eq. 8), and we can take an analogous approach to solving the recursion. The recursion for the variance includes a factor of 4 in each generation; therefore, the closed-form expression contains a factor of 4g. We define yg=4gV[H1,g,fX], and for g ≥ 3, we have

yg4g=yg1/4g1+yg2/4g24.

Multiplying both sides by 4g, we have a recursion yg=yg1+4yg2, with

y1=V[H1,1,fX]=s1,0f(1s1,0f)+s1,0m(1s1,0m)+2c11,04
y2=V[H1,1,fX]+V[H1,1,mX]4=5s1,0f(1s1,0f)+s1,0m(1s1,0m)+2c11,016.

Next, for g ≥ 3, we write yg as Ags1,0f1s1,0f+Bgs1,0m1s1,0m+2Bgc11,0, where

Ag=Ag1+4Ag2
Bg=Bg1+4Bg2

and A1 = 1, A2 = 5, B1 = 1, and B2 = 1. Calculating further values of Bg, we note that Bg = Ag–1. We can therefore rewrite yg=Ags1,0f(1s1,0f)+Ag1s1,0m(1s1,0m)+2Ag1c11,0, leading to

V[H1,g,fX]=Ags1,0f(1s1,0f)+Ag1[s1,0m(1s1,0m)+2c11,0]4g. (44)

The sequence Ag is reminiscent of a similar recursively defined sequence that appears in the X-chromosomal expectation for a single admixture event in Goldberg and Rosenberg (2015). Ag is a Lucas sequence (OEIS A006131), which can be written in closed form by using its generating function, a(x) = 1/(1 − x − 4x2). For g ≥ 1, we have

Ag=(1+172)g+1(1172)g+117.

For a single admixture event, the variance of the admixture fraction in both males and females (eqs. (43) and (44)) decays to zero over time, as 4g grows faster than Ag. The effect of assortative mating on the X-chromosomal variance decreases at a slower rate—c11,0 accumulates a factor of 1+178 each generation—than for the autosomal variance, in which c11,0 is multiplied by a factor of 12 each generation (eq. (42)).

For specified contribution parameters S1,0f and s1,0m, given g, the maximum and minimum of the variance of X-chromosomal admixture as functions of c11,0 occur at the same values of c11,0 as the maximum and minimum of the autosomal variance: the maximum and minimum of c11,0.

Figures 24 analyze the behavior of the variances of autosomal and X-chromosomal admixture for the case of a single admixture event. Figure 2 plots V[H1,g,δA], V[H1,g,fX], and V[H1,g,mX] in relation to the time since admixture. For positive assortative mating, we observe that for both the autosomes (Fig. 2a) and the X chromosome (Figs. 2b and 2c), the variance of admixture is greater relative to corresponding randomly mating populations with the same contribution parameters. For negative assortative mating, the variance is smaller.

Figure 4:

Figure 4:

Variance of ancestry under a single-admixture scenario as a function of the assortative mating parameter c11,0. (A) Autosomes, V[H1,g,δA]. (B) X chromosomes in a female, V[H1,g,fX]. (C) X chromosomes in a male, V[H1,g,mX]. In each panel, the variance is plotted over the range of c11,0, [−0.25, 0.25], for values of g between 1 and 6 (from darkest to lightest gray), with s1,0f=s1,0m=0.5, using eqs. (41), (43), and (44).

Figure 3 notes a special case of the minimum of the autosomal variance at constant s1,0f=s1,0m=0.5 and c11,0=s1,0f(1s1,0f)/2s1,0m(1s1,0m)/2=0.25. Using eq. (41), V[H1,g,δA]=0, even though this set of contributions (s1,0f=s1,0m=0.5) is a maximum of the variance for a randomly mating population (Goldberg et al., 2014).

Figure 3:

Figure 3:

Special case of the variance of ancestry under a single-admixture scenario with negative assortative mating in the founding generation. (A) Autosomes V[H1,g,δA]. (B) X chromosomes in a female, V[H1,g,fX]. (C) X chromosomes in a male, V[H1,g,mX]. In each panel, the variance is plotted for negative assortative mating, c11,0 = −0.25, using eqs. (41), (43), and (44) with s1,0f=s1,0m=0.5. The plot highlights a special case in which the variance of autosomal admixture is zero because the numerator of eq. (41) is zero, even though s1,0f=s1,0m=0.5 is a maximum of the variance with respect to the sex-specific contributions in a randomly mating population. The c11,0 = 0 case is copied from Figure 2.

Figure 4 plots the variance of admixture over the permissible range of c11,0, [−0.25, 0.25], for each value of g in [1, 6] when s1,0f=s1,0m=0.5. The variance of admixture increases linearly with c11,0. That is, positive values of c11,0 produce a higher variance of admixture than that of a randomly mating population (c11,0 = 0), and negative values of c11,0 produce a lower variance. The autosomal variance decreases monotonically with g for each value of c11,0. The X-chromosomal variance is not monotonic as g increases for a specific value of c11,0, as can be seen from the existence of points of intersection of the lines (Fig. 4b), and as is described for the expectation when c11,0 = 0 by Goldberg and Rosenberg (2015). For the autosomes, the minimum observed in Figure 4a across values of c11,0 with S1,0f and S1,0m held constant occurs at c11,0 = −0.25, the minimum depicted in Figure 3, with V[H1,g,δA]=0 for all g. The male X-chromosomal admixture in g = 1 is 0 and does not depend on c11,0 (Fig. 4c).

For all initial contributions S1,0f and S1,0m, and assortative mating parameter values c11,0, the limit of the autosomal and X-chromosomal variances as g → ∞ equals zero. That is, for admixture processes with no contributions after the founding, in both assortative and randomly mating populations, the distribution of admixture narrows around the mean.

6. Special case: constant contributions over time

Next, we consider the special case of constant, non-zero contributions from the source populations to the admixed population over time, after its initial founding. As in Goldberg et al. (2014) and Goldberg and Rosenberg (2015), we choose the sex-specific parameters to have values that are constant in time, (s1,gf,s1,gm,s2,gf,s2,gm)=(s1f,s1m,s2f,s2m) for all g ≥ 1. Sex-specific contributions take their range in [0, 1]; however, the total contributions from each source, s1 and s2, take their range in (0, 1). We maintain separate parameters for the founding contributions, s1,0f,s1,0m,s2,0f,s2,0m. Here, we analyze the limiting behavior of the variance of autosomal and X-chromosomal admixture in an assortatively mating population under this scenario of constant contributions from the source populations to the admixed population in each generation after founding.

6.1. Limit of the variance of admixture over time

Appendix B uses the closed-form expressions for the variance of autosomal admixture in a randomly mating population, derived by Goldberg et al. (2014), to write a closed-form expression for the variance of autosomal admixture in eqs. (20) and (21). Taking the limit as g → ∞ of the expression in eq. (57) and assuming cij = cji, we have

limgV[H1,g,δA]=c112+c1h(s11h)+chh2(s11h)2+limgVRM[H1,g,δA], (45)

where limgVRM[H1,g,δA] is given by Goldberg et al. (2014, eq. 53) and is reproduced in Appendix B, eq. (58).

Next, we consider the variance of X-chromosomal admixture. Using a generating function approach, Appendix C calculates the limits of VRM[H1,g,fX] and of VRM[H1,g,mX] (eqs. (71) and (72)) as g → ∞. Recalling eqs. (37)–(40) and using eq. (73) for the definition of P3, we have

limgV[H1,g,fX]=c112+c1h2(P3+s1f+hfP3)+chh2P3(s1f+hfP3)+limgVRM[H1,g,fX] (46)
limgV[H1,g,mX]=s1f(1s1f)2s1fhfP3+hf(1hf)P32+hflimgV[H1,g,fX]. (47)

Figure 5 plots the variance of autosomal and X-chromosomal admixture over time for various values of the assortative mating parameters, both positive and negative, for constant contributions from the source populations over time. In these plots of the variance of admixture over time, we hold c11 = c22 and c1h = 0, and by consequence, ch1 = ch2 = c2h = 0. This set of values for the cij can be interpreted as representing (1) equal levels of within-population preference for the two source populations, and (2) for each source, the mating probability between hybrid and source individuals is the same as in a randomly mating population.

Figure 5:

Figure 5:

Variance of ancestry under a constant-admixture scenario with assortative mating in producing g ≥ 2. (A) Autosomes, V[H1,g,δA]. (B) X chromosomes in a female, V[H1,g,fX]. (C) X chromosomes in a male, V[H1,g,mX]. Each panel considers positive assortative mating (greens), negative assortative mating (purples), and random mating (black dashed). The plots use eqs. (20), (21), and (29)–(32). We fix c11 = c22 and c1h = 0, with s1f=s1m=s2f=s2m=0.2. For initial conditions, s1,0f=s1,0m=s2,0f=s2,0m=0.5 and c11,0 = 0. Positive assortative mating increases the variance relative to a randomly mating population, whereas negative assortative mating decreases it.

These limiting variances often differ from those of corresponding randomly mating populations with the same contributions. For the autosomes (Fig. 5a) and the X chromosome (Figs. 5b,c), the variance has a linear relationship with cii, with positive values increasing the variance and negative values decreasing it. The limits of the autosomal and X-chromosomal variances in admixture depend on the ongoing sex-specific contributions from the source populations, s1f,s1m,s2f,s2m, and on the assortative mating parameters cij. The limits do not depend on the contributions or assortative mating parameters in the founding generation, s1,0f,s1,0m,s2,0f,s2,0m,c11,0.

6.2. Positive assortative mating can increase or decrease the variance relative to a corresponding randomly mating population

We next consider the effect of assortative mating on the variance of admixture compared to a corresponding randomly mating population with the same contribution parameters. We demonstrate that the behavior of the variance depends on the nature of the assortative mating parameters. That is, positive assortative mating can either increase or decrease the variance relative to a corresponding randomly mating population.

First, we consider assortative mating in which preference for one’s own population is the largest of the three preferences, and, for source populations, preference for the other source population is the smallest of the three. Additionally, all three populations, S1, H, S2, experience the same direction of mating preference. For positive assortative mating, this scenario amounts to c11 > c1h > c12, c22 > ch2 > c12 and chh > 0. For negative assortative mating, we have c11 < c1h < c12, c22 < ch2 < c12, and chh < 0. The variance of X-chromosomal admixture in males depends on the contribution parameters and the limiting admixture in females, but not on the assortative mating parameters beyond their role in female admixture (eq. (47)). Therefore, we consider the variances only of autosomal admixture and X-chromosomal admixture in females.

6.2.1. Positive assortative mating for each population

Here, we show that for positive assortative mating, eq. (45) can be written as the sum of a positive quantity and the limit of the variance of admixture for autosomes under random mating. That is,

limgVH1,g,δAVRMH1,g,δA=c112+c1hs11h+chh2s11h20.

We recall that the quantity s11h is the limit of the expectation of autosomal ancestry, and therefore takes its value in (0, 1) (Verdu and Rosenberg, 2011, eq. 31). We can rewrite the right-hand side of eq. (48) as

fA(c11,c1h,chh)=c112+c1hD1+chh2D12.

where D1 is a given constant in (0, 1). Under this assortative mating scenario, we can rewrite eq. (6) to give a range for c1h of (c112,c11) and, by our definition, we have c11, chh > 0 for positive assortment. Because f is linear in c1h, its minimum in terms of c1h occurs at the boundary of the range of c1h. Substituting for c1h its lower bound, c112, we have

fAc11,c1h,chh=c112c112D1+chh2D12=c1121D1+chh2D12. (48)

This quantity is positive because D1 takes its values in (0, 1), and c11, chh > 0. Therefore, we have demonstrated that this scenario of positive assortative mating increases the variance of autosomal admixture relative to a randomly mating population with the same contribution parameters.

Similarly, for X-chromosomal ancestry in females, we can show that for positive assortative mating, eq. (46) can be written as the sum of a positive quantity and the limit of the variance of the female X-chromosomal admixture under random mating. That is,

limg(V[H1,g,fX]VRM[H1,g,fX])=c112+c1h2(P3+s1f+hfP3)+chh2P3(s1f+hfP3)0. (49)

We recall that the quantities P3 and s1f+hfP3 are the limits of the expectations of female X-chromosomal, and male X-chromosomal ancestry, respectively, and therefore take their values in (0, 1) (Goldberg and Rosenberg, 2015, Appendix). We can rewrite eq. (49) as

fX(c11,c1h,chh)=c112+c1h2D2+chh2D3,

with D2 ∈ (0, 2) and D3 ∈ (0, 1). These upper bounds arise from the fact that 0 < P3 < 1 and 0<s1f+hfP3<1 (Goldberg and Rosenberg, 2015, Appendix). As in the cause for autosomal admixture above, f is linear in c1h. Therefore, its minimum in terms of c1h occurs at the boundary of the range of c1h. Substituting for c1h its lower bound, c112, we have

fX(c11,c1h,chh)=c112c114D2+chh2D3=c112(1D22)+chh2D3. (50)

For this scenario of positive assortative mating, we can see that f is always positive because c11, chh > 0 by definition, and D2 ∈ (0, 2) and D3 ∈ (0, 1).

6.2.2. Negative assortative mating for each population

A similar argument can be made for negative assortative mating. Under a scenario of intermediate preference for hybrids, we can rewrite eq. (6) to give a range for c1h(c11,c112) and c11,chh < 0 for negative assortment. To show that the maximum value fA can take is less than zero, we use the upper bound of c1h, which is equal to c112, and is a positive value because c11 < 0. Using this bound gives the same quantity seen in eq. (48); under negative assortative mating, fA is negative because D takes its values in (0, 1), and c11,chh < 0. Next, considering female X-chromosomal admixture, we can see that fX is negative for all parameter values under this scenario of negative assortative mating. Substituting the lower bound of c1h=c112 into (49), we have the same quantity seen in (50). For negative assortative mating, we have c11,chh < 0, and D2 ∈ (0, 2), D3 ∈ (0, 1). Therefore, fX is negative for all parameter values in this negative assortative mating scenario.

In sum, when all populations experience positive assortative mating and the preference for one’s own population is the strongest preference, we see that the variances of autosomal and X-chromsomal ancestry can be written as a positive term plus the corresponding variance for a randomly mating population. Similarly, for negative assortment, the variances in equations (45) and (46) can be written as a negative term plus the variance in a corresponding randomly mating population. That is, holding the contributions (s1f,s1m,s2f,s2m) constant, positive assortative mating increases the variance of admixture and negative assortative mating decreases the variance relative to random mating.

6.2.3. Positive mating with respect to S1

However, we can think of a more general type of assortative mating instead, allowing assortative mating to be positive in one population and random or negative in the other two populations. That is, we consider positive assortative mating with respect to population i as cii > cij, cji for ij, and negative assortative mating with respect to population i as cii < cij, cji for ij. Notably, these conditions are not mutually exclusive; positive assortative mating might occur in one population at the same time as negative assortative mating in another population. That is, in contrast to the assortative mating scheme above, we allow cases in which c11 > 0 while c22 < 0 within the definition of positive assortative mating with respect to S1.

Under this type of positive assortative mating, we allow cases in which c11 > c12 > c1h, and expand the range of c1h to include c1h(2c11,c112) with c11 > 0. Therefore, eqs. (48) and (50) are no longer strictly positive, and they depend on the relative values of the assortative mating parameters as well as on the contribution parameters producing the expectation of admixture. Similarly, for negative assortative mating, c1h(c112,2c11) with c11, chh < 0, and eqs. (48) and (50) are not strictly negative.

Figure 6 plots an example of positive assortative mating for individuals from S1 and H, but negative assortative mating in S2. First, in green, we see an example with positive assortative mating in all populations, with c11 = chh = 0.02 and c22 = 0.06. In this scenario, the variance of admixture is higher than in the corresponding randomly mating population (black dashed line). Maintaining c11 = chh = 0.02, but now setting c22 = −0.04, the purple line plots a case in which positive assortative mating in S1 is overwhelmed by negative assortative mating in S2, producing a variance of admixture that is less than in the randomly mating population with the same contribution parameters.

Figure 6:

Figure 6:

Example of the variance of ancestry under a constant-admixture scenario with assortative mating in producing g ≥ 2 when populations are allowed to differ in the direction of their mating preference. (A) Autosomes, V[H1,g,δA]. (B) X chromosomes in a female, V[H1,g,fX]. (C) X chromosomes in a male, V[H1,g,mX]. In all scenarios, populations S1 and H experience positive assortative mating, c11,chh = 0.02. Green curves show a scenario in which S2 also experiences positive assortative mating, c22 = 0.06; purple curves show a scenario in which S2 experiences negative assortative mating, c22 = −0.04. The scenario of random mating with the same contribution parameters is the black dashed line. The plots use eqs. (20), (21), and (29)–(32). We set s1f=s1m=s2f=s2m=0.2, and use initial conditions s1,0f=s1,0m=s2,0f=s2,0m=0.5 and c11,0 = 0. In this example, negative assortative mating in S2 offsets the positive assortative mating in S1 and H, producing lower variances of admixture in S1, V[H1,g,δγ], in the assortatively mating population than in the randomly mating population.

7. Discussion

7.1. Summary of results

A recent series of papers has devised mechanistic models to consider admixture over time (Verdu and Rosenberg, 2011; Goldberg et al., 2014; Goldberg and Rosenberg, 2015), assuming that the parents of a randomly chosen individual from the admixed population are chosen independently. We have extended these models to consider the effect on the distribution of admixture of preferential mating based on population of origin. Although the mean autosomal and X-chromosomal admixture fractions depend only on the sex-specific contributions from the source populations (eqs. (11) and (25)), we have found that the variances of admixture contain signatures of assortative mating (eqs. (21), (31), and (32)).

For both autosomes and X chromosomes, when all three populations experience positive assortative mating, the variance of admixture always increases relative to a randomly mating population with the same contributions (Fig. 5). Similarly, when all three populations experience negative assortative mating, the variance of admixture always decreases. Because an individual’s autosomal admixture fraction is the average of its parents’ admixture fractions, in the absence of new contributions from the source populations, random pairings of different admixture fractions decrease the variance of admixture every generation (Goldberg and Rosenberg, 2015). However, positive assortative mating decreases the proportion of matings that occur between different admixture fractions, with more low-low and high-high pairings maintaining a wide spread in the distribution of admixture. Similarly, negative assortative mating increases the proportion of matings that occur between pairs with different admixture fractions, thereby decreasing the spread of the admixture fraction distribution.

Assortative mating has a smaller effect on the variance of X-chromosomal admixture than on the variance of autosomal admixture. Conditional on the distribution of ancestry in the previous generation, the variance of X-chromosomal admixture in a male chosen at random from the admixed population is equivalent for randomly mating and assortatively mating populations (eq. (40)) because it is determined by the female X-chromosomal ancestry only, not by the ancestries of both members of a mating pair. Hence, the effect of assortative mating on the variance of X-chromosomal admixture is seen only in females (eq. (39)).

Our model is flexible, allowing assortative mating to occur in different directions and strengths in the three populations. That is, positive assortative mating with respect to S1 can occur at the same time as random mating or negative assortative mating in H and S2. We found that the effect of assortative mating on the variance of admixture depends on trade-offs in the preferences among the three populations. For both autosomes and the X chromosome, certain scenarios of assortative mating with different directions of mating preference in different populations change the direction of the effect of assortative mating on the variance of admixture (Fig. 6).

7.2. Comparisons to previous models of assortative mating

Focusing on admixed populations, we chose to model mating preferences by population of origin. This type of assortative mating captures scenarios in which the admixed population is a distinct group, and preferences follow population membership rather than any specific trait, ancestry, or genotype. Different populations might have different geographical locations, trait preferences, or host preferences. For example, scenarios of preference for or against new migrants compared to other admixed individuals have been hypothesized in swordfish, butterflies, and grasshoppers (Ritchie et al., 1989; Howard, 1993; Duenez-Guzman et al., 2009; Melo et al., 2009; Schumer et al., 2017). Similarly, in admixed human populations, language barriers or social identities may produce population-based preferences (Qian, 1997; Jacobson et al., 2004; Ruiz-Linares et al., 2014).

Our model of assortative mating by population recapitulates patterns seen in other assortative mating models in settings that do not consider admixture. Increased variance in a phenotype, with no change in the mean, has been observed in classic models of positive assortative mating by a single-locus genotype or a quantitative trait value without admixture (Jennings 1916, pgs. 66–68; Fisher 1918, pgs. 410–414; Wright 1921, pgs. 153–155; Crow and Kimura 1970, ch. 4). These studies demonstrate that under a model of assortative mating by genotype, positive assortative mating produces an increase in the proportion of homozygotes compared to heterozygotes, analogous to the excess proportion of admixture fractions equaling zero and one seen in our model. For the X chromosome, studies of the distribution of traits and the correlation of relatives under assortative mating by trait loci also find that assortative mating influences X chromosomes in females more than in males (Risch, 1979; Yengo and Visscher, 2018).

Empirical examples of the correlation in ancestry between members of mating pairs in admixed populations motivated Zaitlen et al. (2017) to model mating preferences as a fixed correlation in admixture fraction between mates in the admixed population. Extending a Wright-Fisher model, Zaitlen et al. (2017) found a similar inflation of the variance of admixture under positive assortative mating and a decrease in the variance of admixture with negative assortment. In their model, mating preferences have the same direction and strength for all individuals; therefore, their model does not allow for the cases we examine in which the variance of admixture decreases under positive assortative mating with respect to some populations and negative assortative mating in others.

7.3. Sex-specific assortative mating

For most of our analysis, we have assumed that mating preferences were symmetric with respect to sex, cij = cji. However, our general expressions, eqs. (23) and (39), for the autosomal and X-chromosomal variances, respectively, permit this assumption to be relaxed. One observation is that in eq. (23), the autosomal variance depends on the mean of c1h and ch1. In contrast, for the X chromosome in eq. (39), the variance is a function of the more complex ch1E[H1,g1,fX] and c1hE[H1,g1,mX]. The impact of the sex-specific mating preferences is weighted by the sex-specific contributions in the previous population. The effect of relaxing the symmetry could potentially be investigated using similar numerical studies to those we have used for the symmetric case.

7.4. Sex-biased admixture and assortative mating

We also assumed equal female and male contributions from each of the two source populations, s1,gf=s1,gm and s2,gf=s2,gm, for some of our analyses. However, sex bias and assortative mating may occur in the same population (Bryc et al., 2015; Zou et al., 2015). Our work does allow consideration of different sex-specific contributions from the sources. Goldberg and Rosenberg (2015) examined the effect of sex-biased admixture on the mean X-chromosomal and autosomal admixture in the case of random mating, cij = 0 for all i and j. When estimating sex bias levels from data, because assortative mating does not influence the mean admixture levels, inclusion of assortative mating would not change estimates of sex bias obtained from mean X-chromosomal and autosomal admixture levels.

7.5. Assortative mating can bias inference of the timing of admixture

The variance of admixture is informative about the timing of admixture and has been used to study the timing of admixture in human populations (Verdu and Rosenberg, 2011; Gravel, 2012; Goldberg et al., 2014; Liang and Nielsen, 2014; Verdu et al., 2014; Zaitlen et al., 2017). Because assortative mating often changes the variance of admixture, estimating the timing of admixture without accounting for assortative mating may lead to bias in estimates of the timing of admixture.

For example, considering autosomal admixture under the case of a single admixture event, we can rewrite eq. (41) to consider how assortative mating affects this inference in the case of a single admixture event. For an observed variance of autosomal admixture V A, we have

g=ln s1,0f1s1,0f+s1,0m1s1,0m+2c11,0ln VAln 21. (51)

The estimated timing of admixture, g, is directly related to the assortative mating constant c11,0. Therefore, fixing s1,0f,s1,0m, and V A, positive assortative mating increases g relative to random mating. That is, failing to account for positive assortative mating makes admixture appear more recent than the true admixture event. This observation follows from the increase in variance under positive assortative mating compared to random mating: because the variance decreases over time, a fixed variance persisting due to positive assortative mating might be misinterpreted as a more recent admixture under random mating. Negative assortative mating has the opposite effect, in that failing to account for negative assortative mating produces an overestimate of g relative to the true level.

Figure 7 plots estimates of g according to eq. (51) for varying levels of c11,0, considering both positive and negative assortative mating, specifying V A = 0.0625, and s1,0f=s1,0m=0.5 under a single admixture event. For a randomly mating population, an autosomal variance of 0.0625 implies an admixture event 2 generations ago under a single-admixture model with s1,0f=s1,0m=0.5. However, when assortative mating is allowed, the admixture event might have taken place in the current generation or up to 3 generations ago, with the estimated g monotonically increasing in c11,0. When c11,0 ≥ 0, the age of admixture is greater than in a randomly mating population with the same variance and contributions. Conversely, when c11,0 ≤ 0, estimates of g are smaller than in a corresponding randomly mating population.

Figure 7:

Figure 7:

Timing of admixture under assortative mating. The number of generations since admixture, g, for a single admixture event is plotted as a function of c11,0 using the variance in autosomal admixture, eq. (51), with V A = 0.0625 and s1,0s=s1,0m=0.5. The plot traverses the range of possible c11,0 values for g ≥ 0. Recalling that the values of the cij,0 are bounded such that the probability of each given parental pairing takes its values in the interval [0, 1], and such that each probability is no greater than the probability of one of its constituent components, for c11,0 > 0, we have c11,0min(s1,0f,s1,0m)s1,0fs1,0m. In this case, c11,0 ≤ 0.25. For c11,0 < 0, we have c11,0s1,0fs1,0m. In this case, we have c11,0 ≥ −0.25. However, g cannot be negative; therefore, in this case, we truncate the domain at the value of c11,0 that produces g = 0. The value of g for a randomly mating population with the same contributions and variance is shown in the dashed line.

The variance-inflating effect of positive assortative mating on the admixture level is similar to that described by Zaitlen et al. (2017) for the magnitude of ancestry linkage disequilibrium. However, because different scenarios of assortative mating affect the variance of admixture in different directions, under our model, we show that failing to account for assortative mating can either underestimate or overestimate the timing of admixture, depending on whether assortative mating is positive or negative.

Acknowledgments

We acknowledge support from a Miller Fellowship and NIH R35 GM133481 to AG, and NIH R01 HG005855 and NSF BCS-1515127 to NAR.

Appendices

A. Extrema of V[H1,g,δX] for a single admixture event in a randomly mating population

This appendix derives the extrema of V[H1,g,fX] and V[H1,g,mX] as functions of the contribution parameters S1,0f and S1,0m for a randomly mating population founded in a single admixture event with no further contributions. The corresponding extrema for the variance of autosomal admixture were previously analyzed by Goldberg et al. (2014). Here, we use the model of Goldberg and Rosenberg (2015) to study the variance of X-chromosomal admixture in a randomly mating population (c11,0 = 0). We perform this computation for completeness, to provide parallel results for the X-chromosomal case to those derived by Goldberg et al. (2014) for the autosomes.

Figure A1 plots the variance of X-chromosomal admixture in females using eq. (44) with c11,0 = 0, for multiple values of g. As g increases, the admixed population mixes with no further contributions from the source populations, and the variance decreases for all values of S1,0f and s1,0m.

From eq. (44), at c11,0 = 0, the maximum of V[H1,g,fX] in terms for S1,0f and s1,0m occurs when s1,0f(1s1,0f) and s1,0m(1s1,0m) are separately maximized. That is, the maximum occurs when s1,0f=s1,0m=s1,0=12.

For the minimal variance, which equals 0, equation (44) equals 0 at four points: (s1,0f,s1,0m)=(0,0),(0,1),(1,0),(1,1). The locations of the global maximum and the four minima are the same as those seen for the autosomal variance by Goldberg et al. (2014). If only a single population contributes to the hybrid population, (s1,0f,s1,0m)=(0,0) or (1, 1), then no variation exists in the population because all individuals have the same admixture fraction of 0 or 1. Similarly, when all males are from one source population and all females are from the other, so that (s1,0f,s1,0m)=(0,1) or (1, 0), no variation exists because all individuals have an admixture fraction of 12.

Next, we can find the critical points of the X-chromosomal variance for a given value of s1,0, permitting S1,0f and s1,0m to vary. Recalling eq. (2), we rewrite the variance of X-chromosomal admixture (eq. (44), with c11,0 = 0) as a function of S1,0f and s1,0. We have

V[H1,g,fX]=Ags1,0f(1s1,0f)+Ag1[(2s1,0s1,0f)(12s1,0+s1,0f)]4g (52)

Setting the first derivative of eq. (52) with respect to s1,0f equal to 0, we find that the critical point of the variance of X-chromosomal admixture occurs at

s1,0f=Ag+Ag1(4s1,01)2Ag+2Ag1. (53)

This critical point is a maximum because the second derivative of eq. (52) is negative, −2(Ag +Ag−1).

Following the same procedure, we can write the variance of X-chromosomal admixture in females as a function of S1,0m and s1,0. We have

V[H1,g,fX]=Ag(2s1,0s1,0m)(12s1,0+s1,0m)+Ag1s1,0m(1s1,0m)4g,

producing a maximum at

s1,0m=Ag1+Ag(4s1,01)2Ag+2Ag1, (54)

We also have the constraints that S1,0f and S1,0m take their values in [0, 1]. Eqs. (53) and (54) lie within this interval when both of the following pairs of inequalities hold:

Ag1Ag4Ag1s1,0fAg+3Ag14Ag1 (55)
AgAg14Ags1,0m3Ag+Ag14Ag. (56)

Eq. (55) is always satisfied, as Ag > Ag−1, so that the left hand side of the inequality is negative and the right hand side exceeds 1. Therefore, the maximum always occurs when S1,0f takes the value in eq. (53). Wheneq. (56) is satisfied, the maximum occurs when s1,0m follows eq. (54), and when s1,0m is outside the bounds in eq. (56), the maximum occurs at s1,0m equal to 0 or 1.

Next, we find the minima of the variance of the female X-chromosomal admixture fraction with respect to s1,0f and s1,0m for specified s1,0. Eqs. (53) and (54) are quadratic with a single critical point that is a maximum. Therefore, the minima of the variance of X-chromosomal admixture must lie along the boundary of the interval in which s1,0f and s1,0m take their values, [0, 1].

For s1,0 ∉ { 0, 1}, the minimum variance is on the boundary of parameter values, 0s1,0f,s1,0mmin(1,2s1,0). Specifically, for 0s1,012, the minimum variance occurs when (s1,0f,s1,0m)=(0,2s1,0). Using eq. (44), we see that because Ag > Ag−1, the variance of the female X-chromosomal admixture fraction is smaller when (s1,0f,s1,0m)=(0,2s1,0) than when (s1,0f,s1,0m)=(2s1,0,0). Therefore, unlike for the autosomes, the minima are not symmetric for the X chromosome (Figure A1, for g > 1). Specifically, because Ag is monotonically increasing, variance depends to a greater extent on s1,0f than on S1,0m, and the minimum variance for a specific s1,0 occurs when s1,0 = 0, but not s1,0m=0. Similarly, for 12s1,01, the minimum variance occurs when (s1,0f,s1,0m)=(1,12s1,0). Note that for s1,0=12, both (s1,0f,s1,0m)=(1,0) and (s1,0f,s1,0m)=(0,1) are minima.

Figure A1:

Figure A1:

The variance of ancestry as a function of the sex-specific contributions for a single admixture event in a randomly mating population. (A) g = 1. (B) g = 2. (C) g = 3. (D) g = 8. In each panel, V[H1,g,fX] is plotted over the range of permissible values of s1,0f and S1,0m, using eq. (44).

B. Closed-form expression and temporal limit for V[H1,g,δA] under constant contributions over time in an assortatively mating population

In this appendix, we use the closed form of the variance of autosomal admixture for a randomly mating population, derived by Goldberg et al. (2014), to write an expression for limgV[H1,g,δA] under assortative mating.

Goldberg et al. (2014) derived expressions for E[H1,g1,δA] and VRM[H1,g,δA] (eq. 37 and eqs. 5152, respectively). We substitute these equations in eqs. (22) and (23) to produce a closed form for the variance under assortative mating by population. We have

VH1,g,δA=c11,02+VRMH1,g,δA,g=1c112+c1h+ch12s1,0hg1+s11hg11h+chh2s1,0hg1+s11hg11h2+VRMH1,g,δA,g1, (57)

where VRM[H1,g,δA] is given by eqs. 51 and 52 from Goldberg et al. (2014).

For the limit as g → ∞, we recall eq. 53 from Goldberg et al. (2014),

limgVRM[H1,g,δA]=s1+s1fs1m2h+s1(s1fhm+hfs1m)(1h)(2h)+s12hfhm(1h)2(2h)(s11h)2.

Therefore, taking the limit as g → ∞ of the variance of autosomal admixture in eq. (57), we have

limgV[H1,g,δA]=c112+c1h+ch12(s11h)+chh2(s11h)2+limgVRM[H1,g,δA], (58)

Finally, assuming assortative mating is the same for the two sexes, cij = cji, we have eq. (45).

C. Closed form expression and temporal limit for V[H1,g,δX] under constant contributions over time in a randomly mating population

Here, we find an expression for the limit of the variance of X-chromosomal admixture as g → ∞ in a randomly mating population. We follow the structure of Appendix 1 from Goldberg and Rosenberg (2015). Using eqs. 34 from Goldberg and Rosenberg (2015) and eqs. (33)–(35) above, we can rewrite the variance of the X-chromosomal admixture fraction in females for the case of constant admixture. For g = 1, we have equation (33). For g = 2, we have

VRMH1,2,fX=hms1,0f+s1m1hms1,0fs1m+s1f4+hfs1,0+s1,0fs1,0m8s1m+hfs1,022. (59)

For g ≥ 3, we rewrite the female X-chromosomal variance as a second order recursion using equations (35) and (36). We have

VRM[H1,g,fX]=s1f(1s1f)(1+hm)+s1m(1s1m)4s1fhf2E[H1,g1,fX]s1mhm2E[H1,g1,mX]+hf(1hf)4(E[H1,g1,fX])2+hm(1hm)4(E[H1,g1,mX])2+hm4[2s1fhfE[H1,g2,fX]+hf(1hf)(E[H1,g2,fX])2]+hf4V[H1,g1,fX]+hfhm4V[H1,g2,fX].

We simplify the notation by defining zg=V[H1,g,fX]. For g ≥ 3, we have

zg=d1,g+d2zg1+d3zg2, (60)

with

d1,g=s1f(1s1f)(1+hm)+s1m(1s1m)4s1fhf2E[H1,g1,fX]s1mhm2E[H1,g1,mX]+hf(1hf)4(E[H1,g1,fX])2+hm(1hm)4(E[H1,g1,mX])2+hm4[2s1fhfE[H1,g2,fX]+hf(1hf)(E[H1,g2,fX])2], (61)
d2=hf4, (62)
d3=hfhm4, (63)

In eq. (61), d1,g depends on g through the expectation of X-chromosomal admixture. However, we can use closed-form solutions for E[H1,g,fX] and E[H1,g,mX] (Goldberg and Rosenberg, 2015, eqs. 17 and 18).

From eqs. (33) and (59), respectively, we have

z1=s1,0f(1s1,0f)+s1,0m(1s1,0m)+2c11,04
z2=(hms1,0f+s1m)(1hms1,0fs1m)+s1f4+hf(s1,0+s1,0fs1,0m)8(s1f+hfs1,02)2.

We define a generating function Z(x)=g=1zgxg whose coefficients zg represent the values of V[H1,g,fX] in each generation. As the admixed population does not yet exist in generation 0, V[H1,g,fX] and Z(x) are undefined for g = 0. For convenience, we therefore work with W(x) = Z(x)/x, setting wg = zg+1 for g ≥ 0. We then have

W(x)=g=0zg+1xg=g=0wgxg, (64)

and wg = d1,g + d2wg−1 + d3wg−2 for g ≥ 2 by eq. (60). Using eq. (64), it follows that

W(x)=w0+w1x+g=2(d1,g+d2wg1+d3wg2)xg=w0+w1x+d1,gg=2xg+d2xg=2wg1xg1+d3x2g=2wg2xg2=w0+w1x+d1,g(x21x)+d2x[W(x)w0]+d3x2W(x).

Solving for W(x), we have

W(x)=(w0+w1xd2w0x)(1x)+d1,gx2(1x)(1d2xd3x2). (65)

We can decompose the expression in eq. (65), producing

W(x)=Q1,g1r1x+Q2,g1r2x+Q3,g1x, (66)

where r1 and r2 are reciprocals of the two roots of 1 − d2xd3×2,

r1=2d3d2+d22+4d3=hf+(hf)2+16hfhm8 (67)
r2=2d3d2d22+4d3=hf(hf)2+16hfhm8. (68)

Setting eq. (65) equal to eq. (66), we have

Q1,g=(w0r1+w1d2w0)(r11)+d1,g(1r1)(r2r1)Q2,g=(w0r2+w1d2w0)(r21)+d1,g(1r2)(r1r2)Q3,g=d1,g(1r1)(1r2). (69)

The Taylor expansion of eq. (66) around x = 0 then gives

W(x)=Q1,gg=0r1gxg+Q2,gg=0r2gxg+Q3,gg=0xg,=q=0(Q1,gr1g+Q2,gr2g+Q3,g)xg.

Therefore, for g0,wg=Q1,gr1g+Q2,gr2g+Q3,g, and the closed-form expression for the X-chromosomal female mean admixture fraction in generation g1,V[H1,g,fX], is

zg=Q1,gr1g+1+Q2,gr2g+1+Q3,g. (70)

Because for hf, hm ∈ [0, 1], r1 monotonically increases in hf and hm and r2 monotonically decreases, the maxima and minima of r1 and r2 over permissible values of hf and hm occur at the boundaries of the closed interval [0, 1]. Using eqs. (67) and (68), we have r1[0,1+178] and r2[1178,0]; we exclude (r1,r2)=(1+178,1178), as hf and hm cannot both be 1. Because |r1|,|r2| < 1, the variance of the female X-chromosomal admixture fraction in eq. (70) approaches a limit as g → ∞. Using eqs. (70) and (36), we can find expressions for the limit of the variance of the X-chromosomal admixture fractions,

limgVRM[H1,g,fX]=Q3 (71)
limgVRM[H1,g,mX]=s1,g1f(1s1,g1f)2s1,g1fhg1fP3+hg1f(1hg1f)P32+hg1fQ3, (72)

where P3 is the limit of the mean female X-chromosomal admixture fraction (Goldberg and Rosenberg, 2015, eq. 19), and Q3 = limg→∞ Q3,g. Rewriting P3 here, we have

P3=s1f+s1m+s1fhms1f+s1m+s2f+s2m+hm(s1f+s2f). (73)

Q3 is obtained by substituting E[H1,g,fX] and E[H1,g,mX] with their limits, P3 and s1f+hfP3, respectively, into eq. (69). Using eqs. (61)–(63) to simplify, we have

Q3=s1f(1s1f)+s1m(1s1m)+s1fhm(1s1fhm)2s1fhf(1+hm+(hm)2)P3+[hf(1hf)+hfhm(1hfhm)]P324(1r1)(1r2).

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

References

  1. Bryc K, Durand EY, Macpherson JM, Reich D, and Mountain JL, 2015. The genetic ancestry of African Americans, Latinos, and European Americans across the United States. American Journal of Human Genetics 96: 37–53. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Chakraborty R and Weiss KM, 1988. Admixture as a tool for finding linked genes and detecting that difference from allelic association between loci. Proceedings of the National Academy of Sciences 85: 9119–9123. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Crow JF and Kimura M, 1970. An Introduction to Population Genetics Theory. Minneapolis: Burgess. [Google Scholar]
  4. Duenez-Guzman EA, Mavárez J, Vose MD, and Gavrilets S, 2009. Case studies and mathematical models of ecological speciation. 4. Hybrid speciation in butterflies in a jungle. Evolution 63: 2611–2626. [DOI] [PubMed] [Google Scholar]
  5. Fisher RA, 1918. The correlation between relatives on the supposition of Mendelian inheritance. Trans. R. Soc. Edinb pp. 399–433. [Google Scholar]
  6. Goldberg A and Rosenberg NA, 2015. Beyond 2/3 and 1/3: the complex signatures of sex-biased admixture on the X chromosome. Genetics 201: 263–279. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Goldberg A, Verdu P, and Rosenberg NA, 2014. Autosomal admixture levels are informative about sex bias in admixed populations. Genetics 198: 1209–1229. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Gravel S, 2012. Population genetics models of local ancestry. Genetics 191: 607–619. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Howard DJ, 1993. Reinforcement: origin, dynamics, and fate of an evolutionary hypothesis In: Hybrid Zones and the Evolutionary Process, Edited by Harrison R. Oxford University Press, New York. [Google Scholar]
  10. Jacobson CK, Amoateng AY, and Heaton TB, 2004. Inter-racial marriages in South Africa. Journal of Comparative Family Studies 35: 443–458. [Google Scholar]
  11. Jennings HS, 1916. The numerical results of diverse systems of breeding. Genetics 1: 53–89. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Liang M and Nielsen R, 2014. The lengths of admixture tracts. Genetics 197: 953–967. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Liang M and Nielsen R, 2014. Understanding admixture fractions. bioRxiv: 008078. [Google Scholar]
  14. Lindtke D and Buerkle CA, 2015. The genetic architecture of hybrid incompatibilities and their effect on barriers to introgression in secondary contact. Evolution 69: 1987–2004. [DOI] [PubMed] [Google Scholar]
  15. Loh P-R, Lipson M, Patterson N, Moorjani P, Pickrell JK, Reich D, and Berger B, 2013. Inferring admixture histories of human populations using linkage disequilibrium. Genetics 193: 1233–1254. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Long JC, 1991. The genetic structure of admixed populations. Genetics 127: 417–428. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Melo MC, Salazar C, Jiggins CD, and Linares M, 2009. Assortative mating preferences among hybrids offers a route to hybrid speciation. Evolution 63: 1660–1665. [DOI] [PubMed] [Google Scholar]
  18. Ni X, Yang X, Guo W, Yuan K, Zhou Y, Ma Z, and Xu S, 2016. Length distribution of ancestral tracks under a general admixture model and its applications in population history inference. Scientific Reports 6: 20048. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Parra FC, Amado RC, Lambertucci JR, Rocha J, Antunes CM, and Pena SDJ, 2003. Color and genomic ancestry in brazilians. Proceedings of the National Academy of Sciences 100: 177–182. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Pfaff CL, Parra EJ, Bonilla C, Hiester K, McKeigue PM, Kamboh MI, Hutchinson RG, Ferrell RE, Boerwinkle E, and Shriver MD, 2001. Population structure in admixed populations: effect of admixture dynamics on the pattern of linkage disequilibrium. American Journal of Human Genetics 68: 198–207. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Pool JE and Nielsen R, 2009. Inference of historical changes in migration rate from the lengths of migrant tracts. Genetics 181: 711–719. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Qian Z, 1997. Breaking the racial barriers: Variations in interracial marriage between 1980 and 1990. Demography 34: 263–276. [PubMed] [Google Scholar]
  23. Risch H, 1979. The correlation between relatives under assortative malting for an X-linked and autosomal trait. Annals of Human Genetics 43: 151–165. [PubMed] [Google Scholar]
  24. Risch N, Choudhry S, Via M, Basu A, Sebro R, Eng C, Beckman K, Thyne S, Chapela R, Rodriguez-Santana JR, et al. , 2009. Ancestry-related assortative mating in Latino populations. Genome Biology 10: R132. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Ritchie MG, Butlin RK, and Hewitt GM, 1989. Assortative mating across a hybrid zone in Chorthippus parallelus (Orthoptera: Acrididae). Journal of Evolutionary Biology 2: 339–352. [Google Scholar]
  26. Ruiz-Linares A, Adhikari K, Acuña-Alonzo V, Quinto-Sanchez M, Jaramillo C, Arias W, Fuentes M, Pizarro M, Everardo P, de Avila F, et al. , 2014. Admixture in Latin America: geographic structure, phenotypic diversity and self-perception of ancestry based on 7,342 individuals. PLoS Genetics 10: e1004572. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Schumer M and Brandvain Y, 2016. Determining epistatic selection in admixed populations. Molecular Ecology 25: 2577–2591. [DOI] [PubMed] [Google Scholar]
  28. Schumer M, Cui R, Rosenthal GG, and Andolfatto P, 2015. Reproductive isolation of hybrid populations driven by genetic incompatibilities. PLoS Genetics 11: e1005041. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Schumer M, Powell DL, Delclós PJ, Squire M, Cui R, Andolfatto P, and Rosenthal GG, 2017. Assortative mating and persistent reproductive isolation in hybrids. Proceedings of the National Academy of Sciences 114: 10936–10941. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Sebro R, Hoffman TJ, Lange C, Rogus JJ, and Risch NJ, 2010. Testing for non-random mating: evidence for ancestry-related assortative mating in the Framingham heart study. Genetic Epidemiology 34: 674–679. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Sedghifar A, Brandvain Y, and Ralph P, 2016. Beyond clines: lineages and haplotype blocks in hybrid zones. Molecular Ecology 25: 2559–2576. [DOI] [PubMed] [Google Scholar]
  32. Sedghifar A, Brandvain Y, Ralph P, and Coop G, 2015. The spatial mixing of genomes in secondary contact zones. Genetics 201: 243–261. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Verdu P, Pemberton TJ, Laurent R, Kemp BM, Gonzalez-Oliver A, Gorodezky C, Hughes CE, Shattuck MR, Petzelt B, Mitchell J, et al. , 2014. Patterns of admixture and population structure in native populations of northwest north america. PLoS Genetics 10: e1004530. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Verdu P and Rosenberg NA, 2011. A general mechanistic model for admixture histories of hybrid populations. Genetics 189: 1413–1426. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Wang L, Luzynski K, Pool JE, Janoušek V, Dufková P, Vyskočilová MM, Teeter KC, Nachman MW, Munclinger P, Macholán M, et al. , 2011. Measures of linkage disequilibrium among neighbouring SNPs indicate asymmetries across the house mouse hybrid zone. Molecular Ecology 20: 2985–3000. [DOI] [PubMed] [Google Scholar]
  36. Wright S, 1921. Assortative mating based on somatic resemblance. Genetics 6: 144–161. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Yengo L and Visscher PM, 2018. Assortative mating on complex traits revisited: Double first cousins and the X-chromosome. Theoretical Population Biology 124: 51–60. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Zaitlen N, Huntsman S, Hu D, Spear M, Eng C, Oh SS, White MJ, Mak A, Davis A, Meade K, et al. , 2017. The effects of migration and assortative mating on admixture linkage disequilibrium. Genetics 205: 375–383. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Zou JY, Park DS, Burchard EG, Torgerson DG, Pino-Yanes M, Song YS, Sankararaman S, Halperin E, and Zaitlen N, 2015. Genetic and socioeconomic study of mate choice in Latinos reveals novel assortment patterns. Proceedings of the National Academy of Sciences 112: 13621–13626. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES