Skip to main content
Genetics logoLink to Genetics
. 2015 Jul 24;201(1):263–279. doi: 10.1534/genetics.115.178509

Beyond 2/3 and 1/3: The Complex Signatures of Sex-Biased Admixture on the X Chromosome

Amy Goldberg 1,1, Noah A Rosenberg 1
PMCID: PMC4566268  PMID: 26209245

Abstract

Sex-biased demography, in which parameters governing migration and population size differ between females and males, has been studied through comparisons of X chromosomes, which are inherited sex-specifically, and autosomes, which are not. A common form of sex bias in humans is sex-biased admixture, in which at least one of the source populations differs in its proportions of females and males contributing to an admixed population. Studies of sex-biased admixture often examine the mean ancestry for markers on the X chromosome in relation to the autosomes. A simple framework noting that in a population with equally many females and males, two-thirds of X chromosomes appear in females, suggests that the mean X-chromosomal admixture fraction is a linear combination of female and male admixture parameters, with coefficients 2/3 and 1/3, respectively. Extending a mechanistic admixture model to accommodate the X chromosome, we demonstrate that this prediction is not generally true in admixture models, although it holds in the limit for an admixture process occurring as a single event. For a model with constant ongoing admixture, we determine the mean X-chromosomal admixture, comparing admixture on female and male X chromosomes to corresponding autosomal values. Surprisingly, in reanalyzing African-American genetic data to estimate sex-specific contributions from African and European sources, we find that the range of contributions compatible with the excess African ancestry on the X chromosome compared to autosomes has a wide spread, permitting scenarios either without male-biased contributions from Europe or without female-biased contributions from Africa.

Keywords: admixture, African-American genetics, mechanistic model, sex bias, X chromosome


COMPARISONS of the X chromosome and the autosomes provide a strategy for understanding the history of sex-biased demography (Ramachandran et al. 2004, 2008; Wilkins and Marlowe 2006; Hammer et al. 2008, 2010; Bustamante and Ramachandran 2009; Keinan et al. 2009; Casto et al. 2010; Emery et al. 2010; Labuda et al. 2010; Lambert et al. 2010; Gottipati et al. 2011; Heyer et al. 2012; Arbiza et al. 2014). Unlike the autosomes, the X chromosome follows a sex-specific inheritance pattern, with females inheriting two copies, one from the mother and one from the father, and males inheriting only a single copy from the mother. As a consequence, demographic differences between females and males—in such phenomena as the breeding population size, the variance of reproductive success, and migration rates—can be studied by examining differences in patterns of genetic variation between X chromosomes and autosomes.

Many of the best-known cases of sex-biased patterns in human demography involve recently admixed populations (Parra et al. 1998, 2001; Bedoya et al. 2006; Wilkins 2006; Lind et al. 2007; Berniell-Lee et al. 2008; Wang et al. 2008; Stefflova et al. 2009; Bryc et al. 2010a,b, 2015; Quintana-Murci et al. 2010; Beleza et al. 2012; Moreno-Estrada et al. 2013; Verdu et al. 2013; Marcheco-Teruel et al. 2014; Salzano and Sans 2014). During the formation of such populations, sex-biased admixture occurs if one or more of the source populations contributes different fractions of the females and males to the admixed group. What patterns are expected for X chromosomes and autosomes in an admixed population formed through a sex-biased admixture process? An initial hypothesis, reflecting the fact that in a population with equally many females and males, two-thirds of the X chromosomes are in females and one-third are in males, is that if s1f is the fraction of females originating from population 1 and s1m is the fraction of males originating from population 1, then the fraction of ancestry from population 1 for a site on the X chromosome is

H1X=23s1f+13s1m (1)

(Lind et al. 2007). This simple linear combination has been used to estimate the sex-specific contributions from females and males to African-American and Latino populations (Lind et al. 2007; Bryc et al. 2015). As we will show, however, it presumes a very specific history for the admixture process, a history often not viewed as reasonable for practical admixture scenarios.

Here, we extend a two-sex mechanistic admixture model (Goldberg et al. 2014) to incorporate the genetic signatures of sex-specific admixture patterns on the X chromosome. We derive a recursive expression for the expectation of the X-chromosomal admixture fraction as a function of sex-specific admixture parameters, demonstrating that the X-chromosomal admixture is obtained from a more complex formula than in the simple 2/3 and 1/3 weighting (Equation 1). The limiting mean X-chromosomal admixture is a predictable function of female and male contributions from the source populations, but among cases we consider, has the 2/3 and 1/3 weighting only for a single admixture event that takes place at a single point in time. During the approach to this limit, the behavior of the mean X-chromosomal admixture is dependent on the time since admixture. For a single admixture event and for constant ongoing admixture, we characterize the difference between the limit and the mean X-chromosomal admixture under the model as a function of time. We reinterpret admixture patterns in recently admixed African-Americans, demonstrating that consideration of both the admixture model and the time since admixture is important for estimating sex-specific admixture contributions.

A Mechanistic, Sex-Specific Model for Admixture Histories

We follow the discrete-time model and notation of Verdu and Rosenberg (2011) and Goldberg et al. (2014), in which two source populations, S1 and S2, contribute to an admixed population, H (Figure 1). For each population, female and male contributions are considered separately. At generation g, the contribution of sex δ, with δ{f,m}, from source Sα, with α{1,2}, is sα,gδ. Corresponding contributions from H are denoted hgδ. Thus, for example, sα,gδ denotes the fraction in generation g of individuals of sex δ originating in the previous generation in population Sα. The female and male contributions from S1, H, and S2 at generation g are s1,gf, s1,gm, hgf, hgm, s2,gf, and s2,gm.

Figure 1.

Figure 1

Sex-specific mechanistic model of admixture. At each generation g, females and males from each of two source populations, S1 and S2, contribute to the admixed population, H. Contributions can vary in time. The fraction of admixture from source population α{1,2} for chromosomal type γ{A,X} in an admixed individual of sex δ{f,m} is Hα,g,δγ. This model, by considering different chromosomal types, generalizes the model of Goldberg et al. (2014).

We recall key relations among the sex-specific parameters (Goldberg et al. 2014, equations 16). First, the total contribution from each population is the mean of female and male contributions: sα,g=(sα,gf+sα,gm)/2 and hg=(hgf+hgm)/2. Also, as each parameter is a probability, the total female and male contributions separately sum to one: s1,gf+hgf+s2,gf=1 and s1,gm+hgm+s2,gm=1. For g=0, the admixed population does not yet exist, so that h0=h0f=h0m=0 and s1,0f+s2,0f=s1,0m+s2,0m=1.

Under this general framework, we define sex bias as a difference in the contributions from one or more source populations to the admixed population. That is, an admixture process is considered sex-biased if s1,gfs1,gm or s2,gfs2,gm or both. Thus, sex bias involves differences between females and males entering from a specified source population, rather than a comparison of the sex-specific parameters between the two source populations; after the founding, it is possible that sex bias can occur in one source but not in the other.

Considering the three populations (S1,H,S2) from which the parents of an individual from the admixed population can originate, we have nine possible ordered parental pairings (Table 1). As in Goldberg et al. (2014), we study the random variable representing the admixture fraction of a random individual in the admixed population. Whereas Goldberg et al. (2014) examined admixture only for the autosomes, here we also study the X chromosome. We define H1,g,δγ as the admixture fraction of chromosomal type γ sampled in individuals of sex δ from the admixed population H at generation g. Thus, H1,g,δγ is the probability that a random site on an autosome (γ=A) or the X chromosome (γ=X) in a random individual of sex δ in population H in generation g ultimately traces to S1. H1,g,δγ tracks the admixture levels present after the entry of the admixture contributions involved in producing the admixed population at generation g.

Table 1. Recursion for the X-chromosomal and autosomal admixture fractions of a randomly chosen female or male from an admixed population at generation g, given a set of parents.

Case Female parent’s population Male parent’s population Probability Admixture in males Admixture in females
H1,g,mA H1,g,mX H1,g,fA H1,g,fX
1 S1 S1 s1,g1fs1,g1m 1 1 1 1
2 S1 H s1,g1fhg1m 1+H1,g1,mA2 1 1+H1,g1,mA2 1+H1,g1,mX2
3 S1 S2 s1,g1fs2,g1m 12 1 12 12
4 H S1 hg1fs1,g1m 1+H1,g1,fA2 H1,g1,fX 1+H1,g1,fA2 1+H1,g1,fX2
5 H H hg1fhg1m H1,g1,fA+H1,g1,mA2 H1,g1,fX H1,g1,fA+H1,g1,mA2 H1,g1,fX+H1,g1,mX2
6 H S2 hg1fs2,g1m H1,g1,fA2 H1,g1,fX H1,g1,fA2 H1,g1,fX2
7 S2 S1 s2,g1fs1,g1m 12 0 12 12
8 S2 H s2,g1fhg1m H1,g1,mA2 0 H1,g1,mA2 H1,g1,mX2
9 S2 S2 s2,g1fs2,g1m 0 0 0 0

With two source populations and the contributions from the admixed population to itself, a random individual has nine possible parental pairings, for each of which the probability is listed.

In contrast to autosomal DNA, for which both female and male offspring receive a single copy of each chromosome from each parent, the X chromosome is inherited sex-specifically. That is, female offspring inherit one copy of the X chromosome from the mother and one from the father, but males inherit only a maternal copy. Therefore, whereas the autosomal admixture fractions sampled in females and males from the admixed population, H1,g,fA and H1,g,mA, are identically distributed, the female and male X-chromosomal admixture fractions, H1,g,fX and H1,g,mX, have different distributions. As a result, whereas Goldberg et al. (2014) could uncouple H1,g,fA and H1,g,mA and consider recursions for these two quantities separately (both were identical and only one needed to be studied), here we examine the distributions of H1,g,fX and H1,g,mX as a coupled pair of recursions.

Table 1 reports the probabilities that a random individual of a given sex from the admixed population has each possible set of parents, ℓ, as well as the admixture fraction of the individual for source population 1 conditional on the parental pairing. It provides a basis for recursively computing the fraction of admixture from S1 for a random individual of sex δ from the admixed population at generation g, H1,g,δX. For g=1, the founding event of the admixed population, the admixed population does not yet exist, so only cases 1, 3, 7, and 9, the four cases in which both parents are from the source populations, are considered.

Conditional on the previous generation, H1,g,fX and H1,g,mX are independent random variables. The female X-chromosomal admixture fraction depends on both the female and male admixture fractions in the previous generation, H1,g1,fX and H1,g1,mX, but the male X-chromosomal admixture fraction depends only on H1,g1,fX. Therefore, the female X-chromosomal admixture fraction can equivalently be written as a function of the female X-chromosomal admixture fractions in the previous generation, H1,g1,fX, and from two generations ago, H1,g2,fX.

Expectation of the X-Chromosomal Fraction of Admixture

Following Verdu and Rosenberg (2011) and Goldberg et al. (2014), we can find the expectation of the X-chromosomal admixture fraction of an individual of a specified sex randomly chosen from the admixed population. Using the law of total expectation to consider the random parental pairing L, we sum over the nine possible pairings ℓ (Table 1). For the mean X-chromosomal admixture fraction of a randomly chosen admixed individual of sex δ sampled at generation g, we have

E[H1,g,δX]=EL[E[H1,g,δX|L]]=ϵ{S1S1S1HS1S2HS1HHHS2S2S1S2HS2S2}P(L=)E[H1,g,δX|L=]. (2)

Applying Equation 2, using Table 1 to consider X chromosomes sampled in females and males from the admixed population, for the first generation, the initial condition is

[E[H1,1,fX]E[H1,1,mX]]=[s1,0s1,0f]. (3)

For g2, we have

[E[H1,g,fX]E[H1,g,mX]]=[s1,g1s1,g1f]+[hg1f2hg1m2hg1f0][E[H1,g1,fX]E[H1,g1,mX]]. (4)

For the autosomes, for each sex δ, the expectation of the admixture fraction depends only on the total (non-sex-specific) contributions from the source populations and the corresponding expectation in the previous generation (Goldberg et al. 2014, equation 19),

E[H1,g,δA]={s1,0g=1s1,g1+hg1E[H1,g1,δA]g2. (5)

By contrast, the expectations of the female and male X-chromosomal admixture fractions depend also on the sex-specific contributions in the previous generation (Equation 4).

Because the inheritance pattern is identical for X chromosomes and autosomes in females, the mean X-chromosomal admixture in females in Equation 4 matches that of the autosomes before equality of the female and male autosomal means is applied (Goldberg et al. 2014, equation 18). Whereas female and male autosomal admixture random variables are identically distributed and their expectations can be written with a generic δ, the corresponding random variables differ for X chromosomes, and the coupled recursion in Equations 3 and 4 cannot be quickly reduced to one equation. The mean X-chromosomal admixture in males (Equation 4), however, has a similar form to the autosomal mean (Equation 5).

We use the general mechanistic model presented here to derive closed-form expressions for the expected X-chromosomal admixture under two specific models of admixture, a single-admixture event and constant admixture over time.

Analyzing the Model in Special Cases

To analyze the properties of X-chromosomal admixture in relation to the female and male contributions of two source populations, we consider two specific cases of our general admixture model.

A single admixture event

First, we study a special case in which no further contributions from the source population occur after the admixed population is founded in generation g=1. In this scenario of a single admixture event (Long 1991), s1,gf=s1,gm=s2,gf=s2,gm=0, and hgf=hgm=hg=1, for all g1.

Applying Equations 3 and 4, we have for g=1,

[E[H1,1,fX]E[H1,1,mX]]=[s1,0s1,0f]. (6)

For g2,

[E[H1,g,fX]E[H1,g,mX]]=[121210][E[H1,g1,fX]E[H1,g1,mX]]. (7)

For g2, the expected admixture fraction for a male X chromosome is simply the expected admixture fraction for a female X chromosome from the previous generation, or E[H1,g,mX]=E[H1,g1,fX].. We can use this identity between the expected female admixture in generation g1 and male admixture in generation g to simplify the system of equations. For g3, we then have

E[H1,g,fX]=E[H1,g1,fX]+E[H1,g2,fX]2. (8)

In each generation, E[H1,g,fX] decreases by a factor of 2, suggesting that a closed-form solution might include a factor of 2g. Therefore, denoting by yg the quantity 2g1E[H1,g,fX], using Equation 8, for g3, we have

yg2g1=yg1/2g2+yg2/2g32.

Multiplying both sides by 2g1, we have a recursion yg=yg1+2yg2, with y1=s1,0 and y2=s1,0+s1,0f. Then for g3, yg can be written Ags1,0+Bgs1,0f, where Ag and Bg satisfy

Ag=Ag1+2Ag2 (9)
Bg=Bg1+2Bg2, (10)

with A1=1, A2=1, B1=0, and B2=1. Noting that B3=1, we immediately observe that for g3, Bg=Ag1, so that yg=Ags1,0+Ag1s1,0f. Then

E[H1,g,fX]=Ags1,0+Ag1s1,0f2g1. (11)

The recursion for Ag in Equation 9 with the initial conditions A0=0 and A1=1 gives the recursion for the Jacobsthal numbers, for which the closed form is Ag=[2g(1)g]/3 (Sloane 2015, entry A001045). We then have for g1,

E[H1,g,fX]=[2+(12)g]s1,0f3+[1(12)g]s1,0m3. (12)

Using Equation 7, for g2,

E[H1,g,mX]=[2+(12)g1]s1,0f3+[1(12)g1]s1,0m3. (13)

For the case of a single admixture event, with no further gene flow from the source populations to the admixed population, we use Equations 12 and 13 to understand the behavior over time of the mean admixture on the X chromosome. Notably, unlike for the autosomes under a single-admixture scenario (Goldberg et al. 2014), the mean X-chromosomal admixture depends on the sex-specific contributions from the source populations, as well as on the time since admixture. With no sex bias (s1,0f=s1,0m=s1,0), the mean X-chromosomal admixture is the same as the autosomal admixture, which is constant in time, equaling simply s1,0, the total contribution from S1.

Equations 12 and 13 oscillate over time (Figure 2), as they incorporate a negative fraction raised to a power. The mean X-chromosomal admixture does not immediately reach a long-term limit, as the difference in initial admixture values between females and males in the admixed population, followed by a transmission process in which females and males draw in different proportions from females and males in the previous generation, attenuates the homogenization of the female and male gene pools.

Figure 2.

Figure 2

The mean of the X-chromosomal admixture fraction over time in females (red) and males (blue) from the admixed population, for a single admixture event (Equations 12 and 13). The mean X-chromosomal admixture (from source population 1) oscillates in approaching its limit (Equation 14). The limiting mean, shown in black, is the same for X chromosomes in both females and males. (A) s1,0f=0.25, s1,0m=0.5. (B) s1,0f=0.5, s1,0m=0.25.

The long-term limit of the X-chromosomal admixture fraction is

limgE[H1,g,fX]=limgE[H1,g,mX]=23s1,0f+13s1,0m. (14)

Because E[H1,g,mX]=E[H1,g1,fX] for all g2, the expected admixture fraction sampled in males approaches the same limit over time as that sampled in females. The limit of the mean X-chromosomal admixture fraction fits the 2:1 ratio by which the X chromosome is inherited, following Equation 1, with the variables in Equation 1 viewed as the initial admixture values s1,0f and s1,0m. Thus, the limiting admixture, but not the transient admixture, matches the simple linear combination.

For recent admixture, we can calculate the difference between the expected admixture under the model and the limiting value. For g1, we have

E[H1,g,fX]limgE[H1,g,fX]=s1,0fs1,0m3(12)g, (15)

and for g2,

E[H1,g,mX]limgE[H1,g,mX]=s1,0fs1,0m3(12)g1. (16)

The differences in Equations 15 and 16 provide a measure of the difference of a transient single-admixture model from the simpler linear combination in Equation 1, which is constant in time and does not consider differences between the mean admixture in females and males from the admixed population. The g limit in Equation 14 agrees with Equation 1, but for small g, the differences in Equations 15 and 16 can be as large as ±1/3, decreasing by a factor of 2 each generation. For a fixed g, the maximal absolute difference occurs when the sex bias is largest, that is, when s1,0fs1,0m is 1 or 1. At the other extreme, with no sex bias and s1,0f=s1,0m, the simple linear combination exactly describes the X-chromosomal admixture throughout the history of the admixed population.

For six values of g, Figure 3 plots Equation 15 as a function of the difference between female and male contributions from S1, s1,0fs1,0m. With no sex bias, s1,0f=s1,0m, Equation 15 is zero, and our model follows Equation 1. Additionally, as g increases, the difference between our time-dependent model and Equation 1 becomes smaller, as Equation 1 gives the limiting behavior of the single-admixture model.

Figure 3.

Figure 3

The difference between the expectation of the female X-chromosomal admixture fraction and its limit for g[1,6] as a function of the difference between female and male contributions from S1 for a single admixture event (Equation 15). As g, this quantity approaches zero. For small g, however, the limit (2/3)s1,0f+(1/3)s1,0m (Equation 14) is a poor approximation. The difference oscillates so that the slope of the line is negative when g is odd and positive when g is even.

Constant, nonzero admixture over time

Next, we consider the special case of constant, nonzero contributions from the source populations to the admixed population over time. As in Goldberg et al. (2014), we can rewrite the sex-specific parameters as constants, (s1,gf,s1,gm,s2,gf,s2,gm)=(s1f,s1m,s2f,s2m), for all g1. We maintain separate parameters for the founding contributions, s1,0f, s1,0m, s2,0f, and s2,0m. In this setting, hf and hm cannot both be 1, and at least one among s1f, s1m, s2f, and s2m must be nonzero.

Using a generating function approach, we derive a closed-form solution for the mean X-chromosomal admixture fractions, E[H1,g,fX] and E[H1,g,mX].. The mean depends on the number of generations of constant admixture, g; the initial conditions s1,0f and s1,0m; and the sex-specific contributions from the two source populations, s1f, s1m, s2f, and s2m. In the Appendix, we show that for each g1,

E[H1,g,fX]=P1r1g+1+P2r2g+1+P3, (17)

where we have abbreviated a rather complex expression by using the quantities P1, P2, P3, r1, and r2, functions that depend only on the admixture parameters and not on g. These quantities are defined in Equations A9–A13, with r1 and r2 <1 in absolute value. Using Equations A2 and 17, we can write an expression for the male X-chromosomal admixture fraction,

E[H1,g,mX]={s1,0fg=1s1f+hf(P1r1g+P2r2g+P3)g2. (18)

The limits of the mean X-chromosomal admixture fractions in Equations 17 and 18 are

limgE[H1,g,fX]=s1f+s1m+s1fhms1f+s1m+s2f+s2m+hm(s1f+s2f) (19)
limgE[H1,g,mX]=2s1f+hfs1ms1f+s1m+s2f+s2m+hm(s1f+s2f). (20)

These limits can also be obtained by an equilibrium approach. Denote Lf=limgE[H1,g,fX] and Lm=limgE[H1,g,mX].. At equilibrium, an X chromosome in a male can ultimately trace to S1 either through a mother from S1 in the most recent generation or because it originates in an admixed female. Combining these terms, Lm=s1f+hfLf. An X chromosome in a female traces to S1 either through the maternal X chromosome or through the paternal X chromosome: Lf=(1/2)(s1f+hfLf)+(1/2)(s1m+hmLm). The solution to this pair of equations for Lf and Lm accords with Equations 19 and 20.

Unlike in the case of a single admixture event, the limit over time of the mean fraction of admixture from S1 for a constant-admixture process is not a simple 2:1 weighting of the female and male contributions from S1. Notably, recalling s1f+hf+s2f=s1m+hm+s2m=1, the limiting admixture fraction from S1 depends on the contributions both from S1 and from S2. The mean X-chromosomal admixture depends on the sex-specific contributions and cannot be reduced in terms of s1 and s2 only, as was possible for the autosomes (Goldberg et al. 2014, equation 37).

For the autosomes, the limiting mean is the ratio of contributions from S1 to the total contributions from S1 and S2 combined (Goldberg et al. 2014, equation 37), E[H1,g,δA]=s1/(s1+s2). As was observed for the autosomes, the limiting mean X-chromosomal admixture in females can be viewed as the fractional contribution of X chromosomes from S1 in relation to the total number of X chromosomes from both source populations. Therefore, in Equation 19, the numerator has the same form as the denominator, incorporating only contributions from S1. X chromosomes from S1 present in a population of females from the admixed population come from one of three origins in the limit, identified by the three terms of the numerator. They can be directly contributed from S1 females or males, giving the terms s1f and s1m. Alternatively, they can be contributed from admixed males who in turn received them from S1 females, producing the term s1fhm; viewed in the limit, contributions from admixed females in the previous generation need not be considered, as an admixed female in the previous generation has the same equilibrium behavior as one in the current generation. Similar reasoning can be used to understand the limiting male admixture fraction, noting that the limiting mean for males is the sum of two quantities, the product of the limiting female mean and the fraction of females from the admixed population, hf, and the new influx of female contributions from S1, or s1f (Equation A2).

The limits over time of the means in Equations 19 and 20 depend only on the continuing contributions, not on the founding parameters. With no sex bias, s1f=s1m=s1 and hf=hm=h, and we can simplify the limiting mean female and male X-chromosomal admixture fractions:

limgE[H1,g,fX]=limgE[H1,g,mX]=2s1+hs12hh2=s11h=s1s1+s2. (21)

This limit is equivalent to the limiting mean of autosomal admixture from Verdu and Rosenberg (2011, equation 31). That is, with no sex bias, the limiting X-chromosomal mean matches the limiting autosomal mean, limgE[H1,g,δA]=limgE[H1,g,fX]=limgE[H1,g,mX].

Figure 4 plots the expectations of the X-chromosomal and autosomal admixture fractions over time for two scenarios with the same difference in female and male contributions, but different directions of sex bias. In Figure 4A, more males enter from S1 and more females from S2, leading to a lower mean admixture on the X chromosome than on the autosomes, with the mean in males smaller than the mean in females. Conversely, in Figure 4B, more females enter from S1 and more males from S2, leading to a larger mean for X chromosomes than for autosomes. The parameters are set so that for Figure 4, A and B, autosomal admixture is constant in time at 1/2 (Goldberg et al. 2014, equation 37). Both panels illustrate a modified signal of the oscillation seen under a single admixture event, dampened by the ongoing contributions from the source populations. In both cases, the limiting mean X-chromosomal admixture in females lies intermediate between the mean autosomal admixture and the mean X-chromosomal admixture in males.

Figure 4.

Figure 4

The expectation of the mean X-chromosomal and autosomal admixture fractions over time, with their associated limits, for constant ongoing admixture. (A) Male-biased admixture from population S1. (B) Female-biased admixture from population S1. The initial condition is (s1,0f,s1,0m,s2,0f,s2,0m)=(0.5,0.5,0.5,0.5). The autosomal admixture is constant over time because s1,0=s2,0=1/2 and s1=s2; it is the same in both A and B because it does not depend on the sex-specific contributions. The X-chromosomal admixture is different in females and males; it is smaller than the autosomal mean for male-biased admixture from population 1 and larger for female-biased admixture. The X-chromosomal mean is plotted using Equations 17–20. The autosomal mean uses equation 37 from Goldberg et al. (2014).

Figure 5 plots the mean X-chromosomal admixture fraction for females and males (Equations 17 and 18) with the mean autosomal admixture fraction when both source populations have a female bias in contributions, s1f>s1m and s2f>s2m. Although S1 has an excess of females, the mean X-chromosomal admixture is less than the autosomal admixture. Whereas an excess of females from a given source population might be viewed as generating elevated X-chromosomal admixture, when both populations have a sex bias in the same direction, the source population with the larger sex bias dominates the signal. It is tempting to interpret lower X-chromosomal than autosomal admixture as a signal of male-biased admixture from S1 and female-biased admixture from S2, but Figure 5 plots an example where admixture is female biased in both source populations, and the simple interpretation of opposite biases in the two source populations is incorrect. A similar example could also have been produced with male bias in both source populations.

Figure 5.

Figure 5

The expectation of the mean X-chromosomal and autosomal admixture fractions over time for constant admixture, with female-biased contributions from both source populations. The initial condition is (s1,0f,s1,0m,s2,0f,s2,0m)=(0.5,0.5,0.5,0.5). The mean X-chromosomal admixture from S1, E[H1,g,δX], is smaller than the mean autosomal admixture, E[H1,g,δA], even though S1 has an excess of females, s1f>s1m. The expectation of X-chromosomal admixture is plotted using Equations 17 and 18. The autosomal mean uses equation 37 from Goldberg et al. (2014).

We note that in the limit as the parameters s1f, s1m, s2f, and s2m simultaneously approach zero, the constant-admixture model approaches the single-event model. Thus, we expect the limiting mean female and male X-chromosomal admixture fractions in the constant-admixture model to approach the corresponding limits in the single-event model. Indeed, by taking the limits of Equations 17 and 18 as s1f, s1m, s2f, and s2m approach 0, we obtain Equations 12 and 13, respectively. Interestingly, as we expect the trajectories of the X-chromosomal admixture fractions under constant admixture to approach corresponding single-event trajectories, we also expect the oscillatory pattern of the single-event model to occur in the constant-admixture model for sufficiently small s1f, s1m, s2f, and s2m. Indeed, we can find instances of oscillation around the limit in the constant-admixture model as well as in the single-admixture model, for example, for s1f=s1m=s2f=s2m=0.05 and initial condition (s1,0f,s1,0m,s2,0f,s2,0m)=(0.25,0.5,0.75,0.5) (Figure 6). As the continuing contributions in this case have no sex bias, the mean female and male X-chromosomal admixture and the mean autosomal admixture tend to the same limit.

Figure 6.

Figure 6

The expectation of the mean X-chromosomal and autosomal admixture fractions over time for constant admixture, in a case with small continuing contributions. The initial condition is (s1,0f,s1,0m,s2,0f,s2,0m)=(0.25,0.5,0.75,0.5). With sex bias in the founding generation and small continuing contributions, the mean X-chromosomal admixture has a pattern resembling the oscillating behavior seen for a single admixture event. The expectation of X-chromosomal admixture is plotted using Equations 17 and 18. The autosomal mean uses equation 37 from Goldberg et al. (2014).

Sex-Biased Admixture in African-Americans

Studies of the genetic admixture history of African-American populations have consistently reported evidence for male-biased gene flow from Europe (Parra et al. 1998, 2001; Lind et al. 2007; Stefflova et al. 2009; Bryc et al. 2010a, 2015). Lind et al. (2007) and Bryc et al. (2015) estimated female and male contributions from Africa and Europe, using mean admixture estimates for the X chromosome and autosomes. Both analyses followed the linear combination method of Equation 1, in which the mean autosomal admixture averages the female and male contributions, while the mean X-chromosomal admixture weights the female and male contributions by 2/3 and 1/3, respectively. In our notation, the framework can be written

E[H1,g,δA]=12s1f+12s1m (22)
E[H1,g,δX]=23s1f+13s1m, (23)

with E[H1,g,δA] and E[H1,g,δX] estimated from data.

We have demonstrated, however, that in autosomal admixture models, Equation 22 suggests a single admixture event (Goldberg et al. 2014), and in X-chromosomal admixture models, Equation 23 suggests the limit of a single admixture process (replacing parameters for the continuing contributions in these equations by corresponding parameters for admixture in the initial generation, s1,0f and s1,0m, as in Equation 14). For recent admixture in the single-admixture model, or in a model with continuing admixture after the founding of the admixed population, the mean admixture fractions from S1 for the X chromosome depend on the sex-specific contributions in a more complex way, incorporating the sex-specific contributions from S2 (Equations 12, 13, and 17–20). Use of Equations 22 and 23 implies consideration of a single-admixture model in its temporal limit or otherwise is unsuited to the single-admixture and continuing-admixture scenarios.

We use our refined predictions about E[H1,g,δX] to estimate the sex-specific contributions for an African-American population under two different models, a single-admixture event with various times since admixture and a constant-admixture process. Although both models underestimate the spatial and temporal complexity of the true admixture history of African-Americans, these approximations enable us to illustrate the way in which estimates of the sex bias depend on assumptions about the admixture model.

Genetic samples

Many admixture studies have reported X-chromosomal and autosomal admixture estimates for African-Americans (Table 2). For illustration, we focus on data from Cheng et al. (2009), who provided one of the largest samples, reporting estimates of quantities close to those that appear in our model. Cheng et al. (2009) estimated admixture fractions of the X chromosome and autosomes for 15,280 African-Americans from 14 studies.

Table 2. Studies that published estimates of the X-chromosomal and autosomal admixture fractions of an African-American population.

Mean ancestry estimates (%)
Reference African autosome (E[H1,gA]) European autosome (E[H2,gA]) African X chromosome (E[H1,gX]) European X chromosome (E[H2,gX]) % female
Cheng et al. (2009) 80.7 19.3 84.5 15.5 55.7
Bryc et al. (2015) 73.2 24.0 76.9 19.8 52.3
Bryc et al. (2010a) 77 23 81 19 66.3
Lind et al. (2007) 80 20 87.9 12.1 0
Tian et al. (2006) 81.8 18.2 87.1 12.9 Not reported

As the X chromosomes of females and males were considered jointly in the studies, we list the fraction of the sample that was female. In each study, the X chromosome shows elevated African ancestry compared to the autosomes. Values were compiled from quantities reported in the studies or by taking 1 minus an African estimate (“population 1”) to obtain a European estimate (“population 2”) or vice versa, with the following exceptions: For Bryc et al. (2015), the “autosome” estimates are genome-wide, including the X chromosome; the percentage of females was kindly provided by K. Bryc (personal communication). For Bryc et al. (2010a), we obtained an African X-chromosomal estimate by visual inspection of their figure 4g, and we counted the number of females from their supplementary material. For Tian et al. (2006), we computed the values from information reported in their figure 5 legend.

The mean admixture on the X chromosome reported by Cheng et al. (2009) combines females and males into a single estimate, whereas we consider separate quantities for X-chromosomal admixture in females and males. In our notation, the mean X-chromosomal admixture across all individuals, E[H1,gX], is a weighted average of admixture in females and males that takes into account the fraction of the sample that is female. Thus, for a sample divided into females and males in proportions pf and pm=1pf, respectively, we can write

E[H1,gX]=pfE[H1,g,fX]+pmE[H1,g,mX]. (24)

To compare admixture estimates from data to the mean admixture across individuals produced under mechanistic models, we compute the Euclidean distance between predictions for E[H1,gA], E[H1,g,fX], and E[H1,g,mX] and observed values in the data, weighting the predicted female and male X-chromosomal admixture by Equation 24. Denoting by qA and qX the observed mean admixture in a specific source group for autosomes and the X chromosome, we evaluate

D=[qAE[H1,gA]]2+[qX(pfE[H1,g,fX]+pmE[H1,g,mX])]2. (25)

This quantity D measures the fit of a prediction under a model to the actual estimates of X-chromosomal and autosomal admixture for a data set that reports three quantities: estimates of admixture for a specific source population for autosomes and for the X chromosome and the fraction of the admixed sample consisting of females.

The exact timing of the onset of significant admixture in the African-American admixture process is unknown, but assuming a generation time of 20–27 years, the trans-Atlantic slave trade to North America can be regarded as having begun 14–20 generations ago (Eltis and Richardson 2010). As the mean X-chromosomal admixture fraction is generally near its limit for g in this range, both in the single-admixture model and in the constant-admixture model, the specific g[14,20] only minimally changes the results. For our constant-admixture analysis, we chose a single value g=15 for consistency.

Single-admixture event

Using the mean X-chromosomal and autosomal admixture reported by Cheng et al. (2009) (Table 2), we calculated the sex-specific contributions s1,0f and s1,0m for specified values of g[2,5] and in the g limit. As Cheng et al. (2009) reported admixture in a combined sample of females and males, we used Equation 24 to calculate the X-chromosomal admixture fraction E[H1,gX] as a function of pf, pm, E[H1,g,fX], and E[H1,g,mX].. Writing E[H1,g,fX] and E[H1,g,mX] as functions of g, s1,0f, and s1,0m (Equations 12 and 13), we then generated an equation for E[H1,gX] in terms of pf, pm, g, s1,0f, and s1,0m, into which we inserted the reported values qX=0.845 for E[H1,gX] and (pf,pm)=(0.557,0.443). We obtained a second equation for the autosomal admixture fraction from Goldberg et al. (2014, equation 34), E[H1,gA]=s1,0=(s1,0f+s1,0m)/2, using qA=0.807 for the empirical observation of E[H1,gA].. We then solved the pair of equations for s1,0f and s1,0m, with g fixed.

For small g (2, 3, 4, 5) as well as in the g limit, s1,0f, representing the contribution of females from the African source population, exceeds 0.9, and s2,0f, representing the female European contribution, is <0.1 (Table 3). For males, the African contribution s1,0m ∼ 0.7, whereas the European contribution s2,0m ∼ 0.3. Because of the oscillation of the mean X-chromosomal admixture (Figure 2), the estimated ratio of male to female contributions oscillates around the limiting values. For contributions from Europe, in the g limit we estimate s2,0m/s2,0f3.89. That is, in the temporal limit of a model of a single-admixture event, for each female from Europe, ∼3.89 males contributed to the gene pool of African-Americans. The direction of this ratio is reversed for African contributions, with ∼1.33 females for every African male. If we assume that the admixture was recent, for example g5, then these values do vary substantially, with larger deviations from the limiting value of 3.89 occurring under more recent admixture.

Table 3. Estimates of the sex-specific contributions from Africa and Europe to African-Americans, based on the Cheng et al. (2009) data, inferred under our model for a single-admixture event.

African contributions European contributions
Generations since admixture s1,0f s1,0m s1,0f/s1,0m s2,0f s2,0m s2,0m/s2,0f
2 0.943 0.671 1.40 0.057 0.329 5.77
3 0.912 0.702 1.30 0.088 0.298 3.39
4 0.926 0.688 1.35 0.074 0.312 4.22
5 0.919 0.695 1.32 0.081 0.305 3.77
0.921 0.693 1.33 0.079 0.307 3.89

The estimates depend on the time since admixture. All estimates show male-biased contributions from Europe and female-based contributions from Africa.

Constant admixture over time

Assuming g=15, we computed the mean female and male X-chromosomal admixture (Equations 17 and 18) and mean autosomal admixture given the four sex-specific contributions (Equation 5), on a grid of possible parameter values, s1f,s1m,s2f,s2m[0,1], using 0.01 increments. We fixed the initial values s1,0f=s1,0m=0.5, although (except at small values of the continuing contributions) by g=15 the mean nears its limit, erasing the signal of the initial conditions (Equations 19 and 20). We calculated the distance D between the Cheng et al. (2009) data and the computed values (Equation 25).

The parameters s1f, s1m, s2f, and s2m are not uniquely identifiable from the data, as we continue to have two equations, one describing autosomal admixture and one for X-chromosomal admixture, but now we consider four unknowns. We therefore examined sets of parameter values that produced D0.01. Distance cutoffs of 0.001 and 0.1 gave rise to similar ranges for each parameter. Thus, our analysis is based on the following algorithm:

  1. Choose fixed values of g, s1,0f, and s1,0m. We fixed g=15 and s1,0f=s1,0m=0.5.

  2. Choose s1f,s1m,s2f,s2m [0,1] on a four-dimensional grid, using 0.01 increments.

  3. Plug the parameter values from steps 1 and 2 into Equations 5, 17, and 18 to calculate E[H1,g,δA], E[H1,g,fX], and E[H1,g,mX], respectively.

  4. Plug the values for E[H1,g,δA], E[H1,g,fX], and E[H1,g,mX] from step 3 into Equation 25.

  5. Choose a cutoff on D in Equation 25 that indicates a good fit of the parameters to the data. We use D0.01.

Figure 7 plots the set of parameter values that generate D0.01. Because the female and male contributions separately sum to one, the possible range of each set of contributions is represented by a unit simplex. Figure 7A plots the marginal distributions of the set of parameter values with D0.01. Contributions from S1 (Africans) vary over most of the permissible parameter space, but the S2 contributions (Europeans) take their values from a narrower range. Figure 7B plots D on the space of female contributions, (s1f,hf,s2f), for values of (s1m,hm,s2m) fixed at the median of the set of parameter vectors that produce D0.01, s1m=0.47, hm=1s1ms2m=0.34, and s2m=0.19. Figure 7C plots D as a function of the male contributions, (s1m,hm,s2m), with female contributions fixed at the median from the distribution of parameter values for which D0.01, (s1f,hf,s2f)=(0.64,0.29,0.07). In Figure 7, B and C, the parameter sets closest to the data in terms of D permit a large range of possible contributions from S1, Africans, but only a smaller range of possible contributions, both female and male, from Europeans, S2.

Figure 7.

Figure 7

Sex-specific contributions estimated from the data of Cheng et al. (2009). (A) The range, median, and 25th and 75th percentiles of the sets of sex-specific contributions for which the Euclidean distance D (Equation 25) between model-predicted admixture and observed admixture from Cheng et al. (2009) was at most 0.01. The range of values for s2f and s2m, the contributions representing Europeans (S2), is smaller than that representing Africans (S1). (B) Female contributions as a function of D. (C) Male contributions as a function of D. For B, the male contributions are fixed at their median values producing D0.01, (s1m,hm,s2m)=(0.47,0.34,0.19). For C, the female contributions are fixed in a corresponding manner, at (s1f,hf,s2f)=(0.64,0.29,0.07). Each side of the triangles in B and C represents one of three parameters that sum to 1, (s1m,hm,s2m) and (s1f,hf,s2f) in B and C, respectively. Parameter values were tested at increments of 0.01 for each quantity.

Figure 8A plots the natural logarithm of the ratio of male to female contributions for parameter sets with D0.01. With no sex bias, this quantity is zero; with more males than females, it is positive; and with more females than males, it is negative. Most points plotted from S1, Africans, are female biased, whereas contributions from S2, Europeans, are predominantly male biased. The median ratio of males to females from Europe is 2.67 males per female, compared to a median 1.32 females per male from Africa. Note, however, that these patterns do not hold for all parameter sets. For 26.15% of values with D0.01, more male than female contributions occur from Africa. For 9.25% of values with D0.01, more female than male contributions occur from Europe.

Figure 8.

Figure 8

The natural logarithm of the ratio of male to female contributions in African-Americans, as inferred from the data of Cheng et al. (2009). (A) The range (excluding infinity, produced when a parameter value is zero), median, and 25th and 75th percentiles of the natural logarithm of the ratio of male to female contributions from S1 (Africans) and S2 (Europeans) separately for the sex-specific contributions that produced D0.01 (Equation 25). Values from Africans, S1, are largely negative, or female biased, whereas contributions from Europeans, S2, are mostly positive and male biased. Approximately 26.15% of the contributions from Africans are male biased and 9.25% from Europeans are female biased. This pattern is typically observed when a still larger sex bias occurs in the other population. (B) The logarithm of the ratio of male and female contributions from S2 on the y-axis and the corresponding ratio for S1 on the x-axis, plotted by the density of points in 0.05 square bins. For the cases with male bias in Africa (ln(s1m/s1f)>0), the level of male bias in Europe is also positive; for the cases with female bias in Europe (ln(s2m/s2f)<0), the level of female bias from Africa is also negative. Parameter sets in which at least one parameter is 0, and therefore we have values of +∞ or −∞ for ln(s1m/s1f) or ln(s2m/s2f), appear in bins on the edge of the plot for convenience. These bins contain a substantial number of parameter sets.

Although the pattern is initially surprising, scenarios of female bias from Europe or male bias from Africa accord with the phenomenon depicted in Figure 5. In the former scenario, both populations have an excess of females, and the strong female bias in the contributions from Africa can overwhelm the number of European X chromosomes in the gene pool of African-Americans, producing greater African admixture on the X chromosome than on autosomes. This effect is demonstrated in Figure 8B, which plots the logarithm of the ratio of male to female contributions from S2 against the corresponding quantity from S1. The ratios of male to female contributions in S2 on the y-axis and S1 on the x-axis are correlated. The values that are negative on the y-axis, indicating a female bias from S2, also show the strongest female bias from S1, that is, the most negative ln(s1m/s1f) on the x-axis [an additional 2.34% of parameter sets show no sex bias in the European population; the female and male contributions are equal and ln(s1m/s1f)=0]. Analogously, the strongest male bias from Africa is also observed when the excess of males from Europe is largest (Figure 8B).

Discussion

Under a two-sex mechanistic admixture model with sex-biased admixture, we have demonstrated that the relationship between X-chromosomal and autosomal admixture fractions depends both on the time since admixture and on the model of admixture and does not simply follow a prediction from the fractions of X chromosomes and autosomes present in females. Using the mechanistic framework, we have reinterpreted African-American admixture values computed from a nonmechanistic perspective, estimating sex-specific parameters and levels of sex bias in African and European source populations. This analysis uncovers a counterintuitive case in which female-biased or male-biased contributions in the same direction occur both from Africans and from Europeans in a manner consistent with estimated mean X-chromosomal and autosomal admixture fractions in African-Americans, rather than an African female bias and a European male bias.

Estimating sex bias using X-chromosomal and autosomal admixture

Differences between X-chromosomal and autosomal admixture estimates are sometimes used to demonstrate the occurrence of sex bias, even without estimating sex-specific contributions (Wang et al. 2008; Bryc et al. 2010a,b; Cox et al. 2010; Beleza et al. 2012; Moreno-Estrada et al. 2013). Higher estimated admixture from a population S1 for X chromosomes than for autosomes is taken as evidence of a female bias from S1 and a male bias from a second population S2. We have found, however, that the pattern is in principle compatible with two additional possibilities (Figure 8): a female bias from both populations, with a larger female bias from S1, or a male bias from both populations, with a larger male bias from S2.

We considered a general mechanistic model of admixture, focusing specifically on models of a single admixture event and constant admixture. For a single admixture event, mean autosomal admixture over time is constant, depending only on the total contributions from the source populations and not on the sex-specific contributions or time (Goldberg et al. 2014). Mean X-chromosomal admixture, on the other hand, is variable over time—in such a way that given the number of generations g and observed levels of X-chromosomal and autosomal admixture, the initial sex-specific contributions from the two source populations are identifiable. Because of the oscillation of the mean X-chromosomal admixture over time, depending on the time since admixture, the sex bias in admixture contributions can be either overestimated or underestimated using the 2/3 and 1/3 linear combination (Equation 1).

For constant admixture, estimated mean X-chromosomal and autosomal admixture values do not uniquely identify the female and male contributions over time. Therefore, instead of point estimates for the sex-specific contributions and their ratio, we have reported a measure of the compatibility of parameter values and data over a range of values of the parameters (Figure 7 and Figure 8). It is possible, however, that by examining higher moments of the distribution of admixture estimates across individuals (Goldberg et al. 2014) or by simultaneously also considering the Y chromosome and mitochondrial DNA, the sex-specific contributions might become identifiable.

Theoretical population genetics of the X chromosome

The complex signature of admixture we have observed for the X chromosome is reminiscent of other X-chromosomal phenomena in theoretical population genetics, including results related to effective population size, allele-frequency dynamics, and numbers of ancestors. For effective population size, Ne, when female and male population sizes are equal, the ratio of X-chromosomal to autosomal values is 3/4 (Wright 1933; Caballero 1995; Hartl and Clark 2007). In the same way that various forms of sex difference between females and males—in such parameters as the number of individuals, the variance of reproductive success, and migration rates—cause the basic X-to-autosomal Ne ratio to differ from 3/4 (Caballero 1995; Charlesworth 2001; Ramachandran et al. 2004, 2008; Hammer et al. 2008; Keinan et al. 2009; Emery et al. 2010; Gottipati et al. 2011; Heyer et al. 2012; Arbiza et al. 2014), transient dynamics and ongoing admixture cause the X-chromosomal admixture fraction to differ from a linear combination of female and male contributions with coefficients 2/3 and 1/3.

The X-chromosomal admixture fraction in our single-admixture model also has similarities to the allele-frequency trajectory in the approach to Hardy–Weinberg equilibrium in a one-locus X-chromosomal model. In that model (Jennings 1916a; Wright 1969; Crow and Kimura 1970), the frequency of an allele on female X chromosomes depends on the frequency in both females and males in the previous generation, whereas the frequency on male X chromosomes depends only on the frequency in females. If an allele frequency differs between females and males at the first generation, then during the approach to equilibrium, the frequency in males matches the corresponding frequency in females of the previous generation; both the female and male frequencies oscillate around the same limit. The equilibrium frequency in turn has a 2/3 contribution from the initial female frequency and 1/3 from the male frequency.

All these phenomena from the one-locus model—female values dependent on both female and male values from the previous generation, male values dependent only on the female value, males lagging one generation behind females, oscillations around the same limit, and a limiting linear combination of 2/3 and 1/3—appear in our X-chromosomal admixture model with a single admixture event. An additional parallel lies in the difference between the X chromosome and the autosomes: In the one-locus model for the X chromosome, Hardy–Weinberg equilibrium is achieved in the limit, and for the autosomes, it is achieved in one generation. For admixture on the autosomes, the mean admixture is constant after the founding of the admixed population, and for the X chromosome, the mean X-chromosomal admixture approaches a limit rather than remaining constant in time.

Yet another connection to theoretical population genetics of the X chromosome comes from the form of the recursive sequence in Equation 11. The Jacobsthal sequence in the single-admixture model is obtained from a generalization of the Fibonacci sequence (Koshy 2001). Both the Jacobsthal and Fibonacci sequences appeared in breeding system models as early as a century ago (Jennings 1914, 1916a,b; Wright 1921), with the Jacobsthal sequence providing the numerators of allele frequencies at a sex-linked locus in a model that amounts mathematically to a special case of our single-event admixture model (Jennings 1916a,b); the Fibonacci sequence is well known as the number of genealogical ancestors of a haplodiploid system g generations ago (Basin 1963). Specifically, for the pair of X chromosomes in a female, the number of ancestors is entry g+2 in the Fibonacci sequence Fn or, equivalently, the sum of the numbers of maternal and paternal ancestors, Fg+1 and Fg, respectively. The number of genealogical ancestors for a male X chromosome in generation g is Fg+1. Our recursion An=An1+2An2 (Equation 9) generates a different sequence, but its form is similar to the Fibonacci recursion Fn=Fn1+Fn2.

Sex-biased admixture in African-Americans

Our analysis of sex-biased admixture in African-Americans has similarities but a number of noteworthy differences from earlier nonmechanistic analyses. Under our single-admixture model, the estimate of ∼4 European males for every European female accords with estimated ratios of 3–4 from previous studies (Lind et al. 2007; Bryc et al. 2015). For even g, more African females and European males contribute than in the g limit; odd g values produce the opposite pattern.

For the constant-admixture model, the median estimated ratios of male to female contributions from Europe and Africa are ∼2.67 males per female from Europe and ∼1.32 females per male from Africa (Figure 7 and Figure 8). The estimated male bias in contributions from Europe is lower for a constant-admixture history than for a single-admixture event.

For both a single-admixture event and constant admixture over time, the point estimates of the ratio of males to females have a larger male contribution from Europe and female contribution from Africa. In the single-admixture model, the excess African ancestry on the X chromosome compared to the autosomes implies a female-biased contribution from Africa and a male-biased contribution from Europe. For constant admixture, surprisingly, a sex bias in one source alone can produce the observed pattern without a sex bias in the other (Figure 8). In fact, ∼27% of the parameter sets most similar to the data (D0.01) show no sex bias or have larger male than female contributions from Africa and ∼12% show no sex bias or have larger female than male contributions from Europe. Note that in obtaining this result, we have given equal weight to all values of D below a cutoff, rather than giving more weight to parameter choices producing lower D; however, owing to the existence of ridges in the parameter space that have similarly small values of D, for different choices of the cutoff, similar results are produced.

An excess of African mitochondrial and European Y-chromosomal haplotypes has been described in African-Americans (Parra et al. 1998, 2001; McLean et al. 2005; Lind et al. 2007; Stefflova et al. 2009). Similar phenomena to those we observed for X chromosomes and autosomes could affect the Y–mitochondrial DNA (Y–mtDNA) comparisons: in other words, a strong excess of males from Europe compared to European females, even if more males than females contribute from African populations, could give rise to a larger fraction of European Y chromosomes in the African-American gene pool without a female bias for the African source population. Such a process could also explain the excess African ancestry of mtDNA.

Conclusions

We have presented a model to describe the effect of admixture on the X chromosome, deriving a theoretical framework that considers the impact of sex bias during the admixture process. The model can be used to understand the sex-specific contributions from source populations to an admixed population. We have found that because of model dependence, time dependence, and a lack of identifiability of admixture parameters from mean admixture alone, a variety of admixture processes and parameter values might be compatible with estimates of the mean admixture on X chromosomes and the autosomes. We have considered only ratios of female and male contributions rather than absolute numbers of female and male contributors, ignoring genetic drift. Both drift and other factors we have not studied—including mutation, recombination, and selection—can differentially affect the X chromosome and autosomes (Charlesworth 2001; Hedrick 2007; Pool and Nielsen 2007; Ellegren 2009; Labuda et al. 2010; Heyer et al. 2012; Ségurel et al. 2014; Veeramah et al. 2014), potentially further complicating the estimation of admixture parameters. Examination in the context of sex-biased models of more detailed summaries of admixture patterns, including higher moments of the admixture distributions (Verdu and Rosenberg 2011; Gravel 2012; Goldberg et al. 2014) and distributions of the lengths of admixture tracts (Pool and Nielsen 2009; Gravel 2012; Liang and Nielsen 2014), will assist in refining the estimation of sex-biased admixture histories.

We demonstrated that female and male X-chromosomal admixtures have different expectations under mechanistic admixture models. With the exception of studies restricted to males (Lind et al. 2007), however, past studies of sex-biased admixture have generally reported composite X-chromosomal admixture estimates only in pooled collections of females and males. We recommend that such studies report estimates separately in females and males. Even if the values are similar, the difference between X-chromosomal admixture estimates in females and males contains information about parameters of the admixture process. Autosomal admixture estimates obtained only in females and only in males are identically distributed (Goldberg et al. 2014), so that a pooled estimate is more sensible than in the X-chromosomal case; nevertheless, autosomal admixture estimates reported separately in females and males can enable an informative comparison with a corresponding pair of X-chromosomal values.

Curiously, we have found that African-American admixture patterns do not necessarily imply a simultaneous African female bias and European male bias in the genetic ancestry of the population and that in both source populations, it is in principle possible on the basis of genetic admixture patterns that both source populations had female bias or that both had male bias, albeit at different magnitudes. The latter interpretation, of male biases both in Europeans and in Africans is plausible in light of historical scholarship on demographic contributions to African-Americans (Fredrickson 1982; Painter 2006; Davis 2010; Eltis and Richardson 2010), documenting both the well-known contributions of male European slave owners and asymmetric mating practices involving African females and European males, as well as an overrepresentation of males among African slaves arriving in North America. We caution, however, that our focus has been on single-event and constant-admixture models, both of which approximate African-American admixture crudely, as neither one captures the time-varying nature of the contributions from the source groups. Exploration of additional scenarios, including models intermediate between these extreme cases, will be helpful for empirical problems. Nevertheless, both for African-Americans and for less well-documented cases, our model highlights that if a difference in X-chromosomal and autosomal admixture is observed, it is important to consider the possibility that rather than opposite sex biases in the two populations, both populations might have the same type of sex bias.

Acknowledgments

We thank Carlos Bustamante, Filippo Disanto, Michael D. Edge, Marc Feldman, Ethan Jewett, Hua Tang, and Paul Verdu for discussions and two reviewers for helpful comments on the manuscript. We acknowledge support from National Institutes of Health grant R01 HG005855 and a National Science Foundation Graduate Research Fellowship.

Appendix: Solving for E[H1,g,δX] Under Constant Admixture

Here, we obtain the closed form for the mean X-chromosomal admixture fraction in a random female and a random male from the admixed population in the special case of constant admixture over time. Using the constant sex-specific parameters, s1f, s1m, s2f, and s2m, we rewrite the expectations in Equation 4. For g=1, E[H1,1,fX]=s1,0, and for g2, we have

E[H1,g,fX]=s1+12(hfE[H1,g1,fX]+hmE[H1,g1,mX]). (A1)

Similarly, sampling males in the admixed population, for g=1, E[H1,1,mX]=s1,0f. For g2,

E[H1,g,mX]=s1f+hfE[H1,g1,fX]. (A2)

Using Equations A1 and A2, we derive a generating function for the mean female X-chromosomal admixture fraction, which we then use to find closed-form expressions for E[H1,g,fX] and E[H1,g,mX]. Because E[H1,g,mX] depends only on constants and E[H1,g1,fX], we first find an expression for E[H1,g,fX], which we then use to report E[H1,g,mX].

First, using Equation A2, we can rewrite Equation A1 as a second-order recursion of a single variable,

E[H1,g,fX]={s1,0g=1s1+12(s1,0hf+s1,0fhm)g=2s1+hf2E[H1,g1,fX]+hm2(s1f+hfE[H1,g2,fX])g3. (A3)

We simplify the notation by defining zg=E[H1,g,fX]. For g3, we have

zg=c1+c2zg1+c3zg2, (A4)

with c1=s1+s1fhm/2, c2=hf/2, and c3=hfhm/2. Equation A3 gives z1=s1,0 and z2=s1+12(s1,0hf+s1,0fhm).

Define a generating function Z(x)=g=1zgxg whose coefficients zg represent the values of E[H1,g,fX] in each generation. As the admixed population does not yet exist in generation 0, E[H1,g,fX] and Z(x) are undefined for g=0. For convenience, we therefore work with W(x)=Z(x)/x, setting wg=zg+1 for g0. We then have

W(x)=g=0zg+1xg=g=0wgxg, (A5)

and wg=c1+c2wg1+c3wg2 for g2. Using Equation A5, it follows that

W(x)=w0+w1x+g=2(c1+c2wg1+c3wg2)xg=w0+w1x+c1g=2xg+c2xg=2wg1xg1+c3x2g=2wg2xg2=w0+w1x+c1(x21x)+c2x[W(x)w0]+c3x2W(x). (A6)

Solving for W(x), we have

W(x)=(w0+w1xc2w0x)(1x)+c1x2(1x)(1c2xc3x2). (A7)

We can decompose the expression in Equation A7, producing

W(x)=P11r1x+P21r2x+P31x, (A8)

where r1 and r2 are reciprocals of the two roots of 1c2xc3x2,

r1=c2+c22+4c32c3=hf+(hf)2+8hfhm4 (A9)
r2=c2c22+4c32c3=hf(hf)2+8hfhm4. (A10)

Setting Equation A7 equal to Equation A8, we have

P1=(w0r1+w1c2w0)(r11)+c1(r1r2)(r11) (A11)
P2=(w0r2+w1c2w0)(r21)+c1(r2r1)(r21) (A12)
P3=c1(1r1)(1r2). (A13)

The Taylor expansion of Equation A8 around x=0 then gives

W(x)=P1g=0r1gxg+P2g=0r2gxg+P3g=0xg,=g=0(P1r1g+P2r2g+P3)xg.

Therefore, for g0, wg=P1r1g+P2r2g+P3, and the closed-form expression for the X-chromosomal female mean admixture fraction in generation g1, E[H1,g,fX], is zg=P1r1g+1+P2r2g+1+P3. We report this result in the main text as Equation 17, using it to obtain E[H1,g,mX] in Equation 18.

Because for hf,hm[0,1], r1 monotonically increases in hf and hm and r2 monotonically decreases, the maxima and minima of r1 and r2 occur at the boundaries of the closed interval [0,1]. Using Equations A9 and A10, we have r1[0,1) and r2(1/2,0]; we exclude r1=1 and r2=1/2, as hf and hm cannot both be 1. Because |r1|,|r2|<1, the mean X-chromosomal admixture fractions in Equations 17 and 18 approach limits as g. Using Equations 17 and 18, we can find expressions for the limits of the mean of the X-chromosomal admixture fractions,

limgE[H1,g,fX]=P3
limgE[H1,g,mX]=s1f+hfP3,

which can be simplified to give Equations 19 and 20.

Footnotes

Communicating editor: J. D. Wall

Literature Cited

  1. Arbiza L., Gottipati S., Siepel A., Keinan A., 2014.  Contrasting X-linked and autosomal diversity across 14 human populations. Am. J. Hum. Genet. 94: 827–844. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Basin S., 1963.  The Fibonacci sequence as it appears in nature. Fibonacci Q 1: 53–56. [Google Scholar]
  3. Bedoya G., Montoya P., García J., Soto I., Bourgeois S., et al. , 2006.  Admixture dynamics in Hispanics: a shift in the nuclear genetic ancestry of a South American population isolate. Proc. Natl. Acad. Sci. USA 103: 7234–7239. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Beleza S., Campos J., Lopes J., Araújo I. I., Almada A. H., et al. , 2012.  The admixture structure and genetic variation of the archipelago of Cape Verde and its implications for admixture mapping studies. PLoS ONE 7: e51103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Berniell-Lee G., Plaza S., Bosch E., Calafell F., Jourdan E., et al. , 2008.  Admixture and sexual bias in the population settlement of La Réunion Island (Indian Ocean). Am. J. Phys. Anthropol. 136: 100–107. [DOI] [PubMed] [Google Scholar]
  6. Bryc K., Auton A., Nelson M. R., Oksenberg J. R., Hauser S. L., et al. , 2010a Genome-wide patterns of population structure and admixture in West Africans and African Americans. Proc. Natl. Acad. Sci. USA 107: 786–791. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Bryc K., Velez C., Karafet T., Moreno-Estrada A., Reynolds A., et al. , 2010b Genome-wide patterns of population structure and admixture among Hispanic/Latino populations. Proc. Natl. Acad. Sci. USA 107: 8954–8961. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Bryc K., Durand E. Y., Macpherson J. M., Reich D., Mountain J. L., 2015.  The genetic ancestry of African Americans, Latinos, and European Americans across the United States. Am. J. Hum. Genet. 96: 37–53. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Bustamante C. D., Ramachandran S., 2009.  Evaluating signatures of sex-specific processes in the human genome. Nat. Genet. 41: 8–10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Caballero A., 1995.  On the effective size of populations with separate sexes, with particular reference to sex-linked genes. Genetics 139: 1007–1011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Casto A. M., Li J. Z., Absher D., Myers R., Ramachandran S., et al. , 2010.  Characterization of X-linked SNP genotypic variation in globally distributed human populations. Genome Biol. 11: R10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Charlesworth B., 2001.  The effect of life-history and mode of inheritance on neutral genetic variability. Genet. Res. 77: 153–166. [DOI] [PubMed] [Google Scholar]
  13. Cheng C.-Y., Kao W. H. L., Patterson N., Tandon A., Haiman C. A., et al. , 2009.  Admixture mapping of 15,280 African Americans identifies obesity susceptibility loci on chromosomes 5 and X. PLoS Genet. 5: e1000490. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Cox M. P., Karafet T. M., Lansing J. S., Sudoyo H., Hammer M. F., 2010.  Autosomal and X-linked single nucleotide polymorphisms reveal a steep Asian–Melanesian ancestry cline in eastern Indonesia and a sex bias in admixture rates. Proc. R. Soc. Lond. B Biol. Sci. 277: 1589–1596. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Crow J. F., Kimura M., 1970.  An Introduction to Population Genetics Theory. Burgess, Minneapolis. [Google Scholar]
  16. Davis F. J., 2010.  Who Is Black? One Nation’s Definition. Penn State Press, University Park, PA. [Google Scholar]
  17. Ellegren H., 2009.  The different levels of genetic diversity in sex chromosomes and autosomes. Trends Genet. 25: 278–284. [DOI] [PubMed] [Google Scholar]
  18. Eltis D., Richardson D., 2010.  Atlas of the Transatlantic Slave Trade. Yale University Press, New Haven, CT. [Google Scholar]
  19. Emery L. S., Felsenstein J., Akey J. M., 2010.  Estimators of the human effective sex ratio detect sex biases on different timescales. Am. J. Hum. Genet. 87: 848–856. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Fredrickson G. M., 1982.  White Supremacy: A Comparative Study of American and South African History. Oxford University Press, Oxford. [Google Scholar]
  21. Goldberg A., Verdu P., Rosenberg N. A., 2014.  Autosomal admixture levels are informative about sex bias in admixed populations. Genetics 198: 1209–1229. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Gottipati S., Arbiza L., Siepel A., Clark A. G., Keinan A., 2011.  Analyses of X-linked and autosomal genetic variation in population-scale whole genome sequencing. Nat. Genet. 43: 741–743. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Gravel S., 2012.  Population genetics models of local ancestry. Genetics 191: 607–619. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Hammer M. F., Mendez F. L., Cox M. P., Woerner A. E., Wall J. D., 2008.  Sex-biased evolutionary forces shape genomic patterns of human diversity. PLoS Genet. 4: e1000202. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Hammer M. F., Woerner A. E., Mendez F. L., Watkins J. C., Cox M. P., et al. , 2010.  The ratio of human X chromosome to autosome diversity is positively correlated with genetic distance from genes. Nat. Genet. 42: 830–831. [DOI] [PubMed] [Google Scholar]
  26. Hartl D. L., Clark A. G., 2007.  Principles of Population Genetics, Ed. 4 Sinauer Associates, Sunderland, MA. [Google Scholar]
  27. Hedrick P. W., 2007.  Sex: differences in mutation, recombination, selection, gene flow, and genetic drift. Evolution 61: 2750–2771. [DOI] [PubMed] [Google Scholar]
  28. Heyer E., Chaix R., Pavard S., Austerlitz F., 2012.  Sex-specific demographic behaviours that shape human genomic variation. Mol. Ecol. 21: 597–612. [DOI] [PubMed] [Google Scholar]
  29. Jennings H. S., 1914.  Formulae for the results of inbreeding. Am. Nat. 48: 693–696. [Google Scholar]
  30. Jennings H. S., 1916a The numerical results of diverse systems of breeding. Genetics 1: 53–89. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Jennings H. S., 1916b The numerical results of diverse systems of breeding. Proc. Natl. Acad. Sci. USA 2: 45–50. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Keinan A., Mullikin J. C., Patterson N., Reich D., 2009.  Accelerated genetic drift on chromosome X during the human dispersal out of Africa. Nat. Genet. 41: 66–70. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Koshy T., 2001.  Fibonacci and Lucas Numbers with Applications. John Wiley & Sons, Hoboken, NJ. [Google Scholar]
  34. Labuda D., Lefebvre J.-F., Nadeau P., Roy-Gagnon M.-H., 2010.  Female-to-male breeding ratio in modern humans—an analysis based on historical recombinations. Am. J. Hum. Genet. 86: 353–363. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Lambert C. A., Connelly C. F., Madeoy J., Qiu R., Olson M. V., et al. , 2010.  Highly punctuated patterns of population structure on the X chromosome and implications for African evolutionary history. Am. J. Hum. Genet. 86: 34–44. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Liang M., Nielsen R., 2014.  The lengths of admixture tracts. Genetics 197: 953–967. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Lind J. M., Hutcheson-Dilks H. B., Williams S. M., Moore J. H., Essex M., et al. , 2007.  Elevated male European and female African contributions to the genomes of African American individuals. Hum. Genet. 120: 713–722. [DOI] [PubMed] [Google Scholar]
  38. Long J. C., 1991.  The genetic structure of admixed populations. Genetics 127: 417–428. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Marcheco-Teruel B., Parra E. J., Fuentes-Smith E., Salas A., Buttenschøn H. N., et al. , 2014.  Cuba: exploring the history of admixture and the genetic basis of pigmentation using autosomal and uniparental markers. PLoS Genet. 10: e1004488. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. McLean D. C., Jr, Spruill I., Argyropoulos G., Page G. P., Shriver M. D., et al. , 2005.  Mitochondrial DNA (mtDNA) haplotypes reveal maternal population genetic affinities of Sea Island Gullah-speaking African Americans. Am. J. Phys. Anthropol. 127: 427–438. [DOI] [PubMed] [Google Scholar]
  41. Moreno-Estrada A., Gravel S., Zakharia F., McCauley J. L., Byrnes J. K., et al. , 2013.  Reconstructing the population genetic history of the Caribbean. PLoS Genet. 9: e1003925. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Painter N. I., 2006.  Creating Black Americans: African-American History and Its Meanings, 1619 to the Present. Oxford University Press, Oxford. [Google Scholar]
  43. Parra E. J., Marcini A., Akey J., Martinson J., Batzer M. A., et al. , 1998.  Estimating African American admixture proportions by use of population-specific alleles. Am. J. Hum. Genet. 63: 1839–1851. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Parra E. J., Kittles R. A., Argyropoulos G., Pfaff C. L., Hiester K., et al. , 2001.  Ancestral proportions and admixture dynamics in geographically defined African Americans living in South Carolina. Am. J. Phys. Anthropol. 114: 18–29. [DOI] [PubMed] [Google Scholar]
  45. Pool J. E., Nielsen R., 2007.  Population size changes reshape genomic patterns of diversity. Evolution 61: 3001–3006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Pool J. E., Nielsen R., 2009.  Inference of historical changes in migration rate from the lengths of migrant tracts. Genetics 181: 711–719. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Quintana-Murci L., Harmant C., Quach H., Balanovsky O., Zaporozhchenko V., et al. , 2010.  Strong maternal Khoisan contribution to the South African coloured population: a case of gender-biased admixture. Am. J. Hum. Genet. 86: 611–620. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Ramachandran S., Rosenberg N. A., Zhivotovsky L. A., Feldman M. W., 2004.  Robustness of the inference of human population structure: a comparison of X–chromosomal and autosomal microsatellites. Hum. Genomics 1: 87–97. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Ramachandran S., Rosenberg N. A., Feldman M. W., Wakeley J., 2008.  Population differentiation and migration: coalescence times in a two-sex island model for autosomal and X–linked loci. Theor. Popul. Biol. 74: 291–301. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Salzano F. M., Sans M., 2014.  Interethnic admixture and the evolution of Latin American populations. Genet. Mol. Biol. 37: 151–170. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Ségurel L., Wyman M. J., Przeworski M., 2014.  Determinants of mutation rate variation in the human germline. Annu. Rev. Genomics Hum. Genet. 15: 47–70. [DOI] [PubMed] [Google Scholar]
  52. Sloane N. J. A ., 2015 The Online Encyclopedia of Integer Sequences. Available at: https://oeis.org/. [Google Scholar]
  53. Stefflova K., Dulik M. C., Pai A. A., Walker A. H., Zeigler-Johnson C. M., et al. , 2009.  Evaluation of group genetic ancestry of populations from Philadelphia and Dakar in the context of sex-biased admixture in the Americas. PLoS ONE 4: e7842. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Tian C., Hinds D. A., Shigeta R., Kittles R., Ballinger D. G., et al. , 2006.  A genomewide single-nucleotide-polymorphism panel with high ancestry information for African American admixture mapping. Am. J. Hum. Genet. 79: 640–649. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Veeramah K. R., Gutenkunst R. N., Woerner A. E., Watkins J. C., Hammer M. F., 2014.  Evidence for increased levels of positive and negative selection on the X chromosome versus autosomes in humans. Mol. Biol. Evol. 31: 2267–2282. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Verdu P., Rosenberg N. A., 2011.  A general mechanistic model for admixture histories of hybrid populations. Genetics 189: 1413–1426. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Verdu P., Becker N. S., Froment A., Georges M., Grugni V., et al. , 2013.  Sociocultural behavior, sex-biased admixture, and effective population sizes in Central African Pygmies and non-Pygmies. Mol. Biol. Evol. 30: 918–937. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Wang S., Ray N., Rojas W., Parra M. V., Bedoya G., et al. , 2008.  Geographic patterns of genome admixture in Latin American Mestizos. PLoS Genet. 4: e1000037. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Wilkins J. F., 2006.  Unraveling male and female histories from human genetic data. Curr. Opin. Genet. Dev. 16: 611–617. [DOI] [PubMed] [Google Scholar]
  60. Wilkins J. F., Marlowe F. W., 2006.  Sex-biased migration in humans: What should we expect from genetic data? BioEssays 28: 290–300. [DOI] [PubMed] [Google Scholar]
  61. Wright S., 1921.  Systems of mating II. The effects of inbreeding on the genetic composition of a population. Genetics 6: 124–143. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Wright S., 1933.  Inbreeding and homozygosis. Proc. Natl. Acad. Sci. USA 19: 411–420. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Wright S., 1969.  The Theory of Gene Frequencies (Evolution and the Genetics of Populations), Vol. 2 University of Chicago Press, Chicago. [Google Scholar]

Articles from Genetics are provided here courtesy of Oxford University Press

RESOURCES