Population differentiation and migration: coalescence times in a two-sex island model for autosomal and X-linked loci

Sohini Ramachandran; Noah A Rosenberg; Marcus W Feldman; John Wakeley

doi:10.1016/j.tpb.2008.08.003

. Author manuscript; available in PMC: 2009 Dec 1.

Published in final edited form as: Theor Popul Biol. 2008 Sep 4;74(4):291–301. doi: 10.1016/j.tpb.2008.08.003

Population differentiation and migration: coalescence times in a two-sex island model for autosomal and X-linked loci

Sohini Ramachandran ^1,^2,^*, Noah A Rosenberg ^3,^4,⁵, Marcus W Feldman ⁶, John Wakeley ²

PMCID: PMC2630000 NIHMSID: NIHMS83227 PMID: 18817799

Abstract

Evolutionists have debated whether population-genetic parameters, such as effective population size and migration rate, differ between males and females. In humans, most analysis of this problem has focused on the Y chromosome and the mitochondrial genome, while the X chromosome has largely been omitted from the discussion. Past studies have compared F_ST values for the Y and mitochondrion under a model with migration rates that differ between the sexes but with equal male and female population sizes. In this study we investigate rates of coalescence for X-linked and autosomal lineages in an island model with different population sizes and migration rates for males and females, obtaining the mean time to coalescence for pairs of lineages from the same deme and for pairs of lineages from different demes. We apply our results to microsatellite data from the Human Genome Diversity Panel, and we examine the male and female migration rates implied by observed F_ST values.

Keywords: X chromosome, F_ST, separation of time scales, coalescent

Introduction

Evolutionists have long been interested in how demographic and population structural variables differ between males and females, and sex-biased dispersal processes are common in a variety of species (Lawson Handley and Perrin, 2007).

Differences between human males and females in parameters such as migration rate and effective population size have generally been investigated using the uniparentally-inherited Y chromosome and mitochondial genome. Past studies have observed differences in autosomal, Y-chromosomal and mitochondrial variation, and have typically explained these differences based on matrilocality or patrilocality (Wilkins and Marlowe, 2006; Wilkins, 2006).

In a patrilocal society, we expect to see more genetic differentiation across Y-chromosomal lineages than across mitochondrial lineages; such a pattern was observed using globally-distributed samples by Seielstad et al. (1998), while patterns consistent with matrilocality have been observed in Thailand (Oota et al., 2001) and Melanesia (Kayser et al., 2008). Recent studies have questioned the spatial scale at which one can expect to infer a genetic signature of patrilocality or matrilocality, arguing that this signal may be observable within geographic regions, but likely not at a global level (Wilder et al., 2004a; Wilkins and Marlowe, 2006).

The X chromosome has contributed comparatively little to the inference of sex-specific human migration rates. Garrigan et al. (2007) compared genetic variation using resequence data at 2 X-linked loci totaling 8486 bp, 6650 bp encompassing 13 Alu elements on the Y chromosome, and 780 bp of the cytochrome oxidase subunit III on the mitochondrion. Their inference of migration rates among 10 human populations did not produce a consistent pattern of sex-biased gene flow across all the loci investigated, though different rates of male and female migration were inferred for many pairs of populations.

Although variation in the Y chromosome and the mitochondrion has generally been used in studies of sex-specific differences in human dispersal, comparisons between variation observed on the X chromosome and on autosomes also have the potential to shed light on evolutionarily interesting differences between males and females (Schaffner, 2004). In contrast with the Y chromosome and the mitochondrial genome, each of which is effectively a single absolutely-linked locus, the X chromosome and autosomes offer numerous independent markers. The availability of multiple markers potentially adds power to the analysis, although recombination and the movement of the autosomes and X chromosome between males and females are expected to complicate the elucidation of sex-specific histories (Ramachandran et al., 2004; Wilkins and Marlowe, 2006).

Using 17 X-linked and 377 autosomal microsatellites genotyped in 52 globally-distributed populations in the Human Genome Diversity Panel (HGDP), Ramachandran et al. (2004) investigated differences in patterns of X-chromosomal and autosomal geographical variation around the world, as measured by F_ST among populations. These differences were studied by considering the different numbers of copies of X-linked and autosomal loci in a population, for a given female fraction of the total population size, and by deriving a formula for F_ST using a model of divergence from an ancestral population with subsequent isolation of descendant populations. Male and female effective population sizes were allowed to vary, but the model did not involve migration among subpopulations. Ramachandran et al. (2004) found that a ratio of the number of females to the total population size of 0.5 was sufficient to explain global differences in genetic variation between X-linked and autosomal microsatellites. However, the study could not explain differences in F_ST in some of the continental regions of the dataset where the divergence model might be less representative of population history (for example, Europe, where gene flow among populations post-divergence is likely to have been high).

Here we investigate the rates of coalescence for X-linked and autosomal loci in an island migration model with sex-specific population sizes and migration rates. Past theoretical studies have examined the effect of sex-specific gene flow and genetic drift on times to coalescence and F-statistics (Wang, 1997; Rousset, 1999; Wang, 1999; Laporte and Charlesworth, 2002; Vitalis, 2002; Hedrick, 2007). We consider these issues from a coalescent perspective. We start with an exact discrete island model with migrating adults, and use a result due to Möhle (1998) to explicitly take the limit of the coalescent process as population size goes to infinity. We obtain simple expressions for F_ST at X-linked and autosomal loci in our model under the usual assumptions of the structured coalescent.

Applying the analytical results to the X-linked and autosomal microsatellite data from the HGDP (Cann et al., 2002; Ramachandran et al., 2004; Ramachandran et al., 2005; Rosenberg et al., 2005), we find that global patterns of population differentiation as measured by F_ST can be explained without requiring different migration rates for males and females. Within geographic regions, however, the inferred sex-specific migration rates differ substantially, although the direction of the deviation is not always the same.

The migration model

Consider an island model with D demes and four sex-specific parameters, each of which has the same value for all demes: fixed numbers of males and females (N_m and N_f, respectively), and fixed numbers of male and female migrants per generation (M_m and M_f, respectively). The total population size is DN = D(N_m + N_f) (each deme has the same number of individuals). Here we can write N_f = Nr, where r is the female fraction of the population size, assumed to be the same for each deme. It follows that N_m = N (1 − r). Denote by m_f the backwards migration rate for females; that is, the probability that a female sampled from deme i has just migrated from some other deme in the generation during which sampling took place. The corresponding rate for males is m_m. Since M_m and M_f are fixed, m_f = M_f/N_f and m_m = M_m/N_m. We shall assume throughout that m_f and m_m are of the order 1/N. Migration takes place after reproduction within demes, and the probability that a male (for example) migrates to a specific deme is m_m/(D − 1).

We consider a single genetic locus. The resulting single-generation transition matrix for a sample of two autosomal lineages in this model has 10 states. For a sample of two X-linked lineages the model has 9 states, as listed in Table 1.

Table 1. States in the migration model.

Possible states in which two sampled lineages can be found in the island model with two sexes, and the columns of the autosomal and X-linked single-generation transition matrices that correspond to each state. Note that two sampled X-linked lineages cannot be found in the same male unless they have already coalesced.

Columns
Autosomal	X-linked	Definition

1	1	In one female individual, not coalesced
2		In one male individual, not coalesced
3	2	In two female individuals, same deme
4	3	In two male individuals, same deme
5	4	In one male and one female, same deme
6	5	In two female individuals, different demes
7	6	In two male individuals, different demes
8	7	In one male and one female, different demes
9	8	In one female, coalesced
10	9	In one male, coalesced

Open in a new tab

Let P_A be the 10 × 10 single-generation transition matrix for two lineages sampled from an autosomal locus, and let (P_A)_ij refer to the entry in the ith row and jth column of the matrix. Each matrix entry is the product of two terms: (a) a term involving migration or lack of migration among demes, and (b) a term describing inheritance.

For example, (P_A)₅₆, according to Table 1, is the entry describing the probability that two lineages sampled from one male and one female in the same deme came from female parents in different demes in the previous generation. (P_A)₅₆ is the product of (a) the probability that one male and one female lineage currently in the same deme were in different demes in the previous generation (either because one lineage was in a migrant or because both lineages were in migrants that arrived in the same deme), and (b) the probability that two autosomal lineages (one from a male and one from a female) both came from female parents. The latter probability is 1/4, since for each sampled individual we choose the maternal autosome with probability 1/2.

P_X denotes the 9 × 9 single-generation transition matrix for two lineages sampled from an X-linked locus. (P_X)₄₅ is the probability that two X-linked lineages sampled from one male and one female in the same deme came from female parents in different demes in the previous generation (Table 1). The probability (a) above, that the lineages were in different demes in the previous generation, will not differ between an X-linked and autosomal locus. However, the analog to (b) above, the probability that two X-linked lineages (one from a male and one from a female) came from two female parents is 1/2. This is because the male allele would have had to come from the female parent in the previous generation, while we choose the female’s allele from her maternal X with probability 1/2.

The matrices P_A and P_X are rather cumbersome due to their size. Since the terms describing migration among demes do not depend on whether the sampled locus is X-linked or autosomal, the matrices’ entries can be written more simply by using the notation $g_{i, j}^{k, l}$ for terms of type (a) above in the following manner. Let us denote the state in which two lineages, regardless of sex, are in the same deme as state I; state II represents two lineages being from different demes. Then $g_{I, I I}^{M, F}$ is the probability that a sample of one male and one female now in state I was in state II in the previous generation, which corresponds to (a) in the previous paragraph. The probabilities $g_{i, j}^{k, l}$ for all types of samples are given in Appendix 1.

Using this notation, for example, (P_A)₃₉ is equal to the product of (a) $g_{I, I}^{F, F}$ (the probability two females currently in the same deme were in the same deme in the previous generation) and (b) 1/(8N_f) (the probability two sampled lineages, one from each sampled female, coalesce in a female in the previous generation). 1/(8N_f) is the probability that in both females the maternal autosome is selected (= 1/2 × 1/2) times the probability the loci were inherited from the same maternal chromosome (= 1/(2N_f)). (P_X)₆₂ is equal to (a) $g_{I I, I}^{M, F}$ (the probability two sampled males are currently in different demes but were in the same deme in the previous generation) times (b) 1 − 1/N_f (the probability the sampled lineages come from two different females). Since a male’s X chromosome must come from his mother, the probability that two male X chromosomes are found in two different females is simply the probability the chromosomes do not come from the same female.

Suppose the sampled lineages are currently in the same individual but that the lineages have not coalesced (columns 1 and 2 in P_A and column 1 in P_X). Because migration occurs after reproduction within demes, the lineages had to be in a male and female (the individual’s parents) in the same deme in the previous generation, regardless of whether or not the individual from whom the lineages were sampled had migrated (see rows 1 and 2 of matrix (1) and row 1 of matrix (2)).

Thus we can write down both the autosomal and X-linked single generation transition matrices, P_A and P_X, as matrices (1) and (2). Above both matrices, we indicate the sex structure of the sample for each column (e.g., ℳ, ℳ denotes lineages sampled from two males), and the physical locations associated with states (e.g., in the same individual but not coalesced, or from different demes).

P_{A} = [\begin{matrix} \overset{Same individual}{\overset{︷}{\begin{matrix} F & M \\ 0 & 0 \\ 0 & 0 \\ \frac{g_{I, I}^{F, F}}{8 N_{f}} & \frac{g_{I, I}^{F, F}}{8 N_{m}} \\ \frac{g_{I, I}^{M, M}}{8 N_{f}} & \frac{g_{I, I}^{M, M}}{8 N_{m}} \\ \frac{g_{I, I}^{M, F}}{8 N_{f}} & \frac{g_{I, I}^{M, F}}{8 N_{m}} \\ \frac{g_{I I, I}^{F, F}}{8 N_{f}} & \frac{g_{I I, I}^{F, F}}{8 N_{m}} \\ \frac{g_{I I, I}^{M, M}}{8 N_{f}} & \frac{g_{I I, I}^{M, M}}{8 N_{m}} \\ \frac{g_{I I, I}^{M, F}}{8 N_{f}} & \frac{g_{I I, I}^{M, F}}{8 N_{m}} \\ 0 & 0 \\ 0 & 0 \end{matrix}}} & \overset{Same deme}{\overset{︷}{\begin{matrix} F, F & M, M & M, F \\ 0 & 0 & 1 \\ 0 & 0 & 1 \\ \frac{g_{I, I}^{F, F}}{4} (1 - \frac{1}{N_{f}}) & \frac{g_{I, I}^{F, F}}{4} (1 - \frac{1}{N_{m}}) & \frac{g_{I, I}^{F, F}}{2} \\ \frac{g_{I, I}^{M, M}}{4} (1 - \frac{1}{N_{f}}) & \frac{g_{I, I}^{M, M}}{4} (1 - \frac{1}{N_{m}}) & \frac{g_{I, I}^{M, M}}{2} \\ \frac{g_{I, I}^{M, F}}{4} (1 - \frac{1}{N_{f}}) & \frac{g_{I, I}^{M, F}}{4} (1 - \frac{1}{N_{m}}) & \frac{g_{I, I}^{M, F}}{2} \\ \frac{g_{I I, I}^{F, F}}{4} (1 - \frac{1}{N_{f}}) & \frac{g_{I I, I}^{F, F}}{4} (1 - \frac{1}{N_{m}}) & \frac{g_{I I, I}^{F, F}}{2} \\ \frac{g_{I I, I}^{M, M}}{4} (1 - \frac{1}{N_{f}}) & \frac{g_{I I, I}^{M, M}}{4} (1 - \frac{1}{N_{m}}) & \frac{g_{I I, I}^{M, M}}{2} \\ \frac{g_{I I, I}^{M, F}}{4} (1 - \frac{1}{N_{f}}) & \frac{g_{I I, I}^{M, F}}{4} (1 - \frac{1}{N_{m}}) & \frac{g_{I I, I}^{M, F}}{2} \\ 0 & 0 & 0 \\ 0 & 0 & 0 \end{matrix}}} & \overset{Different demes}{\overset{︷}{\begin{matrix} F, F & M, M & M, F \\ 0 & 0 & 0 \\ 0 & 0 & 0 \\ \frac{g_{I, I I}^{F, F}}{4} & \frac{g_{I, I I}^{F, F}}{4} & \frac{g_{I, I I}^{F, F}}{2} \\ \frac{g_{I, I I}^{M, M}}{4} & \frac{g_{I, I I}^{M, M}}{4} & \frac{g_{I, I I}^{M, M}}{2} \\ \frac{g_{I, I I}^{M, F}}{4} & \frac{g_{I, I I}^{M, F}}{4} & \frac{g_{I, I I}^{M, F}}{2} \\ \frac{g_{I I, I I}^{F, F}}{4} & \frac{g_{I I, I I}^{F, F}}{4} & \frac{g_{I I, I I}^{F, F}}{2} \\ \frac{g_{I I, I I}^{M, M}}{4} & \frac{g_{I I, I I}^{M, M}}{4} & \frac{g_{I I, I I}^{M, M}}{2} \\ \frac{g_{I I, I I}^{M, F}}{4} & \frac{g_{I I, I I}^{M, F}}{4} & \frac{g_{I I, I I}^{M, F}}{2} \\ 0 & 0 & 0 \\ 0 & 0 & 0 \end{matrix}}} & \overset{Coalesced}{\overset{︷}{\begin{matrix} F & M \\ 0 & 0 \\ 0 & 0 \\ \frac{g_{I, I}^{F, F}}{8 N_{f}} & \frac{g_{I, I}^{F, F}}{8 N_{m}} \\ \frac{g_{I, I}^{M, M}}{8 N_{f}} & \frac{g_{I, I}^{M, M}}{8 N_{m}} \\ \frac{g_{I, I}^{M, F}}{8 N_{f}} & \frac{g_{I, I}^{M, F}}{8 N_{m}} \\ \frac{g_{I I, I}^{F, F}}{8 N_{f}} & \frac{g_{I I, I}^{F, F}}{8 N_{m}} \\ \frac{g_{I I, I}^{M, M}}{8 N_{f}} & \frac{g_{I I, I}^{M, M}}{8 N_{m}} \\ \frac{g_{I I, I}^{M, F}}{8 N_{f}} & \frac{g_{I I, I}^{M, F}}{8 N_{m}} \\ \frac{1}{2} & \frac{1}{2} \\ \frac{1}{2} & \frac{1}{2} \end{matrix}}} \end{matrix}] .

(1)

P_{X} = [\begin{matrix} \overset{Same female}{\overset{︷}{\begin{matrix} F \\ 0 \\ \frac{g_{I, I}^{F, F}}{8 N_{f}} \\ \frac{g_{I, I}^{M, M}}{2 N_{f}} \\ \frac{g_{I, I}^{M, F}}{4 N_{f}} \\ \frac{g_{I I, I}^{F, F}}{8 N_{f}} \\ \frac{g_{I I, I}^{M, M}}{2 N_{f}} \\ \frac{g_{I I, I}^{M, F}}{4 N_{f}} \\ 0 \\ 0 \end{matrix}}} & \overset{Same deme}{\overset{︷}{\begin{matrix} F, F & M, M & M, F \\ 0 & 0 & 1 \\ \frac{g_{I, I}^{F, F}}{4} (1 - \frac{1}{N_{f}}) & \frac{g_{I, I}^{F, F}}{4} (1 - \frac{1}{N_{m}}) & \frac{g_{I, I}^{F, F}}{2} \\ g_{I, I}^{M, M} (1 - \frac{1}{N_{f}}) & 0 & 0 \\ \frac{g_{I, I}^{M, F}}{2} (1 - \frac{1}{N_{f}}) & 0 & \frac{g_{I, I}^{M, F}}{2} \\ \frac{g_{I I, I}^{F, F}}{4} (1 - \frac{1}{N_{f}}) & \frac{g_{I I, I}^{F, F}}{4} (1 - \frac{1}{N_{m}}) & \frac{g_{I I, I}^{F, F}}{2} \\ g_{I I, I}^{M, M} (1 - \frac{1}{N_{f}}) & 0 & 0 \\ \frac{g_{I I, I}^{M, F}}{2} (1 - \frac{1}{N_{f}}) & 0 & \frac{g_{I I, I}^{M, F}}{2} \\ 0 & 0 & 0 \\ 0 & 0 & 0 \end{matrix}}} & \overset{Different demes}{\overset{︷}{\begin{matrix} F, F & M, M & M, F \\ 0 & 0 & 0 \\ \frac{g_{I, I I}^{F, F}}{4} & \frac{g_{I, I I}^{F, F}}{4} & \frac{g_{I, I I}^{F, F}}{2} \\ g_{I, I I}^{M, M} & 0 & 0 \\ \frac{g_{I, I I}^{M, F}}{2} & 0 & \frac{g_{I, I I}^{M, F}}{2} \\ \frac{g_{I I, I I}^{F, F}}{4} & \frac{g_{I I, I I}^{F, F}}{4} & \frac{g_{I I, I I}^{F, F}}{2} \\ g_{I I, I I}^{M, M} & 0 & 0 \\ \frac{g_{I I, I I}^{M, F}}{2} & 0 & \frac{g_{I I, I I}^{M, F}}{2} \\ 0 & 0 & 0 \\ 0 & 0 & 0 \end{matrix}}} & \overset{Coalesced}{\overset{︷}{\begin{matrix} F & M \\ 0 & 0 \\ \frac{g_{I, I}^{F, F}}{8 N_{f}} & \frac{g_{I, I}^{F, F}}{4 N_{m}} \\ \frac{g_{I, I}^{M, M}}{2 N_{f}} & 0 \\ \frac{g_{I, I}^{M, F}}{4 N_{f}} & 0 \\ \frac{g_{I I, I}^{F, F}}{8 N_{f}} & \frac{g_{I I, I}^{F, F}}{4 N_{m}} \\ \frac{g_{I I, I}^{M, M}}{2 N_{f}} & 0 \\ \frac{g_{I I, I}^{M, F}}{4 N_{f}} & 0 \\ \frac{1}{2} & \frac{1}{2} \\ 1 & 0 \end{matrix}}} \end{matrix}] .

(2)

Results

We can rewrite both transition matrices in equations (1) and (2) in the form

P = D + B / N + E_{N} .

(3)

Assuming that M_f and M_m do not depend on N (i.e., as N approaches infinity, the numbers of migrants per generation converge to some limiting constants, which are again denoted by M_f and M_m for convenience), then D = lim_N_→∞ P and B = lim_N_→∞ N(P - D) (which both do not depend on N). Note that, in equation (3), E_N = P - D - B/N denotes some error matrix with terms of the order of m², 1/N², and m/N. See Appendix 1 for an example of this decomposition.

The entries in D represent a fast process, namely the movement of lineages between males and females according to Mendelian inheritance, while the entries in B represent rare processes of migration and coalescence which are assumed to occur once over a period on the order of N generations. Möhle’s theorem (1998) states that if R = lim_t_→∞ D^t exists (letting the fast process run to its conclusion), then the rates of coalescence and migration among demes when time is scaled by N generations are given by the product matrix G = RBR. Specifically, lim_N_→∞ P^Nt = Re^t^G (Möhle, 1998).

We show D_X (= lim_N_→∞ P_X) and R_X in (4) and (5) below while the detailed derivations of the corresponding autosomal matrices and of G_X and G_A appear in Appendix 2. In the case of P_X given by matrix (2), D_X = lim_N_→∞ P_X is

D_{X} = [\begin{matrix} 0 & 0 & 0 & 1 & 0 & 0 & 0 & 0 & 0 \\ 0 & 1 / 4 & 1 / 4 & 1 / 2 & 0 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 1 / 2 & 0 & 1 / 2 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 1 / 4 & 1 / 4 & 1 / 2 & 0 & 0 \\ 0 & 0 & 0 & 0 & 1 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 1 / 2 & 0 & 1 / 2 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 1 / 2 & 1 / 2 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 1 & 0 \end{matrix}] .

(4)

The columns in matrix (4) can be interpreted using the definitions in Table 1. The terms in D_X are familiar terms based on the inheritance of X chromosomes, as are the entries of R_X = lim_t_→∞(D_X)^t:

R_{X} = [\begin{matrix} 0 & 4 / 9 & 1 / 9 & 4 / 9 & 0 & 0 & 0 & 0 & 0 \\ 0 & 4 / 9 & 1 / 9 & 4 / 9 & 0 & 0 & 0 & 0 & 0 \\ 0 & 4 / 9 & 1 / 9 & 4 / 9 & 0 & 0 & 0 & 0 & 0 \\ 0 & 4 / 9 & 1 / 9 & 4 / 9 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 4 / 9 & 1 / 9 & 4 / 9 & 0 & 0 \\ 0 & 0 & 0 & 0 & 4 / 9 & 1 / 9 & 4 / 9 & 0 & 0 \\ 0 & 0 & 0 & 0 & 4 / 9 & 1 / 9 & 4 / 9 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 2 / 3 & 1 / 3 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 2 / 3 & 1 / 3 \end{matrix}] .

(5)

When applying Möhle’s result to P_A and P_X, a block structure emerges in the R and G matrices for both X-linked and autosomal loci, exemplified by the blocks seen in matrix (5). We can collapse some states together by summing the entries in their columns and by collapsing some rows, reducing the analysis to 3 × 3 matrices. For example, we can sum the entries $\sum_{j = 1}^{4} {(G_{X})}_{1 j}$ (see Appendix 2) and get a single rate of staying in the same deme for two lineages sampled in the same female individual, but not coalesced (the state described by row and column 1 of P_X). The sum $\sum_{j = 1}^{4} {(G_{X})}_{i j}$ has the same value for each i = 1, 2, 3, 4. This is because, in the fast process occurring according to D_X and D_A, lineages move quickly between males and females, so the current sex structure of the sample becomes unimportant and instead we need only follow whether sampled lineages are in the same deme or in diffrent demes. Thus, the product matrices G_X and G_A for X-linked and autosomal lineages in this process simplify to ℊ_X and ℊ_A (equations (6) and (7), respectively; see Appendix 2 for derivation).

G_{X} = [\begin{matrix} \overset{Same deme}{\overset{︷}{\begin{matrix} - \frac{2}{3} (\frac{2 M_{f}}{r} + \frac{M_{m}}{1 - r}) - \frac{2 - r}{9 r (1 - r)} \\ \frac{2}{3 (D - 1)} (\frac{2 M_{f}}{r} + \frac{M_{m}}{1 - r}) \\ 0 \end{matrix}}} & \overset{Different demes}{\overset{︷}{\begin{matrix} \frac{2}{3} (\frac{2 M_{f}}{r} + \frac{M_{m}}{1 - r}) \\ - \frac{2}{3 (D - 1)} (\frac{2 M_{f}}{r} + \frac{M_{m}}{1 - r}) \\ 0 \end{matrix}}} & \overset{Coalesced}{\overset{︷}{\begin{matrix} \frac{2 - r}{9 r (1 - r)} \\ 0 \\ 0 \end{matrix}}} \end{matrix}] .

(6)

G_{A} = [\begin{matrix} \overset{Same deme}{\overset{︷}{\begin{matrix} - (\frac{M_{f}}{r} + \frac{M_{m}}{1 - r}) - \frac{1}{8 r (1 - r)} \\ \frac{1}{D - 1} (\frac{M_{f}}{r} + \frac{M_{m}}{1 - r}) \\ 0 \end{matrix}}} & \overset{Different demes}{\overset{︷}{\begin{matrix} \frac{M_{f}}{r} + \frac{M_{m}}{1 - r} \\ - \frac{1}{D - 1} (\frac{M_{f}}{r} + \frac{M_{m}}{1 - r}) \\ 0 \end{matrix}}} & \overset{Coalesced}{\overset{︷}{\begin{matrix} \frac{1}{8 r (1 - r)} \\ 0 \\ 0 \end{matrix}}} \end{matrix}] .

(7)

Using first-step analysis, we can calculate the expected times to coalescence for a pair of lineages, sampled from the same deme (E[T^same]) or sampled from different demes (E[T^diff]). In the discrete time processes studied here, the expected time to arrive in state j given that the current state is i equals the time to make the jump from state i to another state plus the expected time it takes to reach state j after the jump is made. We are interested in time to coalescence, so we need to solve the following equations to get, for example, $E [T_{A}^{same}]$ and $E [T_{A}^{diff}]$ :

\begin{array}{l} E [T_{A}^{same}] = \frac{1}{{(G_{A})}_{12} + {(G_{A})}_{13}} + \frac{{(G_{A})}_{12}}{{(G_{A})}_{12} + {(G_{A})}_{13}} E [T_{A}^{diff}] \\ E [T_{A}^{diff}] = \frac{1}{{(G_{A})}_{21}} + E [T_{A}^{same}] . \end{array}

Solving these and the analogous equations for X-linked loci gives equations (8 – 11), measured in units of N generations.

E [T_{A}^{same}] = 8 D r (1 - r)

(8)

E [T_{A}^{diff}] = 8 D r (1 - r) + \frac{(D - 1) (1 - r) r}{M_{f} (1 - r) + M_{m} r}

(9)

E [T_{X}^{same}] = \frac{9 D r (1 - r)}{2 - r}

(10)

E [T_{X}^{diff}] = \frac{9 D r (1 - r)}{2 - r} + \frac{3}{2} (\frac{(D - 1) (1 - r) r}{2 M_{f} (1 - r) + M_{m} r}) .

(11)

Using our notation, Slatkin’s (1991) formulation of F_ST at an autosomal locus in a set of D demes is $F_{S T, A} = 1 - E [T_{A}^{same}] / {E [T_{A}^{same}] / D + (D - 1) E [T_{A}^{diff}] / D}$ . The relationship between coalescence times and F_ST in this formulation depends on the mutation rate being very small. As D approaches infinity, we get

F_{S T, A} = \frac{1}{1 + 8 [M_{f} (1 - r) + M_{m} r]}

(12)

F_{S T, X} = \frac{1}{1 + \frac{6}{2 - r} [2 M_{f} (1 - r) + M_{m} r]} .

(13)

Given estimates of F_ST at X-linked and autosomal loci, and assuming some value on the interval (0,1) for r, we can estimate M_f and M_m from (12) and (13) as:

M_{f} = \frac{1}{(1 - r)} [\frac{2 - r}{6 F_{S T, X}} - \frac{1}{8} (\frac{1}{F_{S T, A}} + \frac{1}{3}) - \frac{1 - r}{6}]

(14)

M_{m} = \frac{1}{r} [- \frac{2 - r}{6 F_{S T, X}} + \frac{1}{4} (\frac{1}{F_{S T, A}} + \frac{1}{3}) - \frac{r}{6}] .

(15)

Application to HGDP-CEPH data

A total of 783 autosomal microsatellites from Marshfield Screening Sets #10 and #52 have been reported in the HDGP individuals from 52 populations. Screening Set #10 also contained the 17 non-pseudoautosomal X-linked microsatellites studied by Ramachandran et al. (2004), and Screening Set #52 provided 19 additional non-pseudoautosomal X-linked microsatellites studied here. The data files used in this analysis are available from the authors.

We inferred the sex of individuals from their X-chromosomal genotypes at the 36 loci examined, and verified the inferences against the corresponding inferences made using the X-chromosomal data of Conrad et al. (2006). With one exception, individuals treated as males in our analysis all had <15% heterozygous loci and females all had >19% loci on the X chromosome, among loci with no missing data. The exception, individual #139, was verified to be male on the basis of the data of Conrad et al. (2006), which included a larger number of X-chromosomal loci. Males were treated as hemizygous for calculations. Some males were reported as heterozygous at non-pseudoautosomal X-linked loci; in such cases males were coded as having missing data at these loci.

We calculated F_ST based on the 36 X-linked and 783 autosomal microsatellites typed in the Human Genome Diversity Panel, using Weir’s estimator (Weir, 1996) for the proportion of genetic variation distributed among populations. F_ST was calculated among all populations, as well as among populations within the same continental region, as defined previously by Rosenberg et al. (2002); the estimator was obtained separately for X-linked loci and for autosomal loci, following equation (5.3) on page 174 of Weir (1996). For the computation we grouped all Bantu individuals into one population with a sample size of 20 individuals. We obtained confidence intervals for X-linked and autosomal F_ST values by bootstrapping separately over each set of loci 1000 times (see intervals in Tables 2 and 3).

Table 2. Estimated ratio of M_f/M_m, using data from 1048 individuals.

Estimates of the among-population component of genetic variation based on 783 autosomal and 36 X-linked microsatellites in the HGDP-CEPH individuals, for global and various regional subsets of the data. Also reported are the estimated ratios of M_f/M_m when r = 0.5, calculated using 1048 individuals (Rosenberg, 2006). “C/S Asia” refers to Central/South Asian populations from the panel (see Rosenberg et al., 2002). Confidence intervals for F_ST were obtained by bootstrapping over loci 1000 times. The number of values out of 10⁶ used to generate intervals for M_f/M_m, after the exclusion of of negative estimates, is also given.

Sample	Number of populations	F_ST, autosomal (95% C.I.)		F_ST, X-linked (95% C.I.)		M_f/M_m at r = 0.5 (95% C.I.)		Number of ratios ≥ 0
World	52	0.0561	(0.0543, 0.0579)	0.0718	(0.0620, 0.0839)	1.1524	(0.4149, 4.4257)	999535
Africa	6	0.0300	(0.0286, 0.0314)	0.0539	(0.0401, 0.0691)	0.0936	(0.0078, 1.1324)	716775
Eurasia	21	0.0158	(0.0152, 0.0165)	0.0226	(0.0183, 0.0276)	0.6345	(0.1519, 2.7184)	998081
Europe	8	0.0079	(0.0071, 0.0087)	0.0122	(0.0069, 0.0180)	0.4127	(0.0208, 9.1672)	818855
Middle East	4	0.0137	(0.0130, 0.0145)	0.0162	(0.0121, 0.0208)	2.1807	(0.4004, 32.2857)	852251
C/S Asia	9	0.0137	(0.0130, 0.0145)	0.0149	(0.0095, 0.0208)	4.9120	(0.3772, 59.5790)	636089
East Asia	18	0.0125	(0.0117, 0.0134)	0.0156	(0.0102, 0.0215)	1.4720	(0.1563, 23.0857)	865762
Oceania	2	0.0635	(0.0577, 0.0692)	0.0847	(0.0544, 0.1220)	0.8746	(0.05421, 18.3075)	863700
America	5	0.1174	(0.1127, 0.1219)	0.1367	(0.1166, 0.1567)	2.1349	(0.7401, 19.5850)	964990

Open in a new tab

Table 3. Estimated ratio of M_f/M_m, using data from 952 individuals.

This table reports a similar analysis to that reported in Table 2, but using 952 individuals (Rosenberg, 2000). This set of HGDP individuals contains no two individuals with a second-degree relationship (half siblings, avuncular, or grandparent/grandchild). Confidence intervals for F_ST were obtained by bootstrapping over loci 1000 times. The number of values out of 10⁶ used to generate intervals for M_f/M_m, after the exclusion of negative estimates, is also given.

Sample	Number of populations	F_ST, autosomal (95% C.I.)		F_ST, X-linked (95% C.I.)		M_f/M_m at r = 0.5 (95% C.I.)		Number of ratios ≥ 0
World	52	0.0455	(0.0438, 0.0472)	0.0586	(0.0491, 0.0702)	1.1354	(0.3530, 5.2649)	992554
Africa	6	0.0260	(0.0245, 0.0274)	0.0465	(0.0338, 0.0611)	0.1026	(0.0087, 1.2628)	720323
Eurasia	21	0.0150	(0.0144, 0.0156)	0.0218	(0.0172, 0.0266)	0.5896	(0.1265, 2.8729)	994778
Europe	8	0.0076	(0.0069, 0.0084)	0.0112	(0.0061, 0.0167)	0.5722	(0.0262, 15.7893)	808661
Middle East	4	0.0130	(0.0121, 0.0137)	0.0150	(0.0111, 0.0194)	2.5593	(0.4367, 40.5666)	817864
C/S Asia	9	0.0127	(0.0119, 0.0134)	0.0146	(0.0089, 0.0205)	2.7676	(0.2370, 34.5307)	719969
East Asia	18	0.0113	(0.0105, 0.0121)	0.0134	(0.0081, 0.0190)	2.1811	(0.1713, 35.2105)	745435
Oceania	2	0.0552	(0.0493, 0.0616)	0.0753	(0.0410, 0.1155)	0.7702	(0.0320, 19.1807)	766059
America	5	0.0836	(0.0799, 0.0876)	0.0942	(0.0789, 0.1087)	3.0884	(0.9013, 32.3565)	909730

Open in a new tab

We use equations (14) and (15) to estimate the ratio of female migrants to male migrants using observed F_ST values from the data, for a given assumed proportion of females in the population. Note that in order for M_f and M_m to be interpretable they must be positive, which may not be the case for certain combinations of F_ST and r values. In order for both M_f and M_m to be greater than zero, the condition 2F_ST,A(2 − r)/[3 + F_ST,A(1 − 2r)] < F_ST,X < 4F_ST,A(2 − r)/[3 + F_ST,A(5 − r)] must be satisfied. The region in which M_f/M_m is positive for various fixed values of r, as F_ST,X and F_ST,A vary on the interval [0,1], is shown in Figure 1.

The region in which the ratio *M_f*/*M_m* is positive, as computed from equations (14) and (15) for fixed values of r, with *F_ST,X* and *F_ST,A* varying on the interval [0,1]. The region is shaded in grey. The solid line is 2F_ST,A(2 − r)/[3+ *F_ST,A*(1 − 2r)], which *F_ST,X* must be greater than for *M_m* to be greater than zero. The dashed line is 4F_ST,A(2 − r)/[3 + *F_ST,A*(5 − r)], which *F_ST,X* must be less than for *M_f* to be greater than zero.

We obtained intervals for M_f/M_m (Tables 2 and 3) by taking the 1000 bootstrapped F_ST,X and 1000 boostrapped F_ST,A values, and computing M_f/M_m for all 10⁶ possible pairs of boostrapped F_ST values. We disregarded those estimates of M_f/M_m which were negative, choosing to interpret negative estimates of M_f and M_m as providing little support for r = 0.5 or for our migration model. The number of values used to generate the intervals in Tables 2 and 3 after the exclusion of negative estimates is also given.

Since the initial announcement of the Human Genome Diversity Panel (Cann et al., 2002), subsequent analyses have called attention to individuals who appear to be duplicated or closely related. Here we calculate F_ST for two sets of HGDP individuals (Tables 2 and 3): 1048 individuals, where one individual from each pair of putatively duplicated individuals (Mountain and Ramakrishnan, 2005; Rosenberg, 2006) is excluded; and 952 individuals, a proper subset of the set of 1048, where individuals with first- and second-degree relationships are excluded (Rosenberg, 2006).

Discussion

In this paper, we applyMöhle’s theorem (1998) to transition matrices for X-linked and autosomal loci sampled in an island model of D demes with sex-specific population sizes and migration rates, and we obtain simple expressions under the model for expected times to coalescence for two sampled alleles and for F_ST at X-linked and autosomal loci. Möhle’s result is useful because it gives us a continuous-time limit of a discrete-time process where events are occurring on two time scales: in this case, the fast process of movement of lineages between males and females, and the slow processes of movement of individuals among demes and of coalescence.

The entries in matrices (6) and (7) give us the rates at which, when time is measured in units of N generations, two sampled lineages move among three states: being in the same deme, being in different demes, or being “coalesced”. ( Inline graphic _A)₁₂ gives the rate (over N generations) at which autosomal lineages move out of the same deme into different demes, ( _A)₂₁ gives the rate of movements of lineages into the same deme from different demes, and ( _A)₁₃ gives the rate of coalescence, which can only happen in the model when lineages are in the same deme. For both Inline graphic _A and _X the last row contains only zeros because coalescence is an absorbing state.

The rates of coalescence given by ( Inline graphic _A)₁₃ and ( _X)₁₃ are familiar: they are half the reciprocals of the variance effective population size of autosomal and X-linked genes in a sexual population with an unequal sex ratio (e.g., Nordborg and Krone, 2002; Hartl and Clark, 2007). The expected times to coalescence given in equations (8–11) also reflect that two lineages sampled from different demes must enter the same deme to coalesce, and then coalesce at a rate expected for an X-linked or autosomal locus in a population with an unequal sex ratio.

Using the expected times to coalescence for loci sampled in the same deme or in different demes, we can calculate F_ST at autosomal and X-linked loci in our model, as in equations (12) and (13). The forms of (12) and (13) are 1/(1 + 4N_em_e), where N_e = 4N_mN_f/(N_m + N_f) = 4Nr(1 − r) and m_e = (m_m + m_f)/2 for autosomal loci, and N_e = 9N_mN_f/(4N_m + 2N_f) = 9Nr(1 − r)/[2(2 − r)] and m_e = (2m_f + m_m)/3 for X-linked loci.

When M_m = M_f = M, then F_ST,A = 1/(1 + 8M) and F_ST,X = 1/(1 + 6M), with F_ST,X being greater than F_ST,A. When the number of female migrants per generation is greater than the number of male migrants, and these values are less than 1, then F_ST,X can become less than F_ST,A for some values of r on (0,1), as shown in Figure 2A, where F_ST,X crosses F_ST,A at r = 0.5076 and r = 0.9949. If the number of male migrants exceeds the number of female migrants per generation, then F_ST,X > F_ST,A; in Figure 2B these values become closer for larger values of r.

*F_ST* at X-linked and autosomal loci (equations (12) and (13)) as r, the female fraction of the population, varies on [0,1]. The dashed line is *F_ST,X* and the solid line is *F_ST,A*. A: *M_f* = *N_f m_f* = 1 migrant per generation, while the number of male migrants per generation is 0.01. B: *M_m*, the number of male migrants per generation, is equal to 1, while *M_f* = 0.01.

Using observed values of F_ST from the Human Genome Diversity Panel at X-linked and autosomal loci, we then use equations (14) and (15) to estimate the ratio of female to male migrants M_f/M_m. We assume equal numbers of males and females when calculating the estimates in Tables 2 and 3. In this model, there are no differences between the rates of reproductive success in males and females. However, the consequence of differences between reproductive success in males and females is an important question for further investigation (see, for example, Helgason et al., 2003, and Wilder et al., 2004b).

When r = 0.5, global F_ST values across HGDP populations can be explained by requiring a ratio of female to male migrants only slightly larger than 1. Regional values vary a great deal, both when r = 0.5 and when r varies over [0.4, 0.6] (Figure 3). In the Middle East, Central/South Asia, East Asia, and the Americas, assuming r = 0.5, observed F_ST values require a greater number of female migrants than male migrants to be explained in our model, while in Africa, Europe, and Oceania, the analysis finds support for more male migrants than female migrants. This could be due to differences in reproductive success for males and females in these regions, or to some other assumption made in our model.

Global and regional estimates of the ratio of female to male migrants (the ratio of equation (14) to equation (15)) as r varies over [0.4, 0.6], based on *F_ST* values calculated using 952 individuals from the Human Genome Diversity Panel.

Although we are not able to draw strong empirical conclusions from these data, we have rigorously derived F_ST and expected times to coalescence under this new model, making explicit how population differentiation, as measured by F_ST, depends on the number of males and females in a population and on the migration rates of the sexes. A scenario with males migrating more than females (Figure 2B) creates a bigger discrepancy between F_ST,A and F_ST,X than the reverse situation, producing differences that are much larger than those observed between F_ST values for the autosomes and for the X chromosome in the HGDP dataset. In combination with other tools, our results may assist in further investigations of the contributions of males and females to the history of human migration.

Acknowledgments

We thank Jeremy Van Cleve and Daniel Garrigan for helpful discussions, and Jon Wilkins and an anonymous reviewer for comments on earlier versions of this manuscript. This work was supported by the William F. Milton Fund of Harvard University, NSF grant DEB-0609760, and NIH grants GM-28016 and GM-081441.

Appendix 1: The migration components of transition matrix entries

Recall $g_{i, j}^{M, M}$ is the probability that a sample of two males are currently found in i number of demes (i = I,II) and were found in j number of demes (j = I,II) in the previous generation; states i and j refer to whether the sampled lineages were in the same deme (denoted as I) or different demes (II). For any sample, g_i,j will depend on the backwards migration rate and the population sizes of males (and/or females, depending on the individuals from which lineages were sampled), but will not depend on whether the sampled locus is X-linked or autosomal. Note that sampling of individuals is done without replacement. Thus, g_i,j for a sample of two males is given below:

\begin{array}{l} g_{I, I}^{M, M} = \overset{neither lineage was in a migrant}{\overset{︷}{\frac{N_{m} (1 - m_{m})}{N_{m}} (\frac{N_{m} (1 - m_{m}) - 1}{N_{m} - 1})}} + \overset{\begin{matrix} both lineages were in \\ migrants from the same deme \end{matrix}}{\overset{︷}{(\frac{N_{m} m_{m}}{N_{m}}) \frac{N_{m} m_{m} - 1}{N_{m} - 1} (\frac{1}{D - 1})}} \\ g_{I, I I}^{M, M} = \overset{exactly one lineage was in a migrant}{\overset{︷}{\frac{2 N_{m} (1 - m_{m})}{N_{m}} (\frac{N_{m} m_{m}}{N_{m} - 1})}} + \overset{\begin{matrix} both lineages were in \\ migrants from different demes \end{matrix}}{\overset{︷}{(\frac{N_{m} m_{m}}{N_{m}}) \frac{N_{m} m_{m} - 1}{N_{m} - 1} (\frac{D - 2}{D - 1})}} \\ g_{I I, I}^{M, M} = \frac{2 m_{m} (1 - m_{m})}{D - 1} + \frac{m_{m}^{2} (D - 2)}{{(D - 1)}^{2}} \\ g_{I I, I I}^{M, M} = {(1 - m_{m})}^{2} + \frac{2 m_{m} (1 - m_{m}) (D - 2)}{D - 1} + m_{m}^{2} [\frac{1}{D - 1} + {(\frac{D - 2}{D - 1})}^{2}] . \end{array}

The corresponding probabilities for a sample of two females are obtained by substituting m_f and N_f for m_m and N_m, respectively, in the equations above.

For a sample with one male lineage and one female lineage:

\begin{array}{l} g_{I, I}^{M, F} = (1 - m_{m}) (1 - m_{f}) + \frac{m_{f} m_{m}}{D - 1} \\ g_{I, I I}^{M, F} = m_{f} (1 - m_{m}) + m_{m} (1 - m_{f}) + m_{f} m_{m} \frac{D - 2}{D - 1} \\ g_{I I, I}^{M, F} = \frac{1}{D - 1} [m_{f} (1 - m_{m}) + m_{m} (1 - m_{f})] + \frac{m_{f} m_{m}}{D - 1} \frac{D - 2}{D - 1} \\ g_{I I, I I}^{M, F} = (1 - m_{m}) (1 - m_{f}) + \frac{D - 2}{D - 1} [m_{f} (1 - m_{m}) + m_{m} (1 - m_{f})] + \\ m_{f} m_{m} [\frac{1}{D - 1} + {(\frac{D - 2}{D - 1})}^{2}] . \end{array}

Note that $g_{i, I}^{k, l} + g_{i, I I}^{k, l}$ (the probability alleles sampled from two individuals with sexes k and l in state i in the present were in individuals in either the same or different demes in the previous generation) is 1 for all i, k, l.

To give an example of the decomposition of terms in matrices (1) and (2) according to equation (3), let us examine (P_A)₃₅ more closely. (P_A)₃₅ is the probability that two alleles sampled from two females in the same deme in the present were in one male and one female in the same deme one generation ago.

\begin{array}{l} {(P_{A})}_{35} = \frac{g_{I, I}^{F, F}}{2} \\ = \frac{1}{2} [\frac{N_{f} (1 - m_{f})}{N_{f}} (\frac{N_{f} (1 - m_{f}) - 1}{N_{f} - 1}) + (\frac{N_{f} m_{f}}{N_{f}}) \frac{N_{f} m_{f} - 1}{N_{f} - 1} (\frac{1}{D - 1})] . \end{array}

Substituting m_f = M_f/N_f = M_f/(Nr),

\begin{array}{l} {(D_{A})}_{35} = lim_{N \to \infty} {(P_{A})}_{35} \\ = lim_{N \to \infty} \frac{g_{I, I}^{F, F}}{2} \\ = lim_{N \to \infty} \frac{1}{2} [(1 - \frac{M_{f}}{N r}) (\frac{N r [1 - \frac{M_{f}}{N r}] - 1}{N r - 1})] + lim_{N \to \infty} \overset{goes to 0 as N \to \infty}{\overset{︷}{\frac{M_{f}}{2 N r} (\frac{M_{f} - 1}{N r - 1}) \frac{1}{D - 1}}} \\ = lim_{N \to \infty} \frac{1}{2} [(1 - \frac{M_{f}}{N r}) (\frac{N r - M_{f} - 1}{N r - 1})] = \frac{1}{2} (see {(D_{A})}_{35} in Appendix 2) . \end{array}

Using the definition of B_A from equation (3), we get

graphic file with name nihms83227Eq16.jpg

The second term of the right hand side of equation (16) above goes to 0 as N → ∞; using L’Hospital’s Rule on the first term, then

{(B_{A})}_{35} = lim_{N \to \infty} \frac{- 2 M_{f} N r}{2 r (N r - 1)} = lim_{N \to \infty} \frac{- 2 M_{f} r}{2 r^{2}} = \frac{- M_{f}}{r} (see {(B_{A})}_{35} in Appendix 2) .

Let E_N,A denote the error matrix E_N from equation (3) of the autosomal transition matrix P_A. Then

\begin{array}{l} {(E_{N, A})}_{35} = {(P_{A})}_{35} - {(D_{A})}_{35} - \frac{{(B_{A})}_{35}}{N} \\ = \frac{g_{I, I}^{F, F}}{2} - \frac{1}{2} + \frac{M_{f}}{N r} \\ = \frac{D (M_{f} - 1) M_{f}}{2 (D - 1) N r (N r - 1)} \underset{N \to \infty}{\to} 0. \end{array}

Appendix 2. The derivation of G_X and G_A using Möhle’s (1998) result

From equation (3), as N approaches infinity,

\begin{array}{l} B_{X} = {lim}_{N \to \infty} N (P_{X} - D_{X}) = \\ [\begin{matrix} 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ \frac{1}{8 r} & - \frac{1 + 2 M_{f}}{4 r} & - \frac{1}{4} (\frac{1}{1 - r} + \frac{2 M_{f}}{r}) & - \frac{M_{f}}{2 r} & \frac{M_{f}}{2 r} & \frac{M_{f}}{2 r} & \frac{M_{f}}{r} & \frac{1}{8 r} & \frac{1}{4 (1 - r)} \\ \frac{1}{2 r} & - \frac{2 M_{m}}{1 - r} - \frac{1}{r} & 0 & 0 & \frac{2 M_{m}}{1 - r} & 0 & 0 & \frac{1}{2 r} & 0 \\ \frac{1}{4 r} & - \frac{1}{2} (\frac{M_{m}}{1 - r} + \frac{1 + M_{f}}{r}) & 0 & - \frac{1}{2} (\frac{M_{m}}{1 - r} + \frac{M_{f}}{r}) & \frac{1}{2} (\frac{M_{m}}{1 - r} + \frac{M_{f}}{r}) & 0 & \frac{1}{2} (\frac{M_{m}}{1 - r} + \frac{M_{f}}{r}) & \frac{1}{4 r} & 0 \\ 0 & \frac{M_{f}}{2 (D - 1) r} & \frac{M_{f}}{2 (D - 1) r} & \frac{M_{f}}{(D - 1) r} & - \frac{M_{f}}{2 (D - 1) r} & - \frac{M_{f}}{2 (D - 1) r} & - \frac{M_{f}}{(D - 1) r} & 0 & 0 \\ 0 & \frac{2 M_{m}}{(D - 1) (1 - r)} & 0 & 0 & - \frac{2 M_{m}}{(D - 1) (1 - r) r} & 0 & 0 & 0 & 0 \\ 0 & \frac{1}{2 (D - 1)} (\frac{M_{m}}{1 - r} + \frac{M_{f}}{r}) & 0 & \frac{1}{2 (D - 1)} (\frac{M_{m}}{1 - r} + \frac{M_{f}}{r}) & - \frac{1}{2 (D - 1)} (\frac{M_{m}}{1 - r} + \frac{M_{f}}{r}) & 0 & - \frac{1}{2 (D - 1)} (\frac{M_{m}}{1 - r} + \frac{M_{f}}{r}) & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}] . \end{array}

Using the above and R_X given in matrix (5), the product matrix

\begin{array}{l} G_{X} = R_{X} B_{X} R_{X} = \\ [\begin{matrix} 0 & - \frac{4}{81} (\frac{1 + 6 M_{m}}{1 - r} + \frac{2 (1 + 6 M_{f})}{r}) & - \frac{1}{81} (\frac{1 + 6 M_{m}}{1 - r} + \frac{2 (1 + 6 M_{f})}{r}) & - \frac{4}{81} (\frac{1 + 6 M_{m}}{1 - r} + \frac{2 (1 + 6 M_{f})}{r}) & \frac{8}{27} g_{X}^{*} & \frac{2}{27} g_{X}^{*} & \frac{8}{27} g_{X}^{*} & \frac{2 (2 - r)}{27 (1 - r) r} & \frac{2 - r}{27 (1 - r) r} \\ 0 & - \frac{4}{81} (\frac{1 + 6 M_{m}}{1 - r} + \frac{2 (1 + 6 M_{f})}{r}) & - \frac{1}{81} (\frac{1 + 6 M_{m}}{1 - r} + \frac{2 (1 + 6 M_{f})}{r}) & - \frac{4}{81} (\frac{1 + 6 M_{m}}{1 - r} + \frac{2 (1 + 6 M_{f})}{r}) & \frac{8}{27} g_{X}^{*} & \frac{2}{27} g_{X}^{*} & \frac{8}{27} g_{X}^{*} & \frac{2 (2 - r)}{27 (1 - r) r} & \frac{2 - r}{27 (1 - r) r} \\ 0 & - \frac{4}{81} (\frac{1 + 6 M_{m}}{1 - r} + \frac{2 (1 + 6 M_{f})}{r}) & - \frac{1}{81} (\frac{1 + 6 M_{m}}{1 - r} + \frac{2 (1 + 6 M_{f})}{r}) & - \frac{4}{81} (\frac{1 + 6 M_{m}}{1 - r} + \frac{2 (1 + 6 M_{f})}{r}) & \frac{8}{27} g_{X}^{*} & \frac{2}{27} g_{X}^{*} & \frac{8}{27} g_{X}^{*} & \frac{2 (2 - r)}{27 (1 - r) r} & \frac{2 - r}{27 (1 - r) r} \\ 0 & - \frac{4}{81} (\frac{1 + 6 M_{m}}{1 - r} + \frac{2 (1 + 6 M_{f})}{r}) & - \frac{1}{81} (\frac{1 + 6 M_{m}}{1 - r} + \frac{2 (1 + 6 M_{f})}{r}) & - \frac{4}{81} (\frac{1 + 6 M_{m}}{1 - r} + \frac{2 (1 + 6 M_{f})}{r}) & \frac{8}{27} g_{X}^{*} & \frac{2}{27} g_{X}^{*} & \frac{8}{27} g_{X}^{*} & \frac{2 (2 - r)}{27 (1 - r) r} & \frac{2 - r}{27 (1 - r) r} \\ 0 & \frac{8}{27 (D - 1)} g_{X}^{*} & \frac{2}{27 (D - 1)} g_{X}^{*} & \frac{8}{27 (D - 1)} g_{X}^{*} & - \frac{8}{27 (D - 1)} g_{X}^{*} & - \frac{2}{27 (D - 1)} g_{X}^{*} & - \frac{8}{27 (D - 1)} g_{X}^{*} & 0 & 0 \\ 0 & \frac{8}{27 (D - 1)} g_{X}^{*} & \frac{2}{27 (D - 1)} g_{X}^{*} & \frac{8}{27 (D - 1)} g_{X}^{*} & - \frac{8}{27 (D - 1)} g_{X}^{*} & - \frac{2}{27 (D - 1)} g_{X}^{*} & - \frac{8}{27 (D - 1)} g_{X}^{*} & 0 & 0 \\ 0 & \frac{8}{27 (D - 1)} g_{X}^{*} & \frac{2}{27 (D - 1)} g_{X}^{*} & \frac{8}{27 (D - 1)} g_{X}^{*} & - \frac{8}{27 (D - 1)} g_{X}^{*} & - \frac{2}{27 (D - 1)} g_{X}^{*} & - \frac{8}{27 (D - 1)} g_{X}^{*} & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}], \end{array}

where $g_{X}^{*} = M_{m} / (1 - r) + 2 M_{f} / r$ .

To obtain the terms ( Inline graphic _X)_ij given in matrix (6) in the main text,

\begin{array}{l} {(G_{X})}_{11} = \sum_{j = 1}^{4} {(G_{X})}_{1 j}; {(G_{X})}_{12} = \sum_{j = 5}^{7} {(G_{X})}_{1 j}; {(G_{X})}_{13} = \sum_{j = 8}^{9} {(G_{X})}_{1 j}; \\ {(G_{X})}_{21} = \sum_{j = 1}^{4} {(G_{X})}_{5 j}; {(G_{X})}_{22} = \sum_{j = 5}^{7} {(G_{X})}_{5 j}; {(G_{X})}_{23} = \sum_{j = 8}^{9} {(G_{X})}_{5 j} . \end{array}

The autosomal matrices D_A, R_A, B_A, and G_A are all 10 × 10 matrices, with states as in Table 1. Using P_A given by matrix (1), from equation (3) D_A = lim_N_{→ ∞} P_A is

D_{A} = [\begin{matrix} 0 & 0 & 0 & 0 & 1 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 1 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 1 / 4 & 1 / 4 & 1 / 2 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 1 / 4 & 1 / 4 & 1 / 2 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 1 / 4 & 1 / 4 & 1 / 2 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 1 / 4 & 1 / 4 & 1 / 2 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 1 / 4 & 1 / 4 & 1 / 2 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 1 / 4 & 1 / 4 & 1 / 2 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 1 / 2 & 1 / 2 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 1 / 2 & 1 / 2 \end{matrix}] .

R_{A} = lim_{t \to \infty} {(D_{A})}^{t} = [\begin{matrix} 0 & 0 & 1 / 4 & 1 / 4 & 1 / 2 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 1 / 4 & 1 / 4 & 1 / 2 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 1 / 4 & 1 / 4 & 1 / 2 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 1 / 4 & 1 / 4 & 1 / 2 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 1 / 4 & 1 / 4 & 1 / 2 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 1 / 4 & 1 / 4 & 1 / 2 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 1 / 4 & 1 / 4 & 1 / 2 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 1 / 4 & 1 / 4 & 1 / 2 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 1 / 2 & 1 / 2 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 1 / 2 & 1 / 2 \end{matrix}] .

\begin{array}{l} B_{A} = {lim}_{N \to \infty} N (P_{A} - D_{A}) = \\ [\begin{matrix} 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ \frac{1}{8 r} & \frac{1}{8 (1 - r)} & - \frac{1 + 2 M_{f}}{4 r} & - \frac{1}{2} (\frac{M_{f}}{r} + \frac{1}{2 (1 - r)}) & - \frac{M_{f}}{r} & \frac{M_{f}}{2 r} & \frac{M_{f}}{2 r} & \frac{M_{f}}{r} & \frac{1}{8 r} & \frac{1}{8 (1 - r)} \\ \frac{1}{8 r} & \frac{1}{8 (1 - r)} & - \frac{M_{m}}{2 (1 - r)} - \frac{1}{4 r} & - \frac{1 + 2 M_{m}}{4 (1 - r)} & - \frac{M_{m}}{1 - r} & \frac{M_{m}}{2 (1 - r)} & \frac{M_{m}}{2 (1 - r)} & \frac{M_{m}}{1 - r} & \frac{1}{8 r} & \frac{1}{8 (1 - r)} \\ \frac{1}{8 r} & \frac{1}{8 (1 - r)} & - \frac{1}{4} (\frac{M_{m}}{1 - r} + \frac{1 + M_{f}}{r}) & - \frac{1}{4} (\frac{1 + M_{m}}{1 - r} + \frac{M_{f}}{r}) & - \frac{1}{2} b_{A}^{*} & \frac{1}{4} b_{A}^{*} & \frac{1}{4} b_{A}^{*} & \frac{1}{2} b_{A}^{*} & \frac{1}{8 r} & \frac{1}{8 (1 - r)} \\ 0 & 0 & \frac{M_{f}}{2 (D - 1) r} & \frac{M_{f}}{2 (D - 1) r} & \frac{M_{f}}{(D - 1) r} & - \frac{M_{f}}{2 (D - 1) r} & - \frac{M_{f}}{2 (D - 1) r} & - \frac{M_{f}}{(D - 1) r} & 0 & 0 \\ 0 & 0 & \frac{M_{m}}{2 (D - 1) (1 - r)} & \frac{M_{m}}{2 (D - 1) (1 - r)} & \frac{M_{m}}{(D - 1) (1 - r)} & - \frac{M_{m}}{2 (D - 1) (1 - r)} & - \frac{M_{m}}{2 (D - 1) (1 - r)} & - \frac{M_{m}}{(D - 1) (1 - r)} & 0 & 0 \\ 0 & 0 & \frac{1}{4 (D - 1)} b_{A}^{*} & \frac{1}{4 (D - 1)} b_{A}^{*} & \frac{1}{2 (D - 1)} b_{A}^{*} & - \frac{1}{4 (D - 1)} b_{A}^{*} & - \frac{1}{4 (D - 1)} b_{A}^{*} & - \frac{1}{2 (D - 1)} b_{A}^{*} & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}], \end{array}

where $b_{A}^{*} = M_{m} / (1 - r) + M_{f} / r$ .

The product matrix

\begin{array}{l} G_{A} = R_{A} B_{A} R_{A} = \\ [\begin{matrix} 0 & 0 & - \frac{1 + 8 [M_{f} (1 - r) + M_{m} r]}{32 (1 - r) r} & - \frac{1 + 8 [M_{f} (1 - r) + M_{m} r]}{32 (1 - r) r} & - \frac{1 + 8 [M_{f} (1 - r) + M_{m} r]}{16 (1 - r) r} & \frac{1}{4} g_{A}^{*} & \frac{1}{4} g_{A}^{*} & \frac{1}{2} g_{A}^{*} & \frac{1}{16 (1 - r) r} & \frac{1}{16 (1 - r) r} \\ 0 & 0 & - \frac{1 + 8 [M_{f} (1 - r) + M_{m} r]}{32 (1 - r) r} & - \frac{1 + 8 [M_{f} (1 - r) + M_{m} r]}{32 (1 - r) r} & - \frac{1 + 8 [M_{f} (1 - r) + M_{m} r]}{16 (1 - r) r} & \frac{1}{4} g_{A}^{*} & \frac{1}{4} g_{A}^{*} & \frac{1}{2} g_{A}^{*} & \frac{1}{16 (1 - r) r} & \frac{1}{16 (1 - r) r} \\ 0 & 0 & - \frac{1 + 8 [M_{f} (1 - r) + M_{m} r]}{32 (1 - r) r} & - \frac{1 + 8 [M_{f} (1 - r) + M_{m} r]}{32 (1 - r) r} & - \frac{1 + 8 [M_{f} (1 - r) + M_{m} r]}{16 (1 - r) r} & \frac{1}{4} g_{A}^{*} & \frac{1}{4} g_{A}^{*} & \frac{1}{2} g_{A}^{*} & \frac{1}{16 (1 - r) r} & \frac{1}{16 (1 - r) r} \\ 0 & 0 & - \frac{1 + 8 [M_{f} (1 - r) + M_{m} r]}{32 (1 - r) r} & - \frac{1 + 8 [M_{f} (1 - r) + M_{m} r]}{32 (1 - r) r} & - \frac{1 + 8 [M_{f} (1 - r) + M_{m} r]}{16 (1 - r) r} & \frac{1}{4} g_{A}^{*} & \frac{1}{4} g_{A}^{*} & \frac{1}{2} g_{A}^{*} & \frac{1}{16 (1 - r) r} & \frac{1}{16 (1 - r) r} \\ 0 & 0 & - \frac{1 + 8 [M_{f} (1 - r) + M_{m} r]}{32 (1 - r) r} & - \frac{1 + 8 [M_{f} (1 - r) + M_{m} r]}{32 (1 - r) r} & - \frac{1 + 8 [M_{f} (1 - r) + M_{m} r]}{16 (1 - r) r} & \frac{1}{4} g_{A}^{*} & \frac{1}{4} g_{A}^{*} & \frac{1}{2} g_{A}^{*} & \frac{1}{16 (1 - r) r} & \frac{1}{16 (1 - r) r} \\ 0 & 0 & \frac{1}{4 (D - 1)} g_{A}^{*} & \frac{1}{4 (D - 1)} g_{A}^{*} & \frac{1}{2 (D - 1)} g_{A}^{*} & - \frac{1}{4 (D - 1)} g_{A}^{*} & - \frac{1}{4 (D - 1)} g_{A}^{*} & - \frac{1}{2 (D - 1)} g_{A}^{*} & 0 & 0 \\ 0 & 0 & \frac{1}{4 (D - 1)} g_{A}^{*} & \frac{1}{4 (D - 1)} g_{A}^{*} & \frac{1}{2 (D - 1)} g_{A}^{*} & - \frac{1}{4 (D - 1)} g_{A}^{*} & - \frac{1}{4 (D - 1)} g_{A}^{*} & - \frac{1}{2 (D - 1)} g_{A}^{*} & 0 & 0 \\ 0 & 0 & \frac{1}{4 (D - 1)} g_{A}^{*} & \frac{1}{4 (D - 1)} g_{A}^{*} & \frac{1}{2 (D - 1)} g_{A}^{*} & - \frac{1}{4 (D - 1)} g_{A}^{*} & - \frac{1}{4 (D - 1)} g_{A}^{*} & - \frac{1}{2 (D - 1)} g_{A}^{*} & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}], \end{array}

where $g_{A}^{*} = b_{A}^{*} (see matrix (17)) = M_{m} / (1 - r) + M_{f} / r$ .

To obtain the terms ( Inline graphic _A)_ij given in matrix (7) in the main text

\begin{array}{l} {(G_{A})}_{11} = \sum_{j = 1}^{5} {(G_{A})}_{1 j}; {(G_{A})}_{12} = \sum_{j = 6}^{8} {(G_{A})}_{1 j}; {(G_{A})}_{13} = \sum_{j = 9}^{10} {(G_{A})}_{1 j} \\ {(G_{A})}_{21} = \sum_{j = 1}^{5} {(G_{A})}_{6 j}; {(G_{A})}_{22} = \sum_{j = 6}^{8} {(G_{A})}_{6 j}; {(G_{A})}_{23} = \sum_{j = 9}^{10} {(G_{A})}_{6 j} . \end{array}

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

References

Cann HM, de Toma C, Cazes L, Legrand MF, Morel V, Piouffre L, Bodmer J, Bodmer WF, Bonne-Tamir B, Cambon-Thomsen A, et al. A human genome diversity cell line panel. Science. 2002;296:261–262. doi: 10.1126/science.296.5566.261b. [DOI] [PubMed] [Google Scholar]
Garrigan D, Kingan SB, Pilkington MM, Wilder JA, Cox MP, Soodyall H, Strassmann B, Destro-Bisol G, de Knijff P, Novelletto A, Friedlaender J, Hammer MF. Inferring human population sizes, divergence times and rates of gene flow from mitochondrial, X and Y chromosome resequencing data. Genetics. 2007;177:2195–2207. doi: 10.1534/genetics.107.077495. [DOI] [PMC free article] [PubMed] [Google Scholar]
Hartl DL, Clark AG. Principles of Population Genetics. Sinauer; Sunderland, MA: 2007. pp. 124–125. [Google Scholar]
Hedrick PW. Sex: differences in mutation, recombination, selection, gene flow, and genetic drift. Evolution. 2007;61:2750–2771. doi: 10.1111/j.1558-5646.2007.00250.x. [DOI] [PubMed] [Google Scholar]
Helgason A, Hrafnkelsson B, Gulcher JR, Ward R, Stefánsson K. A populationwide coalescent analysis of Icelandic matrilineal and patrilineal genealogies: evidence for a faster evolutionary rate of mtDNA lineages than Y chromosomes. Am J Hum Genet. 2003;72:1370–1388. doi: 10.1086/375453. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kayser M, Choi Y, van Oven M, Mona S, Brauer S, Trent RJ, Suarkia D, Schiefenhövel W, Stoneking M. The impact of the Austronesian expansion: evidence from mtDNA and Y-chromosome diversity in the Admiralty Islands of Melanesia. Mol Biol Evol. 2008;25:1362–1374. doi: 10.1093/molbev/msn078. [DOI] [PubMed] [Google Scholar]
Laporte V, Charlesworth B. Effective population size and population subdivision in demographically structured populations. Genetics. 2002;162:501–519. doi: 10.1093/genetics/162.1.501. [DOI] [PMC free article] [PubMed] [Google Scholar]
Lawson Handley LJ, Perrin N. Advances in our understanding of mammalian sex-biased dispersal. Mol Ecol. 2007;16:1559–1578. doi: 10.1111/j.1365-294X.2006.03152.x. [DOI] [PubMed] [Google Scholar]
Möhle M. A convergence theorem for Markov chains arising in population genetics and the coalescent with partial selfing. Adv Appl Prob. 1998;30:493–512. [Google Scholar]
Mountain JL, Ramakrishnan U. Impact of human population history on distributions of individual-level genetic distance. Hum Genomics. 2005;2:4–19. doi: 10.1186/1479-7364-2-1-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
Nordborg M, Krone SM. Separation of time scales and convergence to the coalescent in structured populations. In: Slatkin M, Veuille M, editors. Modern Developments in Theoretical Population Genetics. Oxford University Press; Oxford: 2002. pp. 194–232. [Google Scholar]
Oota H, Settheetham-Ishida W, Tiwawech D, Ishida T, Stoneking M. Human mtDNA and Y-chromosome variation is correlated with matrilocal versus patrilocal resident. Nat Genet. 2001;29:20–21. doi: 10.1038/ng711. [DOI] [PubMed] [Google Scholar]
Ramachandran S, Rosenberg NA, Zhivotovsky LA, Feldman MW. Robustness of the inference of human population structure: a comparison of X-chromosomal and autosomal microsatellites. Hum Genomics. 2004;1:87–97. doi: 10.1186/1479-7364-1-2-87. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ramachandran S, Deshpande O, Roseman CC, Rosenberg NA, Feldman MW, Cavalli-Sforza LL. Support from the relationship of genetic and geographic distance in human populations for a serial founder effect originating in Africa. Proc Natl Acad Sci USA. 2005;102:15942–15947. doi: 10.1073/pnas.0507611102. [DOI] [PMC free article] [PubMed] [Google Scholar]
Rosenberg NA, Pritchard JK, Weber JL, Cann HM, Kidd KK, Zhivotovsky LA, Feldman MW. Genetic structure of human populations. Science. 2002;298:2381–2385. doi: 10.1126/science.1078311. [DOI] [PubMed] [Google Scholar]
Rosenberg NA, Mahajan S, Ramachandran S, Zhao C, Pritchard JK, Feldman MW. Clines, clusters, and the effect of study design on the inference of human population structure. PLoS Genet. 2005;1:e70. doi: 10.1371/journal.pgen.0010070. [DOI] [PMC free article] [PubMed] [Google Scholar]
Rosenberg NA. Standardized subsets of the HGDP-CEPH Human Genome Diversity Cell Line Panel, accounting for atypical and duplicated samples and pairs of close relatives. Ann Hum Genet. 2006;70:841–847. doi: 10.1111/j.1469-1809.2006.00285.x. [DOI] [PubMed] [Google Scholar]
Rousset F. Genetic differentiation in populations with different classes of individuals. Theor Popul Biol. 1999;55:297–308. doi: 10.1006/tpbi.1998.1406. [DOI] [PubMed] [Google Scholar]
Schaffner SF. The X chromosome in population genetics. Nat Rev Genet. 2004;5:43–51. doi: 10.1038/nrg1247. [DOI] [PubMed] [Google Scholar]
Seielstad MT, Minch E, Cavalli-Sforza LL. Genetic evidence for a higher female migration rate in humans. Nat Genet. 1998;20:278–280. doi: 10.1038/3088. [DOI] [PubMed] [Google Scholar]
Slatkin M. Inbreeding coefficients and coalescence times. Genet Res. 1991;58:167–175. doi: 10.1017/s0016672300029827. [DOI] [PubMed] [Google Scholar]
Vitalis R. Sex-specific genetic differentiation and coalescence times: estimating sex-biased dispersal rates. Mol Ecol. 2002;11:125–138. doi: 10.1046/j.0962-1083.2001.01414.x. [DOI] [PubMed] [Google Scholar]
Wang J. Effective size and F-statistics of subdivided populations. II. dioecious species. Genetics. 1997;146:1465–1474. doi: 10.1093/genetics/146.4.1465. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wang J. Effective size and F-statistics of subdivided populations for sex-linked loci. Theor Popul Biol. 1999;55:176–188. doi: 10.1006/tpbi.1998.1398. [DOI] [PubMed] [Google Scholar]
Conrad DF, Jakobsson M, Coop G, Wen X, Wall JD, Rosenberg NA, Pritchard JK. A worldwide survey of haplotype variation and linkage disequilibrium in the human genome. Nature Genet. 2006;38:1251–1260. doi: 10.1038/ng1911. [DOI] [PubMed] [Google Scholar]
Weir BS. Genetic Data Analysis II. Sinauer; Sunderland, MA: 1996. [Google Scholar]
Wilder JA, Kingan SB, Mobasher Z, Pilkington MM, Hammer MF. Global patterns of human mitochondrial DNA and Y-chromosome structure are not influenced by higher migration rates of females versus males. Nat Genet. 2004a;36:1122–1125. doi: 10.1038/ng1428. [DOI] [PubMed] [Google Scholar]
Wilder JA, Mobasher Z, Hammer MF. Genetic evidence for unequal effective population sizes of human females and males. Mol Biol Evol. 2004b;21:2047–2057. doi: 10.1093/molbev/msh214. [DOI] [PubMed] [Google Scholar]
Wilkins JF. Unraveling male and female histories from human genetic data. Curr Opin Genet Dev. 2006;16:611–617. doi: 10.1016/j.gde.2006.10.004. [DOI] [PubMed] [Google Scholar]
Wilkins JF, Marlowe FW. Sex-biased migration in humans: what should we expect from genetic data? BioEssays. 2006;28:290–300. doi: 10.1002/bies.20378. [DOI] [PubMed] [Google Scholar]

[R1] Cann HM, de Toma C, Cazes L, Legrand MF, Morel V, Piouffre L, Bodmer J, Bodmer WF, Bonne-Tamir B, Cambon-Thomsen A, et al. A human genome diversity cell line panel. Science. 2002;296:261–262. doi: 10.1126/science.296.5566.261b. [DOI] [PubMed] [Google Scholar]

[R2] Garrigan D, Kingan SB, Pilkington MM, Wilder JA, Cox MP, Soodyall H, Strassmann B, Destro-Bisol G, de Knijff P, Novelletto A, Friedlaender J, Hammer MF. Inferring human population sizes, divergence times and rates of gene flow from mitochondrial, X and Y chromosome resequencing data. Genetics. 2007;177:2195–2207. doi: 10.1534/genetics.107.077495. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R3] Hartl DL, Clark AG. Principles of Population Genetics. Sinauer; Sunderland, MA: 2007. pp. 124–125. [Google Scholar]

[R4] Hedrick PW. Sex: differences in mutation, recombination, selection, gene flow, and genetic drift. Evolution. 2007;61:2750–2771. doi: 10.1111/j.1558-5646.2007.00250.x. [DOI] [PubMed] [Google Scholar]

[R5] Helgason A, Hrafnkelsson B, Gulcher JR, Ward R, Stefánsson K. A populationwide coalescent analysis of Icelandic matrilineal and patrilineal genealogies: evidence for a faster evolutionary rate of mtDNA lineages than Y chromosomes. Am J Hum Genet. 2003;72:1370–1388. doi: 10.1086/375453. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R6] Kayser M, Choi Y, van Oven M, Mona S, Brauer S, Trent RJ, Suarkia D, Schiefenhövel W, Stoneking M. The impact of the Austronesian expansion: evidence from mtDNA and Y-chromosome diversity in the Admiralty Islands of Melanesia. Mol Biol Evol. 2008;25:1362–1374. doi: 10.1093/molbev/msn078. [DOI] [PubMed] [Google Scholar]

[R7] Laporte V, Charlesworth B. Effective population size and population subdivision in demographically structured populations. Genetics. 2002;162:501–519. doi: 10.1093/genetics/162.1.501. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R8] Lawson Handley LJ, Perrin N. Advances in our understanding of mammalian sex-biased dispersal. Mol Ecol. 2007;16:1559–1578. doi: 10.1111/j.1365-294X.2006.03152.x. [DOI] [PubMed] [Google Scholar]

[R9] Möhle M. A convergence theorem for Markov chains arising in population genetics and the coalescent with partial selfing. Adv Appl Prob. 1998;30:493–512. [Google Scholar]

[R10] Mountain JL, Ramakrishnan U. Impact of human population history on distributions of individual-level genetic distance. Hum Genomics. 2005;2:4–19. doi: 10.1186/1479-7364-2-1-4. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R11] Nordborg M, Krone SM. Separation of time scales and convergence to the coalescent in structured populations. In: Slatkin M, Veuille M, editors. Modern Developments in Theoretical Population Genetics. Oxford University Press; Oxford: 2002. pp. 194–232. [Google Scholar]

[R12] Oota H, Settheetham-Ishida W, Tiwawech D, Ishida T, Stoneking M. Human mtDNA and Y-chromosome variation is correlated with matrilocal versus patrilocal resident. Nat Genet. 2001;29:20–21. doi: 10.1038/ng711. [DOI] [PubMed] [Google Scholar]

[R13] Ramachandran S, Rosenberg NA, Zhivotovsky LA, Feldman MW. Robustness of the inference of human population structure: a comparison of X-chromosomal and autosomal microsatellites. Hum Genomics. 2004;1:87–97. doi: 10.1186/1479-7364-1-2-87. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R14] Ramachandran S, Deshpande O, Roseman CC, Rosenberg NA, Feldman MW, Cavalli-Sforza LL. Support from the relationship of genetic and geographic distance in human populations for a serial founder effect originating in Africa. Proc Natl Acad Sci USA. 2005;102:15942–15947. doi: 10.1073/pnas.0507611102. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R15] Rosenberg NA, Pritchard JK, Weber JL, Cann HM, Kidd KK, Zhivotovsky LA, Feldman MW. Genetic structure of human populations. Science. 2002;298:2381–2385. doi: 10.1126/science.1078311. [DOI] [PubMed] [Google Scholar]

[R16] Rosenberg NA, Mahajan S, Ramachandran S, Zhao C, Pritchard JK, Feldman MW. Clines, clusters, and the effect of study design on the inference of human population structure. PLoS Genet. 2005;1:e70. doi: 10.1371/journal.pgen.0010070. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R17] Rosenberg NA. Standardized subsets of the HGDP-CEPH Human Genome Diversity Cell Line Panel, accounting for atypical and duplicated samples and pairs of close relatives. Ann Hum Genet. 2006;70:841–847. doi: 10.1111/j.1469-1809.2006.00285.x. [DOI] [PubMed] [Google Scholar]

[R18] Rousset F. Genetic differentiation in populations with different classes of individuals. Theor Popul Biol. 1999;55:297–308. doi: 10.1006/tpbi.1998.1406. [DOI] [PubMed] [Google Scholar]

[R19] Schaffner SF. The X chromosome in population genetics. Nat Rev Genet. 2004;5:43–51. doi: 10.1038/nrg1247. [DOI] [PubMed] [Google Scholar]

[R20] Seielstad MT, Minch E, Cavalli-Sforza LL. Genetic evidence for a higher female migration rate in humans. Nat Genet. 1998;20:278–280. doi: 10.1038/3088. [DOI] [PubMed] [Google Scholar]

[R21] Slatkin M. Inbreeding coefficients and coalescence times. Genet Res. 1991;58:167–175. doi: 10.1017/s0016672300029827. [DOI] [PubMed] [Google Scholar]

[R22] Vitalis R. Sex-specific genetic differentiation and coalescence times: estimating sex-biased dispersal rates. Mol Ecol. 2002;11:125–138. doi: 10.1046/j.0962-1083.2001.01414.x. [DOI] [PubMed] [Google Scholar]

[R23] Wang J. Effective size and F-statistics of subdivided populations. II. dioecious species. Genetics. 1997;146:1465–1474. doi: 10.1093/genetics/146.4.1465. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R24] Wang J. Effective size and F-statistics of subdivided populations for sex-linked loci. Theor Popul Biol. 1999;55:176–188. doi: 10.1006/tpbi.1998.1398. [DOI] [PubMed] [Google Scholar]

[R25] Conrad DF, Jakobsson M, Coop G, Wen X, Wall JD, Rosenberg NA, Pritchard JK. A worldwide survey of haplotype variation and linkage disequilibrium in the human genome. Nature Genet. 2006;38:1251–1260. doi: 10.1038/ng1911. [DOI] [PubMed] [Google Scholar]

[R26] Weir BS. Genetic Data Analysis II. Sinauer; Sunderland, MA: 1996. [Google Scholar]

[R27] Wilder JA, Kingan SB, Mobasher Z, Pilkington MM, Hammer MF. Global patterns of human mitochondrial DNA and Y-chromosome structure are not influenced by higher migration rates of females versus males. Nat Genet. 2004a;36:1122–1125. doi: 10.1038/ng1428. [DOI] [PubMed] [Google Scholar]

[R28] Wilder JA, Mobasher Z, Hammer MF. Genetic evidence for unequal effective population sizes of human females and males. Mol Biol Evol. 2004b;21:2047–2057. doi: 10.1093/molbev/msh214. [DOI] [PubMed] [Google Scholar]

[R29] Wilkins JF. Unraveling male and female histories from human genetic data. Curr Opin Genet Dev. 2006;16:611–617. doi: 10.1016/j.gde.2006.10.004. [DOI] [PubMed] [Google Scholar]

[R30] Wilkins JF, Marlowe FW. Sex-biased migration in humans: what should we expect from genetic data? BioEssays. 2006;28:290–300. doi: 10.1002/bies.20378. [DOI] [PubMed] [Google Scholar]

PERMALINK

Population differentiation and migration: coalescence times in a two-sex island model for autosomal and X-linked loci

Sohini Ramachandran

Noah A Rosenberg

Marcus W Feldman

John Wakeley

Abstract

Introduction

The migration model

Table 1. States in the migration model.

Results

Application to HGDP-CEPH data

Table 2. Estimated ratio of M_f/M_m, using data from 1048 individuals.

Table 3. Estimated ratio of M_f/M_m, using data from 952 individuals.

Figure 1.

Discussion

Figure 2.

Figure 3.

Acknowledgments

Appendix 1: The migration components of transition matrix entries

Appendix 2. The derivation of G_X and G_A using Möhle’s (1998) result

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Population differentiation and migration: coalescence times in a two-sex island model for autosomal and X-linked loci

Sohini Ramachandran

Noah A Rosenberg

Marcus W Feldman

John Wakeley

Abstract

Introduction

The migration model

Table 1. States in the migration model.

Results

Application to HGDP-CEPH data

Table 2. Estimated ratio of Mf/Mm, using data from 1048 individuals.

Table 3. Estimated ratio of Mf/Mm, using data from 952 individuals.

Figure 1.

Discussion

Figure 2.

Figure 3.

Acknowledgments

Appendix 1: The migration components of transition matrix entries

Appendix 2. The derivation of GX and GA using Möhle’s (1998) result

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Table 2. Estimated ratio of M_f/M_m, using data from 1048 individuals.

Table 3. Estimated ratio of M_f/M_m, using data from 952 individuals.

Appendix 2. The derivation of G_X and G_A using Möhle’s (1998) result