Modeling X-Linked Ancestral Origins in Multiparental Populations

Chaozhi Zheng

doi:10.1534/g3.114.016154

. 2015 Mar 4;5(5):777–801. doi: 10.1534/g3.114.016154

Modeling X-Linked Ancestral Origins in Multiparental Populations

Chaozhi Zheng ^1,¹

PMCID: PMC4426366 PMID: 25740936

Abstract

The models for the mosaic structure of an individual’s genome from multiparental populations have been developed primarily for autosomes, whereas X chromosomes receive very little attention. In this paper, we extend our previous approach to model ancestral origin processes along two X chromosomes in a mapping population, which is necessary for developing hidden Markov models in the reconstruction of ancestry blocks for X-linked quantitative trait locus mapping. The model accounts for the joint recombination pattern, the asymmetry between maternally and paternally derived X chromosomes, and the finiteness of population size. The model can be applied to various mapping populations such as the advanced intercross lines (AIL), the Collaborative Cross (CC), the heterogeneous stock (HS), the Diversity Outcross (DO), and the Drosophila synthetic population resource (DSPR). We further derive the map expansion, density (per Morgan) of recombination breakpoints, in advanced intercross populations with L inbred founders under the limit of an infinitely large population size. The analytic results show that for X chromosomes the genetic map expands linearly at a rate (per generation) of two-thirds times 1 – 10/(9L) for the AIL, and at a rate of two-thirds times 1 – 1/L for the DO and the HS, whereas for autosomes the map expands at a rate of 1 – 1/L for the AIL, the DO, and the HS.

Keywords: advanced intercross lines (AIL), Collaborative Cross (CC), Diversity Outcross (DO), Drosophila synthetic population resource (DSPR), map expansion, MPP, multiparental populations, Multiparent Advanced Generation Inter-Cross (MAGIC)

There have been recently designed quantitative trait locus (QTL) mapping populations with either multiple parents to increase the genetic diversity of the founder population, or many intercross generations to improve the mapping resolution by accumulating historical recombination events. Some examples include the Collaborative Cross (CC) (Churchill et al. 2004), the advanced intercross lines (AIL) (Darvasi and Soller 1995), the heterogeneous stock (HS) (Mott et al. 2000), the diversity outcross (DO) (Svenson et al. 2012), and the Drosophila synthetic population resource (DSPR) (King et al. 2012). The CC can be regarded as a set of eight-way recombinant inbred lines (RIL) by sibling mating, where eight founders of each line are permuted.

The genomes of individuals in QTL mapping populations are random mosaics of the founders’ genomes. The QTL mapping generally necessitates the reconstruction of these genome blocks along two homologous chromosomes of a sampled individual from available genotype data. Such reconstruction is often performed under a hidden Markov model (HMM) with the latent state being the pair of ancestral origins at a locus, where the transition probability of ancestral origins between two loci, or the two-locus diplotype (two-haplotype) probabilities are required.

Modeling ancestral origins along a pair of autosomal chromosomes has been well developed recently. Broman (2012a) extended the approach of Haldane and Waddington (1931) from the two-way to four- and eight-way RIL by sibling mating and provided recipes for calculating autosomal two-locus diplotype probabilities numerically. Johannes and Colome-Tatche (2011) derived autosomal two-locus diplotype probabilities for the two-way RIL by selfing. Zheng et al. (2014) described a general modeling framework for ancestral origins that can be applied to autosomes in various mapping populations such as the RIL by selfing or sibling mating and the AIL.

A special treatment is required for modeling ancestral origins along a pair of X chromosomes. Haldane and Waddington (1931) derived the recurrence relations of the X-linked two-locus diplotype probabilities for the two-way RIL by sibling mating and the bi-parental repeated parent-offspring mating, and their closed form solutions for the final homozygous lines. Broman (2005) extended the solutions to the two- and three-locus haplotype probabilities for the two, four, or eight-way RIL by sibling mating. Broman (2012b) derived the X-linked two-locus haplotype probabilities in advanced intercross populations including the AIL, the HS, and the DO, assuming an infinitely large population size.

In this paper, we extend our previous work (Zheng et al. 2014) to model the ancestral origins along a pair of X chromosomes in a finite mapping population. This extension also builds on the theory of junctions in inbreeding (Fisher 1949, 1954). A junction is defined as a boundary point of genome blocks on chromosomes where two distinct ancestral origins meet, and the boundary points that occur at the same location along multiple chromosomes are counted as a single junction. The map expansion is the expected junction density (per Morgan) on a maternally or paternally derived X chromosome, denoted by $R^{m}$ or $R^{p}$ , respectively. We denote by $ρ^{m p}$ the overall junction density along the XX chromosomes of a female, and it can be used as a measure of X-linked QTL mapping resolution (Darvasi and Soller 1995; Weller and Soller 2004).

The key feature of this extension is to account for the asymmetry between maternally and paternally derived X chromosomes because the latter did not experience any crossover events with Y chromosomes. We first present a model framework for X-linked ancestral origins, where the recurrent relations are derived for various junction densities including the map expansions $R^{m}$ and $R^{p}$ . Then, we derive the closed form solutions for these expected densities in mapping populations including the RIL by sibling mating, the AIL, the HS, the DO, and the DSPR; they are evaluated by forward simulation studies. Lastly, we discuss the model assumptions and the implications of the analytic results on haplotype reconstructions and breeding designs.

A model for X-linked ancestral origins

Assumptions and notation

Consider a dioecious population with two separate sexes: homogametic females with sex chromosomes XX and heterogametic males with sex chromosomes XY. There are no recombination events between X and Y, and thus we ignore the pseudoautosomal regions on the XY chromosomes. As in most mammals and some insects (Drosophila), some flowering species, such as white campion (Silene latifolia), papaya (Carica papaya), and asparagus (Asparagus officianalis), have the XY sex determination system (Ming and Moore 2007). The dioecious population was founded in generation 0, and it has nonoverlapping generations. There are no natural or artificial selections since the founder population. The mating schemes of producing the next generation are random, and they may vary from one generation to the next. The assignments of offspring genders are assumed to be independent of mating schemes.

The ancestral origins along two homologous autosomes have been modeled as a continuous time Markov chain (CTMC) (Zheng et al. 2014). We extend the approach to account for the asymmetry of XX chromosomes, using superscript m (p) for maternally (paternally) derived genes or chromosomes. See Supporting Information, Table S1 for a list of symbols used in this paper. Let $O (x) = (O^{m} (x), O^{p} (x))$ be the ordered pair of the ancestral origins at location x along the two X chromosomes of a randomly sampled female. The ancestral origin process $O (x)$ is assumed to follow a CTMC, where x is the time parameter in unit of Morgan. We assign a unique ancestral origin to the X chromosomes of each inbred founder, or to each X chromosome of each outbred founder. Multiple genes, within or between loci, are identical by descent (IBD) if they have the same ancestral origins. Let L be the number of possible ancestral origins that $O^{m} (x)$ or $O^{p} (x)$ may take. L may be less than the number of inbred founders if some male founders did not produce daughters to pass down their X chromosomes. For example, $L = 3$ for the four-way RIL by sibling mating since one of the founder mating pairs produces only one son (Figure 1A).

The continuous time Markov chain (CTMC) of X-linked ancestry blocks in the four-way recombinant inbred lines (RILs) by sibling mating. (A) One realization of ancestry blocks in the four-way RIL with generation up to F₃. The sex chromosomes of the four inbred founders are represented by different colors and labeled as A, B, C, and D. The short bars denote Y chromosomes. The ancestral origin D is impossible in the X chromosomes of generation t ≥ 1. (B) Evaluation of the exchangeability assumption by one-locus genotype probabilities. The gray dashed line refers to the average genotype probability for one particular non-IBD genotype AB, AC, or BC; the black dashed line is for one particular IBD genotype AA, BB, or CC. Note that the ancestral origins A and B are exchangeable, but the ancestral origin C is not exchangeable with either A or B. (C) Schematics of the seven junction types along the maternally (left) and paternally (right) derived X chromosomes. (D) The rate matrix of the CTMC for the four RILs in (A). The diagonal elements are given so that row sums are zero. The rate matrix is determined by the seven basic rates, each corresponding to one of the seven junction types. The subscripts of the basic rates denote the IBD (1) or non-IBD (0) states on the left- and right-hand sides of the junctions, and the rates with superscript * refer to the transitions on the paternally derived chromosome. (E) The general relationships between the basic rates and the expected densities for the seven types of junctions, with $L = 3$ for the four-way RIL in (A).

The L possible ancestral origins are assumed to be exchangeable, so that we focus on the changes of ancestral origins. See Figure 1B and the relevant part of Discussion on the exchangeability assumption. The initial distribution of $O (0)$ at the leftmost locus $x = 0$ is specified by $α^{m p} (11)$ , a probability that the two ancestral origins are the same (IBD) at a locus. Let $α^{m p} (12) = 1 - α^{m p} (11)$ be the non-IBD probability. Given either IBD or non-IBD at the locus, the ancestral origin pair $O (0)$ takes each of the possible combinations with equal probability.

The transition rate matrix of the CTMC can be constructed from the expected densities $J^{m p} (a b c d)$ of all the junction types $(a b c d)$ along the two X chromosomes of a female. The junction type $(a b c d)$ denotes the four-gene IBD configuration $(a b c d)$ on both sides of a junction, where $a b$ ( $c d$ ) is on the left-hand (right-hand) side, haplotype $a c$ ( $b d$ ) is on the first (second) chromosome, and the same integers denote IBD. Figure 1C illustrates the seven types of junctions: $(1112)$ , $(1121)$ , $(1122)$ , $(1211)$ , $(1213)$ , $(1222)$ , and $(1232)$ for $L \geq 3$ , where the two types $(1213)$ and $(1232)$ do not exist for $L = 2$ . We do not define junction types for the eight two-locus configurations $(1111)$ , $(1123)$ , $(1212)$ , $(1221)$ , $(1223)$ , $(1231)$ , $(1233)$ , and $(1234)$ , because there are either zero or no less than two junctions between the two loci. Figure 1D shows the transition rate matrix of the CTMC in the four-way RIL by sibling mating. Figure 1E shows the relationships between the expected densities $J^{m p} (a b c d)$ and the transition rates, and they are derived based on the interpretation that $J^{m p} (a b c d) Δ d$ is the two-locus diplotype probability, in the limit that the genetic distance $Δ d$ (in Morgan) between two loci goes to zero.

The map expansions $R^{m}$ and $R^{p}$ and the overall expected junction density $ρ^{m p}$ are given by

R^{m} = J^{m p} (1121) + J^{m p} (1122) + J^{m p} (1222) + J^{m p} (1232),

(1)

R^{p} = J^{m p} (1112) + J^{m p} (1122) + J^{m p} (1211) + J^{m p} (1213),

(2)

ρ^{m p} = R^{m} + R^{p} - J^{m p} (1122),

(3)

similar to those for autosomes (Zheng et al. 2014) except that $R^{m} \neq R^{p}$ for X chromosomes. We have $J^{m p} (1112) = J^{m p} (1211)$ and $J^{m p} (1121) = J^{m p} (1222)$ , since the junction densities do not depend on the direction of chromosomes. In contrast to the single-locus two-gene non-IBD probability $α^{m p} (12)$ , the ordering of the superscripts in $J^{m p} (a b c d)$ generally does matter, that is, $J^{m p} (a b c d) \neq J^{p m} (a b c d)$ except for the junction type $(1122)$ . In addition, we have $J^{m p} (1213) = J^{p m} (1232)$ (see Figure 1C). Thus, the CTMC of X-linked ancestral origins can be described by one non-IBD probability $α^{m p} (12)$ and the five expected junction densities $R^{m}$ , $R^{p}$ , $J^{m p} (1122)$ , $J^{m p} (1232)$ , and $J^{p m} (1232)$ , under the exchangeability assumption of the L possible ancestral origins.

Single-locus non-IBD probabilities

The calculation of the expected junction densities necessitates the introduction of the probabilities for the two- and three-gene IBD configurations at a single locus. All the following derivations of the recurrence relations for these probabilities are based on the Mendelian inheritance of X-linked genes: a paternally derived gene must be a copy of the maternally derived gene in a male of the previous generation, and a maternally derived gene has equal probability of being a copy of either the maternally derived gene or the paternally derived gene in a female of the previous generation.

In a dioecious mapping population, the single-locus two-gene probabilities of IBD configuration $(a b)$ depend on whether or not the two homologous genes are in a single individual. Thus, we denote by $β^{m m} (a b)$ , $β^{m p} (a b)$ , and $β^{p p} (a b)$ the two-gene probability of IBD configuration $(a b)$ , given that the two homologous genes are in two distinct individuals in generation t and have parental origins $m m$ , $m p$ , and $p p$ , respectively (Figure 2A); it holds that $β^{p m} (a b) = β^{m p} (a b)$ .

Schematics of (A) the probabilities of the two-gene IBD configurations, (B) the probabilities of the three-gene IBD configurations, and (C) the expected junction densities. Circles denote females, and dashed rectangles for males or females. Black vertical lines denote the maternally derived X chromosomes, and gray vertical lines for the paternally derived. Dots denote genes on chromosomes.

The recurrence relations of the two-gene non-IBD probabilities are derived by tracing the parental origins of two homologous genes from generation $t \geq 1$ into the previous generation, and they are given by

β_{t}^{m m} (12) = s_{t}^{m} \frac{1}{2} α_{t - 1}^{m p} (12) + (1 - s_{t}^{m}) [\frac{1}{4} β_{t - 1}^{m m} (12) + \frac{1}{2} β_{t - 1}^{m p} + \frac{1}{4} β_{t - 1}^{p p}]

(4a)

β_{t}^{m p} (12) = \frac{1}{2} β_{t - 1}^{m m} (12) + \frac{1}{2} β_{t - 1}^{m p} (12)

(4b)

β_{t}^{p p} (12) = (1 - s_{t}^{p}) β_{t - 1}^{m m} (12)

(4c)

α_{t}^{m p} (12) = β_{t}^{m p} (12)

(4d)

where equation (4d) holds immediately after one generation of random mating, although it may not hold in the founder population at $t = 0$ . In equation (4a), the first term on the right-hand side refers to the scenario that the two genes with parent origins $m m$ in generation t come from a single female of the previous generation with the probability $s_{t}^{m}$ , and with probability $1 / 2$ that they come from different genes of the female. In equation (4b), the two genes with parental origins $m p$ cannot merge because they must come from one male and one female of the previous generation. In equation (4c), the two genes with parental origins $p p$ in generation t come from a single male of the previous generation with the probability $s_{t}^{p}$ ; if so, they must merge because there is only one X chromosome in a male.

We introduce the single-locus three-gene probabilities of IBD configuration $(a b c)$ . Let $β^{m m m} (a b c)$ , $β^{m m p} (a b c)$ , $β^{m p p} (a b c)$ , and $β^{p p p} (a b c)$ be the probabilities of IBD configuration $(a b c)$ , given that the three homologous genes are in three distinct individuals in generation t and have parental origins $m m m$ , $m m p$ , $m p p$ , and $p p p$ , respectively (Figure 2B). Similarly, we define $α^{m m p} (a b c)$ and $α^{m p p} (a b c)$ for three homologous genes in two distinct individuals. The ordering of the superscripts does not matter for these three-gene probabilities, for example, $β^{m m p} (a b c) = β^{m p m} (a b c) = β^{p m m} (a b c)$ .

The recurrence relations of the three-gene non-IBD probabilities are derived by tracing the parental origins of three homologous genes from generation t ≥ 1 into the previous generation, and they are given by

\begin{matrix} β_{t}^{m m m} (123) = 3 q_{t}^{m} \frac{1}{2} [\frac{1}{2} α_{t - 1}^{m m p} (123) + \frac{1}{2} α_{t - 1}^{m p p} (123)] + (1 - s_{t}^{m} - 2 q_{t}^{m}) \\ \times [\frac{1}{8} β_{t - 1}^{m m m} (123) + \frac{3}{8} β_{t - 1}^{m m p} (123) + \frac{3}{8} β_{t - 1}^{m p p} (123) + \frac{1}{8} β_{t - 1}^{p p p} (123)] \end{matrix}

(5a)

\begin{matrix} β_{t}^{m m p} (123) = s_{t}^{m} \frac{1}{2} α_{t - 1}^{m m p} (123) + (1 - s_{t}^{m}) \\ \times [\frac{1}{4} β_{t - 1}^{m m m} (123) + \frac{1}{2} β_{t - 1}^{m m p} (123) + \frac{1}{4} β_{t - 1}^{m p p} (123)] \end{matrix}

(5b)

β_{t}^{m p p} (123) = (1 - s_{t}^{p}) [\frac{1}{2} β_{t - 1}^{m m m} (123) + \frac{1}{2} β_{t - 1}^{m m p} (123)]

(5c)

β_{t}^{p p p} (123) = (1 - s_{t}^{p} - 2 q_{t}^{p}) β_{t - 1}^{m m m} (123)

(5d)

α_{t}^{m m p} (123) = β_{t}^{m m p} (123)

(5e)

α_{t}^{m p p} (123) = β_{t}^{m p p} (123)

(5f)

where $q_{t}^{m}$ is the coalescence probability of three maternally derived genes in generation t that a particular pair of genes come from a single female of the previous generation and the third comes from another female of the previous generation, and similarly $q_{t}^{p}$ for three paternally derived genes. The equations (5e, 5f) hold immediately after one generation of random mating, although they may not hold in the founder population at $t = 0$ .

The derivations of the recurrence equations (5a–5d) for the three-gene non-IBD probabilities are similar to equations (4a–4c) for the two-gene non-IBD probabilities. In equation (5a), the pre-factor 3 denotes that each of the three possible pairs of genes may come from a single female of the previous generation; the term $(1 - s_{t}^{m} - 2 q_{t}^{m})$ is the probability that the three maternally derived genes in generation t come from three distinct females of the previous generation, and it is obtained by the probability $1 - s_{t}^{m}$ that one pair of genes come from two distinct females minus the probability $2 q_{t}^{m}$ that the third gene and either gene of the pair come from a single female of the previous generation. Similarly, the term $(1 - s_{t}^{p} - 2 q_{t}^{p})$ in equation (5d) is the probability that the three paternally derived genes in generation t come from three distinct males of the previous generation.

Expected junction densities

We derive the recurrence relations for $R^{m}$ , $R^{p}$ , $J^{m p} (1122)$ , $J^{m p} (1232)$ , and $J^{p m} (1232)$ . The recurrence relation for $R^{m}$ follows from the theory of junctions (Fisher 1954): a new junction is formed whenever a recombination event occurs between two X chromosomes that are non-IBD at the location of a crossover. The recurrence relations for the map expansions $R^{m}$ and $R^{p}$ are given by

R_{t}^{m} = \frac{1}{2} R_{t - 1}^{m} + \frac{1}{2} R_{t - 1}^{p} + α_{t - 1}^{m p} (12),

(6a)

R_{t}^{p} = R_{t - 1}^{m},

(6b)

where equation (6b) follows directly from no recombination events occurring between the XY chromosomes in a male of the previous generation.

To measure differential map expansions between maternally and paternally derived chromosomes, we define $R_{t}^{X} = (2 R_{t}^{m} + R_{t}^{p}) / 3$ and $R_{t}^{-} = (R_{t}^{m} - R_{t}^{p}) / 2$ , and their recurrence relations are given by

R_{t}^{X} = R_{t - 1}^{X} + \frac{2}{3} α_{t - 1}^{m p} (12),

(7a)

R_{t}^{-} = - \frac{1}{2} R_{t - 1}^{-} + \frac{1}{2} α_{t - 1}^{m p} (12),

(7b)

according to the recurrence equations (6a, 6b). If there are equal numbers of males and females in the population, a randomly chosen X chromosome is maternally derived with probability $2 / 3$ , and it is paternally derived with probability $1 / 3$ . Thus $R_{t}^{X}$ can be interpreted as the map expansion on a randomly chosen X chromosome.

For comparisons, we denote by $R_{t}^{A}$ the map expansion on a random chosen autosome, and and its recurrence relation is given by (MacLeod et al. 2005; Zheng et al. 2014)

R_{t}^{A} = R_{t - 1}^{A} + α_{t - 1}^{A A} (12),

(8)

where $α_{t}^{A A} (12)$ refers to the non-IBD probability between two homologous autosomal genes in an individual. The equations (7a, 8) show that the map expansion $R_{t}^{X}$ for an X chromosome is two-thirds $R_{t}^{A}$ for an autosome if the non-IBD probability $α_{t}^{A A} (12)$ for autosomes is the same as $α_{t}^{m p} (12)$ for XX chromosomes, and the sex ratio is 1.

In addition to $J_{t}^{m p} (a b c d)$ and $J_{t}^{p m} (a b c d)$ , we define $K_{t}^{m m} (a b c d)$ , $K_{t}^{m p} (a b c d)$ , $K_{t}^{p m} (a b c d)$ , and $K_{t}^{p p} (a b c d)$ for haplotypes $a c$ and $b d$ that are in two distinct individuals and have parental origins $m m$ , $m p$ , $p m$ , and $p p$ , respectively (Figure 2C). The contributions to the junctions in the current generation come from either the existing junctions at the previous generation, or a new junction via a crossover event. In the following, we focus on the formation of a new junction, because the contributions of the existing junctions in the previous generation are similar to those for the two-gene non-IBD probabilities in the recurrence equations (4a–4c).

The schematics of the recurrence relations for junction types $(1232)$ and $(1122)$ are shown in Figure S1. The ancestry transitions of type $(1122)$ occur on both haplotypes $a c$ and $b d$ at exactly the same location, and thus a new junction of type $(1122)$ can be formed only by duplicating a chromosome segment. It holds that $J_{t}^{m p} (1122) = J_{t}^{p m} (1122)$ and $K_{t}^{m p} (1122) = K_{t}^{p m} (1122)$ because of the symmetry of type $(1122)$ . We have

\begin{matrix} K_{t}^{m m} (1122) = s_{t}^{m} [\frac{1}{2} J_{t - 1}^{m p} (1122) + \frac{1}{4} R_{t - 1}^{m} + \frac{1}{4} R_{t - 1}^{p}] \\ + (1 - s_{t}^{m}) [\frac{1}{4} K_{t - 1}^{m m} (1122) + \frac{1}{2} K_{t - 1}^{m p} (1122) + \frac{1}{4} K_{t - 1}^{p p} (1122)], \end{matrix}

(9a)

K_{t}^{m p} (1122) = \frac{1}{2} K_{t - 1}^{m m} (1122) + \frac{1}{2} K_{t - 1}^{m p} (1122),

(9b)

K_{t}^{p p} (1122) = s_{t}^{p} R_{t - 1}^{m} + (1 - s_{t}^{p}) K_{t - 1}^{m m} (1122),

(9c)

J_{t}^{m p} (1122) = K_{t}^{m p} (1122),

(9d)

for t ≥ 1, where equation (9d) may not hold in the founder population at $t = 0$ , the first term on the right-hand side of equation (9a) refers to the scenario that both haplotypes $a c$ and $b d$ come from a single female of the previous generation, and the first term on the right-hand side of equation (9c) refers to the scenario that both haplotypes are the duplicated copies of the maternally derived X chromosome in a male of the previous generation (Figure S1A). According to equations (6a, 6b) and equations (9a–9d), the overall expected density $ρ^{m p}$ in equation (3) does not depend on the three-gene non-IBD probabilities.

The ancestry transition of type $(1232)$ occurs on haplotype $a c$ . A new junction of type $(1232)$ is formed whenever the two parental chromosomes of haplotype $a c$ and the parental chromosome of haplotype $b d$ are distinct and have the IBD configuration $(123)$ at the location of the crossover. We have

\begin{matrix} K_{t}^{m m} (1232) = s_{t}^{m} [\frac{1}{4} J_{t - 1}^{m p} (1232) + \frac{1}{4} J_{t - 1}^{p m} (1232)] \\ + (1 - s_{t}^{m}) {\frac{1}{4} [K_{t - 1}^{m m} (1232) + α_{t - 1}^{m m p} (123)] + \frac{1}{4} [K_{t - 1}^{m p} (1232) + α_{t - 1}^{m p p} (123)]} \\ + (1 - s_{t}^{m}) {\frac{1}{4} [K_{t - 1}^{p m} (1232) + α_{t - 1}^{m m p} (123)] + \frac{1}{4} [K_{t - 1}^{p p} (1232) + α_{t - 1}^{m p p} (123)]}, \end{matrix}

(10a)

K_{t}^{m p} (1232) = \frac{1}{2} [K_{t - 1}^{m m} (1232) + α_{t - 1}^{m m p} (123)] + \frac{1}{2} [K_{t - 1}^{p m} (1232) + α_{t - 1}^{m m p} (123)],

(10b)

K_{t}^{p m} (1232) = \frac{1}{2} K_{t - 1}^{m m} (1232) + \frac{1}{2} K_{t - 1}^{m p} (1232),

(10c)

K_{t}^{p p} (1232) = (1 - s_{t}^{p}) K_{t - 1}^{m m} (1232),

(10d)

J_{t}^{m p} (1232) = K_{t}^{m p} (1232),

(10e)

J_{t}^{p m} (1232) = K_{t}^{p m} (1232),

(10f)

for $t \geq 1$ , where equations (10e, 10f) may not hold in the founder population at $t = 0$ . A new junction is formed at the rate $α_{t - 1}^{m m p} (123)$ ( $α_{t - 1}^{m p p} (123)$ ), given that the parental chromosome of haplotype $b d$ is maternally (paternally) derived. The density $K_{t}^{p m} (1232)$ in equation (10c) has no contributions of a new junction because there are no crossover events occurring between the XY chromosomes in the father of haplotype $a c$ (Figure S1B). We denote by $K_{t}^{m p +} (1232) = [K_{t}^{m p} (1232) + K_{t}^{p m} (1232)] / 2$ , and $K_{t}^{m p -} (1232) = [K_{t}^{m p} (1232) - K_{t}^{p m} (1232)] / 2$ , and their recurrence relations are given in Appendix A. Both $K_{t}^{m p -} (1232)$ and $R_{t}^{-}$ measure the asymmetry between maternally and paternally derived X chromosomes.

Model evaluation by simulations

To evaluate the theoretical predications of non-IBD probabilities and expected junction densities, we perform simulation studies with the same model assumptions: random mating with discrete generations, no natural selections, and no genetic interferences, except that the ancestral origins along chromosomes do not follow Marker assumptions. Instead, the genome ancestral origins are simulated forwardly by first generating a pedigree according to a given breeding design, and then dropping on the pedigree the distinct founder genome labels (ancestral origins) that are assigned to the whole X chromosomes of each complete inbred founder or to each X chromosome of each outbred founder. The X chromosomes of each descendant gamete are specified as a list of the labeled segments determined by chromosomal crossovers.

For a mapping population with the particular breeding design, the realized junction densities and IBD probabilities are saved for all individuals in each generation in each simulation replicate, and they are averaged over in total $2 \times 10^{4}$ replicates. Various mating schemes are used in simulating breeding pedigrees. We denote by RM1 the random mating where each sampling of two randomly chosen individuals with opposite genders produces one offspring, and RM2 the random mating where each sampling of two randomly chosen individuals with opposite genders produces two offspring. We combine these mating schemes with -NE if each parent contributes a Poisson distributed number of gametes to the next generation, and -E if each parent contributes exactly two gametes. Thus, we have four random mating schemes, RM1-NE, RM1-E, RM2-NE, and RM2-E. The sibling mating belongs to RM2-E with population size 2, and the exclusively pairing in $2^{n}$ -way (n ≥ 1) crosses can be regarded as a special case of random mating without inbreeding. The genders are assigned randomly, independent of mating schemes.

Application to QTL mapping populations

Multistage populations

For mapping populations with stage-wise constant mating schemes, we derive analytic expressions of the non-IBD probabilities and the expected junction densities for constructing CTMC of X-linked ancestral origins, according to the recurrence relations. The closed form solutions are obtained by linking results of each subsequent stage via the initial conditions. The general results for a population with constant random mating are derived in Appendix A, where three scenarios are considered: finite population of size ≥6, sibling-mating population of size 2, and large population of size »6. Table S2 gives the coalescence probabilities of X chromosomes for various mating schemes, similar to Table 1 of Zheng et al. (2014) for autosomes. Table S3 summarizes the results for X chromosomes in a sibling-mating population, and Table S4 for autosomes; they are necessary for dioecious breeding populations with a stage of inbreeding by sibling mating such as the CC and the DSPR. We use the superscripts of A denoting the quantities for autosomes.

Table 1. Results for X chromosomes in the $2^{n}$ -way RIL by sibling mating in the last generation $g = U + V + 1$ , where $U = 0$ for $n = 1$ and $U = n - 2$ for $n \geq 2$ .

Quantity	Theoretical Prediction
(A) 2 ways sibling
$α_{g}^{m p} (12)$	$\frac{5 + \sqrt{5}}{10} {(λ_{1})}^{V} + c o n j u g a t e$
$R_{g}^{X}$	$\frac{8}{3} - \frac{20 + 8 \sqrt{5}}{15} {(λ_{1})}^{V} + c o n j u g a t e$
$R_{g}^{m}$	$\frac{8}{3} - \frac{2}{3} {(- \frac{1}{2})}^{V} - \frac{5 + 3 \sqrt{5}}{5} {(λ_{1})}^{V} + c o n j u g a t e$
$J_{g}^{m p} (1122)$	$\frac{8}{3} + \frac{1}{3} {(- \frac{1}{2})}^{V} - (\frac{3 + \sqrt{5}}{2} + \frac{5 + \sqrt{5}}{10} V) {(λ_{1})}^{V} + c o n j u g a t e$
$J_{g}^{m p +} (1232)$	$0$
$J_{g}^{m p -} (1232)$	0
(B) $2^{n} (n \geq 2)$ ways sibling
$α_{g}^{m p} (12)$	$\frac{5 + 3 \sqrt{5}}{10} {(λ_{1})}^{V} + c o n j u g a t e$
$R_{g}^{X}$	$\frac{2}{3} (U + 6) - \frac{30 + 14 \sqrt{5}}{15} {(λ_{1})}^{V} + c o n j u g a t e$
$R_{g}^{m}$	$\frac{2}{3} (U + 6) + \frac{1}{3} (R_{U + 1}^{m} - R_{U + 1}^{p}) {(- \frac{1}{2})}^{V} - \frac{10 + 4 \sqrt{5}}{5} {(λ_{1})}^{V} + c o n j u g a t e$
$J_{g}^{m p} (1122)$	$\begin{array}{l} \frac{2}{3} (U + 6) - \frac{1}{6} (R_{U + 1}^{m} - R_{U + 1}^{p}) {(- \frac{1}{2})}^{V} \\ - [\frac{10 + 4 \sqrt{5}}{5} + \frac{1 + \sqrt{5}}{4} R_{U + 1}^{m} + \frac{5 + \sqrt{5}}{20} R_{U + 1}^{p} + \frac{5 + 3 \sqrt{5}}{10} V] {(λ_{1})}^{V} + c o n j u g a t e \end{array}$
$J_{g}^{m p +} (1232)$	$- {(\frac{1}{2})}^{V} + [\frac{5 + 3 \sqrt{5}}{10} + \frac{1 + \sqrt{5}}{4} R_{U + 1}^{m} + \frac{5 + \sqrt{5}}{20} R_{U + 1}^{p}] {(λ_{1})}^{V} + c o n j u g a t e$
$J_{g}^{m p -} (1232)$	${(\frac{1}{2})}^{V + 1} + (R_{U + 1}^{p} - R_{U + 1}^{m} + 1) {(- \frac{1}{2})}^{V + 1}$

Open in a new tab

The eigenvalues $λ_{1} = (1 + \sqrt{5}) / 4$ and $λ_{2} = (1 - \sqrt{5}) / 4$ . The map expansions $R_{U + 1}^{p}$ and $R_{U + 1}^{m}$ are given by equations (11a, 11b). The conjugate is given by replacing $\sqrt{5}$ with $- \sqrt{5}$ from the terms involving $λ_{1}$ . For example, the conjugate term for $R_{g}^{X}$ in (A) is given by $- (20 - 8 \sqrt 5) {(λ_{2})}^{V} / 15$ . RIL, recombinant inbred line.

We derive the analytic expressions of $α_{t}^{m p} (12)$ , $R_{t}^{X}$ , $R_{t}^{m}$ , $J_{t}^{m p} (1122)$ , $J_{t}^{m p +} (1232)$ , and $J_{t}^{m p -} (1232)$ in the mapping populations of the RIL, the AIL, and the DO, and they are given in Table 1, Table 2, and Table 3, respectively. These results are necessary for constructing the CTMC of ancestral origins along the XX chromosomes of a female; only the expression of $R_{t}^{m}$ is needed for the maternal derived X chromosome of a male. For comparisons, the autosomal results for $α_{t}^{A A} (12)$ , $R_{t}^{A}$ , $J_{t}^{A A} (1122)$ , and $J_{t}^{A A} (1232)$ are included. The results for the AIL, the DO, and the DSPR are derived under the assumption of a large population size in the intercross stage. We evaluate this assumption in the DSPR, because the evaluation results hold similarly for the AIL and the DO. In addition, the map expansions $R_{t}^{X}$ and $R_{t}^{A}$ are given explicitly under the assumption of an infinitely large intercross population size, which may be used as a simple measure of QTL mapping resolution.

Table 2. Results for the AIL in the last generation $g = U + 1$ .

Quantity	Theoretical Prediction
(A) X chromosomes
$α_{g}^{m p} (12)$	$(1 - \frac{10}{9 L}) {(λ_{1})}^{U} + \frac{2}{9 L} {(- \frac{1}{2})}^{U} + \frac{8}{9 L} {(\frac{1}{4})}^{U}$
$R_{g}^{X}$	$\frac{8}{9 L} + \frac{2}{3} (1 - \frac{10}{9 L}) \frac{1 - {(λ_{1})}^{U}}{1 - λ_{1}} - \frac{8}{81 L} {(- \frac{1}{2})}^{U} - \frac{64}{81 L} {(\frac{1}{4})}^{U}$
$R_{g}^{m}$	$\frac{2}{9} + \frac{52}{81 L} + \frac{2}{3} (1 - \frac{10}{9 L}) \frac{1 - {(λ_{1})}^{U}}{1 - λ_{1}} - (\frac{2}{9} + \frac{20}{81 L} + \frac{4}{27 L} U) {(- \frac{1}{2})}^{U} - \frac{32}{81 L} {(\frac{1}{4})}^{U}$
$J_{g}^{m p} (1122)$	$\frac{2}{9} + \frac{52}{81 L} + \frac{2}{3} (1 - \frac{10}{9 L}) \frac{1 - {(λ_{1})}^{U}}{1 - λ_{1}} - [\frac{2}{9} + \frac{52}{81 L} + (\frac{2}{3} + \frac{16}{27} s) (1 - \frac{10}{9 L}) U] {(λ_{1})}^{U}$
$J_{g}^{m p +} (1232)$	$(1 - \frac{2}{L}) (1 - \frac{4}{3 L}) \frac{{(λ_{1})}^{U} - {(λ_{4})}^{U}}{s} + (1 - \frac{2}{L}) (- \frac{1}{9} + \frac{76}{81 L}) {(λ_{1})}^{U} + (1 - \frac{2}{L}) \frac{16}{81 L} {(λ_{4})}^{U}$ $+ (1 - \frac{2}{L}) (\frac{1}{9} - \frac{4}{81 L} + \frac{8}{27 L} U) {(- \frac{1}{2})}^{U} + (1 - \frac{2}{L}) (- \frac{88}{81 L} + \frac{16}{27 L} U) {(\frac{1}{4})}^{U}$
$J_{g}^{m p -} (1232)$	$(1 - \frac{2}{L}) (\frac{1}{3} - \frac{4}{9 L}) {(λ_{4})}^{U} - (1 - \frac{2}{L}) (\frac{1}{3} + \frac{4}{9 L}) {(- \frac{1}{2})}^{U} + (1 - \frac{2}{L}) \frac{8}{9 L} {(\frac{1}{4})}^{U}$
(B) Autosomes
$α_{g}^{A A} (12)$	$(1 - \frac{1}{L}) {(λ_{1}^{A})}^{U - 1}$
$R_{g}^{A}$	$1 + (1 - \frac{1}{L}) \frac{1 - {(λ_{1}^{A})}^{U - 1}}{1 - λ_{1}^{A}}$
$J_{g}^{A A} (1122)$	$[1 - {(λ_{1}^{A})}^{U - 1}] + (1 - \frac{1}{L}) [\frac{1 - {(λ_{1}^{A})}^{U - 1}}{1 - λ_{1}^{A}} - (U - 1) {(λ_{1}^{A})}^{U - 2}]$
$J_{g}^{A A} (1232)$	$(1 - \frac{2}{L}) {(λ_{1}^{A})}^{U - 1} + (1 - \frac{1}{L}) (1 - \frac{2}{L}) \frac{{(λ_{1}^{A})}^{U - 1} - {(λ_{4}^{A})}^{U - 1}}{s^{A}}$

Open in a new tab

The eigenvalues $λ_{1} = 1 - s / 3$ and $λ_{4} = 1 - s$ for X chromosomes, and for autosomes $λ_{1}^{A} = 1 - s^{A} / 2$ and $λ_{4}^{A} = 1 - 3 s^{A} / 2$ . AIL, advanced inter-cross lines.

Table 3. Results for the DO in the last generation $g = U + 1$ .

Quantity	Theoretical Prediction
(A) X chromosomes
$α_{g}^{m p} (12)$	$(1 - \frac{1}{L}) {(λ_{1})}^{U}$
$R_{g}^{X}$	$R_{0}^{X} + \frac{2}{3} α_{0}^{m p} (12) + \frac{2}{3} (1 - \frac{1}{L}) \frac{1 - {(λ_{1})}^{U}}{1 - λ_{1}}$
$R_{g}^{m}$	$\begin{array}{l} R_{0}^{X} + \frac{2}{3} α_{0}^{m p} (12) + \frac{2}{9} (1 - \frac{1}{L}) + \frac{2}{3} (1 - \frac{1}{L}) \frac{1 - {(λ_{1})}^{U}}{1 - λ_{1}} \\ - [\frac{2}{9} (1 - \frac{1}{L}) + \frac{1}{6} (R_{0}^{m} - R_{0}^{p}) - \frac{1}{3} α_{0}^{m p} (12)] {(- \frac{1}{2})}^{U} \end{array}$
$J_{g}^{m p} (1122)$	$\begin{array}{l} R_{0}^{X} + \frac{2}{3} α_{0}^{m p} (12) + \frac{2}{9} (1 - \frac{1}{L}) + \frac{2}{3} (1 - \frac{1}{L}) \frac{1 - {(λ_{1})}^{U}}{1 - λ_{1}} \\ - [R_{0}^{X} + \frac{2}{3} α_{0}^{m p} (12) + \frac{2}{9} (1 - \frac{1}{L}) + (\frac{2}{3} + \frac{16}{27} s) (1 - \frac{1}{L}) U] {(λ_{1})}^{U} \end{array}$
$J_{g}^{m p +} (1232)$	$\begin{array}{l} (1 - \frac{1}{L}) (1 - \frac{2}{L}) \frac{{(λ_{1})}^{U} - {(λ_{4})}^{U}}{s} + (1 - \frac{2}{L}) [- \frac{1}{9} (1 - \frac{1}{L}) + R_{0}^{X} + \frac{2}{3} α_{0}^{m p} (12)] {(λ_{1})}^{U} \\ + (1 - \frac{2}{L}) [\frac{1}{9} (1 - \frac{1}{L}) + \frac{1}{12} (R_{0}^{m} - R_{0}^{p}) - \frac{1}{6} α_{0}^{m p} (12)] {(- \frac{1}{2})}^{U} \end{array}$
$J_{g}^{m p -} (1232)$	$\frac{1}{3} (1 - \frac{1}{L}) (1 - \frac{2}{L}) {(λ_{4})}^{U} + (1 - \frac{2}{L}) [- \frac{1}{3} (1 - \frac{1}{L}) - \frac{1}{4} (R_{0}^{m} - R_{0}^{p}) + \frac{1}{2} α_{0}^{m p} (12)] {(- \frac{1}{2})}^{U}$
(B) Autosomes
$α_{g}^{A A} (12)$	$(1 - \frac{1}{L}) {(λ_{1}^{A})}^{U}$
$R_{g}^{A}$	$R_{0}^{A} + α_{0}^{A A} (12) + (1 - \frac{1}{L}) \frac{1 - {(λ_{1}^{A})}^{U}}{1 - λ_{1}^{A}}$
$J_{g}^{A A} (1122)$	$[R_{0}^{A} + α_{0}^{A A} (12)] [1 - {(λ_{1}^{A})}^{U}] + (1 - \frac{1}{L}) [\frac{1 - {(λ_{1}^{A})}^{U}}{1 - λ_{1}^{A}} - U {(λ_{1}^{A})}^{U - 1}]$
$J_{g}^{A A} (1232)$	$[R_{0}^{A} + α_{0}^{A A} (12)] (1 - \frac{2}{L}) {(λ_{1}^{A})}^{U} + (1 - \frac{1}{L}) (1 - \frac{2}{L}) \frac{{(λ_{1}^{A})}^{U} - {(λ_{4}^{A})}^{U}}{s^{A}}$

Open in a new tab

The eigenvalues $λ_{1} = 1 - s / 3$ and $λ_{4} = 1 - s$ for X chromosomes, and for autosomes $λ_{1}^{A} = 1 - s^{A} / 2$ and $λ_{4}^{A} = 1 - 3 s^{A} / 2$ . DO, diversity outcross.

Many breeding populations can be divided into three stages: mixing, intercross, and inbreeding, such as the RIL by sibling mating, the CC, and the DSPR. There is no inbreeding stage for the AIL, the HS, and the DO. We denote by U the number of intercross generations, V the number of inbreeding generations, and N the intercross population size. Let $ℳ_{F}$ and $ℳ_{I}$ denote the random mating schemes for mixing and intercross stages, respectively. We choose the mixing stage to consist of one generation of random mating, so that the non-IBD probabilities and the expected junction densities in the $F_{1}$ population do not depend on whether genes or haplotypes are in distinct individuals.

The general derivation procedure is as follows. First, we derive the initial conditions in the $F_{1}$ population for the intercross stage, according to the genetic compositions of the founder population $F_{0}$ . Second, we substitute the obtained initial conditions into the theorems of Appendix A3 under the assumption of a large intercross population size. Alternatively, the theorems of Appendix A1 may be used for a finite intercross population. Lastly, if there is a stage of inbreeding by sibling mating, we substitute analytic expressions in the $F_{U + 1}$ population into the theorems of Appendix A2 to obtain the results in the last generation $g = U + V + 1$ .

RIL

The $2^{n}$ -way RIL by sibling mating can be regarded as a three-stage mapping population without the intercross stage for $n \leq 2$ . All the founders are fully inbred, and the intercross mating scheme is exclusively pairing so that inbreeding is completely avoided. Thus $R_{1}^{m} = R_{1}^{p} = 0$ , and the non-IBD probability $α_{t}^{m p} (12) = 1$ during the intercross stage $1 \leq t \leq U + 1$ , where $U = 0$ for $n = 1$ and $U = n - 2$ for $n \geq 2$ . According to the recurrence equations (6a, 6b), it holds

R_{U + 1}^{m} = \frac{2}{9} [1 + 3 U - {(- \frac{1}{2})}^{U}],

(11a)

R_{U + 1}^{p} = R_{U}^{m},

(11b)

and $R_{U + 1}^{X} = 2 U / 3$ . Furthermore, it is straightforward to obtain $β_{U + 1}^{m p} (12) = 1$ , $β_{U + 1}^{m m} (12) = α_{U + 1}^{m m p} (123) = δ_{n \geq 2}$ , $K_{U + 1}^{m m} (1122) = K_{U + 1}^{m p} (1122) = 0$ , $K_{U + 1}^{m m} (1232) = K_{U + 1}^{m p} (1232) = R_{U + 1}^{m}$ , and $K_{U + 1}^{p m} (1232) = R_{U + 1}^{p}$ , where the indicator $δ_{n \geq 2} = 1$ if $n \geq 2$ and 0 otherwise, since the two maternally derived genes at $t = 1$ must come from the inbred female founder for the two-way RIL.

Substituting the initial conditions in the $F_{U + 1}$ population into Table S3, we obtain the results for the RIL in the last generation $t = U + V + 1$ shown in Table 1. The non-IBD probabilities $α_{t}^{m p} (12)$ for X chromosomes are the same as those for autosomes (Table 2 of Zheng et al. 2014). Thus, we show analytically that the map expansion $R^{X}$ for the X chromosome is two-thirds that of the autosome for the $2^{n}$ -way (n ≥ 1) RIL, according to equations (7a, 8). Broman (2012a) has verified this two-thirds rule via Maxima for the $2^{n}$ -way RIL up to $n = 98$ .

Figure 3 shows that these theoretical predictions fit very well with the forward simulation results for the two- and eight-way RIL by sibling mating. The differential densities $R_{t}^{-}$ and $J_{t}^{m p -} (1232)$ decay very fast with generation t and show some oscillations in the beginning generations. The overall expected junction density $ρ_{t}^{m p}$ reaches the maximum in the same generation for autosomes.

Results of the $2^{n}$ -way recombinant inbred lines (RILs) with by sibling mating for $n = 1$ (left panels) and $n = 3$ (right panels). The filled symbols refer to the results for X chromosomes, the empty symbols for autosomes, and lines for the theoretical predictions in Table 1. The non-IBD probabilities $α_{t}^{m p} (12)$ for X chromosomes and autosomes are overlapped with each other. The brown filled diamonds refer to $J_{t}^{m p -} (1232)$ in (C) and (D) and $ρ_{t}^{m p}$ in (E) and (F).

AIL

We consider a multiparental AIL population that is founded by $L / 2$ inbred females and $L / 2$ inbred males. A unique ancestral origin is assigned to each inbred founder’s genomes so that the two-gene non-IBD probabilities $α_{0}^{m p} (12) = 0$ and $β_{0}^{m m} (12) = β_{0}^{m p} (12) = β_{0}^{p p} (12) = 1$ , and similarly for the three-gene non-IBD probabilities $α_{0}^{m m p} (123) = α_{0}^{m p p} (123) = 0$ , $β_{0}^{m m m} (123) = β_{0}^{m m p} (123) = β_{0}^{m p p} (123) = β_{0}^{p p p} (123) = 1$ if they exist.

The $F_{1}$ population of size N is produced by mating scheme $ℳ_{F} =$ RM1-NE or RM2-NE. According to Table S2, the coalescence probabilities $s_{1}^{m} = s_{1}^{p} = 2 / L$ and $q_{1}^{m} = q_{1}^{p} = (2 / L) (1 - 2 / L)$ for mating scheme RM1-NE, and they hold approximately for RM2-NE with large population size N » 6. Thus, the two-gene non-IBD probabilities at $t = 1$ are given by $β_{1}^{m m} (12) = β_{1}^{p p} (12) = 1 - 2 / L$ and $α_{1}^{m p} (12) = β_{1}^{m p} (12) = 1$ according to the recurrence equations (4a–4d), and the three-gene non-IBD probabilites at $t = 1$ are given by $β_{1}^{m m m} (123) = β_{1}^{p p p} (123) = (1 - 2 / L) (1 - 4 / L)$ and $α_{1}^{m m p} (123) = α_{1}^{m p p} (123) = β_{1}^{m m p} (123) = β_{1}^{m p p} (123) = 1 - 2 / L$ according to the recurrence equations (5a–5f). In addition, no junctions can be formed from inbred founders so that it holds that $R_{1}^{m} = R_{1}^{p} = 0$ , $K_{1}^{m m} (1122) = K_{1}^{m p} (1122) = K_{1}^{p p} (1122) = 0$ , and $K_{1}^{m m} (1232) = K_{1}^{m p} (1232) = K_{1}^{p m} (1232) = K_{1}^{p p} (1232) = 0$ .

The $F_{1}$ population is maintained for U generations with constant size N and sex ratio 1. Assuming that the intercross population size is large (N » 6), all the two- and three-gene coalescence probabilities at $t \geq 2$ are approximately equal and are denoted by s, and they are determined by the intercross mating scheme $ℳ_{I}$ according to Table S2. Substituting the initial conditions in the $F_{1}$ population into the theorems of Appendix A3, we obtain in Table 2 the results for X chromosomes in the AIL in the last generation $t = U + 1$ . Table 2 also shows the results for autosomes, which are derived according to Zheng et al. (2014).

As shown in Table 2, the non-IBD probabilities $α_{t}^{m p} (12)$ for X-chromosomes are unequal to those for autosomes, and thus the map expansions generally do not satisfy the two-thirds rule. According to the map expansions $R_{t}^{X}$ and $R_{t}^{A}$ in Table 2, we derive their approximations under the limit of an infinitely large population size (N →∞) so that the coalescence probability goes to zero (s →0),

R_{U + 1}^{X} \approx \frac{8}{9 L} + \frac{2}{3} (1 - \frac{10}{9 L}) U,

(12a)

R_{U + 1}^{A} \approx 1 + (1 - \frac{1}{L}) (U - 1),

(12b)

where the last two terms for $R_{t}^{X}$ in Table 2 are small and thus ignored. The equations (12a, 12b) show that the two-thirds rule is approximately valid for a large number L of founder lines. The map expansion of equation (12b) for $L = 2$ is consistent with the previous results (Darvasi and Soller 1995; Liu et al. 1996; Winkler et al. 2003; Broman 2012b).

The left panels of Figure 4 show for the AIL that the theoretical predictions fit very well with the forward simulation results, where $ℳ_{F} =$ RM1-NE, $ℳ_{I} =$ RM1-E, $L = 8$ , and $N = 100$ . Within $U = 20$ intercross generations, the non-IBD probability $α_{t}^{m p} (12)$ decreases slowly with generation t, the differential map expansion $R_{t}^{-}$ remains almost constant after a few generations of oscillations, and the map expansions in equations (12a, 12b), shown as thick red lines in Figure 4, are very good approximations.

Results of the AIL (left panels) and the HS (right panels) with L = 8 and N = 100. The random mating schemes *M_F* = RM1-NE for the AIL and RM1-E for the HS, and *M_I* = RM1-E for both populations. The symbols and lines are the same as those in Figure 3. The theoretical predictions refer to Table 2 for the AIL and Table 3 for the DO. The additional red lines denote the map expansions under the large size assumption, given by equations (12a, 12b) for the AIL and equations (13a, 13b) for the HS.

HS and DO

The HS and the DO differ from the AIL only in the genetic compositions of the founder population. The N progenitors of the DO at $t = 0$ were sampled independently from pre-CC lines at a variety of different generations. Each pre-CC line is produced by the RIL by sibling mating from $L = 8$ randomly permuted founder strains. Let $q_{k}$ denote the proportion of the pre-CC progenitors that were in generation k. Thus, for a random progenitor, it holds $α_{0}^{m p} (12) = \sum_{k} q_{k} α_{k}^{m p} (12 | pre ‐ CC)$ and $R_{0}^{O} = \sum_{k} q_{k} R_{k}^{O} (pre ‐ CC)$ , where $α_{k}^{m p} (12 | pre ‐ CC)$ and $R_{k}^{O} (pre ‐ CC)$ for $O = m, p$ can be obtained from Table 1. Because the founder stains are exchangeable, we obtain $β_{0}^{m m} (12) = β_{0}^{m p} (12) = β_{0}^{p p} (12) = 1 - 1 / L$ , $α_{0}^{m m p} (123) = α_{0}^{m p p} (123) = α_{0}^{m p} (12) (1 - 2 / L)$ , and $β_{0}^{m m m} (123) = β_{0}^{m m p} (123) = β_{0}^{m p p} (123) = β_{0}^{p p p} (123) = (1 - 1 / L) (1 - 2 / L)$ , and because recombination crossovers are independent among different pre-CC lines, the between-individual expected junction densities at $t = 0$ are given by $K_{0}^{m m} (1122) = K_{0}^{m p} (1122) = K_{0}^{p p} (1122) = 0$ , $K_{0}^{m m} (1232) = K_{0}^{m p} (1232) = R_{0}^{m} (1 - 2 / L)$ , and $K_{0}^{p p} (1232) = K_{0}^{p m} (1232) = R_{0}^{p} (1 - 2 / L)$ , where $1 - 2 / L$ refers to the probability that the third ancestral origin on haplotype $b d$ is different from the two ancestral origins on haplotype $a c$ where the ancestry transition occurs. The within-individual expected junction densities at $t = 0$ are not required in the following derivations.

The $F_{1}$ population of size N is produced by random mating with equal sex ratio. Assuming that the population size N » 6, the coalescence probabilities at $t = 1$ are approximated to be zero. According to the recurrence equations for the two- and three-gene non-IBD probabilities, the between-individual probabilities did not change and the within-individual non-IBD probabilities at $t = 1$ equal to the corresponding between-individual probabilities. In addition, we have $R_{1}^{m} = R_{0}^{m} / 2 + R_{0}^{p} / 2 + α_{0}^{m p} (12)$ , $R_{1}^{p} = R_{0}^{m}$ , $K_{1}^{m m} (1122) = K_{1}^{m p} (1122) = K_{1}^{p p} (1122) = 0$ , $K_{1}^{m m} (1232) = K_{1}^{m p} (1232) = [R_{0}^{m} / 2 + R_{0}^{p} / 2 + α_{0}^{m p} (12)] (1 - 2 / L)$ , $K_{1}^{p p} (1232) = K_{1}^{p m} (1232) = R_{0}^{m} (1 - 2 / L)$ , according to the recurrence equations for the expected junction densities.

Similar to the intercross stage of the AIL, we obtain in Table 3 the results for X chromosomes in the DO in the last generation $t = U + 1$ by substituting the initial conditions in the $F_{1}$ population into the theorems of Appendix A3. Table 3 also shows the results for autosomes, which are derived according to Zheng et al. (2014). Under the limit of an infinitely large population size (N→∞), we obtain from Table 3

R_{U + 1}^{X} \approx R_{0}^{X} + \frac{2}{3} α_{0}^{m p} (12) + \frac{2}{3} (1 - \frac{1}{L}) U,

(13a)

R_{U + 1}^{A} \approx R_{0}^{A} + α_{0}^{A A} (12) + (1 - \frac{1}{L}) U,

(13b)

showing that the two-thirds rule is valid under such an approximation since $α_{0}^{m p} (12) = α_{0}^{A A} (12)$ and $R_{0}^{X} = 2 R_{0}^{A} / 3$ for progenitors drawn from the RIL (Table 1). The map expansion in equation (13b) for $L = 8$ is the same as the one obtained by Broman (2012b).

The right panels of Figure 4 show for the HS that the theoretical predictions fit very well with the forward simulation results, where $ℳ_{F} = ℳ_{I} =$ RM1-E, the $N =$ 100 individuals in the $F_{0}$ population were sampled independently from CC funnels at the same generation $t = 3$ . The results are similar to those for the AIL with the same L shown in the left panels of Figure 4. For X chromosomes, the non-IBD probabilities in the DO are larger than those in the AIL, and thus in the DO the map expands at a higher rate than that for the AIL, see equations (12a, 13a).

DSPR

The DSPR RILs were derived from two synthetic populations, each created independently by adding the multiparental AIL with an inbreeding stage by sibling mating (King et al. 2012). For example, we derive the analytic expressions of the map expansions in one synthetic population with L founder strains. We assume that $β_{U + 1}^{m m} (12) = β_{U + 1}^{m p} (12)$ , which holds in a non-inbreeding population and approximately in a large population (e.g., $N \geq 100$ ) with a large number of intercross generations (e.g., $U \geq 6$ ). According to the map expansions in Table S3, we have

R_{U + V + 1}^{X} \approx R_{U + 1}^{X} + α_{U + 1}^{m p} (12) [4 - \frac{30 + 14 \sqrt{5}}{15} {(λ_{1})}^{V} - \frac{30 - 14 \sqrt{5}}{15} {(λ_{2})}^{V}],

(14a)

R_{U + V + 1}^{A} \approx R_{U + 1}^{A} + α_{U + 1}^{A A} (12) [6 - \frac{15 + 7 \sqrt{5}}{5} {(λ_{1})}^{V} - \frac{15 - 7 \sqrt{5}}{5} {(λ_{2})}^{V}],

(14b)

where $λ_{1} = (1 + \sqrt{5}) / 4$ and $λ_{2} = (1 - \sqrt{5}) / 4$ , and $R_{U + 1}^{X}$ and $α_{U + 1}^{m p} (12)$ are given in Table 1, Table 2, or Table 3 if the $F_{U + 1}$ population is the last generation of the RIL, the AIL, or the DO, respectively.

We evaluate the large size assumption for various random mating schemes by simulation studies of the DSPR. Figure 5 shows the fitting of the theoretical predictions with the forward simulation results for the intercross size $N =$ 20, 50, and 100, where the mating schemes $ℳ_{F} =$ RM1-NE and $ℳ_{I} =$ RM1-E (RM1-NE) for the left (right) panels. The theoretical predictions are obtained by combining the results for the AIL (Table 2) with those for the sibling-mating population (Table S3), assuming the large size (N » 6). The relative worse fitting for the differential densities $R_{t}^{-}$ and $J_{t}^{m p -} (1232)$ is probably attributable to the limited number (2 × 10⁴) of simulation replicates. The theoretical fitting becomes improved with increased size N, and it is very good for N = 100 within the range of U = 20 intercross generations. The fitting for RM1-E is better than RM1-NE because in the former case the two-gene coalescence probabilities are always equal to the three-gene probabilities (Table S2), independent of the size N. Figure S2 shows similar results for the random mating scheme RM2, except that the expected junction densities are slightly smaller. Figure S3 and Figure S4 show that the large size assumption is less sensitive for autosomes, and the fittings are very good even for N = 20.

Results of the DSPR for X chromosomes with $L = 8$ and $N =$ 20 (cyan), 50 (brown), and 100 (blue). The random mating schemes *M_F* = RM1-NE for all panels and *M_I* = RM1-E (RM1-NE) for the left (right) panels. The lines denote the theoretical predictions under the large size assumption. The filled circles refer to $R_{t}^{-}$ in panels C and D, and $R_{t}^{X}$ in panels E and F; the filled diamonds refer to $J_{t}^{m p -} (1232)$ in (C) and (D) and $ρ_{t}^{m p}$ in (E) and (F).

Discussion

We have extended our previous framework of modeling ancestral origin processes from autosomes to X chromosomes, and thus the same assumptions such as exchangeability of ancestral origins, Markov properties and random mating also apply (Zheng et al. 2014). The deviations from Markov properties result in larger variances in the IBD-tract length and the junction densities, which have been shown to be acceptable (Chapman and Thompson 2003; Martin and Hospital 2011). The random mating indicates that our approach does not apply to breeding populations with marker-assisted selections.

In contrast to the previous approaches (Haldane and Waddington 1931; Broman 2012a), the exchangeability assumption of ancestral origins greatly reduces model complexity, because the number of possible junction types does not depend on the number of founders for L ≥ 3 whereas the number of diplotype states increases very fast with L. The assumption affects the rate matrix of the Markov model, but not the expected junction densities where only changes of ancestral origins matter. The exchangeability is a good approximation for the AIL- or the multiparent advanced generation inter-cross (i.e., MAGIC)-type populations with random mating, but it does not hold for the multiway RIL by sibling mating.

However, the exchangeability assumption is not critical for the application of our results to haplotype reconstructions from genotype data. The genomes of the individuals collected in the last generation have been well mixed by random chromosomal segregations over many generations. This is demonstrated in Figure 1A for the four-way RIL by sibling mating, where a female A and a male B was crossed, and a female C and a male D was crossed, and then a daughter from A × B and a son from C × D was crossed. The X chromosome of the founder D is lost in $F_{1}$ . The genotype probabilities for AB and AC are different and given in the Table 2 of Broman (2012a), although the sum of the genotype probabilities for AB, AC, and BC is equal to $α_{t}^{m p} (12)$ in Table 1. Figure 1B shows that the genotype probability for AB or AC becomes close to the average probability $α_{t}^{m p} (12) / 3$ as generation t increases. Furthermore, in the beginning generations when the asymmetry among ancestral origins is large, there are fewer number of recombination breakpoints, and thus more marker data per genome block are available to estimate ancestral origins. As a result, a priori equal weights of ancestral origins have little effects.

An HMM is under development for reconstructing ancestral origins for both autosomes and X chromosomes from marker data, using the present model and the previous one (Zheng et al. 2014) as the prior distribution. The previously implemented HMM methods, such as GAIN (Liu et al. 2010) and HAPPY (Mott et al. 2000), were developed for autosomes, and they do not account for the asymmetry between maternally and paternally derived X chromosomes.

The closed form expressions for non-IBD probabilities and various expected junction densities have been derived for stage-wise mapping populations. They provide the complete information for constructing the CTMC along two X chromosomes but also the guides for designing a new population in terms of X-linked QTL mapping resolutions. For advanced intercross populations such as the AIL, the HS, and the DO under the assumption of a large intercross size, the map expands linearly at a rate proportional to the inverse of the number L of inbred founders, which is robust to intercross mating schemes. For the RIL and the inbreeding stage of the DSPR, the map expansion slows down with increasing level of inbreeding. The overall junction density $ρ^{m p}$ for the DSPR decreases after one generation of the inbreeding stage by sibling mating, whereas for the RIL it reaches the maximum in the middle of inbreeding by sibling mating. These conclusions can also be applied to autosomes. Thus the most effective way of improving mapping resolutions is to increase the number U of intercross generations in a large population (N ≥ 5U, empirically).

Acknowledgments

I thank George O. Agogo, Rianne Jacobs, Martin P. Boer, Fred A. van Eeuwijk, and the two anonymous reviewers for their helpful comments. This research was supported by the Stichting Technische Wetenschappen (STW) - Technology Foundation, which is part of the Nederlandse Organisatie voor Wetenschappelijk Onderzoek - Netherlands Organization for Scientific Research, and which is partly funded by the Ministry of Economic Affairs. The specific grant number was STW-Rijk Zwaan project 12425.

Appendix A

Results for constant random mating populations

We introduce some matrix-vector notations to facilitate the derivations. Denote by $A ⊙ B$ the element-by-element multiplication of the two matrices $A$ and $B$ , and by $A ⊘ B$ the element-by-element division of the two matrices. Denote by $x_{i - j}^{t} = (x_{i}^{t}, x_{i + 1}^{t}, \dots, x_{j}^{t})$ the element-wise power where the subscripts of the natural numbers $i \leq j$ , and by default $x_{i - j} = (x_{i}, x_{i + 1}, \dots, x_{j})$ . Let 1 be a vector with appropriate length and all the elements being 1. Let $Λ (x)$ be the diagonal matrix with the diagonal elements being the vector $x$ . Denote by $[x, \dots, y]$ the matrix with row vectors $x$ , …, $y$ of equal length. Denote by superscript T the transpose of a vector or matrix.

The closed form expressions for the two- and three-gene non-IBD probabilities and the expected junction densities are derived for populations with constant size and random mating schemes. The coalescence probabilities are thus constant, and set $s_{t}^{m} = s^{m}$ , $s_{t}^{p} = s^{p}$ , $q_{t}^{m} = q^{m}$ , and $q_{t}^{p} = q^{p}$ . We first consider a finite population, number of males $N_{m} \geq 1$ and number of females $N_{f} \geq 3$ , so that all the two- and three-gene non-IBD probabilities exist. Then consider an example of small population size, a sibling-mating population with one male and one female ( $N_{f} = N_{m} = 1$ ), where the non-IBD probabilities $β^{p p} (12)$ , $β^{m m m} (123)$ , $β^{m m p} (123)$ , $β^{m p p} (123)$ , and $β^{p p p} (123)$ do not exist. Lastly, we consider a large population under the limit that the size $N_{f} = N_{m} ≫ 3$ .

A1 Finite population

Definition A1.1. The finite population refers to a population of constant number $N_{m} \geq 1$ of males and number $N_{f} \geq 3$ of females, maintained by random mating, and the initial population satisfies $α_{0}^{m p} (12) = β_{0}^{m p} (12)$ , $α_{0}^{m m p} (123) = β_{0}^{m m p} (123)$ , and $α_{0}^{m p p} (123) = β_{0}^{m p p} (123)$ .

Proposition A1.2. Denote by $β_{t} (12) = {(β_{t}^{m m} (12), β_{t}^{m p} (12), β_{t}^{p p} (12))}^{T}$ the two-gene non-IBD probability in a finite population. According to equations (4a–4c), it holds

β_{t} (12) = T_{12} β_{t - 1} (12)

(A1.1)

where

T_{12} = (\begin{matrix} \frac{1}{4} (1 - s^{m}) & \frac{1}{2} & \frac{1}{4} (1 - s^{m}) \\ \frac{1}{2} & \frac{1}{2} & 0 \\ 1 - s^{p} & 0 & 0 \end{matrix}) .

(A1.2)

Premise A1.3. The eigenvalues of $T_{12}$ in a finite population, denoted by $λ_{k}$ , $k \in {1, 2, 3}$ in the decreasing order of their absolute values, are distinct with multiplicities 1, and none of them is 0, 1, or $- 1 / 2$ .

Theorem A1.4. The two-gene non-IBD probability $β_{t} (12)$ in a finite population is given by

β_{t} (12) = P Λ (λ_{1 - 3}^{t}) {(C_{1 - 3})}^{T},

(A1.3)

where

P = [2 λ_{1 - 3} - 1, 1, (1 - s^{p}) (2 λ_{1 - 3} - 1) ⊘ λ_{1 - 3}],

(A1.4)

{(C_{1 - 3})}^{T} = P^{- 1} β_{0} (12) .

(A1.5)

Proof. It holds

β_{t}^{m m} (12) = B_{1 - 3} {(λ_{1 - 3}^{t})}^{T},

(A1.6a)

β_{t}^{m p} (12) = C_{1 - 3} {(λ_{1 - 3}^{t})}^{T},

(A1.6b)

β_{t}^{p p} (12) = D_{1 - 3} {(λ_{1 - 3}^{t})}^{T},

(A1.6c)

where the constant coefficients $B_{1 - 3}$ , $C_{1 - 3}$ , and $D_{1 - 3}$ are to be solved. Substituting $β_{t}^{m m} (12)$ and $β_{t}^{m p} (12)$ of equations (A1.6a, A1.6b) into the recurrence equation (4b), we obtain

B_{k} = (2 λ_{k} - 1) C_{k}, k \in {1, 2, 3} .

(A1.7)

Substituting $β_{t}^{m m} (12)$ and $β_{t}^{p p} (12)$ of equations (A1.6a, A1.6c) into the recurrence equation (4c), we obtain

D_{k} = (1 - s^{p}) \frac{B_{k}}{λ_{k}}, k \in {1, 2, 3},

(A1.8)

Substituting $B_{k}$ and $D_{k}$ of equations (A1.7, A1.8) into equation (A1.6a–A1.6c), we obtain

β_{t} (12) = P [C_{1 - 3} {(λ_{1 - 3}^{t})}^{T}],

(A1.9)

where $P$ is given by in equation (A1.4), and the constant coefficient $C_{1 - 3}$ is determined by the initial condition $β_{0} (12)$ and it is given by equation (A1.5).

Proposition A1.5. Denote by $β_{t} (123) = {(β_{t}^{m m m} (123), β_{t}^{m m p} (123), β_{t}^{m p p} (123), β_{t}^{p p p} (123))}^{T}$ the three-gene non-IBD probability in a finite population. According to equations (5a–5d), it holds

β_{t} (123) = T_{123} β_{t - 1} (123)

(A1.10)

where

T_{123} = (\begin{matrix} \frac{1}{8} (1 - s^{m} - 2 q^{m}) & \frac{3}{8} (1 - s^{m}) & \frac{3}{8} (1 - s^{m}) & \frac{1}{8} (1 - s^{m} - 2 q^{m}) \\ \frac{1}{4} (1 - s^{m}) & \frac{1}{2} & \frac{1}{4} (1 - s^{m}) & 0 \\ \frac{1}{2} (1 - s^{p}) & \frac{1}{2} (1 - s^{p}) & 0 & 0 \\ 1 - s^{p} - 2 q^{p} & 0 & 0 & 0 \end{matrix}) .

(A1.11)

Premise A1.6. The eigenvalues of $T_{123}$ in a finite population, denoted by $λ_{k}, k \in {4, 5, 6, 7}$ in the decreasing order of their absolute values, are distinct with multiplicities 1, and none of them is 0, 1, $- 1 / 2$ , or $λ_{k}, k \in {1, 2, 3}$ .

Theorem A1.7. The three-gene non-IBD probability $β_{t} (123)$ in a finite population is given by

β_{t} (123) = Q Λ (λ_{4 - 7}^{t}) {(C_{4 - 7})}^{T},

(A1.12)

where

Q = [a_{4 - 7}, 1, (1 - s^{p}) (a_{4 - 7} + 1) ⊘ (2 λ_{4 - 7}), (1 - s^{p} - 2 q^{p}) a_{4 - 7} ⊘ λ_{4 - 7}],

(A1.13)

{(C_{4 - 7})}^{T} = Q^{- 1} β_{0} (123) .

(A1.14)

and for $k \in {4, 5, 6, 7}$

a_{k} = \frac{8 {(λ_{k})}^{2} - 4 λ_{k} - (1 - s^{m}) (1 - s^{p})}{(1 - s^{m}) (1 - s^{p} + 2 λ_{k})} .

(A1.15)

Proof. It holds

β_{t}^{m m m} (123) = A_{4 - 7} {(λ_{4 - 7}^{t})}^{T},

(A1.16a)

β_{t}^{m m p} (123) = C_{4 - 7} {(λ_{4 - 7}^{t})}^{T},

(A1.16b)

β_{t}^{m p p} (123) = B_{4 - 7} {(λ_{4 - 7}^{t})}^{T},

(A1.16c)

β_{t}^{p p p} (123) = D_{4 - 7} {(λ_{4 - 7}^{t})}^{T},

(A1.16d)

where the constant coefficients $A_{4 - 7}$ , $B_{4 - 7}$ , $C_{4 - 7}$ , and $D_{4 - 7}$ are to be solved. Substituting $β_{t}^{m m m} (123)$ and $β_{t}^{p p p} (123)$ of equations (A1.16a, A1.16d) into the recurrence equation (5d), we obtain

D_{k} = (1 - s^{p} - 2 q^{p}) \frac{A_{k}}{λ_{k}}, k \in {4, 5, 6, 7} .

(A1.17)

Substituting $β_{t}^{m m m} (123)$ , $β_{t}^{m m p} (123)$ , and $β_{t}^{m p p} (123)$ of equations (A1.16a–A1.16c) into the recurrence equation (5c), we obtain

B_{k} = \frac{(1 - s^{p}) (A_{k} + C_{k})}{2 λ_{k}}, k \in {4, 5, 6, 7} .

(A1.18)

Substituting $β_{t}^{m m m} (123)$ , $β_{t}^{m m p} (123)$ , and $β_{t}^{m p p} (123)$ of equations (A1.16a–A1.16c) into the recurrence equation (5b), and substituting $B_{k}$ of equation (A1.18), we obtain

A_{k} = a_{k} C_{k}, k \in {4, 5, 6, 7},

(A1.19)

where $a_{k}$ is given by equation (A1.15). Substituting $A_{k}$ , $B_{k}$ , and $D_{k}$ of equations (A1.17–A1.19) into equations (A1.16a–A1.16d), we obtain

β_{t} (123) = Q [C_{4 - 7} {(λ_{4 - 7}^{t})}^{T}],

(A1.20)

where $Q$ is given by equation (A1.13), and the constant coefficient $C_{4 - 7}$ is determined by the initial condition $β_{0} (123)$ and it is given by equation (A1.14).

Theorem A1.8. The map expansions in a finite population are given by

R_{t}^{X} = C_{8} - \frac{2}{3} [C_{1 - 3} ⊘ (1 - λ_{1 - 3})] {(λ_{1 - 3}^{t})}^{T},

(A1.21)

R_{t}^{m} = C_{8} + C_{9} {(- \frac{1}{2})}^{t} + C_{10 - 12} {(λ_{1 - 3}^{t})}^{T},

(A1.22)

R_{t}^{p} = C_{8} - 2 C_{9} {(- \frac{1}{2})}^{t} + C_{10 - 12} {(λ_{1 - 3}^{t - 1})}^{T},

(A1.23)

where $C_{1 - 3}$ is given by equation (A1.5), and

C_{8} = R_{0}^{X} + \frac{2}{3} [C_{1 - 3} ⊘ (1 - λ_{1 - 3})],

(A1.24)

C_{9} = R_{0}^{m} - C_{8} - C_{10 - 12} 1^{T},

(A1.25)

C_{10 - 12} = - 2 (C_{1 - 3} ⊙ λ_{1 - 3}) ⊘ [(1 - λ_{1 - 3}) ⊙ (2 λ_{1 - 3} + 1)] .

(A1.26)

Proof. According to equation (7a), $R_{t}^{X}$ can be obtained from the accumulative summation of the non-IBD probability $β_{t}^{m p}$ of equation (A1.6b), and we have

R_{t}^{X} = R_{0}^{X} + \frac{2}{3} \sum_{k = 1}^{3} C_{k} \frac{1 - {(λ_{k})}^{t}}{1 - λ_{k}},

(A1.27)

which is equivalent to equation (A1.21) with the stationary map expansion $C_{8}$ being given by equation (A1.24). The eigenvalues for the transition matrix of the linear recurrence equations (4a–4c) and equations (6a, 6b) are 1, $- 1 / 2$ , $λ_{1 - 3}$ , and thus the map expansions $R^{m}$ and $R^{p}$ can be expressed in the forms of equations (A1.22, A1.23), where the constant coefficients $C_{10 - 12}$ are determined by calculating $R_{t}^{X} = 2 R_{t}^{m} / 3 + R_{t}^{p} / 3$ from equations (A1.22, A1.23) and comparing the result with equation (A1.21), and $C_{9}$ is determined by the initial condition $R_{0}^{m}$ . □

Theorem A1.9. Denote by $K_{t} (1122) = {(K_{t}^{m m} (1122), K_{t}^{m p} (1122), K_{t}^{p p} (1122))}^{T}$ the expected density of junction type $(1122)$ in a finite population, and it holds

\begin{matrix} K_{t} (1122) = C_{8} + {(1, - 1 / 2, - 2)}^{T} C_{9} {(- \frac{1}{2})}^{t} + W Λ (λ_{1 - 3}^{t}) {(C_{18 - 20})}^{T} \\ + P Λ (λ_{1 - 3}^{t}) [{(C_{15 - 17})}^{T} + {(C_{18 - 20})}^{T} t] \end{matrix}

(A1.28)

where $C_{8 - 12}$ are given by equations (A1.24–A1.26), $P$ is given by equation (A1.4),

W = [2 λ_{1 - 3}, 0, (s^{p} Φ_{1 - 3}^{- 1} + (1 - s^{p})) ⊘ λ_{1 - 3}],

(A1.29)

{(C_{15 - 17})}^{T} = P^{- 1} [K_{0} (1122) - C_{8} - {(1, - 1 / 2, - 2)}^{T} C_{9} - W {(C_{18 - 20})}^{T}],

(A1.30)

C_{18 - 20} = Φ_{1 - 3} ⊙ C_{10 - 12},

(A1.31)

and for $k \in {1, 2, 3}$

Φ_{k} = \frac{s^{m} (λ_{k} + 1) + (1 - s^{m}) s^{p}}{(6 - 2 s^{m}) λ_{k}^{2} + (6 - 2 s^{m} - 4 s^{p} + 4 s^{m} s^{p}) λ_{k} - 3 (1 - s^{m}) (1 - s^{p})} .

(A1.32)

Proof. The eigenvalues for the transition matrix of the linear recurrence equations (4a–4c), equations (6a, 6b), and equations (9a–9c) are 1, $- 1 / 2$ , and duplicated $λ_{1 - 3}$ . It holds

K_{t}^{m m} (1122) = B_{13} + B_{14} {(- \frac{1}{2})}^{t} + B_{15 - 17} {(λ_{1 - 3}^{t})}^{T} + B_{18 - 20} t {(λ_{1 - 3}^{t})}^{T},

(A1.33a)

K_{t}^{m p} (1122) = C_{13} + C_{14} {(- \frac{1}{2})}^{t} + C_{15 - 17} {(λ_{1 - 3}^{t})}^{T} + C_{18 - 20} t {(λ_{1 - 3}^{t})}^{T},

(A1.33b)

K_{t}^{p p} (1122) = D_{13} + D_{14} {(- \frac{1}{2})}^{t} + D_{15 - 17} {(λ_{1 - 3}^{t})}^{T} + D_{18 - 20} t {(λ_{1 - 3}^{t})}^{T},

(A1.33c)

where the constant coefficients $B_{13 - 20}$ , $C_{13 - 20}$ , and $D_{13 - 20}$ are to be solved. Substituting $K_{t}^{m m} (1122)$ and $K_{t}^{m p} (1122)$ of equations (A1.33a, A1.33b) into the recurrence equation (9b), we obtain

\begin{matrix} B_{13} = C_{13}, \\ B_{14} = - 2 C_{14}, \\ B_{14 + k} = (2 λ_{k} - 1) C_{14 + k} + 2 λ_{k} C_{17 + k}, k \in {1, 2, 3}, \\ B_{17 + k} = (2 λ_{k} - 1) C_{17 + k}, k \in {1, 2, 3} . \end{matrix}

(A1.34)

Substituting $K_{t}^{m m} (1122)$ and $K_{t}^{p p} (1122)$ of equations (A1.33a, A1.33c) and $R_{t}^{m}$ of equation (A1.22) into the recurrence equation (9c), and substituting $B_{13 - 20}$ of equation (A1.34), we obtain

\begin{matrix} D_{13} = C_{13} + s^{p} (C_{8} - C_{13}), \\ D_{14} = - 2 s^{p} C_{9} + 4 (1 - s^{p}) C_{14}, \\ D_{14 + k} = (1 - s^{p}) \frac{2 λ_{k} - 1}{λ_{k}} C_{14 + k} + s^{p} \frac{C_{9 + k}}{λ_{k}} + (1 - s^{p}) \frac{C_{17 + k}}{λ_{k}}, k \in {1, 2, 3}, \\ D_{17 + k} = (1 - s^{p}) \frac{2 λ_{k} - 1}{λ_{k}} C_{17 + k}, k \in {1, 2, 3}, \end{matrix}

(A1.35)

Substituting $K_{t}^{m p} (1122)$ , $K_{t}^{m m} (1122)$ and $K_{t}^{p p} (1122)$ of equations (A1.33a–A1.33c) and $R_{t}^{m}$ and $R_{t}^{p}$ of equations (A1.22, A1.23) into the recurrence equation (9a), and substituting $B_{13 - 20}$ and $D_{13 - 20}$ of equations (A1.34, A1.35), we obtain $C_{18 - 20}$ in equation (A1.31) and

C_{13} = C_{8},

(A1.36a)

C_{14} = - \frac{1}{2} C_{9} .

(A1.36b)

The constant coefficient $C_{15 - 17}$ is determined by the initial condition $K_{0} (1122)$ .

Definition A1.10. Denote by

K_{t}^{m p +} (1232) = \frac{K_{t}^{m p} (1232) + K_{t}^{p m} (1232)}{2},

(A1.37)

K_{t}^{m p -} (1232) = \frac{K_{t}^{m p} (1232) - K_{t}^{p m} (1232)}{2},

(A1.38)

and thus

K_{t}^{m p} (1232) = K_{t}^{m p +} (1232) + K_{t}^{m p -} (1232),

(A1.39)

K_{t}^{p m} (1232) = K_{t}^{m p +} (1232) - K_{t}^{m p -} (1232) .

(A1.40)

Proposition A1.11. According to Definition A1.10, the recurrence equations (10a–10d) in a finite population are transformed into

\begin{matrix} K_{t}^{m m} (1232) = (1 - s^{m}) \frac{1}{4} K_{t - 1}^{m m} (1232) + \frac{1}{2} K_{t - 1}^{m p +} (1232) + (1 - s^{m}) \frac{1}{4} K_{t - 1}^{p p} (1232) \\ + (1 - s^{m}) \frac{1}{2} [β_{t - 1}^{m m p} (123) + β_{t - 1}^{m p p} (123)], \end{matrix}

(A1.41a)

K_{t}^{m p +} (1232) = \frac{1}{2} K_{t - 1}^{m m} (1232) + \frac{1}{2} K_{t - 1}^{m p +} (1232) + \frac{1}{2} β_{t - 1}^{m m p} (123),

(A1.41b)

K_{t}^{p p} (1232) = (1 - s^{p}) K_{t - 1}^{m m} (1232),

(A1.41c)

K_{t}^{m p -} (1232) = - \frac{1}{2} K_{t - 1}^{m p -} (1232) + \frac{1}{2} β_{t - 1}^{m m p} (123) .

(A1.41d)

Theorem A1.12. Denote by $K_{t} (1232) = {(K_{t}^{m m} (1232), K_{t}^{m p +} (1232), K_{t}^{p p} (1232))}^{T}$ the expected density of junction type $(1232)$ in a finite population, and it holds

K_{t} (1232) = P Λ (λ_{1 - 3}^{t}) {(C_{21 - 23})}^{T} + W Λ (λ_{4 - 7}^{t}) {(C_{24 - 27})}^{T},

(A1.42)

where $P$ is given by equation (A1.4),

W = [- Φ_{4 - 7}^{- 1} + (2 λ_{4 - 7} - 1), 1, (1 - s^{p}) (- Φ_{4 - 7}^{- 1} + (2 λ_{4 - 7} - 1)) ⊘ λ_{4 - 7}],

(A1.43)

{(C_{21 - 23})}^{T} = P^{- 1} [K_{0} (1232) - W {(C_{24 - 27})}^{T}],

(A1.44)

C_{24 - 27} = Φ_{4 - 7} ⊙ C_{4 - 7},

(A1.45)

and for $k \in {4, 5, 6, 7}$

Φ_{k} = \frac{\frac{1}{2} {(λ_{k})}^{2} + \frac{1}{8} [(1 - s^{p}) (a_{k} + 1) / λ_{k} + 1] (1 - s^{m}) λ_{k} - \frac{1}{8} (1 - s^{m}) (1 - s^{p})}{f_{12} (λ_{k})},

(A1.46)

where $f_{12} (x)$ is the characteristic polynomial of the transition matrix $T_{12}$ of equation (A1.2).

Proof. The eigenvalues for the transition matrix of the linear recurrence equations (5a–5d) and equations (A1.41a–A1.41c) are $λ_{k}$ , $1 \leq k \leq 7$ . It holds

K_{t}^{m m} (1232) = B_{21 - 27} {(λ_{1 - 7}^{t})}^{T},

(A1.47a)

K_{t}^{m p +} (1232) = C_{21 - 27} {(λ_{1 - 7}^{t})}^{T},

(A1.47b)

K_{t}^{p p} (1232) = D_{21 - 27} {(λ_{1 - 7}^{t})}^{T},

(A1.47c)

where the constant coefficients $B_{21 - 27}$ , $C_{21 - 27}$ , and $D_{21 - 27}$ are to be solved. Substituting $K_{t}^{m m} (1232)$ and $K_{t}^{p p} (1232)$ of equations (A1.47a, A1.47c) into the recurrence equation (A1.41c), we obtain

D_{20 + k} = (1 - s^{p}) \frac{B_{20 + k}}{λ_{k}}, 1 \leq k \leq 7,

(A1.48)

Substituting $K_{t}^{m p +} (1232)$ and $K_{t}^{m m} (1232)$ of equations (A1.47a, A1.47b) and $β_{t}^{m m p}$ of equation (A1.16b) into the recurrence equation (A1.41b), we obtain

B_{20 + k} = {\begin{array}{l} (2 λ_{k} - 1) C_{20 + k} & k \in {1, 2, 3}, \\ - C_{k} + (2 λ_{k} - 1) C_{20 + k} & k \in {4, 5, 6, 7} . \end{array}

(A1.49)

The theorem follows by substituting $K_{t}^{m m} (1232)$ , $K_{t}^{m p +} (1232)$ and $K_{t}^{p p} (1232)$ of equations (A1.47a–A1.47c) and $β_{t}^{m m p}$ and $β_{t}^{m p p}$ of equations (A1.16b, A1.16c) into the recurrence equation (A1.41a), and substituting $B_{21 - 27}$ and $D_{21 - 27}$ of equations (A1.48, A1.49), where the constant coefficient $C_{21 - 23}$ in equation (A1.44) is determined by the initial condition $K_{0} (1232)$ . □

Theorem A1.13. The expected density $K_{t}^{m p -} (1232)$ is given by

K_{t}^{m p -} (1232) = A_{28} {(- \frac{1}{2})}^{t} + A_{24 - 27} {(λ_{4 - 7}^{t})}^{T},

(A1.50)

where

A_{24 - 27} = C_{4 - 7} ⊘ (2 λ_{4 - 7} + 1),

(A1.51)

A_{28} = K_{0}^{m p -} (1232) - A_{24 - 27} 1^{T} .

(A1.52)

Proof. The eigenvalues of the transition matrix of the linear recurrence equations (5a–5d) and equation (A1.41d): $- 1 / 2$ , $λ_{k}$ , $k \in {4, 5, 6, 7}$ . Thus equation (A1.50) holds, where $A_{24 - 27}$ is determined by putting equations (A1.50, A1.16b) into equation (A1.41d), and the constant coefficient $A_{28}$ is determined by the initial condition $K_{0}^{m p -} (1232)$ . □

Corollary A1.14. The overall expected junction density $ρ_{t}^{m p}$ of a female in a finite population is given by

ρ_{t}^{m p} = C_{8} - \frac{1}{2} C_{9} {(- \frac{1}{2})}^{t} + [C_{10 - 12} ⊙ (1 + λ_{1 - 3}^{- 1}) - C_{15 - 17} - C_{18 - 20} t] {(λ_{1 - 3}^{t})}^{T},

(A1.53)

where $C_{8 - 12}$ are given by equations (A1.24–A1.26), and $C_{15 - 20}$ by equations (A1.30, A1.31).

Proof. The corollary follows by substituting $R_{t}^{m}$ and $R_{t}^{p}$ of equations (A1.22, A1.23) and $K_{t}^{m p} (1122)$ of equation (A1.33b) into equation (3).

A2 Sibling-mating population

Definition A2.1. The sibling-mating population refers to a population of constant size 2, one male and one female, maintained by sibling mating, and the initial population satisfies $α_{0}^{m p} (12) = β_{0}^{m p} (12)$ . In such a population, the coalescence probability $s^{m} = 1$ , and the coalescence probabilities $s^{p}$ , $q^{m}$ , $q^{p}$ are set to zero.

Definition A2.2. In a sibling-mating population, the non-IBD probabilities $β_{t}^{p p} (12)$ , $β_{t}^{m m m} (123)$ , $β_{t}^{m m p} (123)$ , $β_{t}^{m p p} (123)$ , and $β_{t}^{p p p} (123)$ are set to zero since they do not exist. Similarly the expected junction densities $K^{p p} (1122)$ and $K^{p p} (1232)$ are set to zero.

Proposition A2.3. Denote by $β_{t} (12) = (β_{t}^{m m} (12), β_{t}^{m p} (12)$ the two-gene non-IBD probability in a sibling-mating population. According to equations (4a, 4b), it holds

T_{12} = (\begin{matrix} 0 & \frac{1}{2} \\ \frac{1}{2} & \frac{1}{2} \end{matrix}),

(A2.1)

and its eigenvalues are given by

λ_{1} = \frac{1 + \sqrt{5}}{4}, λ_{2} = \frac{1 - \sqrt{5}}{4} .

(A2.2)

Definition A2.4. In a sibling-mating population, the conjugate is obtained by replacing $\sqrt{5}$ with $- \sqrt{5}$ from the terms involving $λ_{1}$ .

Theorem A2.5. The two-gene non-IBD probability $β_{t} (12)$ in a sibling-mating population is given by

β_{t}^{m m} (12) = [\frac{5 - \sqrt{5}}{10} β_{0}^{m m} (12) + \frac{\sqrt{5}}{5} β_{0}^{m p} (12)] {(λ_{1})}^{t} + c o n j u g a t e

(A2.3)

β_{t}^{m p} (12) = [\frac{\sqrt{5}}{5} β_{0}^{m m} (12) + \frac{5 + \sqrt{5}}{10} β_{0}^{m p} (12)] {(λ_{1})}^{t} + c o n j u g a t e

(A2.4)

Proof. Similar to Theorem A1.4, it holds

β_{t} (12) = P Λ (λ_{1 - 2}^{t}) {(C_{1 - 2})}^{T},

(A2.5)

where

P = [2 λ_{1 - 2} - 1, 1],

(A2.6)

{(C_{1 - 2})}^{T} = P^{- 1} β_{0} (12) .

(A2.7)

The theorem follows by substituting $P$ an $C_{1 - 2}$ into equation (A2.5). □

Theorem A2.6. The three-gene non-IBD probability in a sibling-mating population is given by

α_{t}^{m m p} (123) = α_{0}^{m m p} (123) {(\frac{1}{2})}^{t} .

(A2.8)

Proof. According to the recurrence equations (5b, 5e),

α_{t}^{m m p} (123) = \frac{1}{2} α_{t - 1}^{m m p} (123) .

(A2.9)

Thus it holds

α_{t}^{m m p} (123) = C_{4} {(λ_{4})}^{t},

(A2.10)

where

λ_{4} = 1 / 2,

(A2.11)

C_{4} = α_{0}^{m m p} (123) .

(A2.12)

Theorem A2.7. The map expansions in a sibling-mating population are given by

R_{t}^{X} = C_{8} - [\frac{10 + 6 \sqrt{5}}{15} β_{0}^{m m} (12) + \frac{20 + 8 \sqrt{5}}{15} β_{0}^{m p} (12)] {(λ_{1})}^{t} + c o n j u g a t e,

(A2.13)

R_{t}^{m} = C_{8} + C_{9} {(- \frac{1}{2})}^{t} - [\frac{5 + \sqrt{5}}{5} β_{0}^{m m} (12) + \frac{5 + 3 \sqrt{5}}{5} β_{0}^{m p} (12)] {(λ_{1})}^{t} + c o n j u g a t e

(A2.14)

R_{t}^{p} = C_{8} - 2 C_{9} {(- \frac{1}{2})}^{t} - [\frac{4 \sqrt{5}}{5} β_{0}^{m m} (12) + \frac{10 + 2 \sqrt{5}}{5} β_{0}^{m p} (12)] {(λ_{1})}^{t} + c o n j u g a t e

(A2.15)

where $C_{8}$ and $C_{9}$ are given by

C_{8} = \frac{1}{3} (2 R_{0}^{m} + R_{0}^{p}) + \frac{4}{3} (β_{0}^{m m} (12) + 2 β_{0}^{m p} (12)),

(A2.16)

C_{9} = \frac{1}{3} (R_{0}^{m} - R_{0}^{p}) + \frac{2}{3} (β_{0}^{m m} (12) - β_{0}^{m p} (12)),

(A2.17)

Proof. Similar to Theorem A1.8, the map expansions in a sibling-mating population are given by

R_{t}^{X} = C_{8} - \frac{2}{3} [C_{1 - 2} ⊘ (1 - λ_{1 - 2})] {(λ_{1 - 2}^{t})}^{T},

(A2.18)

R_{t}^{m} = C_{8} + C_{9} {(- \frac{1}{2})}^{t} + C_{10 - 11} {(λ_{1 - 2}^{t})}^{T},

(A2.19)

R_{t}^{p} = C_{8} - 2 C_{9} {(- \frac{1}{2})}^{t} + C_{10 - 11} {(λ_{1 - 2}^{t - 1})}^{T},

(A2.20)

where

C_{8} = R_{0}^{X} + \frac{2}{3} [C_{1 - 2} ⊘ (1 - λ_{1 - 2})],

(A2.21)

C_{9} = R_{0}^{m} - C_{8} - C_{10 - 11} 1^{T},

(A2.22)

C_{10 - 11} = - 2 (C_{1 - 2} ⊙ λ_{1 - 2}) ⊘ [(1 - λ_{1 - 2}) ⊙ (2 λ_{1 - 2} + 1)] .

(A2.23)

The theorem follows by substituting $λ_{1 - 2}$ of equation (A2.2) and $C_{1 - 2}$ of equation (A2.7) into the aforementioned equations.

Theorem A2.8. The expected density $K_{t} (1122) = {(K_{t}^{m m} (1122), K_{t}^{m p} (1122))}^{T}$ in a sibling-mating population is given by

\begin{matrix} K_{t}^{m m} (1122) = C_{8} + C_{9} {(- \frac{1}{2})}^{t} \\ + [- \frac{5 + \sqrt{5}}{5} β_{0}^{m m} (12) - \frac{5 + 4 \sqrt{5}}{5} β_{0}^{m p} (12)] {(λ_{1})}^{t} \\ + [\frac{5 - \sqrt{5}}{10} K_{0}^{m m} (1122) + \frac{\sqrt{5}}{5} K_{0}^{m p} (1122) - \frac{1}{2} R_{0}^{m} - \frac{\sqrt{5}}{10} R_{0}^{p}] {(λ_{1})}^{t} \\ + [- \frac{5 - \sqrt{5}}{10} β_{0}^{m m} (12) - \frac{\sqrt{5}}{5} β_{0}^{m p} (12)] t {(λ_{1})}^{t} + c o n j u g a t e \end{matrix}

(A2.24)

\begin{matrix} K_{t}^{m p} (1122) = C_{8} - \frac{1}{2} C_{9} {(- \frac{1}{2})}^{t} \\ + [- \frac{5 + 3 \sqrt{5}}{10} β_{0}^{m m} (12) - \frac{3 + \sqrt{5}}{2} β_{0}^{m p} (12)] {(λ_{1})}^{t} \\ + [\frac{\sqrt{5}}{5} K_{0}^{m m} (1122) + \frac{5 + \sqrt{5}}{10} K_{0}^{m p} (1122) - \frac{1 + \sqrt{5}}{4} R_{0}^{m} - \frac{5 + \sqrt{5}}{20} R_{0}^{p}] {(λ_{1})}^{t} \\ + [- \frac{\sqrt{5}}{5} β_{0}^{m m} (12) - \frac{5 + \sqrt{5}}{10} β_{0}^{m p} (12)] t {(λ_{1})}^{t} + c o n j u g a t e \end{matrix}

(A2.25)

where $C_{8}$ and $C_{9}$ are given by equations (A2.16, A2.17), respectively.

Proof. Similar to Theorem A1.9, the expected density $K_{t} (1122)$ in a sibling-mating population is given by

\begin{matrix} K_{t} (1122) = C_{8} + {(1, - 1 / 2)}^{T} C_{9} {(- \frac{1}{2})}^{t} + W Λ (λ_{1 - 2}^{t}) {(C_{18 - 19})}^{T} \\ + P Λ (λ_{1 - 2}^{t}) [{(C_{15 - 16})}^{T} + {(C_{18 - 19})}^{T} t] \end{matrix}

(A2.26)

where $C_{8}$ and $C_{9}$ are given by equations (A2.16, A2.17), respectively, and $P$ is given by equation (A2.6), and

W = [2 λ_{1 - 2}, 0],

(A2.27)

{(C_{15 - 16})}^{T} = P^{- 1} [K_{0} (1122) - C_{8} - {(1, - 1 / 2)}^{T} C_{9} - W {(C_{18 - 19})}^{T}],

(A2.28)

C_{18 - 19} = Φ_{1 - 2} ⊙ C_{10 - 11},

(A2.29)

and for $k \in {1, 2}$

Φ_{k} = \frac{1}{4 λ_{k}} .

(A2.30)

The theorem follows by substituting $λ_{1 - 2}$ of equation (A2.2) and $C_{10 - 11}$ of equation (A2.23).

Theorem A2.9. The expected densities of type (1232) in a sibling-mating population are given by

\begin{matrix} K_{t}^{m m} (1232) = [\frac{5 - \sqrt{5}}{10} K_{0}^{m m} (1232) + \frac{\sqrt{5}}{5} K_{0}^{m p +} (1232) + \frac{5 + \sqrt{5}}{10} α_{0}^{m m p} (123)] {(λ_{1})}^{t} \\ - α_{0}^{m m p} (123) {(\frac{1}{2})}^{t} + c o n j u t a t e \end{matrix}

(A2.31)

\begin{matrix} K_{t}^{m p +} (1232) = [\frac{\sqrt{5}}{5} K_{0}^{m m} (1232) + \frac{5 + \sqrt{5}}{10} K_{0}^{m p +} (1232) + \frac{5 + 3 \sqrt{5}}{10} α_{0}^{m m p} (123)] {(λ_{1})}^{t} \\ - α_{0}^{m m p} (123) {(\frac{1}{2})}^{t} + c o n j u t a t e, \end{matrix}

(A2.32)

K_{t}^{m p -} (1232) = [K_{0}^{m p -} (1232) - \frac{1}{2} α_{0}^{m m p} (123)] {(- \frac{1}{2})}^{t} + \frac{1}{2} α_{0}^{m m p} (123) {(\frac{1}{2})}^{t}

(A2.33)

Proof. Denote by $K_{t} (1232) = {(K_{t}^{m m} (1232), K_{t}^{m p +} (1232))}^{T}$ . Similar to Theorem A1.12, the expected density $K_{t} (1232)$ in a sibling-mating population is given by

K_{t} (1232) = P Λ (λ_{1 - 2}^{t}) {(C_{21 - 22})}^{T} + W Λ (λ_{4 - 4}^{t}) {(C_{24 - 24})}^{T},

(A2.34)

where $P$ is given by equation (A2.6), and

W = [- Φ_{4 - 4}^{- 1} + (2 λ_{4 - 4} - 1), 1],

(A2.35)

{(C_{21 - 22})}^{T} = P^{- 1} [K_{0} (1232) - W {(C_{24 - 24})}^{T}],

(A2.36)

C_{24 - 24} = Φ_{4 - 4} ⊙ C_{4 - 4},

(A2.37)

and for $k = 4$

Φ_{k} = \frac{\frac{1}{2} λ_{k}}{f_{12} (λ_{k})},

(A2.38)

where $f_{12} (x)$ is the characteristic polynomial of the transition matrix $T_{12}$ of equation (A2.1). The equations (A2.31, A2.32) follow by substituting $λ_{1 - 2}$ of equation (A2.2), $λ_{4}$ of equation (A2.11), and $C_{4}$ of equation (A2.12).

Similar to Theorem A1.13, the expected density $K_{t}^{m p -} (1232)$ in a sibling-mating population is given by

K_{t}^{m p -} (1232) = A_{28} {(- \frac{1}{2})}^{t} + A_{24 - 24} {(λ_{4 - 4}^{t})}^{T}

(A2.39)

where

A_{24 - 24} = C_{4 - 4} ⊘ (2 λ_{4 - 4} + 1),

(A2.40)

A_{28} = K_{0}^{m p -} (1232) - A_{24 - 24} .

(A2.41)

Corollary A2.10. The overall expected junction density $ρ_{t}^{m p}$ in a sibling-mating population is given by

\begin{matrix} ρ_{t}^{m p} = C_{8} - \frac{1}{2} C_{9} {(- \frac{1}{2})}^{t} \\ + [- \frac{5 + 7 \sqrt{5}}{10} β_{0}^{m m} (12) - \frac{3 + \sqrt{5}}{2} β_{0}^{m p} (12)] {(λ_{1})}^{t} \\ + [- \frac{\sqrt{5}}{5} K_{0}^{m m} (1122) - \frac{5 + \sqrt{5}}{10} K_{0}^{m p} (1122) + \frac{1 + \sqrt{5}}{4} R_{0}^{m} + \frac{5 + \sqrt{5}}{20} R_{0}^{p}] {(λ_{1})}^{t} \\ + [\frac{\sqrt{5}}{5} β_{0}^{m m} (12) + \frac{5 + \sqrt{5}}{10} β_{0}^{m p} (12)] t {(λ_{1})}^{t} + c o n j u t a t e, \end{matrix}

(A2.42)

where $C_{8}$ and $C_{9}$ are given by equations (A2.16, A2.17), respectively.

Proof. The corollary follows by substituting $R_{t}^{m}$ and $R_{t}^{p}$ of equations (A2.14, A2.15) and $K_{t}^{m p} (1122)$ of equation (A2.25) into equation (3).

A3 Large population

Definition A3.1. The large population refers to a population with large and equal number $N_{m} = N_{f} ≫ 3$ of males and females, maintained by random mating, and the initial population satisfies $α_{0}^{m p} (12) = β_{0}^{m p} (12)$ , $α_{0}^{m m p} (123) = β_{0}^{m m p} (123)$ , and $α_{0}^{m p p} (123) = β_{0}^{m p p} (123)$ . In such a large population, the coalescence probabilities are set to $s^{m} = s^{q} = q^{m} = q^{p} = s$ .

Proposition A3.2. Denote by $β_{t} (12) = (β_{t}^{m m} (12), β_{t}^{m p} (12), β_{t}^{p p} (12))$ the two-gene non-IBD probability in a large population, and the eigenvalues of the transition matrix $T_{12}$ in equation (A1.2) are given by

λ_{1} = 1 - \frac{s}{3}, λ_{2} = - \frac{1}{2}, λ_{3} = \frac{1}{4} .

(A3.1)

where $λ_{1}$ is approximated to the first order of s to keep it smaller than 1, and $λ_{2}$ and $λ_{3}$ are approximated to zero order of s.

Theorem A3.3. The two-gene non-IBD probabilities in a large population are given by

β_{t} (12) = P Λ (λ_{1 - 3}^{t}) {(C_{1 - 3})}^{T},

(A3.2)

where

P = [\begin{matrix} 1 & - 2 & - 1 / 2 \\ 1 & 1 & 1 \\ 1 & 4 & - 2 \end{matrix}],

(A3.3)

{(C_{1 - 3})}^{T} = \frac{1}{9} [\begin{matrix} 4 & 4 & 1 \\ - 2 & 1 & 1 \\ - 2 & 4 & - 2 \end{matrix}] β_{0} (12) .

(A3.4)

(A3.5)

Proof. The theorem follows directly from Theorem A1.4, where

P = [2 λ_{1 - 3} - 1, 1, (1 - s) (2 λ_{1 - 3} - 1) ⊘ λ_{1 - 3}],

(A3.6)

{(C_{1 - 3})}^{T} = P^{- 1} β_{0} (12) .

(A3.7)

The matrix $P$ is approximated to the zero order of s. □

Proposition A3.4. Denote by $β_{t} (123) = {(β_{t}^{m m m} (123), β_{t}^{m m p} (123), β_{t}^{m p p} (123), β_{t}^{p p p} (123))}^{T}$ the three-gene non-IBD probability in a large population, and the eigenvalues of the transition matrix $T_{123}$ of equation (A1.11) are given by

λ_{4} = 1 - s, λ_{5} = - \frac{1}{2}, λ_{6} = \frac{1}{4}, λ_{7} = - \frac{1}{8},

(A3.8)

where $λ_{4}$ is approximated to the first order of s to keep it smaller than 1, and $λ_{5 - 7}$ is approximated to zero order of s.

Theorem A3.5. The three-gene non-IBD probability $β_{t} (123)$ in a large population is given by

β_{t} (123) = Q Λ (λ_{4 - 7}^{t}) {(C_{4 - 7})}^{T},

(A3.9)

where

Q = [\begin{matrix} 1 & - \frac{3}{s} & - 1 & - \frac{1}{2} \\ 1 & 1 & 1 & 1 \\ 1 & \frac{3}{s} & 0 & - 2 \\ 1 & \frac{6}{s} & - 4 & 4 \end{matrix}],

(A3.10)

{(C_{4 - 7})}^{T} = Q^{- 1} β_{0} (123) .

(A3.11)

Proof. The theorem follows directly from Theorem A1.7, where

a_{k} = \frac{8 {(λ_{k})}^{2} - 4 λ_{k} - {(1 - s)}^{2}}{(1 - s) (1 - s + 2 λ_{k})}, k \in {4, 5, 6, 7},

Q = [a_{4 - 7}, 1, (1 - s) (a_{4 - 7} + 1) ⊘ (2 λ_{4 - 7}), (1 - 3 s) a_{4 - 7} ⊘ λ_{4 - 7}],

where the elements of $Q$ are approximated to be the leading order of s and up to the zero order of s. □

Theorem A3.6. The map expansions in a large population are given by

R_{t}^{X} = R_{0}^{X} + \frac{2}{3} C_{1} \frac{[1 - {(λ_{1})}^{t}]}{1 - λ_{1}} + \frac{4}{9} C_{2} [1 - {(λ_{2})}^{t}] + \frac{8}{9} C_{3} [1 - {(λ_{3})}^{t}],

(A3.12)

R_{t}^{m} = C_{8} + C_{10} \frac{1 - {(λ_{1})}^{t}}{1 - λ_{1}} + (C_{9} + C_{11} t) {(λ_{2})}^{t} + C_{12} {(λ_{3})}^{t},

(A3.13)

R_{t}^{p} = C_{8} + C_{10} \frac{1 - {(λ_{1})}^{t - 1}}{1 - λ_{1}} + [C_{9} + C_{11} (t - 1)] {(λ_{2})}^{t - 1} + C_{12} {(λ_{3})}^{t - 1},

(A3.14)

where $C_{1 - 3}$ is given by equation (A3.4), and

C_{8} = R_{0}^{X} + \frac{2}{3 (2 λ_{1} + 1)} C_{1} + \frac{4}{9} C_{2} + \frac{8}{9} C_{3},

(A3.15)

C_{9} = R_{0}^{m} - C_{8} - C_{12},

(A3.16)

C_{10} = \frac{2 λ_{1}}{2 λ_{1} + 1} C_{1}, C_{11} = - \frac{2}{3} C_{2}, C_{12} = - \frac{4}{9} C_{3} .

(A3.17)

Remark. As $s \to 0$ given t, $[1 - {(λ_{1})}^{t}] / (1 - λ_{1}) \to t$ , that is, the map expansions are dominant by the linear term proportional to t; as $t \to ∞$ given s, the map expansions asymptotically go to the constant $C_{8} + C_{10} / (1 - λ_{1})$ .

Proof. The proof is similar to Theorem A1.8, except that the eigenvalue $λ_{2}$ or $- 1 / 2$ has multiplicity 2. □

Theorem A3.7. The expected density $K_{t} (1122) = (K_{t}^{m m} (1122), K_{t}^{m p} (1122), K_{t}^{p p} (1122))$ , is given by

K_{t} (1122) = C_{8} + C_{10} \frac{1 - λ_{1}^{t}}{1 - λ_{1}} - [1 + (\frac{2}{9}, \frac{8}{9}, - \frac{4}{9}) s] C_{10} t λ_{1}^{t} + P Λ (λ_{1 - 3}^{t}) {(C_{15 - 17})}^{T},

(A3.18)

where $C_{8}$ , $C_{10}$ , and $P$ are given by equation (A3.15), equation (A3.17), and equation (A3.3), respectively, and

C_{15 - 17} = P^{- 1} [K_{0} (1122) - C_{8}] .

(A3.19)

Proof. The eigenvalues for the transition matrix of the linear recurrence equations (4a–4c), equations (6a, 6b), and equations (9a–9c) are 1, triplicated $λ_{2} = - 1 / 2$ , and duplicated $λ_{1}$ and $λ_{3}$ , under the limit $s \to 0$ . It holds

K_{t}^{m m} (1122) = B_{13} + (B_{15} - \frac{C_{10}}{1 - λ_{1}}) {(λ_{1})}^{t} + B_{16 - 17} {(λ_{2 - 3}^{t})}^{T} + B_{18 - 20} t {(λ_{1 - 3}^{t})}^{T} + B_{14} t^{2} {(λ_{2})}^{t},

(A3.20a)

K_{t}^{m p} (1122) = C_{13} + (C_{15} - \frac{C_{10}}{1 - λ_{1}}) {(λ_{1})}^{t} + C_{16 - 17} {(λ_{2 - 3}^{t})}^{T} + C_{18 - 20} t {(λ_{1 - 3}^{t})}^{T} + C_{14} t^{2} {(λ_{2})}^{t},

(A3.20b)

K_{t}^{p p} (1122) = D_{13} + (D_{15} - \frac{C_{10}}{1 - λ_{1}}) {(λ_{1})}^{t} + D_{16 - 17} {(λ_{2 - 3}^{t})}^{T} + D_{18 - 20} t {(λ_{1 - 3}^{t})}^{T} + D_{14} t^{2} {(λ_{2})}^{t},

(A3.20c)

where $C_{10}$ is given by equation (A3.17), and the constant coefficients $B_{13 - 20}$ , $C_{13 - 20}$ , and $D_{13 - 20}$ are to be solved. Substituting $K_{t}^{m m} (1122)$ and $K_{t}^{m p} (1122)$ of equations (A3.20a, A3.20b) into the recurrence equation (9b), we obtain

\begin{matrix} B_{13} = C_{13}, \\ B_{14} = (2 λ_{2} - 1) C_{14}, \\ B_{15 - 17} = (2 λ_{1 - 3} - 1) ⊙ C_{15 - 17} + 2 λ_{1 - 3} ⊙ C_{18 - 20} + (2 C_{10}, 2 λ_{2} C_{14}, 0), \\ B_{18 - 20} = (2 λ_{1 - 3} - 1) ⊙ C_{18 - 20} + (0, 4 λ_{2} C_{14}, 0) . \end{matrix}

(A3.21)

Substituting $K_{t}^{m m} (1122)$ and $K_{t}^{p p} (1122)$ of equations (A3.20a, A3.20c) and $R_{t}^{m}$ of equation (A3.13) into the recurrence equation (9c), and substituting $B_{13 - 20}$ of equation (A3.21), we obtain

\begin{matrix} D_{13} = C_{13} + s [(C_{8} + \frac{C_{10}}{1 - λ_{1}} - C_{13})], \\ D_{14} = \frac{(1 - s) (2 λ_{2} - 1)}{λ_{2}} C_{14}, \\ D_{15 - 17} = [(1 - s) (2 λ_{1 - 3} - 1) ⊘ λ_{1 - 3}] ⊙ C_{15 - 17} + (1 - s) C_{18 - 20} ⊘ λ_{1 - 3} \\ + (\frac{1 - 2 s}{λ_{1}} C_{10}, \frac{s}{λ_{2}} C_{9} - \frac{s}{λ_{2}} C_{11} - \frac{1 - s}{λ_{2}} C_{14}, \frac{s}{λ_{3}} C_{12}), \\ D_{18 - 20} = [(1 - s) (2 λ_{1 - 3} - 1) ⊘ λ_{1 - 3}] ⊙ C_{18 - 20} + (0, \frac{s}{λ_{2}} C_{11} + 2 \frac{1 - s}{λ_{2}} C_{14}, 0) . \end{matrix}

(A3.22)

Substituting $K_{t}^{m p} (1122)$ , $K_{t}^{m m} (1122)$ and $K_{t}^{p p} (1122)$ of equations (A3.20a–A3.20c) and $R_{t}^{m}$ of equation (A3.13) into the recurrence equation (9a), substituting $B_{13} - B_{20}$ of equation (A3.21), and substituting the exact $D_{13} - D_{20}$ of equation (A3.22), we obtain

C_{13} = C_{8} + \frac{C_{10}}{1 - λ_{1}}, C_{18} = - (1 + \frac{8}{9} s) C_{10}, C_{14} = C_{19} = C_{20} = 0,

(A3.23)

where the constant coefficients $C_{14}$ , $C_{19}$ , and $C_{20}$ are approximated by keeping up to the zero order of s, while $C_{18}$ is approximated by keeping up to the first order of s. The theorem follows by substituting $B_{13 - 20}$ , $C_{13 - 14}$ , $C_{18 - 20}$ , and $D_{13 - 20}$ of equations (A3.21–A3.23) into equations (A3.20a–A3.20c), where the constant coefficient $C_{15 - 17}$ of equation (A3.19) is determined by the initial conditions $K_{0} (1122)$ . □

Theorem A3.8. Denote by $K_{t} (1232) = {(K_{t}^{m m} (1232), K_{t}^{m p +} (1232), K_{t}^{p p} (1232))}^{T}$ the expected density of junction type $(1232)$ in a large population, and it holds

K_{t} (1232) = C_{4} \frac{λ_{1}^{t} - λ_{4}^{t}}{s} + P Λ (λ_{1 - 3}^{t}) {(C_{21 - 23})}^{T} + W_{t},

(A3.24)

where $P$ is given by equation (A3.3), $C_{4 - 7}$ is given by equation (A3.11), and

{(C_{21 - 23})}^{T} = P^{- 1} [K_{0} (1232) - W_{0}],

(A3.25)

\begin{matrix} W_{t} = [\begin{matrix} \frac{1}{3} & - \frac{2}{3 s} & - \frac{7}{9} & - \frac{4}{9} \\ 0 & 0 & 0 & - \frac{4}{9} \\ - \frac{1}{3} & - \frac{4}{3 s} & - \frac{20}{9} & \frac{32}{9} \end{matrix}] {[(λ_{1}^{t}, λ_{2}^{t}, λ_{3}^{t}, λ_{7}^{t}) ⊙ C_{4 - 7}]}^{T} \\ + [\begin{matrix} - \frac{4}{3 s} & - \frac{2}{9} \\ \frac{2}{3 s} & \frac{4}{9} \\ \frac{8}{3 s} & - \frac{8}{9} \end{matrix}] {[(t λ_{2}^{t}, t λ_{3}^{t}) ⊙ C_{5 - 6}]}^{T} . \end{matrix}

(A3.26)

Remark. In equation (A3.24), $(λ_{1}^{t} - λ_{4}^{t}) / s \to 2 t / 3$ as $s \to 0$ ; the term $W_{t}$ of equation (A3.26) is independent of s since $1 / s$ in the matrices is canceled with s of $C_{5}$ (see equation (A3.11)).

Proof. In a large population, the seven eigenvalues for the transition matrix of the linear recurrence equations (5a–5d) and equations (A1.41a–A1.41c) are $λ_{1}$ , $λ_{4}$ , $λ_{2} = λ_{5} = - 1 / 2$ , $λ_{3} = λ_{6} = 1 / 4$ , and $λ_{7}$ . It holds

K_{t}^{m m} (1232) = B_{21 - 23} {(λ_{1 - 3}^{t})}^{T} + B_{24} [{(λ_{4})}^{t} - {(λ_{1})}^{t}] + B_{25 - 26} t {(λ_{2 - 3}^{t})}^{T} + B_{27} {(λ_{7})}^{t},

(A3.27a)

K_{t}^{m p +} (1232) = C_{21 - 23} {(λ_{1 - 3}^{t})}^{T} + C_{24} [{(λ_{4})}^{t} - {(λ_{1})}^{t}] + C_{25 - 26} t {(λ_{2 - 3}^{t})}^{T} + C_{27} {(λ_{7})}^{t},

(A3.27b)

K_{t}^{p p} (1232) = D_{21 - 23} {(λ_{1 - 3}^{t})}^{T} + D_{24} [{(λ_{4})}^{t} - {(λ_{1})}^{t}] + D_{25 - 26} t {(λ_{2 - 3}^{t})}^{T} + D_{27} {(λ_{7})}^{t},

(A3.27c)

where the constant coefficients $B_{21 - 27}$ , $C_{21 - 27}$ , and $D_{21 - 27}$ are to be solved. Substituting $K_{t}^{m p +} (1232)$ and $K_{t}^{m m} (1232)$ of equations (A3.27a, A3.27b) and $β_{t}^{m m p}$ of equation (A1.16b) into the recurrence equation (A1.41b), we obtain

\begin{matrix} B_{21} = (2 λ_{1} - 1) C_{21} + 2 (λ_{4} - λ_{1}) C_{24} - C_{4}, \\ B_{22 - 23} = [2 λ_{2 - 3} - 1] ⊙ C_{22 - 23} + 2 λ_{2 - 3} ⊙ C_{25 - 26} - C_{5 - 6}, \\ B_{24 - 27} = [2 λ_{4 - 7} - 1] ⊙ C_{24 - 27} + (- C_{4}, 0, 0, - C_{7}) . \end{matrix}

(A3.28)

Substituting $K_{t}^{m m} (1232)$ and $K_{t}^{p p} (1232)$ of equations (A3.27a, A3.27c) into the recurrence equation (A1.41c), we obtain

\begin{matrix} D_{21} = \frac{(1 - s) (2 λ_{1} - 1)}{λ_{1}} C_{21} - \frac{(1 - s) (λ_{1} - λ_{4})}{λ_{1} λ_{4}} C_{24} - \frac{1 - s}{λ_{4}} C_{4}, \\ D_{22 - 23} = [(1 - s) (2 λ_{2 - 3} - 1) ⊘ λ_{2 - 3}] ⊙ C_{22 - 23} + (1 - s) (C_{25 - 26} - C_{5 - 6}) ⊘ λ_{2 - 3}, \\ D_{24 - 27} = [(1 - s) (2 λ_{4 - 7} - 1) ⊘ λ_{4 - 7}] ⊙ C_{24 - 27} + (- \frac{1 - s}{λ_{4}} C_{4}, 0, 0, - \frac{1 - s}{λ_{7}} C_{7}), \end{matrix}

(A3.29)

Substituting $K_{t}^{m m} (1232)$ , $K_{t}^{m p +} (1232)$ and $K_{t}^{p p} (1232)$ of equations (A3.27a–A3.27c) and $β_{t}^{m m p}$ and $β_{t}^{m p p}$ of equations (A1.16b–A1.16c) into the recurrence equation (A1.41a), and substituting $B_{21 - 27}$ and $D_{21 - 27}$ of equations (A3.28, A3.29) we obtain

C_{24} = - \frac{B_{4} + 2 C_{4}}{3 s}, C_{25} = \frac{1}{9} (2 B_{5} + C_{5}), C_{26} = - \frac{4}{9} (B_{6} - C_{6}), C_{27} = - \frac{4}{81} (4 B_{7} + 17 C_{7}) .

(A3.30)

According to equation (A3.10), we have

B_{4 - 7} = (1, \frac{3}{s}, 0, - 2) ⊙ C_{4 - 7},

(A3.31)

and thus

C_{24 - 27} = (- \frac{1}{s}, \frac{2}{3 s}, \frac{4}{9}, - \frac{4}{9}) ⊙ C_{4 - 7},

(A3.32)

where $C_{4 - 7}$ is given by equation (A3.11). The constant coefficient $C_{21 - 23}$ is determined by the initial condition $K_{0} (1232)$ , and it is given by equation (A3.25). The theorem follows by writing equations (A3.27a–A3.27c) in the matrix form while keeping only the leading order of s.

Theorem A3.9. The expected density $K_{t}^{m p -} (1232)$ in a large population is given by

K_{t}^{m p -} (1232) = \frac{1}{3} C_{4} {(λ_{4})}^{t} + A_{28} {(λ_{5})}^{t} + \frac{2}{3} C_{6} {(λ_{6})}^{t} + \frac{4}{3} C_{7} {(λ_{7})}^{t},

(A3.33)

where $C_{4}$ , $C_{6}$ , and $C_{7}$ are given by equation (A3.11), and

A_{28} = K_{0}^{m p -} (1232) - \frac{1}{3} C_{4} - \frac{2}{3} C_{6} - \frac{4}{3} C_{7} .

(A3.34)

Proof. In a large population, the eigenvalues of the transition matrix of the linear recurrence equations (5a–5d) and equation (A1.41d) are $λ_{4}$ , duplicated $λ_{5} = - 1 / 2$ , $λ_{6}$ and $λ_{7}$ . It holds

K_{t}^{m p -} (1232) = A_{24} {(λ_{4})}^{t} + (A_{28} + A_{25} t) {(λ_{5})}^{t} + A_{26} {(λ_{6})}^{t} + A_{27} {(λ_{7})}^{t},

(A3.35)

where $A_{24 - 28}$ is to be solved. Substituting $K_{t}^{m p -} (1232)$ of equation (A3.35) and $β_{t}^{m m p}$ of equation (A3.9) into the recurrence equation (A1.41d), and we obtain

A_{24} = \frac{1}{3} C_{4}, A_{25} = - C_{5}, A_{26} = \frac{2}{3} C_{6}, A_{27} = \frac{4}{3} C_{7},

(A3.36)

and $A_{28}$ is determined by the initial condition $K_{0}^{m p -} (1232)$ and it is given in equation (A3.34). The theorem follows by substituting equation (A3.36) into equation (A3.35), and approximating $A_{25}$ to zero since the leading term of $C_{5}$ is the first order of s, see equation (A3.11).

Corollary A3.10. The overall expected unction density $ρ_{t}^{m p}$ in a large population is given by

\begin{matrix} ρ_{t}^{m p} = C_{8} + C_{10} \frac{1 - λ_{1}^{t - 1}}{1 - λ_{1}} + [- C_{15} + (1 + \frac{8}{9} s) C_{10} t] λ_{1}^{t} \\ + (- C_{16} - C_{9} - \frac{4}{3} C_{2} + \frac{2}{3} C_{2} t) λ_{2}^{t} + (- C_{17} - \frac{20}{9} C_{3}) λ_{3}^{t}, \end{matrix}

(A3.37)

where $C_{1 - 3}$ , $C_{8 - 10}$ , and $C_{15 - 17}$ are given by equation (A3.4), equations (A3.15–A3.17), and equation (A3.19), respectively.

Proof. The corollary follows by substituting $R_{t}^{m}$ and $R_{t}^{p}$ of equations (A3.13, A3.14) and $K_{t}^{m p} (1122)$ of equation (A3.20b) into equation (3).

Footnotes

Supporting information is available online at http://www.g3journal.org/lookup/suppl/doi:10.1534/g3.114.016154/-/DC1

Communicating editor: E. Huang

Literature Cited

Broman K., 2005. The genomes of recombinant inbred lines. Genetics 169: 1133–1146. [DOI] [PMC free article] [PubMed] [Google Scholar]
Broman K. W., 2012a Genotype probabilities at intermediate generations in the construction of recombinant inbred lines. Genetics 190: 403–412. [DOI] [PMC free article] [PubMed] [Google Scholar]
Broman K. W., 2012b Haplotype probabilities in advanced intercross populations. G3 (Bethesda) 2: 199–202. [DOI] [PMC free article] [PubMed] [Google Scholar]
Chapman N., Thompson E., 2003. A model for the length of tracts of identity by descent in finite random mating populations. Theor. Popul. Biol. 64: 141–150. [DOI] [PubMed] [Google Scholar]
Churchill G., Airey D., Allayee H., Angel J., Attie A., et al. , 2004. The collaborative cross, a community resource for the genetic analysis of complex traits. Nat. Genet. 36: 1133–1137. [DOI] [PubMed] [Google Scholar]
Darvasi A., Soller M., 1995. Advanced intercross lines, an experimental population for fine genetic-mapping. Genetics 141: 1199–1207. [DOI] [PMC free article] [PubMed] [Google Scholar]
Fisher R., 1949. The Theory of Inbreeding. Oliver and Boyd, London. [Google Scholar]
Fisher R., 1954. A fuller theory of junctions in inbreeding. Heredity 8: 187–197. [Google Scholar]
Haldane J., Waddington C., 1931. Inbreeding and linkage. Genetics 16: 357–374. [DOI] [PMC free article] [PubMed] [Google Scholar]
Johannes F., Colome-Tatche M., 2011. Quantitative epigenetics through epigenomic perturbation of isogenic lines. Genetics 188: 215–227. [DOI] [PMC free article] [PubMed] [Google Scholar]
King E. G., Macdonald S. J., Long A. D., 2012. Properties and power of the Drosophila Synthetic Population Resource for the routine dissection of complex traits. Genetics 191: 935–949. [DOI] [PMC free article] [PubMed] [Google Scholar]
Liu E. Y., Zhang Q., McMillan L., Pardo-Manuel de Villena F., Wang W., 2010. Efficient genome ancestry inference in complex pedigrees with inbreeding. Bioinformatics 26: i199–i207. [DOI] [PMC free article] [PubMed] [Google Scholar]
Liu S., Kowalski S., Lan T., Feldmann I., Paterson A., 1996. Genome-wide high-resolution mapping by recurrent intermating using Arabidopsis thaliana as a model. Genetics 142: 247–258. [DOI] [PMC free article] [PubMed] [Google Scholar]
MacLeod A. K., Haley C. S., Woolliams J. A., 2005. Marker densities and the mapping of ancestral junctions. Genet. Res. 85: 69–79. [DOI] [PubMed] [Google Scholar]
Martin O. C., Hospital F., 2011. Distribution of parental genome blocks in recombinant inbred lines. Genetics 189: 645–654. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ming R., Moore P. H., 2007. Genomics of sex chromosomes. Curr. Opin. Plant Biol. 10: 123–130. [DOI] [PubMed] [Google Scholar]
Mott R., Talbot C., Turri M., Collins A., Flint J., 2000. A method for fine mapping quantitative trait loci in outbred animal stocks. Proc. Natl. Acad. Sci. USA 97: 12649–12654. [DOI] [PMC free article] [PubMed] [Google Scholar]
Svenson K. L., Gatti D. M., Valdar W., Welsh C. E., Cheng R. Y., et al. , 2012. High-resolution genetic mapping using the mouse diversity outbred population. Genetics 190: 437–447. [DOI] [PMC free article] [PubMed] [Google Scholar]
Weller J., Soller M., 2004. An analytical formula to estimate confidence interval of qtl location with a saturated genetic map as a function of experimental design. Theor. Appl. Genet. 109: 1224–1229. [DOI] [PubMed] [Google Scholar]
Winkler C., Jensen N., Cooper M., Podlich D., Smith O., 2003. On the determination of recombination rates in intermated recombinant inbred populations. Genetics 164: 741–745. [DOI] [PMC free article] [PubMed] [Google Scholar]
Zheng C. Z., Boer M. P., van Eeuwijk F. A., 2014. A general modeling framework for genome ancestral origins in multiparental populations. Genetics 198: 87–101. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib1] Broman K., 2005. The genomes of recombinant inbred lines. Genetics 169: 1133–1146. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib2] Broman K. W., 2012a Genotype probabilities at intermediate generations in the construction of recombinant inbred lines. Genetics 190: 403–412. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib3] Broman K. W., 2012b Haplotype probabilities in advanced intercross populations. G3 (Bethesda) 2: 199–202. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib4] Chapman N., Thompson E., 2003. A model for the length of tracts of identity by descent in finite random mating populations. Theor. Popul. Biol. 64: 141–150. [DOI] [PubMed] [Google Scholar]

[bib5] Churchill G., Airey D., Allayee H., Angel J., Attie A., et al. , 2004. The collaborative cross, a community resource for the genetic analysis of complex traits. Nat. Genet. 36: 1133–1137. [DOI] [PubMed] [Google Scholar]

[bib6] Darvasi A., Soller M., 1995. Advanced intercross lines, an experimental population for fine genetic-mapping. Genetics 141: 1199–1207. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib7] Fisher R., 1949. The Theory of Inbreeding. Oliver and Boyd, London. [Google Scholar]

[bib8] Fisher R., 1954. A fuller theory of junctions in inbreeding. Heredity 8: 187–197. [Google Scholar]

[bib9] Haldane J., Waddington C., 1931. Inbreeding and linkage. Genetics 16: 357–374. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib10] Johannes F., Colome-Tatche M., 2011. Quantitative epigenetics through epigenomic perturbation of isogenic lines. Genetics 188: 215–227. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib11] King E. G., Macdonald S. J., Long A. D., 2012. Properties and power of the Drosophila Synthetic Population Resource for the routine dissection of complex traits. Genetics 191: 935–949. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib12] Liu E. Y., Zhang Q., McMillan L., Pardo-Manuel de Villena F., Wang W., 2010. Efficient genome ancestry inference in complex pedigrees with inbreeding. Bioinformatics 26: i199–i207. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib13] Liu S., Kowalski S., Lan T., Feldmann I., Paterson A., 1996. Genome-wide high-resolution mapping by recurrent intermating using Arabidopsis thaliana as a model. Genetics 142: 247–258. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib14] MacLeod A. K., Haley C. S., Woolliams J. A., 2005. Marker densities and the mapping of ancestral junctions. Genet. Res. 85: 69–79. [DOI] [PubMed] [Google Scholar]

[bib15] Martin O. C., Hospital F., 2011. Distribution of parental genome blocks in recombinant inbred lines. Genetics 189: 645–654. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib16] Ming R., Moore P. H., 2007. Genomics of sex chromosomes. Curr. Opin. Plant Biol. 10: 123–130. [DOI] [PubMed] [Google Scholar]

[bib17] Mott R., Talbot C., Turri M., Collins A., Flint J., 2000. A method for fine mapping quantitative trait loci in outbred animal stocks. Proc. Natl. Acad. Sci. USA 97: 12649–12654. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib18] Svenson K. L., Gatti D. M., Valdar W., Welsh C. E., Cheng R. Y., et al. , 2012. High-resolution genetic mapping using the mouse diversity outbred population. Genetics 190: 437–447. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib19] Weller J., Soller M., 2004. An analytical formula to estimate confidence interval of qtl location with a saturated genetic map as a function of experimental design. Theor. Appl. Genet. 109: 1224–1229. [DOI] [PubMed] [Google Scholar]

[bib20] Winkler C., Jensen N., Cooper M., Podlich D., Smith O., 2003. On the determination of recombination rates in intermated recombinant inbred populations. Genetics 164: 741–745. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib21] Zheng C. Z., Boer M. P., van Eeuwijk F. A., 2014. A general modeling framework for genome ancestral origins in multiparental populations. Genetics 198: 87–101. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Modeling X-Linked Ancestral Origins in Multiparental Populations

Chaozhi Zheng

Abstract

A model for X-linked ancestral origins

Assumptions and notation

Figure 1.

Single-locus non-IBD probabilities

Figure 2.

Expected junction densities

Model evaluation by simulations

Application to QTL mapping populations

Multistage populations

Table 1. Results for X chromosomes in the $2^{n}$ -way RIL by sibling mating in the last generation $g = U + V + 1$ , where $U = 0$ for $n = 1$ and $U = n - 2$ for $n \geq 2$ .

Table 2. Results for the AIL in the last generation $g = U + 1$ .

Table 3. Results for the DO in the last generation $g = U + 1$ .

RIL

Figure 3.

AIL

Figure 4.

HS and DO

DSPR

Figure 5.

Discussion

Acknowledgments

Appendix A

Results for constant random mating populations

A1 Finite population

A2 Sibling-mating population

A3 Large population

Footnotes

Literature Cited

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Modeling X-Linked Ancestral Origins in Multiparental Populations

Chaozhi Zheng

Abstract

A model for X-linked ancestral origins

Assumptions and notation

Figure 1.

Single-locus non-IBD probabilities

Figure 2.

Expected junction densities

Model evaluation by simulations

Application to QTL mapping populations

Multistage populations

Table 1. Results for X chromosomes in the 2n-way RIL by sibling mating in the last generation g=U+V+1, where U=0 for n=1 and U=n−2 for n≥2.

Table 2. Results for the AIL in the last generation g=U+1.

Table 3. Results for the DO in the last generation g=U+1.

RIL

Figure 3.

AIL

Figure 4.

HS and DO

DSPR

Figure 5.

Discussion

Acknowledgments

Appendix A

Results for constant random mating populations

A1 Finite population

A2 Sibling-mating population

A3 Large population

Footnotes

Literature Cited

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Table 1. Results for X chromosomes in the $2^{n}$ -way RIL by sibling mating in the last generation $g = U + V + 1$ , where $U = 0$ for $n = 1$ and $U = n - 2$ for $n \geq 2$ .

Table 2. Results for the AIL in the last generation $g = U + 1$ .

Table 3. Results for the DO in the last generation $g = U + 1$ .