Skip to main content

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

bioRxiv logoLink to bioRxiv
[Preprint]. 2024 Dec 19:2024.06.06.597727. Originally published 2024 Jun 6. [Version 2] doi: 10.1101/2024.06.06.597727

Differentiating mechanism from outcome for ancestry-assortative mating in admixed human populations

Dashiell J Massey 1, Zachary A Szpiech 2,3, Amy Goldberg 1,4
PMCID: PMC11185628  PMID: 38895317

Abstract

Population genetic theory, and the empirical methods built upon it, often assume that individuals pair randomly for reproduction. However, natural populations frequently violate this assumption, which may potentially confound genome-wide association studies, selection scans, and demographic inference. Within several recently admixed human populations, empirical genetic studies have reported a correlation in global ancestry proportion between spouses, referred to as ancestry-assortative mating. Here, we use forward genomic simulations to link correlations in global ancestry proportion between mates to the underlying mechanistic mate-choice process. We consider the impacts of two types of mate-choice model, using either ancestry-based preferences or social groups as the basis for mate pairing. We find that multiple mate-choice models can produce the same correlations in global ancestry proportion between spouses; however, we also highlight alternative analytic approaches and circumstances in which these models may be distinguished. With this work, we seek to highlight potential pitfalls when interpreting correlations in empirical data as evidence for a particular model of human mating practices, as well as to offer suggestions toward development of new best practices for analysis of human ancestry-assortative mating.

Introduction

Non-random mating has long been appreciated as an important source of genetic structure in natural populations (Fisher 1918; Wright 1921; Wright 1950; Nagylaki 1978). Positive assortative mating (hereafter, assortative mating), wherein genotypic or phenotypic trait values are positively correlated between mates, has been empirically observed across animal species (Jiang et al. 2013). Theoretical and empirical studies have demonstrated the consequences of assortative mating for speciation and hybridization (Kondrashov 1983; Otto et al. 2008; Tung et al. 2012; Schumer et al. 2017; Kopp et al. 2018; Powell et al. 2021; Muralidhar et al. 2022; Natola et al. 2022; Smadja et al. 2022; Robinson et al. 2023), and for the distributions of traits within populations (Wright 1921; Norris et al. 2019; Kim et al. 2021; Border et al. 2022; Muralidhar et al. 2022; Horwitz et al. 2023).

In non-human primates that live in complex and structured social environments, sociodemographic factors have also been demonstrated to influence mate pair formation (Keddy-Hector 1992; Klinkova et al. 2005; Setchell And Wickings 2006; Van Belle et al. 2009; Tung et al. 2012; Fogel et al. 2021). As such, when it comes to humans, assortative mating has been the purview not only of biologists, but also social scientists (e.g., (Buss And Barnes 1986; Mare 1991; Kalmijn 1998; Luo And Klohnen 2005; Blossfeld 2009; Torche 2010; Schwartz 2013; Greenwood et al. 2014; Henz And Mills 2017; Smieja And Stolarski 2018; Chiappori 2020; De La Mare And Lee 2023)). Positive correlations have been reported between human spouses for a diverse array of phenotypes, including morphometric measurements, health outcomes, personality traits, lifestyle factors, age, socioeconomic status, educational attainment, religious affiliation, and language (Nagoshi et al. 1990; Robinson et al. 2017; Fibla et al. 2022; Horwitz et al. 2023; Yamamoto et al. 2023). These associations are likely driven by multiple generative processes, including individual mate-choice preferences, phenotypic convergence over time facilitated by cohabitation, and social structures that restrict or promote particular pairings. These processes may act individually or in concert. For instance, sociological literature on assortative mating by educational attainment has revealed the influence of both a preference for mate similarity (Kalmijn 1998) and social barriers to marriage across socioeconomic class (Torche 2010), and that the strength of assortment by education over time is sensitive to the degree of temporal overlap between the end of schooling and average age at the time of marriage (Mare 1991), suggesting that social milieu is likely also an important factor in structuring mate pair outcomes.

Within the field of population genetics, relatively less attention has been paid to understanding the contributions of these generative processes to producing the observed correlations between mating pairs. Indeed, mechanism and outcome are often conflated or not clearly delineated when discussing assortative mating, which obscures these multiple mechanisms that may contribute to mate similarity. When studies do consider mechanism, they typically draw on foundational work in the sexual selection literature modeling the effects of female choice on male phenotype (Lande 1981; Kirkpatrick 1982; Seger 1985), in which the mechanism driving assortative mating is assumed to be mate choice. To extend this framework beyond sexual dimorphism, the mate-choice mechanism is modeled as a preference for mates that “match” an individual’s own phenotype (Kopp et al. 2018; Goldberg et al. 2020; Kim et al. 2021; Muralidhar et al. 2022). However, some studies have suggested that temporal structure in the mating process (Xie et al. 2015; Woodman et al. 2023) or non-uniform ability to attract mates (Burley 1983) can generate phenotypic correlations between mates without an explicit preference for phenotypic similarity. In this paper, we use the term “assortative mating” to refer specifically to an empirical observation of greater resemblance between mates than expected by chance, agnostic to mechanism.

Reports of assortative mating by ancestry in humans provide a particularly intriguing example for exploring mechanism because genetic ancestry is a complex quantitative “phenotype” that involves every locus in the genome and that can only be ascertained by sequencing. In recently admixed human populations, in which individuals derive ancestry from multiple source populations, a positive correlation in global ancestry proportion between spouses has been observed. This phenomenon, referred to as ancestry-assortative mating, has been reported in multiple Latino populations (Risch et al. 2009; Zou et al. 2015; Spear et al. 2020; Mas Sandoval et al. 2023), as well as in African-Americans (Zaitlen et al. 2017; Avadhanam And Williams 2022), Cabo Verdeans (Korunes et al. 2022), and ni-Vanuatu (Arauna et al. 2022). Ancestry-associated mating patterns, inferred from excess sharing of single-nucleotide variants, have also been reported for non-Hispanic white spouse pairs in the United States (Sebro et al. 2010; Domingue et al. 2014; Sebro et al. 2017). Empirical studies commonly use such correlations in global ancestry proportion between spouses to suggest that ancestry shapes mate choice, although this interaction would necessarily be mediated by a proxy phenotype of some type, not by quantitative ancestry proportion. However, the relative contributions of individual preferences and broader social mechanisms to generating these correlations are unclear.

Accounting for assortative mating is critical for accurate statistical and population genetic analysis in admixed human populations: it has been shown that assortative mating confounds genome-wide association studies (Howe et al. 2021; Border et al. 2022; Veller And Coop 2024) and heritability estimation (Sebro And Risch 2012; Tenesa et al. 2016; Huang et al. 2024). In addition, ancestry-assortative mating specifically has been shown to produce estimates for the timing of admixture that are too recent (Zaitlen et al. 2017; Goldberg et al. 2020; Korunes et al. 2022). However, it remains challenging to establish a null expectation in the presence of assortative mating without understanding the underlying mechanism at play.

To date, statistical methods to correct for ancestry-assortative mating in empirical data have tended to generate null models by simulating data to match the observed empirical correlation in global ancestry proportion between mating pairs, often without a specific mechanistic model (Zaitlen et al. 2017; Spear et al. 2020; Pfennig And Lachance 2023; Huang et al. 2024). This approach makes two important assumptions: first, that the correlation coefficient for a given population has remained constant over time; and second, that matching the correlation coefficient is sufficient to recapitulate the dynamics of global and local ancestry. In contrast, theoretical population genetic studies have examined the behavior of these same parameters using mechanistic models of biased mate choice (Goldberg et al. 2020; Kim et al. 2021; Muralidhar et al. 2022). This approach assumes that ancestry-based preference is the primary mechanism driving non-random mating. These studies typically do not directly consider whether the model used results in an observed correlation in global ancestry proportion between mates, which leaves open the question of how applicable their conclusions are to the context of empirical studies of human ancestry-assortative mating.

Here, we used forward-in-time simulations to probe the relationship between mate-choice mechanism and observed correlation in global ancestry proportion, comparing variants of two classes of mate-choice model. The first model class considers only individual mate-choice preferences for similarity in global ancestry proportion, treating genetic ancestry as a continuous quantitative trait. The second class considers only discrete social groups, assigning each individual to one of two groups and limiting cross-group mating events. (Importantly, group membership is “inherited” from one parent and group identities are initially associated with source populations; see Models.) We found that ancestry-similarity and social group models can produce similar correlations in global ancestry proportion between mates, suggesting that either or both mechanisms could be relevant in interpreting empirical data. Furthermore, the correlation coefficient was not constant over time in any of our simulations, and not all variants of the ancestry-similarity class or all parameter values of the social-group model maintained sufficient population variance to sustain correlations in global ancestry proportion over tens of generations. Our results highlight important caveats about the assumptions behind both existing approaches and suggest additional analyses of empirical data that may help reveal the underlying mechanism of ancestry-assortative mating in human populations.

Models

Simulation framework

We performed forward-in-time simulations of admixture with equal contributions from two source populations using SLiM 4.0.1 (Haller et al. 2019; Haller And Messer 2023). We modeled sexually reproducing diploid individuals (census population size N=10,000/generation) with 22 independently segregating chromosomes, representing the size distribution of human autosomes (total genome size L=2.88Gb), with a uniform recombination rate r=1×10-8. Unless otherwise noted, there was a single pulse of admixture and no additional contributions from the source populations introduced at later generations. We did not model any type of one- or two-dimensional spatial relationships between individuals, nor did we simulate genotypes.

For each of 50 generations post-admixture, we extracted the position and source of local-ancestry tracts from the tree-sequence (Haller et al. 2019). From these data, we calculated the total proportion of the genome that each individual derived from source population 1 (hereafter, global ancestry proportion, x[0,1]. Separately, we tracked the social group s{A,B} to which each individual belonged: offspring were assigned to the same social group as their first parent – i.e., without reference to their global ancestry proportion. We focus primarily on results from the first 20 generations post-admixture, approximately corresponding to the timing of African-European admixture initiated by the trans-Atlantic slave trade (Zaitlen et al. 2017; Hamid et al. 2021; Korunes et al. 2022; Mas Sandoval et al. 2023; Mooney et al. 2023), with a subset of results over longer time periods in the Supplement.

We considered two broad classes of mate-choice models: (1) ancestry-similarity, wherein the probability of an individual with global ancestry proportion xi mating with an individual with global ancestry proportion xj is defined in terms of xi-xj; and (2) social group, wherein the probability of an individual belonging to social group si mating with an individual belonging to social group sj is defined by whether or not si=sj

To implement non-random mating in SLiM, we started from the default Wright-Fisher framework, which is designed to allow females to be selective if a mateChoice() callback is used. In this work, we did not distinguish males and females, instead allowing each individual to potentially serve as both parent 1 (selector) and parent 2 (selected) in sequential mating events. For each mating event, parent 1 was uniformly sampled with replacement from the population, while parent 2 was sampled proportional to mating weight ψi,j, calculated according to the specified mate-choice function (defined below) evaluated for fxi,xj or fsi,sj, under the ancestry-similarity and social group models, respectively (Figure 1). Incidental selfing was explicitly prohibited and each mating event generated a single child.

Figure 1.

Figure 1.

Four functions for defining the sampling weight, ψi,j, assigned to individual j as a potential mate for a focal individual i, based on global ancestry proportion xj(a-c) or social group sj(d). For each mating event, mate 1 was uniformly sampled from the population, and mate 2 was sampled proportional to ψi,j. For the sake of visualization, values of ψi,j have been scaled such that ψi,j=1 means that the probability of individual j being selected as a potential mate for individual i is 1/n (n: number of potential mates). All mate-choice functions are re-parameterized in terms of α. (a) The stationary-preference variant of the ancestry-similarity model (Equation 2) defines ψi,j as a function of the difference in global ancestry proportion between individuals, xi-xj. (b) Under the increasing-preference variant of the mate-preference model (Equation 3), the value of ψi,j is scaled to the variance in global ancestry proportion, σx2(t), across all potential mates, resulting in increasing choosiness over time. ψi,j values are shown for t{1,2,3} and α=4, assuming σx2(t)=2-t2 (Liang et al. 2024). (c) Under the broad-preference variant of the ancestry-similarity model (Equation 4), ψi,j follows the probability density function of a normal distribution with mean xi. (d) Under the social group model (Equation 6), ψi,j can take one of two discrete values, determined by whether a potential mate belongs to the same social group as the focal individual (si=sj) or to the other social group (sisj).

Defining a common parameter for model comparison

The mate-choice functions we considered (Equations 3–6, below) differ in how they parameterize the strength of mating bias; to aid in comparison across models, we will refer throughout to α0,, the frequency of mating events between individuals from opposite source populations. Specifically, we define α to be inversely related to the proportion of admixed offspring in the initial generation post-contact, A. We have,

α=1A-1. (1)

That is, at time of population contact, individuals are α times more likely choose a mate from within their own source population than from the opposite source population. Increasing values of α representing an increasing bias toward endogamy, with α=0 corresponding to exclusive exogamy, α(0,1) to negative assortative mating, α=1 to random mating, and α>1 to positive assortative mating. Given our focus on mechanisms that might explain empirically observed positive correlations in global ancestry proportion between mates, we consider only α1. While our models differ in their dynamics over time, simulations with the same α have the same correlation in global ancestry proportion between mates, rxi,xj[0,1], for the first generation post-admixture under all models.

Ancestry-similarity models

Individual-based mate preference models originate in the sexual selection literature, where they were developed for studying the co-evolutionary dynamics of male secondary sex characteristics and female mate choice (Lande 1981; Kirkpatrick 1982; Seger 1985). Generally, under these models, male phenotype and female preference are controlled by distinct genetic loci (or sets of loci). In the speciation literature, an alternative family of models are concerned with assortative mating as a mechanism of sympatric speciation and model mate preference based on phenotypic similarity between mates for some ecologically relevant trait (Dieckmann And Doebeli 1999; Burger And Schneider 2006; Pennings et al. 2008; Rettelbach et al. 2013). These “matching rule” models (Kopp et al. 2018) are the basis for our ancestry-similarity model class, wherein individuals preferentially choose mates to maximize similarity in phenotype (i.e., global ancestry proportion). Importantly for the models we consider here, these are fixed relative-preference models: a focal individual i assigns each potential mate a preference value ψi,j that depends only on the potential mate’s phenotype and is density independent (Seger 1985).

Ancestry-similarity models define a mate-choice function ψi,j=fxi,xj for calculating the sampling weight assigned to individual j with global ancestry proportion xj as a potential mate for a focal individual i with global ancestry proportion xi. In the present study, we considered three variants of the ancestry-similarity model identified in a literature search, which differ in the precise definition of the mate-choice function f.

Models of ancestry-based assortative mating typically model fx1,xi as an exponential function, such that ψi,j decays exponentially at rate c=ln(α) as xi-xj increases. We define,

ψi,j=e-cxi-xj. (2)

We refer to this as the stationary-preference model (“like-with-like” model in (Muralidhar et al. 2022)): for a given absolute difference in global ancestry proportion, xi-xj, the associated preference ψi,j does not change over time. It is important to note, however, that the range of ψi,j is constrained by xi: an individual i with global ancestry proportion xi=0.5 is more similar to all potential mates than an individual with xi=0, and is therefore less selective in choosing a mate (Figure 1a). Furthermore, without additional migration from the source populations, variance in global ancestry proportion across individuals decreases over time (Figure S2), meaning that the pool of potential mates is becoming increasingly homogenous and the ratio of maxψi,j/minψi,j approaches 1 for all values of xi (Figure S1a). In other words, all individuals become less selective in choosing mates over successive generations.

One approach to account for the decreased variance in global ancestry proportion over time, and thus preserve mate-selectiveness, is to decrease c, rather that holding it constant. Following (Kim et al. 2021), we re-scale c in each generation t using the variance in global ancestry proportion observed in that generation, σx2(t):

ψi,j=e-cσx2(t)xi-xj. (3)

We refer to this as the increasing-preference model (Equation A2 in (Kim et al. 2021)). Under this model, as under the stationary-preference model, an individual i with global ancestry proportion xi=0.5 is less selective than one with xi=0 (Figure 1b). However, because σx2(t) decreases over time, there is an increase in mate-selectiveness over successive generations under this model for a given xi (Figure S1b; Figure S3).

We also considered a third ancestry-similarity model, employing a Gaussian – rather than exponential – mate-choice function. This approach is commonly used in models of assortative mating based on a quantitative trait (e.g., (Dieckmann And Doebeli 1999; Pennings et al. 2008; Funk et al. 2021)), although it has not to our knowledge been used to model assortative mating based on global ancestry proportion. Under this model, ψi,j~Nxi,σ2 where xi is the global ancestry proportion of the focal individual and the constant σ2=1-2lnα-1 determines the strength of mate-choice preference. Again, ψi,j decreases as the difference in global ancestry proportion between potential mates increases:

ψi,j=1σ2πe-xi-xj22σ2. (4)

We refer to this as the broad-preference model because fxi,xj decays more gradually with increasing differences in global ancestry proportion, relative to the stationary-preference and increasing-preference models (Figure 1c). Thus, a large difference in global ancestry proportion from a potential mate is strongly disfavored, while smaller differences within the pool of “similar” mates are not weighed strongly. Compared to the other two ancestry-similarity models, individuals are less selective in choosing mates under the broad-preference model; however, as under the stationary-preference model, they also become less selective over time (Figure S1c; Figure S4).

Social group model

An alternative class of models considers preferential mating based on a categorical, rather than quantitative, trait. This type of model is often used in the context of interspecific hybrid zones, wherein non-random mating can maintain species boundaries and/or promote hybrid speciation (Melo et al. 2009; Schumer et al. 2017; Powell et al. 2021; Natola et al. 2022; Smadja et al. 2022). Mechanistically, this can be construed as a model of biased mating by species identity or source population (Goldberg et al. 2020). However, application of this type of model in humans is not straightforward: the source populations in question are not biologically distinct species, but rather labels describing socially and geographically defined boundaries that have changed over time. Furthermore, it is unclear how we ought to model the mating behavior of admixed individuals: neither random mating by admixed individuals nor a simple preference of admixed individuals to mate with one another seem likely to produce the observed correlation in global ancestry proportion between mates. Yet, the strongest evidence for ancestry-assortative mating in humans is observed in contexts where both spouses have admixed ancestry (Risch et al. 2009; Zou et al. 2015; Zaitlen et al. 2017; Spear et al. 2020; Avadhanam And Williams 2022; Korunes et al. 2022; Mas Sandoval et al. 2023).

Social identity is an important organizer of pair formation in humans, as evidenced by widespread cultural practices of endogamy based on race, ethnicity, religion, socioeconomic status, and caste. Although distinct from genetic ancestry, social categorizations often interact with ancestry. For instance, skin pigmentation, a prominent contributor to how an individual is racialized, is correlated with West African-related global ancestry proportion in multiple populations with recent admixture history (Parra et al. 2003; Shriver et al. 2003; Bonilla et al. 2004; Beleza et al. 2012). Race has also served as the basis for numerous sociolegal barriers to mating across contexts, including anti-miscegenation laws (Browning 1951). This type of interaction opens the possibility that social barriers to mating might be “recorded” in patterns of ancestry-assortative mating detectable by genomic analyses.

We considered a simple social group model with two groups, with sampling weight ψi,j determined by the social group identity of the focal individual and potential mate, si and sj, respectively, with preference strength α (Figure 1d; Figure S5). Rearranging Equation 1:

ψi,j=11α+1,si=sj1α+1,sisj. (5)

To generate an association between social group membership and global ancestry proportion, we assigned individuals in each source population to the same social group prior to admixture. However, from the onset of population contact, individuals “inherited” their social group membership from the first parent in the mating pair. Thus, individuals whose parents belonged to opposite social groups were equally likely to be assigned to either social group.

Several features distinguish the social group model from the ancestry-similarity models described above, or from one in which individuals use an observable quantitative trait (e.g., skin pigmentation) as a proxy for an unobserved trait (global ancestry proportion). First, social group membership is correlated with – but not determined by – genotype. This matters because it has previously been demonstrated that recombination is expected to decouple the relationship between global ancestry proportion and a proxy quantitative trait over time, leading to only a transient correlation between the two phenotypes (Kim et al. 2021). Because social group membership is not genetic, it is not impacted by recombination. In addition, individuals are uniformly likely to cross the social group barrier in choosing a mate and do not express relative preferences among potential mates within a social group. Thus, in contrast to an ancestry-similarity model, individuals with intermediate ancestry are not less selective than individuals with more source-like ancestry in choosing a mate.

However, these features of the social group model also mean that the association between global ancestry proportion and social group membership is expected to decay over successive generations, and we explored this relationship quantitatively over time.

Results

Not all simulations of non-random mating show a positive correlation in global ancestry proportion between mates 20 generations post-admixture

Ancestry-assortative mating is often inferred from a positive correlation in global ancestry proportion between spouses at single point in time, and statistical methods to account for non-random mating typically generate null models using simulations where potential mate pairs are permuted until the empirically observed correlation is achieved (Zaitlen et al. 2017; Pfennig And Lachance 2023; Huang et al. 2024). This approach assumes that the correlation observed in the present-day samples has remained constant since the start of admixture. Figure 2 plots the Pearson correlation coefficient for global ancestry proportion between mates, rxi,xj, over time under three variants of the ancestry-similarity model or the social group model, for multiple strengths of mating bias, α, and following a single pulse of admixture. In all scenarios of non-random mating that we considered, we observed positive correlations in global ancestry proportion between mates (Figure 2; Figure S6). However, contrary to the assumption that the correlation observed in a contemporary sample should be modeled as a fixed value, the observed correlation coefficients in our simulations decayed over time. In simulations performed under the increasing-preference model, rxi,xj reached a stable plateau within the first 10 generations post-admixture across values of α (Figure S7a). In contrast, simulations performed under the other two variants of the ancestry-similarity model (stationary-preference or broad-preference) approached rxi,xj=0 within 20 generations post-admixture, unless α was sufficiently large to disrupt the admixture process altogether (Figure S8; Figure S9). Under the social group model, positive values of rxi,xj could be observed 20 generations post-admixture for α>2, although these models also approached rxi,xj=0 on longer time scales (Figure S7b).

Figure 2.

Figure 2.

Correlation in global ancestry proportion between mates, rxi,xj, was not constant over time and decayed to near-zero within 20 generations post-admixture in some simulations. Simulations under the stationary-preference (blue; Equation 2) and broad-preference (green; Equation 4) models did not simultaneously maintain rxi,xj0 while also allowing for a large population with intermediate global ancestry proportion values, whereas those under the increasing-preference (pink; Equation 3) and social group (yellow; Equation 5) models did for some α. By definition, rxi,xj in the first generation after admixture is equal across all simulations performed with the same value of α, regardless of mate-choice model. For the sake of comparison, the same replicate random-mating simulations (α=1; gray) are reproduced in each subplot. See Figure S6.

To formally test whether the decay in rxi,xj meant that mating was effectively random at t=20 generations post-admixture under the stationary-preference and broad-preference models, we permuted the mating pairs and re-calculated rxi,xj for 1,000 permutations of the simulated data. For the stationary-preference model, we found examples in which rxi,xj was close to – but still significantly different from – zero by a permutation test. For instance, for the five replicate simulations with α=10,rxi,xj[0.0182,0.0444] for t=20 generations post-admixture. In two of these simulations, the observed rxi,xj{0.0399,0.0444} never overlapped the permuted distribution (empirical p<0.001). In contrast, rxi,xj always overlapped the permuted distribution after t=11 generations under the broad-preference model (α=10) (Figure S10).

Thus, not all variants of the ancestry-similarity model generate a positive correlation in global ancestry proportion between mates that is observable 20 generations post-admixture while also allowing for a large population of individuals within intermediate global ancestry proportion. Conversely, results for the social group model demonstrate that even relatively weak social barriers can produce a positive correlation in global ancestry proportion between mates on the timescale of tens of generations, even as the association between social group membership and global ancestry proportion diminishes over time (Figure S11).

Relationship between correlation in global ancestry proportion between mates and variance in global ancestry proportion across individuals differs between models

To understand why the stationary-preference and broad-preference models did not produce a sustained correlation in global ancestry proportion between mates, rxi,xj0, over time while the increasing-preference and social group models sometimes did (Figure 2), we next considered the variance in global ancestry proportion across individuals in generation t,σx2(t). Intuitively, non-random mating requires that individuals in the population vary for the trait that serves as the basis for mate choice: if all potential mates are identical for the relevant phenotype, mating pairs will form randomly regardless of the strength of mate-choice bias. Thus, we hypothesized that our simulations of biased mating would become indistinguishable from random mating when σx2(t) became sufficiently small.

Consistent with prior analytic models, σx2(t) was initially greater under non-random relative to random mating, regardless of mate-choice model (Wright 1921; Verdu And Rosenberg 2011; Zaitlen et al. 2017; Goldberg et al. 2020; Liang et al. 2024), and decayed more slowly for larger α (Figure S12). However, for simulations under the stationary-preference and broad-preference models, σx2(t) approached zero within approximately t=15 generations post-admixture under our set of parameters. This timing corresponds to when rxi,xj0 under these two models, as observed in Figure 2. In contrast, under the increasing-preference and social group models, which maintained a positive rxi,xj at t=20 generations, σx2(t) was greater than for random mating for any generation t (all α for the increasing-preference model; α>2 for the social group model).

As predicted, higher σx2(t) corresponded to higher rxi,xj for all four models, at least for early generations post-admixture (Figure S13). However, this relationship was not linear and differed across both models and ranges of σx2(t). In early generations, when σx2(t) is large, rxi,xj scales approximately linearly with σx2(t) under all models (Figure 3a). As the admixture process continues, however, we observed that for intermediate values of σx2(t), the same σx2(t) corresponded to a lower rxi,xj under the stationary-preference and broad-preference models relative to the other two models (Figure 3b, c). In other words, simulations under these two models required greater variance in global ancestry proportion across individuals in order to observe the same correlation in global ancestry proportion between mates. By t=20 generations post-admixture, σx2(t) is small and we observed that the same σx2(t) corresponded to a higher rxi,xj under the increasingpreference model than under the social group model (Figure 3d). This emphasizes a key distinction between these two models: the social group model sustains a positive rxi,xj over tens of generations by maintaining higher σx2(t), whereas the increasing-preference model instead compensates for very low σx2(t) by scaling the mate-choice parameter c by σx2(t), increasing the strength of preference in each generation.

Figure 3.

Figure 3.

The relationship between variance in global ancestry proportion across individuals, σx2(t), and correlation in global ancestry between mates, rxi,xj, differs between models and changes as σx2(t) approaches zero. (a) In early generations, when σx2(t) is larger, there is a quasi-linear relationship between rxi,xj and σx2(t). (b, c) As the value of σx2(t) decreases, differences between the two models that sustain high rxi,xj and the two model that do not. (d) At generation t=20, a given value of rxi,xj corresponds to a lower σx2(t) under the increasing-preference model relative to the social group model. In other words, the increasing-preference model maintains a high rxi,xj at t=20 generations post-admixture despite low differentiability of potential mates. Each dot represents one simulation (5 replicates each for α{1,2,4,6,8,10}).

Given our broader objective of understanding mechanisms that could potentially explain the correlation in global ancestry proportion between spouses that has been reported in empirical data, in the following sections we focus on the two models that produce rxi,xj0 for tens of generations post-admixture: the increasing-preference model and the social group model. Specifically, we compare pairs of simulations performed under these models that produced the same value of rxi,xj at t=20 generations post-admixture, noting that we did not observe evidence for any effect of global ancestry proportion on reproductive success in these simulations (Figure S14; Figure S15).

The same correlation in global ancestry proportion between mates can be explained by multiple patterns of non-random mating

Though commonly used for empirical genetic studies, the correlation in global ancestry proportion between mates is not particularly informative about the underlying distribution of global ancestry proportion, x, within the population. As a general principle, the admixture process ultimately results in a shift in the distribution of x over time from bimodal to unimodal, as two discrete source populations with x{0,1} converge to a continuous distribution of intermediate x values. This shift to a unimodal distribution occurs within a single generation under random mating; under ancestry-biased mating, we expect the shift to be delayed, as individuals with more extreme x preferentially mate with one another.

We observed that the timing of this shift is delayed even more under the social group model than under the increasing-preference model, whether comparing pairs of simulations matched for mating bias strength, α (Figure S16) or for correlation in the global ancestry proportion, rxi,xj. Specifically, we considered the distributions of x at t=20 generations post-admixture, approximating the timing of the onset of admixture for many human populations with African and European ancestry (Zaitlen et al. 2017; Hamid et al. 2021; Korunes et al. 2022; Mas Sandoval et al. 2023; Mooney et al. 2023). At t=20, we found a unimodal distribution of ancestry proportion for simulations under the increasing-preference model for all α, whereas the social group model produced a bimodal distribution for simulations with α7. For simulations under the social group model with α<7, ancestry proportion was unimodal but with higher variance than simulations under the increasing-preference model with the same correlation between mates (Figure S17).

Differences in the distribution of global ancestry proportion, x, within the population between the increasing-preference and social group models are also reflected in the distribution of mating pairs, xi,xj at t=20 generations post-admixture. Under the increasing-preference model, the mating pairs form a single cluster with highest density around (0.5, 0.5), reflecting the original equal contributions from the two source populations (Figure 4a). In contrast, for a simulation under the social group model with the same correlation in global ancestry proportion between mates, rxi,xj, there were two distinct clusters of mating pairs, reflecting preferential mating within each of two subpopulations (Figure 4b). These clusters were identifiable by eye when rxi,xj0.4 (Figure S18). Thus, although the same correlation in global ancestry proportion between mates can be produced by both the increasing-preference and social group models, the underlying structure of non-random mating is not equivalent, and may be distinguishable when rxi,xj is large.

Figure 4.

Figure 4.

Simulations under the increasing-preference (α=5) and social group (α=7) models produced the same correlation in global ancestry proportion between mates, rxi,xj, at t=20 generations post-admixture but with different underlying mating structure. Hexagonal bin plots represent the correlation in global ancestry proportion between the two parents of individuals in generation t=20. Each hexagon corresponds to a bin of 0.025 global ancestry proportion units, with color encoding the scaled density (max=1) of mating pairs in each bin. (a) Under the increasing-preference model, mating pairs cluster around a single bin of maximum density. (b) In contrast, under the social group model, mating pairs form two clusters, representing preferential mating within each of two social groups. All 10,000 individuals are shown. A y=x line is shown for reference. (c, d) The same trends are observed when considering a subsample of n = 100 individuals. See Figure S18, Figure S19, and Figure S20.

Given that evidence for ancestry-assortative mating is often assessed in substantially smaller empirical datasets, we also examined whether these differences in mating structure could be observed in a subsample (n=100 individuals) of our simulated population. The correlation in global ancestry proportion between mates is typically visualized as a dot-plot (e.g., (Zou et al. 2015; Korunes et al. 2022)); we found that the two models looked similar when taking this approach (Figure S19). However, even with only 100 individuals, hexagonal bin plots (Carr et al. 1987) suggested the presence of two discrete clusters of mating pairs in simulations under the social group model with high α (and thus high rxi,xj) (Figure 4cd; Figure S20).

These results, along with similar results using the distribution of the absolute difference in global ancestry proportion between mate pairs, Δx=xi-xj, rather than summarizing it as a correlation coefficient (Figure S21), underscore that multiple mating structures can produce very similar correlation in global ancestry proportion between mates and highlight the utility of alternative visualizations for understanding the mating structure in a population of interest.

Non-random mating produced an excess of long local-ancestry tracts relative to random mating, leading to underestimates for the timing of admixture

Thus far, we have focused on the relationship between mate-choice model and the distribution of global ancestry proportion. We next considered the impact of model choice on the length distribution of local-ancestry tracts, which is used directly or indirectly (e.g., using admixture linkage disequilibrium as a proxy) to infer demographic parameters, including the time since the onset of admixture (Moorjani et al. 2011; Gravel 2012; Loh et al. 2013; Hellenthal et al. 2014). Prior work has shown that non-random mating disrupts the decay in local-ancestry tract length due to recombination, resulting in long local-ancestry tracts consistent with more recent admixture than truly occurred (Zaitlen et al. 2017; Korunes et al. 2022). Thus, some methods to correct for the systematic underestimation of the time since admixture due to assortative mating attempt to recapitulate local-ancestry tract length dynamics by matching the empirical correlation between spouses (Zaitlen et al. 2017).

We first compared the median local-ancestry tract length at t=20 generations post-admixture in simulations under the increasing-preference and social group models, matched for correlation in global ancestry proportion between mates, rxi,xj. Differences in median local-ancestry tract length were modest when comparing across α values within a single model, between these two mate-choice models, and between both models and random mating (Figure 5a). However, we did observe across all four mate-choice models that larger values of α were associated with longer median local-ancestry tract lengths at generation t=20 (Figure S22). Furthermore, under the increasing-preference and social group models — the two models under which rxi,xj0 at generation t=20 — greater rxi,xj was also associated with longer median local-ancestry tract length. Additionally, for a given value of rxi,xj, median local-ancestry tract length tended to be longest under the social group model (Figure 5b).

Figure 5.

Figure 5.

Local-ancestry tract length distributions were similar for simulations under the increasing-preference and social group models, when comparing simulations with similar correlation in global ancestry proportion between mates, rxi,xj. (a) The distribution of local-ancestry tract lengths at t=20 generations post-admixture is shown for three representative pairs of simulations, matched for rxi,xj. A random mating control simulation is shown in gray. Because the local-ancestry tract length distribution has a very long tail, the y-axis is truncated at the top whisker (75th percentile + 1.5 × the inter-quartile range). (b) Median local-ancestry tract length was similar across models and values of α. However, for the same rxi,xj, median local-ancestry tract length tended to be longer for simulations under the social group model relative to those under the increasing-preference model. Each dot represents one simulation (5 replicates each for α{1,2,4,6,8,10}).

To directly test the effects of the increasing-preference and social group models on downstream inference of admixture timing, we fit an exponential to the local-ancestry tract length distribution, with decay rate λ=(t+1)×m, where t represents generations post-admixture and m represents the proportion of individuals from source population 1 contributing to the founding of the admixed populations. From the fit, we then inferred that a single-pulse admixture event occurred t generations before the time of sampling (Equation 1 from (GRAVEL 2012)). We observed that the simulated distribution of local-ancestry tracts had an excess of long local-ancestry tracts at t=20 generations post-admixture relative the exponential fit, particularly for simulations with greater rxi,xj (Figure 6a, b). Although slight, this mismatch in distribution shape may prove useful as an indication in empirical data that the effects of non-random mating should be taken into consideration, although it may also be confused for evidence of continuous migration.

Figure 6.

Figure 6.

Non-random mating led to underestimation of the time since admixture. For each simulation, an exponential model was fit to the local-ancestry tract length distribution, with decay rate λ. (a) For the mean local-ancestry tract length, observed values (solid lines) were similar to those expected under the exponential fit (1/λ, dotted lines), indicating a good fit to the data. The dashed line at y=9.52 indicates the expected mean local-ancestry tract length for the true values of t=20 and m=0.5. (b) However, for the 95th percentile of local-ancestry tract length, observed values (solid lines) were larger than those expected under the exponential fit (-ln(1-0.95)/λ dotted lines), particularly when rxi,xj is high. The dashed line at y=28.53 indicates the expected 95th percentile of local-ancestry tract length for the true values of t=20 and m=0.5. (c-e) Estimated time since admixture was similar between the increasing-preference and social group models. As expected, inference was accurate under random mating (gray) and underestimated under both models of non-random mating, with increasing discrepancy between true and inferred values for larger rxi,xj.

As expected based on prior empirical work (Zaitlen et al. 2017; Korunes et al. 2022), we underestimated the true time since admixture, and the discrepancy between the truth and our inferred time increased as rxi,xj increased (Figure 6c). Intriguingly, we found that this discrepancy was established in the first few generations and then grew relatively slowly over time, suggesting that there might be a plateauing of the effect on longer timescales (Figure S23). For instance, time since admixture was underestimated by an average of 2.25 generations for t=20 and 4.01 generations for t=50 generations post-admixture. As a result, we observe a bias of similar magnitude under all four model at t=20 generations post-admixture, although rxi,xj0 under the stationary-preference and broad-preference models (Figure S24).

The effects of continuous migration on the correlation in global ancestry proportion between mates differed between models and values of α

Realistic models of human admixture likely include more complex dynamics than a single pulse of admixture followed by complete isolation from the source populations, as we have modeled above. To begin to explore the behavior of the increasing-preference and social group models under more complex demographic scenarios, we examined the trajectory of the correlation in global ancestry proportion between mates, rxi,xj, over time in a scenario with continuous migration, wherein 1% of the population in each generation was replaced with migrants from the two source populations. Relative to the single-pulse admixture scenario, we might expect rxi,xj to be smaller in the continuous-migration scenario for the same generation t and mating bias strength α because new migrants have fewer potential mates with similar global ancestry proportion to choose from compared to individuals born in the admixed population. Furthermore, variance in global ancestry proportion across individuals in the admixed population, σx2(t), decreases in each successive generation (Figure S12), meaning that new migrants have increasingly dissimilar global ancestry proportion to the average potential mate. On the other hand, mating events between two migrants from the same source population will increase rxi,xj.

We found that the balance between these countervailing effects on rxi,xj differed between models. Under the increasing-preference model, rxi,xj initially decreased over time, similar to the scenario with single-pulse admixture. However, for simulations with α4, there was a subsequent increase over time, resulting in a greater rxi,xj at t=20 generations post-admixture in the scenario with migration than the one without migration (Figure 7a). As noted above, the increasing-preference model compensates for the decrease in σx2(t) by increasing mate selectiveness over time (Figure S12). Consistent with this explanation for the difference between the single-pulse admixture and continuous migration simulations, we observed that the proportion of mating events between migrants increased over time in these simulations (Figure 7c). Additionally, under the other two ancestry-similarity models, which do not compensate for the decreased σx2(t) over time, we observed that the prevalence of mating events between migrants was constant and that the value of rxi,xj was similar with or without migration for a given generation t and α (Figure S25; Figure S26; Figure S27).

Figure 7.

Figure 7.

The effects of continuous migration differed between mate-choice models, driven by the percent of mating events between new migrants. (a, b) Correlation in global ancestry proportion between mates, rxi,xj increased over time in simulations under the increasing-preference model with continuous migration, for some α. This behavior was not observed in any simulations without migration (see Figure 2) or under the social group model with continuous migration. (c) Under the increasing-preference model with continuous migration, mating between migrants was increasingly prevalent over time, and far more frequent than expected under random mating. (d) Mating between migrants under the social group model with continuous migration did not occur more often than expected by chance.

In contrast, the social group model neither takes into consideration similarity in global ancestry proportion, nor increases the strength of mating bias over time. Thus, for simulations under this model, new migrants are equally likely to mate with anyone within their social group regardless of global ancestry proportion; as expected, we observed a constant prevalence in mating events between migrants (Figure 7d). Additionally, because σx2(t) decreases over time, migrants become increasingly dissimilar from potential mates. As a result, continuous migration always decreased rxi,xj under this model relative to the single-pulse admixture scenario, controlling for generation and α (Figure 7c).

Discussion

While it is appreciated that humans, like individuals in many other natural populations, do not choose mates at random, population-genetic theory and methods to account for assortative mating in empirical data remain largely underdeveloped. In the context of ancestry-assortative mating, extension of existing theory on how assortative mating shapes the expected distribution of a trait in a population (Wright 1921; Norris et al. 2019; Kim et al. 2021; Border et al. 2022; Muralidhar et al. 2022; Horwitz et al. 2023) is made difficult by ambiguity about what the relevant phenotype is. Furthermore, global ancestry proportion is an unusual quantitative phenotype, in that its trait value is integrated across every locus in the genome. How this genome-wide involvement might impact analyses, such as selection scans and association studies, that attempt to distinguish implicated loci from neutral loci remains unclear. Simulations of ancestry-assortative mating that accurately recapitulate key summaries of empirical data are crucial to unraveling these impacts. Here, we considered two related prerequisite questions: first, are there multiple mechanisms of mate choice compatible with the observed correlation in global ancestry proportion between spouses in human populations? Second, does the choice of a particular mathematical function for defining biased mating meaningfully impact the conclusions drawn from the resulting simulations? We compared four models, including one that considers social groups rather than quantitative similarity in global ancestry proportion as the mechanism of mate choice, to better understand how we ought to think about modeling and correcting for ancestry-assortative mating going forward.

We turn first to assumptions commonly made by statistical genetics methods (Zaitlen et al. 2017; Pfennig And Lachance 2023; Huang et al. 2024), which focus on the observed empirical correlation in global ancestry between mates: namely, that this correlation is constant over time and that parameters of interest are comparable between empirical data and simulated data with the same correlation coefficient. In our simulations, we find that rxi,xj is not stable over time and decays toward zero under most models (Figure 2). Thus, assuming that bias in mating is constant over time, the correlation in global ancestry proportion between spouses observed in a contemporary sample is likely smaller than it was in previous generations. In addition, we find that multiple models can produce the same rxi,xj while differing in the underlying mating structure (Figure 4) and the distribution of local-ancestry tract lengths (Figure 5). However, we find that the effects of these differences between models on estimated time since admixture are likely to be small, likely because the effects on mean local-ancestry tract length are similar (Figure 6).

Prior studies that have modeled biased mate choice to develop theory about assortative mating have either not considered how the decrease in variance in global ancestry proportion across individuals over time impacts the correlation in global ancestry proportion between mates (Kim et al. 2021; Muralidhar et al. 2022) or have explicitly modeled a constant correlation coefficient in the face of decreasing variance (Huang et al. 2024). Our work highlights that variance in global ancestry proportion across individuals plays an essential role in determining whether a positive correlation can be observed and, furthermore, whether mating is effectively random (Figure 3; Figure S12). Prior theory has also prioritized the role of individual mate-choice preference for similarity in global ancestry proportion (xi-xj). In humans particularly, there is a wide array of social science research to support the role of sociological factors in mating outcomes. To wit, we found that a simplistic model imposing a barrier to mating between two social groups was sufficient to generate signatures of ancestry-assortative mating that could be observed for 20–50 generations post-admixture. To our knowledge, this type of model has not been used before to model ancestry-assortative mating in admixed populations but may be representative of how social categories like race, ethnicity, and socioeconomic status influence spouse choice.

The four models that we consider in the present study are not the only possible models, but rather represent two classes of model worthy of further theoretical exploration: ancestry-similarity and social group models. For ancestry-similarity models, we have demonstrated that not all variants of this model type can sustain sufficiently high levels of variance in global ancestry proportion across individuals to continue to observe a positive correlation in global ancestry proportion between mates (at least, without additional in-migration events). The increasing-preference model presents a mathematically simple strategy to compensate for this decay in σx2(t): increasing the choosiness of individuals. While this is effective in maintaining a high rxi,xj, it is likely unrealistic in practice, making individuals too attuned to small differences between mates (Figure S2b). This overcompensation is heightened in the continuous-migration scenario, leading to unexpected (and undesirable) behavior (Figure 7). Future development of ancestry-similarity models should consider alternative approaches to tune the selectiveness of individuals over time. For instance, we might want to model a constant degree of bias (i.e., maxψi,jminψi,j) over time.

In including the social group model in this study, we aimed only to demonstrate that categorical social barriers can mediate ancestry-assortative mating, even when individuals do not directly use ancestry information to make mating decisions. To that end, we designed a proof-of-concept version of this model class, which is likely overly simplistic for drawing conclusions about real-world human populations. Future development of this class could include greater social complexity; for instance, more than two social groups, alternative rules for how individuals “inherit” their social group membership, and asymmetric barriers between groups. Each of these added layers could be implemented in different ways. For example, in a simulation with three groups, barriers might be more permeable between some pairs of groups than others.

A growing body of research suggests that patterns of assortative mating are not stable over time (e.g., (Mare 1991; Sunde et al. 2024)). Under an ancestry-similarity model, this could reflect changes in preference over time, potentially to account for decreasing differences between potential mates (as discussed above). Under a social group model, this could reflect changes in social mobility or acceptance of inter-group mating. Thus, modeling this type of change over time would be an interesting future direction for both classes of model, although it remains outside the scope of the present work.

Here, we focus exclusively on biased mate choice as a mechanism for generating a positive correlation in global ancestry proportion between mates, comparing potential models for implementing that mechanism. Future work should also consider other mechanisms that require non-Wright-Fisher frameworks to incorporate additional parameters (e.g., birth and death rates, dispersal rates, etc.). For instance, in real-world populations, geographic structure (i.e., isolation-by-distance over continuous space) is also a major driver of non-random mating with respect to ancestry. It remains an open question whether geography alone could produce patterns of assortative mating that resemble empirical observations, as well as how geography might interact with social barriers to shape the opportunities for individuals to encounter one another and act on biased mate preferences.

Taken together, our results emphasize that theory incorporating non-random mating must carefully consider how mate-choice is conceptualized and modeled. While our focus is primarily on how to better model ancestry-assortative mating, we also make a few recommendations for future empirical studies of ancestry-assortative mating in humans. First, observing a correlation in global ancestry proportion between spouses can reflect multiple mechanisms of mate choice and should not be interpreted as unequivocal evidence of a preference for mates with similar global ancestry proportion. Second, the magnitude of the correlation coefficient is limited by variance in global ancestry proportion across individuals. Thus, a positive correlation in global ancestry proportion in a real-world population should be interpreted as specific to the time point sampled. Third, there are multiple mating structures that can give rise to the same Pearson correlation coefficient, and visualization of these patterns (e.g., with a hexagonal bin plot or hurricane plot of Δx) may help to disambiguate these possibilities.

As with any simulation study, our results are limited by what we have elected to model or not model. We highlight two important caveats. First, we do not model genetic variation within and between source populations. Empirical analyses rely on these data to first infer global and local ancestry, a process that is sensitive to how researchers define discrete source populations from continuous human genetic variation. In bypassing this step, we cannot comment on how estimation error might interact with our results. Second, we have focused on a single pulse of admixture, turning to a continuous-migration scenario only to highlight an unexpected result from the increasing-preference model. More realistic models of human populations almost certainly involve complex migration dynamics, including multiple pulses of migration; differing contributions from source populations; changes in population size; asymmetries in mate preferences and in the ability of individuals to enact their mate choices; monogamous mating; and multi-way admixture. Inclusion of these additional factors into future models is likely to impact the results. However, each additional factor drastically expands the range of possible implementations and space of parameter values. As such, these more complex models are likely to be most useful when targeted to matching the parameters of a population of interest whose history is well understood and less well suited to a general exploration of parameter space.

Supplementary Material

Supplement 1
media-1.pdf (6.2MB, pdf)

Acknowledgments

We thank Joshua Schraiber (ORCiD: 0000-0002-7912-2195), Michael M. Hoffman (ORCiD: 0000-0002-4517-1562), John Barton (ORCiD: 0000-0003-1467-421X), Shyamalika Gopalan (ORCiD: 0000-0002-2608-8472), and members of the Goldberg lab for helpful discussions.

Funding

This work was supported by the National Institute of General Medical Sciences of the National Institutes of Health (R35 GM133481 to A.G. and R35 GM146926 to Z.A.S.).

Funding Statement

This work was supported by the National Institute of General Medical Sciences of the National Institutes of Health (R35 GM133481 to A.G. and R35 GM146926 to Z.A.S.).

Data Availability

All scripts used for simulations, analyses, and figures is available on GitHub at: https://github.com/agoldberglab/ancestry-assortative-mating-simulation.

References

  1. Arauna L. R., Bergstedt J., Choin J., Mendoza-Revilla J., Harmant C. et al. , 2022. The genomic landscape of contemporary western Remote Oceanians. Curr Biol 32: 4565–4575 e4566. [DOI] [PubMed] [Google Scholar]
  2. Avadhanam S., and Williams A. L., 2022. Simultaneous inference of parental admixture proportions and admixture times from unphased local ancestry calls. Am J Hum Genet 109: 1405–1420. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Beleza S., Campos J., Lopes J., Araujo II, Hoppfer Almada A. et al. , 2012. The admixture structure and genetic variation of the archipelago of Cape Verde and its implications for admixture mapping studies. PLoS One 7: e51103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Blossfeld H.-P., 2009. Educational assortative marriage in comparative perspective. Annual Review of Sociology 35: 513–530. [Google Scholar]
  5. Bonilla C., Shriver M. D., Parra E. J., Jones A. and Fernandez J. R., 2004. Ancestral proportions and their association with skin pigmentation and bone mineral density in Puerto Rican women from New York city. Hum Genet 115: 57–68. [DOI] [PubMed] [Google Scholar]
  6. Border R., Athanasiadis G., Buil A., Schork A. J., Cai N. et al. , 2022. Cross-trait assortative mating is widespread and inflates genetic correlation estimates. Science 378: 754–761. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Browning J. R., 1951. Anti-miscegenation laws in the united states. Duke Bar Journal 1: 26–41. [Google Scholar]
  8. Burger R., and Schneider K. A., 2006. Intraspecific competitive divergence and convergence under assortative mating. Am Nat 167: 190–205. [DOI] [PubMed] [Google Scholar]
  9. Burley N., 1983. The meaning of assortative mating. Ethology and Sociobiology 4: 191–203. [Google Scholar]
  10. Buss D. M., and Barnes M., 1986. Preferences in human mate selection. Journal of Personality and Social Psychology 50: 559–570. [Google Scholar]
  11. Carr D. B., Littlefield R. J., Nicholson W. L. and Littlefield J. S., 1987. Scatterplot matrix techniques for large n. Journal of the American Statistical Association 82: 424–436. [Google Scholar]
  12. Chiappori P.-A., 2020. The theory and empirics of the marriage market. Annual Review of Economics 12: 547–578. [Google Scholar]
  13. De La Mare J. K., and Lee A. J., 2023. Assortative preferences for personality and online dating apps: Individuals prefer profiles similar to themselves on agreeableness, openness, and extraversion. Personality and Individual Differences 208: 112185. [Google Scholar]
  14. Dieckmann U., and Doebeli M., 1999. On the origin of species by sympatric speciation. Nature 400: 354–357. [DOI] [PubMed] [Google Scholar]
  15. Domingue B. W., Fletcher J., Conley D. and Boardman J. D., 2014. Genetic and educational assortative mating among US adults. Proc Natl Acad Sci U S A 111: 7996–8000. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Fibla J., Maceda I., Laplana M., Guerrero M., Alvarez M. M. et al. , 2022. The power of geohistorical boundaries for modeling the genetic background of human populations: The case of the rural catalan Pyrenees. Front Genet 13: 1100440. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Fisher R. A., 1918. The correlation between relatives on the supposition of Mendelian inheritance. Transactions of the Royal Society of Edinburgh 52: 399–433. [Google Scholar]
  18. Fogel A. S., McLean E. M., Gordon J. B., Archie E. A., Tung J. et al. , 2021. Genetic ancestry predicts male-female affiliation in a natural baboon hybrid zone. Anim Behav 180: 249–268. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Funk E. R., Mason N. A., Palsson S., Albrecht T., Johnson J. A. et al. , 2021. A supergene underlies linked variation in color and morphology in a holarctic songbird. Nat Commun 12: 6833. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Goldberg A., Rastogi A. and Rosenberg N. A., 2020. Assortative mating by population of origin in a mechanistic model of admixture. Theor Popul Biol 134: 129–146. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Gravel S., 2012. Population genetics models of local ancestry. Genetics 191: 607–619. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Greenwood J., Guner N., Kocharkov G. and Santos C., 2014. Marry your like: Assortative mating and income inequality. American Economic Review 104: 348–353. [Google Scholar]
  23. Haller B. C., Galloway J., Kelleher J., Messer P. W. and Ralph P. L., 2019. Tree-sequence recording in SLiM opens new horizons for forward-time simulation of whole genomes. Mol Ecol Resour 19: 552–566. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Haller B. C., and Messer P. W., 2023. SLiM 4: Multispecies eco-evolutionary modeling. Am Nat 201: E127–E139. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Hamid I., Korunes K. L., Beleza S. and Goldberg A., 2021. Rapid adaptation to malaria facilitated by admixture in the human population of cabo verde. eLife 10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Hellenthal G., Busby G. B. J., Band G., Wilson J. F., Capelli C. et al. , 2014. A genetic atlas of human admixture history. Science 343: 747–751. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Henz U., and Mills C., 2017. Social class origin and assortative mating in Britain, 1949–2010. Sociology 52: 1217–1236. [Google Scholar]
  28. Horwitz T. B., Balbona J. V., Paulich K. N. and Keller M. C., 2023. Evidence of correlations between human partners based on systematic reviews and meta-analyses of 22 traits and UK biobank analysis of 133 traits. Nat Hum Behav 7: 1568–1583. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Howe L. J., Battram T., Morris T. T., Hartwig F. P., Hemani G. et al. , 2021. Assortative mating and within-spouse pair comparisons. PLoS Genet 17: e1009883. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Huang J., Kleman N., Basu S., Shriver M. D. and Zaidi A. A., 2024. Interpreting SNP heritability in admixed populations. bioRxiv. [Google Scholar]
  31. Jiang Y., Bolnick D. I. and Kirkpatrick M., 2013. Assortative mating in animals. Am Nat 181: E125–138. [DOI] [PubMed] [Google Scholar]
  32. Kalmijn M., 1998. Intermarriage and homogamy: Causes, patterns, trends. Annu Rev Sociol 24: 395–421. [DOI] [PubMed] [Google Scholar]
  33. Keddy-Hector A. C., 1992. Mate choice in non-human primates. American Zoologist 32: 62–70. [Google Scholar]
  34. Kim J., Edge M. D., Goldberg A. and Rosenberg N. A., 2021. Skin deep: The decoupling of genetic admixture levels from phenotypes that differed between source populations. Am J Phys Anthropol 175: 406–421. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Kirkpatrick M., 1982. Sexual selection and the evolution of female choice. Evolution 36: 1–12. [DOI] [PubMed] [Google Scholar]
  36. Klinkova E., Hodges J. K., Fuhrmann K., de Jong T. and Heistermann M., 2005. Male dominance rank, female mate choice and male mating and reproductive success in captive chimpanzees. International Journal of Primatology 26: 357–484. [Google Scholar]
  37. Kondrashov A. S., 1983. Multilocus model of sympatric speciation I. One character. Theoretical Population Biology 24: 121–135. [DOI] [PubMed] [Google Scholar]
  38. Kopp M., Servedio M. R., Mendelson T. C., Safran R. J., Rodriguez R. L. et al. , 2018. Mechanisms of assortative mating in speciation with gene flow: Connecting theory and empirical research. Am Nat 191: 1–20. [DOI] [PubMed] [Google Scholar]
  39. Korunes K. L., Soares-Souza G. B., Bobrek K., Tang H., Araujo II et al. , 2022. Sex-biased admixture and assortative mating shape genetic variation and influence demographic inference in admixed Cabo Verdeans. G3 (Bethesda) 12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Lande R., 1981. Models of speciation by sexual selection on polygenic traits. Proc Natl Acad Sci U S A 78: 3721–3725. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Liang M., Shishkin M., Shchur V. and Nielsen R., 2024. Understanding admixture fractions: Theory and estimation of gene-flow. J Math Biol 89: 47. [DOI] [PubMed] [Google Scholar]
  42. Loh P. R., Lipson M., Patterson N., Moorjani P., Pickrell J. K. et al. , 2013. Inferring admixture histories of human populations using linkage disequilibrium. Genetics 193: 1233–1254. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Luo S., and Klohnen E. C., 2005. Assortative mating and marital quality in newlyweds: A couple-centered approach. J Pers Soc Psychol 88: 304–326. [DOI] [PubMed] [Google Scholar]
  44. Mare R. D., 1991. Five decades of educational assortative mating. American Sociological Review 56: 15–32. [Google Scholar]
  45. Mas Sandoval A., Mathieson S. and Fumagalli M., 2023. The genomic footprint of social stratification in admixing American populations. eLife 12: 2022.2011.2016.516754. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Melo M. C., Salazar C., Jiggins C. D. and Linares M., 2009. Assortative mating preferences among hybrids offers a route to hybrid speciation. Evolution 63: 1660–1665. [DOI] [PubMed] [Google Scholar]
  47. Mooney J. A., Agranat-Tamir L., Pritchard J. K. and Rosenberg N. A., 2023. On the number of genealogical ancestors tracing to the source groups of an admixed population. Genetics 224. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Moorjani P., Patterson N., Hirschhorn J. N., Keinan A., Hao L. et al. , 2011. The history of african gene flow into southern europeans, levantines, and jews. PLoS Genet 7: e1001373. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Muralidhar P., Coop G. and Veller C., 2022. Assortative mating enhances postzygotic barriers to gene flow via ancestry bundling. Proc Natl Acad Sci U S A 119: e2122179119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Nagoshi C. T., Johnson R. C. and Danko G. P., 1990. Assortative mating for cultural identification as indicated by language use. Behav Genet 20: 23–31. [DOI] [PubMed] [Google Scholar]
  51. Nagylaki T., 1978. The correlation between relatives with assortative mating. Ann Hum Genet 42: 131–137. [DOI] [PubMed] [Google Scholar]
  52. Natola L., Seneviratne S. S. and Irwin D., 2022. Population genomics of an emergent tri-species hybrid zone. Mol Ecol 31: 5356–5367. [DOI] [PubMed] [Google Scholar]
  53. Norris E. T., Rishishwar L., Wang L., Conley A. B., Chande A. T. et al. , 2019. Assortative mating on ancestry-variant traits in admixed latin American populations. Front Genet 10: 359. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Otto S. P., Servedio M. R. and Nuismer S. L., 2008. Frequency-dependent selection and the evolution of assortative mating. Genetics 179: 2091–2112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Parra F. C., Amado R. C., Lambertucci J. R., Rocha J., Antunes C. M. et al. , 2003. Color and genomic ancestry in Brazilians. Proc Natl Acad Sci U S A 100: 177–182. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Pennings P. S., Kopp M., Meszena G., Dieckmann U. and Hermisson J., 2008. An analytically tractable model for competitive speciation. Am Nat 171: E44–71. [DOI] [PubMed] [Google Scholar]
  57. Pfennig A., and Lachance J., 2023. Challenges of accurately estimating sex-biased admixture from X chromosomal and autosomal ancestry proportions. Am J Hum Genet 110: 359–367. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Powell D. L., Moran B. M., Kim B. Y., Banerjee S. M., Aguillon S. M. et al. , 2021. Two new hybrid populations expand the swordtail hybridization model system. Evolution 75: 2524–2539. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Rettelbach A., Kopp M., Dieckmann U. and Hermisson J., 2013. Three modes of adaptive speciation in spatially structured populations. Am Nat 182: E215–234. [DOI] [PubMed] [Google Scholar]
  60. Risch N., Choudhry S., Via M., Basu A., Sebro R. et al. , 2009. Ancestry-related assortative mating in Latino populations. Genome Biol 10: R132. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Robinson C. E., Thyagarajan H. and Chippindale A. K., 2023. Evolution of reproductive isolation in a long-term evolution experiment with Drosophila melanogaster: 30 years of divergent life-history selection. Evolution 77: 1756–1768. [DOI] [PubMed] [Google Scholar]
  62. Robinson M. R., Kleinman A., Graff M., Vinkhuyzen A. A. E., Couper D. et al. , 2017. Genetic evidence of assortative mating in humans. Nature Human Behaviour 1: 0016. [Google Scholar]
  63. Schumer M., Powell D. L., Delclos P. J., Squire M., Cui R. et al. , 2017. Assortative mating and persistent reproductive isolation in hybrids. Proc Natl Acad Sci U S A 114: 10936–10941. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Schwartz C. R., 2013. Trends and variation in assortative mating: Causes and consequences. Annual Review of Sociology 39: 451–470. [Google Scholar]
  65. Sebro R., Hoffman T. J., Lange C., Rogus J. J. and Risch N. J., 2010. Testing for non-random mating: Evidence for ancestry-related assortative mating in the Framingham heart study. Genet Epidemiol 34: 674–679. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Sebro R., Peloso G. M., Dupuis J. and Risch N. J., 2017. Structured mating: Patterns and implications. PLoS Genet 13: e1006655. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Sebro R., and Risch N. J., 2012. A brief note on the resemblance between relatives in the presence of population stratification. Heredity (Edinb) 108: 563–568. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Seger J., 1985. Unifying genetic models for the evolution of female choice. Evolution 39: 1185–1193. [DOI] [PubMed] [Google Scholar]
  69. Setchell J. M., and Wickings E. J., 2006. Mate choice in male mandrills (mandrillus sphinx). Ethology 112: 91–99. [Google Scholar]
  70. Shriver M. D., Parra E. J., Dios S., Bonilla C., Norton H. et al. , 2003. Skin pigmentation, biogeographical ancestry and admixture mapping. Hum Genet 112: 387–399. [DOI] [PubMed] [Google Scholar]
  71. Smadja C. M., Loire E., Caminade P., Severac D., Gautier M. et al. , 2022. Divergence of olfactory receptors associated with the evolution of assortative mating and reproductive isolation in mice. Peer Community Journal 2. [Google Scholar]
  72. Smieja M., and Stolarski M., 2018. Assortative mating for emotional intelligence. Curr Psychol 37: 180–187. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Spear M. L., Diaz-Papkovich A., Ziv E., Yracheta J. M., Gravel S. et al. , 2020. Recent shifts in the genomic ancestry of Mexican Americans may alter the genetic architecture of biomedical traits. eLife 9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Sunde H. F., Eftedal N. H., Cheesman R., Corfield E. C., Kleppesto T. H. et al. , 2024. Genetic similarity between relatives provides evidence on the presence and history of assortative mating. Nat Commun 15: 2641. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Tenesa A., Rawlik K., Navarro P. and Canela-Xandri O., 2016. Genetic determination of height-mediated mate choice. Genome Biol 16: 269. [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Torche F., 2010. Educational assortative mating and economic inequality: A comparative analysis of three latin American countries. Demography 47: 481–502. [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Tung J., Charpentier M. J., Mukherjee S., Altmann J. and Alberts S. C., 2012. Genetic effects on mating success and partner choice in a social mammal. Am Nat 180: 113–129. [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Van Belle S., Estrada A., Ziegler T. E. and Strier K. B., 2009. Sexual behavior across ovarian cycles in wild black howler monkeys (alouatta pigra): Male mate guarding and female mate choice. Am J Primatol 71: 153–164. [DOI] [PubMed] [Google Scholar]
  79. Veller C., and Coop G. M., 2024. Interpreting population- and family-based genome-wide association studies in the presence of confounding. PLoS Biol 22: e3002511. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Verdu P., and Rosenberg N. A., 2011. A general mechanistic model for admixture histories of hybrid populations. Genetics 189: 1413–1426. [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Woodman J. P., Cole E. F., Firth J. A., Perrins C. M. and Sheldon B. C., 2023. Disentangling the causes of age-assortative mating in bird populations with contrasting life-history strategies. J Anim Ecol 92: 979–990. [DOI] [PubMed] [Google Scholar]
  82. Wright S., 1921. Systems of mating. III. Assortative mating based on somatic resemblance. Genetics 6: 144–161. [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Wright S., 1950. Genetical structure of populations. Nature 166: 247–249. [DOI] [PubMed] [Google Scholar]
  84. Xie Y., Cheng S. and Zhou X., 2015. Assortative mating without assortative preference. Proc Natl Acad Sci U S A 112: 5974–5978. [DOI] [PMC free article] [PubMed] [Google Scholar]
  85. Yamamoto K., Sonehara K., Namba S., Konuma T., Masuko H. et al. , 2023. Genetic footprints of assortative mating in the Japanese population. Nat Hum Behav 7: 65–73. [DOI] [PMC free article] [PubMed] [Google Scholar]
  86. Zaitlen N., Huntsman S., Hu D., Spear M., Eng C. et al. , 2017. The effects of migration and assortative mating on admixture linkage disequilibrium. Genetics 205: 375–383. [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. Zou J. Y., Park D. S., Burchard E. G., Torgerson D. G., Pino-Yanes M. et al. , 2015. Genetic and socioeconomic study of mate choice in Latinos reveals novel assortment patterns. Proc Natl Acad Sci U S A 112: 13621–13626. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplement 1
media-1.pdf (6.2MB, pdf)

Data Availability Statement

All scripts used for simulations, analyses, and figures is available on GitHub at: https://github.com/agoldberglab/ancestry-assortative-mating-simulation.


Articles from bioRxiv are provided here courtesy of Cold Spring Harbor Laboratory Preprints

RESOURCES