Skip to main content
Genetics logoLink to Genetics
. 2025 Feb 7;229(4):iyaf022. doi: 10.1093/genetics/iyaf022

Differentiating mechanism from outcome for ancestry-assortative mating in admixed human populations

Dashiell J Massey 1, Zachary A Szpiech 2, Amy Goldberg 3,✉,2
Editor: S Gravel
PMCID: PMC12005266  PMID: 39919029

Abstract

Population genetic theory, and the empirical methods built upon it, often assumes that individuals pair randomly for reproduction. However, natural populations frequently violate this assumption, which may potentially confound genome-wide association studies, selection scans, and demographic inference. Within several recently admixed human populations, empirical genetic studies have reported a correlation in global ancestry proportion between spouses, referred to as ancestry-assortative mating. Here, we use forward genomic simulations to link correlations in global ancestry proportion between mates to the underlying mechanistic mate choice process. We consider the impacts of 2 types of mate choice model, using either ancestry-based preferences or social groups as the basis for mate pairing. We find that multiple mate choice models can produce the same correlations in global ancestry proportion between spouses; however, we also highlight alternative analytic approaches and circumstances in which these models may be distinguished. With this work, we seek to highlight potential pitfalls when interpreting correlations in empirical data as evidence for a particular model of human mating practices and to offer suggestions toward development of new best practices for analysis of human ancestry-assortative mating.

Keywords: assortative mating, human admixture, genetic ancestry, global ancestry proportion, local ancestry tracts, forward genetic simulation, individual-based simulations, SLiM

Introduction

Nonrandom mating has long been appreciated as an important source of genetic structure in natural populations (Fisher 1918; Wright 1921, 1950; Nagylaki 1978). Positive assortative mating (hereafter, assortative mating), wherein genotypic or phenotypic trait values are positively correlated between mates, has been empirically observed across animal species (Jiang et al. 2013). Theoretical and empirical studies have demonstrated the consequences of assortative mating for speciation and hybridization (Kondrashov 1983; Otto et al. 2008; Tung et al. 2012; Schumer et al. 2017; Kopp et al. 2018; Powell et al. 2021; Muralidhar et al. 2022; Natola et al. 2022; Smadja et al. 2022; Robinson et al. 2023) and for the distributions of traits within populations (Wright 1921; Norris et al. 2019; Kim et al. 2021; Border et al. 2022; Muralidhar et al. 2022; Horwitz et al. 2023).

In nonhuman primates that live in complex and structured social environments, sociodemographic factors have also been demonstrated to influence mate pair formation (Keddy-Hector 1992; Klinkova et al. 2005; Setchell and Wickings 2006; Van Belle et al. 2009; Tung et al. 2012; Fogel et al. 2021). As such, when it comes to humans, assortative mating has been the purview not only of biologists but also of social scientists (e.g. Buss and Barnes 1986; Mare 1991; Kalmijn 1998; Luo and Klohnen 2005; Blossfeld 2009; Torche 2010; Schwartz 2013; Greenwood et al. 2014; Henz and Mills 2017; Smieja and Stolarski 2018; Chiappori 2020; De La Mare and Lee 2023). Positive correlations have been reported between human spouses for a diverse array of phenotypes, including morphometric measurements, health outcomes, personality traits, lifestyle factors, age, socioeconomic status, educational attainment, religious affiliation, and language (Nagoshi et al. 1990; Robinson et al. 2017; Fibla et al. 2022; Horwitz et al. 2023; Yamamoto et al. 2023). These associations are likely driven by multiple generative processes, including individual mate choice preferences, phenotypic convergence over time facilitated by cohabitation, and social structures that restrict or promote particular pairings. These processes may act individually or in concert. For instance, sociological literature on assortative mating by educational attainment has revealed the influence of both a preference for mate similarity (Kalmijn 1998) and social barriers to marriage across socioeconomic class (Torche 2010) and that the strength of assortment by education over time is sensitive to the degree of temporal overlap between the end of schooling and average age at the time of marriage (Mare 1991), suggesting that social milieu is likely also an important factor in structuring mate pair outcomes.

Within the field of population genetics, relatively less attention has been paid to understanding the contributions of these generative processes to producing the observed correlations between mating pairs. Indeed, mechanism and outcome are often conflated or not clearly delineated when discussing assortative mating, which obscures these multiple mechanisms that may contribute to mate similarity. When studies do consider mechanism, they typically draw on foundational work in the sexual selection literature modeling the effects of female choice on male phenotype (Lande 1981; Kirkpatrick 1982; Seger 1985), in which the mechanism driving assortative mating is assumed to be mate choice. To extend this framework beyond sexual dimorphism, the mate choice mechanism is modeled as a preference for mates that “match” an individual's own phenotype (Kopp et al. 2018; Goldberg et al. 2020; Kim et al. 2021; Muralidhar et al. 2022). However, some studies have suggested that temporal structure in the mating process (Xie et al. 2015; Woodman et al. 2023) or nonuniform ability to attract mates (Burley 1983) can generate phenotypic correlations between mates without an explicit preference for phenotypic similarity. In this paper, we use the term “assortative mating” to refer specifically to an empirical observation of greater resemblance between mates than expected by chance, agnostic to mechanism.

Reports of assortative mating by ancestry in humans provide a particularly intriguing example for exploring mechanism because genetic ancestry is a complex quantitative “phenotype” that involves every locus in the genome and that can only be ascertained by sequencing. In recently admixed human populations, in which individuals derive ancestry from multiple source populations, a positive correlation in global ancestry proportion between spouses has been observed. This phenomenon, referred to as ancestry-assortative mating, has been reported in multiple Latino populations (Risch et al. 2009; Zou et al. 2015; Spear et al. 2020; Mas Sandoval et al. 2023), as well as in African-Americans (Zaitlen et al. 2017; Avadhanam and Williams 2022), Cabo Verdeans (Korunes et al. 2022), and Ni-Vanuatu (Arauna et al. 2022). Ancestry-associated mating patterns, inferred from excess sharing of single-nucleotide variants, have also been reported for non-Hispanic white spouse pairs in the United States (Sebro et al. 2010, 2017; Domingue et al. 2014). Empirical studies commonly use such correlations in global ancestry proportion between spouses to suggest that ancestry shapes mate choice, although this interaction would necessarily be mediated by a proxy phenotype of some type not by quantitative ancestry proportion. However, the relative contributions of individual preferences and broader social mechanisms to generating these correlations are unclear.

Accounting for assortative mating is critical for accurate statistical and population genetic analysis in admixed human populations: it has been shown that assortative mating confounds genome-wide association studies (Howe et al. 2021; Border et al. 2022; Veller and Coop 2024) and heritability estimation (Sebro and Risch 2012; Tenesa et al. 2016; Huang et al. 2024). In addition, ancestry-assortative mating specifically has been shown to produce estimates for the timing of admixture that are too recent (Zaitlen et al. 2017; Goldberg et al. 2020; Korunes et al. 2022). However, it remains challenging to establish a null expectation in the presence of assortative mating without understanding the underlying mechanism at play.

To date, statistical methods to correct for ancestry-assortative mating in empirical data have tended to generate null models by simulating data to match the observed empirical correlation in global ancestry proportion between mating pairs, often without a specific mechanistic model (Zaitlen et al. 2017; Spear et al. 2020; Pfennig and Lachance 2023; Huang et al. 2024). This approach makes 2 important assumptions: first, the correlation coefficient for a given population has remained constant over time and second, matching the correlation coefficient is sufficient to recapitulate the dynamics of global and local ancestry. In contrast, theoretical population genetic studies have examined the behavior of these same parameters using mechanistic models of biased mate choice (Goldberg et al. 2020; Kim et al. 2021; Muralidhar et al. 2022). This approach assumes that ancestry-based preference is the primary mechanism driving nonrandom mating. These studies typically do not directly consider whether the model used results in an observed correlation in global ancestry proportion between mates, which leaves open the question of how applicable their conclusions are to the context of empirical studies of human ancestry-assortative mating.

Here, we used forward-in-time simulations to probe the relationship between mate choice mechanism and observed correlation in global ancestry proportion, comparing variants of 2 classes of mate choice model. The first model class considers only individual mate choice preferences for similarity in global ancestry proportion, treating genetic ancestry as a continuous quantitative trait. The second class considers only discrete social groups, assigning each individual to 1 of 2 groups and limiting cross-group mating events. (Importantly, group membership is “inherited” from one parent, and group identities are initially associated with source populations; see Methods.) We found that ancestry-similarity and social group models can produce similar correlations in global ancestry proportion between mates, suggesting that either or both mechanisms could be relevant in interpreting empirical data. Furthermore, the correlation coefficient was not constant over time in any of our simulations, and not all variants of the ancestry-similarity class or all parameter values of the social group model maintained sufficient population variance to sustain correlations in global ancestry proportion over tens of generations. Our results highlight important caveats about the assumptions behind both existing approaches and suggest additional analyses of empirical data that may help reveal the underlying mechanism of ancestry-assortative mating in human populations.

Methods

Simulation framework

We performed forward-in-time simulations of admixture with equal contributions from 2 source populations using SLiM 4.0.1 (Haller et al. 2019; Haller and Messer 2023). We modeled sexually reproducing diploid individuals (census population size N=10,000/generation) with 22 independently segregating chromosomes, representing the size distribution of human autosomes (total genome size L=2.88 Gb), with a uniform recombination rate r=1×108. Unless otherwise noted, there was a single pulse of admixture and no additional contributions from the source populations were introduced at later generations. We did not model any type of 1D or 2D spatial relationships between individuals, nor did we simulate genotypes.

For each of 50 generations post-admixture, we extracted the position and source of local ancestry tracts from the tree sequence (Haller et al. 2019). From these data, we calculated the total proportion of the genome that each individual derived from source population 1 (hereafter, global ancestry proportion, x[0,1]). Separately, we tracked the social group s{A,B} to which each individual belonged: offsprings were assigned to the same social group as their first parent—i.e. without reference to their global ancestry proportion. We focus primarily on results from the first 20 generations post-admixture, approximately corresponding to the timing of African-European admixture initiated by the trans-Atlantic slave trade (Zaitlen et al. 2017; Hamid et al. 2021; Korunes et al. 2022; Mas Sandoval et al. 2023; Mooney et al. 2023), with a subset of results over longer time periods in the supplementary material.

We considered 2 broad classes of mate choice models: (1) ancestry-similarity, wherein the probability of an individual with global ancestry proportion xi mating with an individual with global ancestry proportion xj is defined in terms of |xixj| and (2) social group, wherein the probability of an individual belonging to social group si mating with an individual belonging to social group sj is defined by whether or not si=sj.

To implement nonrandom mating in SLiM, we started from the default Wright–Fisher framework, which is designed to allow females to be selective if a mateChoice() callback is used. In this work, we did not distinguish males and females, instead allowing each individual to potentially serve as both parent 1 (selector) and parent 2 (selected) in sequential mating events. For each mating event, parent 1 was uniformly sampled with replacement from the population, while parent 2 was sampled proportional to mating weight ψi,j, calculated according to the specified mate choice function (defined below) evaluated for f(xi,xj) or f(si,sj), under the ancestry-similarity and social group models, respectively (Fig. 1). Incidental selfing was explicitly prohibited, and each mating event generated a single child.

Fig. 1.

Fig. 1.

Four functions for defining the sampling weight, ψi,j, assigned to individual j as a potential mate for a focal individual i, based on global ancestry proportion xj a–c) or social group sj d). For each mating event, mate 1 was uniformly sampled from the population, and mate 2 was sampled proportional to ψi,j. For the sake of visualization, values of ψi,j have been scaled such that ψi,j=1 means that the probability of individual j being selected as a potential mate for individual i is 1/n (n, number of potential mates). All mate choice functions are reparameterized in terms of α. a) The stationary preference variant of the ancestry-similarity model [Equation (2)] defines ψi,j as a function of the difference in global ancestry proportion between individuals, |xixj|. b) Under the increasing preference variant of the mate preference model [Equation (3)], the value of ψi,j is scaled to the variance in global ancestry proportion, σx2(t), across all potential mates, resulting in increasing choosiness over time. ψi,j values are shown for t{1,2,3} and α=4, assuming σx2(t)=2t2 (Liang et al. 2024). c) Under the broad preference variant of the ancestry-similarity model [Equation (4)], ψi,j follows the probability density function of a normal distribution with mean xi. d) Under the social group model (Equation 5), ψi,j can take 1 of 2 discrete values, determined by whether a potential mate belongs to the same social group as the focal individual (si=sj) or to the other social group (sisj).

Defining a common parameter for model comparison

The mate choice functions we considered (Equations 25, below) differ in how they parameterize the strength of mating bias; to aid in comparison across models, we will refer throughout to α[0,), the frequency of mating events between individuals from opposite source populations. Specifically, we define α to be inversely related to the proportion of admixed offspring in the initial generation post-contact, A, assuming that the 2 source populations admix in equal proportions. We have:

α=1A1. (1)

That is, at time of population contact, individuals are α times more likely to choose a mate from within their own source population than from the opposite source population. Increasing values of α represents an increasing bias toward endogamy, with α=0 corresponding to exclusive exogamy, α(0,1) to negative assortative mating, α=1 to random mating, and α>1 to positive assortative mating. Given our focus on mechanisms that might explain empirically observed positive correlations in global ancestry proportion between mates, we consider only α1. While our models differ in their dynamics over time, simulations with the same α have the same correlation in global ancestry proportion between mates, r(xi,xj)[0,1], for the first generation post-admixture under all models.

Ancestry-similarity models

Individual-based mate preference models originate in the sexual selection literature, where they were developed for studying the coevolutionary dynamics of male secondary sex characteristics and female mate choice (Lande 1981; Kirkpatrick 1982; Seger 1985). Generally, under these models, male phenotype and female preference are controlled by distinct genetic loci (or sets of loci). In the speciation literature, an alternative family of models is concerned with assortative mating as a mechanism of sympatric speciation and model mate preference based on phenotypic similarity between mates for some ecologically relevant trait (Dieckmann and Doebeli 1999; Burger and Schneider 2006; Pennings et al. 2008; Rettelbach et al. 2013). These “matching rule” models (Kopp et al. 2018) are the basis for our ancestry-similarity model class, wherein individuals preferentially choose mates to maximize similarity in phenotype (i.e. global ancestry proportion). Importantly for the models we consider here, these are fixed relative preference models: a focal individual i assigns each potential mate a preference value ψi,j that depends only on the potential mate's phenotype and is density independent (Seger 1985).

Ancestry-similarity models define a mate choice function ψi,j=f(xi,xj) for calculating the sampling weight assigned to individual j with global ancestry proportion xj as a potential mate for a focal individual i with global ancestry proportion xi. In the present study, we considered 3 variants of the ancestry-similarity model identified in a literature search, which differ in the precise definition of the mate choice function f.

Models of ancestry-based assortative mating typically model f(x1,xi) as an exponential function, such that ψi,j decays exponentially at rate c=ln(α) as |xixj| increases. We define:

ψi,j=ec|xixj|. (2)

We refer to this as the stationary preference model [“like-with-like” model in (Muralidhar et al. 2022)]: for a given absolute difference in global ancestry proportion, |xixj|, the associated preference ψi,j does not change over time. It is important to note, however, that the range of ψi,j is constrained by xi: an individual i with global ancestry proportion xi=0.5 is more similar to all potential mates than an individual with xi=0 and is therefore less selective in choosing a mate (Fig. 1a). Furthermore, without additional migration from the source populations, variance in global ancestry proportion across individuals decreases over time (Supplementary Fig. 2), meaning that the pool of potential mates is becoming increasingly homogenous and the ratio of max(ψi,j)/min(ψi,j) approaches 1 for all values of xi (Supplementary Fig. 1a). In other words, all individuals become less selective in choosing mates over successive generations.

One approach to account for the decreased variance in global ancestry proportion over time, and thus preserve mate selectiveness, is to decrease c, rather that holding it constant. Following (Kim et al. 2021), we rescale c in each generation t using the variance in global ancestry proportion observed in that generation, σx2(t):

ψi,j=ecσx2(t)|xixj|. (3)

We refer to this as the increasing preference model [equation A2 in the study by Kim et al. (2021)]. Under this model, as under the stationary preference model, an individual i with global ancestry proportion xi=0.5 is less selective than one with xi=0 (Fig. 1b). However, because σx2(t) decreases over time, there is an increase in mate selectiveness over successive generations under this model for a given xi (Supplementary Figs. 1b and 3).

We also considered a third ancestry-similarity model, employing a Gaussian—rather than exponential—mate choice function. This approach is commonly used in models of assortative mating based on a quantitative trait (e.g. Dieckmann and Doebeli 1999; Pennings et al. 2008; Funk et al. 2021), although it has not to our knowledge been used to model assortative mating based on global ancestry proportion. Under this model, ψi,jN(xi,σ2) where xi is the global ancestry proportion of the focal individual and the constant σ2=12ln(α1) determines the strength of mate choice preference. Again, ψi,j decreases as the difference in global ancestry proportion between potential mates increases:

ψi,j=1σ2πe(xixj)22σ2. (4)

We refer to this as the broad preference model because f(xi,xj) decays more gradually with increasing differences in global ancestry proportion, relative to the stationary preference and increasing preference models (Fig. 1c). Thus, a large difference in global ancestry proportion from a potential mate is strongly disfavored, while smaller differences within the pool of “similar” mates are not weighed strongly. Compared with the other ancestry-similarity models, individuals are less selective in choosing mates under the broad preference model; however, as under the stationary preference model, they also become less selective over time (Supplementary Figs. 1c and 4).

Social group model

An alternative class of models considers preferential mating based on a categorical, rather than quantitative, trait. This type of model is often used in the context of interspecific hybrid zones, wherein nonrandom mating can maintain species boundaries and/or promote hybrid speciation (Melo et al. 2009; Schumer et al. 2017; Powell et al. 2021; Natola et al. 2022; Smadja et al. 2022). Mechanistically, this can be construed as a model of biased mating by species identity or source population (Goldberg et al. 2020). However, application of this type of model in humans is not straightforward: the source populations in question are not biologically distinct species, but rather labels describing socially and geographically defined boundaries that have changed over time. Furthermore, it is unclear how we ought to model the mating behavior of admixed individuals: neither random mating by admixed individuals nor a simple preference of admixed individuals to mate with one another seems likely to produce the observed correlation in global ancestry proportion between mates. Yet, the strongest evidence for ancestry-assortative mating in humans is observed in contexts where both spouses have admixed ancestry (Risch et al. 2009; Zou et al. 2015; Zaitlen et al. 2017; Spear et al. 2020; Avadhanam and Williams 2022; Korunes et al. 2022; Mas Sandoval et al. 2023).

Social identity is an important organizer of pair formation in humans, as evidenced by widespread cultural practices of endogamy based on race, ethnicity, religion, socioeconomic status, and caste. Although distinct from genetic ancestry, social categorizations often interact with ancestry. For instance, skin pigmentation, a prominent contributor to how an individual is racialized, is correlated with West African-related global ancestry proportion in multiple populations with recent admixture history (Parra et al. 2003; Shriver et al. 2003; Bonilla et al. 2004; Beleza et al. 2012). Race has also served as the basis for numerous sociolegal barriers to mating across contexts, including anti-miscegenation laws (Browning 1951). This type of interaction opens the possibility that social barriers to mating might be “recorded” in patterns of ancestry-assortative mating detectable by genomic analyses.

We considered a simple social group model with 2 groups, with sampling weight ψi,j determined by the social group identity of the focal individual and potential mate, si and sj, respectively, with preference strength α (Fig. 1d; Supplementary Fig. 5). Rearranging Equation (1):

ψi,j={11α+1,|si=sj1α+1,|sisj. (5)

To generate an association between social group membership and global ancestry proportion, we assigned individuals in each source population to the same social group prior to admixture. However, from the onset of population contact, individuals “inherited” their social group membership from the first parent in the mating pair. Thus, individuals whose parents belonged to opposite social groups were equally likely to be assigned to either social group.

Several features distinguish the social group model from the ancestry-similarity models described above or from one in which individuals use an observable quantitative trait (e.g. skin pigmentation) as a proxy for an unobserved trait (global ancestry proportion). First, social group membership is correlated with—but not determined by—genotype. This matters because it has previously been demonstrated that recombination is expected to decouple the relationship between global ancestry proportion and a proxy quantitative trait over time, leading to only a transient correlation between the 2 phenotypes (Kim et al. 2021). Because social group membership is not genetic, it is not impacted by recombination. In addition, individuals are uniformly likely to cross the social group barrier in choosing a mate and do not express relative preferences among potential mates within a social group. Thus, in contrast to an ancestry-similarity model, individuals with intermediate ancestry are not less selective than individuals with more source-like ancestry in choosing a mate.

However, these features of the social group model also mean that the association between global ancestry proportion and social group membership is expected to decay over successive generations, and we explored this relationship quantitatively over time.

Results

Not all simulations of nonrandom mating show a positive correlation in global ancestry proportion between mates 20 generations post-admixture

Ancestry-assortative mating is often inferred from a positive correlation in global ancestry proportion between spouses at single point in time and statistical methods to account for nonrandom mating typically generate null models using simulations where potential mate pairs are permuted until the empirically observed correlation is achieved (Zaitlen et al. 2017; Pfennig and Lachance 2023; Huang et al. 2024). This approach assumes that the correlation observed in the present-day samples has remained constant since the start of admixture. Figure 2 plots the Pearson correlation coefficient for global ancestry proportion between mates, r(xi,xj), over time under 3 variants of the ancestry-similarity model or the social group model, for multiple strengths of mating bias, α, and following a single pulse of admixture. In all scenarios of nonrandom mating that we considered, we observed positive correlations in global ancestry proportion between mates (Fig. 2; Supplementary Fig. 6). However, contrary to the assumption that the correlation observed in a contemporary sample should be modeled as a fixed value, the observed correlation coefficients in our simulations decayed over time. In simulations performed under the increasing preference model, r(xi,xj), reached a stable plateau within the first 10 generations post-admixture across values of α (Supplementary Fig. 7a). In contrast, simulations performed under the other 2 variants of the ancestry-similarity model (stationary preference or broad preference) approached r(xi,xj)=0 within 20 generations post-admixture, unless α was sufficiently large to disrupt the admixture process altogether (Supplementary Figs. 8 and 9). Under the social group model, positive values of r(xi,xj) could be observed 20 generations post-admixture for α>2, although these models also approached r(xi,xj)=0 on longer time scales (Supplementary Fig. 7b).

Fig. 2.

Fig. 2.

Correlation in global ancestry proportion between mates, r(xi,xj), was not constant over time and decayed to near-zero within 20 generations post-admixture in some simulations. Simulations under the stationary preference [Equation (2)] and broad preference [Equation (4)] models did not simultaneously maintain r(xi,xj)0 while also allowing for a large population with intermediate global ancestry proportion values, whereas those under the increasing preference [Equation (3)] and social group [Equation (5)] models did for some α. By definition, r(xi,xj) in the first generation after admixture is the same across all simulations performed with the same value of α, regardless of mate choice model. For the sake of comparison, the same replicate random mating simulations (α=1) are reproduced in each subplot. See Supplementary Fig. 6.

To formally test whether the decay in r(xi,xj) meant that mating was effectively random at t=20 generations postadmixture under the stationary preference and broad preference models, we permuted the mating pairs and recalculated r(xi,xj) for 1,000 permutations of the simulated data. For the stationary preference model, we found examples in which r(xi,xj) was close to—but still significantly different from—zero by a permutation test. For instance, for the 5 replicate simulations with α=10, r(xi,xj)[0.0182,0.0444] for t=20 generations post-admixture. In 2 of these simulations, the observed r(xi,xj){0.0399,0.0444} never overlapped the permuted distribution (empirical P<0.001). In contrast, r(xi,xj) always overlapped the permuted distribution after t=11 generations under the broad preference model (α=10) (Supplementary Fig. 10).

Thus, not all variants of the ancestry-similarity model generate a positive correlation in global ancestry proportion between mates that is observable 20 generations post-admixture while also allowing for a large population of individuals with intermediate global ancestry proportion. Conversely, results for the social group model demonstrate that even relatively weak social barriers can produce a positive correlation in global ancestry proportion between mates on the timescale of tens of generations, even as the association between social group membership and global ancestry proportion diminishes over time (Supplementary Fig. 11).

Relationship between correlation in global ancestry proportion between mates and variance in global ancestry proportion across individuals differs between models

To understand why the stationary preference and broad preference models did not produce a sustained correlation in global ancestry proportion between mates, r(xi,xj)0, over time, while the increasing preference and social group models sometimes did (Fig. 2), we next considered the variance in global ancestry proportion across individuals in generation t, σx2(t). Intuitively, nonrandom mating requires that individuals in the population vary for the trait that serves as the basis for mate choice: if all potential mates are identical for the relevant phenotype, mating pairs will form randomly regardless of the strength of mate choice bias. Thus, we hypothesized that our simulations of biased mating would become indistinguishable from random mating when σx2(t) became sufficiently small.

Consistent with prior analytic models, σx2(t) was initially greater under nonrandom relative to random mating, regardless of mate choice model (Wright 1921; Verdu and Rosenberg 2011; Zaitlen et al. 2017; Goldberg et al. 2020; Liang et al. 2024) and decayed more slowly for larger α (Supplementary Fig. 12). However, for simulations under the stationary preference and broad preference models, σx2(t) approached zero within approximately t=15 generations post-admixture under our set of parameters. This timing corresponds to when r(xi,xj)0 under these 2 models, as observed in Fig. 2. In contrast, under the increasing preference and social group models, which maintained a positive r(xi,xj) at t=20 generations, σx2(t) was greater than for random mating for any generation t (all α for the increasing preference model; α>2 for the social group model).

As predicted, higher σx2(t) corresponded to higher r(xi,xj) for all 4 models, at least for early generations post-admixture (Supplementary Fig. 13). However, this relationship was not linear and differed across both models and ranges of σx2(t). In early generations, when σx2(t) is large, r(xi,xj) scales approximately linearly with σx2(t) under all models (Fig. 3a). As the admixture process continues, however, we observed that for intermediate values of σx2(t), the same σx2(t) corresponded to a lower r(xi,xj) under the stationary preference and broad preference models relative to the other 2 models (Fig. 3, b and c). In other words, simulations under these 2 models required greater variance in global ancestry proportion across individuals in order to observe the same correlation in global ancestry proportion between mates. By t=20 generations post-admixture, σx2(t) is small, and we observed that the same σx2(t) corresponded to a higher r(xi,xj) under the increasing preference model than under the social group model (Fig. 3d). This emphasizes a key distinction between these 2 models: the social group model sustains a positive r(xi,xj) over tens of generations by maintaining higher σx2(t), whereas the increasing preference model instead compensates for very low σx2(t) by scaling the mate choice parameter c by σx2(t), increasing the strength of preference in each generation.

Fig. 3.

Fig. 3.

The relationship between variance in global ancestry proportion across individuals, σx2(t), and correlation in global ancestry between mates, r(xi,xj), differs between models and changes as σx2(t) approaches zero. a) In early generations, when σx2(t) is larger, there is a quasi-linear relationship between r(xi,xj) and σx2(t). b and c) As the value of σx2(t) decreases, differences appear between the 2 models that sustain high r(xi,xj) and the 2 model that do not. d) At generation t=20, a given value of r(xi,xj) corresponds to a lower σx2(t) under the increasing preference model relative to the social group model. In other words, the increasing preference model maintains a high r(xi,xj) at t=20 generations post-admixture despite low differentiability of potential mates. Each dot represents 1 simulation (5 replicates each for α{1,2,4,6,8,10}).

Given our broader objective of understanding mechanisms that could potentially explain the correlation in global ancestry proportion between spouses that has been reported in empirical data, in the following sections we focus on the 2 models that produce r(xi,xj)0 for tens of generations post-admixture: the increasing preference model and the social group model. Specifically, we compare pairs of simulations performed under these models that produced the same value of r(xi,xj) at t=20 generations post-admixture, noting that we did not observe evidence for any effect of global ancestry proportion on reproductive success in these simulations (Supplementary Figs. 14 and 15).

The same correlation in global ancestry proportion between mates can be explained by multiple patterns of nonrandom mating

Though commonly used for empirical genetic studies, the correlation in global ancestry proportion between mates is not particularly informative about the underlying distribution of global ancestry proportion, x, within the population. As a general principle, the admixture process ultimately results in a shift in the distribution of x over time from bimodal to unimodal, as 2 discrete source populations with x{0,1} converge to a continuous distribution of intermediate x values. This shift to a unimodal distribution occurs within a single generation under random mating; under ancestry-biased mating, we expect the shift to be delayed, as individuals with more extreme x preferentially mate with one another.

We observed that the timing of this shift is delayed even more under the social group model than under the increasing preference model, whether comparing pairs of simulations matched for mating bias strength, α (Supplementary Fig. 16) or for correlation in the global ancestry proportion, r(xi,xj). Specifically, we considered the distributions of x at t=20 generations post-admixture, approximating the timing of the onset of admixture for many human populations with African and European ancestry (Zaitlen et al. 2017; Hamid et al. 2021; Korunes et al. 2022; Mas Sandoval et al. 2023; Mooney et al. 2023). At t=20, we found a unimodal distribution of ancestry proportion for simulations under the increasing preference model for all α, whereas the social group model produced a bimodal distribution for simulations with α7. For simulations under the social group model with α<7, ancestry proportion was unimodal but with higher variance than simulations under the increasing preference model with the same correlation between mates (Supplementary Fig. 17).

Differences in the distribution of global ancestry proportion, x, within the population between the increasing preference and social group models are also reflected in the distribution of mating pairs, (xi,xj) at t=20 generations post-admixture. Under the increasing preference model, the mating pairs form a single cluster with highest density around (0.5,0.5), reflecting the original equal contributions from the 2 source populations (Fig. 4a). In contrast, for a simulation under the social group model with the same correlation in global ancestry proportion between mates, r(xi,xj), there were 2 distinct clusters of mating pairs, reflecting preferential mating within each of 2 subpopulations (Fig. 4b). These clusters were identifiable by eye when r(xi,xj)0.4 (Supplementary Fig. 18). Thus, although the same correlation in global ancestry proportion between mates can be produced by both the increasing preference and social group models, the underlying structure of nonrandom mating is not equivalent and may be distinguishable when r(xi,xj) is large.

Fig. 4.

Fig. 4.

Simulations under the increasing preference (α=5) and social group (α=7) models produced the same correlation in global ancestry proportion between mates, r(xi,xj), at t=20 generations post-admixture but with different underlying mating structure. Hexagonal bin plots represent the correlation in global ancestry proportion between the 2 parents of individuals in generation t=20. Each hexagon corresponds to a bin of 0.025 global ancestry proportion units, with color encoding the scaled density (max=1) of mating pairs in each bin. a) Under the increasing preference model, mating pairs cluster around a single bin of maximum density. b) In contrast, under the social group model, mating pairs form 2 clusters, representing preferential mating within each of 2 social groups. All 10,000 individuals are shown. A y=x line is shown for reference. c and d) The same trends are observed when considering a subsample of n=100 individuals. See Supplementary Figs. 18–20.

Given that evidence for ancestry-assortative mating is often assessed in substantially smaller empirical datasets, we also examined whether these differences in mating structure could be observed in a subsample (n=100 individuals) of our simulated population. The correlation in global ancestry proportion between mates is typically visualized as a dot-plot (e.g. Zou et al. 2015; Korunes et al. 2022); we found that the 2 models looked similar when taking this approach (Supplementary Fig. 19). However, even with only 100 individuals, hexagonal bin plots (Carr et al. 1987) suggested the presence of 2 discrete clusters of mating pairs in simulations under the social group model with high α [and thus high r(xi,xj)] (Fig. 4, c and d; Supplementary Fig. 20).

These results, along with similar results using the distribution of the absolute difference in global ancestry proportion between mate pairs, Δx=|xixj|, rather than summarizing it as a correlation coefficient (Supplementary Fig. 21), underscore that multiple mating structures can produce very similar correlation in global ancestry proportion between mates and highlight the utility of alternative visualizations for understanding the mating structure in a population of interest.

Nonrandom mating produced an excess of long local ancestry tracts relative to random mating, leading to underestimates for the timing of admixture

Thus far, we have focused on the relationship between mate choice model and the distribution of global ancestry proportion. We next considered the impact of model choice on the length distribution of local ancestry tracts, which is used directly or indirectly (e.g. using admixture linkage disequilibrium as a proxy) to infer demographic parameters, including the time since the onset of admixture (Moorjani et al. 2011; Gravel 2012; Loh et al. 2013; Hellenthal et al. 2014). Prior work has shown that nonrandom mating disrupts the decay in local ancestry tract length due to recombination, resulting in long local ancestry tracts consistent with more recent admixture than truly occurred (Zaitlen et al. 2017; Korunes et al. 2022). Thus, some methods to correct for the systematic underestimation of the time since admixture due to assortative mating attempt to recapitulate local ancestry tract length dynamics by matching the empirical correlation between spouses (Zaitlen et al. 2017).

We first compared the median local ancestry tract length at t=20 generations post-admixture in simulations under the increasing preference and social group models, matched for correlation in global ancestry proportion between mates, r(xi,xj). Differences in median local ancestry tract length were modest when comparing across α values within a single model, between these 2 mate choice models, and between both models and random mating (Fig. 5a). However, we did observe across all 4 mate choice models that larger values of α were associated with longer median local ancestry tract lengths at generation t=20 (Supplementary Fig. 22). Furthermore, under the increasing preference and social group models—the 2 models under which r(xi,xj)0 at generation t=20—greater r(xi,xj) was also associated with longer median local ancestry tract length. Additionally, for a given value of r(xi,xj), median local ancestry tract length tended to be longest under the social group model (Fig. 5b).

Fig. 5.

Fig. 5.

Local ancestry tract length distributions were similar for simulations under the increasing preference and social group models, when comparing simulations with similar correlation in global ancestry proportion between mates, r(xi,xj). a) The distribution of local ancestry tract lengths at t=20 generations post-admixture is shown for 3 representative pairs of simulations, matched for r(xi,xj). A random mating control simulation is shown for reference. Because the local ancestry tract length distribution has a very long tail, the y-axis is truncated at the top whisker (75th percentile +1.5× the interquartile range). b) Median local ancestry tract length was similar across models and values of α. However, for the same r(xi,xj), median local ancestry tract length tended to be longer for simulations under the social group model relative to those under the increasing preference model. Each dot represents 1 simulation (5 replicates each for α{1,2,4,6,8,10}).

To directly test the effects of the increasing preference and social group models on downstream inference of admixture timing, we fit an exponential to the local ancestry tract length distribution, with decay rate λ=(t+1)×m, where t represents generations postadmixture and m represents the proportion of individuals from source population 1 contributing to the founding of the admixed populations. From the fit, we then inferred that a single-pulse admixture event occurred t generations before the time of sampling [equation 1 from the study by Gravel (2012)]. We observed that the simulated distribution of local ancestry tracts had an excess of long local ancestry tracts at t=20 generations post-admixture relative to the exponential fit, particularly for simulations with greater r(xi,xj) (Fig. 6, a and b). Although slight, this mismatch in distribution shape may prove useful as an indication in empirical data that the effects of nonrandom mating should be taken into consideration, although it may also be confused for evidence of continuous migration.

Fig. 6.

Fig. 6.

Nonrandom mating led to underestimation of the time since admixture. For each simulation, an exponential model was fit to the local ancestry tract length distribution, with decay rate λ. a) For the mean local ancestry tract length, observed values (solid lines) were similar to those expected under the exponential fit (1/λ,dotted lines), indicating a good fit to the data. The dashed line at y=9.52 indicates the expected mean local ancestry tract length for the true values of t=20 and m=0.5. b) However, for the 95th percentile of local ancestry tract length, observed values (solid lines) were larger than those expected under the exponential fit [ln(10.95)/λ, dotted lines], particularly when r(xi,xj) is high. The dashed line at y=28.53 indicates the expected 95th percentile of local ancestry tract length for the true values of t=20 and m=0.5. c–e) Estimated time since admixture was similar between the increasing preference and social group models. As expected, inference was accurate under random mating and underestimated under both models of nonrandom mating, with increasing discrepancy between true and inferred values for larger r(xi,xj).

As expected based on prior empirical work (Zaitlen et al. 2017; Korunes et al. 2022), we underestimated the true time since admixture, and the discrepancy between the truth and our inferred time increased as r(xi,xj) increased (Fig. 6c). Intriguingly, we found that this discrepancy was established in the first few generations and then grew relatively slowly over time, suggesting that there might be a plateauing of the effect on longer timescales (Supplementary Fig. 23). For instance, time since admixture was underestimated by an average of 2.25 generations for t=20 and 4.01 generations for t=50 generations post-admixture. As a result, we observe a bias of similar magnitude under all 4 models at t=20 generations post-admixture, although r(xi,xj)0 under the stationary preference and broad preference models (Supplementary Fig. 24).

The effects of continuous migration on the correlation in global ancestry proportion between mates differed between models and values of α

Realistic models of human admixture likely include more complex dynamics than a single pulse of admixture followed by complete isolation from the source populations, as we have modeled above. To begin to explore the behavior of the increasing preference and social group models under more complex demographic scenarios, we examined the trajectory of the correlation in global ancestry proportion between mates, r(xi,xj), over time in a scenario with continuous migration, wherein 1% of the population in each generation was replaced with migrants from the 2 source populations. Relative to the single-pulse admixture scenario, we might expect r(xi,xj) to be smaller in the continuous migration scenario for the same generation t and mating bias strength α because new migrants have fewer potential mates with similar global ancestry proportion to choose from compared to individuals born in the admixed population. Furthermore, variance in global ancestry proportion across individuals in the admixed population, σx2(t), decreases in each successive generation (Supplementary Fig. 12), meaning that new migrants have increasingly dissimilar global ancestry proportion to the average potential mate. On the other hand, mating events between 2 migrants from the same source population will increase r(xi,xj).

We found that the balance between these countervailing effects on r(xi,xj) differed between models. Under the increasing preference model, r(xi,xj) initially decreased over time, similar to the scenario with single-pulse admixture. However, for simulations with α4, there was a subsequent increase over time, resulting in a greater r(xi,xj) at t=20 generations post-admixture in the scenario with migration than the one without migration (Fig. 7a). As noted above, the increasing preference model compensates for the decrease in σx2(t) by increasing mate selectiveness over time (Supplementary Fig. 12). Consistent with this explanation for the difference between the single-pulse admixture and continuous migration simulations, we observed that the proportion of mating events between migrants increased over time in these simulations (Fig. 7c). Additionally, under the other 2 ancestry-similarity models, which do not compensate for the decreased σx2(t) over time, we observed that the prevalence of mating events between migrants was constant and that the value of r(xi,xj) was similar with or without migration for a given generation t and α (Supplementary Figs. 25–27).

Fig. 7.

Fig. 7.

The effects of continuous migration differed between mate choice models, driven by the percent of mating events between new migrants. a and b) Correlation in global ancestry proportion between mates, r(xi,xj), increased over time in simulations under the increasing preference model with continuous migration, for some α. This behavior was not observed in any simulations without migration (see Fig. 2) or under the social group model with continuous migration. c) Under the increasing preference model with continuous migration, mating between migrants was increasingly prevalent over time and far more frequent than expected under random mating. d) Mating between migrants under the social group model with continuous migration did not occur more often than expected by chance.

In contrast, the social group model neither takes into consideration similarity in global ancestry proportion nor increases the strength of mating bias over time. Thus, for simulations under this model, new migrants are equally likely to mate with anyone within their social group regardless of global ancestry proportion; as expected, we observed a constant prevalence in mating events between migrants (Fig. 7d). Additionally, because σx2(t) decreases over time, migrants become increasingly dissimilar from potential mates. As a result, continuous migration always decreased r(xi,xj) under this model relative to the single-pulse admixture scenario, controlling for generation and α (Fig. 7c).

Discussion

While it is appreciated that humans, like individuals in many other natural populations, do not choose mates at random, population genetic theory and methods to account for assortative mating in empirical data remain largely underdeveloped. In the context of ancestry-assortative mating, extension of existing theory on how assortative mating shapes the expected distribution of a trait in a population (Wright 1921; Norris et al. 2019; Kim et al. 2021; Border et al. 2022; Muralidhar et al. 2022; Horwitz et al. 2023) is made difficult by ambiguity about what the relevant phenotype is. Furthermore, global ancestry proportion is an unusual quantitative phenotype, in that its trait value is integrated across every locus in the genome. How this genome-wide involvement might impact analyses, such as selection scans and association studies, that attempt to distinguish implicated loci from neutral loci remains unclear. Simulations of ancestry-assortative mating that accurately recapitulate key summaries of empirical data are crucial to unraveling these impacts. Here, we considered 2 related prerequisite questions: first, are there multiple mechanisms of mate choice compatible with the observed correlation in global ancestry proportion between spouses in human populations? Second, does the choice of a particular mathematical function for defining biased mating meaningfully impact the conclusions drawn from the resulting simulations? We compared 4 models, including one that considers social groups rather than quantitative similarity in global ancestry proportion as the mechanism of mate choice, to better understand how we ought to think about modeling and correcting for ancestry-assortative mating going forward.

We turn first to assumptions commonly made by statistical genetics methods (Zaitlen et al. 2017; Pfennig and Lachance 2023; Huang et al. 2024), which focus on the observed empirical correlation in global ancestry between mates: namely, that this correlation is constant over time and that parameters of interest are comparable between empirical data and simulated data with the same correlation coefficient. In our simulations, we find that r(xi,xj) is not stable over time and decays toward zero under most models (Fig. 2). Thus, assuming that bias in mating is constant over time, the correlation in global ancestry proportion between spouses observed in a contemporary sample is likely smaller than it was in previous generations. In addition, we find that multiple models can produce the same r(xi,xj) while differing in the underlying mating structure (Fig. 4) and the distribution of local ancestry tract lengths (Fig. 5). However, we find that the effects of these differences between models on estimated time since admixture are likely to be small, likely because the effects on mean local ancestry tract length are similar (Fig. 6).

Prior studies that have modeled biased mate choice to develop theory about assortative mating have either not considered how the decrease in variance in global ancestry proportion across individuals over time impacts the correlation in global ancestry proportion between mates (Kim et al. 2021; Muralidhar et al. 2022) or explicitly modeled a constant correlation coefficient in the face of decreasing variance (Huang et al. 2024). Our work highlights that variance in global ancestry proportion across individuals plays an essential role in determining whether a positive correlation can be observed and, furthermore, whether mating is effectively random (Fig. 3; Supplementary Fig. 12). Prior theory has also prioritized the role of individual mate choice preference for similarity in global ancestry proportion (|xixj|). In humans particularly, there is a wide array of social science research to support the role of sociological factors in mating outcomes. To wit, we found that a simplistic model imposing a barrier to mating between 2 social groups was sufficient to generate signatures of ancestry-assortative mating that could be observed for 20–50 generations post-admixture. To our knowledge, this type of model has not been used before to model ancestry-assortative mating in admixed populations but may be representative of how social categories like race, ethnicity, and socioeconomic status influence spouse choice.

The 4 models that we consider in the present study are not the only possible models but rather represent 2 classes of model worthy of further theoretical exploration: ancestry-similarity and social group models. For ancestry-similarity models, we have demonstrated that not all variants of this model type can sustain sufficiently high levels of variance in global ancestry proportion across individuals to continue to observe a positive correlation in global ancestry proportion between mates (at least, without additional in-migration events). The increasing preference model presents a mathematically simple strategy to compensate for this decay in σx2(t): increasing the choosiness of individuals. While this is effective in maintaining a high r(xi,xj), it is likely unrealistic in practice, making individuals too attuned to small differences between mates (Supplementary Fig. 2b). This overcompensation is heightened in the continuous migration scenario, leading to unexpected (and undesirable) behavior (Fig. 7). Future development of ancestry-similarity models should consider alternative approaches to tune the selectiveness of individuals over time. For instance, we might want to model a constant degree of bias (i.e.max(ψi,j)min(ψi,j)) over time.

In including the social group model in this study, we aimed only to demonstrate that categorical social barriers can mediate ancestry-assortative mating, even when individuals do not directly use ancestry information to make mating decisions. To that end, we designed a proof-of-concept version of this model class, which is likely overly simplistic for drawing conclusions about real-world human populations. Future development of this class could include greater social complexity, for instance, more than 2 social groups, alternative rules for how individuals “inherit” their social group membership, and asymmetric barriers between groups. Each of these added layers could be implemented in different ways. For example, in a simulation with more than 2 groups, barriers might be more permeable between some pairs of groups than others.

A growing body of research suggests that patterns of assortative mating are not stable over time (e.g. Mare 1991; Sunde et al. 2024). Under an ancestry-similarity model, this could reflect changes in preference over time, potentially to account for decreasing differences between potential mates (as discussed above). Under a social group model, this could reflect changes in social mobility or acceptance of intergroup mating. Thus, modeling this type of change over time would be an interesting future direction for both classes of model, although it remains outside the scope of the present work.

Here, we focus exclusively on biased mate choice as a mechanism for generating a positive correlation in global ancestry proportion between mates, comparing potential models for implementing that mechanism. Future work should also consider other mechanisms that require non-Wright–Fisher frameworks to incorporate additional parameters (e.g. birth and death rates, dispersal rates, etc.). For instance, in real-world populations, geographic structure (i.e. isolation by distance over continuous space) is also a major driver of nonrandom mating with respect to ancestry. It remains an open question whether geography alone could produce patterns of assortative mating that resemble empirical observations, as well as how geography might interact with social barriers to shape the opportunities for individuals to encounter one another and act on biased mate preferences.

Taken together, our results emphasize that theory incorporating nonrandom mating must carefully consider how mate choice is conceptualized and modeled. While our focus is primarily on how to better model ancestry-assortative mating, we also make a few recommendations for future empirical studies of ancestry-assortative mating in humans. First, observing a correlation in global ancestry proportion between spouses can reflect multiple mechanisms of mate choice and should not be interpreted as unequivocal evidence of a preference for mates with similar global ancestry proportion. Second, the magnitude of the correlation coefficient is limited by variance in global ancestry proportion across individuals. Thus, a positive correlation in global ancestry proportion in a real-world population should be interpreted as specific to the time point sampled. Third, there are multiple mating structures that can give rise to the same Pearson correlation coefficient, and visualization of these patterns (e.g. with a hexagonal bin plot or hurricane plot of Δx) may help to disambiguate these possibilities.

As with any simulation study, our results are limited by what we have elected to model or not model. We highlight 2 important caveats. First, we do not model genetic variation within and between source populations. Empirical analyses rely on these data to first infer global and local ancestry, a process that is sensitive to how researchers define discrete source populations from continuous human genetic variation. Having bypassed this step, we cannot comment on how estimation error might interact with our results. Second, we have focused on a single pulse of admixture, turning to a continuous migration scenario only to highlight an unexpected result from the increasing preference model. More realistic models of human populations almost certainly involve complex migration dynamics, including multiple pulses of migration, differing contributions from source populations, changes in population size, asymmetries in mate preferences and in the ability of individuals to enact their mate choices, monogamous mating, and multiway admixture. Inclusion of these additional factors into future models is likely to impact the results. However, each additional factor drastically expands the range of possible implementations and space of parameter values. As such, these more complex models are likely to be most useful when targeted to matching the parameters of a population of interest whose history is well understood and less well suited to a general exploration of parameter space.

Supplementary Material

iyaf022_Supplementary_Data
iyaf022_Peer_Review_History

Acknowledgments

We thank Joshua G. Schraiber (ORCiD: 0000-0002-7912-2195), Michael M. Hoffman (ORCiD: 0000-0002-4517-1562), John Barton (ORCiD: 0000-0003-1467-421X), Shyamalika Gopalan (ORCiD: 0000-0002-2608-8472), and members of the Goldberg lab for helpful discussions.

Contributor Information

Dashiell J Massey, Department of Evolutionary Anthropology, Duke University, Durham, NC 27708, USA.

Zachary A Szpiech, Department of Biology, The Pennsylvania State University, University Park, PA 16802, USA.

Amy Goldberg, Department of Evolutionary Anthropology, Duke University, Durham, NC 27708, USA.

Data availability

All scripts used for simulations, analyses, and figures are available on GitHub at: https://github.com/agoldberglab/ancestry-assortative-mating-simulation.

Supplemental material available at GENETICS online.

Funding

This work was supported by the National Institute of General Medical Sciences (R35 GM133481 to AG and R35 GM146926 to ZAS).

Literature cited

  1. Arauna LR, Bergstedt J, Choin J, Mendoza-Revilla J, Harmant C, Roux M, Mas-Sandoval A, Lémée L, Colleran H, François Aet al. 2022. The genomic landscape of contemporary western remote Oceanians. Curr Biol 32(21):4565–4575. e4566. 10.1016/j.cub.2022.08.055 [DOI] [PubMed] [Google Scholar]
  2. Avadhanam S, Williams AL. 2022. Simultaneous inference of parental admixture proportions and admixture times from unphased local ancestry calls. Am J Hum Genet. 109(8):1405–1420. 10.1016/j.ajhg.2022.06.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Beleza S, Campos J, Lopes J, Araujo II, Hoppfer Almada A, Correia e Silva A, Parra EJ, Rocha J. 2012. The admixture structure and genetic variation of the archipelago of Cape Verde and its implications for admixture mapping studies. PLoS One. 7(11):e51103. 10.1371/journal.pone.0051103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Blossfeld H-P. 2009. Educational assortative marriage in comparative perspective. Annu Rev Sociol. 35(1):513–530. 10.1146/annurev-soc-070308-115913. [DOI] [Google Scholar]
  5. Bonilla C, Shriver MD, Parra EJ, Jones A, Fernandez JR. 2004. Ancestral proportions and their association with skin pigmentation and bone mineral density in Puerto Rican women from New York City. Hum Genet. 115(1):57–68. 10.1007/s00439-004-1125-7 [DOI] [PubMed] [Google Scholar]
  6. Border R, Athanasiadis G, Buil A, Schork AJ, Cai N, Young AI, Werge T, Flint J, Kendler KS, Sankararaman S, et al. 2022. Cross-trait assortative mating is widespread and inflates genetic correlation estimates. Science. 378(6621):754–761. 10.1126/science.abo2059. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Browning JR. 1951. Anti-miscegenation laws in the United States. Duke Bar J. 1(1):26–41. 10.2307/1370809. [DOI] [Google Scholar]
  8. Burger R, Schneider KA. 2006. Intraspecific competitive divergence and convergence under assortative mating. Am Nat. 167(2):190–205. 10.1086/499375. [DOI] [PubMed] [Google Scholar]
  9. Burley N. 1983. The meaning of assortative mating. Ethol Sociobiol. 4(4):191–203. 10.1016/0162-3095(83)90009-2. [DOI] [Google Scholar]
  10. Buss DM, Barnes M. 1986. Preferences in human mate selection. J Pers Soc Psychol. 50(3):559–570. 10.1037/0022-3514.50.3.559. [DOI] [Google Scholar]
  11. Carr DB, Littlefield RJ, Nicholson WL, Littlefield JS. 1987Scatterplot matrix techniques for large n. J Am Stat Assoc. 82:424–436. 10.2307/2289444 [DOI] [Google Scholar]
  12. Chiappori P-A. 2020. The theory and empirics of the marriage market. Annu Rev Econom. 12(1):547–578. 10.1146/annurev-economics-012320-121610. [DOI] [Google Scholar]
  13. De La Mare JK, Lee AJ. 2023. Assortative preferences for personality and online dating apps: individuals prefer profiles similar to themselves on agreeableness, openness, and extraversion. Pers Individ Dif. 208:112185. 10.1016/j.paid.2023.112185. [DOI] [Google Scholar]
  14. Dieckmann U, Doebeli M. 1999. On the origin of species by sympatric speciation. Nature. 400(6742):354–357. 10.1038/22521. [DOI] [PubMed] [Google Scholar]
  15. Domingue BW, Fletcher J, Conley D, Boardman JD. 2014. Genetic and educational assortative mating among US adults. Proc Natl Acad Sci U S A. 111(22):7996–8000. 10.1073/pnas.1321426111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Fibla J, Maceda I, Laplana M, Guerrero M, Alvarez MM, Burgueño J, Camps A, Fàbrega J, Felisart J, Grané J, et al. 2022. The power of geohistorical boundaries for modeling the genetic background of human populations: the case of the rural Catalan Pyrenees. Front Genet. 13:1100440. 10.3389/fgene.2022.1100440. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Fisher RA. 1918. The correlation between relatives on the supposition of Mendelian inheritance. Trans R Soc Edinburgh. 52(2):399–433. 10.1017/S0080456800012163. [DOI] [Google Scholar]
  18. Fogel AS, McLean EM, Gordon JB, Archie EA, Tung J, Alberts SC. 2021. Genetic ancestry predicts male-female affiliation in a natural baboon hybrid zone. Anim Behav. 180:249–268. 10.1016/j.anbehav.2021.07.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Funk ER, Mason NA, Palsson S, Albrecht T, Johnson JA, Taylor SA. 2021. A supergene underlies linked variation in color and morphology in a holarctic songbird. Nat Commun. 12(1):6833. 10.1038/s41467-021-27173-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Goldberg A, Rastogi A, Rosenberg NA. 2020. Assortative mating by population of origin in a mechanistic model of admixture. Theor Popul Biol. 134:129–146. 10.1016/j.tpb.2020.02.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Gravel S. 2012. Population genetics models of local ancestry. Genetics. 191(2):607–619. 10.1534/genetics.112.139808. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Greenwood J, Guner N, Kocharkov G, Santos C. 2014. Marry your like: assortative mating and income inequality. Am Econ Rev. 104(5):348–353. 10.1257/aer.104.5.348. [DOI] [Google Scholar]
  23. Haller BC, Galloway J, Kelleher J, Messer PW, Ralph PL. 2019. Tree-sequence recording in SLiM opens new horizons for forward-time simulation of whole genomes. Mol Ecol Resour. 19(2):552–566. 10.1111/1755-0998.12968. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Haller BC, Messer PW. 2023. SLiM 4: multispecies eco-evolutionary modeling. Am Nat. 201(5):E127–E139. 10.1086/723601. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Hamid I, Korunes KL, Beleza S, Goldberg A. 2021. Rapid adaptation to malaria facilitated by admixture in the human population of Cabo Verde. eLife. 10:e63177. 10.7554/eLife.63177 [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Hellenthal G, Busby GBJ, Band G, Wilson JF, Capelli C, Falush D, Myers S. 2014. A genetic atlas of human admixture history. Science. 343(6172):747–751. 10.1126/science.1243518. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Henz U, Mills C. 2017. Social class origin and assortative mating in Britain, 1949–2010. Sociology. 52(6):1217–1236. 10.1177/0038038517726479. [DOI] [Google Scholar]
  28. Horwitz TB, Balbona JV, Paulich KN, Keller MC. 2023. Evidence of correlations between human partners based on systematic reviews and meta-analyses of 22 traits and UK Biobank analysis of 133 traits. Nat Hum Behav. 7(9):1568–1583. 10.1038/s41562-023-01672-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Howe LJ, Battram T, Morris TT, Hartwig FP, Hemani G, Davies NM, Smith GD. 2021. Assortative mating and within-spouse pair comparisons. PLoS Genet. 17(11):e1009883. 10.1371/journal.pgen.1009883. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Huang J, Kleman N, Basu S, Shriver MD, Zaidi AA. Interpreting SNP heritability in admixed populations. bioRxiv 551959, 10.1101/2023.08.04.551959, 4 August 2024, preprint: not peer reviewed. [DOI]
  31. Jiang Y, Bolnick DI, Kirkpatrick M. 2013. Assortative mating in animals. Am Nat. 181(6):E125–E138. 10.1086/670160. [DOI] [PubMed] [Google Scholar]
  32. Kalmijn M. 1998. Intermarriage and homogamy: causes, patterns, trends. Annu Rev Sociol. 24(1):395–421. 10.1146/annurev.soc.24.1.395. [DOI] [PubMed] [Google Scholar]
  33. Keddy-Hector AC. 1992. Mate choice in non-human primates. Am Zool. 32(1):62–70. 10.1093/icb/32.1.62. [DOI] [Google Scholar]
  34. Kim J, Edge MD, Goldberg A, Rosenberg NA. 2021. Skin deep: the decoupling of genetic admixture levels from phenotypes that differed between source populations. Am J Phys Anthropol. 175(2):406–421. 10.1002/ajpa.24261. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Kirkpatrick M. 1982. Sexual selection and the evolution of female choice. Evolution. 36(1):1–12. 10.2307/2407961. [DOI] [PubMed] [Google Scholar]
  36. Klinkova E, Hodges JK, Fuhrmann K, de Jong T, Heistermann M. 2005. Male dominance rank, female mate choice and male mating and reproductive success in captive chimpanzees. Folia Primatol (Basel). 26(2):357–484. 10.1007/s10764-005-2929-6. [DOI] [Google Scholar]
  37. Kondrashov AS. 1983. Multilocus model of sympatric speciation I. One character. Theor Popul Biol. 24(2):121–135. 10.1016/0040-5809(83)90036-9. [DOI] [PubMed] [Google Scholar]
  38. Kopp M, Servedio MR, Mendelson TC, Safran RJ, Rodriguez RL, Rodríguez RL, Hauber ME, Scordato EC, Symes LB, Balakrishnan CNet al. 2018. Mechanisms of assortative mating in speciation with gene flow: connecting theory and empirical research. Am Nat. 191:1–20. 10.1086/694889 [DOI] [PubMed] [Google Scholar]
  39. Korunes KL, Soares-Souza GB, Bobrek K, Tang H, Araujo II, Goldberg A, Beleza S. 2022. Sex-biased admixture and assortative mating shape genetic variation and influence demographic inference in admixed Cabo Verdeans. G3 (Bethesda). 12(10):jkac183. 10.1093/g3journal/jkac183 [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Lande R. 1981. Models of speciation by sexual selection on polygenic traits. Proc Natl Acad Sci U S A. 78(6):3721–3725. 10.1073/pnas.78.6.3721. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Liang M, Shishkin M, Shchur V, Nielsen R. 2024. Understanding admixture fractions: theory and estimation of gene-flow. J Math Biol. 89(5):47. 10.1007/s00285-024-02146-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Loh PR, Lipson M, Patterson N, Moorjani P, Pickrell JK, Reich D, Berger B. 2013. Inferring admixture histories of human populations using linkage disequilibrium. Genetics. 193(4):1233–1254. 10.1534/genetics.112.147330. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Luo S, Klohnen EC. 2005. Assortative mating and marital quality in newlyweds: a couple-centered approach. J Pers Soc Psychol. 88(2):304–326. 10.1037/0022-3514.88.2.304. [DOI] [PubMed] [Google Scholar]
  44. Mare RD. 1991. Five decades of educational assortative mating. Am Sociol Rev. 56(1):15–32. 10.2307/2095670. [DOI] [Google Scholar]
  45. Mas Sandoval A, Mathieson S, Fumagalli M. 2023. The genomic footprint of social stratification in admixing American populations. eLife. 12:2022.2011.2016.516754. 10.7554/eLife.84429. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Melo MC, Salazar C, Jiggins CD, Linares M. 2009. Assortative mating preferences among hybrids offers a route to hybrid speciation. Evolution. 63(6):1660–1665. 10.1111/j.1558-5646.2009.00633.x. [DOI] [PubMed] [Google Scholar]
  47. Mooney JA, Agranat-Tamir L, Pritchard JK, Rosenberg NA. 2023. On the number of genealogical ancestors tracing to the source groups of an admixed population. Genetics. 224(3):iyad079. 10.1093/genetics/iyad079 [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Moorjani P, Patterson N, Hirschhorn JN, Keinan A, Hao L, Atzmon G, Burns E, Ostrer H, Price AL, Reich D, et al. 2011. The history of African gene flow into Southern Europeans, Levantines, and Jews. PLoS Genet. 7(4):e1001373. 10.1371/journal.pgen.1001373. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Muralidhar P, Coop G, Veller C. 2022. Assortative mating enhances postzygotic barriers to gene flow via ancestry bundling. Proc Natl Acad Sci U S A. 119(30):e2122179119. 10.1073/pnas.2122179119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Nagoshi CT, Johnson RC, Danko GP. 1990. Assortative mating for cultural identification as indicated by language use. Behav Genet. 20(1):23–31. 10.1007/BF01070737. [DOI] [PubMed] [Google Scholar]
  51. Nagylaki T. 1978. The correlation between relatives with assortative mating. Ann Hum Genet. 42(1):131–137. 10.1111/j.1469-1809.1978.tb00938.x. [DOI] [PubMed] [Google Scholar]
  52. Natola L, Seneviratne SS, Irwin D. 2022. Population genomics of an emergent tri-species hybrid zone. Mol Ecol. 31(20):5356–5367. 10.1111/mec.16650. [DOI] [PubMed] [Google Scholar]
  53. Norris ET, Rishishwar L, Wang L, Conley AB, Chande AT, Dabrowski AM, Valderrama-Aguirre A, Jordan IK. 2019. Assortative mating on ancestry-variant traits in admixed Latin American populations. Front Genet. 10:359. 10.3389/fgene.2019.00359. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Otto SP, Servedio MR, Nuismer SL. 2008. Frequency-dependent selection and the evolution of assortative mating. Genetics. 179(4):2091–2112. 10.1534/genetics.107.084418. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Parra FC, Amado RC, Lambertucci JR, Rocha J, Antunes CM, Pena SDJ. 2003. Color and genomic ancestry in Brazilians. Proc Natl Acad Sci U S A. 100(1):177–182. 10.1073/pnas.0126614100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Pennings PS, Kopp M, Meszena G, Dieckmann U, Hermisson J. 2008. An analytically tractable model for competitive speciation. Am Nat. 171(1):E44–E71. 10.1086/523952. [DOI] [PubMed] [Google Scholar]
  57. Pfennig A, Lachance J. 2023. Challenges of accurately estimating sex-biased admixture from X chromosomal and autosomal ancestry proportions. Am J Hum Genet. 110(2):359–367. 10.1016/j.ajhg.2022.12.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Powell DL, Moran BM, Kim BY, Banerjee SM, Aguillon SM, Aguillon SM, Fascinetto-Zago P, Langdon QK, Schumer M. 2021. Two new hybrid populations expand the swordtail hybridization model system. Evolution. 75(10):2524–2539. 10.1111/evo.14337. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Rettelbach A, Kopp M, Dieckmann U, Hermisson J. 2013. Three modes of adaptive speciation in spatially structured populations. Am Nat. 182(6):E215–E234. 10.1086/673488. [DOI] [PubMed] [Google Scholar]
  60. Risch N, Choudhry S, Via M, Basu A, Sebro R, Eng C, Beckman K, Thyne S, Chapela R, Rodriguez-Santana JR, et al. 2009. Ancestry-related assortative mating in Latino populations. Genome Biol. 10(11):R132. 10.1186/gb-2009-10-11-r132. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Robinson MR, Kleinman A, Graff M, Vinkhuyzen AAE, Couper D, Miller MB, Peyrot WJ, Abdellaoui A, Zietsch BP, Nolte IM, et al. 2017. Genetic evidence of assortative mating in humans. Nat Hum Behav. 1(1):0016. 10.1038/s41562-016-0016. [DOI] [Google Scholar]
  62. Robinson CE, Thyagarajan H, Chippindale AK. 2023. Evolution of reproductive isolation in a long-term evolution experiment with Drosophila melanogaster: 30 years of divergent life-history selection. Evolution. 77(8):1756–1768. 10.1093/evolut/qpad098. [DOI] [PubMed] [Google Scholar]
  63. Schumer M, Powell DL, Delclos PJ, Squire M, Cui R, Andolfatto P, Rosenthal GG. 2017. Assortative mating and persistent reproductive isolation in hybrids. Proc Natl Acad Sci U S A. 114(41):10936–10941. 10.1073/pnas.1711238114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Schwartz CR. 2013. Trends and variation in assortative mating: causes and consequences. Annu Rev Sociol. 39(1):451–470. 10.1146/annurev-soc-071312-145544. [DOI] [Google Scholar]
  65. Sebro R, Hoffman TJ, Lange C, Rogus JJ, Risch NJ. 2010. Testing for non-random mating: evidence for ancestry-related assortative mating in the Framingham Heart Study. Genet Epidemiol. 34(7):674–679. 10.1002/gepi.20528. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Sebro R, Peloso GM, Dupuis J, Risch NJ. 2017. Structured mating: patterns and implications. PLoS Genet. 13(4):e1006655. 10.1371/journal.pgen.1006655. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Sebro R, Risch NJ. 2012. A brief note on the resemblance between relatives in the presence of population stratification. Heredity (Edinb). 108(5):563–568. 10.1038/hdy.2011.124. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Seger J. 1985. Unifying genetic models for the evolution of female choice. Evolution. 39(6):1185–1193. 10.2307/2408776. [DOI] [PubMed] [Google Scholar]
  69. Setchell JM, Wickings EJ. 2006. Mate choice in male mandrills (Mandrillus sphinx). Ethology. 112(1):91–99. 10.1111/j.1439-0310.2006.01128.x. [DOI] [Google Scholar]
  70. Shriver MD, Parra EJ, Dios S, Bonilla C, Norton H, Jovel C, Pfaff C, Jones C, Massac A, Cameron N, et al. 2003. Skin pigmentation, biogeographical ancestry and admixture mapping. Hum Genet. 112(4):387–399. 10.1007/s00439-002-0896-y. [DOI] [PubMed] [Google Scholar]
  71. Smadja CM, Loire E, Caminade P, Severac D, Gautier M, Ganem G. 2022. Divergence of olfactory receptors associated with the evolution of assortative mating and reproductive isolation in mice. Peer Community J. 2. 10.24072/pcjournal.190 [DOI] [Google Scholar]
  72. Smieja M, Stolarski M. 2018. Assortative mating for emotional intelligence. Curr Psychol. 37(1):180–187. 10.1007/s12144-016-9501-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Spear ML, Diaz-Papkovich A, Ziv E, Yracheta JM, Gravel S, Torgerson DG, Hernandez RD. 2020. Recent shifts in the genomic ancestry of Mexican Americans may alter the genetic architecture of biomedical traits. eLife. 9:e56029. 10.7554/eLife.56029 [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Sunde HF, Eftedal NH, Cheesman R, Corfield EC, Kleppesto TH, Seierstad AC, Ystrom E, Eilertsen EM, Torvik FA. 2024. Genetic similarity between relatives provides evidence on the presence and history of assortative mating. Nat Commun. 15(1):2641. 10.1038/s41467-024-46939-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Tenesa A, Rawlik K, Navarro P, Canela-Xandri O. 2016. Genetic determination of height-mediated mate choice. Genome Biol. 16(1):269. 10.1186/s13059-015-0833-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Torche F. 2010. Educational assortative mating and economic inequality: a comparative analysis of three Latin American countries. Demography. 47(2):481–502. 10.1353/dem.0.0109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Tung J, Charpentier MJ, Mukherjee S, Altmann J, Alberts SC. 2012. Genetic effects on mating success and partner choice in a social mammal. Am Nat. 180(1):113–129. 10.1086/665993. [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Van Belle S, Estrada A, Ziegler TE, Strier KB. 2009. Sexual behavior across ovarian cycles in wild black howler monkeys (Alouatta pigra): male mate guarding and female mate choice. Am J Primatol. 71(2):153–164. 10.1002/ajp.20635. [DOI] [PubMed] [Google Scholar]
  79. Veller C, Coop GM. 2024. Interpreting population- and family-based genome-wide association studies in the presence of confounding. PLoS Biol. 22(4):e3002511. 10.1371/journal.pbio.3002511. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Verdu P, Rosenberg NA. 2011. A general mechanistic model for admixture histories of hybrid populations. Genetics. 189(4):1413–1426. 10.1534/genetics.111.132787. [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Woodman JP, Cole EF, Firth JA, Perrins CM, Sheldon BC. 2023. Disentangling the causes of age-assortative mating in bird populations with contrasting life-history strategies. J Anim Ecol. 92(5):979–990. 10.1111/1365-2656.13851. [DOI] [PubMed] [Google Scholar]
  82. Wright S. 1921. Systems of mating. III. Assortative mating based on somatic resemblance. Genetics. 6(2):144–161. 10.1093/genetics/6.2.144. [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Wright S. 1950. Genetical structure of populations. Nature. 166(4215):247–249. 10.1038/166247a0. [DOI] [PubMed] [Google Scholar]
  84. Xie Y, Cheng S, Zhou X. 2015. Assortative mating without assortative preference. Proc Natl Acad Sci U S A. 112(19):5974–5978. 10.1073/pnas.1504811112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  85. Yamamoto K, Sonehara K, Namba S, Konuma T, Masuko H, Miyawaki S, Kamatani Y, Hizawa N, Ozono K, Yengo L, et al. 2023. Genetic footprints of assortative mating in the Japanese population. Nat Hum Behav. 7(1):65–73. 10.1038/s41562-022-01438-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  86. Zaitlen N, Huntsman S, Hu D, Spear M, Eng C, Oh SS, White MJ, Mak A, Davis A, Meade K, et al. 2017. The effects of migration and assortative mating on admixture linkage disequilibrium. Genetics. 205(1):375–383. 10.1534/genetics.116.192138. [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. Zou JY, Park DS, Burchard EG, Torgerson DG, Pino-Yanes M, Song YS, Sankararaman S, Halperin E, Zaitlen N. 2015. Genetic and socioeconomic study of mate choice in Latinos reveals novel assortment patterns. Proc Natl Acad Sci U S A. 112(44):13621–13626. 10.1073/pnas.1501741112. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

iyaf022_Supplementary_Data
iyaf022_Peer_Review_History

Data Availability Statement

All scripts used for simulations, analyses, and figures are available on GitHub at: https://github.com/agoldberglab/ancestry-assortative-mating-simulation.

Supplemental material available at GENETICS online.


Articles from Genetics are provided here courtesy of Oxford University Press

RESOURCES