Skip to main content
Proceedings of the Royal Society B: Biological Sciences logoLink to Proceedings of the Royal Society B: Biological Sciences
. 2019 Mar 20;286(1899):20182664. doi: 10.1098/rspb.2018.2664

Genomic evidence for MHC disassortative mating in humans

Claire Dandine-Roulland 1,, Romain Laurent 1, Irene Dall'Ara 1, Bruno Toupance 1, Raphaëlle Chaix 1,
PMCID: PMC6452061  PMID: 30890093

Abstract

Although pervasive in many animal species, the evidence for major histocompatibility complex (MHC) disassortative mating in humans remains inconsistent across studies. Here, to revisit this issue, we analyse dense genotype data for 883 European and Middle Eastern couples. To distinguish MHC-specific effects from socio-cultural confounders, the pattern of relatedness between spouses in the MHC region is compared to the rest of the genome. Couples from Israel exhibit no significant pattern of relatedness across the MHC region, whereas across the genome, they are more similar than random pairs of individuals, which may reflect social homogamy and/or cousin marriages. On the other hand, couples from The Netherlands and more generally from Northern Europe are significantly more MHC-dissimilar than random pairs of individuals, and this pattern of dissimilarity is extreme when compared with the rest of the genome. Our findings support the hypothesis that the MHC influences mate choice in humans in a context-dependent way: MHC-driven preferences may exist in all populations but, in some populations, social constraints over mate choice may reduce the ability of individuals to rely on such biological cues when choosing their mates.

Keywords: mate choice, MHC, sexual selection, GAIN-ADHD

1. Background

High-density genomic data have recently become a useful source of information regarding the factors influencing mate choice in human populations. In particular, assortative mating signals with respect to ancestry [1,2], social status [3,4] and phenotypic traits [5] have been reported. In addition, inbreeding levels are contrasted among human populations [68], probably reflecting variable frequencies of consanguineous unions in these populations [9]. Non-random mating processes with respect to biological or socio-cultural factors are attracting growing interest among geneticists because they may generate cryptic genetic structures [10,11], potentially biasing kinship and heritability estimation [12] and generating false positives when performing association studies [1317].

One of the natural candidates when looking for non-random mating processes is the major histocompatibility complex (MHC), an ancient multigene family playing a fundamental role in immunity. MHC genes code for receptors located on the surface of a variety of immune and non-immune cells. Their role is to bind self and non-self antigens and to present them to T cells, thus initiating immune responses when antigens from intracellular and extracellular pathogens or malignant cells are recognized [1821]. Antigen presentation is a codominant trait and heterozygotes present all antigens displayed by both homozygotes, which potentially increases their resistance to pathogens [18,22].

The MHC is known for its extremely high level of polymorphism [21,23], with over 9000 class I and 2500 class II alleles in human species [24,25]. Pathogen-driven balancing selection has probably contributed to such polymorphism, as suggested by the greater MHC polymorphism in pathogen-rich regions at a worldwide scale [26]. MHC non-random mating has been proposed as a complementary mechanism potentially promoting heterozygosity in the offspring and contributing to the maintenance of MHC polymorphism at the species level [18,19]. Indeed, although the mechanisms through which MHC influences odour and mate choice remain disputed [27], a recent meta-analysis, summarizing decades of research on vertebrate species, and in particular, in mammals, birds, fishes and reptiles, supports a female preference for MHC-diverse males and a preference for MHC dissimilarity when dissimilarity is characterized across multiple loci [28]. Preference for MHC-diverse males could provide direct benefits to females (healthier mates), as well as indirect genetic benefits to the offspring (rare beneficial alleles are more likely to be carried by MHC-diverse individuals and to be transmitted to the next generation). Preference for MHC dissimilarity could provide indirect genetic benefits to the offspring through either optimization of their immune response, or avoidance of close inbreeding if MHC functions as a kin recognition system. Selection for specific MHC alleles may also exist (for example, see [29]).

However, the situation in humans and non-human primates is more controversial [3032]. A trend for female preference for MHC diversity was reported for non-human primates, and this effect was statistically significant in humans [31]. However, no consensus was reached regarding the preference for MHC dissimilarity, with reports of preference for MHC dissimilarity, MHC similarity or no preference at all [31,32]. Such inconsistencies may result from variable study designs throughout this field of research: some studies investigated mating preferences (focusing on odour or facial cues) while other looked at mating outcomes. An additional confounding factor is the level of ethnic heterogeneity in the population sample: signals of MHC similarity preference are overrepresented in studies with ethnically heterogeneous population samples [31]. In addition, MHC non-random mating in humans may be particularly context-dependent and may operate only under certain socio-cultural contexts. In particular, social homogamy with respect to ethnicity, language, religion, socio-economic status [3,4,33], as well as alliance rules (such as cousin marriages, whose prevalence varies across populations [9,34]) may reduce the ability of individuals to rely on biological cues when choosing their mates. Consequently, we do not expect MHC and other biological factors to have the same weight on mate choice in all human populations.

The availability of genomic data, especially when spousal data have been collected, offers an unprecedented opportunity to investigate such context dependency by disentangling biological from socio-cultural factors acting on mate choice. Notably, while socio-cultural factors shape the diversity of every portion of the genome, making spouses appear more similar or dissimilar than expected by chance, non-random mate choice with respect to the MHC will only target and shape the diversity of the MHC region. Taking advantage of this rationale, Chaix et al. previously developed a methodological frame to search for MHC non-random mating signals using an outlier approach [35,36]: indeed, under the hypothesis of MHC disassortative mating, we expect the MHC dissimilarity between spouses to be extreme in comparison to random pairs of individuals, as well as extreme in comparison to the rest of the genome (which can be considered representative of the null expectation of spouse relatedness, taking into account socio-cultural factors). Using such a methodological framework, Chaix et al. previously detected a signature of MHC disassortative mating in a sample of European American origin [35]. However, this signature has been challenged by a study claiming that such signal of MHC non-random mating was driven by outliers, was lost when minor modifications were made to the analysis procedure and that opposite-sex non-spouse pairs showed a dissimilarity signal as strong as spouses [37]. In addition, this study reported that the signal of MHC non-random mating was not found when using a more recent version of the data comprising a larger number of individuals but fewer genetic markers [37]. Laurent & Chaix disagreed with these criticisms [38,39], but this debate highlighted the need for larger datasets to explore more thoroughly the occurrence of MHC non-random mating in human populations. A recent analysis of 872 European American couples concluded for an absence of significant MHC disassortative mating [40]. However, while most studies [35,37,38] used the classical definition of the MHC region spanning 3.6 Mb [23], this study focused on a less stringent definition of the MHC region spanning over 4.9 Mb [40], a choice that may have diluted the signal of non-random mating (a non-significant trend for MHC dissimilarity between spouses was reported for this extended definition of the MHC). Here, we propose to reexamine the hypothesis of MHC disassortative mating in another large sample, the GAIN-ADHD dataset [4144] which includes 883 European and Middle Eastern couples. We use the same methodological framework as the one Chaix et al. originally proposed [35], but we also include the changes in the methodological procedure proposed by Derti et al. [37].

2. Material and methods

(a). GAIN-ADHD dataset

GAIN-ADHD is a genome-wide association study for attention deficit with hyperactivity disorder (ADHD) based on parent–child trios from the International Multisite ADHD Genetics (IMAGE) Project sampled in seven countries: Belgium, Germany, Ireland, The Netherlands, the UK, Spain and Israel. In Germany, The Netherlands, the UK and Israel, participants were recruited from two centres. More details about recruitment and clinical description of samples are given in the papers describing this dataset [4144]. This dataset has been cleaned and quality controlled by the National Center for Biotechnology Information (NCBI) using the GAIN QA/QC software package (version 0.7.4) developed by G. Abecasis and S. Gopalakrishnan at the University of Michigan, as described by Lasky-Su et al. [43]. The cleaned dataset contains 2732 individuals (including 895 couples) and 430 055 autosomal SNPs. Among them, 1327 SNPs were located in the MHC region (positions 29 700 000–33 300 000 on chromosome 6, in build 36.2 coordinates). This definition of the MHC is based on the classical definition of the MHC provided by Horton et al. [23], starting after the MOG gene and ending before the COL11A2 gene (when reading the short arm of the chromosome 6 from telomere to centromere), and encompassing the classical class I, II and III subregions.

(b). Preliminary analyses

We excluded all children from this dataset. In addition, we removed 8418 SNPs with minor allele frequency below 5%. Because our methodology assumes that observations are independent of each other, we removed redundant SNPs by applying an LD (linkage disequilibrium) pruning procedure retaining a subset of SNPs in low LD (r2 < 0.4), using the R packages gaston [45] and GPop [46]. However, similar results were found when performing our relatedness analyses on a dataset without LD pruning (results not shown). Furthermore, in order to remove closely related individuals, we computed the genetic correlation between standardized genotypes (i.e. genotypes standardized to have mean zero and variance 1) for all pairs of individuals [47], thus identifying pairs of individuals with genetic relatedness higher than 1/8 (which is the expected relatedness between first cousins). Then, we removed for each of these pairs, one of the two individuals. In total, nine individuals were removed (six individuals from Israel, one from the UK and two from The Netherlands). Finally, we performed a principal component analysis (PCA) and we further removed six additional individuals recruited in the UK that clustered with Israel samples. The dataset ready for our relatedness analyses comprises 1798 unrelated individuals (including 883 couples and 32 single individuals) and 205 019 SNPs, of which 421 SNPs were within the MHC region. Mean heterozygosity computed over these 421 MHC SNPs for each population sample are provided in electronic supplementary material, table S1.

(c). Relatedness analyses

We estimated the genetic relatedness between spouses using the Rousset relatedness coefficient [48] between spouses i and j, defined as

Rij=QijQm1Qm, 2.1

where Qij is the proportion of variants which are identical by state between the two spouses and Qm is the mean proportion of variants which are identical by state on average in the population (computed by comparing all possible pairs of individuals in each population sample). At a given SNP, the proportion of identical variants for a given pair of individuals was set to 0 if both individuals were homozygous and carrying a different allele (e.g. 00 and 11), 1 if both individuals were homozygous and carrying the same allele (e.g. 00 and 00), and 0.5 in all others cases. We checked that setting this proportion to 1 (instead of 0.5) for pairs of heterozygous individuals (01 and 01) did not impact our findings. The relatedness between spouses was summarized across the MHC region and the genome by computing for each couple i and j the mean ratio Rij over all SNPs and then averaging this ratio over all couples. In addition, to exclude the possibility of outliers influencing our findings, we checked that similar results were found when focusing on the median of the ratio Rij over all couples.

Significance was assessed by permuting individuals among the couples 105 times, attributing a new wife to each husband. We estimated a two-sided p-value defined as the proportion of permutations in which the permuted couples had more extreme relatedness than the real couples.

In addition, to control for genome-wide effects, we asked whether the MHC region was unusual relative to the rest of the genome in terms of similarity or dissimilarity between spouses. To do so, we compared the MHC to all sliding windows of 3.6 Mb (in increments of 300 kb) having at least 100 variants, not overlapping a centromere and having similar recombination rate (±10%) as the MHC (1117 windows met these criteria). Here, it is crucial to take into account the recombination rate because the MHC has particular LD structure [49] and the variance of any statistics (here the relatedness coefficient) computed throughout the genome is influenced by the local recombination rate, such that more extreme values are expected in regions of low recombination [50,51]. We estimated the proportion of such 1117 windows exhibiting more extreme relatedness coefficient between spouses than the MHC. Such a proportion corresponds to a two-sided p-value as relatedness coefficients computed for all genomic windows similar to the MHC in terms of length and recombination rate provide an empirical null distribution.

To further check whether our results were robust to changes in the methodological procedure, we re-estimated the relatedness between spouses using two alternative statistics: first, the genetic correlation Cij between spouses i and j based on their standardized genotypes [47]

Cij=l=1qZilZjlq1, 2.2

where Zil is the standardized genotype of individual i at locus l, and q is equal to the number of SNPs included in the analysis.

Second, we computed a Z-score, indicating the deviation between the mean relatedness between spouses in comparison to non-spouses and defined as below

Z=mspousesmnonspousesσnonspouses, 2.3

where mspouses is the mean Rousset coefficient between spouses and mnon-spouses and σnon-spouses are the mean and the standard error of Rousset coefficient between all possible pairs of individuals except spouses.

In addition, we computed a Z-score, indicating the deviation between the mean relatedness between spouses in comparison to opposite-sex non-spouse pairs and defined as

Z=mspousesmoppositesexnonspousesσoppositesexnonspouses, 2.4

where mspouses is the mean Rousset coefficient between spouses and mopposite-sex non-spouses and σopposite-sex non-spouses are the mean and the standard error of Rousset coefficient between all opposite-sex non-spouse pairs. Opposite-sex non-spouse pairs include all male–female pairs that are not spouse to each other.

3. Results

We investigated MHC non-random mating signal in European and Middle Eastern populations using the GAIN-ADHD dataset, which comprises genotype data for parent–child trios sampled in Belgium, Germany, Ireland, The Netherlands, the UK, Spain and Israel. Principal component analyses (electronic supplementary material, figures S1 and S2) confirmed that most samples from the same country clustered together. In addition, while samples from Israel and Spain clustered apart from other countries, no striking genetic boundary was observed between Northern European countries (Belgium, Germany, Ireland, The Netherlands and the UK). Consequently, we analysed each country independently as well as a merged dataset grouping all individuals from Northern European countries (and referred here as the Northern European population). The motivation behind such grouping was to gain power in our statistical analyses by increasing sample size, while introducing a limited amount of genetic substructure. After quality control and preliminary analyses (including an LD-pruning step to obtain a dataset of approximately independent SNPs), the dataset comprises 883 couples (table 1) and 205 019 SNPs, including 421 SNPs within the MHC region.

Table 1.

Sample size and mean Rousset relatedness coefficient (R) between spouses at the genome-wide level and at the MHC. Italic values correspond to significant p-values considering the 5% threshold.

population sample size (number of couples) genome-wide
MHC
relatedness between spouses (R) p-valuea relatedness between spouses (R) p-valueb p-valuec
Northern Europe 1330 (658) 8 × 10−4 <10−5 −0.009 0.003 0
 Belgium 72 (36) 8 × 10−4 0.033 −0.011 0.343 0.264
 Ireland 172 (86) 5 × 10−4 0.032 −0.018 0.068 0.003
 Germany 217 (108) 5 × 10−4 0.009 −0.010 0.170 0.073
 Netherlands 613 (302) 4 × 10−4 10−4 −0.010 0.040 0.010
 UK 256 (126) −8 × 10−5 0.696 −0.010 0.157 0.080
Spain 139 (69) 10−4 0.734 −0.010 0.179 0.120
Israel 329 (156) 0.003 <10−5 0.003 0.575 0.625

aTwo-sided p-value defined as the proportion of spouse permutations yielding a more extreme R than the R observed between real spouses.

bTwo-sided p-value defined as the proportion of spouse permutations yielding a more extreme R than the R observed between real spouses.

cTwo-sided p-value defined as the proportion of 3.6 Mb genomic windows (with similar recombination rate as the MHC) exhibiting a more extreme R between spouses than the MHC.

We measured the genetic relatedness between spouses using Rousset relatedness coefficient (R) [48] (table 1; electronic supplementary material, figure S3). In all populations, the mean relatedness coefficients between spouses for the MHC region were negative, ranging from −0.018 in Ireland to −0.009 in Northern Europe, except in Israel where R was equal to 0.003. We tested whether spouses were more MHC-similar or -dissimilar than random pairs of individuals by permuting individuals between couples (attributing a new husband randomly to each wife) (figure 1 and table 1). We observed that real spouses from The Netherlands were more MHC-dissimilar than random pairs of spouses (p = 0.040) and marginally significant results were reached for spouses from Ireland (p = 0.068). When analysing all spouses from Northern Europe together (and permuting individuals between couples regardless of their country), we observed again that spouses were significantly more MHC-dissimilar from each other than random pairs of individuals (p = 0.003). p-values in Belgium, Germany, the UK, Spain and Israel were above 0.1.

Figure 1.

Figure 1.

Distribution of mean Rousset relatedness coefficient computed at the MHC level for real spouses (red line) and for permuted spouses (blue distribution, 105 permutations).

To control for genome-wide effects, and putative socio-demographic factors influencing mate choice, we compared these observations to the pattern of genetic relatedness across the genome. Genome-wide spouses were significantly more genetically similar than random pairs of individuals in all populations (p < 0.05), except in Spain and the UK (table 1; electronic supplementary material, figure S4). In particular, spouses from Israel exhibit similarity across the genome much higher (R = 0.003, p < 10−5) than the similarity between spouses observed in the other populations (R ≤ 8 × 10−4).

To further control for genome-wide effects, we asked whether the MHC region was unusual relative to similar regions across the genome in terms of similarity or dissimilarity between spouses, by comparing the average MHC relatedness between spouses to that of all genomic windows having the same length (3.6 Mb) and similar recombination rate (±10%) as the MHC (figure 2 and table 1). Only 0.3%, 1%, 7.3% and 8% of windows exhibited a more extreme level of genetic relatedness between spouses than the MHC when focusing on Ireland, The Netherlands, Germany and the UK, respectively. Moreover, in the merged Northern European population, the MHC exhibited the strongest non-random signal among all genomic windows with similar length and recombination rate. The MHC was not statistically unusual in comparison to the rest of the genome in terms of relatedness between spouses in Belgium, Spain and Israel (table 1). Note that this approach, which compares the MHC to genomic windows with similar recombination rate, also controls for variation in the level of genetic diversity throughout the genome since recombination and genetic diversity are correlated to each other [52]. In particular, the number of windows having a similar (±10%) recombination rate to that of the MHC (1117) is very close to the number of windows having a similar (±10%) recombination rate and heterozygosity to those of the MHC (from 1055 to 1113), confirming that the recombination rate is a good proxy for the genetic diversity. In other words, it is unlikely that the high diversity of the MHC biases our findings. To further confirm this, we compared the MHC to regions of the genome having similar recombination rate and similar level of heterozygosity (±10%) as the MHC and we replicated our findings: no window had more extreme relatedness coefficient between spouses than the MHC in Northern Europe and only 0.3 and 1% of the windows had more extreme relatedness between spouses than the MHC in Ireland and The Netherlands.

Figure 2.

Figure 2.

Mean Rousset relatedness coefficient between spouses across 3.6 Mb sliding windows (in increments of 300 kb) throughout the genome, plotted against the window recombination rate. The red point corresponds to the MHC.

In addition, we checked that our results were robust to changes in the analysis procedure, as proposed by a study [37] that criticized the original statistical approach of the study by Chaix et al. [35]. These changes were as follows. (i) We performed a slight modification in the way the Rousset coefficient was computed by setting the proportion of identical variants between pairs of heterozygous individuals (01 and 01) to 1, instead of 0.5. (ii) We checked that our findings were not driven by outliers by performing all analyses with the median (rather than mean) Rousset relatedness coefficient between spouses. In addition, we considered alternative statistics assessing relatedness between spouses, that is (iii) the mean genetic correlation between spouses' standardized genotypes [47] and (iv) a Z-score indicating the deviation in genetic relatedness between spouses and non-spouses. These four changes in the analysis procedure had limited effect on our results, and strong evidence for MHC non-random mating was still found in Northern Europe and The Netherlands and occasionally in Ireland, as well as in Belgium, Germany and the UK (electronic supplementary material, tables S2–S5). In addition, the signals of MHC non-random mate choice in Northern Europe remain significant after multiple testing correction (the significance threshold with the Bonferroni correction is 0.006 and p-values are below 0.003 for Northern Europe).

Last but not least, we asked whether opposite-sex non-spouse pairs exhibit a similar signal of MHC dissimilarity as the spouses, as suggested by Derti et al. [37]. This hypothesis was not supported by the data since trends for MHC similarity were observed between opposite-sex non-spouse pairs in most population samples (p > 0.1) except in Germany, where a significant signal of dissimilarity was found (R = −9 × 10−4, p = 0.03) (table 2). In addition, we computed a Z-score indicating the deviation in genetic relatedness between spouses and opposite-sex non-spouse pairs. This analysis confirmed again that spouses were extreme in terms of MHC dissimilarity in comparison to opposite-sex non-spouse pairs in Northern Europe (p < 0.01) and marginally so in Ireland and The Netherlands (p < 0.06) (table 2).

Table 2.

Mean Rousset relatedness coefficient (R) between opposite-sex non-spouses and Z-score indicating the deviation of Rousset relatedness coefficient between spouses and opposite-sex non-spouses at the MHC. Italic values correspond to significant p-values considering the 5% threshold.

population opposite-sex non-spouses
spouses versus opposite-sex non-spouses
relatedness (R) p-valuea Z-score p-valueb p-valuec
Northern Europe 10−4 0.470 −0.102 0.003 9 × 10−4
 Belgium 3 × 10−5 0.975 −0.159 0.308 0.241
 Ireland 5 × 10−4 0.372 −0.173 0.055 0.052
 Germany −9 × 10−4 0.030 −0.100 0.248 0.241
 Netherlands 10−4 0.148 −0.098 0.057 0.051
 UK −3 × 10−4 0.991 −0.123 0.124 0.062
Spain 10−4 0.863 −0.149 0.157 0.079
Israel 0.002 0.178 0.021 0.727 0.657

aTwo-sided p-value defined as the proportion of 3.6 Mb genomic windows (with similar recombination rate as the MHC) exhibiting a more extreme R between opposite-sex non-spouses than the MHC.

bTwo-sided p-value defined as the proportion of spouse permutations yielding a more extreme Z-score than the Z-score observed for real spouses.

cTwo-sided p-value defined as the proportion of 3.6 Mb genomic windows (with similar recombination rate as the MHC) exhibiting a more extreme Z-score than the MHC.

4. Discussion

In this study, we found a signal of MHC non-random mating among Dutch couples, as well as in a merged population sample of Northern European origin: not only these couples were significantly more MHC-dissimilar than random pairs of individuals, but this pattern of dissimilarity was extreme when compared to the rest of the genome, both globally and when broken into windows having the same length and similar recombination rate as the MHC. This signal of non-random mating in Northern Europe was robust to multiple testing correction, and our observations were globally robust to small changes in the analysis procedure, particularly modifications in the relatedness measures suggested by the study [37] that criticized the original statistical approach proposed by Chaix et al. [35]. In addition, spouses from Ireland, but also from Belgium, Germany, the UK and Spain, exhibit tendencies for MHC dissimilarity, which became occasionally significant when performing these changes in the relatedness measures. This suggests a putative lack of power in these population samples to reach significance thresholds. To test this hypothesis, we performed a post hoc power analysis, evaluating the influence of sample size on the ability to detect a level of MHC non-random mating similar to the one observed among the 302 Dutch couples (electronic supplementary material, note and table S6). This analysis shows that this power is 5.5%, 9.1%, 12%, 12.9% and 17.3% when the number of couples drops to 36, 69, 86, 108 and 126, as this is the case for population samples from Belgium, Spain, Ireland, Germany and the UK, respectively. This sharp decrease in statistical power with sample size may explain why the trend for MHC dissimilarity between spouses from Ireland, the UK, Belgium and Spain was not significant.

In addition, we confirmed that such significant MHC dissimilarity signals in The Netherlands and in the merged Northern European sample were specific to spouses and cannot be generalized to all opposite-sex pairs as suggested by Derti et al. [37]. Indeed, positive MHC relatedness coefficients were observed among opposite-sex non-spouse pairs in most population samples, except in the German population sample, where opposite-sex non-spouse pairs exhibited a significant signal of MHC dissimilarity compared to the rest of genome. Although this would warrant further analysis, we hypothesize that such signal in Germany may relate to differences between males and females MHC composition that could derive from sex-specific post-copulatory selective pressures or from sexually antagonistic selection as reported in several species [5357].

Because the dissimilarity among Dutch and more generally Northern European couples was MHC-specific and not found at the genome-wide level, it may result from MHC disassortative mating in these populations, possibly triggered by our olfactory capacity to discriminate MHC-mediated odours [18,22,27]. We explored the pattern of spouse dissimilarity throughout the MHC using a sliding-window approach and observed that the genetic dissimilarity among spouses was distributed all along the MHC, rather than localized to particular parts of the MHC, which suggests that such disassortative mating might be driven by a summation of effects over multiple genes (electronic supplementary material, figure S5). This is in line with previous analyses that reported negative results at the level of individual SNPs, HLA alleles and amino acids [35,40] and with previous observations from vertebrates species [28], which reported significant female choice for dissimilarity only when dissimilarity was characterized across multiple loci. In addition, figure S5 in the electronic supplementary material shows that the signal of non-random mating drops quickly when moving away from the classical MHC region, potentially explaining why a recent study of a large European American dataset focusing on a less stringent definition of the MHC region spanning over 4.9 Mb reported a non-significant trend for MHC dissimilarity between these spouses [40].

All couples from this study were selected for having offspring, one of them being affected by the ADHD [4143]. It is unlikely that the inclusion criteria regarding ADHD is affecting our findings since no association was reported between MHC and ADHD [41,58]. In addition, even in the case of an association between the MHC and ADHD, this may create a bias towards MHC similarity (rather than dissimilarity) among parents of children affected by the ADHD, a pattern found in a study of spouses having a child affected by multiple sclerosis [59], a disease strongly associated with MHC [60]. Consequently, we believe that our signal is indicative of MHC disassortative mating in the general population. However, it is possible that the pattern reported here is restricted to fertile spouses, rather than couples in general, thus reflecting post-copulatory rather than pre-copulatory pressures [6163]. Such possibility was also discussed in a previous study by Chaix et al. reporting MHC disassortative mating among spouses of European American origin (having at least a child) [35]. Nevertheless, MHC disassortative mating was also found among Hutterite couples [64], in which all couples were included, regardless of whether they had a child or not (C. Ober 2008, personal communication).

The benefits of MHC disassortative mating are still debated. Such process could either increase or optimize the level of MHC heterozygosity in the offspring, thus enhancing their resistance to multiple pathogens. Alternatively, the MHC could function as a kin recognition system to minimize inbreeding at the genome-wide level [18,22,65]. Under this latter hypothesis, a correlation should exist between spouse relatedness at the MHC and genome-wide levels. However, no such correlation was found in Ireland, The Netherlands and more general Northern Europe samples (electronic supplementary material, figure S6 and table S7). This supports the hypothesis that in this dataset, MHC disassortative mating pattern aims at enhancing the immune resistance of offspring rather than limiting their inbreeding.

When looking at the genome-wide pattern of relatedness among spouses, we found a signal of similarity between spouses in all samples except the UK and Spain. Such a pattern complements previous observations [2] and may result from the tendency to mate with someone from the same location or neighbourhood [66,67] (likely to be genetically closer than a more distant individual [68,69]), or from the tendency to mate with a distant cousin [9]. Alternatively, such a pattern may be generated under random mating if the data were collected from different geographical areas and then aggregated for the analysis. However, no genetic stratification was detected for all population samples except for Israel (electronic supplementary material, figures S1 and S2), which supports the hypothesis of geographical homogamy (tendency to mate with a ‘neighbour’) and/or distant cousin marriages as the mating process generating such assortative patterns in the European samples.

The Israeli population sample exhibits an assortative signal much stronger than the ones seen in the other population samples. In addition, we detected a specific genetic structure in this sample, with one clear genetic cluster, as well as two other less well-defined clusters, and a number of admixed individuals in between them (electronic supplementary material, figures S1 and S2). The clusters are not geographically based since they mixed individuals sampled in Tel Aviv and Jerusalem (electronic supplementary material, figure S2). In order to further understand such structures, we performed a PCA analysis of the Israel sample merged with European and Middle Eastern populations samples from the HGDP dataset [70] (electronic supplementary material, figure S7). We observed that the two less well-defined clusters fall in between European and Middle Eastern populations, as observed in previous studies of Jewish populations [71]. However, the more defined cluster distinguishes itself from all European and Middle Eastern populations, suggesting a strong genetic drift effect for this cluster. Such genetic structures may relate to the fact that the Israel samples analysed here include both Ashkenazi and non-Ashkenazi (from Arab countries) Jews (R. P. Ebstein 2017, personal communication). We hypothesize that such social stratification constrains mate choice and that the observed signal of genome-wide assortative mating reflects social homogamy [33], possibly combined with cousin marriages, a frequent practice in this country [9]. On the other hand, in Israel, the MHC exhibits no departure from the null hypothesis of random mating (table 1). This is true also when replicating the analysis in each of the three genetic clusters seen on the PCA analysis (electronic supplementary material, table S8). These observations support the hypothesis that whenever social rules over mate choice are strong, individuals are less prone to rely on biological cues, and particularly on the MHC, when choosing a mate. Similar finding was reported for the Yoruba population, in which Chaix et al. found a signal of genome-wide assortative mating possibly reflecting cousin marriages, but no significant signal of MHC non-random mating [35].

Finally, to estimate the biological importance of the signal of MHC non-random mating detected in The Netherlands and in Northern Europe, as well as its consequences in terms of offspring genetic diversity, we used the female choice model proposed by Hedrick [65]. Assuming a two-allele model, we estimated the strength of female choice (relative preference of MHC-dissimilar males over MHC similar males) to be around 7%. The decrease in homozygosity in offspring was estimated to be around 1% (electronic supplementary material, table S9). These estimates, based on this simple model, suggest that the weight of the MHC on mate choice in these populations is significant but weak, and may easily be overwhelmed by socio-cultural factors. This may explain why this effect was not found in the Israel population sample, nor in other populations such as the Yoruba [35] and South Amerindians [72], where the role played by socio-cultural factors on mate choice is important. A decrease of 1% in offspring homozygosity seems weak too and it is difficult to assess whether this has any consequence on the fitness of the offspring compared to offspring born from random pairings. However, even if the effect on offspring is low in contemporary populations, this does not mean there is no MHC non-random mating process in these populations. The signal we detect may be the heritage of millions of years of evolution, still acting to some extent in human populations, with no particular benefit for the offspring.

To conclude, our findings support the view that the MHC influences mate choice in humans in a context-dependent way [32]. In particular, the role played by the MHC seems to depend on the socio-cultural context. Indeed, we found evidence for MHC disassortative mating in Dutch and more generally Northern European human populations, but not in the Israeli sample, where a strong genome-wide signature of assortative mating was observed, probably reflecting social constraints over mate choice, such as social homogamy and/or cousin marriages. From these findings, we hypothesize that the MHC may influence mate choice in human populations only in the context of weak socio-cultural constraints over mate choice. To test this hypothesis, this approach should be extended to more population samples from all continents, taking into account the variability of human social organizations [34]. This would help assessing the respective weights of the biological and socio-cultural factors influencing mate choice in human populations. In addition, because the MHC is associated with a wide spectrum of complex diseases [73], the consequences of such MHC non-random mate choice over kinship estimation and association studies remain an open question.

Supplementary Material

Supplementary Figures and Tables
rspb20182664supp1.pdf (772KB, pdf)

Supplementary Material

R Script
rspb20182664supp2.txt (51.4KB, txt)

Acknowledgements

Samples for the International Multi-Center ADHD Genetics Project were provided by the following investigators: S. Faraone (PI), R. Anney, P. Asherson, J. Sergeant, R. Ebstein, B. Franke, M. Gill, A. Miranda, F. Mulas, R. Oades, H. Roeyers, A. Rothenberger, T. Banaschewski, J. Buitelaar, E. Sonuga-Barke (site PIs), M. Daly, C. Lange, N. Laird, J. Su and B. Neale (statistical analysis team). We thank Peter Donnelly for helpful discussions.

Ethics

This study is based on data from the GAIN-ADHD study. Ethical approval for the study was obtained from National Institute of Health registered ethical review boards for each centre (subjects were recruited from 12 specialist centres in 8 countries). Detailed information sheets were provided and informed consent obtained from the majority of children and from all of their parents.

Data accessibility

The GAIN-ADHD dataset was obtained from the database of Genotypes and Phenotypes (dbGaP) found at http://www.ncbi.nlm.nih.gov/gap, through dbGaP accession number 34465-5. Codes are available as supplementary material. R package GPop is available in GitHub (https://github.com/genostats/GPop).

Authors' contributions

R.C., B.T., R.L. and C.D.-R. designed the study; C.D.-R., R.L., I.D. and R.C. analysed the data; B.T. implemented the female choice model; R.C. and C.D.-R. wrote the manuscript.

Competing interests

The authors declare no competing interests.

Funding

Funding support for the International Multisite ADHD Genetics (IMAGE) project was provided by NIH grants R01MH62873 and R01MH081803 to S.V. Faraone and the genotyping of samples was provided through the Genetic Association Information Network (GAIN). C.D.-R. was supported by a postdoctoral grant from the French Museum National d'Histoire Naturelle.

References

  • 1.Risch N, et al. 2009. Ancestry-related assortative mating in Latino populations. Genome Biol. 10, R132 ( 10.1186/gb-2009-10-11-r132) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Sebro R, Peloso GM, Dupuis J, Risch NJ. 2017. Structured mating: patterns and implications. PLoS Genet. 13, e1006655 ( 10.1371/journal.pgen.1006655) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Abdellaoui A, et al. 2015. Educational attainment influences levels of homozygosity through migration and assortative mating. PLoS One 10, 1–15. ( 10.1371/journal.pone.0118935) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Abdellaoui A, et al. 2013. Association between autozygosity and major depression: stratification due to religious assortment. Behav. Genet. 43, 455–467. ( 10.1007/s10519-013-9610-1) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Robinson MR, et al. 2017. Genetic evidence of assortative mating in humans. Nat. Hum. Behav. 1, 0016 ( 10.1038/s41562-016-0016) [DOI] [Google Scholar]
  • 6.Pemberton TJ, Absher D, Feldman MW, Myers RM, Rosenberg NA, Li JZ. 2012. Genomic patterns of homozygosity in worldwide human populations. Am. J. Hum. Genet. 91, 275–292. ( 10.1016/j.ajhg.2012.06.014) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Leutenegger A-L, Sahbatou M, Gazal S, Cann H, Génin E. 2011. Consanguinity around the world: what do the genomic data of the HGDP-CEPH diversity panel tell us? Eur. J. Hum. Genet. 19, 583–587. ( 10.1038/ejhg.2010.205) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Gazal S, Sahbatou M, Babron M-C, Génin E, Leutenegger A-L. 2015. High level of inbreeding in final phase of 1000 Genomes Project. Sci. Rep. 5, 17453 ( 10.1038/srep17453) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Bittles AH, Black ML. 2010. Consanguineous marriage and human evolution. Annu. Rev. Anthropol. 39, 193–207. ( 10.1146/annurev.anthro.012809.105051) [DOI] [Google Scholar]
  • 10.Wright S. 1951. The genetical structure of populations. Ann. Eugen. 15, 323–354. ( 10.1111/j.1469-1809.1949.tb02451.x) [DOI] [PubMed] [Google Scholar]
  • 11.Crow JF, Felsenstein J. 1968. The effect of assortative mating on the genetic composition of a population. Eugen. Q. 15, 85–97. ( 10.1080/19485565.1968.9987760) [DOI] [PubMed] [Google Scholar]
  • 12.Sebro R, Risch NJ. 2012. A brief note on the resemblance between relatives in the presence of population stratification. Heredity (Edinb). 108, 563–568. ( 10.1038/hdy.2011.124) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Marchini J, Cardon LR, Phillips MS, Donnelly P. 2004. The effects of human population structure on large genetic association studies. Nat. Genet. 36, 512–517. ( 10.1038/ng1337) [DOI] [PubMed] [Google Scholar]
  • 14.Helgason A, Yngvadóttir B, Hrafnkelsson B, Gulcher J, Stefánsson K. 2004. An Icelandic example of the impact of population structure on association studies. Nat. Genet. 37, 90–96. ( 10.1038/ng1492) [DOI] [PubMed] [Google Scholar]
  • 15.Freedman ML, et al. 2004. Assessing the impact of population stratification on genetic association studies. Nat. Genet. 36, 388–393. ( 10.1038/ng1333) [DOI] [PubMed] [Google Scholar]
  • 16.Redden DT, Allison DB. 2005. The effect of assortative mating upon genetic association studies: spurious associations and population substructure in the absence of admixture. Behav. Genet. 36, 678–686. ( 10.1007/s10519-006-9060-0) [DOI] [PubMed] [Google Scholar]
  • 17.Cardon LR, Palmer LJ. 2003. Population stratification and spurious allelic association. Lancet 361, 598–604. ( 10.1016/S0140-6736(03)12520-2) [DOI] [PubMed] [Google Scholar]
  • 18.Apanius V, Penn D, Slev PR, Ruff LR, Potts WK. 1997. The nature of selection on the major histocompatibility complex. Crit. Rev. Immunol. 17, 179–224. ( 10.1615/CritRevImmunol.v17.i2.40) [DOI] [PubMed] [Google Scholar]
  • 19.Hedrick PW. 1994. Evolutionary genetics of the major histocompatibility complex. Am. Nat. 143, 945–964. ( 10.1086/285643) [DOI] [Google Scholar]
  • 20.Bernatchez L, Landry C. 2003. MHC studies in nonmodel vertebrates: what have we learned about natural selection in 15 years? J. Evol. Biol. 16, 363–377. ( 10.1046/j.1420-9101.2003.00531.x) [DOI] [PubMed] [Google Scholar]
  • 21.Parham P, Ohta T. 1996. Population biology of antigen presentation by MHC class I molecules. Science 272, 67–74. ( 10.1126/science.272.5258.67) [DOI] [PubMed] [Google Scholar]
  • 22.Milinski M. 2006. The major histocompatibility complex, sexual selection, and mate choice. Annu. Rev. Ecol. Evol. Syst. 37, 159–186. ( 10.1146/annurev.ecolsys.37.091305.110242) [DOI] [Google Scholar]
  • 23.Horton R, et al. 2004. Gene map of the extended human MHC. Nat. Rev. Genet. 5, 889–899. ( 10.1038/nrg1489) [DOI] [PubMed] [Google Scholar]
  • 24.Robinson J, Halliwell JA, Hayhurst JD, Flicek P, Parham P, Marsh SGE. 2015. The IPD and IMGT/HLA database: allele variant databases. Nucleic Acids Res. 43, D423–D431. ( 10.1093/nar/gku1161) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Trowsdale J, Knight JC. 2013. Major histocompatibility complex genomics and human disease. Annu. Rev. Genomics Hum. Genet. 14, 301–323. ( 10.1146/annurev-genom-091212-153455) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Prugnolle F, Manica A, Charpentier M, Guégan JF, Guernier V, Balloux F. 2005. Pathogen-driven selection and worldwide HLA class I diversity. Curr. Biol. 15, 1022–1027. ( 10.1016/j.cub.2005.04.050) [DOI] [PubMed] [Google Scholar]
  • 27.Penn D, Potts W. 1998. How do major histocompatibility complex genes influence odor and mating preferences? Adv. Immunol. 69, 411–436. ( 10.1016/S0065-2776(08)60612-4) [DOI] [PubMed] [Google Scholar]
  • 28.Kamiya T, O'Dwyer K, Westerdahl H, Senior A, Nakagawa S. 2014. A quantitative review of MHC-based mating preference: the role of diversity and dissimilarity. Mol. Ecol. 23, 5151–5163. ( 10.1111/mec.12934) [DOI] [PubMed] [Google Scholar]
  • 29.Brouwer L, Barr I, Van De Pol M, Burke T, Komdeur J, Richardson DS.. 2010. MHC-dependent survival in a wild population: evidence for hidden genetic benefits gained through extra-pair fertilizations. Mol. Ecol. 19, 3444–3455. ( 10.1111/j.1365-294X.2010.04750.x) [DOI] [PubMed] [Google Scholar]
  • 30.Setchell JM, Huchard E. 2010. The hidden benefits of sex: evidence for MHC-associated mate choice in primate societies. Bioessays 32, 940–948. ( 10.1002/bies.201000066) [DOI] [PubMed] [Google Scholar]
  • 31.Winternitz J, Abbate JL, Huchard E, Havlíček J, Garamszegi LZ. 2017. Patterns of MHC-dependent mate selection in humans and nonhuman primates: a meta-analysis. Mol. Ecol. 26, 668–688. ( 10.1111/mec.13920) [DOI] [PubMed] [Google Scholar]
  • 32.Havlicek J, Roberts SC. 2009. MHC-correlated mate choice in humans: a review. Psychoneuroendocrinology 34, 497–512. ( 10.1016/j.psyneuen.2008.10.007) [DOI] [PubMed] [Google Scholar]
  • 33.Kalmijn M. 1998. Intermarriage and homogamy: causes, patterns, trends. Annu. Rev. Sociol 24, 395–421. ( 10.1146/annurev.soc.24.1.395) [DOI] [PubMed] [Google Scholar]
  • 34.Godelier M. 2004. Métamorphoses de la parenté. Paris, France: Fayard. [Google Scholar]
  • 35.Chaix R, Cao C, Donnelly P. 2008. Is mate choice in humans MHC-dependent? PLoS Genet. 4, 1–5. ( 10.1371/journal.pgen.1000184) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Laurent R, Toupance B, Chaix R. 2012. Non-random mate choice in humans: insights from a genome scan. Mol. Ecol. 21, 587–596. ( 10.1111/j.1365-294X.2011.05376.x) [DOI] [PubMed] [Google Scholar]
  • 37.Derti A, Cenik C, Kraft P, Roth FP. 2010. Absence of evidence for MHC-dependent mate selection within HapMap populations. PLoS Genet. 6, e1000925 ( 10.1371/journal.pgen.1000925) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Laurent R, Chaix R. 2012. MHC-dependent mate choice in humans: why genomic patterns from the HapMap European American dataset support the hypothesis. Bioessays 34, 267–271. ( 10.1002/bies.201100150) [DOI] [PubMed] [Google Scholar]
  • 39.Laurent R, Chaix R.. 2012. HapMap European American genotypes are compatible with the hypothesis of MHC-dependent mate choice (response to DOI 10.1002/bies.201200023, Derti and Roth). Bioessays 34, 871–872. ( 10.1002/bies.201200075) [DOI] [PubMed] [Google Scholar]
  • 40.Qiao Z, Powell JE, Evans DM. 2018. MHC-dependent mate selection within 872 spousal retirement study. Genes (Basel) 9, 1–13. ( 10.3390/genes9010053) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Kuntsi J, Neale BM, Chen W, Faraone SV, Asherson P. 2006. The IMAGE project: methodological issues for the molecular genetic analysis of ADHD. Behav. Brain Funct. 2, 27 ( 10.1186/1744-9081-2-27) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Zhou K, et al. 2008. Linkage to chromosome 1p36 for attention-deficit/hyperactivity disorder traits in school and home settings. Biol. Psychiatry 64, 571–576. ( 10.1016/j.biopsych.2008.02.024) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Lasky-Su J, et al. 2008. Genome-wide association scan of the time to onset of attention deficit hyperactivity disorder. Am. J. Med. Genet. B Neuropsychiatr. Genet. 147, 1355–1358. ( 10.1002/ajmg.b.30869) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Brookes K, et al. 2006. The analysis of 51 genes in DSM-IV combined type attention deficit hyperactivity disorder: association signals in DRD4, DAT1 and 16 other genes. Mol. Psychiatry 11, 934–953. ( 10.1038/sj.mp.4001869) [DOI] [PubMed] [Google Scholar]
  • 45.Perdry H, Dandine-Roulland C.. 2017. Package R ‘gaston’ [version 1.4.9].
  • 46.Dandine-Roulland C. 2018. Package R ‘GPop’ [version 1.0].
  • 47.Speed D, Balding DJ. 2015. Relatedness in the post-genomic era: is it still useful? Nat. Rev. Genet. 16, 33–44. ( 10.1038/nrg3821) [DOI] [PubMed] [Google Scholar]
  • 48.Rousset F. 2002. Inbreeding and relatedness coefficients: what do they measure? Heredity (Edinb. ). 88, 371–380. ( 10.1038/sj.hdy.6800065) [DOI] [PubMed] [Google Scholar]
  • 49.De Bakker PIW, et al. 2006. A high-resolution HLA and SNP haplotype map for disease association studies in the extended human MHC. Nat. Genet. 38, 1166–1172. ( 10.1038/ng1885) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Wall JD. 1999. Recombination and the power of statistical tests of neutrality. Genet. Res. 74, 65–79. ( 10.1017/S0016672399003870) [DOI] [Google Scholar]
  • 51.McVean G, Spencer CCA, Chaix R. 2005. Perspectives on human genetic variation from the HapMap project. PLoS Genet. 1, 0413–0418. ( 10.1371/journal.pgen.0010054) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Spencer CCA, Deloukas P, Hunt S, Mullikin J, Myers S, Silverman B, Donnelly P, Bentley D, McVean G. 2006. The influence of recombination on human genetic diversity. PLoS Genet. 2, 1375–1385. ( 10.1371/journal.pgen.0020148) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Astolfi P, Martinetti M, Gigli-Berzolari F, Cuccia M. 1990. The effect of parental and maternal-fetal histocompatibility at MHC on sex ratio in offspring. Tissue Antigens 35, 172–177. ( 10.1111/j.1399-0039.1990.tb01775.x) [DOI] [PubMed] [Google Scholar]
  • 54.Dorak MT, Lawson T, Machulla HKG, Mills KI, Burnett AK. 2002. Increased heterozygosity for MHC class II lineages in newborn males. Genes Immun. 3, 263–269. ( 10.1038/sj.gene.6363862) [DOI] [PubMed] [Google Scholar]
  • 55.Radvany RM, Vaisrub N, Ober C, Patel KM, Hecht F. 1987. The human sex ratio: increase in first-born males to parents with shared HLA-DR antigens. Tissue Antigens 29, 34–42. ( 10.1111/j.1399-0039.1987.tb01547.x) [DOI] [PubMed] [Google Scholar]
  • 56.Schad J, Dechmann DKN, Voigt CC, Sommer S. 2011. MHC class II DRB diversity, selection pattern and population structure in a neotropical bat species, Noctilio albiventris . Heredity (Edinb.) 107, 115–126. ( 10.1038/hdy.2010.173) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Roved J, Hansson B, Tarka M, Hasselquist D, Westerdahl H. 2018. Evidence for sexual conflict over major histocompatibility complex diversity in a wild songbird. Proc. R. Soc. B 285, 20180841 ( 10.1098/rspb.2018.0841) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Bidwell LC, Gray JC, Weafer J, Palmer AA, de Wit H, MacKillop J.. 2017. Genetic influences on ADHD symptom dimensions: examination of a priori candidates, gene-based tests, genome-wide variation, and SNP heritability. Am. J. Med. Genet. B Neuropsychiatr. Genet. 174, 458–466. ( 10.1002/ajmg.b.32535) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Khankhanian P, Gourraud P-A, Caillier SJ, Santaniello A, Hauser SL, Baranzini SE, Oksenberg JR. 2010. Genetic variation in the odorant receptors family 13 and the mhc loci influence mate selection in a multiple sclerosis dataset. BMC Genomics 11, 626 ( 10.1186/1471-2164-11-626) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Goris A. 2012. Comment: the HLA region in multiple sclerosis. Neurology 79, 544 ( 10.1212/WNL.0b013e318263c45b) [DOI] [PubMed] [Google Scholar]
  • 61.Ziegler A, Kentenich H, Uchanska-Ziegler B. 2005. Female choice and the MHC. Trends Immunol. 26, 496–502. ( 10.1016/j.it.2005.07.003) [DOI] [PubMed] [Google Scholar]
  • 62.Ziegler A, Dohr G, Uchanska-Ziegler B. 2002. Possible roles for products of polymorphic MHC and linked olfactory receptor genes during selection processes in reproduction. Am. J. Reprod. Immunol. 48, 34–42. ( 10.1034/j.1600-0897.2002.01097.x) [DOI] [PubMed] [Google Scholar]
  • 63.Birkhead TR, Pizzari T. 2002. Postcopulatory sexual selection. Nat. Rev. Genet. 3, 262–673. ( 10.1038/nrg774) [DOI] [PubMed] [Google Scholar]
  • 64.Ober C, Weitkamp LR, Cox N, Dytch H, Kostyu D, Elias S. 1997. HLA and mate choice in humans. Am. J. Hum. Genet. 61, 497–504. ( 10.1086/515511) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Hedrick PW. 1992. Female choice and variation in the major histocompatibility complex. Genetics 132, 575–581. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Bossard JH. 1932. Residential propinquity as a factor in marriage selection. Am. J. Sociol. 38, 219–224. ( 10.1086/216031) [DOI] [Google Scholar]
  • 67.Fix AG. 1999. Migration and colonization in human microevolution. Cambridge, UK: Cambridge University Press. [Google Scholar]
  • 68.Novembre J, et al. 2008. Genes mirror geography within Europe. Nature 456, 98–101. ( 10.1038/nature07331) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Ralph P, Coop G. 2013. The geography of recent genetic ancestry across Europe. PLoS Biol. 11, e1001555 ( 10.1371/journal.pbio.1001555) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Li JZ, et al. 2008. Worldwide human relationships inferred from genome-wide patterns of variation. Science 319, 1100–1104. ( 10.1126/science.1153717) [DOI] [PubMed] [Google Scholar]
  • 71.Behar DM, et al. 2010. The genome-wide structure of the Jewish people. Nature 466, 238–242. ( 10.1038/nature09103) [DOI] [PubMed] [Google Scholar]
  • 72.Hedrick PW, Black FL. 2004. HLA and mate selection: no evidence in South Amerindians. Am. J. Hum. Genet. 61, 505–511. ( 10.1086/515519) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Traherne JA. 2008. Human MHC architecture and evolution: implications for disease association studies. Int. J. Immunogenet. 35, 179–192. ( 10.1111/j.1744-313X.2008.00765.x) [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Figures and Tables
rspb20182664supp1.pdf (772KB, pdf)
R Script
rspb20182664supp2.txt (51.4KB, txt)

Data Availability Statement

The GAIN-ADHD dataset was obtained from the database of Genotypes and Phenotypes (dbGaP) found at http://www.ncbi.nlm.nih.gov/gap, through dbGaP accession number 34465-5. Codes are available as supplementary material. R package GPop is available in GitHub (https://github.com/genostats/GPop).


Articles from Proceedings of the Royal Society B: Biological Sciences are provided here courtesy of The Royal Society

RESOURCES