Skip to main content
Wiley Open Access Collection logoLink to Wiley Open Access Collection
. 2022 Sep 6;76(10):2464–2468. doi: 10.1111/evo.14588

Luck, lottery, or legacy? The problem of confounding. A reply to Harden

Graham Coop 1,, Molly Przeworski 2
PMCID: PMC9627830  NIHMSID: NIHMS1826963  PMID: 35915930

Abstract

A reply to Harden's response to Coop and Przeworski (2022).


In Coop and Przeworski (2022), we argue that Harden (2021) misinterprets findings from genome‐wide association studies (GWAS) of educational attainment and overstates their relevance for our understanding of social inequalities. We focus on three key steps of the book's central argument: (1) the reliance on the metaphor of polygenic scores as drawn from a “genetic lottery” analogous to a randomized controlled trial; (2) the attribution of all GWAS signals to the brain of the focal individual; and (3) the treatment of GWAS findings as reflecting genetic causes of inequality within an ancestry group, but as uninterpretable between ancestry groups. Each of these three steps is invalid, for largely the same reason: not following through on what it means for the genetic effects identified in GWAS to be confounded by genomic background and by environmental effects correlated with that background. Thus, the “meta‐message” of our review is not, as Harden (2022) writes in her reply, that “It's complicated” but rather that “It's confounded.”

This confounding may not preclude uses of polygenic scores for educational attainment as control variables in the social sciences. But it does mean that these scores cannot be interpreted as identifying genetic causes of inequality. There is a long history of using confounded biological and genetic measures to explain poverty and inequality. Although Harden stakes out a different political position, her book nevertheless participates in that same tradition by downplaying the limitations of GWAS.

Here, we clarify some erroneous statements about our review and the GWAS literature in her reply. We also note some distortions, including in the quotations. As an example, Harden cites us as saying “we …fully grant the book's starting point …GWAS [genome‐wide association studies] undoubtedly capture some causal genetic effects.” The quotation provided combines two sentences that are five pages apart in our review; what we write at the outset is, in fact: “that educational attainment is heritable was documented before GWAS and is in some sense trivial …We thus fully grant the book's starting point.”

THE ELISION OF DIFFERENT TYPES OF GWAS

GWAS associate trait variation with genetic loci. In controlled or randomized environments, those associations can help to identify genetic causes. In human GWAS, the hope is to disentangle genetic and environmental effects through the use of statistical controls. These controls are not perfect, however, and for some traits, notably behavioral ones, substantial confounding of genetic effects remains (Kong et al., 2018; Mostafavi et al., 2020; Selzam et al., 2019). That problem can be mitigated in family‐based designs: the randomization of genotypes transmitted from parents to offspring allows direct genetic effects‐the effects of alleles carried by a person on that person‐to be teased apart. But a standard GWAS does not have that property. Instead, associations identified in a standard GWAS absorb indirect genetic effects and numerous confounders. As summarized in a recent review article: “There are at least three sources of confounding in GWAS: (i) environmental confounding, where allele frequencies and environmental effects vary in a correlated way across different geographic regions…; (ii) genetic confounding…or (iii) assortative‐mating confounding…These forms of confounding are conceptually different, but in practice they are often intertwined. ” (Young et al., 2019). To date, almost all GWAS have been of the standard design. Therefore, despite Harden's emphasis on causal identification and genetic lotteries—per the title—almost all of the genetic evidence marshaled in the book is potentially confounded, often to an unknown degree (Fletcher, 2022). Harden notes some of these limitations at various points, but inconsistently, and without pursuing their implications for her central claims.

In her reply, Harden acknowledges limitations in drawing inferences about differences between families from sib‐GWAS, but incorrectly argues that these do not extend to trio studies or other family‐GWAS designs. In reality, the same limitations apply. As noted in Young et al. (2019), despite some technical differences, all nuclear family‐GWAS designs similarly use “parental genotypes as controls to separate direct from indirect genetic effects and other confounding effects.” In some contexts, those direct genetic effects can be interpreted as causes. When it comes to interpreting trait differences between people from different families, however, or to predicting traits, all the effects come into play, not just direct effects (Becker et al., 2021; Okbay et al., 2022; Young et al., 2022).

Harden also describes a related method inaccurately in her reply, writing that “Relatedness disequilibrium regression (RDR)…was described by Dr. Przeworski in one of her previous co‐authored papers as giving ‘unbiased estimates of direct genetic effects’ (Young et al., 2019).” The Young et al. quote is actually not a definition of RDR, about which the paper contains only one description, in the legend of Figure 4: as providing “an estimate of the SNP heritability using a within family method” (Young et al., 2019). As for Harden's claim that “Results from RDR suggest that, yes, genetic influences account for variation in educational attainment, and not just within‐family variation,” we are unsure what she has in mind, other than a loose restatement of RDR being a method to estimate heritability.

For educational attainment specifically, confounders are a major concern. Indeed, the RDR estimate of heritability, which is designed to be less inflated by environmental confounders, is substantially lower than other estimates (Young et al., 2018; Figure 4 in Young et al., 2019). In her book, Harden provides numbers for various correlates of educational attainment, but never reports how much of the GWAS signal and the resulting polygenic prediction for educational attainment have been attributed to direct genetic effects versus other sources, such as indirect effects and population‐structure confounding. In her reply, she refers only to one meta‐analysis (Wang et al., 2021), on the basis of which she states that “there are ‘indirect’ genetic effects on educational outcomes= .08, 95% CI = .07 to .09), but these indirect genetic effects are smaller than the ‘direct’ genetic effects= .17, 95% CI = .13 to .20).” These numbers are meant to represent the effects (regression coefficients) of direct versus indirect effects on educational attainment. But the estimate reported for “indirect effects” is actually for only one of two parents (Wang et al., 2021); moreover, it includes potential effects of confounding which, alongside other statistical issues, complicates the comparison to direct genetic effects (Fletcher et al., 2021; Mostafavi et al., 2020; Trejo & Domingue, 2018; Wang et al., 2021; Young et al., 2022).

Multiple papers that go unmentioned explicitly decompose the variance in educational attainment explained by polygenic scores. These studies find that in European‐ancestry individuals, as little as a quarter to a third of the variance explained by the polygenic score is attributable to direct genetic effects, with the other two‐thirds to three‐quarters ascribed to a complex mixture of indirect genetic effects and population‐structure confounders (Howe et al., 2022; Mostafavi et al., 2020; Okbay et al., 2022; Young et al., 2022). Thus, the latest GWAS for educational attainment reports that, although  12‐16% of the variance in educational attainment among people of European ancestry is predicted by polygenic scores, less than 5% of the variance is due to direct genetic effects (Okbay et al., 2022). A recent paper, which Harden co‐authored, attempts to tease apart parental indirect effects and confounders by analyzing siblings of the parents as well as nuclear families; it concludes that much of the non‐direct genetic effects may be due to “dynastic stratification in environments relevant to success in school” (Nivard et al., 2022), a form of confounding.

THE INTERPRETATION OF GWAS HITS FOR EDUCATIONAL ATTAINMENT AS “BUILT IN” DIFFERENCES AMONG INDIVIDUALS

In her book, Harden misinterprets the GWAS enrichment analysis to imply that all GWAS associations can be ascribed to genes active in the brain, writing, e.g., that “Whatever genes are doing to make it more or less likely for some people to succeed in education, they are doing it in people's brains, not their hair or livers or skin or bones.” [p. 137]. In fact, the gene enrichment analysis for the GWAS simply shows that the associations are found more often than expected by chance near genes also expressed in the brain; it does not rule out causal effects mediated by other tissues.

Elsewhere, she writes that: “Thousands upon thousands of genetic variants matter for educational attainment …These genes exert their effects via largely unknown cellular processes that are happening in neurons and other brain cells. These cellular effects are already happening during prenatal development, and their effects on the individual organism are already evident in childhood …” [p. 148]. In so doing, she repeatedly uses phrasing that ascribes the entire predictive power of a standard‐GWAS polygenic score to the activity of genes in the brain of the focal individual, which again misinterprets gene enrichment analyses, and ignores the fact that much of the GWAS signal stems from indirect effects and confounding.

Harden does not address these points in her reply, and instead argues that “it is doubtful that any credible account of the causal paths from genetic differences between people to educational inequality will somehow circumvent psychological differences between people, or that these psychological differences will somehow not involve the brain.” Psychological traits undoubtedly play a role in educational attainment; that was never in dispute. The issue is rather that enrichment analyses and related studies leave open other causal genetic paths that are not about psychology: for instance, children at risk for chronic childhood diseases may tend to fall behind in their schooling. Moreover, Harden's argument once again ignores the consequences of confounding: namely that genetic differences among individuals identified by GWAS cannot be assumed to be causally related to educational attainment. As just one example: to the extent that the GWAS is confounded by dynastic stratification in environments, psychological differences may not explain why children find themselves trapped in under‐performing schools.

THE RELIANCE ON TYPOLOGICAL NOTIONS OF POPULATIONS

We point out in our review that Harden cannot have it both ways: she cannot simultaneously ignore the limitations of GWAS when it is conducted in individuals of European ancestry and yet invoke them to forestall comparisons of GWAS findings between ancestry groups. In her reply, Harden acknowledges the lack of any clear boundary between ancestry groups and the difficulty of controlling for confounders, but argues that these are unavoidable features of the social sciences, adding “The study of environmental influences is every bit as difficult [as that of genetic influences]. For many environments of interest, experimentation is impossible.” We agree: the lack of control or randomization of the environment only makes it that much more difficult to interpret polygenic scores for educational attainment from a standard GWAS. As a consequence, the big jump is not from one “ancestral population” to another, as described in her book, but from within‐ to between‐family comparisons, where one loses the ability to tease apart genetic effects from environmental ones.

Harden writes that “Because of the difficulty of that ‘big jump’, Coop and Przeworski conclude that ‘current PGS [polygenic scores] for educational attainment are neither interpretable nor meaningful.”' In point of fact, we wrote that current educational attainment polygenic scores are “neither interpretable nor particularly meaningful” because, even for direct effects, there is little understanding of the developmental or physiological mechanisms through which they act (in addition to the usual problem of causal loci rarely being distinguished from the variants that tag them). Nor has the field yet figured out where up to three‐quarters of the standard GWAS signal is coming from.

In her reply, Harden does not mention that existing polygenic scores for educational attainment were developed based on GWAS‐enrollees of European ancestry and have substantially reduced prediction accuracy in individuals of less similar genetic ancestry (Martin et al., 2019; Mills & Rahal, 2020; Okbay et al., 2022; Privé et al., 2022). In her book, she does, and moving from ancestry groups to racial labels, she describes a near future in which we will have a polygenic score “that is as strongly related, statistically, to academic achievement in Black students as it is in White students.” [p. 191] (This conflation of ancestry and race is particularly consequential when considering Black Americans, who vary greatly in their proportions of recent African and European ancestries, in ways that are correlated with both geography and economic opportunity (e.g., Baharian et al., 2016; Micheletti et al., 2020).) Harden warns the reader, however, that such comparisons between Black and White Americans would be “scientifically and ethically wrong” [p. 192]. To explain why comparisons of genetic effects between groups are invalid, and will remain so even when polygenic scores have been constructed for both, she refers to statistical issues that arise from confounders and the lack of environmental homogeneity. But as we outline in our review, similar problems, differing in degree rather than in kind, apply to the move from within‐family studies to standard GWAS.

Harden interprets our charge that she is trying to have it both ways as somehow related to polygenic scores being “simultaneously meaningful…and limited, by virtue of their contextual dependence and imperfect portability” , adding “As the author Maggie Nelson wrote: ‘There is a lot to be learned from having it both ways' ”. In reality, we were making the point that she cannot overlook the effects of confounding when interpreting polygenic scores within European‐ancestry GWAS individuals—let alone within the racial grouping of “Black Americans”—only to resurrect them when seeking protection against the implications of her claims for between ancestry‐group comparisons. The truth is that polygenic scores for educational attainment are also confounded within ancestry groupings, in ways that greatly complicate interpretation.

IMPLICATIONS

In concluding our review, we caution that the interpretability of polygenic scores for educational attainment “matters greatly when they are used to elucidate, let alone redress, social inequalities.” Harden recasts our statement in terms of ethical commitments, writing: “I am wary of the review's conclusion that ‘it matters greatly’ whether the causal paths connecting genetics to educational attainment are … physical phenotypes such as skin or hair color that society discriminates against, rather than mediated through the development of neurocognitive phenotypes …Are people's claims on society for their inclusion and participation, regardless of difference, any less valid if the difference is psychological versus physiological? I think not.” Of course not. Where the interpretability of polygenic scores matters greatly is if, as Harden advocates in her book, they are to be a basis for putting ethical commitments into practice.

We also convey our “doubt[s] that overstating our understanding of the genetics of behaviors is going to increase empathy.” In her reply, Harden ignores the word “overstate”, arguing that “research has shown that ‘understanding of the genetics of behaviors,’ ranging from sexual orientation to body size to serious mental illness, can indeed increase people's empathic attitudes, reducing ascriptions of blame and increasing support for civil rights.” She contends that what would really convince Americans and in particular “American conservatives” to support greater equality is evidence from behavioral genetics. The two references provided by Harden in support of her claim are more circumspect, concluding that “biogenetic explanations for mental disorders confer mixed blessings for stigma” (Kvaale et al., 2013) and that “persuasion tended to be conditioned on citizens’ political values” (Garretson & Suhay, 2016).

As scientists, our responsibility is to be deliberate about the questions we pose and transparent about the strength of our findings. That includes asking ourselves whether, as geneticists, we may end up overstating the utility of genetics in addressing social problems, recasting structural and historical inequities as embodied or psychological characteristics of individuals. In that regard, when the goal is addressing inequality, it seems worthwhile to take a broader perspective than that of the US. Many countries redistribute income and develop social programs to a greater extent than does the US, without invoking genetics, let alone GWAS.

1.

Associate Editor: Dr. J. Mallet

Handling Editor: Dr. T. Chapman

ACKNOWLEDGMENTS

Funding was provided by the National Institutes of Health (NIH R35 GM136290 awarded to GC and R01 HG011432 co‐awarded to MP).

Contributor Information

Graham Coop, Email: gmcoop@ucdavis.edu.

Molly Przeworski, Email: mp3284@columbia.edu.

REFERENCES

  1. Baharian, S. , Barakatt, M. , Gignoux, C.R. , Shringarpure, S. , Errington, J. , Blot, W.J. , Bustamante, C.D. , Kenny, E.E. , Williams, S.M. , Aldrich, M.C. , et al. (2016) The great migration and African‐American genomic diversity. PLoS genetics, 12(5), e1006059. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Becker, J. , Burik, C.A. , Goldman, G. , Wang, N. , Jayashankar, H. , Bennett, M. , Belsky, D.W. , Karlsson Linnér, R. , Ahlskog, R. , Kleinman, A. , et al. (2021) Resource profile and user guide of the polygenic index repository. Nature human behaviour, 5(12), 1744–1758. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Coop, G. & Przeworski, M. (2022) Lottery, luck, or legacy. A review of “The Genetic Lottery: Why DNA matters for social equality”. Evolution, 76(4), 846–853.35225362 [Google Scholar]
  4. Fletcher, J. (2022) Backdoor to a Dead End: A Review Essay The Genetic Lottery: Why DNA Matters for Social Equality by Kathryn Paige Harden. Population and Development Review , 48.
  5. Fletcher, J. , Wu, Y. , Li, T. & Lu, Q. (2021) Interpreting polygenic score effects in sibling analysis. BioRxiv. [DOI] [PMC free article] [PubMed]
  6. Garretson, J. & Suhay, E. (2016) Scientific communication about biological influences on homosexuality and the politics of gay rights. Political Research Quarterly, 69(1), 17–29. [Google Scholar]
  7. Harden, K.P. (2021) The genetic lottery: why DNA matters for social equality. Princeton University Press. [Google Scholar]
  8. Harden, K.P. (2022) On Genetics and Justice: A Reply to Coop and Przeworski (2022). Evolution, Accepted. [DOI] [PMC free article] [PubMed]
  9. Howe, L.J. , Nivard, M.G. , Morris, T.T. , Hansen, A.F. , Rasheed, H. , Cho, Y. , Chittoor, G. , Ahlskog, R. , Lind, P.A. , Palviainen, T. , et al. (2022) Within‐sibship genome‐wide association analyses decrease bias in estimates of direct genetic effects. Nature genetics, 54(5), 581–592. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Kong, A. , Thorleifsson, G. , Frigge, M.L. , Vilhjalmsson, B.J. , Young, A.I. , Thorgeirsson, T.E. , Benonisdottir, S. , Oddsson, A. , Halldorsson, B.V. , Masson, G. , et al. (2018) The nature of nurture: Effects of parental genotypes. Science, 359(6374), 424–428. [DOI] [PubMed] [Google Scholar]
  11. Kvaale, E.P. , Gottdiener, W.H. & Haslam, N. (2013) Biogenetic explanations and stigma: A meta‐analytic review of associations among laypeople. Social science & medicine, 96, 95–103. [DOI] [PubMed] [Google Scholar]
  12. Martin, A.R. , Kanai, M. , Kamatani, Y. , Okada, Y. , Neale, B.M. & Daly, M.J. (2019) Clinical use of current polygenic risk scores may exacerbate health disparities. Nature genetics, 51(4), 584–591. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Micheletti, S.J. , Bryc, K. , Esselmann, S. G.A. , Freyman, W.A. , Moreno, M.E. , Poznik, G.D. , Shastri, A.J. , Agee, M. , Aslibekyan, S. , Auton, A. , et al. (2020) Genetic consequences of the transatlantic slave trade in the Americas. The American Journal of Human Genetics, 107(2), 265–277. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Mills, M.C. & Rahal, C. (2020) The GWAS diversity monitor tracks diversity by disease in real time. Nature genetics, 52(3), 242–243. [DOI] [PubMed] [Google Scholar]
  15. Mostafavi, H. , Harpak, A. , Agarwal, I. , Conley, D. , Pritchard, J.K. & Przeworski, M. (2020) Variable prediction accuracy of polygenic scores within an ancestry group. eLife, 9, e48376. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Nivard, M. , Belsky, D. , Harden, K.P. , Baier, T. , Ystrom, E. & Lyngstad, T.H. (2022) Neither nature nor nurture: Using extended pedigree data to elucidate the origins of indirect genetic effects on offspring educational outcomes.
  17. Okbay, A. , Wu, Y. , Wang, N. , Jayashankar, H. , Bennett, M. , Nehzati, S.M. , Sidorenko, J. , Kweon, H. , Goldman, G. , Gjorgjieva, T. , et al. (2022) Polygenic prediction of educational attainment within and between families from genome‐wide association analyses in 3 million individuals. Nature genetics, 54(4), 437–449. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Privé, F. , Aschard, H. , Carmi, S. , Folkersen, L. , Hoggart, C. , O'Reilly, P.F. & Vilhjálmsson, B.J. (2022) Portability of 245 polygenic scores when derived from the UK Biobank and applied to 9 ancestry groups from the same cohort. The American Journal of Human Genetics, 109(1), 12–23. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Selzam, S. , Ritchie, S.J. , Pingault, J.‐B. , Reynolds, C.A. , O'Reilly, P.F. & Plomin, R. (2019) Comparing within‐and between‐family polygenic score prediction. The American Journal of Human Genetics, 105(2), 351–363. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Trejo, S. & Domingue, B.W. (2018) Genetic nature or genetic nurture? introducing social genetic parameters to quantify bias in polygenic score analyses. Biodemography and Social Biology, 64(3‐4), 187–215. [DOI] [PubMed] [Google Scholar]
  21. Wang, B. , Baldwin, J.R. , Schoeler, T. , Cheesman, R. , Barkhuizen, W. , Dudbridge, F. , Bann, D. , Morris, T.T. & Pingault, J.‐B. (2021) Robust genetic nurture effects on education: A systematic review and meta‐analysis based on 38,654 families across 8 cohorts. American Journal of Human Genetics, 108(9), 1780. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Young, A.I. , Benonisdottir, S. , Przeworski, M. & Kong, A. (2019) Deconstructing the sources of genotype‐phenotype associations in humans. Science, 365(6460), 1396–1400. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Young, A.I. , Frigge, M.L. , Gudbjartsson, D.F. , Thorleifsson, G. , Bjornsdottir, G. , Sulem, P. , Masson, G. , Thorsteinsdottir, U. , Stefansson, K. & Kong, A. (2018) Relatedness disequilibrium regression estimates heritability without environmental bias. Nat Genet, 50(9), 1304–1310. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Young, A.I. , Nehzati, S.M. , Benonisdottir, S. , Okbay, A. , Jayashankar, H. , Chanwook, L. , Cesarini, D. , Benjamin, D.J. , Turley, P. & Kong, A. (2022) Mendelian imputation of parental genotypes improves estimates of direct genetic effects. Nature Genetics . [DOI] [PMC free article] [PubMed]

Articles from Evolution; International Journal of Organic Evolution are provided here courtesy of Wiley

RESOURCES