Abstract
Drawing on psychological and sociological theories of crime causation, we tested the hypothesis that genetic risk for low educational attainment (assessed via a genome-wide polygenic score) is associated with offending. We further tested hypotheses of how polygenic risk relates to the development of antisocial behavior from childhood through adulthood. Across the Dunedin and E-Risk birth cohorts of individuals growing up 20 years and 20,000 kilometres apart, education polygenic scores predicted risk of a criminal record, with modest effects. Polygenic risk manifested during primary schooling, in lower cognitive abilities, lower self-control, academic difficulties, and truancy, and predicted a life-course persistent pattern of antisocial behavior that onsets in childhood and persists into adulthood. Crime is central in the nature/nurture debate, and findings reported here demonstrate how molecular-genetic discoveries can be incorporated into established theories of antisocial behavior. They also suggest the hypothesis that improving school experiences might prevent genetic influences on crime from unfolding.
Keywords: crime, genetics, antisocial behavior, longitudinal
INTRODUCTION
Advances in the genomic sciences are now making it possible to investigate genetic influences on behavior at the molecular-genetic level, using genome-wide association studies (GWAS; Visscher et al., 2017). Although effect sizes for individual genetic variants revealed in GWAS are tiny, it is possible to aggregate the effects of millions of variants across the genome to construct polygenic scores, which index a person’s position on a continuum of genetic propensity toward specific phenotypes (Dudbridge, 2013). Associations between individuals’ polygenic scores and behaviors are non-deterministic, with a polygenic score on the higher or lower end of the continuum slightly increasing or decreasing the odds of an outcome. One of the largest and most successful GWAS for a social-science outcome has been conducted for educational attainment (Okbay et al., 2016). Because educational attainment is a central phenotype in the nomological net of constructs in the psychological and social sciences, genetic discoveries for education may have implications for research and theory about outcomes that are known to be linked to education. Here we tested the hypothesis that polygenic influences on educational attainment predict criminal offending.
We derived the hypothesis that polygenic influences on educational attainment predict criminal offending by integrating genetic discoveries about educational attainment with established theories about crime. First, individuals with lower polygenic scores for educational attainment tend to complete less schooling, which is a correlate of criminal offending (Stattin & Magnusson, 1995). Truncated education may leave people with fewer legitimate methods to achieve wealth or status, increasing incentives to pursue crime (Agnew, 1992; Merton, 1938). Second, polygenic scores for educational attainment partly reflect early-emerging traits that affect success in school, such as cognitive ability and self-control (Belsky et al., 2016); these traits also increase risk for crime (Gottfredson & Hirschi, 1990; Moffitt, 1993a). Third, low polygenic scores for education may predict poor school performance and academic frustration, which reduce a protective factor that helps deter young people from crime, namely, commitment to school and its social norms (Catalano, Oesterle, Fleming, & Hawkins, 2004).
Based on these considerations, we tested the hypothesis that molecular-genetic predictors of educational attainment would forecast individuals’ criminal offending. We tested this hypothesis in two birth cohorts comprising nearly 3,000 participants. In both cohorts we linked genetic data to official police records. We additionally investigated whether the effects of polygenic scores on criminal offending would survive after accounting for two significant markers of a criminogenic family environment: growing up in socioeconomic deprivation and having antisocial parents.
Our developmental study examines how the association between genetic influences on educational attainment and criminal offending emerges over time and in concert with (and independently of) educational attainment itself. First, we examined early-emerging psychological and behavioral risk factors that may connect genetic differences between individuals to their risk of criminal offending. Second, we tested the hypothesis that genetic influences would be particularly strong among individuals who show a pattern of antisocial behavior that begins in childhood and thereafter follows a persistent pattern into adulthood, often referred to as “life-course persistent” antisocial behavior (Moffitt, 1993b).
MATERIALS AND METHODS
Samples
E-Risk cohort
Participants in the first cohort were members of the Environmental Risk (E-Risk) Longitudinal Twin Study, which tracks the development of a birth cohort of 2,232 British participants (Figure 1). The sample was drawn from a larger birth register of twins born in England and Wales in 1994-1995 (Trouton, Spinath, & Plomin, 2002). Full details about the sample are reported elsewhere (Moffitt & E-Risk Study Team, 2002). Briefly, the E-Risk sample was constructed in 1999-2000, when 1,116 families (93% of those eligible) with same-sex 5-year-old twins participated in home-visit assessments. This sample comprised 56% monozygotic (MZ) and 44% dizygotic (DZ) twin pairs; sex was evenly distributed within zygosity (49% male). Families were recruited to represent the UK population of families with newborns in the 1990s, on the basis of residential location throughout England and Wales and mother’s age. Teenaged mothers with twins were over-selected to replace high-risk families who were selectively lost to the register through non-response. Older mothers having twins via assisted reproduction were under-selected to avoid an excess of well-educated older mothers. The study sample represents the full range of socioeconomic conditions in the UK, as reflected in the families’ distribution on a neighborhood-level socioeconomic index (ACORN [A Classification of Residential Neighbourhoods], developed by CACI Inc. for commercial use) (Odgers, Caspi, Bates, Sampson, & Moffitt, 2012): 25.6% of E-Risk families live in “wealthy achiever” neighborhoods compared to 25.3% nationwide; 5.3% vs. 11.6% live in “urban prosperity” neighborhoods; 29.6% vs. 26.9% live in “comfortably off” neighborhoods; 13.4% vs. 13.9% live in “moderate means” neighborhoods, and 26.1% vs. 20.7% live in “hard-pressed” neighborhoods. E-Risk underrepresents “urban prosperity” neighborhoods because such households are likely to be childless.
Home-visits assessments took place when participants were aged 5, 7, 10, 12 and, most recently, 18 years, when 93% of the participants took part. At ages 5, 7, 10, and 12 years, assessments were carried out with participants as well as their mothers (or primary caretakers); the home visit at age 18 included interviews only with participants. Each twin was assessed by a different interviewer. These data are supplemented by searches of official records and by questionnaires that are mailed, as developmentally appropriate, to teachers, and co-informants nominated by participants themselves. The Joint South London and Maudsley and the Institute of Psychiatry Research Ethics Committee approved each phase of the study. Parents gave informed consent and twins gave assent between 5-12 years and then informed consent at age 18.
Dunedin cohort
Participants in the second cohort were members of the Dunedin Multidisciplinary Health and Development Study, a longitudinal investigation of health and behavior in a birth cohort (Figure 1). Dunedin participants (N=1,037; 91% of eligible births; 52% male) were all individuals born between April 1972 and March 1973 in Dunedin, New Zealand, who were eligible on the basis of residence in the province and who participated in the first assessment at age 3. Full details about the sample are reported elsewhere (Poulton, Moffitt, & Silva, 2015). The cohort represented the full range of socioeconomic status (SES) in the general population of New Zealand’s South Island. On adult health, the cohort matches the New Zealand National Health and Nutrition Survey on key health indicators (e.g. body mass index, smoking, visits to the doctor).
Assessments were carried out at birth and ages 3, 5, 7, 9, 11, 13, 15, 18, 21, 26, 32, and, most recently, 38 years, when 95% of the 1,007 participants still alive took part. At each assessment wave, participants are brought to the Dunedin research unit for a full day of interviews and examinations. These data are supplemented by searches of official records and by questionnaires that are mailed, as developmentally appropriate, to parents, teachers, and peers nominated by the participants themselves. The Otago Ethics Committee approved each phase of the study and informed consent was obtained from all participants.
Genotyping and imputation
We used Illumina HumanOmni Express 12 BeadChip arrays (Version 1.1; Illumina, Hayward, CA) to assay common single-nucleotide polymorphism (SNP) variation in the genomes of cohort members. We imputed additional SNPs using the IMPUTE2 software (Version 2.3.1; https://mathgen.stats.ox.ac.uk/impute/impute_v2.html; (Howie, Donnelly, & Marchini, 2009)) and the 1000 Genomes Phase 3 reference panel (Abecasis et al., 2012). Imputation was conducted on autosomal SNPs appearing in dbSNP (Version 140; http://www.ncbi.nlm.nih.gov/SNP/; (Sherry et al., 2001)) that were “called” in more than 98% of the samples. Invariant SNPs were excluded. The E-Risk cohort contains monozygotic twins, who are genetically identical; we therefore empirically measured genotypes of one randomly-selected twin per pair and assigned these data to their monozygotic co-twin. Prephasing and imputation were conducted using a 50-million-base-pair sliding window. The resulting genotype databases included genotyped SNPs and SNPs imputed with 90% probability of a specific genotype among the European-descent members of the E-Risk cohort (N=1,999 participants in 1,011 families) and the non-Maori members of the Dunedin cohort (N=918). We analyzed SNPs in Hardy-Weinberg equilibrium (p > .01).
Polygenic scoring
Polygenic scoring was conducted following the method described by Dudbridge (Dudbridge, 2013) using PRSice (Euesden, Lewis, & O’Reilly, 2015). Briefly, SNPs reported in the most recent GWAS results released by the Social Science Genetic Association Consortium (Okbay et al., 2016) were matched with SNPs in the E-Risk and Dunedin databases. For each SNP, the count of education-associated alleles was weighted according to the effect estimated in the GWAS. Weighted counts were summed across SNPs to compute polygenic scores. We used all matched SNPs to compute polygenic scores irrespective of nominal significance for their association with educational attainment and linkage disequilibrium between SNPs (see Supplementary Materials; Table S1 for results from analyses of polygenic scores computed using a clumping approach that takes linkage disequilibrium into account. The pattern of findings was similar to using non-clumped scores.).
To control for possible population stratification, we conducted a principal components analysis of our genome-wide SNP database using PLINK v1.9 (Chang et al., 2015). Analyses were conducted separately in the E-Risk and Dunedin databases. In the E-Risk database, one twin was selected at random from each family for principal components analysis. SNP-loadings for principal components were applied to co-twin genetic data to compute principal component values for the full sample. The 10 principal components explained 2.8% of variance in the education polygenic score in the E-Risk cohort and 1.2% in the Dunedin cohort. Within each database, we residualized polygenic scores for the first ten principal components estimated from the genome-wide SNP data. The residualized score was normally distributed. We standardized residuals to Mean=0, Standard deviation=1 for analysis. In replication of the GWAS from which our education polygenic score was derived (Okbay et al., 2016), the score predicted educational attainment, measured as the highest degree completed at the time of the age-18 assessment in E-Risk (β=.21 [95%CI .16, .26], R2= .045) and the age-38 assessment in Dunedin (β=.17 [95%CI .10, .23], R2= .028). For analyses where we reported the effects of having a ‘lower’ polygenic score, we reverse-coded the score, so that higher numbers indicate a lower polygenic score for education, i.e. a greater genetic risk for low educational attainment.
Criminal offending and trajectories of antisocial behavior
In the E-Risk cohort, official records of participants criminal offending were obtained through UK Police National Computer (PNC) record searches conducted in cooperation with the UK Ministry of Justice. Records include complete histories of cautions and convictions for participants cautioned or convicted in the UK beginning at age 10 years, the age of criminal responsibility. Our data are complete through age 19 years. Criminal offending was recoded into a binary variable to reflect whether participants had been cautioned or convicted or not.
In the Dunedin cohort, information on officially recorded criminal offending was obtained by searching the central computer system of the New Zealand Police, which provides details of all New Zealand convictions and sentences and Australian convictions communicated to the New Zealand Police. Searches were completed following each assessment, at ages 18, 21, 26, 32, and 38 (last search completed in 2013). Official records of criminal conviction were available from 14 years of age onwards, the age from which criminal conviction for all types of offenses was permissible. Criminal offending was recoded into a binary variable to reflect whether participants had been convicted or not.
Using data from the Dunedin cohort, we supplemented the analyses by studying developmental trajectories of parent, teacher and self-reported antisocial behavior from childhood to adulthood. These trajectories in the Dunedin cohort have been developed and described in previous articles about antisocial behavior in the Dunedin cohort (Odgers et al., 2007, 2008). Briefly, antisocial conduct problems were assessed at ages 7, 9, 11, 13, 15, 18, 21, and 26 years through scoring six key symptoms of DSM-IV conduct disorder as being present or absent at each age according to reports from parents, teachers, and study members: physical fighting, bullying others, destroying property, telling lies, truancy, and stealing (APA, 1994). Symptoms were adapted across the age span to ensure that the measures were developmentally appropriate (e.g. work absenteeism was substituted for truancy at older ages). Growth mixture modeling was used to identify subgroups of participants that followed unique trajectories in their antisocial behavior over time. A four-class model represented the best empirical fit to the data according to several indices of model fit and classification accuracy. Across gender, the majority of participants (50%) displayed low levels of antisocial behavior across time (the “always-low” group); 22% exhibited antisocial behavior only in childhood (“childhood-limited”); 19% were characterized by adolescent-limited antisocial behavior (“adolescent-limited”); and 9% displayed persistently high levels of antisocial behavior across the years (“life-course persistent”). A recent follow-up study in the Dunedin cohort validated this classification, showing that life-course persistent cohort members averaged more convictions between ages 26 and 38 years compared to adolescent-limited cohort members, of whom very few received any convictions after age 26 (Rivenbark et al., forthcoming).
Potential explanatory variables
We evaluated three sets of explanatory variables in both cohorts. Psychometric details about these measures are provided in previous publications.
Criminogenic family environment
In the E-Risk cohort, socioeconomic status was defined using a standardized composite of parents’ income, education and social class (Trzesniewski, Moffitt, Caspi, Taylor, & Maughan, 2006). We reverse-coded the variable to reflect socioeconomic deprivation. Parental antisocial behavior was assessed as father’s and mother’s antisocial personality (Caspi et al., 2001), reported by mothers using the Young Adult Behavior Checklist (Achenbach, 1997), modified to obtain lifetime data and supplemented with questions from the Diagnostic Interview Schedule (DIS; (18)). Reports about parents’ antisocial behavior were averaged and standardized to M=0, SD=1.
In the Dunedin cohort, socioeconomic status was measured using a 6-point scale that assessed parents’ occupational statuses, defined based on average income and educational levels derived from the New Zealand Census. Parents’ occupational statuses were assessed when participants were born and again at subsequent assessments up to age 15 years. The highest occupational status of either parent was averaged across the childhood assessments (Poulton et al., 2002) and the variable standardized to M=0, SD=1 and reverse-coded to reflect socioeconomic deprivation. Parental antisocial behavior was assessed as father’s and mother’s history of antisocial behavior, using items from the DIS (Robins et al., 1995), reported by mothers and fathers (Milne et al., 2008). Reports about parents’ antisocial behavior were averaged and standardized to M=0, SD=1.
Poor educational qualifications
In the E-Risk cohort, poor educational qualifications were assessed by whether participants did not obtain or scored a low average grade (grade D-G) on their General Certificate of Secondary Education (GCSE) (21.9% of participants). GSCEs are a standardized examination taken at the end of compulsory education at age 16 years.
In the Dunedin cohort, poor educational qualifications were assessed by whether participants did not sit their English or Maths school certificate exam, a national examination held at about 15 years of age that was, at the time the Dunedin cohort was growing up, the most basic educational qualification in New Zealand (17.3% of participants).
Early-emerging psychological and behavioral risk factors
In the E-Risk cohort, participants’ cognitive ability was individually assessed at age 5 using a short form of the Wechsler Preschool and Primary Scale of Intelligence-Revised (WPPSI-R; (21)) comprising Vocabulary and Block Design subtests. IQ scores were prorated (i.e., the full-scale IQ score was estimated from two subscales) following procedures described by Sattler (1992, pp. 998–1004 (Sattler, 1992)). IQ scores were standardized to M=0, SD=1. We reverse-coded the variable to reflect low cognitive ability. Participants’ low self-control during their first decade of life was measured using a multi-occasion/multi-informant strategy, following Moffitt et al. (Moffitt et al., 2011). Briefly, a self-control factor was estimated via multiple measures, including observational ratings of participants’ lack of control (age 5 years), parent and teacher reports of poor impulse control (ages 5, 7 and 10 years), self-reports of inattentive and impulsive behavior (age 7 years), and interviewer judgments of the personality trait of conscientiousness (age 10 years). Based on principal components analysis, the standardized measures were averaged into a single composite score (M=0, SD=1). Participants’ academic performance was assessed using teacher reports when participants were 7 and 10 years old. Teachers were asked to rate participant’s performance in English and Maths on a 5-point scale ranging from ‘far below average’ (1) to ‘far above average’ (5). Scores were averaged across academic subjects and age and standardized to M=0, SD=1. We reverse-coded the variable so higher scores reflected more academic difficulties. Truancy was assessed using mother and teacher reports at ages 7 and 10. Truancy was considered present if either mothers or teachers reported truancy at either age (3.2%).
In the Dunedin cohort, participants’ cognitive ability was individually assessed at ages 7, 9, and 11 years using the Wechsler Intelligence Scale for Children – Revised (WISC-R; (Wechsler, 1974). Scores were averaged across age and standardized to M=0, SD=1. We reverse-coded the variable to reflect low cognitive ability. Participants’ low self-control was measured using multiple measures of self-control: observational ratings of participants’ lack of control (ages 3 and 5) and parent, teacher, and self-reports of impulsive aggression, overactivity, lack of persistence, inattention, and impulsivity (ages 5, 7, 9, and 11) (Moffitt et al., 2011). Based on principal components analysis, the standardized measures were averaged into a single composite score (M=0, SD=1). Participants’ academic performance was assessed at ages 7 and 9, when mothers were asked to rate their children’s performance in reading, printing, arithmetic and spelling on a 3-point scale ranging from ‘slow’ (0) to ‘above average’ (2). Mothers’ responses were averaged across age and standardized to M=0, SD=1. We reverse-coded the variable to reflect academic difficulties. Truancy was assessed using mother and teacher reports at ages 7, 9 and 11. Truancy was considered present if mothers or teachers reported truancy at any age (7.7%).
Statistical analyses
We used liability threshold models to estimate genetic, shared environmental and non-shared environmental influences on criminal offending in the E-Risk cohort. We used Poisson regression models with robust standard errors to estimate relative risks for the binary dependent outcome of having a criminal record and to investigate whether criminogenic family environment, poor educational qualifications and early-emerging risk factors could explain the effects. Formal mediation analyses for binary outcomes, as implemented in Stata, was used to test whether poor educational qualifications and early-emerging risk factors accounted for a significant portion of the genetic association with offending. In the mediation analyses, 95% confidence intervals were obtained from 500 bootstrap replications; in E-Risk, this was done accounting for the clustering of the twin data. We used survival analysis to test whether participants with lower versus higher polygenic scores tended to become convicted earlier in life and multinomial logistic regression to estimate relative risks for membership in different developmental trajectories of antisocial behavior. Both twins were included in the analyses in the E-Risk cohort; we accounted for non-independence of observations of twins within families by clustering standard errors at the family level. We also repeated the main analyses including only one randomly selected twin of each pair in the E-Risk cohort (Supplementary Materials; Table S2). R2 and pseudo-R2-based measures of effect size for all outcomes are reported in the Supplementary Materials (Table S3).
RESULTS
Are there genetic influences on official records of offending?
Official records of participants’ cautions and convictions were obtained through national police record searches; through age 19 years in the E-Risk cohort and age 38 years in the Dunedin cohort. Using the E-Risk twin design, we first sought to replicate findings from previous twin and adoption studies of quantitative genetic influences on criminal offending, to establish that there was a basis to proceed with our analyses of testing for an association between a molecular-based polygenic score and offending. Monozygotic twins in E-Risk were more similar in their criminal offending (tetrachoric correlation rmz=.81; 95%CI .71, .90) than dizygotic twins (rdz=.61; 95%CI .44, .77), indicating genetic influences. A univariate liability threshold model was used to estimate genetic, shared environmental and non-shared environmental influences on criminal offending. Genetic influences accounted for an estimated 41% (95%CI 05, 81) of the variance in offending in the E-Risk cohort. 40% of the variance was accounted for by shared environmental influences (95%CI 02, 70) and 19% by non-shared environmental influences (95%CI 11, 30). These estimates are consistent with meta-analytic findings from twin studies of antisocial behavior (Rhee & Waldman, 2002).
Do participants’ polygenic scores predict their official records of offending?
We next tested the hypothesis that molecular-genetic predictors of educational attainment would forecast participants’ criminal offending. In both cohorts, participants with lower polygenic scores for educational attainment were at greater risk to grow up to have a criminal record. The increase in risk was modest: a standard deviation decrease in the polygenic score was associated with a 20-30% greater risk of having been cautioned or convicted (incidence-rate ratio IRR=1.33 [95%CI 1.13, 1.55], p <.01 in E-Risk and IRR=1.21 [95%CI 1.09, 1.34], p<.01 in Dunedin; Table 1, Panel A). Effect sizes were similar across the two cohorts (Figure 2) and across sex (IRR=1.25 [95%CI 1.13, 1.39], p<.01 for males and IRR=1.30 [95%CI 1.08, 1.57], p<.01 for females, with participants pooled across cohorts). Results also showed that although having a low polygenic score for educational attainment increased the risk of officially recorded offending, even among individuals with very low scores the majority had no criminal record (Supplementary Materials, Table S4).
Table 1.
E-Risk cohort (N=1,857)a | Dunedin cohort (N=898)a | |||
---|---|---|---|---|
| ||||
Bivariate | Multivariate | Bivariate | Multivariate | |
| ||||
IRR (95%CI) | IRR (95%CI) | IRR (95%CI) | IRR (95%CI) | |
Panel A. Effect of participants’ polygenic scores on their official records of offending | ||||
| ||||
Low polygenic score for educational attainmentb | 1.33 (1.13, 1.55) | – | 1.21 (1.09, 1.34) | – |
| ||||
Panel B. Polygenic effect on offending, accounting for a criminogenic family environment | ||||
| ||||
Low polygenic score for educational attainmentb | 1.34 (1.14, 1.57) | 1.20 (1.01, 1.42) | 1.23 (1.11, 1.36) | 1.17 (1.05, 1.30) |
Socioeconomic deprivation | 2.26 (1.88, 2.71) | 1.96 (1.58, 2.44) | 1.40 (1.25, 1.57) | 1.35 (1.19, 1.52) |
Parental antisocial behavior | 1.49 (1.34, 1.66) | 1.18 (1.03, 1.35) | 1.16 (1.05, 1.27) | 1.07 (.96, 1.18) |
| ||||
Panel C. Polygenic effect on offending, accounting for leaving school with poor qualifications | ||||
| ||||
Low polygenic score for educational attainmentb | 1.33 (1.13, 1.56) | 1.19 (1.02, 1.39) | 1.21 (1.09, 1.34) | 1.14 (1.03, 1.26) |
Leaving school with poor qualifications | 4.27 (3.27, 5.59) | 4.02 (3.05, 5.28) | 2.83 (2.31, 3.46) | 2.71 (2.21, 3.32) |
| ||||
Panel D. Polygenic effect on offending, accounting for early-emerging psychological and behavioral risk factors | ||||
| ||||
Low polygenic score for educational attainmentb | 1.30 (1.11, 1.53) | 1.22 (1.05, 1.42) | 1.21 (1.09, 1.35) | 1.14 (1.02, 1.27) |
Low cognitive ability | 1.58 (1.35, 1.85) | 1.11 (.93, 1.32) | 1.27 (1.14, 1.43) | .99 (.86, 1.13) |
Low self-control | 2.06 (1.83, 2.32) | 1.80 (1.58, 2.04) | 1.54 (1.43, 1.67) | 1.48 (1.34, 1.64) |
Academic difficulties in primary school | 1.68 (1.47, 1.93) | 1.17 (1.00, 1.37) | 1.30 (1.16, 1.45) | 1.02 (.89, 1.17) |
Truancy | 3.92 (2.65, 5.80) | 1.98 (1.38, 2.83) | 2.11 (1.61, 2.76) | 1.25 (.91, 1.72) |
Crime records data were obtained for 93% (1,857/1,999) and 98% (898/918) of the E-Risk and Dunedin study samples, respectively.
The polygenic score was reverse-coded in these analyses, so that a higher score indicates a greater genetic risk for low educational attainment.
IRR = Incidence-rate ratio
CI = Confidence interval.
Is the influence of participants’ polygenic scores on future offending accounted for by criminogenic family environment?
We next examined whether genetic influences increased risk for officially recorded offending independently of a criminogenic family environment, as indicated by growing up in socioeconomic deprivation and having parents who display antisocial behavior. We conducted this test for two reasons. First, we aimed to rule out the possibility that genetic associations with criminal offending solely reflected gene-environment correlations, whereby parents pass on genetic variants for low educational attainment to their children and also create an environment that increases their children’s risk of offending. Second, we aimed to test whether polygenic scores predicted offending over and above two well-established, global predictors of criminal offending that contain both genetic and environmental influences. In both cohorts, Poisson regression models indicated that, as expected, participants who grew up in socioeconomically deprived families and who had parents who displayed antisocial behavior were at greater risk to have a criminal record (Table 1, Panel B, bivariate models). We also observed a correlation between participants’ polygenic scores and these two features of the environments they grew up in. Participants with lower polygenic scores for education were more likely to have grown up in socioeconomically deprived households (r=.23 [95%CI .17, .29] p<.01 in E-Risk and r=.16 [95%CI .09, .22] p<.01 in Dunedin) and with parents who displayed antisocial behavior (r=.06 [95%CI .01, .11], p<.05 in E-Risk and r=.13 [95%CI .06, .19] p<.01, in Dunedin). After accounting for these two indicators of criminogenic family environment by including them in the model as covariates, participants’ polygenic scores continued to forecast their criminal offending (Table 1, Panel B, multivariate models).
Is the effect of polygenic scores accounted for by leaving school with poor qualifications?
We investigated characteristics that may connect genetic differences between individuals with official records of criminal offending. Because the polygenic score we used comes from a GWAS of educational attainment, we first tested whether it predicted offending because it was associated with poor educational qualifications. Our findings provided some support for poor educational qualifications as an explanation for the link between polygenic scores for educational attainment and offending. Participants with lower polygenic scores were more likely to leave school with poor educational qualifications, in both the E-Risk and Dunedin cohorts (polychoric r=.21 [95%CI .13, .28] p<.01 and polychoric r=.19 [95%CI .09, .29] p<.01, respectively). Poor educational qualifications, in turn, were associated with criminal offending (Table 1, Panel C, bivariate models). Once poor qualifications were included as a covariate in the model, the effect of polygenic scores on offending was reduced (Table 1, Panel C, multivariate models). Formal mediation analyses indicated that poor qualifications accounted for a significant portion of the genetic association with offending (Supplementary Materials, Table S5). However, two results indicate that there is more to the observed genetic effect than poor educational qualifications alone. First, the polygenic score continued to predict offending in both cohorts after qualifications were included in the model (Table 1, Panel C, multivariate models). Second, in the E-Risk cohort, cautions and convictions are recorded from age 10 onwards; examining their cumulative distribution across ages revealed that the majority of participants with criminal records received their first caution or conviction before the end of compulsory schooling (Figure 3). This finding indicates that polygenic influences predicting offending already manifested earlier in life, while participants were still in school. We therefore turned our attention to the first decade of life.
Are influences of participants’ polygenic scores on future offending accounted for by early-emerging psychological and behavioral risk factors?
Our findings indicated that part of the reason why participants with lower polygenic scores are at greater risk to become involved in crime is that they display a constellation of psychological and behavioral risk factors for school failure as well as offending from a young age. As children, participants with lower polygenic scores for educational attainment exhibited lower cognitive ability (E-Risk: r=.14 [95%CI .09, .19] p<.01; Dunedin: r=.22 [95%CI .15, .29] p<.01), poorer self-control (E-Risk: r=.06 [95%CI .00, .11] p=.050; Dunedin: r=.13 [95%CI .06, .20] p<.01), more academic difficulties in primary school (E-Risk: r=.14 [95%CI .08, .20] p<.01; Dunedin: r=.19 [95%CI .12, .25] p<.01) and, in the Dunedin cohort, more truancy (E-Risk: r=.08 [95%CI -.05, .20] p=.19; Dunedin: r=.15 [95%CI .03, .28] p<.01). Each of these risk factors in turn predicted offending (Table 1, Panel D, bivariate models). Including all early-emerging risk factors in the same model reduced the prediction of participants’ polygenic scores on criminal records in both cohorts (Table 1, Panel D, multivariate models). Formal mediation analyses indicated that early-emerging risk factors accounted for a significant portion of the genetic association with offending (Supplementary Materials, Table S5).
Do polygenic scores predict the timing and persistence of antisocial behavior across the life-course?
In the Dunedin cohort, we first analyzed if lower polygenic scores for educational attainment predicted an earlier onset of offending. Survival analyses indicated that those with lower education polygenic scores tended to get convicted earlier in life (hazard ratio=1.25 [95%CI 1.10, 1.42], p<.01; Figure 4a). Second, we tested if polygenic risk was particular to the life-course persistent pattern of antisocial behavior, as opposed to shorter-term involvement. We analyzed life-course trajectories of antisocial behavior as reported by parents, teachers and participants themselves from childhood to adulthood, by comparing the polygenic scores of participants classified in our previous research into four developmental subtype groups according to their longitudinal pattern of participation in antisocial behavior: always-low; childhood-limited; adolescent-limited; and life-course persistent (Figure 4b). Results from multinomial regression models supported our hypothesis that participants with lower polygenic scores would be significantly more likely to belong to the life-course persistent subtype as compared to the always-low-antisocial subtype (relative-risk ratio RRR=1.36 [95%CI 1.07, 1.73] p<.05). As also hypothesized, participants with lower polygenic scores were not at significantly greater risk of belonging to the childhood-limited or adolescent-limited subtype, relative to the always-low-antisocial subtype (RRR=1.13 [95%CI .95, 1.33] p=.16, and RRR=1.16 [95%CI .97, 1.39] p=.10, respectively).
DISCUSSION
In two birth cohorts of individuals growing up 20 years and 20,000 kilometres apart, we tested the hypothesis that molecular-genetic predictors of educational attainment, summarized in a polygenic score, would predict criminal offending. We chose to examine a polygenic score derived from GWAS of education because low educational attainment and criminal offending are linked through established criminological theories. Our findings revealed that participants with lower polygenic scores for educational attainment were more likely to grow up to have a criminal record. This effect was small in size, but robust across replication. Although we observed a gene-environment correlation, whereby individuals with lower polygenic scores were more likely to grow up in criminogenic environments, genetic associations remained after accounting for familial predictors of offending including socioeconomic deprivation and parental antisocial behavior. Early-emerging psychological and behavioral risk factors for school failure and crime, including low cognitive ability, poor self-control, academic difficulties and truancy, connected differences in DNA with participants’ later criminal offending. The effect of participants’ polygenic scores extended to the onset of their criminal offending and trajectories of antisocial behavior across the life course, as indexed by developmental trajectories of life-course persistent antisocial behavior. Taken together, the findings show how polygenic discoveries for educational attainment can be used to study pathways leading from genes to offending. They also suggest the hypothesis that early-emerging risk factors that influence whether children have a good or bad experience of school may serve as intervention targets to prevent some of the genetic influences on offending from unfolding.
Three aspects of the present study further bolster the substance of our finding of a link between polygenic scores for educational attainment and offending. First, our analyses across two population-representative cohorts revealed that the findings were robust, answering calls for reproducibility in psychological and genomic science. Second, retention rates in both cohorts are high (93% and 95% in the E-Risk and Dunedin cohorts, respectively), reducing the risk of biased estimates when examining behaviors such as offending, that are susceptible to attrition. Third, in both cohorts, the research linked participants’ genetic information with electronic crime records. Although crime records under-detect offending, they have the advantages of being less susceptible than self-reports to reporting biases, recall failure and concealment. Integrating genetic information with administrative records is increasingly being used to advance medical research (McCarty et al., 2011), but is not yet widely adopted in the social sciences.
It may seem surprising that genetic variants identified in a GWAS for educational attainment predict criminal offending. However, this hypothesis was derived by incorporating theoretical accounts of crime causation with recent genomic discoveries about educational attainment. More generally, the findings illustrate how established social-science theories can guide the characterisation of genomic discoveries for human behavior (Belsky, Moffitt, & Caspi, 2013).
Although we identified characteristics that mediated some of the influence of polygenic scores on offending, we could not explain all of the effect. Our inability to do so can fuel follow-up work. Polygenic scores may predict offending via early-emerging deficits in neurocognitive functioning, such as the ability to learn from rewards and punishments. Another possibility is that individuals with lower polygenic scores have deficits in systems that influence socio-emotional processing, putting them at greater risk of experiencing difficulties in school as well as engaging in delinquent behavior.
The findings should be interpreted in light of limitations. First, polygenic influences shared with education may reflect only a small proportion of all genetic influences on crime and may exert their effects via different pathways. A recent GWAS on antisocial behavior (Tielbeek et al., in press) reported a genetic correlation with educational attainment of r = –0.52, indicating both genetic overlap with educational attainment and unique genetic effects on crime. Whether or not the developmental processes and mechanisms driving shared and unique genetic effects are the same remains to be tested. Second, the findings cannot be generalized to individuals of non-European ancestry because allele frequencies, linkage disequilibrium patterns, and environmental moderators of the association may vary across populations (Rosenberg et al., 2010). Third, we analyzed official crime records, which reflect only a portion of offenders and offenses. Reassuringly, we obtained similar findings when analysing trajectories of persistent conduct problems derived from parent, teacher and self-reports. Fourth, the polygenic score for education accounted for only a small portion of variation in criminal offending, both in relative and absolute terms. The small effect size relative to other predictors (e.g., parental antisocial behavior, cognitive ability) is to be expected, because these comprise both environmental and genetic influences. Against this background, it is remarkable that the polygenic score accounted for additional variance beyond traditional risk factors. As GWASs become larger, the proportion of variance accounted for by molecular-genetic variables will likely increase. Finally, there are additional hypotheses that can be tested using the polygenic score, which we have not examined. For example, we did not have enough statistical power to examine interactions between the polygenic score and socioeconomic status and we restricted our analyses to two broad indicators of antisocial behavior – criminal offending and life-course persistent antisocial behavior – rather than zooming in on finer distinctions. We also did not test hypotheses about the interplay between children’s genes and aspects of their environment, such as parenting. Our priority was to conduct an initial examination of genetic links with “workhorse” phenotypes in criminology, and to test robustness across cohorts. This research can now be taken forward to test further hypotheses in other, larger samples.
The findings have implications for public and scientific debates about genetic research on (anti)-social behavior. First, a key result from this and previous studies is that discoveries in GWAS of educational attainment are related not only to education but to life-course success and adversity more generally (Belsky et al., 2016). These findings are in line with the notion of educational attainment as a proxy-phenotype for related phenotypes (Rietveld et al., 2014). They also underscore the pervasiveness of pleiotropy, i.e. the phenomenon that genomic discoveries for one particular phenotype also predict related outcomes. Together with polygenicity–i.e., the observation that one outcome is influenced by many genes–findings of pleiotropy are moving sociogenomic research further away from a deterministic paradigm of “one-gene-one-outcome” and toward an understanding that many genes affect many outcomes, through their influences on early-emerging characteristics that shape life-course development.
Second, our findings hark back to the nature/nurture debate and the question of whether criminals are ‘born’ or ‘made’ (Wilson & Herrnstein, 1985). Using a polygenic scoring approach that overcomes lingering reservations about the validity of twin and adoption studies (Burt & Simons, 2014), our findings demonstrate that some children are born with genetic propensities that are associated with their risk to offend. However, our findings do not support a view of genetics as destiny. Many children who carry few education-associated alleles develop reasonable behavioral control, complete schooling, and become productive members of society. Others develop behavioral problems, drop out of school, and commit crimes. Alongside environmental factors, genetics explain a small proportion of these individual differences in life outcomes. Genetic risk operates through a series of intermediate phenotypes that are also under the influence of the social environment and that can provide targets for intervention, such as low self-control and academic difficulties. Intervening with these early-emerging characteristics and behaviors, for example through early training of self-control and academic skills (Heckman, 2006), may be one strategy to disrupt pathways from genes to offending.
Finally, some people look back at the fraught history of behavioral genetics and wonder whether genetic influences on social behavior should be studied at all. Instead of fearing sociogenomic research, or focusing on genetics to the neglect of other risk factors, here we incorporate molecular genetic predictors into existing sociological and psychological theories, and find that a polygenic score for education acts much like any other risk factor for offending: it has modest, probabilistic effects that are mediated by characteristics and behaviors criminologists have studied for decades. Our study demonstrates that existing theories in the social and behavioral sciences can accommodate molecular-genetic discoveries by weaving them into the frameworks of understanding that we already have about human behavior.
Supplementary Material
Acknowledgments
The E-Risk Study is funded by UK MRC grant G1002190. The Dunedin Multidisciplinary Health and Development Research Unit is funded by the NZ HRC and NZ MBIE. We thank Dunedin Study founder Phil Silva. This research was supported by NIA Grant AG032282, NICHD grant HD077482, MRC Grant MR/P005918/1, the Jacobs Foundation and the Avielle Foundation. Data-support was provided by Duke’s Social Science Research Institute and NC Biotechnology Center grant 2016-IDG-1013. D.W. Belsky and C.L. Odgers are supported by Fellowships from the Jacobs Foundation. A.L. Beckley is supported by a FORTE Marie Curie International Fellowship . L. Arseneault is the Mental Health Leadership Fellow for the UK ESRC.
Footnotes
AUTHOR CONTRIBUTIONS
J. Wertz, T.E. Moffitt and A. Caspi conceived the study and wrote the manuscript. A. Caspi, T.E. Moffitt, L. Arseneault, R. Poulton, S. Hogan, N. Morgan, D.L. Corcoran, J. A. Prinz, K. Sugden, and B.S. Williams collected the data, and it was analyzed by J.Wertz, D.W. Belsky, R.M. Houts and C.L. Odgers . J. Wertz, T.E. Moffitt, A. Caspi, D.W. Belsky, A.L. Beckley and J.C. Barnes interpreted the results. All authors reviewed drafts, provided critical feedback and approved the final manuscript.
References
- Abecasis GR, Auton A, Brooks LD, DePristo MA, Durbin RM, Handsaker RE, McVean GA. An integrated map of genetic variation from 1,092 human genomes. Nature. 2012;491:56–65. doi: 10.1038/nature11632. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Achenbach TM. Manual for the Young Adult Self-Report and Young Adult Behavior Checklist. Burlington, VT: University of Vermont; 1997. [Google Scholar]
- Agnew R. Foundation for a general strain theory of crime and delinquency. Criminology. 1992;30:47–88. [Google Scholar]
- APA. Diagnostic and Statistical Manual of Mental Disorders. 4th. Washington, DC: American Psychiatric Association; 1994. [Google Scholar]
- Belsky DW, Moffitt TE, Caspi A. Genetics in population health science: Strategies and opportunities. American Journal of Public Health. 2013;103(Suppl):S73–83. doi: 10.2105/AJPH.2012.301139. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Belsky DW, Moffitt TE, Corcoran DL, Domingue B, Harrington H, Hogan S, Caspi A. The genetics of success: How single-nucleotide polymorphisms associated with educational attainment relate to life-course development. Psychological Science. 2016;27:957–972. doi: 10.1177/0956797616643070. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Burt CH, Simons RL. Pulling back the curtain on heritability studies: Biosocial criminology in the postgenomic era. Criminology. 2014;52:223–262. [Google Scholar]
- Caspi A, Taylor A, Smart M, Jackson J, Tagami S, Moffitt TE. Can women provide reliable information about their children’s fathers? Cross-informant agreement about men’s lifetime antisocial behaviour. Journal of Child Psychology and Psychiatry. 2001;42:915–920. doi: 10.1111/1469-7610.00787. [DOI] [PubMed] [Google Scholar]
- Catalano RF, Oesterle S, Fleming CB, Hawkins JD. The importance of bonding to school for healthy development: Findings from the Social Development Research Group. Journal of School Health. 2004;74:252–261. doi: 10.1111/j.1746-1561.2004.tb08281.x. [DOI] [PubMed] [Google Scholar]
- Chang CC, Chow CC, Tellier LC, Vattikuti S, Purcell SM, Lee JJ. Second-generation PLINK: Rising to the challenge of larger and richer datasets. GigaScience. 2015;4:7. doi: 10.1186/s13742-015-0047-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dudbridge F. Power and predictive accuracy of polygenic risk scores. PLoS Genetics. 2013;9:e1003348. doi: 10.1371/journal.pgen.1003348. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Euesden J, Lewis CM, O’Reilly PF. PRSice: Polygenic Risk Score software. Bioinformatics. 2015;31:1466–1468. doi: 10.1093/bioinformatics/btu848. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gottfredson MR, Hirschi T. A general theory of crime. Stanford, CA: Stanford University Press; 1990. [Google Scholar]
- Heckman JJ. Skill formation and the economics of investing in disadvantaged children. Science. 2006;312:1900–1902. doi: 10.1126/science.1128898. [DOI] [PubMed] [Google Scholar]
- Howie BN, Donnelly P, Marchini J. A flexible and accurate genotype imputation method for the next generation of genome-wide association studies. Plos Genetics. 2009;5:e1000529. doi: 10.1371/journal.pgen.1000529. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McCarty CA, Chisholm RL, Chute CG, Kullo IJ, Jarvik GP, Larson EB, Wolf WA. The eMERGE Network: A consortium of biorepositories linked to electronic medical records data for conducting genomic studies. BMC Medical Genomics. 2011;4:13. doi: 10.1186/1755-8794-4-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Merton RK. Social structure and anomie. American Sociological Review. 1938;3:672–682. [Google Scholar]
- Milne BJ, Moffitt TE, Crump R, Poulton R, Rutter M, Sears MR, Caspi A. How should we construct psychiatric family history scores? A comparison of alternative approaches from the Dunedin Family Health History Study. Psychological Medicine. 2008;38:1793–1802. doi: 10.1017/S0033291708003115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moffitt TE. The neuropsychology of conduct disorder. Development and Psychopathology. 1993a;5:135–151. [Google Scholar]
- Moffitt TE. Adolescence-limited and life-course-persistent antisocial behavior: A developmental taxonomy. Psychological Review. 1993b;100:674–701. [PubMed] [Google Scholar]
- Moffitt TE, Arseneault L, Belsky D, Dickson N, Hancox RJ, Harrington H, Caspi A. A gradient of childhood self-control predicts health, wealth, and public safety. Proceedings of the National Academy of Sciences of the United States of America. 2011;108:2693–2698. doi: 10.1073/pnas.1010076108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moffitt TE, E-Risk Study Team Teen-aged mothers in contemporary Britain. Journal of Child Psychology and Psychiatry. 2002;43(6):727–42. doi: 10.1111/1469-7610.00082. [DOI] [PubMed] [Google Scholar]
- Odgers CL, Caspi A, Bates CJ, Sampson RJ, Moffitt TE. Systematic social observation of children’s neighborhoods using Google Street View: A reliable and cost-effective method. Journal of Child Psychology and Psychiatry. 2012;53:1009–17. doi: 10.1111/j.1469-7610.2012.02565.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Odgers CL, Caspi A, Broadbent JM, Dickson N, Hancox RJ, Harrington H, Moffitt TE. Prediction of differential adult health burden by conduct problem subtypes in males. Archives of General Psychiatry. 2007;64:476–84. doi: 10.1001/archpsyc.64.4.476. [DOI] [PubMed] [Google Scholar]
- Odgers CL, Moffitt TE, Broadbent JM, Dickson N, Hancox RJ, Harrington H, Caspi A. Female and male antisocial trajectories: From childhood origins to adult outcomes. Development and Psychopathology. 2008;20:673–716. doi: 10.1017/S0954579408000333. [DOI] [PubMed] [Google Scholar]
- Okbay A, Beauchamp JP, Fontana MA, Lee JJ, Pers TH, Rietveld CA, Benjamin DJ. Genome-wide association study identifies 74 loci associated with educational attainment. Nature. 2016;533:539–542. doi: 10.1038/nature17671. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Poulton R, Caspi A, Milne BJ, Thomson WM, Taylor A, Sears MR, Moffitt TE. Association between children’s experience of socioeconomic disadvantage and adult health: A life-course study. The Lancet. 2002;360:1640–1645. doi: 10.1016/S0140-6736(02)11602-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Poulton R, Moffitt TE, Silva PA. The Dunedin Multidisciplinary Health and Development Study: Overview of the first 40 years, with an eye to the future. Social Psychiatry and Psychiatric Epidemiology. 2015;50:679–93. doi: 10.1007/s00127-015-1048-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rhee SH, Waldman ID. Genetic and environmental influences on antisocial behavior: A meta-analysis of twin and adoption studies. Psychological Bulletin. 2002;128:490–529. [PubMed] [Google Scholar]
- Rietveld CA, Esko T, Davies G, Pers TH, Turley P, Benyamin B, Smithkk BH. Common genetic variants associated with cognitive performance identified using the proxy-phenotype method. Proc Natl Acad Sci. 2014;111:13790–13794. doi: 10.1073/pnas.1404623111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rivenbark J, Odgers CL, Caspi A, Harrington HL, Hogan S, Houts RM, Poulton R, Moffitt TE. The high societal costs of childhood conduct problems: Evidence from electronic medical records and administrative records of social services and crime. Journal of Child Psychology and Psychiatry. doi: 10.1111/jcpp.12850. (forthcoming) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Robins LN, Cottler L, Bucholz KK, Compton W. Diagnostic Interview Schedule for DSM-IV. St Louis, MO: Washington University School of Medicine; 1995. [Google Scholar]
- Rosenberg NA, Huang L, Jewett EM, Szpiech ZA, Jankovic I, Boehnke M. Genome-wide association studies in diverse populations. Nature Reviews Genetics. 2010;11:356–366. doi: 10.1038/nrg2760. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sattler JM. Assessment of Children: WISC-III and WPPSI-R Supplement. San Diego, CA: J.M. Sattler; 1992. [Google Scholar]
- Sherry ST, Ward MH, Kholodov M, Baker J, Phan L, Smigielski EM, Sirotkin K. dbSNP: the NCBI database of genetic variation. Nucleic Acids Research. 2001;29:308–311. doi: 10.1093/nar/29.1.308. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stattin H, Magnusson D. Onset of official delinquency: Its co-occurrence in time with educational, behavioral, and interpersonal problems. British Journal of Criminology. 1995;35:417–449. [Google Scholar]
- Tielbeek JJ, Johansson A, Polderman TA, Rautiainen M, Jansen P, Taylor M, Posthuma D. Genome-wide association studies of a broad spectrum of antisocial behavior. JAMA Psychiatry. doi: 10.1001/jamapsychiatry.2017.3069. (in press) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Trouton A, Spinath FM, Plomin R. Twins early development study (TEDS): A multivariate, longitudinal genetic investigation of language, cognition and behavior problems in childhood. Twin Research. 2002;5:444–448. doi: 10.1375/136905202320906255. [DOI] [PubMed] [Google Scholar]
- Trzesniewski KH, Moffitt TE, Caspi A, Taylor A, Maughan B. Revisiting the association between reading achievement and antisocial behavior: New evidence of an environmental explanation from a twin study. Child Development. 2006;77:72–88. doi: 10.1111/j.1467-8624.2006.00857.x. [DOI] [PubMed] [Google Scholar]
- Visscher PM, Wray NR, Zhang Q, Sklar P, McCarthy MI, Brown MA, Yang J. 10 years of GWAS discovery: Biology, function, and translation. The American Journal of Human Genetics. 2017;101:5–22. doi: 10.1016/j.ajhg.2017.06.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wechsler D. Manual for the Wechsler Intelligence Scale for Children – Revised. New York, NY: Psychological Corporation; 1974. [Google Scholar]
- Wechsler D. Wechsler Preschool and Primary Scale of Intelligence–Revised. London, UK: Psychological Corporation; 1990. [Google Scholar]
- Wilson JQ, Herrnstein RJ. Crime and human nature: The definitive study on the causes of crime. New York, NY: Simon & Schuster; 1985. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.