Commentary on ‘Regulatory genomic circuitry of human disease loci by integrative epigenomics’ by C. Boix et al. Nature, 2021, 590, 300–307.
In the last two decades, genome-wide association studies (GWASs) took advantage of advances in genome-wide genotyping technology, and large population and patient datasets, to explore the role of common variants on phenotypic traits and disease susceptibility. GWAS have the major advantage of being hypothesis free, thus allowing investigators to uncover associations between cardiovascular traits and genetic polymorphisms, which can help point to unexpected pathways involved in cardiovascular pathophysiology such as the important role of inflammatory-related genes in coronary artery disease (CAD)1 or as the identification of the PITX2 transcription factor in atrial fibrillation.2 According to the GWAS catalogue database (release of 25 March 2021), 1329 polymorphism-cardiovascular trait associations have been so far described. This growing catalogue of genome-wide (P < 5 × 10−8) and nominally (P < 0.05) significant variants has also opened the door to creating polygenic risk scores (PRS) and other personalized medicine approaches, by identifying individuals at risk of developing specific cardiovascular diseases or sub-groups of patients with a more severe prognosis.
But despite their remarkable success, GWAS also have their limitations. Most notably, the core output of GWAS, being long lists of disease-associated genes and single-nucleotide polymorphisms (SNPs), provide no additional insights on potential mechanisms of action. As a result of this, in parallel with the mounting success of GWAS, has been the growing list of disease-associated genes and SNPs for which we have little or no idea about how they might cause disease. This issue is compounded by the fact that the large majority of genetic signals (90%) are located in non-coding regions of the genome, likely corresponding to transcriptional regulatory elements.3 Thus, a major challenge that has arisen, has been understanding the impact of variants in such regulatory elements, and the added complexity of needing to consider not only gene regulation but also the tissue pleiotropy of an enhancer. That is, a given enhancer or transcriptional regulatory element might have a profound effect in a specific tissue (i.e. in liver), but have no effect or even the oppositive effect in another tissue (i.e. the heart). At a minimum, any attempt at a mechanistic molecular understanding of these factors requires the integration of: (i) GWAS data comprising hundreds of disease-associated SNPs and genes; (ii) epigenetic data, obtained from specific human tissues since the location and role of the regulatory elements (promoter or enhancer) are tissue- and species-dependent; (iii) transcriptomic (RNA-seq) data sets to assess human tissue gene expression levels (or even gene isoforms) that can be correlated with genetic variants (eQTL: expression Quantitative Trait Loci) and which can then point to causal variants or target genes thus controlled or modulated.
In a recent publication in Nature, Boix et al. in their study ‘Regulatory genomic circuitry of human disease loci by integrative epigenomics’, undertook a tour de force where they tackle these exact challenges.4 They leverage unprecedented network data and tools to disclose, among other things: (i) the effect of individual variants on multiple traits through effects in different tissues; (ii) the combined effect of multiple variants on the expression pattern of individual genes in multiple different tissues; and (iii) the combined effect of multiple genes in one or multiple tissues. They then developed a repository called Epimap (http://compbio.mit.edu/epimap) based on the uniform processing of 833 biosamples (comprising 33 tissue categories) for 18 epigenomics marks resulting in 2850 observed and 14 510 imputed tracks. This comprehensive annotation revealed more than 2 million enhancers, with an average of 13 enhancers modulating each gene and 1.5 genes controlled by each enhancer. Boix et al. also demonstrated enhancer co-regulation and described 300 enhancer modules including 290 tissue-specific modules. These different layers of data integration appear to be essential for understanding the genetic architecture underlying the broad phenotypic traits encountered in common and complex cardiovascular diseases such as CAD (Figure 1). As an example, Boix et al. described 56 ‘unifactorial’ traits enriched in only one tissue such as the QT interval in heart (where the heart is the only tissue that might contribute to controlling the QT interval), whereas 192 ‘multifactorial’ traits find enrichment on average in five different tissues such as waist-to-hip ratio in adipose, muscle, kidney, and digestive tissues, and finally, 26 ‘polyfactorial’ traits in 14 tissue categories on average such as CAD that was associated with enrichment in 19 tissues including liver, heart, adipose, muscle, and endocrine samples.
Figure 1.
Pleotropic impact at the gene- and tissue-level of variants identified in GWAS. The integration of different layers of data (epigenomic and transcriptional datasets) is essential to understand the genetic architecture underlying the broad phenotypic traits encountered in common and complex cardiovascular diseases such as CAD. Organ's cartoons have been taken from https://smart.servier.com.
In addition, their study provides an unprecedented integrative comprehension at the gene and tissue level of the pleotropic impact of variants identified in GWAS. Common variants identified by GWAS (P < 5 × 10−8) are mostly considered as modest contributors to disease inheritance. However, this study supports the emerging concept of the likely larger contribution of variants exhibiting nominally significant P-values (P < 0.05 but >5 × 10−8), as also suggested by studies examining either gene-regulatory networks5 or polygenic risk scores6,7 for CAD. Furthermore, their study supports the concept that common variant effects may be potentiated by combined genetic factors impacting the same gene-pathway, in combination with other (rare or low frequency) variants, drugs, or environments.8,9
The study by Boix et al. is at the same time a rich scientific resource, but also a lesson regarding the profound and magnificent complexity of the human genome and the causal basis of common diseases like CAD. Their enhancer modules and trait-trait networks open the door to understanding fine gene regulation on one hand and broad distance gene regulation pleiotropy orchestrated by enhancers of multiple genes and multiple tissues. The study also underscores the importance of developing global tissue research programs to decipher the broad phenotypic spectrum of complex diseases, while also highlighting the complex and multi-layered analyses that are required to meaningfully interpret these datasets. More newsworthy than ever, multidisciplinary research that integrates complex multi-layered datasets from various sources appears to be key for deciphering the full complexity of cardiovascular diseases.
Precision medicine is being increasingly applied to manage patients with Mendelian disorders, where genotype can directly inform phenotype and management decisions can be guided by patient-specific information. But given their remarkably complex genetic and epigenetic basis, common diseases like CAD, obesity and hypertension lag behind Mendelian disorders in terms of being able to apply precision medicine. However, by continuing to advance our understanding of why individual patients develop common complex diseases, we foresee that advanced approaches such as integrative epigenomics, in combination with other techniques, will eventually help to guide patient management decisions. For example, while PRS have so far been marginal in their ability to predict disease or disease severity, integrative epigenomics will likely facilitate the fine-tuning of PRS by re-assigning weights to differing variants according to their specific pleiotropic effects. So, while we are not there yet, through integrative epigenomics and other advances it seems that we are inching towards the long-held goal that precision medicine will one day inform the delivery of the right treatment, to the right patient, at the right time, for every patient.
Conflict of interest: none declared.
Funding
J.B. is supported by the research program Etoiles montantes des Pays de la Loire REGIOCARD RPH081-U1087-REG-PDL, by the ANR LEARN (R21006NN) and by the ANR EJPRD LQTS-NEXT. J.C.K. acknowledges research support from the National Institutes of Health (R01HL130423, R01HL135093, R01HL148167-01A1) and New South Wales health grant RG194194.
Authors
Biography: Julien Barc' s group leads genetics and epigenetics research programs on the inherited cardiac arrhythmia at l institut du thorax in Nantes, France, where he obtained his PhD in 2009 in genetics of ventricular cardiac arrhythmia. Julien spent then 5 years at the department of Experimental Cardiology of Amsterdam, the Netherlands, a world leader group in genetics and functional studies on cardiac arrhythmias. He dedicated the early part of his carrier to identify new syndromes, genes and mutations associated with inherited cardiac arrhythmia at risk of sudden death. More recently he developed research programs on patient population to investigate the role of common variants in cardiac disorders. This conducts Julien to explore the role of regulatory region by initiating epigenomics projects and develop models based on genome editing in induced pluripotent stem cell-derived cardiomyocytes.
Biography: Professor Jason Kovacic graduated from The University of Melbourne Medical School in 1994, and then undertook cardiology training at St Vincent s Hospital in Sydney including a PhD in cardiovascular medicine at the Victor Chang Cardiac Research Institute. In 2007, he relocated to the USA, initially to do a postdoc at the National Heart, Lung and Blood Institute (NHLBI) at the National Institutes of Health (NIH) in Bethesda, Maryland. After this postdoc he then moved to The Icahn School of Medicine at Mount Sinai in New York. In early 2020 Jason was appointed as the Executive Director of the Victor Chang Cardiac Research Institute in Sydney, Australia. He continues to practice as a clinical cardiologist and runs a research program focused on atherosclerosis, fibromuscular dysplasia and other vascular diseases.
References
- 1. Ruparelia N, Chai JT, Fisher EA, Choudhury RP.. Inflammatory processes in cardiovascular disease: a route to targeted therapies. Nat Rev Cardiol 2017;14:133–144. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Roselli C, Chaffin MD, Weng L-C, Aeschbacher S, Ahlberg G, Albert CM, Almgren P, Alonso A, Anderson CD, Aragam KG, Arking DE, Barnard J, Bartz TM, Benjamin EJ, Bihlmeyer NA, Bis JC, Bloom HL, Boerwinkle E, Bottinger EB, Brody JA, Calkins H, Campbell A, Cappola TP, Carlquist J, Chasman DI, Chen LY, Chen Y-DI, Choi E-K, Choi SH, Christophersen IE, Chung MK, Cole JW, Conen D, Cook J, Crijns HJ, Cutler MJ, Damrauer SM, Daniels BR, Darbar D, Delgado G, Denny JC, Dichgans M, Dörr M, Dudink EA, Dudley SC, Esa N, Esko T, Eskola M, Fatkin D, Felix SB, Ford I, Franco OH, Geelhoed B, Grewal RP, Gudnason V, Guo X, Gupta N, Gustafsson S, Gutmann R, Hamsten A, Harris TB, Hayward C, Heckbert SR, Hernesniemi J, Hocking LJ, Hofman A, Horimoto ARVR, Huang J, Huang PL, Huffman J, Ingelsson E, Ipek EG, Ito K, Jimenez-Conde J, Johnson R, Jukema JW, Kääb S, Kähönen M, Kamatani Y, Kane JP, Kastrati A, Kathiresan S, Katschnig-Winter P, Kavousi M, Kessler T, Kietselaer BL, Kirchhof P, Kleber ME, Knight S, Krieger JE, Kubo M, Launer LJ, Laurikka J, Lehtimäki T, Leineweber K, Lemaitre RN, Li M, Lim HE, Lin HJ, Lin H, Lind L, Lindgren CM, Lokki M-L, London B, Loos RJF, Low S-K, Lu Y, Lyytikäinen L-P, Macfarlane PW, Magnusson PK, Mahajan A, Malik R, Mansur AJ, Marcus GM, Margolin L, Margulies KB, März W, McManus DD, Melander O, Mohanty S, Montgomery JA, Morley MP, Morris AP, Müller-Nurasyid M, Natale A, Nazarian S, Neumann B, Newton-Cheh C, Niemeijer MN, Nikus K, Nilsson P, Noordam R, Oellers H, Olesen MS, Orho-Melander M, Padmanabhan S, Pak H-N, Paré G, Pedersen NL, Pera J, Pereira A, Porteous D, Psaty BM, Pulit SL, Pullinger CR, Rader DJ, Refsgaard L, Ribasés M, Ridker PM, Rienstra M, Risch L, Roden DM, Rosand J, Rosenberg MA, Rost N, Rotter JI, Saba S, Sandhu RK, Schnabel RB, Schramm K, Schunkert H, Schurman C, Scott SA, Seppälä I, Shaffer C, Shah S, Shalaby AA, Shim J, Shoemaker MB, Siland JE, Sinisalo J, Sinner MF, Slowik A, Smith AV, Smith BH, Smith JG, Smith JD, Smith NL, Soliman EZ, Sotoodehnia N, Stricker BH, Sun A, Sun H, Svendsen JH, Tanaka T, Tanriverdi K, Taylor KD, Teder-Laving M, Teumer A, Thériault S, Trompet S, Tucker NR, Tveit A, Uitterlinden AG, Van Der Harst P, Van Gelder IC, Van Wagoner DR, Verweij N, Vlachopoulou E, Völker U, Wang B, Weeke PE, Weijs B, Weiss R, Weiss S, Wells QS, Wiggins KL, Wong JA, Woo D, Worrall BB, Yang P-S, Yao J, Yoneda ZT, Zeller T, Zeng L, Lubitz SA, Lunetta KL, Ellinor PT.. Multi-ethnic genome-wide association study for atrial fibrillation. Nat Genet 2018;50:1225–1233. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Edwards SL, Beesley J, French JD, Dunning AM.. Beyond GWASs: illuminating the dark road from association to function. Am J Hum Genet 2013;93:779–797. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Boix CA, , James BT, , Park YP, , Meuleman W, , Kellis M. Regulatory genomic circuitry of human disease loci by integrative epigenomics. Nature 2021;590:300–307. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Zeng L, Talukdar HA, Koplev S, Giannarelli C, Ivert T, Gan L-M, Ruusalepp A, Schadt EE, Kovacic JC, Lusis AJ, Michoel T, Schunkert H, Björkegren JLM.. Contribution of gene regulatory networks to heritability of coronary artery disease. J Am Coll Cardiol 2019;73:2946–2957. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Khera AV, Chaffin M, Aragam KG, Haas ME, Roselli C, Choi SH, Natarajan P, Lander ES, Lubitz SA, Ellinor PT, Kathiresan S.. Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations. Nat Genet 2018;50:1219–1224. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Inouye M, Abraham G, Nelson CP, Wood AM, Sweeting MJ, Dudbridge F, Lai FY, Kaptoge S, Brozynska M, Wang T, Ye S, Webb TR, Rutter MK, Tzoulaki I, Patel RS, Loos RJF, Keavney B, Hemingway H, Thompson J, Watkins H, Deloukas P, Di Angelantonio E, Butterworth AS, Danesh J, Samani NJ; UK Biobank CardioMetabolic Consortium CHD Working Group. Genomic risk prediction of coronary artery disease in 480,000 adults: implications for primary prevention. J Am Coll Cardiol 2018;72:1883–1893. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Cerrone M, Remme CA, Tadros R, Bezzina CR, Delmar M.. Beyond the one gene-one disease paradigm: complex genetics and pleiotropy in inheritable cardiac disorders. Circulation 2019;140:595–610. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Bezzina CR, Barc J, Mizusawa Y, Remme CA, Gourraud J-B, Simonet F, Verkerk AO, Schwartz PJ, Crotti L, Dagradi F, Guicheney P, Fressart V, Leenhardt A, Antzelevitch C, Bartkowiak S, Borggrefe M, Schimpf R, Schulze-Bahr E, Zumhagen S, Behr ER, Bastiaenen R, Tfelt-Hansen J, Olesen MS, Kääb S, Beckmann BM, Weeke P, Watanabe H, Endo N, Minamino T, Horie M, Ohno S, Hasegawa K, Makita N, Nogami A, Shimizu W, Aiba T, Froguel P, Balkau B, Lantieri O, Torchio M, Wiese C, Weber D, Wolswinkel R, Coronel R, Boukens BJ, Bézieau S, Charpentier E, Chatel S, Despres A, Gros F, Kyndt F, Lecointe S, Lindenbaum P, Portero V, Violleau J, Gessler M, Tan HL, Roden DM, Christoffels VM, Le Marec H, Wilde AA, Probst V, Schott J-J, Dina C, Redon R.. Common variants at SCN5A-SCN10A and HEY2 are associated with Brugada syndrome, a rare disease with high risk of sudden cardiac death. Nat Genet 2013;45:1044–1049. [DOI] [PMC free article] [PubMed] [Google Scholar]