Abstract
Clinical presentation of congenital heart disease is heterogeneous, making identification of the disease-causing genes and their genetic pathways and mechanisms of action challenging. By using in vivo electrocardiography, transthoracic echocardiography and microcomputed tomography imaging to screen 3,894 single-gene-null mouse lines for structural and functional cardiac abnormalities, here we identify 705 lines with cardiac arrhythmia, myocardial hypertrophy and/or ventricular dilation. Among these 705 genes, 486 have not been previously associated with cardiac dysfunction in humans, and some of them represent variants of unknown relevance (VUR). Mice with mutations in Casz1, Dnajc18, Pde4dip, Rnf38 or Tmem161b genes show developmental cardiac structural abnormalities, with their human orthologs being categorized as VUR. Using UK Biobank data, we validate the importance of the DNAJC18 gene for cardiac homeostasis by showing that its loss of function is associated with altered left ventricular systolic function. Our results identify hundreds of previously unappreciated genes with potential function in congenital heart disease and suggest causal function of five VUR in congenital heart disease.
Subject terms: Genetic predisposition to disease, Cardiovascular genetics
In a multicenter research program coordinated by the International Mouse Phenotyping Consortium, Spielmann et al. analyze the cardiac function and structure in ~4,000 monogenic mutant mice and identify 705 mouse genes involved in cardiac function, 75% of which have not been previously linked to cardiac heritable disease in humans. Using the UK Biobank human data, the authors validate the link between cardiovascular disease and some of the newly identified genes to illustrate the resource value and potential of their mutant mouse collection.
Main
Human cardiovascular disease (CVD) is multifactorial with genetic and exogenous contributors. Almost all areas of clinical adult and pediatric cardiology include many rare CVDs that are associated with complex underlying genetic etiologies. These genetic disorders include subsets of inherited arrhythmias, cardiomyopathies, vascular diseases and/or structural heart defects that present as heterogeneous phenotypes with variable penetrance and expression1.
Arrhythmias are caused by anomalies in the electrical conduction system of the heart that regulates and coordinates myocardial contraction. The severity of arrhythmias varies from clinically benign to life threatening, causing sudden cardiac death2. Cardiac myocytes are excitable cells that generate an action potential in response to a stimulus that elicits a contractile response. Phases of the cardiac action potential, such as the P wave, the PR interval, the QRS complex or the QT interval, reflect atrial and ventricular depolarization or repolarization and can be measured by surface electrocardiography (ECG). Thus, any changes in ECG intervals can indicate abnormalities in cardiac conduction or electrical instability of the heart muscle leading to cardiac arrhythmia.
Primary cardiomyopathies are diseases of the heart muscle often associated with dysfunction of the sarcomere’s myofibrillar proteins, actin and myosin. Clinical manifestations include systolic or diastolic dysfunction. Abnormalities on echocardiography, such as enlarged ventricular chambers (that is, dilatation), myocardial wall thickening (that is, hypertrophy) or left ventricular (LV) systolic dysfunction indicate pathologic myocardial remodeling and/or cardiomyopathies.
Structural heart defects are the most prevalent congenital condition in human infants and result from perturbations to normal cardiac development causing structural abnormalities of the heart itself, the adjacent arteries and veins or a combination of heart and cardiac vasculature3. For example, ventricular septal defect (VSD) is a developmental malformation of the interventricular septum, allowing blood flow between the lumens of the two ventricles. Isolated VSD4 occurs in approximately two to six of every 1,000 live births and accounts for more than 20% of all human congenital heart disease (CHD), making it the second most common congenital heart defect after bicuspid aortic valve defects5.
Complete understanding of the pathogenesis of human congenital heart defects depends on knowing the genes (or potential candidates) that can cause or contribute to its manifestation and severity, knowledge of the relative penetrance and expressivity of these genes and informative mammalian models to identify a particular variant’s mechanism of action6. While many genes have been associated with CHD, the detection rate for rare congenital structural heart defect variants is low, at about 50% in syndromic CHD and about 30% in nonsyndromic CHD7. A large proportion of CHD, particularly in severely affected individuals, occurs in families in which there is no history of CHD.
In this study, we used 3,894 single-gene-knockout mouse lines (null deletion alleles) produced by the International Mouse Phenotyping Consortium (IMPC) for unbiased discovery of genes corresponding to monogenic heart disease phenotypes. We identified structural and functional cardiac abnormalities in adult mice by clinical in vivo ECG and transthoracic echocardiography (TTE) and by ex situ iodine contrast high-spatial-resolution microcomputed tomography (micro-CT) imaging of embryo hearts to reproduce the effects of known genes and identify hitherto unknown genes that as null alleles corresponded to congenital monogenic cardiac rhythm disorders, cardiomyopathies and structural heart defects. However, we have not considered vascular diseases. Our study identified 705 genes associated with cardiac rhythm disorder, myocardial hypertrophy and/or ventricular dilation phenotypes, 70% (486) of which have not, to our knowledge, been previously linked to cardiac function or cardiac disease. We further investigated a subset of the 486 genes not yet associated with CHD for causal inferences.
Results
Cardiac abnormalities associated with null alleles for 705 genes
We generated and analyzed phenotype data using TTE and ECG from 3,894 IMPC mouse lines homozygous for a single-gene null knockout mutation or heterozygous if homozygous mutation was lethal. All mutants and corresponding control animals were young adult mice (12 weeks of age) of the C57BL/6N background. This dataset included 3,836 of 3,894 lines with ECG data, 1,398 of 3,894 lines with ECG and TTE data and 58 lines with TTE data only. We identified 705 lines (genes) with at least one abnormal cardiac phenotype by applying the statistics of a soft windowing approach8 that compensates for temporal batch effect combined with q-value correction of the genotype contribution P values (R package qvalue9, version 2.14.1, R version 3.5.3 (ref. 8)) to account for multiple testing (Supplementary Data 1). The significance value for phenotyping deviance on these q values was set to 0.05. A total of 3,189 gene knockouts showed no significant differences in ECG and TTE parameters between mutants and controls. Separated by test, 424 gene knockouts had differences in ECG only between knockout mice and controls, 243 knockouts had differences in TTE only and 38 knockouts had significant differences for both tests (Extended Data Fig. 1a,b). A comprehensive overview of the null alleles and the detected phenotypes is provided in Supplementary Table 1. As examples, we focus here on five genes with the highest number of significant (that is, abnormal) parameters from a single test method, that is, five genes from ECG testing and five genes from TTE testing.
Candidate genes for cardiac conduction system disease and rhythm disorders
Among 424 gene knockouts with ECG alterations, homozygous Gatm-, Pla2g10-, Elmod1- and Masp1-knockout mice and mice heterozygous for Cap2 had the highest number of significant phenotype differences compared to controls. These five genes, given the presence of multiple abnormal ECG phenotype parameters, are most likely to be associated with abnormal function of the cardiac conduction system and cardiomyocyte arrhythmias. The gene Gatm (glycine amidinotransferase, l-arginine–glycine amidinotransferase) encodes a mitochondrial enzyme involved in creatine biosynthesis with a potential role in embryonic and central nervous system development10. Homozygous deletion of Gatm resulted in low heart rate with prolonged QRS width and rate-corrected QT (QTc) and ST interval length (Fig. 1a). Pla2g10 encodes phospholipase A2 group 10, a calcium-dependent enzyme, which catalyzes the release of arachidonic acid from cell membrane phospholipids11. Pla2g10 homozygous knockout mice had a lower heart rate with prolonged PR, PQ and QTc interval length, and females were predominantly affected (Fig. 1b). Elmod1 encodes ELMO/CED-12 domain-containing 1, which acts as a GTPase-activating protein for the ARF family of small G proteins12. Homozygous deletion of Elmod1 had no effect on heart rate but shortened the duration of PR, PQ and ST intervals. Masp1 encodes mannan-binding lectin serine protease 1, a serine protease that functions as a component of the lectin pathway of complement activation with an essential role in innate and adaptive immune responses13. Homozygous deletion of Masp1 resulted in lower heart rate with prolonged QRS and ST intervals. Cap2 encodes adenylate cyclase-associated protein 2 (yeast) (CAP), a protein related to the Saccharomyces cerevisiae CAP protein involved in the cyclic AMP pathway that regulates filament dynamics and is implicated in a number of complex molecular and morphological developmental processes, including mRNA localization and the establishment of cell polarity14. Heterozygous deletion of Cap2 caused a lower heart rate and prolonged QTc and ST durations, both indicative of ventricular conduction delay (Fig. 1c). None of these five genes with the highest number of significant differences in ECG parameters have previously been reported to be associated with cardiac conduction system disease and rhythm disorders.
Candidate and known genes for monogenic cardiomyopathies
Of the 243 gene knockouts with significant TTE alterations, homozygous Fzd8-, Leprotl1-, Abhd3-, Ints6l- and Alpk3-knockout mouse lines had the highest number of significant abnormal phenotypes compared to controls. The gene Fzd8 (encoding frizzled class receptor 8) is an intronless member of the frizzled gene family that encodes frizzled receptors that play a critical role when coupled to the canonical WNT–β-catenin pathway in LV remodeling and cardiac development15. Homozygous loss of Fzd8 resulted in reductions of LV internal diameters (LVID) during systolic and diastolic contraction (LVIDs and LVIDd) with increased myocardial wall thickness (LV anterior wall (LVAW)s, LVAWd and LV posterior wall (LVPW)s) and decreased systolic function measured by fractional shortening and ejection fraction. In this study, all homozygous Fzd8-null mice showed distinct features of hypertrophic cardiomyopathy with normal ECG, strongly suggesting a role for Fzd8 in cardiomyopathy. Identical hypertrophic cardiomyopathy phenotypes were identified in Leprotl1 (encoding homozygous leptin receptor overlapping transcript-like 1) mutants compared to controls (Fig. 1d). The product of Leprotl1 negatively regulates cell surface expression of the growth hormone (GH) receptor in the liver and is an important metabolic regulator stimulating lipolysis, preventing protein catabolism and reducing insulin-dependent glucose disposal during nutrient starvation. Leprotl1 was also shown to be a molecular link between nutritional signals and GH activity that influences body growth and metabolism16. A hypertrophic cardiomyopathy phenotype is unrecognized for Leprotl1. By contrast, mice with homozygous loss of Ints6l (integrator complex subunit 6 like) or Abhd3 (abhydrolase domain-containing 3) had an increase in LV diameters (LVIDs and LVIDd) with decreased systolic function, indicated by reduced fractional shortening and ejection fraction. These findings confirmed decreased thickness of both LVAW and LVPW, suggesting dilated LV cardiomyopathy in both mutant lines. A cardiomyopathy phenotype has not been reported previously for these two genes. The gene Alpk3 (α-kinase 3) is involved in cardiomyocyte differentiation and is associated with several forms of cardiomyopathy17. Homozygous Alpk3-knockout mice had features of both hypertrophic and dilated forms of cardiomyopathy, reproducing the same phenotype previously reported for an Alpk3 mouse model18. Our test data confirmed reproducible myocardial hypertrophy in Alpk3-null mice, characterized by increased end-diastolic LVAW and LVPW thickness. Several changes more typically associated with dilated cardiomyopathy included marked increases in LV end-diastolic and end-systolic volumes and increased LV diameters (LVIDs and LVIDd) as well as reduced ejection fraction and fractional shortening, suggestive of LV chamber dilation (Fig. 1e). Of the five representative examples with abnormal TTE reported here, four represent unidentified candidate genes for cardiomyopathy and one provides validation of a previously reported association.
Thirty-eight of the gene-knockout mouse lines that we tested had abnormal phenotypes as shown by both ECG and TTE. Heterozygous loss of smoothened, frizzled class receptor (Smo) resulted in prolonged PR and PQ in ECG, and, under isoflurane anesthesia, Smo mutants showed normal cardiac function and normal LV dimensions but a higher heart rate as shown by TTE. Other examples included scavenger receptor class B member 1 (Scarb1)19,20 and collagen type I α2 (Col1a2)21–23, known for their role in CVD, and adaptor-related protein complex AP-4 ε1 (Ap4e1) and LSM1 homolog, mRNA degradation associated (Lsm1), which our data identify as undetected candidates for cardiac defects. The product of Ap4e1 is a component of the adaptor protein complex, the components of which are involved in both vesicle formation and cargo selection. AP-4-deficiency syndrome arises from defects in protein coats involved in intracellular protein sorting referred to as ‘coatopathies’, which are multi-systemic and often impact the central nervous system24. Homozygous loss of Ap4e1 resulted in decreased heart rate, confirmed by increased RR intervals in electrocardiograms. Ape4e1-knockout mice also had impaired LV myocardial function with reduced stroke volume, indicating reduced volume of blood pumped out of the left ventricle during each systolic cardiac contraction. At the same time, LVIDs was reduced, but systolic function increased, indicated by greater fractional shortening and ejection fraction (Fig. 1f). A previously published Ap4e1-null mouse model had motor deficits and neurological defects, which are consistent with phenotypes observed in humans with AP-4-deficiency syndrome24.
Candidate genes causing heart VSD
We used ex situ imaging of the embryo heart in homozygous lethal or homozygous subviable single-gene-knockout mice to identify structural heart defects. Of the mouse lines studied here, 65% were homozygous knockout mice (corresponding to loss of function (LOF) in human) and 35% were heterozygous knockout mice. We observed a variety of structural heart defects in mutants with lethal and subviable phenotypes but most frequently VSD. Homozygous single-gene deletion of Sirt1, Stambp, Casz1, Wfdc2, Tmem161b, Nxn, Dnajc18, Gnao1 and Slc25a1 resulted in substantial structural heart anomalies when compared to control littermate embryos. In young adult heterozygous mice, the same genes caused ECG abnormalities (Sirt1, Stambp, Casz1 and Wfdc2) and abnormal TTE measurements (Tmem161b, Nxn, Dnajc18, Gnao1 and Slc25a1). Complementary gross morphological examination of the embryos confirmed structural cardiac abnormalities in homozygous lethal knockout mice for five additional genes: Zfp503, Ubr4, Furin, Smo and Shox2 (Table 1). In young adult heterozygous mice, the same genes caused ECG (Zpf503) or TTE alterations (Ubr4 and Furin) or both ECG and TTE alterations (Smo) when compared to age- and sex-matched controls (Supplementary Data 1). Young adult heterozygous Shox2-mutant animals had not been tested by ECG or TTE at the time of publication; therefore, no adult data were available.
Table 1.
Gene | Allele | Primary viability | Secondary viability | Adult TTE | Adult ECG | Embryo GM abnormalities | Embryo micro-CT VSD |
---|---|---|---|---|---|---|---|
Sirt1 | tm1b | Subviable | Viable, E18.5 | No hit | rMSSD ↑ | Nervous system, eyes, immune system | |
Stambp | tm1b | Subviable | Viable, E18.5 | No hit | QTc ↓ | All normal | |
Casz1 | tm1b | Lethal | Viable, E14.5 | No hit | ST, QTc ↓ | Bone, cartilage, muscle | |
Wfdc2 | tm1b | Subviable | Viable, E14.5 | No hit | RR ↑ | Respiratory system | |
Tmem161b | tm1b | Lethal | Viable, E14.5 | Stroke volume, cardiac output ↑ | No hit | Nervous system, eye | |
Nxn | tm1b | Lethal | Viable, E18.5 | Ejection fraction, fractional shortening, HR (TTE) ↓ | No hit | Craniofacial morphology, bone, cartilage, muscle | |
Dnajc18 | tm1b | Subviable | Viable, E18.5 | LVIDd, stroke volume, cardiac output ↑ | No hit | All normal | |
Gnoa1 | tm1b | Lethal | Viable, E18.5 | LVIDd, LVIDs, end-systolic diameter, end-diastolic diameter ↓ | No hit | Reproductive system | |
Scl25a1 | tm1b | Lethal | Viable, E18.5 | LVAWd, LVAWs, cardiac output, HR (TTE) ↓ | No hit | Abnormal embryo size, abnormal embryo development, embryonic growth retardation, pallor, abnormal facial morphology, exencephaly, abnormal head size, abnormal head shape, protruding tongue, abnormal limb morphology, abnormal tail morphology | |
Genes positive for abnormal heart morphology in the VSD network | |||||||
Gene | Allele | Primary viability | Secondary viability | Adult TTE | Adult ECG | Embryo GM abnormalities | Embryo micro-CT VSD |
Smo | tm1b | Lethal | Viable, E9.5–E12.5 | HR (TTE) ↑ | PQ ↑; HR ↓; CV, QTc dispersion, ST, rMSSD, RR, HRV, PR ↑ | Abnormal embryo development, abnormal embryo turning | Abnormal pericardium morphology |
Ubr4 | tm1b | Lethal | Viable, E9.5 | LVIDd ↓, ejection fraction ↑ | No hit | None | Heart looping defects |
Furin | tm1b | Lethal | Viable, E9.5 | Ejection fraction, fractional shortening ↓; LVIDs ↑ | No hit | Abnormal brain (forebrain, midbrain and hindbrain), eye morphology, neural tube | Aortic sac, ventricular chamber, atrioventricular canal, heart trabeculation, arterial chamber, dorsal aorta |
Shox2 | tm1b | Lethal | Lethal, E11–E13 | No hit | No hit | Craniofacial defects, wavy spinal cords and edema, especially around the heart | Edema around the heart |
Zfp503 | tm1b | Subviable | Viable, E15.5 | No hit | HR ↓, RR ↑ | Sternum, left lung and right lung, cranial, accessory, caudal and middle lobe anomalies | Ductus arteriosus, pulmonary trunk, heart ventricle |
Characterization of nine genes involved in VSD (top) and five genes associated with abnormal heart morphology (bottom) by target allele, primary and secondary viability, significant (q value set at 0.05) adult ECG and TTE phenotypes, gross morphology (GM) and microcomputed (micro-CT) visualization of fetal hearts (left, healthy control; right, mutant heart with structural VSD). Note that arrows indicate phenotype differences in knockout compared with C57BL/6N mice: increased, ↑; decreased, ↓. HR, heart rate; HRV, heart rate variability; CV, coefficient of variation; RR, interval between successive Rs;rMSSD, root mean square of successive differences; bold (male) and italic (female) text in adult TTE and ECG columns indicates sex effect.
Ex situ screening of embryonic hearts identified structural heart defects in mutants for genes previously associated with CVD (Stambp, Nxn, Slc25a1, Smo and Shox2) as well as hitherto unassociated candidate genes (Sirt1, Casz1, Wfdc2, Tmem161b, Dnajc18, Zfp503 and Ubr4). These 14 genes, all of which when deleted caused a structural VSD in the heart, were further investigated for common elements using CIDeR-based multifactorial interaction network analysis (http://mips.helmholtz-muenchen.de/cider25). This analysis identified associations for an additional 45 IMPC genes from our study and 13 interacting non-IMPC genes associated with structural heart defects (Fig. 2).
Association of Gnao1 with VSD
To exploit the utility of our VSD dataset, we enriched our deep literature curation and the automated text mining tool ‘Geneshot’ (ref. 26) with human VSD data. This analysis identified genes known to be linked to VSDs (SIRT1 (ref. 27), SLC25A1 (ref. 28) and NXN28) and genes with no direct human experimental evidence for VSDs (TMEM161B, WFDC2, SHOX2, GNAO1, UBR4 (ref. 29) and FURIN30) apart from rare case studies that suggest association with structural heart defects for some of these genes. An example is one clinical report of a single patient with neurological disease and a STAMBP mutation with congenital malformations of the ventricular septum31. Further evidence is found in hypermethylation of the WFDC2 promoter in fetal myocardia with VSDs32 and association of an NKX2-5 variant, a gene suppressed by the regulator SHOX2, in a family with autosomal dominant inherited VSDs33 and cardiac arrhythmia34. GNAO1, the α subunit of the guanine nucleotide-binding protein, has no link to VSD but does have an association with infantile epileptic encephalopathy35, a neurological disorder not previously reported to be associated with congenital heart defects. Consequently, our data demonstrate that loss of Gnao1 results in a structural heart defect (that is, VSD) in mice and it should henceforth be considered a VSD gene candidate.
Unbiased large-scale cardiac phenotyping of knockout mouse lines identifies 486 unrecognized genes involved in cardiac function or development
To distinguish genes with a known association with human heart abnormalities from genes with no previously published association, we used comprehensive upstream literature-based Pharos (https://pharos.nih.gov/1) analysis of all 705 genes irrespective of the test modality and type of phenotype abnormality that we identified. This analysis indicated that 178 of the 664 mouse–human orthologous genes (41 nonorthologous genes) were previously linked to heart disease, some of which functionally converge on skeletal and cardiac muscle structural integrity (for example, LDB3)36, cardiac development (BMP10)37, calcium signaling (CACNB2 and DSG2)38,39, sodium channels (SCN2B)40 and sarcomere contraction cycle (TNNC1 and MYH2)30,41. Notably, 486 orthologous genes had not been previously associated with a cardiac condition and likely represent unappreciated candidate genes for CVD and more yet for CHD (Fig. 3a).
Mouse genes associated with CHD phenotypes are often regulated by the transcription factor TBX20
To explore the regulatory elements most important for normal cardiogenesis and therefore likely to play a role in congenital heart defects, we conducted chromatin immunoprecipitation followed by sequencing (ChIP–seq) analysis (ChIP-X enrichment analysis (ChEA) 2016 dataset of publicly available ChIP–seq experiments42). We probed for genes associated with a significant cardiac phenotype for cardio-specific enrichment compared to the 3,189 IMPC knockout genes that had no statistically significant associated cardiac phenotype using an unbiased comprehensive analysis. Genes previously identified as associated with abnormal cardiac function in knockout mice showed strong enrichment (P = 0.004; odds ratio, 1.31; combined score, 7.2) for heart-specific targets of the T-box transcription factor (TBX)20 next to TBX3 and TBX5 (both not significant; Supplementary Table 2), while all 3,189 genes with no significant phenotypes did not. Interestingly, the vast majority of the identified TBX20 target genes (66 of 93, ~70%) have no previous known link to CVD or are poorly annotated genes (13 of 93, ~14%; Fig. 3c). Additional gene expression analysis in both mouse and human hearts (Extended Data Fig. 2a–h) identified higher expression levels for positive genes (mean, 62.8 reads per kb of transcript per million mapped reads (RPKM); minimum, 0.01 RPKM; maximum, 10,303.04 RPKM) than for negative genes (mean, 55.22 RPKM; minimum, 0.01 RPKM; maximum, 3,270.97 RPKM) averaged across different developmental stages, stratified by ECG or TTE (Supplementary Data 2).
Mouse genes associated with CHD phenotypes are enriched in congenital heart defect variants in human patients
With TBX20 enrichment confirmed, we further explored the translational advantage and whether our single-gene-knockout mouse mutants can provide important information on the contribution of individual gene variants to CHD and help to prioritize variants that are highly likely to contribute to or cause CHD. Recognizing that many CHDs have a complex profile of genetic mutations, we nevertheless focused on two types of mutations in this comparison, de novo mutations and LOF variants. The latter are LOF mutations that most closely resemble a single-gene knockout, whereas de novo mutations are those that appear in an individual despite not being identified in their parents. This is where the single-gene-knockout mouse is particularly useful. Here we provide additional evidence for the utility of unknown LOF candidate genes for CHD in mice to translate basic research data to inform prioritization of inherited but undiagnosed mutations in patients with CHD. We used ortholog matching between our set of 486 unknown LOF CHD candidate genes and two large-scale sequencing studies of patients with CHD and their controls: the US Pediatric Cardiac Genomics Consortium (PCGC, https://benchtobassinet.com)43 and the UK 100,000 Genomes Project (100KGP, https://www.genomicsengland.co.uk/)44,45.
Focusing on the 486 genes that our data and analysis designate as unassociated with CVD, alignment showed that 213 of them (44%) overlapped with human de novo and/or LOF gene alleles. More precisely, 40 genes with LOF mutations and one gene with a de novo mutation (UST) were aligned to data in the 100KGP, whereas 168 genes with LOF mutations and 55 genes with de novo mutations were found in the PCGC dataset (Fig. 4a). Because of their rarity, detection of de novo LOF mutations can be a powerful way to discover unappreciated CHD risk genes1. Here we present five representative genes causative of structural heart abnormalities in different stages of embryo cardiac development (Casz1, Dnajc18, Pde4dip, Rnf38 and Tmem161b; Fig. 4b). The gene PDE4DIP46, encoding a scaffold protein, and CASZ1, a gene that encodes a zinc finger transcription factor essential for normal cardiovascular morphogenesis47, have been identified as both a de novo and an LOF variant in human studies and could be described as ‘de novo LOF mutations’. Heterozygous Pde4dip-mutant mice had decreased ventricular inner dimensions (LVIDd and LVIDs) with increased fractional shortening but normal ECG, whereas homozygous LOF deletion of Pde4dip caused early lethality, attributed to ventricular hypertrophy. Casz1-heterozygous mice had shortened QTc and ST intervals, indicating abbreviated ventricular conduction time, whereas homozygous loss of Casz1 caused structural defects in the heart atrium, the mitral valve and the heart ventricle, VSD and early embryo lethality. The RNF38 (encoding ring finger protein 38) gene is poorly characterized and not known to be associated with human disease. However, we were able to detect prolonged QRS, ST and QTc intervals in heterozygous Rnf38 mutants, potentially indicating ventricular conduction delay and abnormal depolarization, while homozygous loss of Rnf38 resulted in ventricular hypertrophy and early embryo lethality. The TMEM161B (encoding transmembrane protein 161B) gene has no hitherto direct experimental evidence for involvement in CHD except for preliminary data of a locus including TMEM161B identified in a cognitively impaired patient with comorbidities that included VSD48. In mice, heterozygous Tmem161b mutants had increased stroke volume and cardiac output that was more severe in females, whereas homozygous loss of Tmem161b caused manifold structural malformations affecting the vena cava, atria, ventricles and the mitral valve in addition to VSD. Another poorly characterized gene, DNAJC18 (DNAJ heat-shock protein family (HSP40) member C18), encodes a type III HSP40–DNAJ protein family member that is involved in endoplasmic reticular protein transfer and degradation49. Endoplasmic reticulum stress-associated pathways such as the unfolded protein response pathway play an important role in cardiac pathophysiology50. However, heart abnormalities in the absence of DNAJC18 have not been reported. Here, heterozygous Dnajc18-mutant mice had enlarged LVIDd with enhanced stroke volume and cardiac output, indicating slight LV dilation, while homozygous deletion (Dnajc18−/−) caused septal wall defects, severe structural cardiac malformations and early embryo death.
Notably, RCN3 (encoding reticulocalbin 3) is a poorly characterized gene preliminary associated with lung cancer35,51. However, heterozygous Rcn3-mutant mice had impaired LV inner dimensions (LVIDd and LVIDs) with increased LVPWs, while homozygous deletion of Rcn3 caused immediate death at birth from compressed hearts and respiratory distress. This gene was not identified among the genes carrying de novo or LOF mutations in the 100KGP and PCGC studies, but structural defects in homozygous and heterozygous mutants were compelling, indicating that this may be a previously undiscovered CHD candidate gene.
UK Biobank highlights potential mouse-to-human translational targets
We further aimed to identify genes with a role in healthy and abnormal cardiac function in mice that could inform about human cardiology and CVD. We used the UK Biobank and matched our genotype–phenotype information from ECG and TTE. Matching with structural heart defects is not applicable in this database because it is inclusive of healthy individuals.
Association testing for ECG parameters was performed based on extracted data from precomputed summary statistics of the pan-ancestry UK Biobank dataset (from the Neale laboratory, http://www.nealelab.is/data) for three phenotypes: ventricular rate (a surrogate of the heart rate), P duration and QRS duration (UK Biobank phenotype codes 12336, 12338 and 12340, respectively). For all genes, we assessed both strict boundaries of each gene (genome build 37) and a longer sequence of the same gene ±500 kb. The greatest significance was observed in variants of ARHGAP24, SMARCB1, ASPA, FAM53B and ABHD17B (Supplementary Table 3a). Further analysis of the gene sequence alone identified variants in SETBP1 (for example, 18:42,424,594[C:T]; P = 6.393 × 10−17) and VTI1A (for example, 10:114,505,465[G:A]; P = 1.770 × 10−13) that had significant associations with QRS duration. Eight additional genes showed suggestive P values (defined as P < 5 × 10−6 and P > 5 × 10−8) across all three phenotypes. Analysis of the extended sequence (gene ± 500 kb) increased the number of genes showing suggestive associations to 42, and variants in or near two additional genes showed genome-wide significant associations with QRS duration: PRAG1 (for example, 8:8,501,230[A:G]; P = 7.572 × 10−9) and PCYOX1L (for example, 5:149,165,478[C:CT]; P = 3.479 × 10−8) (Supplementary Table 3b). Interestingly, we have mouse-to-human phenotypes, even for single phenotypes, that are identical, for example, Cd300e and CD300E (heart ventricular rate) or Fndc3b and FNDC3B (QRS interval length). Other genes have an associated ECG phenotype but not the same one in mice and humans, for example, Tyms (QRS) and TYMS (ventricular rate), and a few genes have an associated TTE phenotype in mice and an associated ECG phenotype in humans (Supplementary Table 3c).
Association tests for TTE parameters were carried out based on LV end-diastolic and systolic volumes (LVedv, LVesv), LV stroke volume (SV), SV index (SV adjusted for body surface area), cardiac output and cardiac index. In addition, three additional non-cardiac parameters were included such as body surface area, age and sex for genomes from the UK Biobank database with a minimum of five of the six selected TTE parameters. Five nonsynonymous coding variants were associated with a significance threshold of P < 1 × 10−5 (Supplementary Table 3d), three of which are 3′ and 5′ untranslated region variants for DNAJC18 (Supplementary Table 3e). To assess noncoding consequences, we performed an overlap of qualifying variants associated with significant expression quantitative trait loci (eQTL) for the genes of interest at P < 1 × 10−5 for each magnetic resonance imaging (MRI) feature in each tissue from the GTEx expression database (see Methods for details). We found overlap for 50 different gene–trait pairs. The eQTL data confirmed suggestive evidence of an association between DNAJC18 LOF and increased LV end-systolic volume and decreased systolic volume index. An additional mouse–human alignment with LV image-derived phenotypes from UK Biobank52 confirmed this association between DNAJC18 LOF and LV systolic function. This analysis also revealed same-genome-wide associations with LV end-systolic–diastolic volume for five genes (FNDC3B, SMARCB1, HILPDA, ALPK3 and FBLIM1) and an association between CDSN loss and increased stroke volume. Interestingly, we have mouse-to-human phenotypes, even for single phenotypes, that are identical, for example, in Alpk3 and ALPK3 (LVIDd, LVIDs, LVedv and LVesv) or in Hilpda and HILPDA (LVIDs and LVesv). Alpk3 is of particular interest because this gene is involved in cardiomyocyte differentiation and is associated with several forms of cardiomyopathy17. Some mouse genes, however, have ECG data but no TTE data, and thus phenotypes cannot be the same in mouse–human comparisons (Supplementary Table 3f).
Study limitations
We generated and analyzed 3,984 IMPC single-gene-knockout mouse lines on a C57BL/6N inbred background. The study focus is to access, through systematic genotyping–phenotyping in mice, a landscape of previously unknown or poorly described genes playing a role in CHD with great translational potential. We have data on so-called proof-of-concept genes but by no means data on all known cardiac genes such as Ttn53 and Scn5a54. Analysis and description of the complete data quantity was beyond the feasible scope of the presented work. However, all production–genotype data and phenotyping results (raw and statistically analyzed) are freely available (http://ftp.ebi.ac.uk/pub/databases/impc/all-data-releases). Intersection of human and mouse genes was limited primarily due to the lack of CHD gene saturation in both reviewed human studies, the mouse data we generated and our selection of LOF and de novo variants.
Discussion
Identification of the genes, mutations and their mechanisms of action that cause and/or contribute to human inherited or de novo CHD is far from complete. We used in vivo ECG and echocardiography of young adult mice and high-resolution micro-CT imaging of embryos to identify 705 single-gene-knockout mouse lines with cardiac abnormalities. Our analysis of this gene set suggests that 75% of the genes have not been previously identified as genes associated with CHD and confirms that 25% are known cardiac genes, an important validation of our methodology to screen for CHD phenotypes and a requisite step forward in identifying genes that warrant further investigation for their role in human CHD.
We identified 297 genes that, until now, were not connected to electrical conduction disorders in the research and clinical literature. Single-gene mutation of each of these genes in knockout mouse lines resulted in significantly abnormal electrical conduction periods (that is, altered ECG intervals). Examples include Gatm, Cap2 and Elmod1, which had a range of ECG abnormalities. The product of GATM is known to facilitate creatine synthesis; therefore, it is probable that abnormal creatine synthesis in cardiomyocytes impairs the dynamic requirements for cardiac excitation conduction55 in homozygous Gatm-knockout mice. The abnormal depolarization pattern observed in heterozygous Cap2-mutant mice may be reflective of an impairment in the contractility of cardiac muscle cells56. By contrast, Elmod1 is a poorly characterized gene, only recently associated with hair cell stereocilia dysmorphology and deafness in mice12. Mutations in the lectin complement pathway (for example, MASP1) cause the rare autosomal recessive Carnevale, Mingarelli, Malpuech and Michels syndromes (3MC) that have a spectrum of developmental phenotypes but rarely include cardiac anomalies57. Homozygous Masp1-knockout mice, however, were viable and showed ventricular conduction delay with abnormal depolarization. Adding Elmod1 and Masp1 to the list of genes that are known to influence the heart’s electrical conduction provide unappreciated candidates for investigation.
A comprehensive assessment of all 297 genes that we identified with abnormal ECG in mutant mice is beyond the scope this study, but, given the examples above, it is very likely that this set of 297 genes represents a important resource of unknown genes involved in the cardiac conduction system, some of which may have translational potential for clinical diagnosis and therapeutic approach. These genes may be shown to have direct impact on the action potential or an impact on the genetic signaling pathways involving these genes in the development of cardiac conduction system diseases, such as sinus or atrioventricular node dysfunction or altered sub-Hisian conduction.
Our cardiac phenotyping screen also identified 297 single-gene-deletion mouse lines with abnormal dimensions and/or function of the LV myocardium (that is, altered TTE parameters). None of these genes have published links to hypertrophic and/or dilated cardiomyopathies. For example, Fzd8- and Leprotl1-mutant mice had distinct TTE features of hypertrophic cardiomyopathy. The known contribution of Fzd8 to canonical β-catenin signaling and the WNT pathway15 is a likely mechanism to explore and potentially assign to the variant Fzd8. By contrast, the hypertrophic cardiomyopathy that we identified in Leprotl1-mutant mice suggests functions of this gene in addition to its known role in growth (that is, GH in the liver) and metabolism16. Many genes in this set are very poorly understood, yet single-gene mutation in mice and focused cardiac phenotyping identified abnormalities consistent with cardiomyopathy. Deletion of Ints6l or Abhd3, for example, resulted in dilated LV cardiomyopathy with decreased systolic function. Ints6l is a gene with limited information, almost no functional annotation and no known disease association before our identification of a heart phenotype consistent with cardiomyopathy. The product of ABHD3 is known to play a multi-faceted role in the catabolism of medium-chain phospholipids, previously confirmed in an Abhd3-knockout mouse model with elevated myristoyl (C14)-phospholipids, including the bioactive lipid C14-lysophosphatidylcholine58. Here, we provide evidence for the mode of action of Abhd3 in the myocardium. Conduction system deterioration in young adult heterozygous Smo-mutant mice was also intriguing. Smo is known to encode a G protein-coupled receptor that interacts with the patched protein, a receptor for Hedgehog proteins and a fundamental actor in embryonic development and postnatal tissue homeostasis59. Smo is also a gene that has been long debated in the context of heart disorders, particularly for its relevance in the Hedgehog signaling pathway and thus cardiac disorders. Our in vivo data show that diminished expression of Smo severely affects the electrical conduction system with prolonged signal from sinus node to the ventricles in structurally normal hearts, a previously unidentified association with a monogenic form of primary conduction disease that may lead to an expansion of gene function. Interestingly, homozygous E9.5 Smo-knockout mice died at or around E9.5 from multiple abnormalities that included craniofacial defects and malformation of ventricles and atria (Supplementary Fig. 1). These multiple abnormalities strongly suggest that the product of Smo has essential and pleiotropic roles in organogenesis and normal cardiac conduction system development during early embryonic development. Surprisingly, we found no link between cardiac disease and RCN3 in human studies. Its mouse ortholog, however, showed a very compelling association with LV dilatation in heterozygous null adult mice and with compressed hearts with lethal respiratory distress in homozygous neonates. Reticulocalbin 3 is encoded by a poorly characterized gene with an unvalidated association with lung cancer35,51. From our work in mice, Rcn3 is a previously undiscovered cardiac gene with an important role in cardiac development and myocardial remodeling. This gene should be the subject of further research and be included in other studies of the genetics of CHD. Systematic mouse phenotyping can complement the human data and identify genes not previously associated with CVD. Embryonic heart diagnostics of mouse embryos from knockout mouse lines that were embryo lethal or neonate subviable in particular provide a valuable resource for discovering genes causal to structural heart defects.
This approach enabled identification of cardiac phenotypes in lethal and subviable ‘essential’ genes with no previous association with normal heart development or CHD. We identified a variety of structural heart defects, with the most highly prevalent being VSD, a congenital abnormality affecting the wall dividing the left and right ventricles5. We confirmed that mutations in mouse genes that resulted in VSD are networked with known transcription factors (for example, NKX2-5 or TBX20 (refs. 60,61)), chromatin-modifying genes regulated by processes essential for cardiogenesis (for example, Hdac1 and Smarcb1 (ref. 62)) and functional pathways (for example, smoothened and WNT signaling63). The connectivity of mouse genes associated with the VSD phenotype with key components of cardiogenesis offered a plausible rationale for the cardiac anomalies observed in this study. As an example, homozygous Gnao1-knockout mice produced by the IMPC had VSD, although a different allele of Gnao1 previously showed its involvement in the regulation of Ca2+ channels in the heart with no VSD64. VSD in Gnao1-null mutants was not previously reported. In humans, GNAO1 (encoding guanine nucleotide-binding protein G(o) subunit α) is associated with infantile epileptic encephalopathy35, but our mouse data suggest that GNAO1 is an unappreciated candidate gene involved in VSD in humans.
Analyses of regulatory networks enriched across our entire set of mouse genes with cardiac phenotype(s) showed strong enrichment of TBX20 targets among genes previously identified in knockouts with an impaired electrical conduction system or dysmorphic myocardium. TBX20 target enrichment added unknown functional evidence to support association of our gene set with abnormal cardiac structure phenotypes but no prior association with heart abnormalities or CHD. TBX20 is one of the critical regulators of heart development across different species65–67 and is associated with human CHD and adult cardiomyopathies68–71. Intersectional analysis of the 486 mouse genes and their human orthologs showed that the majority are previously unknown heart genes. We showed overlap between our genes and those of patients with CHD from the US PCGC43 and the UK 100KGP44,45. From this data-intersection analysis, we selected five genes with clinical human variants of unknown relevance (VUR) and showed that mutations in Casz1 (zinc finger protein castor homolog 1), Dnajc18 (DNAJ homolog subfamily C member 18), Pde4dip (myomegalin), Rnf38 (E3 ubiquitin-protein ligase RNF38) and Tmem161b (transmembrane protein 161B) are pathogenic in the mouse heart. Deletion of all five genes caused severe structural defects in the heart that resulted in the death of mutant embryos. This experimental body of evidence suggests that these genes should be prioritized as pathogenic candidates in human sequencing studies.
In the UK Biobank, although biased for selection of older adults (40–69 years) and ‘healthy volunteers’ (ref. 72), we also found evidence of association between homologs of 31 genes from TTE and 42 of our genes of interest from ECG and MRI features of cardiac function. This particularly highlights these genes as potential targets for mouse-to-human translation and warrants further investigation into their role in healthy heart function and cardiac disease. One association that we highlighted was the association between LOF of DNAJC18, a gene classified as VUR in the PCGC and the UK 100KGP, and increased LV end-systolic volume and decreased systolic volume index. Likewise, the association of SIRT1 with alterations in QRS interval length in the UK Biobank matches the results of the enrichment analysis shown in ‘Hairball’, where SIRT1 was directly associated with heart diseases alongside 11 other genes from our study. Overall, genes associated with TTE phenotypes in our dataset are associated with the same phenotypes in the UK Biobank and therefore strengthen our findings from the IMPC mouse models. We noted that the overlap analysis with human association data was performed only for single studies. With the release of large-scale whole-exome-sequencing data, such as UK Biobank whole-exome sequencing, gene-based datasets for rare variant analysis are beginning to emerge. However, these datasets are not yet easy to query (https://genebass.org/73), which prevents efficient analysis. Enabling programmatic access is a key driver of large dataset adoption and should be a priority for large biobank analysts.
To date, over 500 human genes have been linked to CHD, including those encoding transcription factors, cell signaling molecules and structural proteins important for heart development. The large number of gene variants identified thus far clearly shows that diverse mechanisms lead to congenital heart defects6. Nevertheless, many variants are still considered to be of unknown relevance (for example, VUR), and de novo variants are emerging. Inherited cardiac disease is a generic term covering a wide variety of relatively rare diseases of the heart, mainly divided into disorders that are primarily arrhythmogenic with no obvious changes in structure and diseases that are associated with structural change, such as dilated or hypertrophic cardiomyopathy74. Knowledge of genetic contributions to congenital arrhythmias has made many advances, especially in heritable monogenic diseases (for example, long QT); however, the genes causing or contributing to a substantial proportion of congenital monogenic cardiac conditions have yet to be discovered75.
We believe that the set of 705 genes included in this work is fundamentally useful to the cardiovascular community by identifying a substantial number of human–mouse orthologous genes that, when individually mutated (one mutation per mouse line), result in (but not necessarily cause) a cardiovascular phenotype. The subset of 486 genes with no previously reported association or role in CHD or more yet in CVD (Fig. 5), regardless of allele type, provides an unreported gene set of potential interest to the cardiovascular community. Additionally, LOF is by far the best-studied mechanism in human patients with CHD with and without developmental disabilities. Currently available datasets from the field of human genetics have helped uncover several genes with heterozygous de novo LOF variants, which to date have shown the strongest association and effect size with syndromic and nonsyndromic CHD. Nevertheless, estimates of these studies in the field69,76 can only explain about 10–40% of cases, with a higher proportion in those with neurodevelopmental disorders. Although some of these studies have examined biallelic variants, current sample sizes are often too small to detect reliable associations. Detailed phenotypic studies are needed to further unravel the underlying mechanism and reveal milder and stronger phenotypic associations depending on the background allele. These further studies on genes identified here will substantially augment the usefulness of this resource to uncover VUR and unappreciated CHD genes. Therefore, these mice are a starting point for the cardiovascular community to validate human findings and expand detailed downstream molecular investigation of genes available to the community to promote curative strategies.
Methods
Ethics approval and consent to participate
All IMPC institutes that breed mice and collect phenotyping data are guided by their individual ethical review, licensing and accrediting bodies, reflecting the national legislation under which they operate. Cardiovascular mouse phenotyping was carried out under the auspice of the following animal protocols: Baylor College of Medicine (AN-5896), the German Mouse Clinic Helmholtz Zentrum München (144-10 and 15-168), the Institut Clinique de la Souris Mouse Clinical Institute (4789-2016040511578546v2), Medical Research Council Harwell (97/3384), Nanjing University (NRCMM9), the Rikagaku Kenkyūjo Tsukuba Institute (Exp11-011, 12-011, 13-011, 14-009, 14-017, 15-009 and 16-008), the Centre for Phenogenomics (0153, 0275, 0277 and 0279), Jackson Laboratory (11005) and the University of California Davis (20863).
The International Mouse Phenotyping Consortium
The IMPC is a collaborative, inter-institutional research initiative of 12 major research centers across Europe, North America and Asia77–79 that aims to generate and phenotype a null mutant for every protein-coding gene in the mouse genome (https://www.mousephenotype.org/)51,77,80. This IMPC program involves the systematic and standardized characterization of more than 20,000 knockout mice strains carried out under the uniform operating procedures detailed in the International Mouse Phenotyping Resource of Standardized Screens (IMPReSS), which was developed and validated during the pilot programs EUMORPHIA and EUMODIC81,82.
IMPC gene nomination
Each IMPC center selects its genes freely; some make selections according to the preferences of funders, and others select based on scientific interest, a colocated receptor community and center-specific special focus. To this end, some centers have an organized network of experts to nominate genes of particular interest; other centers chose clones that were available when only targeted embryonic stem cells were available during the initial phase of the IMPC. The only rule is to avoid duplication and therefore not nominate genes more than once. Genes that are already known and described in their full function and disease association are of less interest. The IMPC mission focuses on characterizing many of the poorly understood genes (the ‘ignorome’) that have little to no information and publication. This strategy has made substantial contributions to our understanding of mammalian gene function in terms of sexual dimorphism83, pleiotropy77 and disease79,84,85.
The breadth of original data identifies previously uncharacterized disease genes in the mouse and additional phenotypes for genes with existing mutant lines mimicking the associated disorder.
There is also a feature on the IMPC website (https://www.mousephenotype.org) with which the public community can express interest in specific genes and actively participate in the nomination process.
IMPC data collection
The IMPC has 21 international phenotyping centers that use high-throughput pipelines to phenotype knockout mouse strains. These centers are generating homozygous knockout mouse strains for every protein-coding gene in the mouse genome, producing seven males and seven females per gene.
After weaning, viability of offspring was monitored. When homozygous mice were viable, the animals were phenotyped following the procedures as indicated in the early adult (EA) pipeline. When homozygous mice were not viable, additional homozygous mice entered the embryo pipeline and the heterozygous mice, when viable, were phenotyped following the EA pipeline. For sample sizes, 14 homozygous knockout mice (seven female and seven male mice) were phenotyped for each gene. Wild-type mice were continually tested using the same protocols to provide baseline ‘normal’ values.
The phenotyping pipeline generated multiple types of data. Phenotype parameters were continuous data (for example, cholesterol levels) or qualitative values (for example, presence or absence of cataracts). Additionally, images (for example, X-rays, echocardiograms, LacZ staining, histology) were collected as part of the protocols. Statistical analysis was performed to identify outlier values. These outliers were then annotated with terms from the mouse phenotype ontology, for example, high circulating cholesterol. This text contained data from the EA pipeline. This pipeline contained a wide range of in vivo phenotyping procedures carried out from 9 to 15 weeks with body weights measured weekly from 4 to 16 weeks (detailed information on the ‘IMPC EA pipeline’ is provided at https://www.mousephenotype.org/impress/PipelineInfo).
Cardiovascular data were gathered at week 12 with ECG and/or echocardiography. Some procedures were mandatory for all centers, assuring key data were provided (see https://www.mousephenotype.org/impress/index for high-level detail).
At the time when this study was carried out, the following centers provided ECG and TTE data: Baylor College of Medicine, the Helmholtz Centre Munich, MRC Harwell and the Institut Clinique de la Souris. ECG data were provided by Jackson Laboratory, the RIKEN Tsukuba Institute BioResource Center, the Toronto Centre for Phenogenomics and the University of California, Davis. In addition, TTE data were provided by Nanjing University. These centers were not yet involved in the data release (DR)10.1 data collection for this publication: the Wellcome Trust Sanger Institute, the Czech Centre for Phenogenomics and the Korea Mouse Phenotyping Center.
Mouse phenotyping
We used data collected from the IMPC phenotyping pipeline including cardiovascular screening, carried out at 12 weeks of age, to determine cardiovascular health using two different procedures, namely, high-throughput electrocardiogram recordings of conscious mice if not stated differently (IMPC_ECG_001 and IMPC_ECG_002) and TTE recordings of anesthetized mice if not stated differently (IMPC_ECH_001) to assess the morphology and functionality of the heart.
Electrocardiography recording
As even modest handling of mice may induce alterations in heart rate86, each mouse was permitted to acclimatize on the ECG recording platform for 10 min before measurement. Furthermore, cage mates were placed on the adjacent platform unit to provide company. Electrocardiograms were recorded in a dim and quiet analysis room. To eliminate circadian influences, electrocardiograms were recorded at the same time of day. A disposable lead plate (Mouse Specifics) was embedded in the floor of the platform and spaced to provide contact between the electrodes and animals’ paws, providing an ECG signal equivalent to that of Einthoven lead II. Only runs in which at least 15 ECG beats could be included in the analysis were chosen. Data were analyzed using standard protocols for ECG signal analysis by eMouse (Mouse Specifics). The software uses a peak-detection algorithm to find the peak of R waves and to calculate heart rate. The software plots its interpretation of P, Q, R, S and T for each beat so that heart rate, QRS duration, PQ interval, PR interval, QT interval and ST interval are measured and reported automatically. In addition, each trace was examined for clear P, Q, R, S and T peaks before accepting the automatic calculations. Sensitivity was corrected manually in case R peaks were not chosen correctly. Noise and motion artifacts are rejected automatically by the software. As in mice the T wave often merges with the final part of the QRS complex87, the software automatically defines the end of the T wave of each signal as the point where the signal intersects the isoelectric line. HRV was calculated as the mean of the differences between sequential heart rates for the complete set of ECG signals. QT intervals were rate corrected (QTc) by applying the equation recommended by Mitchell et al.88. Detailed experimental protocols for the IMPC phenotyping procedures are available for general access at https://www.mousephenotype.org/IMPReSS.
Transthoracic echocardiography recording
Body weights were taken shortly before TTE. For TTE recordings of anesthetized mice, the animal was placed in an induction chamber and anesthetized with gaseous anesthetic. Once the animal was sedated, it was moved to a nose cone for hair removal in the area of measurement. Once the hair was removed, the animal was moved to the imaging platform, its paws were taped to the ECG lead plates, and a rectal probe was inserted to monitor body temperature, which was maintained at 36–37 °C. During imaging, anesthesia was adjusted to maintain proper heart rate and prevent the animal from waking up. For TTE examination of awake animals, mice were firmly held by the nape of the neck (in the supine position) in the palm of one hand with the tail held tightly between the last two fingers. Prewarmed ultrasound gel was placed on the chest at the area of imaging, and TTE recordings started in short-axis mode with papillary muscles being the point of reference. Images were obtained in parasternal long-axis B mode and short-axis views in M mode with at least three images per mode. Once imaging was complete, the animal was removed from the platform and allowed to recover on top of a heating pad.
Qualitative and quantitative measurements were made offline using analytical software (VisualSonics). LVIDs, LVIDd, systolic and diastolic interventricular septum thickness (IVSs and IVSd), LVPWs and LVPWd were measured in three consecutive beats according to the American Society of Echocardiography leading-edge method89 as a measure of the actual visualized thickness of the ventricular septum and other chamber dimensions as defined by the actual tissue–blood interface. Papillary muscles should be excluded from the cavity in the tracing. Fractional shortening was calculated as percent fractional shortening = ((LVIDd − LVIDs)LVIDd−1) × 100. Ejection fraction was calculated as percent ejection fraction = 100 × ((LVvold − LVvols)LVvold−1) with LVvol = ((7.0(2.4 + LVID)−1 × LVID3), where LVvold is end-diastolic volume and LVvols is end-systolic volume.
The corrected LV mass was calculated as 0.8(1.053 × ((LVIDd + LVPWd + IVSd)3 − LVIDd3)). SV is the volume of blood pumped from one ventricle of the heart with each beat. The stroke volume of the left ventricle was obtained by subtracting LVvols from LVvold. Heart rate was determined from the cardiac cycles recorded on the M-mode tracing, using at least three consecutive systolic intervals. In addition, respiratory rate was calculated by measuring three consecutive respiratory intervals.
Importantly, each center records relevant metadata parameters including equipment manufacturer, equipment model, recording environment, anesthetic agent and anesthetic dose.
Statistical methods
The workflow can be described in two major stages: (1) data collection (in Supplementary Methods) and (2) statistical analysis.
The statistical analysis complies with the IMPC statistical pipeline and implementation in R90 (version 3.4.0) and applies the R packages SmoothWin91 and PhenStat92–94. Outcomes of the statistical pipeline, including 57,500+ analyses and results, are then assigned a mammalian phenotype term using the standard definitions in IMPReSS (https://www.mousephenotype.org/impress/index). In brief, the Mammalian Phenotype Ontology is a community effort to provide standard terms for annotating phenotypic data (http://www.informatics.jax.org/vocab/mp_ontology). This browser can be used to view terms, definitions and term relationships in a hierarchical display (http://www.informatics.jax.org/userhelp/VOCAB_mp_browser_help.shtml).
Statistical results as well as the raw data and the assigned mammalian phenotype terms are available at https://zenodo.org. Windowed and standard (non-windowed79) analyses for the IMPC data remain comparable on this web interface (https://zenodo.org).
Human and mouse orthologs
To obtain human orthologs for mouse genes, we applied the protein-coding file from the HGNC website (20190617) and the HCOP file (human-to-mouse orthologs 20190617) (fixed for 12 unique inference methods; that is, methods that appear multiple times were counted only once). We considered good-quality orthologs as ‘protCod_score ≥ 5_max’ and defined ortholog-inference score ≥5 (that is, the inference is supported by five or more methods); both directions (mouse-to-human and human-to-mouse) with ‘max’ score (filtering out ‘dup_max’ and ‘no_max’ scores) had a total of 16,882 inferences, included one-to-one and one-to-many orthologs.
IMPC embryonic data
Knockout mouse strains differ in viability, with up to one-quarter of mutations being lethal. Each IMPC gene-knockout strain is assessed for viability by examination of litters produced from mating heterozygous animals. A mutation is declared lethal if no homozygous null pups are detected at genotyping age (P13–P17), while it is declared subviable if homozygous null pups constitute less than 12.5% of the progeny. For lethal mutations, embryos are phenotyped in the embryonic or perinatal lethal pipeline.
Preparation of mouse embryos for micro-CT imaging and phenotyping was described previously51. Briefly, pregnant timed mated dams were euthanized according to an approved protocol from the local Institutional Animal Care and Usage Committee (IACUC). Embryos were dissected and collected in warm 1× PBS, while yolk sacs from each embryo were collected for genotyping. Each embryo was immersed in individual sample tubes with a sufficient amount of ice-cold 4% paraformaldehyde and fixed at 4 °C (E8.5–E15.5, overnight; E18.5, 3 d). After fixation, a hydrogel-based tissue-scaffolding protocol (STABILITY) was used to prevent sample shrinkage and deformation for embryos at E15.5 and beyond51,95,96. To scaffold the embryo with hydrogel, each sample was transferred to a 50-ml conical tube, immersed in 20 ml STABILITY buffer and incubated at 4 °C for 3 d to allow the polymer to diffuse through the embryo. Samples were then placed in a desiccator to remove the air in the sample tubes with a bench-top vacuum for 10 min, followed by purging with nitrogen gas at 10 psi for 5 min. The cross-linking reaction was initialized by incubating samples at 37 °C for 3 h. After cross-linking, the external hydrogels were removed from the specimens, and the samples were stored in 1× PBS with 0.1% (wt/vol) sodium azide at 4 °C until ready for imaging. Iodine solution (0.1 N) was used to contrast the soft tissue of the embryo for micro-CT imaging. Samples were immersed in iodine solution for staining on a nutator at room temperature. The minimum required staining time depends on the stage and size of the embryos (E8.5–E15.5, overnight; E18.5, 3 d). Embryos were then mounted in 56-mm capped sample tubes in 1% (wt/vol) agarose immediately before imaging. The raw data for three-dimensional imaging of the samples were acquired with a SKYSCAN 1272 micro-CT scanner (Bruker) along the anterior–posterior axis, and acquired projection images were then reconstructed by NRecon Reconstruction (version 1.6.9.8, Bruker) software. Image-processing software CTVox (Bruker) and Slicer (https://www.slicer.org/) were used for rendering the reconstructed three-dimensional volume data and data analysis.
Briefly, among lethal lines, approximately 50% of animals die before E9.5, 15% are viable at E9.5 and lethal before E15.5 and the remaining 35% are usually lethal prenatally between E15.5 and birth. Among our 486 genes of interest, 321 (66%) were viable and their corresponding lines will not be subjected to embryo phenotyping and 24% were lethal. Based on previous IMPC distributions, of the 77 lethal lines, we would expect 50% to be lethal before E9.5, leaving 38 genes for which we would expect to obtain embryonic imaging data. Among these, we would expect data for six lines at E9.5 (15% of lines viable at E9.5 are then lethal at E12.5 or E15.5) and E15.5 or E18.5 data for the remaining 32 lines. For the subviable lines, we would expect data at E18.5 for 29 genes. Thus, even when starting with a large collection of lines, only a limited set would have embryo data due to operational and biological reasons.
UK Biobank database
We extracted data for LVedv, LVesv, SV, SV index (SV adjusted for body surface area), cardiac output and cardiac index. In addition, three additional non-cardiac parameters were examined such as body surface area, age and sex for genomes of 39,624 individuals in the UK Biobank database when at least one phenotype was not missing. We averaged values over the second and third visit to minimize missingness and adjusted MRI traits for age and sex as well as LVedv and LVesv for body surface area. We restricted the analysis to complete cases only and excluded adjusted outlier values above 200, 4 and 10 for both end volumes, cardiac index and cardiac output, respectively. We performed step 1 of REGENIE 2 version 2.2 using the 20 first genetic principal components on the directly typed version 2 release of UK Biobank filtered for MAF (0.01), MAC (100), HWE (P > 1 × 10−15) and a sample and genotyping rate of 0.9. To build the set of qualifying variants for which association tests are performed, we queried the Ensembl REST API Homology endpoint and selected only human genes with a one-to-one homology status with the genes of interest. We extracted gene coordinates using the Ensembl REST API Lookup endpoint and extended them by 1 Mbp on either side. The resulting regions were merged using bedtools merge and extracted from the imputed version 2 dataset, restricted to the samples present in the phenotype file using plink version 2.00a3LM AVX2 Intel (4 August 2021). Association was carried out on the resulting 13,193,809 variants using step 2 of REGENIE version 2.2, and suggestively significant variants (log10 (P > 5)) were extracted using R data.table (Supplementary Table 3).
Exon coordinates were queried using the Ensembl API Lookup endpoint, and exonic variants were identified using bedtools intersect. The consequences of the resulting variants were assessed using the Ensembl REST API VEP endpoint. For eQTL mapping, we downloaded GTEx version 8, mapped IDs to build 37 coordinates using the ‘2017-06-05_v8’ lookup table and subset the data to only include eQTL for the genes of interest. We excluded variants for which eQTL direction was not concordant across tissues for a given gene and merged these eQTL data with the REGENIE output. We aggregated variants by gene and classified them based on whether eQTL and GWAS directions were concordant or discordant and computed the resulting proportion. For all genes considered, most variants (77% at the lowest) agreed on either concordance or discordance, meaning that a positive effect on expression clearly correlated with an increased (for concordant genes) or decreased (for discordant genes) value of the measured trait. We also report, for every gene–trait pair, the minimum association P value found for any variant that is also an eQTL (Supplementary Table 3).
Reporting Summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Supplementary information
Acknowledgements
Hands-on mouse IMPC phenotyping has been carried out by a large number of laboratory staff with superb experimental skills and unsurpassed dedication. We profoundly apologize for not being able to name all these individuals due to space limitations. M.H.d.A., V.G.-D. and H.F. thank all past and present GMC technicians and animal caretakers for their outstanding work. J.D.H. thanks the French National Centre for Scientific Research (CNRS), the French National Institute of Health and Medical Research (INSERM), the University of Strasbourg and all members of the Mouse Clinical Institute (MCI-ICS) for their help and helpful discussions during the project. R.E.B., S.A.M., J.K.W. and K.L.S. thank the members of the Jackson Laboratory Research Animal Facility and Laboratory Informatics Team for their outstanding support. This research was made possible through access to the data and findings generated by the 100KGP. PCGC data were generated by the PCGC under the auspices of the National Heart, Lung, and Blood Institute’s Bench to Bassinet Program (http://www.benchtobassinet.org/). The PCGC program is funded by the National Heart, Lung, and Blood Institute, the National Institutes of Health (NIH), US Department of Health and Human Services through grants U01HL098123, U01HL098147, U01HL098153, U01HL098162, U01HL098163 and U01HL098188. This text was not prepared in collaboration with investigators of the PCGC, has not been reviewed and/or approved by the PCGC and does not necessarily reflect the opinions of PCGC investigators or the NHLBI. The IMPC has been supported by NIH grants U54 HG006364, NIH U42 OD011175 and NIH UM1 OD023221 (K.C.K.L. and C. McKerlie); NIH UM1OD023222 (J.M.W., S.M. and R. Braun); MRC grant code MC_A410 and NIH UM1HG006370 (T.F.M., H. Parkinson, P.F., D.S., J.M., V.M.F. and H.M.); MRC funding for the IMPC (reference 53658) (S.M.W.); Genome Canada and Ontario Genomics (OGI-051) (C. McKerlie and S.D.M.B.). The Czech Centre for Phenogenomics is supported by RVO68378050 and projects LM2015040, LM2018126, CZ.1.05/1.1.00/02.0109, CZ.1.05/2.1.00/19.0395, CZ.02.1.01/0.0/0.0/16_013/0001789 and CZ.02.1.01/0.0/0.0/18_046/0015861 (R.S.); the French Agence Nationale de la Recherche grants ANR-10-IDEX-0002-02, ANR-10-LABX-0030-INRT and ANR-10-INBS-07 PHENOMIN (Y.H.); European Union Horizon 2020 (IPAD-MD funding 653961); and the German Center for Diabetes Research (DZD) (M.H.d.A.). The Cambridge-Suda Genomic Research Center of the Soochow University was supported by the National Key R&D program of China (2018YFA0801100)(Y.X.).
Extended data
Author contributions
A.M.W., J.R., M.E.D., J.D.H., A.L.B., A.M.F., C. McKerlie, L.M.J.N., K.C.K.L., T. Sorg, Y.H., H. Parkinson, S.E.W., S.D.M.B., A.-M.M., X.G., T.S., J.K.S., Y.X., R.S., J.K.W., S.A.M., K.L.S., R.E.B., H.G., H.F., J. Prochazka, S.P., A.A., G.P., M.-C.B., N.S., V.G.-D. and M.H.d.A. conceived and supervised the study. S.M., W.W., L.M.J.N., K.C.K.L., M. Selloum, G.P.T.-V., F. Mammano, O.E., A.F., P.F., D. Mayr, C. Seisenberger, A. Bürger, F.G., A.A., G.P., M.-C.B., S. Jacqout, D.A.-H., P.C., P.A., J.S., G.C., M. Mackenzie, A. Cauldner, J.K., W.G., S.A.M., A.M.S., R.U., C.K., L.O.G., K.A.P., M. Mckay, V.L., S. Clarke, Q.L., G.S., A. Creighton, L. Lintott, M.G., M.P., S.T., A.P., M.C. and J.A.W planned and performed mouse production. A.M.W., J.K.W., J.R., C.-W.H., C.L.R., A.E.C., C.S.W., A.M.F., L.M.J.N., C. McKerlie, L. Bower., D.C., G.B.A., O.W., H.J., N.S., Y.-L.C., A.S.-M., S.S., J.P., S.P., R.S., J.A.A.-P., L. Becker, O.V.A., J.C.-W., T.K.-R., M.K., C. Stoeger, R. Gerlini, B.R., L.G., S.M.H., A.Y., H.M., M. Tamura, W.H., A. Gaspero, J.R.G., A.G., R. Bohat, L.W., M.L.M., S.K., T.L.R., I.L., D.G.L., R.C.S., S.V., H.C., M. Stewart, S.C., S.W., S. Johnson, J.C., Z.S.-K., H. Phelps, R.H., C.N., Z.L., M.-F.C., F.R., W.B.R., M.A.M., J.J.C., J.P.L., I.V., Z.B., D.Q., R. Guo, A.D.’S., Z.H., S. Camilleri, M. Ganguly, H.A., M.E., X.S., K.D., C.X., K.R., V.L., S.R.T., E. Wolf and J.J.G. participated in phenotyping and data acquisition. D.S.W., A.M.W., J.K.W., J.M.W., J.R., G.M., C.-W.H., C.L.R., A.E.C., C.S.W., L. Bower, S. Leblanc, H.M., T.I.O., N.S., V.G.-D., M.A.Ö., L.Z., L.T., R.K.A.B.-S., C.L., J. Prochazka, S.P., R.S., A. Bezginov, D.M., D.C., E.A., A.G., N.W.R., E.Z. and M.-P.H. performed data QC and data analyses. S. Leblanc, T.I.O., G.M., H.H.M., T.F.M., H.W., G.F., G. Frishmann, C. Montrone, A. Ruepp, J.M., V.M.F., D.S., H.P., A.-M.M., D.C., R.K. and M. McFarland conducted bioinformatic analyses. D.S.W., T.I.O., S. Leuchtenberger, A.S.-M., Y.-L.C., J.R., M.E.D., C.-W.H., A.M.F., L.M.J.N., G.B.A., T. Sorg, Y.H., A. Görlach, S.D.M.B., K.C.K.L., N.S., H.H.M., G.M., V.G.-D., M.H.d.A., M.W., P.M.-K. and R.T. contributed to manuscript preparation and writing.
Peer review
Peer review information
Nature Cardiovascular Research thanks Morten S. Olesen and the other, anonymous, reviewers for their contribution to the peer review of this work.
Funding
Open access funding provided by Helmholtz Zentrum München - Deutsches Forschungszentrum für Gesundheit und Umwelt (GmbH).
Data availability
All data generated or analyzed during this study are included in this published article and its Supplementary Information. IMPC data are open access for public. For single-gene search, visit https://www.mousephenotype.org/data/search. For batch queries, please visit https://www.mousephenotype.org/data/batchQuery. To download a particular DR, visit http://ftp.ebi.ac.uk/pub/databases/impc/all-data-releases/. Support for particular DR (here we used DR10.1) download can be found at https://www.mousephenotype.org/help/programmatic-data-access/.
Competing interests
T.I.O. has received honoraria or consulted for Abbott, AstraZeneca, Chiron, Genentech, Infinity Pharmaceuticals, Merz Pharmaceuticals, Merck Darmstadt, Mitsubishi Tanabe, Novartis, Ono Pharmaceuticals, Pfizer, Roche, Sanofi and Wyeth. None of these funds were used for this research project.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Lists of authors and their affiliations appear at the end of the paper.
These authors jointly supervised this work: Valerie Gailus-Durner, Martin Hrabe de Angelis.
Change history
4/20/2022
A Correction to this paper has been published: 10.1038/s44161-022-00072-2
Contributor Information
Martin Hrabe de Angelis, Email: hrabe@helmholtz-muenchen.de.
IMPC consortium:
Juan A. Aguilar-Pimentel, Lore Becker, Lillian Garrett, Sabine M. Hölter, Oana V. Amarie, Julia Calzada-Wack, Tanja Klein-Rodewald, Patricia da Silva-Buttkus, Christoph Lengger, Claudia Stoeger, Raffaele Gerlini, Birgit Rathkolb, Daniela Mayr, John Seavitt, Angelina Gaspero, Jennie R. Green, Arturo Garza, Ritu Bohat, Leeyean Wong, Melissa L. McElwee, Sowmya Kalaga, Tara L. Rasmussen, Isabel Lorenzo, Denise G. Lanza, Rodney C. Samaco, Surabi Veeraragaven, Juan J. Gallegos, Petr Kašpárek, Silvia Petrezsélyová, Ruairidh King, Sara Johnson, James Cleak, Zsombor Szkoe-Kovacs, Gemma Codner, Matthew Mackenzie, Adam Caulder, Janet Kenyon, Wendy Gardiner, Hayley Phelps, Rhys Hancock, Claire Norris, Michayla A. Moore, Audrie M. Seluke, Rachel Urban, Coleen Kane, Leslie O. Goodwin, Kevin A. Peterson, Matthew Mckay, Jenn J. Cook, Jacob P. Lowy, Michael McFarland, Joshua A. Wood, Brandon J. Willis, Heather Tolentino, Todd Tolentino, Michael Schuchbauer, Jason Salazar, Jennifer Johnson, Rebecca Munson, Abdel Ayadi, Guillaume Pavlovic, Marie-Christine Birling, Sylvie Jacquot, Dalila Ali-Hadji, Philippe Charles, Philippe Andre, Marie-France Champy, Fabrice Riet, Igor Vukobradovic, Zorana Berberovic, Dawei Qu, Ruolin Guo, Abigail D’Souza, Ziyue Huang, Susan Camilleri, Milan Ganguly, Hibret Adissu, Mohammed Eskandarian, Xueyuan Shang, Kyle Duffin, Catherine Xu, Kyle Roberton, Valerie Laurin, Qing Lan, Gillian Sleep, Amie Creighton, Lauri Lintott, Marina Gertsenstein, Monica Pereira, Sandra Tondat, Amit Patel, Maribelle Cruz, Alex Bezginov, David Miller, Wang Hy, Atsushi Yoshiki, Nobuhiko Tanaka, Masaru Tamura, Zhiwei Liu, Olga Ermakova, Anna Ferrara, Paolo Fruscoloni, Claudia Seisenberger, Antje Bürger, and Florian Giesert
Genomics England Research Consortium:
J. C. Ambrose, P. Arumu gam, R. Bevers, M. Bleda, F. Boardman-Pretty, C. R. Boustred, H. Brittain, M. J. Caulfield, G. C. Chan, T. Fowler, A. Giess, A. Hamblin, S. Henderson, T. J. P. Hubbard, R. Jackson, L. J. Jones, D. Kasperaviciute, M. Kayikci, A. Kousathanas, L. Lahnstein, S. E. A. Leigh, I. U. S. Leong, F. J. Lopez, F. Maleady-Crowe, M. McEntagart, F. Minneci, L. Moutsianas, M. Mueller, N. Murugaesu, A. C. Need, P. O‘Donovan, C. A. Odhams, C. Patch, D. Perez-Gil, M. B. Pereira, J. Pullinger, T. Rahim, A. Rendon, T. Rogers, K. Savage, K. Sawant, R. H. Scott, A. Siddiq, A. Sieghart, S. C. Smith, A. Sosinsky, A. Stuckey, M. Tanguy, A. L. Taylor-Tavares, E. R. A. Thomas, S. R. Thompson, A. Tucci, M. J. Welland, E. Williams, K. Witkowska, and S. M. Wood
Extended data
is available for this paper at 10.1038/s44161-022-00018-8.
Supplementary information
The online version contains supplementary material available at 10.1038/s44161-022-00018-8.
References
- 1.Watkins, W. S. et al. De novo and recessive forms of congenital heart disease have distinct genetic and phenotypic landscapes. Nat. Commun.10, 4722 (2019). 10.1038/s41467-019-12582-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Landstrom, A. P., Dobrev, D. & Wehrens, X. H. T. Calcium signaling and cardiac arrhythmias. Circ. Res.120, 1969–1993 (2017). 10.1161/CIRCRESAHA.117.310083 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Liu, Y. et al. Global birth prevalence of congenital heart defects 1970–2017: updated systematic review and meta-analysis of 260 studies. Int. J. Epidemiol.48, 455–463 (2019). 10.1093/ije/dyz009 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Penny, D. J. & Vick, G. W. III Ventricular septal defect. Lancet377, 1103–1112 (2011). 10.1016/S0140-6736(10)61339-6 [DOI] [PubMed] [Google Scholar]
- 5.Dakkak, W. & Oliver, T. I. Ventricular septal defect. in StatPearlshttps://www.ncbi.nlm.nih.gov/books/NBK470330/ (StatPearls, 2019). [PubMed]
- 6.Williams, K., Carson, J. & Lo, C. Genetics of congenital heart disease. Biomolecules9, 879 (2019). [DOI] [PMC free article] [PubMed]
- 7.Zu, B., Zheng, Z. & Fu, Q. Molecular diagnostics for congenital heart disease: a narrative review of the current technologies and applications. J. BioX Res.3, 92–96 (2020). [Google Scholar]
- 8.Haselimashhadi, H. et al. Soft windowing application to improve analysis of high-throughput phenotyping data. Bioinformatics36, 1492–1500 (2019). 10.1093/bioinformatics/btz744 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Gelb, B. et al. The Congenital Heart Disease Genetic Network Study: rationale, design, and early results. Circ. Res.112, 698–706 (2013). 10.1161/CIRCRESAHA.111.300297 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Sandell, L. L., Guan, X. J., Ingram, R. & Tilghman, S. M. Gatm, a creatine synthesis enzyme, is imprinted in mouse placenta. Proc. Natl Acad. Sci. USA100, 4622–4627 (2003). 10.1073/pnas.0230424100 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Pan, Y. H. et al. Crystal structure of human group X secreted phospholipase A2. Electrostatically neutral interfacial surface targets zwitterionic membranes. J. Biol. Chem.277, 29086–29093 (2002). 10.1074/jbc.M202531200 [DOI] [PubMed] [Google Scholar]
- 12.Johnson, K. R., Longo-Guess, C. M. & Gagnon, L. H. Mutations of the mouse ELMO domain containing 1 gene (Elmod1) link small GTPase signaling to actin cytoskeleton dynamics in hair cell stereocilia. PLoS ONE7, e36074 (2012). 10.1371/journal.pone.0036074 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Sekine, H., Takahashi, M., Iwaki, D. & Fujita, T. The role of MASP-1/3 in complement activation. Adv. Exp. Med. Biol.735, 41–53 (2013). 10.1007/978-1-4614-4118-2_3 [DOI] [PubMed] [Google Scholar]
- 14.Jansen, S., Collins, A., Golden, L., Sokolova, O. & Goode, B. L. Structure and mechanism of mouse cyclase-associated protein (CAP1) in regulating actin dynamics. J. Biol. Chem.289, 30732–30742 (2014). 10.1074/jbc.M114.601765 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Bergmann, M. W. WNT signaling in adult cardiac hypertrophy and remodeling: lessons learned from cardiac development. Circ. Res.107, 1198–1208 (2010). 10.1161/CIRCRESAHA.110.223768 [DOI] [PubMed] [Google Scholar]
- 16.Touvier, T. et al. LEPROT and LEPROTL1 cooperatively decrease hepatic growth hormone action in mice. J. Clin. Invest.119, 3830–3838 (2009). 10.1172/JCI34997 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Al Senaidi, K. et al. Phenotypic spectrum of ALPK3-related cardiomyopathy. Am. J. Med. Genet.179, 1235–1240 (2019). [DOI] [PubMed] [Google Scholar]
- 18.Van Sligtenhorst, I. et al. Cardiomyopathy in α-kinase 3 (ALPK3)-deficient mice. Vet. Pathol.49, 131–141 (2012). 10.1177/0300985811402841 [DOI] [PubMed] [Google Scholar]
- 19.Linton, M. F., Tao, H., Linton, E. F. & Yancey, P. G. SR-BI: a multifunctional receptor in cholesterol homeostasis and atherosclerosis. Trends Endocrinol. Metab.28, 461–472 (2017). 10.1016/j.tem.2017.02.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Muthuramu, I. et al. Hepatocyte-specific SR-BI gene transfer corrects cardiac dysfunction in Scarb1-deficient mice and improves pressure overload-induced cardiomyopathy. Arterioscler. Thromb. Vasc. Biol.38, 2028–2040 (2018). 10.1161/ATVBAHA.118.310946 [DOI] [PubMed] [Google Scholar]
- 21.Zhou, R. et al. Endocrine role of bone in the regulation of energy metabolism. Bone Res.9, 25 (2021). 10.1038/s41413-021-00142-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Tang, C. M. et al. CircRNA_000203 enhances the expression of fibrosis-associated genes by derepressing targets of miR-26b-5p, Col1a2 and CTGF, in cardiac fibroblasts. Sci. Rep.7, 40342 (2017). 10.1038/srep40342 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Boraschi-Diaz, I. et al. Metabolic phenotype in the mouse model of osteogenesis imperfecta. J. Endocrinol.234, 279–289 (2017). 10.1530/JOE-17-0335 [DOI] [PubMed] [Google Scholar]
- 24.De Pace, R. et al. Altered distribution of ATG9A and accumulation of axonal aggregates in neurons from a mouse model of AP-4 deficiency syndrome. PLoS Genet.14, e1007363 (2018). 10.1371/journal.pgen.1007363 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Lechner, M. et al. CIDeR: multifactorial interaction networks in human diseases. Genome Biol.13, R62 (2012). 10.1186/gb-2012-13-7-r62 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Lachmann, A. et al. Geneshot: search engine for ranking genes from arbitrary text queries. Nucleic Acids Res.47, W571–W577 (2019). 10.1093/nar/gkz393 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Shan, J. et al. Genetic analysis of the SIRT1 gene promoter in ventricular septal defects. Biochem. Biophys. Res. Commun.425, 741–745 (2012). 10.1016/j.bbrc.2012.07.145 [DOI] [PubMed] [Google Scholar]
- 28.Cohen, I. et al. A novel homozygous SLC25A1 mutation with impaired mitochondrial complex V: possible phenotypic expansion. Am. J. Med. Genet.176, 330–336 (2018). 10.1002/ajmg.a.38574 [DOI] [PubMed] [Google Scholar]
- 29.Nakaya, T. et al. p600 plays essential roles in fetal development. PLoS ONE8, e66269 (2013). 10.1371/journal.pone.0066269 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Anderson, R. H., Brown, N. A. & Mohun, T. J. Insights regarding the normal and abnormal formation of the atrial and ventricular septal structures. Clin. Anat.29, 290–304 (2016). 10.1002/ca.22627 [DOI] [PubMed] [Google Scholar]
- 31.Faqeih, E. A. et al. Novel STAMBP mutation and additional findings in an Arabic family. Am. J. Med. Genet.167a, 805–809 (2015). 10.1002/ajmg.a.36782 [DOI] [PubMed] [Google Scholar]
- 32.Zhu, C. et al. Screening for differential methylation status in fetal myocardial tissue samples with ventricular septal defects by promoter methylation microarrays. Mol. Med. Rep.4, 137–143 (2011). 10.3892/mmr.2011.434 [DOI] [PubMed] [Google Scholar]
- 33.Duncan, E. L. et al. Genome-wide association study using extreme truncate selection identifies novel genes affecting bone mineral density and fracture risk. PLoS Genet.7, e1001372 (2011). 10.1371/journal.pgen.1001372 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Nielsen, J. B. et al. Biobank-driven genomic discovery yields new insight into atrial fibrillation biology. Nat. Genet.50, 1234–1239 (2018). 10.1038/s41588-018-0171-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Ananth, A. L. et al. Clinical course of six children with GNAO1 mutations causing a severe and distinctive movement disorder. Pediatr. Neurol.59, 81–84 (2016). 10.1016/j.pediatrneurol.2016.02.018 [DOI] [PubMed] [Google Scholar]
- 36.Vatta, M. et al. Mutations in Cypher/ZASP in patients with dilated cardiomyopathy and left ventricular non-compaction. J. Am. Coll. Cardiol.42, 2014–2027 (2003). 10.1016/j.jacc.2003.10.021 [DOI] [PubMed] [Google Scholar]
- 37.Huang, J. et al. Myocardin regulates BMP10 expression and is required for heart development. J. Clin. Invest.122, 3678–3691 (2012). 10.1172/JCI63635 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Soldatov, N. M. CACNB2: an emerging pharmacological target for hypertension, heart failure, arrhythmia and mental disorders. Curr. Mol. Pharmacol.8, 32–42 (2015). 10.2174/1874467208666150507093258 [DOI] [PubMed] [Google Scholar]
- 39.Rasmussen, T. B. et al. Mutated desmoglein-2 proteins are incorporated into desmosomes and exhibit dominant-negative effects in arrhythmogenic right ventricular cardiomyopathy. Hum. Mutat.34, 697–705 (2013). 10.1002/humu.22289 [DOI] [PubMed] [Google Scholar]
- 40.Amin, A. S., Asghari-Roodsari, A. & Tan, H. L. Cardiac sodium channelopathies. Pflugers Arch. Eur. J. Physiol.460, 223–237 (2010). 10.1007/s00424-009-0761-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Rani, D. S. et al. A complete absence of missense mutation in myosin regulatory and essential light chain genes of south Indian hypertrophic and dilated cardiomyopathies. Cardiology141, 156–166 (2018). 10.1159/000495027 [DOI] [PubMed] [Google Scholar]
- 42.Kuleshov, M. V. et al. Enrichr: a comprehensive gene set enrichment analysis web server 2016 update. Nucleic Acids Res.44, W90–W97 (2016). 10.1093/nar/gkw377 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Hoang, T. T. et al. The Congenital Heart Disease Genetic Network Study: cohort description. PLoS ONE13, e0191319 (2018). 10.1371/journal.pone.0191319 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Moss, C. & Wernham, A. The 100 000 Genomes Project: feeding back to patients. BMJ361, k2441 (2018). 10.1136/bmj.k2441 [DOI] [PubMed] [Google Scholar]
- 45.Turnbull, C. et al. The 100 000 Genomes Project: bringing whole genome sequencing to the NHS. BMJ361, k1687 (2018). 10.1136/bmj.k1687 [DOI] [PubMed] [Google Scholar]
- 46.Schunkert, H. et al. Large-scale association analysis identifies 13 new susceptibility loci for coronary artery disease. Nat. Genet.43, 333–338 (2011). 10.1038/ng.784 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Hou, Y. et al. A preliminary study on RCN3 protein expression in non-small cell lung cancer. Clin. Lab.62, 293–300 (2016). 10.7754/Clin.Lab.2015.150411 [DOI] [PubMed] [Google Scholar]
- 48.Gijsbers, A. C. et al. Additional cryptic CNVs in mentally retarded patients with apparently balanced karyotypes. Eur. J. Med. Genet.53, 227–233 (2010). 10.1016/j.ejmg.2010.06.003 [DOI] [PubMed] [Google Scholar]
- 49.Goodwin, E. C. et al. BiP and multiple DNAJ molecular chaperones in the endoplasmic reticulum are required for efficient simian virus 40 infection. mBio2, e00101–e00111 (2011). 10.1128/mBio.00101-11 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Groenendyk, J., Sreenivasaiah, P. K., Kim, D. H., Agellon, L. B. & Michalak, M. Biology of endoplasmic reticulum stress in the heart. Circ. Res.107, 1185–1197 (2010). 10.1161/CIRCRESAHA.110.227033 [DOI] [PubMed] [Google Scholar]
- 51.Dickinson, M. E. et al. High-throughput discovery of novel developmental phenotypes. Nature537, 508–514 (2016). 10.1038/nature19356 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Aung, N. et al. Genome-wide analysis of left ventricular image-derived phenotypes identifies fourteen loci associated with cardiac morphogenesis and heart failure development. Circulation140, 1318–1330 (2019). 10.1161/CIRCULATIONAHA.119.041161 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Herman, D. S. et al. Truncations of titin causing dilated cardiomyopathy. N. Engl. J. Med.366, 619–628 (2012). 10.1056/NEJMoa1110186 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Derangeon, M., Montnach, J., Baró, I. & Charpentier, F. Mouse models of SCN5A-related cardiac arrhythmias. Front. Physiol.3, 210 (2012). 10.3389/fphys.2012.00210 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Zervou, S., Whittington, H. J., Russell, A. J. & Lygate, C. A. Augmentation of creatine in the heart. Mini Rev. Med. Chem.16, 19–28 (2016). 10.2174/1389557515666150722102151 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Sequeira, V., Nijenkamp, L. L., Regan, J. A. & van der Velden, J. The physiological role of cardiac cytoskeleton and its alterations in heart failure. Biochim. Biophys. Acta1838, 700–722 (2014). 10.1016/j.bbamem.2013.07.011 [DOI] [PubMed] [Google Scholar]
- 57.Rooryck, C. et al. Mutations in lectin complement pathway genes COLEC11 and MASP1 cause 3MC syndrome. Nat. Genet.43, 197–203 (2011). 10.1038/ng.757 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Long, J. Z. et al. Metabolomics annotates ABHD3 as a physiologic regulator of medium-chain phospholipids. Nat. Chem. Biol.7, 763–765 (2011). 10.1038/nchembio.659 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Deshpande, I. et al. Smoothened stimulation by membrane sterols drives Hedgehog pathway activity. Nature571, 284–288 (2019). 10.1038/s41586-019-1355-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.McCulley, D. J. & Black, B. L. Transcription factor pathways and congenital heart disease. Curr. Top. Dev. Biol.100, 253–277 (2012). 10.1016/B978-0-12-387786-4.00008-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Fu, K. et al. A temporal transcriptome and methylome in human embryonic stem cell-derived cardiomyocytes identifies novel regulators of early cardiac development. Epigenetics13, 1013–1026 (2018). 10.1080/15592294.2018.1526029 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Cui, M., Wang, Z., Bassel-Duby, R. & Olson, E. N. Genetic and epigenetic regulation of cardiomyocytes in development, regeneration and disease. Development145, dev171983 (2018). [DOI] [PMC free article] [PubMed]
- 63.Wiegering, A., Rüther, U. & Gerhardt, C. The role of Hedgehog signalling in the formation of the ventricular septum. J. Dev. Biol.5, 17 (2017). [DOI] [PMC free article] [PubMed]
- 64.Valenzuela, D. et al. Gαo is necessary for muscarinic regulation of Ca2+ channels in mouse heart. Proc. Natl Acad. Sci. USA94, 1727–1732 (1997). 10.1073/pnas.94.5.1727 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Cai, X. et al. Tbx20 acts upstream of Wnt signaling to regulate endocardial cushion formation and valve remodeling during mouse cardiogenesis. Development140, 3176–3187 (2013). 10.1242/dev.092502 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Boogerd, C. J. et al. Tbx20 is required in mid-gestation cardiomyocytes and plays a central role in atrial development. Circ. Res.123, 428–442 (2018). 10.1161/CIRCRESAHA.118.311339 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Witzel, H. R., Cheedipudi, S., Gao, R., Stainier, D. Y. & Dobreva, G. D. Isl2b regulates anterior second heart field development in zebrafish. Sci. Rep.7, 41043 (2017). 10.1038/srep41043 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Vasilescu, C. et al. Genetic basis of severe childhood-onset cardiomyopathies. J. Am. Coll. Cardiol.72, 2324–2338 (2018). 10.1016/j.jacc.2018.08.2171 [DOI] [PubMed] [Google Scholar]
- 69.Jin, S. C. et al. Contribution of rare inherited and de novo variants in 2,871 congenital heart disease probands. Nat. Genet.49, 1593–1601 (2017). 10.1038/ng.3970 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Yang, X. F. et al. Relationship between TBX20 gene polymorphism and congenital heart disease. Genet. Mol. Res.15, gmr7374 (2016). [DOI] [PubMed]
- 71.Huang, R. T. et al. CASZ1 loss-of-function mutation associated with congenital heart disease. Gene595, 62–68 (2016). 10.1016/j.gene.2016.09.044 [DOI] [PubMed] [Google Scholar]
- 72.Fry, A. et al. Comparison of sociodemographic and health-related characteristics of UK Biobank participants with those of the general population. Am. J. Epidemiol.186, 1026–1034 (2017). 10.1093/aje/kwx246 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Karczewski, K. J. et al. Systematic single-variant and gene-based association testing of 3,700 phenotypes in 281,850 UK Biobank exomes. Preprint at medRxiv10.1101/2021.06.19.21259117 (2021).
- 74.Lipshultz, S. E. et al. Cardiomyopathy in children: classification and diagnosis: a scientific statement from the American Heart Association. Circulation140, e9–e68 (2019). 10.1161/CIR.0000000000000682 [DOI] [PubMed] [Google Scholar]
- 75.Isbister, J. & Semsarian, C. Sudden cardiac death: an update. Intern. Med. J.49, 826–833 (2019). 10.1111/imj.14359 [DOI] [PubMed] [Google Scholar]
- 76.Sifrim, A. et al. Distinct genetic architectures for syndromic and nonsyndromic congenital heart defects identified by exome sequencing. Nat. Genet.48, 1060–1065 (2016). 10.1038/ng.3627 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Brown, S. D. M. et al. High-throughput mouse phenomics for characterizing mammalian gene function. Nat. Rev. Genet.19, 357–370 (2018). 10.1038/s41576-018-0005-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Hrabe de Angelis, M. et al. Analysis of mammalian gene function through broad-based phenotypic screens across a consortium of mouse clinics. Nat. Genet.47, 969–978 (2015). 10.1038/ng.3360 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Bowl, M. R. et al. A large scale hearing loss screen reveals an extensive unexplored genetic landscape for auditory dysfunction. Nat. Commun.8, 886 (2017). 10.1038/s41467-017-00595-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Brown, S. D. & Moore, M. W. The International Mouse Phenotyping Consortium: past and future perspectives on mouse phenotyping. Mamm. Genome23, 632–640 (2012). 10.1007/s00335-012-9427-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Gailus-Durner, V. et al. Introducing the German Mouse Clinic: open access platform for standardized phenotyping. Nat. Methods2, 403–404 (2005). 10.1038/nmeth0605-403 [DOI] [PubMed] [Google Scholar]
- 82.Mallon, A. M., Blake, A. & Hancock, J. M. EuroPhenome and EMPReSS: online mouse phenotyping resource. Nucleic Acids Res.36, D715–D718 (2008). 10.1093/nar/gkm728 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Rozman, J. et al. Identification of genetic elements in metabolism by high-throughput mouse phenotyping. Nat. Commun.9, 288 (2018). 10.1038/s41467-017-01995-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Meehan, T. F. et al. Disease model discovery from 3,328 gene knockouts by the International Mouse Phenotyping Consortium. Nat. Genet.49, 1231–1238 (2017). 10.1038/ng.3901 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Swan, A. L. et al. Mouse mutant phenotyping at scale reveals novel genes controlling bone mineral density. PLoS Genet.16, e1009190 (2020). 10.1371/journal.pgen.1009190 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Desai, K. H. et al. Cardiovascular indexes in the mouse at rest and with exercise: new tools to study models of cardiac disease. Am. J. Physiol.272, H1053–H1061 (1997). [DOI] [PubMed] [Google Scholar]
- 87.Wehrens, X. H., Kirchhoff, S. & Doevendans, P. A. Mouse electrocardiography: an interval of thirty years. Cardiovasc. Res.45, 231–237 (2000). 10.1016/S0008-6363(99)00335-1 [DOI] [PubMed] [Google Scholar]
- 88.Mitchell, G. F., Jeron, A. & Koren, G. Measurement of heart rate and Q–T interval in the conscious mouse. Am. J. Physiol.274, H747–H751 (1998). [DOI] [PubMed] [Google Scholar]
- 89.Sahn, D. J., DeMaria, A., Kisslo, J. & Weyman, A. Recommendations regarding quantitation in M-mode echocardiography: results of a survey of echocardiographic measurements. Circulation58, 1072–1083 (1978). 10.1161/01.CIR.58.6.1072 [DOI] [PubMed] [Google Scholar]
- 90.R Core Team R: A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, 2020).
- 91.Haselimashhadi, H. et al. OpenStats: a robust and scalable software package for reproducible analysis of high-throughput phenotypic data. PLoS ONE15, e0242933 (2020). 10.1371/journal.pone.0242933 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Karp, N. A. et al. Applying the ARRIVE Guidelines to an in vivo database. PLoS Biol.13, e1002151 (2015). 10.1371/journal.pbio.1002151 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Kurbatova, N., Karp, N. & Mason, J. PhenStat: Statistical Analysis of Phenotypic Datahttps://bioc.ism.ac.jp/ (2016).
- 94.Kurbatova, N., Karp, N., Mason, J. & Haselimashhadi, H. PhenStat: statistical analysis of phenotypic data. R package version 2.30.0 (2021).
- 95.Wong, M. D., Spring, S. & Henkelman, R. M. Structural stabilization of tissue for embryo phenotyping using micro-CT with iodine staining. PLoS ONE8, e84321 (2013). 10.1371/journal.pone.0084321 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96.Kemp, J. P. et al. Phenotypic dissection of bone mineral density reveals skeletal site specificity and facilitates the identification of novel loci in the genetic regulation of bone mass attainment. PLoS Genet.10, e1004423 (2014). 10.1371/journal.pgen.1004423 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97.Li, H. Z. et al. Study on the contribution rate of follow-up formula to the nutrient intake of infants and young children aged 7–24 months in China. Zhonghua Yu Fang Yi Xue Za Zhi51, 65–69 (2017). [DOI] [PubMed] [Google Scholar]
- 98.Xu, J.-H. et al. Prevalence and spectrum of NKX2-5 mutations associated with sporadic adult-onset dilated cardiomyopathy. Int. J. Heart58, 521–529 (2017). 10.1536/ihj.16-440 [DOI] [PubMed] [Google Scholar]
- 99.Zhou, Y.-M. A novel TBX20 loss‑of‑function mutation contributes to adult‑onset dilated cardiomyopathy or congenital atrial septal defect. Mol. Med. Rep.14, 3307–3314 (2016). 10.3892/mmr.2016.5609 [DOI] [PubMed] [Google Scholar]
- 100.Zhou, G. et al. NetworkAnalyst 3.0: a visual analytics platform for comprehensive gene expression profiling and meta-analysis. Nucleic Acids Res.47, W234–W241 (2019). 10.1093/nar/gkz240 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All data generated or analyzed during this study are included in this published article and its Supplementary Information. IMPC data are open access for public. For single-gene search, visit https://www.mousephenotype.org/data/search. For batch queries, please visit https://www.mousephenotype.org/data/batchQuery. To download a particular DR, visit http://ftp.ebi.ac.uk/pub/databases/impc/all-data-releases/. Support for particular DR (here we used DR10.1) download can be found at https://www.mousephenotype.org/help/programmatic-data-access/.