Somatic mutations in single human cardiomyocytes reveal age-associated DNA damage and widespread oxidative genotoxicity

Sangita Choudhury; August Yue Huang; Junho Kim; Zinan Zhou; Katherine Morillo; Eduardo A Maury; Jessica W Tsai; Michael B Miller; Michael A Lodato; Sarah Araten; Nazia Hilal; Eunjung Alice Lee; Ming Hui Chen; Christopher A Walsh

doi:10.1038/s43587-022-00261-5

. Author manuscript; available in PMC: 2022 Aug 31.

Published in final edited form as: Nat Aging. 2022 Aug 11;2(8):714–725. doi: 10.1038/s43587-022-00261-5

Somatic mutations in single human cardiomyocytes reveal age-associated DNA damage and widespread oxidative genotoxicity

Sangita Choudhury ^1,^2,^3,^13,^14,^✉, August Yue Huang ^1,^2,^3,¹³, Junho Kim ^1,^2,^3,⁴, Zinan Zhou ^1,^2,⁵, Katherine Morillo ¹, Eduardo A Maury ^1,^2,^3,^5,⁶, Jessica W Tsai ^1,^2,^5,⁷, Michael B Miller ^1,^2,^3,⁸, Michael A Lodato ⁹, Sarah Araten ¹, Nazia Hilal ^1,², Eunjung Alice Lee ^1,^2,^3,^14,^✉, Ming Hui Chen ^1,^2,^10,^11,^14,^✉, Christopher A Walsh ^1,^2,^3,^5,^12,^14,^✉

¹Division of Genetics and Genomics, Manton Center for Orphan Disease, Department of Pediatrics, Boston Children’s Hospital, Boston, MA, USA.

²Departments of Pediatrics, Harvard Medical School, Boston, MA, USA.

³Broad Institute of MIT and Harvard, Cambridge, MA, USA.

⁴Department of Biological Sciences, Sungkyunkwan University, Suwon, South Korea.

⁵Department of Neurology, Harvard Medical School, Boston, MA, USA.

⁶Bioinformatics & Integrative Genomics Program and Harvard/MIT MD-PhD Program, Harvard Medical School, Boston, MA, USA.

⁷Department of Pediatric Oncology, Dana-Farber Cancer Institute, Boston, MA, USA.

⁸Division of Neuropathology, Department of Pathology, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA.

⁹Department of Molecular, Cell and Cancer Biology, University of Massachusetts Medical, School, Worcester, MA, USA.

¹⁰Department of Cardiology, Boston Children’s Hospital, Boston, MA, USA.

¹¹Harvard Medical School, Boston, MA, USA.

¹²Howard Hughes Medical Institute, Boston Children’s Hospital, Boston, MA, USA.

¹³These authors contributed equally: Sangita Choudhury, August Yue Huang.

¹⁴These authors jointly supervised this work: Sangita Choudhury, Eunjung Alice Lee, Ming Hui Chen, Christopher A. Walsh.

Author contributions

S.C., M.H.C., A.Y.H., E.A.L. and C.A.W. conceived and designed the study. C.A.W., E.A.L. and M.H.C. supervised the study. S.C., A.Y.H., E.A.L., M.H.C. and C.A.W. wrote the article. S.C., Z.Z., K.M., J.W.T., M.B.M., M.A.L., S.A. and N.H. isolated single nuclei and performed whole-genome amplification as well as single-cell WGS. A.Y.H. performed sSNV calling from single-cell WGS data and performed bioinformatics analyses. J.K. performed quality control and read alignment of single-cell WGS data. E.A.M. and A.Y.H. analyzed GTEx expression data and performed knockout cell modeling. All authors reviewed and edited the article.

^✉

Sangita.choudhury@childrens.harvard.edu; ealice.lee@childrens.harvard.edu; Minghui.chen@cardio.chboston.org; christopher.walsh@childrens.harvard.edu

^✉

Correspondence and requests for materials should be addressed to Sangita Choudhury, Eunjung Alice Lee, Ming Hui Chen or Christopher A. Walsh.

PMCID: PMC9432807 NIHMSID: NIHMS1832113 PMID: 36051457

Abstract

The accumulation of somatic DNA mutations over time is a hallmark of aging in many dividing and nondividing cells but has not been studied in postmitotic human cardiomyocytes. Using single-cell whole-genome sequencing, we identified and characterized the landscape of somatic single-nucleotide variants (sSNVs) in 56 single cardiomyocytes from 12 individuals (aged from 0.4 to 82 years). Cardiomyocyte sSNVs accumulate with age at rates that are faster than in many dividing cell types and nondividing neurons. Cardiomyocyte sSNVs show distinctive mutational signatures that implicate failed nucleotide excision repair and base excision repair of oxidative DNA damage, and defective mismatch repair. Since age-accumulated sSNVs create many damaging mutations that disrupt gene functions, polyploidization in cardiomyocytes may provide a mechanism of genetic compensation to minimize the complete knockout of essential genes during aging. Age-related accumulation of cardiac mutations provides a paradigm to understand the influence of aging on cardiac dysfunction.

Advanced age is the most critical risk factor for heart disease and heart failure¹. Most heart failure occurs in individuals older than 65 years, and yet we have an incomplete understanding of how aging promotes heart failure. With aging, cardiac function is impaired at the organ level as well as at the cardiomyocyte level². In the adult heart, cardiomyocyte cell size is stable, and proliferation is either very rare or absent. Nevertheless, cardiomyocytes show changes in activation, contraction and relaxation, hypertrophy and, in some cases, cell loss with aging³. The underlying cause of these many aging phenotypes is likely molecular in nature, but its mechanism is not well understood.

Somatic mutations are genomic changes that escape the DNA repair machinery, and occur after the formation of the zygote and throughout the whole lifespan. The accumulation of somatic DNA mutations has been demonstrated to be a hallmark of aging in many human cell types, including postmitotic neurons^4–7. In most cases, they have little or no phenotypic consequences, as most mutations have no effect on cellular function. In some cases, these mutations are toxic, or even lethal, to the mutant cell. Recent evidence suggests that somatic mutations play a role in cancer as well as in other common diseases^8–10, including coronary artery disease¹¹. Each mutational process leaves a characteristic mutational mark in the genomic context (so-called mutational signature) and many of these mutational signatures have been linked to specific mutagen and biological processes, such as tobacco smoke or the deficiency of DNA repair processes¹². Understanding mutational signatures and their mechanism of formation might lead us to unveil the mechanism of DNA damage and disease progression in the aging heart. This prompted us to evaluate the landscape of somatic single-nucleotide variants (sSNVs) and associated mutational signatures in aging human cardiomyocytes.

Results

Cardiomyocyte polyploidization starts early in life.

To identify and isolate cardiomyocyte nuclei from heart tissue, we purified cardiac nuclei from fresh-frozen, unfixed, left ventricle human myocardium by density sedimentation (Supplementary Table 1)¹³. Cardiac nuclei were identified with cardiac troponin T (cTroponin T) staining (Fig. 1a,b,d and Extended Data Fig. 1a), which is a well-characterized cardiomyocyte-specific marker¹⁴. Although the pericentriolar material 1 (PCM1) antibody is used widely to identify cardiomyocyte nuclei, it has been shown that about 20% of PCM1-positive nuclei correspond to nonmyocyte cells¹⁵. Careful doublet exclusion was performed by plotting the area for forward scatter (FSC) and side scatter (SSC) against the height (H) or width (W); H versus W or area (A) allows the separation of doublets from single nuclei containing 4n amounts of DNA. Nuclei containing 4n amounts of DNA have double the A and H values, whereas W is roughly the same as cells containing 2n amounts of DNA (Fig. 1a,b; DAPI-H versus DAPI-W). We evaluated systematically cardiomyocyte nucleus ploidy from 50 human heart tissue samples from individuals ranging from 44 days to 81 years of age (Fig. 1c, showing only representative age group plots). Polyploid cardiomyocyte nuclei were identified based on the intensity of nuclear stain 4′,6-diamidino-2-phenylindole (DAPI). Fluorescence-activated nuclei sorting (FANS Fig. 1c,d), as well as Amnis imaging flow (Fig. 1e,f), revealed the presence of modest numbers of tetraploid cardiomyocyte nuclei even in the newborn heart tissue. Cardiomyocyte ploidy was further confirmed after nuclear sorting with karyotyping (Fig. 1g). Single-cell gene expression analysis by droplet digital PCR (ddPCR) from tetraploid nuclei indicates that tetraploid nuclei all express cardiac marker genes PLN, TNNT2 and MYH7 and lack fibroblast and endothelial marker genes (Fig. 1h and Extended Data Fig. 1b), suggesting that all tetraploid nuclei are cardiomyocytes, and making it unlikely that tetraploid nuclei represent sorting artifacts. Extensive studies by fluorescence in situ hybridization and immunostaining have indicated that human cardiomyocyte nuclei are mostly diploid at birth and start to become polyploid mainly in the second decade of life¹⁶, and that 60% of human cardiomyocyte nuclei have higher ploidy at an advanced age^17–19. Our data indicate that polyploidization in cardiomyocytes is evident in the neonatal heart but confirm that it becomes more common with age.

Fig. 1 | — a,b,d, Representative flow cytometry analysis of cardiac nuclei with Alexa Fluor 488 conjugate cTroponin T with careful doublet exclusion for 1278 (a), 5657 (b), and 1465 (d). c, DNA content histograms of heart cell nuclei with varied ages. The first peak indicates diploid (2n) nuclei, the second peak indicates tetraploid (4n) nuclei and the third and fourth peaks indicate multiploid (>4n) nuclei. d,e, Evaluation of cardiomyocyte nuclei ploidy by flow cytometry (d) and Amnis imaging flow cytometry (e), showing diploid (2N), tetraploid (4N) and multiploid (multiple N) cardiomyocyte nuclei proportion in case 1465. f, Representative photomicrographs of isolated cardiomyocyte nuclei (n = 4 independent experiments; images from 500 cells per experiment were examined from n = 20 cases), confirming DNA content of a single tetraploid and multiploid cardiomyocyte nuclei. Scale bar, 20 μm. g, Flow cytometry sorted cardiomyocyte nuclei karyotyping confirming cardiomyocyte chromosomes numbers, 46 (2n), 92 (4n) and 138 (>4n) in cardiomyocyte nuclei isolated from case 1465 (n = 6; 15–20 nuclei counted each time). Scale bar, 10 μm. h, Representative ddPCR analysis from flow cytometry sorted nuclei (n = 4, from 12 cases). cTroponin-T-positive 2n, 4n and greater than 4n are highly enriched for *PLN* and *MYH7* (cardiac markers), whereas cTroponin-T-negative 2n nuclei express *PDGFB* or *CD31* (markers for fibroblast and endothelial cells). The bottom clusters on the plot represent the negative droplets and the upper clusters represent the droplets that are positive for the respective reference assay. The plus sign indicates cells sorted from cTroponin-T-positive staining population; the dash indicates cells sorted from cTroponin-T-negative population; the blank box indicates that cells were sorted based solely on ploidy status

Somatic mutations increase with age in cardiomyocytes.

We evaluated the genome-wide burden of sSNVs using single-cell whole-genome sequencing (WGS) of DNA amplified from 48 tetraploid and 8 diploid single-cardiomyocyte nuclei from postmortem hearts of three infant (<4 years), six middle-aged (30–66 years) and three aged individuals (>75 years) (Fig. 2a and Table 1). Cardiomyocyte nuclei were isolated from the left ventricle, and DNA was amplified using multiple displacement amplification (MDA)²⁰ followed by quality control steps (Methods) and then high-coverage (>50×) WGS on selected, well-amplified cells¹⁰ (Supplementary Table 2). We identified single-cell sSNVs from each cardiomyocyte (Supplementary Table 3) using a modified version of the LiRA²¹ algorithm, which uses read phasing information from adjacent germline variants to distinguish true somatic mutations from technical artifacts arising during whole-genome amplification and sequencing. The LiRA algorithm has been demonstrated to effectively remove most false positives²¹. We also estimated the genome-wide sSNV burden for each cardiomyocyte (Supplementary Table 4), after taking cell-specific dropout rates and sequencing depth distributions into consideration to account for the tetraploidy effect on detection sensitivity and power calculation, and subtracting the contribution of potential artifacts due to MDA amplification based on their highly distinctive nucleotide substitution pattern^22,23 (Methods). We also measured the MDA amplification evenness across the genome for each single-cell using median absolute pairwise difference (MAPD) and coefficient of variation (CoV) of binned normalized copy number ratios, and incorporated them as covariates in our subsequent regression analyses.

Fig. 2 | — a, Schematic of approach. Nuclei are isolated from frozen postmortem heart tissues with varied ages, and sorted based on ploidy content by FANS. Sorted nuclei are amplified by Φ29 polymerase-mediated MDA for WGS and subsequent sSNV burden and signature analysis. The numbers of examined nuclei (n) are noted. b, Estimated sSNV density in tetraploid cardiomyocytes. Tetraploid cardiomyocytes showed an increased number of sSNVs with increased age (P = 7.3 × 10⁻⁴, mixed-effects regression). Single tetraploid cardiac nuclei were obtained from normal postmortem left ventricle hearts of individuals ranging from 0.4 to 82.7 years of age (n = 48 cells from 12 donors). c, Genome-wide distribution of SNVs. Autosomal sSNV density in the 10 Mb genomic window is plotted for each individual. Green, blue and red curves represent infant, middle-aged and aged hearts, respectively. d, Estimated sSNV density in diploid cardiomyocytes. Aged diploid cardiomyocytes (75 years, n = 4 cells) showed an increase of around sevenfold (P = 0.014, two-tailed t-test) in sSNV burden compared with infant cardiomyocytes (0.4 years, n = 4 cells). Error bar, ± s.e.m. e, Age-dependent increases of sSNV density in cardiomyocytes versus neurons, hepatocytes and lymphocytes. Red, blue, green and purple dotted lines indicate the mixed-effects regression lines of cardiomyocytes, neurons, hepatocytes and lymphocytes, respectively.

Table 1 |.

Case information analyzed in this study for sequencing

Case ID	Age (years)	Height (inches, range)	HW (g)	BW (lb, range)	Sex	Diagnosis	Cause of death	Hypertrophy/fibrosis?	RIN	PMI (h)
1278	0.4	20–25	42	15–20	M	Normal	SIDS	No	NA	8
1864	2	35–40	59	30–35	F	Normal	Laryngitis and bronchiolitis associated with beta hemolytic streptococcus group A infection	No	7.2	8
6032	4	40–45	110	55–60	M	Normal	Head and neck injuries, accident	No	7	25
1863	30	65–70	300	125–130	F	Normal	Multiple injuries, accident	No	8.8	7
1104	35	70–75	450	225–230	M	Normal	Multiple injuries, accident	No	8.4	12
1028	39	65–70	228	125–130	F	Normal	Accident	No	8.9	21
936	49.2	65–70	280	155–160	F	Normal	Liver cirrhosis	No	7.8	7
5919	63	70–75	430	185–190	M	Normal	Drowning	No	6.1	12
5828	66	60–65	320	140–150	F	Normal	Accident	No	8.2	18
5084	75	65–70	400	205–210	F	Normal	Accident	No	NA	12
5840	75.3	70–75	650	225–230	M	Hypertrophy	Ruptured abdominal aortic aneurysm	No; LV 1.6 cm, Septum 1.6 cm, RV 0.4 cm	7.8	17
5657	82.2	70–75	700	235–240	M	Hypertrophy	Natural death	No; LV 1.8 cm, Septum 1.8 cm, RV 0.2 cm	8.4	22

Open in a new tab

HW, heart weight; BW, body weight; LV left ventricle; NA, not available; RIN, RNA integrity number; RV, right ventricle; SIDS, sudden infant death syndrome; PMI, post-mortem interval.

Tetraploid cardiomyocytes showed a significant increase of sSNV with age (P = 7.3 × 10⁻⁴, mixed-effects regression; Fig. 2b) at a rate of 0.010 sSNV Mb⁻¹ year⁻¹ (or 124 sSNV cell⁻¹ year⁻¹), and the sSNVs were distributed broadly across the genome (Fig. 2c). Note that, in each age group, there are notable intraindividual and interindividual variations, particularly in the aged group, where a few outlier nuclei showed a very high sSNV burden (>2 sSNV Mb⁻¹). We further confirmed the age-dependent increase in diploid cardiomyocytes, in which the aged heart showed significantly more sSNVs than the infant heart (P = 0.014, two-tailed t-test; Fig. 2d). We observed nearly doubled per-genome sSNV rate in tetraploid cardiomyocytes when compared with diploid cardiomyocytes, but, after normalizing to their different genomic size, no significant difference was observed between tetraploid and diploid cardiomyocytes obtained from infant and aged donors (P = 0.11 and 0.86, two-tailed Wilcoxon test; Extended Data Fig. 2a), suggesting a consistent increase of sSNV in human heart muscle cells with age, regardless of cardiomyocyte nuclear ploidy. Since two of our donors were affected by ventricular hypertrophy (cases 5657 and 5840), we remeasured the age-associated increase after excluding cardiomyocytes obtained from these two donors, and still observed a consistent increase rate (0.010 sSNV Mb⁻¹ year⁻¹; P = 6.1 × 10⁻⁴, mixed-effects regression).

To better understand the age-dependent sSNV accumulation in cardiomyocytes, we first compared the accumulation rates between cardiomyocytes and neurons^10,23, another nondividing human cell type. We found that cardiomyocytes accumulate sSNVs around three times faster than neurons (0.003 sSNV Mb⁻¹ year⁻¹ or 19 sSNV cell genome⁻¹ year⁻¹; P = 2.1 × 10⁻⁵ between cardiomyocyte and neuron, mixed-effects regression; Fig. 2e), suggesting that cardiomyocytes and neurons might undergo different mutational processes during aging. The age-dependent increase in cardiomyocytes and the cardiomyocyte-neuron difference remained significant (P < 0.05) even after controlling potential confounding factors including MAPD and CoV scores (measurement of amplification evenness), sequencing depth, library insert size, number of heterozygous germline SNVs and postmortem interval (PMI), as well as excluding C > T sSNVs (Extended Data Fig. 3; Methods). We further compared the sSNV accumulation rate in cardiomyocytes with hepatocytes²⁴—the liver cells of another highly active metabolic organ—and mitotically active lymphocyte blood cells²⁵ profiled by other groups (Fig. 2e). We found that cardiomyocytes accumulated sSNV at a rate similar to that of hepatocytes (0.009 sSNV Mb⁻¹ year⁻¹ considering diploid genome or 55 sSNV cell⁻¹ year⁻¹; P = 0.54 between cardiomyocyte and hepatocyte, mixed-effects regression) but significantly higher than lymphocytes (0.004 sSNV Mb⁻¹ year⁻¹ or 22 sSNV cell⁻¹ year⁻¹; P = 2.0 × 10⁻⁴ between cardiomyocyte and lymphocyte, mixed-effects regression).

Signature analysis identifies distinct mutational processes in aging cardiomyocytes.

Different types of mutagenesis processes manifest different mutational signatures on the genome, offering insight into the molecular mechanisms involved in their formation²⁶. Using 10,407 sSNVs identified from 48 tetraploid cardiac nuclei (Supplementary Table 3), we first studied the base substitution distribution of age-accumulated sSNVs by subtracting the sSNV profiles of infant cardiomyocytes from those of aged cardiomyocytes, and observed that C > T and T > C mutations accumulated predominantly during the aging process (Fig. 3a). To further decipher mutational processes in the aging heart, we deconvoluted the sSNV profiles of all cardiomyocytes into mutation signatures using non-negative matrix factorization (NMF)²⁷. Cardiac sSNVs were best fit into four distinct mutational signatures. referred to in this study as Signature A, B, C and D (Extended Data Fig. 4 and Extended Data Fig. 5). Signature B consisted mainly of C > T mutations, depleted at CpG dinucleotides, and matched closely with mutational signatures that were previously ascribed to artifacts created by MDA amplification²², so Signature B was removed from mutation burden analysis and not considered further. We compared our cardiac signatures with single-base substitution (SBS) signatures annotated by the COSMIC (Catalogue Of Somatic Mutations In Cancer) database²⁸, a curated list of reference signatures generated from distinct types of human cancer, many of which had linked their mutagenesis mechanisms to various environmental factors and intrinsic processes. Further, we confirmed that tetraploid and diploid cardiomyocytes showed a similar contribution of Signatures A, C and D (Extended Data Fig. 2b,c,d).

Signature A comprised mainly C > T and T > C mutations (Fig. 3b), and its contribution in cardiomyocytes increased with age (Fig. 3c). Signature A closely resembled and was contributed dominantly by SBS5 (Fig. 3d and Extended Data Fig. 6), an age-related, ‘clock-like’ signature previously observed in many cancers²⁶, normal cycling cells⁶ and single neurons⁵. This signature has been proposed to reflect faulty repair of deamination of methylated cytosines to thymine that frequently occurs in the CpG context²⁹.

Signature C was distinct from the other signatures due to the prominent enrichment of C > A mutations (Fig. 3b). C > A mutations often reflect faulty repair of 8-oxoguanine (8-hydroxyguanine (8-oxo-Gua)), created by oxidative DNA damage³⁰—one of the most common threats to genome stability³¹. The heart is one of the most highly metabolic organs, with large oxidative demands resulting in production of reactive oxygen species (ROS)³², which is thought to generate 8-oxo-Gua³³. Decomposition of Signature C revealed significant contributions of SBS8, SBS18, SBS32 and SBS39 (Fig. 3d). SBS8 and SBS18 have been associated with the transcription-coupled repair of damaged guanine by ROS via nucleotide excision repair (NER)³⁴ and base excision repair (BER)³⁵ pathways, respectively, suggesting that the C > A mutations may reflect the accelerated accumulation of oxidative DNA damage that overwhelms these repair pathways.

Signature D was also enriched for C > T and T > C mutations but distinct from Signature A in its trinucleotide context (Fig. 3b). Signature D closely resembled COSMIC SBS44 (Extended Data Fig. 6) along with significant contributions from SBS6, SBS39, SBS42 and SBS46 (Fig. 3d). Both SBS6 and SBS44 have been associated with defective DNA mismatch repair (MMR) machinery and are increased in tumors associated with loss of MMR genes²⁶. MMR is regulated by a small set of MMR-specific proteins in all cells³⁶. Mutagenesis in the absence of one of the core MMR factors is shaped by the sequence spectrum of the unrepaired mismatches, which themselves are the product of the insertional specificity and proofreading activity of DNA polymerases. Signature D showed a striking similarity to the mutational signature of MSH6-defective human HAP1 cells, or the DLD-1 human colorectal cancer cell line, dominated by C > T and T > C mutations³⁷ in a range of contexts, and small contributions of C > A, C > G, T > A, and T > G mutations. Signature D hence likely reflects a defect in the repair of damage that involves almost all mismatches.

Comparative signature analysis between cardiomyocytes, neurons, hepatocytes and lymphocytes shows shared and distinct mutational signatures (Fig. 3c). The age-dependent accumulation of Signature A was observed in all four cell types (P < 0.001, mixed-effects regression; Fig. 3c); we did not observe a significant difference in the rate of increase between cardiomyocytes and neurons (P = 0.24), which might suggest a similar ‘clock-like’ accumulation of such mutations among postmitotic cell types. Signature C also increased with age in all four cell types, but at a relatively similar rate (Fig. 3c). On the other hand, the dramatic increase in the contribution of Signature D with age was observed only in cardiomyocytes, not in neurons, hepatocytes or lymphocytes (P < 0.001 between cardiomyocyte and other cell types, mixed-effects regression; Fig. 3c). sSNVs that accumulated in aging cardiomyocytes showed broader substitution types with enrichment in untranscribed strands, such as T > C and T > G mutations, than those in neurons, whereas age-accumulated sSNVs in neurons showed enrichment of C > T and T > C mutations in transcribed strands (Fig. 3e). This putative MMR-related Signature D, which accumulates specifically in aged cardiomyocytes, seems to represent a distinct mutational process in the heart. The uniqueness of the signature to heart cells—both diploid and tetraploid—also argues strongly against this signature representing any sort of universal technical artifact.

Potential sources and mechanisms of mutation formation in the aging heart.

To understand how mutations can be formed and accumulate during aging in the absence of cell cycling, it is important to recognize that both MMR, BER and NER involve steps of exonuclease removal of a segment of DNA, followed by replication of the remaining strand to reconstitute double-stranded DNA, using the nondamaged strand as template. Although oxidative base lesions are commonly repaired via the BER pathway, and NER is the main pathway for the repair of bulky adducts and other helix-distorting lesions, recent evidence has suggested a role for NER proteins in the repair of oxidative damage through interactions with BER proteins^38–40. Our analysis of mutational signatures (Fig. 3b,d) suggested a model in which oxidative stress leads to an increased burden of damaged bases, which might overwhelm the NER, BER and MMR machinery in aged cardiomyocytes.

Using the RNA-seq expression data from the GTEx⁴¹ portal, which compiles data from 168 nondiseased donors with available heart and brain tissue gene expression profiles, we observed an overall lower gene expression for the core components of the MMR complex (MLH3, MSH2, MSH3, MSH6, PMS1 and PMS2) in heart samples compared with brain samples (P = 2.5 × 10⁻⁸, two-tailed paired Wilcoxon test; Fig. 4a), and a significantly stronger decrease of those gene expression levels with aging in heart samples (P = 0.04, linear regression; Fig. 4a), suggesting compromised MMR activity in aged heart, except one of the MMR complex protein MLH1. Existing evidence from primary endometrial cancer studies indicates a highly variable expression pattern of MMR complex proteins even in clinical cases and the decrease of one or more MMR complex proteins is considered damaging⁴². The age-dependent decreasing of gene expression in human cardiomyocytes was further confirmed for most MMR complex genes in a recent single-cell RNA-seq dataset⁴³.

Fig. 4 | — a, mRNA expression level of MMR complex components indicating significant downregulation of MMR genes with increased age in heart samples but not in brain samples (n = 168 donors). Heart versus brain (P = 8.4 × 10⁻⁸, two-tailed paired Wilcoxon test); age effect in heart (P = 0.04, linear regression). Boxplot with whisker denotes minimum, 25%, median, 75% quartiles and maximum. b, Aged heart (n = 5 donors) showing significantly higher oxidative damage than infant heart (n = 5 donors), quantified by 8-OHdG assay (P = 0.008, two-tailed Wilcoxon test). Error bar, ± s.e.m. c, Association of sSNVs with gene expression level. Signature A accumulated more sSNVs in highly expressed genes in both aged cardiomyocytes and neurons (P = 0.09 and 0.004, linear regression). Signatures C and D are dominant in aged cardiomyocytes without significant correlation with gene expression level. n = 10,407 and 19,101 sSNVs of tetraploid cardiomyocytes and neurons. Error bar, ± 95% confidence interval. d, Aged cardiomyocytes showed excess exonic and nonsynonymous sSNVs when compared with germline mutations (two-tailed Fisher’s exact test), suggesting relaxed constraint of negative selection; n = 10,407 and 19,101 sSNVs of tetraploid cardiomyocytes and neurons. Error bar, ± s.e.m. e, Cardiomyocyte-specifically enriched GO categories (FDR-adjusted P < 0.05, permutation test); x axis denotes the enrichment P value for sSNVs in cardiomyocytes and neurons separately. Cardiac sSNVs are enriched in MMR pathways and pathways involved in metabolism and kinase signaling. f, Prediction model for the effect of sSNVs on abundance of KO tetraploid cardiomyocytes (n = 48 cells from 12 donors). The tetraploid model demonstrates a significantly lower KO rate than the diploid model (P = 4.9 × 10⁻⁴, two-tailed paired Wilcoxon test), especially in aged hearts, implying the protective impact of polyploidization against deleterious sSNV accumulation. Error bar, ± s.e.m. g, Mechanism of sSNV occurrence in heart and adaptation of cardiomyocyte to the polyploid genome. Cardiomyocytes with increased age show increased level of oxidative stress and ROS. DNA repair pathways including MMR, NER and BER in aged cardiomyocytes might not function effectively to repair this increased load of DNA damage and lead to the accumulation of sSNVs. Cardiomyocytes with higher ploidy can better tolerate the deleterious effect of these mutational burdens.

We also examined the expression of NER and BER pathway genes among GTEx heart and brain samples (Extended Data Fig. 7). We observed a twofold slower reduction of gene expression for NER and BER pathway genes in heart samples with age (−0.014 year⁻¹; P = 0.22 and 0.04) than the MMR pathway (−0.030 year⁻¹). This suggested that the MMR pathway might be affected more severely during the aging of cardiomyocytes, which could potentially explain the cardiomyocyte-specific accumulation of sSNVs from the MMR-related Signature D.

Under physiological conditions, ROS are scavenged by the antioxidant system, but when the ROS concentration is very high, oxidative damage occurs to DNA. Guanosine is the most oxidized among the DNA nucleobases, with 8-hydroxy-2-deoxyguanosine (8-OHdG) being a product of oxidative DNA damage and considered as a biomarker of oxidative stress. High levels of 8-OHdG have been correlated with various age-related cardiovascular disease progression⁴⁴, but the exact causal relationship has not been fully elucidated. We directly assessed potential oxidative damage in the left ventricular cardiomyocytes (n = 10, five infant and five aged donors) using an enzyme-linked immunosorbent assay (Methods). We found that the level of 8-OHdG in aged hearts was more than twice as high as that in infant hearts (P = 0.008, two-tailed Wilcoxon test; Fig. 4b). These data suggest that increased oxidative stress leads to elevated 8-OHdG, which may overwhelm DNA repair systems, and contributes, at least in part, to the increased DNA mutational burden.

Functional impact of sSNVs in the aging process.

To further investigate whether the occurrence of sSNVs is associated with defective gene transcription and function, we stratified and compared cardiac and neuronal sSNVs by using the expression profiles of corresponding tissues in GTEx. By estimating the mutational signature composition of genic sSNVs in each gene expression quartile and for each age group, we found that Signature A sSNVs are enriched in highly expressed genes at a similar level in both aged cardiomyocytes and aged neurons (Fig. 4c, upper), suggesting Signature A as a common transcription-associated signature⁴⁵. In contrast, Signatures C and D showed higher contributions in aged cardiomyocytes than in aged neurons, without a strong association with gene transcription level (Fig. 4c, middle and lower), indicating that Signatures C and D might result from mechanisms distinct from Signature A, and relatively specific to cardiomyocytes. We found a higher proportion of exonic and nonsynonymous mutations in aged cardiomyocytes when compared with germline heterozygous SNVs (Fig. 4d and Extended Data Fig. 8), which could be explained by the relaxed constraint of negative selection in the somatic context. Gene ontology (GO) analysis found that genes involved in mismatch repair, mitochondria organization and phosphatidylinositol 3-kinases (PI3Kinase) pathways showed sSNV enrichment in aged cardiomyocytes (false discovery rate (FDR)-adjusted P < 0.05, permutation test; Fig. 4e). More specifically, we observed protein-altering deleterious somatic mutations in kinase pathway genes of aged cardiomyocytes (Supplementary Table 5), such as WNK2 (ref. ⁴⁶), TAOK3 (ref. ⁴⁷) and BAZ1B⁴⁸, which play key roles in the regulation of electrolyte homeostasis, cell signaling survival, proliferation activities, and DNA damage response. Additionally, protein-altering mutations were identified in ZMYM6 (ref. ⁴⁹) and DOCK6 (ref. ⁵⁰) that are associated with cytoskeletal organization (Supplementary Table 5).

Although many heterozygous mutations would likely compromise cardiomyocyte function, it is expected that deleterious gene ‘knockout’ (KO) mutations in haploinsufficient genes would be especially damaging if all the alleles are affected, and that there may be a threshold for the accumulation of such KOs above which cardiomyocyte function would deteriorate. Higher ploidy cells with more copies of chromosomes may potentially represent adaptive mechanisms to guard cells against these KO mutations. Therefore, we compared the accumulation of gene KOs in diploid versus tetraploid cardiomyocytes using a prediction model. In this model, at least two coincident deleterious sSNV events in a diploid cell, or four deleterious sSNVs within one gene in a tetraploid cell, cause loss of function. Diploid cardiomyocytes had an average probability of 0.2% of getting one or more genes completely knocked out by age 60 years, with this probability increasing to 1% by age 80 years, implying that a substantial fraction of cardiomyocytes would carry damaging mutations (Fig. 4f). In contrast, tetraploid cardiomyocytes showed a significantly lower probability of gene KO (P = 4.9 × 10⁻⁴, two-tailed paired Wilcoxon test), with less than 0.2% of cells with KO genes at age of 80 years (Fig. 4f). These data strongly suggest that tetraploid cardiomyocytes are more effective in averting the loss of gene function caused by age-related mutations (Fig. 4g).

Discussion

Our data show that each individual cardiomyocyte has a profoundly distinctive genome, harboring sSNVs accumulated throughout the lifetime. After controlling for the genomic size difference between tetraploid and diploid cells, cardiomyocytes accumulate age-related SNVs at rates higher than neurons and lymphocytes, but similar to hepatocytes. This finding indicates that the heart and liver, two highly metabolic active organs, harbor a higher load of somatic mutations and have the tenacity to become polyploid, potentially to endure oxidative stress. Although the accumulation rates of hepatocyte and cardiomyocyte are similar, cardiomyocyte sSNVs display distinct signatures of mutagenic processes, with a predominant contribution of Signature D that has been associated with defects in the MMR pathway.

Existing literature shows that cardiomyocyte genome ploidies and nuclear counts vary widely across different mammalian species¹⁹. Polyploidization is a characteristic feature of mammalian cardiomyocytes and can be stress-induced and/or developmentally programmed^51,52. Polyploidization not only plays a role in increasing cell size and metabolic output, but also promotes alterations in the transcriptome and metabolome. Polyploidy also frequently confers resistance to environmental stresses not tolerated by diploid cells. Our prediction models show that tetraploid cardiomyocytes have a significantly lower probability of complete gene KO compared with diploid cardiomyocytes, indicating that cardiomyocyte polyploidization potentially offers a mechanism to ameliorate the deleterious effects of this rapid mutation accumulation. Human cardiomyocytes initially are mainly diploid, though tetraploid cardiomyocytes begin to appear soon after birth. The formation of hexaploid, octoploid and potentially higher ploidy cells arises with advanced age.

Our data indicate that aging results in increased generation, decreased repair, or both, of oxidative DNA lesions. Age-related myocardial sSNVs have distinctive C > A mutations, which are hallmarks of oxidative damage, and direct quantification by enzyme-linked immunosorbent assay shows increased oxoguanine in the aged heart. Decomposition of Signature C with COSMIC signature indicates significant contributions of SBS8 and SBS18. This finding strongly suggests that the damaged guanine accumulates either via disrupted NER, linked to SBS8 as well as disrupted BER, linked to SBS18. Most likely these C > A mutations are from the accelerated accumulation of oxidized nucleotides that overwhelm these repair pathways, since our finding from GTEx expression data indicates expression of both NER and BER pathway genes are changed only mildly during heart aging. Oxidized guanine nucleotides reflect the presence of increased ROS, which has previously been associated with various cardiovascular diseases, and can be generated by a variety of processes including inflammation and mitochondrial dysfunction that are well studied in cardiovascular disease. Our findings suggest that increased oxidative stress leads to elevated 8-OHdG, which may overwhelm NER and BER repair machinery, resulting in increased DNA mutational burden. Another mutational signature that is uniquely enriched in cardiomyocytes compared with neurons suggests a role of defective MMR in mutation generation. It is known that failure of DNA MMR is associated with a strikingly elevated rate of base substitution mutations and, as a consequence, tumors with MMR deficiency are amongst those with the highest number of somatic mutations. Our findings of increased sSNV counts in the aged heart suggest that MMR may not be efficient at correcting mismatched nucleotides in aged cardiomyocytes, contributing to the large increase in sSNVs.

Here we show that the number of sSNVs and the likelihood of disrupting essential gene function in human cardiomyocytes increases significantly with age, suggesting that the age-related cellular dysfunction in aged cardiomyocytes could be due partially to somatic mutations, although more studies will be needed to draw a causal relationship between mutational burden and age-associated decrease in cardiac function. We identified a total of 10,407 sSNVs from 48 tetraploid cardiac nuclei by LiRA. Among 75 coding sSNVs, 36 were predicted to be damaging (Supplementary Table 5). We also observed seven sSNVs shared by two cardiomyocytes (no sSNVs shared by three or more cardiomyocytes); six of these were shared by cardiomyocytes obtained from the same donor, indicating early somatic mutations in the common lineage of these cardiomyocytes. More comprehensive measurements of somatic mutation from healthy and diseased myocardium will be needed to decipher the functional impact of somatic mutation in aging and heart disease. In this study, we investigated only single nucleotide variants, but cardiomyocytes could also carry other types of somatic mutations, including indels and structural variations. MDA requires extensive quality control to identify well-amplified samples, so that our samples are biased towards cells that amplified well and evenly, that is, cells more likely to have intact genomes. Importantly, the sSNV increase rate in neurons amplified by MDA (19 sSNV per cell genome per year) is consistent with the rate estimated by a recently developed duplex sequencing protocol⁶ without genome amplification (20 sSNV per cell genome per year), suggesting the accuracy of MDA-based sSNV analysis. Newer methods, such as primary template amplification (PTA)⁷ or META-CS⁵³ may provide an improved means to study somatic mutations in diploid and multiploid cardiomyocytes with broader genomic coverage for variant calling and better distinction of sSNVs from single-stranded lesions. Nevertheless, these results provide an early view into the mutational landscape of terminally differentiated cardiomyocytes. Our analysis of human cardiomyocytes lays a foundation for better understanding of the genomic landscape and mechanisms driving mutation accumulation in aging cardiomyocytes that may help develop new treatments to reduce age-related cardiomyocyte dysfunction.

Methods

Human tissues and DNA sample preparation.

This study was approved by the Boston Children’s Hospital institutional review board. Samples were processed according to a standardized protocol under the supervision of the National Institutes of Health (NIH) NeuroBioBank ethics guidelines. Research on these deidentified specimens and data was performed at Boston Children’s Hospital with approval from the Committee on Clinical Investigation (S07-02-0087 with waiver of authorization, exempt category 4). All human tissues were obtained from the NIH NeuroBioBank at the University of Maryland. Once we received the tissue from the BioBank, DNA degradation evaluation was performed by isolating DNA from the tissue and performing gel electrophoresis as well as a Genomic DNA Screen Tape Station. Tissues with fragmented DNA were not selected for further studies.

Statistics and reproducibility.

No statistical methods were used to predetermine sample size. The experiments were not randomized, and the investigators were not blinded to allocation during experiments and outcome assessment.

Isolation of cardiomyocyte nuclei.

Cardiac nuclei were isolated using a density sedimentation protocol¹³. Briefly, 100 mg of cardiac tissue from the left ventricle was homogenized in 5 ml of ice-cold lysis buffer (0.32 M sucrose, 5 mM CaCl₂, 3 mM magnesium acetate, 2.0 mM EDTA, 0.5 mM EGTA, 10 mM Tris-HCl (pH 8.0) and 1 mM dithiothreitol (DTT)). The suspension was further dounced (20 strokes) with a type A pestle in a glass douncer (Sigma). The homogenized solution was filtered through 100 and 70 μm strainer (Pluriselect) and centrifuged for 7 min at 700g at 4 °C and the crude nuclear pellets were resuspended in 2.1 M sucrose solution (2.1 M sucrose, 3 mM magnesium acetate, 1 mM DTT and 10 mM Tris-HCl, pH 8.0). This was then layered onto a cushion of 5 ml 2.1 M sucrose solution and centrifuged at 30,000g for 1 h at 4 °C. The pellet from each tube was then resuspended with 1.5 ml nuclei storage buffer (0.43 M sucrose, 70 mM KCl, 2 mM MgCl₂, 10 mM Tris-HCl (pH 7.2) and 5 mM EGTA) for further analysis.

Flow cytometry.

Accurate identification of cardiomyocyte nuclei is crucial for the analysis of this study. Single cardiac nuclei were isolated using FANS-based cardiac troponin T staining and nuclear DAPI intensity, using FACSAria (20 psi, 100-mm nozzle, Becton Dickenson Biosciences). Cardiac nuclei were identified using a sequential gating strategy. Initial size gates for FSC versus SSC were set to select the large cardiac nuclei with high FSC and SSC corresponding to larger and more granular cells. Cell doublet discrimination was performed by a combination of high forward scatter height and area FSC-H/FSC-A and SSC-H versus SSC-W plots. H versus W or A allows separating the doublets from the single-cells containing 4n amounts of DNA. Cells containing 4n amounts of DNA have double the A and H values whereas W is roughly the same as cells containing 2n amounts of DNA. Cardiomyocytes are the only tetraploid cell in cardiac tissue, so ploidy is a convenient way to purify them. However, to rule out any possibility that tetraploid cells accumulate mutations in different ways from diploid cells, or that the amplification, sequencing and calling process performs differently in tetraploid cells, we carried out replicate analysis of diploid cells by isolating diploid cardiomyocytes using cardiomyocyte-specific markers (cardiac troponin T). The genomes of every single nucleus were amplified using MDA²⁰.

Ploidy quantification by imaging flow cytometry and karyotyping.

We used a FlowSight Imaging Flow Cytometer to combine the advantages of traditional flow cytometry and microscopy to verify the cardiac nuclei ploidy. Left ventricular cardiomyocyte nuclei were isolated and stained with Hoechst (Invitrogen, catalog no. H3570; 1:1,000) and the cardiomyocyte marker cardiac troponin T (Abcam, catalog no. ab56357; 1:250). The nuclei were resuspended in PBS at a concentration of 2 × 10⁷ cells ml⁻¹. The DNA content was detected by a FlowSight Imaging Flow Cytometer (Luminex), equipped with a ×20 objective lens, and analyzed by image analysis software (IDEAS). The percentage of 2n, 4n and greater than 4n ploidy was determined by setting gates using the calibration with nuclei of noncardiomyocytes at 2n. The software separated the cardiomyocyte nuclei using brightfield, Hoechst and Troponin T images.

Chromosomes were visualized by Giemsa staining (GIBCO KaryoMAX Giemsa Stain Stock Solution, catalog no. 10092–013) according to the manufacturer’s protocol. Briefly, sorted cardiac nuclei from 1465 were treated with a hypotonic solution (0.075 M KCl) and preserved in their swollen state with Carnoy’s fixative; further nuclei were dropped on to slides and air-dried. The slides were stained with freshly prepared Giemsa staining solution (3:1 ratio of Gurr Buffer and Giemsa Stain) and visualized at ×100 magnification.

Expression analysis of sorted cardiac nuclei.

ddPCR assays were performed to test the cardiomyocyte-specific gene expression by using Tagman probe of PLN (Hs01848144), TNNT2 (Hs00943911), MYH7 (Hs01110632), CD31 (Hs00169777_m1) and PDGFB (Hs01019589). QX100 Droplet Digital PCR System (Bio-Rad) was used with standard parameters. We measured numbers of droplets that were positive and negative for each gene using QuantaSoft software.

Library preparation and WGS.

We sequenced 48 tetraploid and 8 diploid single cardiac nuclei from 12 individuals with ages from 0.4 to 82 years, including three infant (<4 years), six middle-aged (30–66 years) and three aged (>75 years) individuals (Table 1). From each of the 12 individuals, we sequenced four single cardiac nuclei. Low coverage library preparations were carried out according to the manufacturer’s instructions (QIAseq FX single-cell DNA Kit). MDA-amplified DNA libraries were prepared with the Illumina TruSeq Nano LT sample preparation kit. Bulk DNA was extracted using the QIAamp DNA Mini kit with RNase A treatment. Paired-end sequencing (150 bp × 2) was performed on a HiSeq ×10 instrument. Single-cell and bulk WGS library preparations and sequencing were done at Macrogen Genomics. The sequencing depth of cardiomyocytes was comparable with that of neurons we previously studied (Extended Data Fig. 9).

Read alignment and postprocessing.

Reads generated from single-cell WGS were aligned against the GRCh37 human reference genome by BWA (v.0.7.15) with default parameters. Duplicate reads were masked by Mark Duplicate of Picard (v.2.8) and then postprocessed with local realignment around indels and base quality score recalibration using Genome Analysis Toolkit (GATK) (v.3.5).

Measuring the evenness of genome amplification in single-cells.

We measured the evenness of genome amplification in single-cells using two metrics: MAPD and CoV. MAPD is the median value of absolute differences between the copy number ratios of neighboring bins with variable lengths, where bins were divided to have the same number of uniquely mapping reads. CoV is the measure of variance of bin-wise copy number ratios, calculated by taking the ratio of their standard deviation to the mean. Both higher MAPD and CoV scores represent greater unevenness of single-cell genome amplification. Binning, GC normalization, segmentation and copy estimation were performed following the previous single-cell copy number analysis protocol⁵⁴, to obtain the copy number ratio per bin and calculate MAPD and CoV scores.

Estimation of amplification dropout rates in single-cell WGS data.

Germline heterozygous SNVs were identified from bulk WGS data using HaplotypeCaller of GATK with default parameters (-stand_call_conf 30.0 -stand_emit_conf 30.0 -ploidy 2)⁵⁵, and only those reported by the 1000 Genomes Project were considered subsequently as high-confidence variants to estimate dropout rate. For each single-cell, a germline heterozygous SNV was considered as locus dropout if the total coverage in single-cell WGS is less than five times and considered as allelic dropout if the number of reads supporting either a reference or a mutant allele is less than two. The genome-wide locus- and allele-dropout rates were then calculated as the proportions of dropout sites among all germline heterozygous SNVs.

Somatic SNV calling from single-cell WGS data.

We performed phasing-based linked read analysis using the LiRA method (v.2018Feb)²¹ to identify sSNVs in single-cells using around 30× WGS data of nonheart tissue from the same individual as bulk germline controls. Served as the input of LiRA, SNVs from each single-cell and bulk sample were called using the HaplotypeCaller from GATK with default parameters (-stand_call_conf 30.0 -stand_emit_conf 30.0 -ploidy 2); germline SNVs identified from bulk samples were further phased by Shapeit 2 (v.904)⁵⁶. LiRA distinguishes true somatic mutations from base-calling or amplification errors by leveraging the linkage information between the somatic candidate and adjacent phased germline mutations. For each single-cell, the threshold for number of phasable reads supporting mutant allele was calculated to control for more than 90% true positive rate. The detection sensitivity in phasable regions was estimated from germline SNVs and then used to adjust the LiRA-detected sSNV count to the real count. sSNV density per megabase was calculated for each single-cell with the consideration of doubled genomic size for tetraploid cardiomyocytes. We considered sSNVs in autosomes only to avoid potential detection bias in sex chromosomes between different genders.

Somatic SNV burden correction for tetraploid cells.

Each tetraploid cardiomyocyte contains two sets of diploid genomes (that is, four haplotypes). Theoretically, somatic mutations present in one out of the four haplotypes can be called by LiRA only when the reads violating complete linkage were lost due to allelic or locus dropout (Extended Data Fig. 10a). In comparison, germline mutations can be called in the same way as in the diploid cells. Therefore, the LiRA-estimated power from germline SNVs should be corrected by cell-specific allelic or locus dropout rates before applying to the count of identified sSNVs in tetraploid cells.

Theoretically, the dropout status of a site in a tetraploid cell (S_4n) was determined by the status of its two diploid origin cells (Extended Data Fig. 10b): a locus dropout happened in a tetraploid cell only when both of its diploid origin cells were locus dropout, whereas an allelic (reference or mutant allele) dropout happened in a tetraploid cell when two copies of the corresponding alleles from two diploid origin cells were lost by locus or allelic dropout. Using simulated tetraploid cells by an in silico mixture of single-cell WGS data from two diploid cells from the same individual, we observed that the locus dropout rate in simulated tetraploid cells generally equals the product of the dropout rates in the two original diploid cells, suggesting independence of dropout occurrences across the genome between MDA-amplified cells.

Cell-specific allelic and locus dropout rates were estimated for each tetraploid cell, and the dropout rate for its two diploid origins was then calculated from the above matrix under the assumption of equal rates between the two diploid origins. The count and density of LiRA-called sSNVs were further adjusted by the locus and allelic dropout rate of the corresponding diploid origins and the sensitivity loss due to the decrease of per-haplotype sequencing depths from diploid to tetraploid cells.

Mutational signature analysis.

Mutational signatures were decomposed de novo by the NMF-based mutational signature framework²⁷ using MutationalPatterns (v.1.8.0), using the 96 trinucleotide contexts of sSNVs detected from tetraploid cardiomyocytes in this study as well as nondisease neurons that were previously studied¹⁰. For most of the downstream analyses, except the regression modeling of sSNV density where we used all neurons to cover the whole age span, only neurons from the same donors of the studied cardiomyocytes were considered to better control for donor background. We estimated signature stability and reconstruction error and found that four signatures best fit the observed sSNV profiles (Extended Data Fig. 4). We then compared our de novo signatures (Signatures N1, N2, N3 and N4) with previously reported signatures in neurons (Signatures A, B and C)¹⁰ and signatures potentially resulting from MDA artifacts (SBS, scE and scF)²². As shown in Extended Data Fig. 5, Signatures N1, N2 and N3 resemble Signature B (as well as SBS scF), A and C, respectively, whereas Signature N4 did not show high similarity to any of these signatures, suggesting a potential cardiac-specific signature (renamed as Signature D).

To remove the potential contamination of MDA artifacts, we decomposed the sSNV profile of each single-cell into Signatures N2/A, N3/C, N4/D as well as SBS scE and scF (Signature N1/B was not included because it was nearly identical to SBS scF) using MutationalPatterns. We then calculated the signature-specific sSNVs density for each single-cell by multiplying the overall sSNV density and the cell-specific weight for each signature. The contributions of SBS scE and scF were subtracted from the overall sSNV density for subsequent burden analyses.

Burden and list of sSNVs in hepatocytes and lymphocytes.

Genome-wide sSNV burden and list of MDA-amplified hepatocytes and lymphocytes profiled by single-cell WGS were extracted from Brazhnik et al.²⁴ and Zhang et al.²⁵, respectively. The contributions of SBS, scE and scF were estimated and subtracted from the overall sSNV burden for each hepatocyte and lymphocyte, following the same strategy as that performed in each cardiomyocyte and neuron.

Mixed-effects modeling of somatic SNV density.

To study the age-dependent somatic mutation accumulation and the rate difference between cardiomyocyte and other cell types, we performed linear mixed-effects regression modeling using the lme4 (v.1.1–23) and lmerTest (v.3.1–2) R packages. Overall and signature-specific sSNV density per megabase was modeled as continuous outcomes. Age was modeled as a fixed effect, whereas donor and cell type were modeled as random effects, because cells from the same donor and cell type may share the biological environment and thus be independent in terms of sSNV density. The maximum likelihood method was used to fit linear mixed-effects regression models.

To test the age effect of sSNV density in healthy individuals, we fitted the model y_ij = β × ρ_j + μ + θ_j + ε_ij where y_ij is the sSNV density in cell i of donor j, β is the fixed effect of age, ρ_j is the age of donor j, μ is the number of sSNVs at birth, θ_j is the random effect of each donor following a normal distribution with mean 0 and variance τ, and ε_ij is the measurement error of each cell following a normal distribution with mean 0 and variance σ_ij. We observed a significant age-association of sSNV density in tetraploid cardiomyocytes (P = 7.3 × 10⁻⁴; Fig. 2b), and further confirmed similar sSNV densities between tetraploid and diploid cardiomyocytes (Extended Data Fig. 2).

To test the difference in the age effect between different cell types, we fit the model y_ijk = (β + γ_k) × ρ_j + μ + θ_jk + ε_ijk where y_ikj is the sSNV density in cell i from cell type k of donor j, β is the fixed effect of age, γ_k is the fixed effect of cell type k on age (interaction terms between age and cell type), ρ_j is the age of donor j, μ is the number of sSNVs at birth, θ_jk is the random effect of the donor-cell type pair following a normal distribution with mean 0 and variance τ, and ε_ijk is the measurement error of each cell following a normal distribution with mean 0 and variance σ_ijk. As shown in Fig. 2e, we observed that cardiomyocytes showed an age effect that was significantly larger than neurons and lymphocytes (P = 5.0 × 10⁻³ and 8.7 × 10⁻⁴) but not hepatocytes (P = 0.53).

To control for potential confounding factors in sSNV detection, we introduced δ_ij into the previous models, where δ_ij denotes one of the potential confounding factors including MAPD and CoV scores (measurement of amplification evenness), sequencing depth, library insert size, number of heterozygous germline SNVs and PMI of donors. We confirmed that the age-dependent increase in cardiomyocytes and the cardiomyocyte-neuron difference remained statistically significant (P < 0.05), suggesting that the sSNV accumulation pattern we found in cardiomyocytes was very unlikely to be caused by these technical issues.

Mutation spectrum and strand bias analysis.

The LiRA-identified sSNVs were grouped into three categories according to the age of their cell donor: infant (<4 years), middle-aged (30–66 years) and aged (>75 years), and then the mutation spectrum and strand bias were calculated for each age category. The transcriptional strands of genic sSNVs were assigned based on the UCSC TxDb annotations by MutationalPatterns⁵⁷, where mutated bases (‘C’ or ‘T’) on the same strand as the gene direction were categorized as ‘untranscribed’ and on the opposite strand as ‘transcribed.’ To characterize sSNV accumulation during aging, we further estimated the mutation spectrum and strand bias for the net increase of sSNVs between infant and aged categories. Specifically, we first measured the absolute sSNV count for each mutation type by multiplying its proportion and the average sSNV burden for each age category, and then subtracted the sSNV count for each mutation type between infant and aged categories. The statistical significance of strand bias was determined by the Poisson test.

Gene expression analysis.

The expression matrix ‘Gene read counts’ (GTEx Analysis v.8) for left ventricle of the heart and frontal cortex (BA9) of the brain was downloaded from the GTEx portal, since these two regions are the corresponding source tissues for our single-cell cardiomyocytes and neurons, respectively. The per-gene expression value was then normalized for each tissue sample after estimating sample-specific size factor and dispersion as well as modeling tissue, age and gender factors using DESeq2 (ref. ⁵⁸) (v.1.24.0) with the recommended protocol and default parameters. To study the age-dependent changes in MMR activity in the heart and brain, we extracted the expression levels of core components of the MMR complex (MLH1, MLH3, MSH2, MSH3, MSH6, PMS1 and PMS2) as well as genes in the NER and BER pathways annotated by the KEGG database⁵⁹. Individuals with both heart and brain expression profiles were binned according to their ages. Individuals with ages less than 40 years and more than 70 years were excluded due to the small sample size (n ≤ 10). For the remaining 168 individuals, we calculated the average expression levels of the MMR, NER and BER genes in heart and brain samples, separately, and tested their association with age using the linear regression model.

To investigate the relationship between somatic mutation and gene expression, we assigned genes into four quartiles based on their average expression values in heart or brain samples across all GTEx individuals. Cardiac and neuronal sSNV densities were calculated for each quartile of genes, after normalizing by gene length and per-cell sSNV detection power. The standard deviation of sSNV density was estimated using a permutation test, in which the quartile classification of genes was shuffled randomly and the permuted sSNV densities were calculated for 1,000 rounds. We further performed an NMF-based mutational signature decomposition for sSNVs located in each quartile of genes, to estimate the relative contributions of Signature A, Signature C, Signature D, SBS scE and SBS scF for each quartile. The sSNV density for each signature was calculated by multiplying the overall sSNV density by the signature contribution. We also performed the above analysis by using the expression profiles from aged individuals (>75 years) only and observed robust results.

Functional enrichment analysis.

GO functional enrichment analysis was performed using GOseq (v.1.34.1). We assigned a binary value ‘0’ or ‘1’ to each RefSeq gene according to whether any sSNV was present in the gene in any single-cell and built the sSNV-gene table for cardiac and neuronal sSNVs separately. GOseq uses the Wallenius approximation method to test the enrichment of sSNVs for each GO term, after applying a probability weighting function to control for potential bias from gene length. Genes without any GO annotation were ignored when calculating the total gene count. GO terms with fewer than five member genes with sSNVs were excluded to avoid ascertainment bias. GO terms with more than 1,000 member genes were also excluded.

To identify GO terms that were specifically enriched in cardiomyocytes but not in neurons, we performed a permutation test among all GO terms with P < 0.01 for either cardiac or neuronal sSNVs. For each permutated GO term, we compared the observed rank difference in GOseq’s P between cardiac and neuronal sSNVs against the expected null distribution, which was estimated by 1,000 rounds of random shuffling of the sSNV-gene tables. The FDR method was applied for correcting multiple hypothesis testing.

Measurement of oxidative stress.

The level of 8-OHdG and 8-oxo-Gua was measured in 250 ng total nucleic acids extracted from left ventricular cardiomyocytes using a competitive enzyme-linked immunosorbent assay kit (Cayman Chemical, catalog no. 589320) according to the manufacturer’s instructions. The samples were incubated for 1 h with a monoclonal antibody against 8-OHdG in a microtiter plate precoated with 8-OHdG and 8-oxo-Gua. The final color was developed by the addition of 3,3,5,5-tetramethylbenzidine, and absorbance was measured at 450 nm. The samples were diluted at 1:50 with enzyme immunoassay buffer before assaying.

Modeling accumulation of gene KOs in cardiomyocytes.

Accumulation of exonic, deleterious gene KO mutations might be detrimental to proper cell function. These mutations can be ‘biallelic’ in the case of diploid cells, or ‘quadallelic’ in the case of tetraploid cells. The number of sSNVs identified in this study were used to estimate the accumulation of gene KOs in single-cells, using an extension of the method described in Lodato et al.⁵. To account for genes that are highly dosage sensitive, and thus can be haploinsufficient, we included a factor to capture the probability of a mutation landing on an allele of a gene with a high pLI score. The pLI metric measures the probability of loss-of-function intolerance⁶⁰, and genes with pLI score greater than 0.90 are considered highly dosage sensitive. These high pLI score genes comprise 17% of all genes. Consequently, the calculation was computed as follows:

n = number of estimated sSNVs \times \frac{total number of deleterious variants}{total number of variants} \times p

d_{i} = {event that gene i has at least one mutation}

π_{i} = {event that gene i has a high pLI score}

D = {probability of a gene having a deleteriousmutation}

Pr (KO ∣ π, D, n) = π \times (1 - {(1 - D)}^{n}) + (1 - π) (1 - e^{- n D})

where p is the ploidy factor that captures the probability of obtaining a mutation on the remaining alleles of the gene (for example, P = 0.5 for diploid genomes, and P = 0.125 for tetraploid genomes) and n is the expected number of deleterious mutations. Considering the similar sSNV density but halved genomic size in diploid cardiomyocytes when compared with tetraploid cardiomyocytes (Extended Data Fig. 2), the genome-wide sSNV burden in diploid cardiomyocytes of each individual was calculated as 50% of the burden in corresponding tetraploid cardiomyocytes. The mean and s.e.m. were calculated across all cells per individual, for tetraploid and diploid cardiomyocytes separately. Regressions were performed using an exponential model to capture the nonlinear trend of the probability of obtaining cells with KO genes with age. All calculations were performed using custom R scripts.

Reporting summary.

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Extended Data

Extended Data Fig. 3 | — A, Uncorrected sSNV density of tetraploid cardiomyocytes. B–C, sSNV density of tetraploid cardiomyocytes after correcting for two metrics about amplification evenness, MAPD (B) and CoV (C). D, sSNV density of tetraploid cardiomyocytes after excluding C > T mutations. Tetraploid cardiomyocytes showed consistent age-dependent accumulation of sSNVs, robust to amplification evenness (p = 3.9 × 10⁻⁴ for MAPD correction and 1.3 × 10⁻³ for CoV correction, mixed-effects model) and the exclusion of C > T mutations (p = 3.3 × 10⁻⁵, mixed-effects model).

Extended Data Fig. 4 | — *De novo* mutational signature analysis was performed using non-negative matrix factorization (NMF), in which the factorization rank is critical to define the number of signatures used to decompose the target matrix of sSNVs. We identified that four signatures can maximize the cophenetic and best fit the observed sSNV matrix.

Extended Data Fig. 5 | — (Left panel) *De novo* mutational signatures identified from single human cardiomyocytes in this study. (Middle panel) Previously published signatures identified from single human neurons (Lodato et al.). (Right panel) Recently published signatures thought to represent artifacts of single-cell whole genome amplification, SBS scE and scF, from a study of cultured cells (Petljak et al.). Signature D/N4 was present only in cardiomyocytes but not in neurons. Signature B/N1 identified in cardiomyocytes and neurons resembles the artifact signature SBS scF, thus it was excluded for further mutational burden and signature analyses in this study.

Extended Data Fig. 6 | — Unsupervised clustering was performed among single-cardiomyocyte-derived signatures (Signatures A, C, and D) and cancer-derived COSMIC signatures (SBS1–60), by using cosine similarity of 96-class of trinucleotide context to measure the pairwise distance. Signature A resembles SBS5 and Signature D resembles SBS44.

Extended Data Fig. 7 | — Heart vs. brain (p = 3.7 × 10⁻¹⁵ and 1.9 × 10⁻¹⁵ for NER and BER, two-tailed paired Wilcoxon test); age effect in heart (p = 0.04 for BER, linear regression). Boxplot with whisker denotes minimum, 25%, median, 75% quartiles, and maximum.

Extended Data Fig. 8 | — **(A) and mutation type (B).** sSNVs identified from tetraploid cardiomyocytes (n = 10,407) and neurons (n = 19,101) were grouped according to the age group of donors. Error bar, mean ± 95%CI.

Extended Data Fig. 9 | — A, No systematic difference in sequencing depth (p = 0.51, two-tailed Wilcoxon test) between tetraploid cardiomyocytes (n = 48) and neurons (n = 155). Eight outlier cardiomyocytes (in the dashed rectangle) were intentionally sequenced at doubled sequencing depth from two donors. Boxplot with whisker denotes minimum, 25%, median, 75% quartiles, and maximum. B, LiRA-estimated sSNV burden remained generally robust with varied sequencing depths at or above the average depth in neurons, denoted by the dashed line. Four outlier cardiomyocytes from (A) were randomly chosen, and their sequencing reads were *in silico* down-sampled into 25%, 50%, and 75% of the original sequencing depths. Error bar, mean ± 95%CI.

Extended Data Fig. 10 | — A, In diploid cells, LiRA identified the complete linkage between each sSNV candidate and its adjacent germline heterozygous mutation, which distinguishes true sSNVs from technical artifacts. In tetraploid cardiomyocytes, sSNVs present in one out of the four haplotypes were able to be called by LiRA when the reads violating complete linkage were lost due to allelic or locus dropout. B, Dropout status of a tetraploid cell determined by its two diploid origin cells. Rows and columns denote the dropout status of two diploid origin cells, respectively. mut-allele, mutant allele of the germline mutation; ref-allele, reference allele of the germline mutation.

Supplementary Material

Table_S1_Ploidy Case Info

NIHMS1832113-supplement-Table_S1_Ploidy_Case_Info.xlsx^{(11.9KB, xlsx)}

Table -S5_heart exonic mutation

NIHMS1832113-supplement-Table_-S5_heart_exonic_mutation.xlsx^{(17.1KB, xlsx)}

table _S4_Mutational rate info

NIHMS1832113-supplement-table__S4_Mutational_rate_info.xlsx^{(18.5KB, xlsx)}

Table _S2_Sequencing info

NIHMS1832113-supplement-Table__S2_Sequencing_info.xlsx^{(14.4KB, xlsx)}

Table _S3_Mutational Site info

NIHMS1832113-supplement-Table__S3_Mutational_Site_info.xlsx^{(785.4KB, xlsx)}

Acknowledgements

We thank R. S. Hill and J. N. Partlow for assistance, and R. Mathieu from Boston Children’s Hospital Flow Cytometry Core, IDDRC Molecular Genetics Core Facility and S. Pechhold of the UMass Medical School Flow Cytometry Core Facility for imaging flow analysis, S. Sengupta from Arbor biotechnology for assisting with Flowjo analysis, M. Bhaumik, Boston Children’s Hospital Mouse Gene Manipulation Core, for karyotyping experiment set up, we thank the IDDRC Cellular Imaging Core, funded by NIH P50 HD105351, S10OD016453, and the Research Computing group at Harvard Medical School for assistance. Human tissue was obtained from the NIH NeuroBioBank at the University of Maryland, and we thank the donors and their families for their invaluable donations for the advancement of science. S.C. is supported by American Heart Association (SDG) and NHLBI (R01HL152063). C.A.W. and E.A.L. are supported by the Manton Center for Orphan Disease Research and the Allen Discovery Center for Human Brain Evolution, funded by the Paul G. Allen Frontiers Program, C.A.W. is an Investigator of the Howard Hughes Medical Institute. This work is supported in part by NINDS as a supplement to R01NS032457. E.A.L is also supported by NIH (DP2AG072437; R01AG070921) and the Suh Kyungbae Foundation. E.A.M is supported by NIGMS (T32GM007753) and NLM (T15LM007092). M.B.M. is supported by NIH (K08AG065502, T32HL007627), the Brigham and Women’s Hospital Program for Interdisciplinary Neuroscience through a gift from Lawrence and Tiina Rand, the donors of the Alzheimer’s Disease Research program of the BrightFocus Foundation (A20201292F), and the Doris Duke Charitable Foundation Clinical Scientist Development Award (2021183).

Footnotes

Code availability

The code used in this study is available upon reasonable request to the corresponding authors.

Competing interests

The authors declare that they have no competing interests.

Extended data is available for this paper at https://doi.org/10.1038/s43587-022-00261-5.

Supplementary information The online version contains supplementary material available at https://doi.org/10.1038/s43587-022-00261-5.

Data availability

Single-cardiomyocyte WGS data is deposited in the NCBI dbGaP with accession number phs002284.v1.p1. The data are available under controlled use conditions set by human privacy regulations. Other data are available upon request.

References

1.Kannel WB & Vasan RS Is age really a non-modifiable cardiovascular risk factor. Am. J. Cardiol 104, 1307–1310 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Strait JB & Lakatta EG Aging-associated cardiovascular changes and their relationship to heart failure. Heart Fail. Clin 8, 143–164 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Sheydina A, Riordon DR & Boheler KR Molecular mechanisms of cardiomyocyte aging. Clin. Sci 121, 315–329 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Franco I et al. Somatic mutagenesis in satellite cells associates with human skeletal muscle aging. Nat. Commun 9, 800 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Lodato MA et al. Somatic mutation in single human neurons tracks developmental and transcriptional history. Science 350, 94–98 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Abascal F et al. Somatic mutation landscapes at single-molecule resolution. Nature 593, 405–410 (2021). [DOI] [PubMed] [Google Scholar]
7.Luquette LJ et al. Ultraspecific Somatic SNV and Indel Detection in Single Neurons Using Primary Template-Directed Amplification (Cold Spring Harbor Laboratory, 2021). [Google Scholar]
8.Martincorena I et al. Somatic mutant clones colonize the human esophagus with age. Science 362, 911–917 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Lee-Six H et al. The landscape of somatic mutation in normal colorectal epithelial cells. Nature 574, 532–537 (2019). [DOI] [PubMed] [Google Scholar]
10.Lodato MA et al. Aging and neurodegeneration are associated with increased mutations in single human neurons. Science 359, 555–559 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Weakley SM et al. Role of somatic mutations in vascular disease formation. Expert Rev. Mol. Diagn 10, 173–185 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Helleday T, Eshtad S & Nik-Zainal S Mechanisms underlying mutational signatures in human cancers. Nat. Rev. Genet 15, 585–598 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Bergmann O & Jovinge S Isolation of cardiomyocyte nuclei from post-mortem tissue. J. Vis. Exp 65, 4205 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Bergmann O et al. Evidence for cardiomyocyte renewal in humans. Science 324, 98–102 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Cui M & Olson EN Protocol for single-nucleus transcriptomics of diploid and tetraploid cardiomyocytes in murine hearts. STAR Protoc 1, 100049 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Lazar E, Sadek HA & Bergmann O Cardiomyocyte renewal in the human heart: insights from the fall-out. Eur. Heart J 38, 2333–2342 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Mollova M et al. Cardiomyocyte proliferation contributes to heart growth in young humans. Proc. Natl Acad. Sci. USA 110, 1446–1451 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Herget GW, Neuburger M, Plagwitz R & Adler CP DNA content, ploidy level and number of nuclei in the human heart after myocardial infarction. Cardiovasc. Res 36, 45–51 (1997). [DOI] [PubMed] [Google Scholar]
19.Derks W & Bergmann O Polyploidy in cardiomyocytes: roadblock to heart regeneration. Circ. Res 126, 552–565 (2020). [DOI] [PubMed] [Google Scholar]
20.Dean FB et al. Comprehensive human genome amplification using multiple displacement amplification. Proc. Natl Acad. Sci. USA 99, 5261–5266 (2002). [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Bohrson CL et al. Linked-read analysis identifies mutations in single-cell DNA-sequencing data. Nat. Genet 51, 749–754 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Petljak M et al. Characterizing mutational signatures in human cancer cell lines reveals episodic APOBEC mutagenesis. Cell 176, 1282–1294 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Miller MB et al. Somatic genomic changes in single Alzheimer’s disease neurons. Nature 604, 714–722 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Brazhnik K et al. Single-cell analysis reveals different age-related somatic mutation profiles between stem and differentiated cells in human liver. Sci. Adv 6, eaax2659 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Zhang L et al. Single-cell whole-genome sequencing reveals the functional landscape of somatic mutations in B lymphocytes across the human lifespan. Proc. Natl Acad. Sci. USA 116, 9014–9019 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Alexandrov LB et al. The repertoire of mutational signatures in human cancer. Nature 578, 94–101 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Alexandrov LB, Nik-Zainal S, Wedge DC, Campbell PJ & Stratton MR Deciphering signatures of mutational processes operative in human cancer. Cell Rep. 3, 246–259 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Tate JG et al. COSMIC: the catalogue of somatic mutations in cancer. Nucleic Acids Res. 47, D941–D947 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Waters TR & Swann PF Thymine-DNA glycosylase and G to A transition mutations at CpG sites. Mutat. Res 462, 137–147 (2000). [DOI] [PubMed] [Google Scholar]
30.Viel A et al. A specific mutational signature associated with DNA 8-oxoguanine persistence in MUTYH-defective colorectal cancer. eBioMedicine 20, 39–49 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Cadet J & Davies KJA Oxidative DNA damage & repair: an introduction. Free Radic. Biol. Med 107, 2–12 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Taverne YJ, Bogers AJ, Duncker DJ & Merkus D Reactive oxygen species and the cardiovascular system. Oxid. Med. Cell. Longev 2013, 862423 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Aguiar PH et al. Oxidative stress and DNA lesions: the role of 8-oxoguanine lesions in Trypanosoma cruzi cell viability. PLoS Negl. Trop. Dis 7, e2279 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Jager M et al. Deficiency of nucleotide excision repair is associated with mutational signature observed in cancer. Genome Res. 29, 1067–1077 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
35.Thibodeau ML et al. Base excision repair deficiency signatures implicate germline and somatic MUTYH aberrations in pancreatic ductal adenocarcinoma and breast cancer oncogenesis. Mol. Case Stud 5, a003681 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Kunkel TA & Erie DA Eukaryotic mismatch repair in relation to DNA replication. Annu. Rev. Genet 49, 291–313 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
37.Nemeth E et al. Two main mutational processes operate in the absence of DNA mismatch repair. DNA Repair 89, 102827 (2020). [DOI] [PubMed] [Google Scholar]
38.Limpose KL, Corbett AH & Doetsch PW BERing the burden of damage: pathway crosstalk and posttranslational modification of base excision repair proteins regulate DNA damage management. DNA Repair 56, 51–64 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
39.Shafirovich V & Geacintov NE Removal of oxidatively generated DNA damage by overlapping repair pathways. Free Radic. Biol. Med 107, 53–61 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
40.Kumar N, Moreno NC, Feltes BC, Menck CF & Houten BV Cooperation and interplay between base and nucleotide excision repair pathways: from DNA lesions to proteins. Genet. Mol. Biol 43, e20190104 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
41.Consortium, G. T. Human genomics. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans. Science 348, 648–660 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
42.Backes FJ et al. Prospective evaluation of DNA mismatch repair protein expression in primary endometrial cancer. Gynecol. Oncol 114, 486–490 (2009). [DOI] [PubMed] [Google Scholar]
43.Koenig AL et al. Single-cell transcriptomics reveals cell-type-specific diversification in human heart failure. Nat. Cardiovasc. Res 1, 263–280 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
44.Kroese LJ & Scheffer PG 8-hydroxy-2′-deoxyguanosine and cardiovascular disease: a systematic review. Curr. Atheroscler. Rep 16, 452 (2014). [DOI] [PubMed] [Google Scholar]
45.Blokzijl F et al. Tissue-specific mutation accumulation in human adult stem cells during life. Nature 538, 260–264 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
46.Moniz S et al. Protein kinase WNK2 inhibits cell proliferation by negatively modulating the activation of MEK1/ERK1/2. Oncogene 26, 6071–6081 (2007). [DOI] [PubMed] [Google Scholar]
47.Raman M, Earnest S, Zhang K, Zhao Y & Cobb MH TAO kinases mediate activation of p38 in response to DNA damage. EMBO J. 26, 2005–2014 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
48.Oppikofer M et al. Non-canonical reader modules of BAZ1A promote recovery from DNA damage. Nat. Commun 8, 862 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
49.Bai SW et al. Identification and characterization of a set of conserved and new regulators of cytoskeletal organization, cell morphology and migration. BMC Biol. 9, 54 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
50.Mosaddeghzadeh N & Ahmadian MR The RHO family GTPases: mechanisms of regulation and signaling. Cell 10, 1831 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
51.Matsuyama D & Kawahara K Oxidative stress-induced formation of a positive-feedback loop for the sustained activation of p38 MAPK leading to the loss of cell division in cardiomyocytes soon after birth. Basic Res. Cardiol 106, 815–828 (2011). [DOI] [PubMed] [Google Scholar]
52.Bergmann O et al. Dynamics of cell generation and turnover in the human heart. Cell 161, 1566–1575 (2015). [DOI] [PubMed] [Google Scholar]
53.Xing D, Tan L, Chang CH, Li H & Xie XS Accurate SNV detection in single cells by transposon-based whole-genome amplification of complementary strands. Proc. Natl Acad. Sci. USA 118, e2013106118 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
54.Baslan T et al. Genome-wide copy number analysis of single cells. Nat. Protoc 7, 1024–1041 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
55.Van der Auwera GA et al. From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline. Curr. Protoc. Bioinformatics 43, 11.10.1–11.10.33 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
56.Delaneau O, Marchini J & Zagury JF A linear complexity phasing method for thousands of genomes. Nat. Methods 9, 179–181 (2011). [DOI] [PubMed] [Google Scholar]
57.Blokzijl F, Janssen R, van Boxtel R & Cuppen E MutationalPatterns: comprehensive genome-wide analysis of mutational processes. Genome Med. 10, 33 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
58.Love MI, Huber W & Anders S Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
59.Kanehisa M, Furumichi M, Sato Y, Ishiguro-Watanabe M & Tanabe M KEGG: integrating viruses and cellular organisms. Nucleic Acids Res. 49, D545–D551 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
60.Karczewski KJ et al. The ExAC browser: displaying reference data information from over 60 000 exomes. Nucleic Acids Res. 45, D840–D845 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Table_S1_Ploidy Case Info

NIHMS1832113-supplement-Table_S1_Ploidy_Case_Info.xlsx^{(11.9KB, xlsx)}

Table -S5_heart exonic mutation

NIHMS1832113-supplement-Table_-S5_heart_exonic_mutation.xlsx^{(17.1KB, xlsx)}

table _S4_Mutational rate info

NIHMS1832113-supplement-table__S4_Mutational_rate_info.xlsx^{(18.5KB, xlsx)}

Table _S2_Sequencing info

NIHMS1832113-supplement-Table__S2_Sequencing_info.xlsx^{(14.4KB, xlsx)}

Table _S3_Mutational Site info

NIHMS1832113-supplement-Table__S3_Mutational_Site_info.xlsx^{(785.4KB, xlsx)}

Data Availability Statement

[R1] 1.Kannel WB & Vasan RS Is age really a non-modifiable cardiovascular risk factor. Am. J. Cardiol 104, 1307–1310 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R2] 2.Strait JB & Lakatta EG Aging-associated cardiovascular changes and their relationship to heart failure. Heart Fail. Clin 8, 143–164 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R3] 3.Sheydina A, Riordon DR & Boheler KR Molecular mechanisms of cardiomyocyte aging. Clin. Sci 121, 315–329 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R4] 4.Franco I et al. Somatic mutagenesis in satellite cells associates with human skeletal muscle aging. Nat. Commun 9, 800 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R5] 5.Lodato MA et al. Somatic mutation in single human neurons tracks developmental and transcriptional history. Science 350, 94–98 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R6] 6.Abascal F et al. Somatic mutation landscapes at single-molecule resolution. Nature 593, 405–410 (2021). [DOI] [PubMed] [Google Scholar]

[R7] 7.Luquette LJ et al. Ultraspecific Somatic SNV and Indel Detection in Single Neurons Using Primary Template-Directed Amplification (Cold Spring Harbor Laboratory, 2021). [Google Scholar]

[R8] 8.Martincorena I et al. Somatic mutant clones colonize the human esophagus with age. Science 362, 911–917 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R9] 9.Lee-Six H et al. The landscape of somatic mutation in normal colorectal epithelial cells. Nature 574, 532–537 (2019). [DOI] [PubMed] [Google Scholar]

[R10] 10.Lodato MA et al. Aging and neurodegeneration are associated with increased mutations in single human neurons. Science 359, 555–559 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R11] 11.Weakley SM et al. Role of somatic mutations in vascular disease formation. Expert Rev. Mol. Diagn 10, 173–185 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R12] 12.Helleday T, Eshtad S & Nik-Zainal S Mechanisms underlying mutational signatures in human cancers. Nat. Rev. Genet 15, 585–598 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R13] 13.Bergmann O & Jovinge S Isolation of cardiomyocyte nuclei from post-mortem tissue. J. Vis. Exp 65, 4205 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R14] 14.Bergmann O et al. Evidence for cardiomyocyte renewal in humans. Science 324, 98–102 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R15] 15.Cui M & Olson EN Protocol for single-nucleus transcriptomics of diploid and tetraploid cardiomyocytes in murine hearts. STAR Protoc 1, 100049 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R16] 16.Lazar E, Sadek HA & Bergmann O Cardiomyocyte renewal in the human heart: insights from the fall-out. Eur. Heart J 38, 2333–2342 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R17] 17.Mollova M et al. Cardiomyocyte proliferation contributes to heart growth in young humans. Proc. Natl Acad. Sci. USA 110, 1446–1451 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R18] 18.Herget GW, Neuburger M, Plagwitz R & Adler CP DNA content, ploidy level and number of nuclei in the human heart after myocardial infarction. Cardiovasc. Res 36, 45–51 (1997). [DOI] [PubMed] [Google Scholar]

[R19] 19.Derks W & Bergmann O Polyploidy in cardiomyocytes: roadblock to heart regeneration. Circ. Res 126, 552–565 (2020). [DOI] [PubMed] [Google Scholar]

[R20] 20.Dean FB et al. Comprehensive human genome amplification using multiple displacement amplification. Proc. Natl Acad. Sci. USA 99, 5261–5266 (2002). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R21] 21.Bohrson CL et al. Linked-read analysis identifies mutations in single-cell DNA-sequencing data. Nat. Genet 51, 749–754 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R22] 22.Petljak M et al. Characterizing mutational signatures in human cancer cell lines reveals episodic APOBEC mutagenesis. Cell 176, 1282–1294 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R23] 23.Miller MB et al. Somatic genomic changes in single Alzheimer’s disease neurons. Nature 604, 714–722 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R24] 24.Brazhnik K et al. Single-cell analysis reveals different age-related somatic mutation profiles between stem and differentiated cells in human liver. Sci. Adv 6, eaax2659 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R25] 25.Zhang L et al. Single-cell whole-genome sequencing reveals the functional landscape of somatic mutations in B lymphocytes across the human lifespan. Proc. Natl Acad. Sci. USA 116, 9014–9019 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R26] 26.Alexandrov LB et al. The repertoire of mutational signatures in human cancer. Nature 578, 94–101 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R27] 27.Alexandrov LB, Nik-Zainal S, Wedge DC, Campbell PJ & Stratton MR Deciphering signatures of mutational processes operative in human cancer. Cell Rep. 3, 246–259 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R28] 28.Tate JG et al. COSMIC: the catalogue of somatic mutations in cancer. Nucleic Acids Res. 47, D941–D947 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R29] 29.Waters TR & Swann PF Thymine-DNA glycosylase and G to A transition mutations at CpG sites. Mutat. Res 462, 137–147 (2000). [DOI] [PubMed] [Google Scholar]

[R30] 30.Viel A et al. A specific mutational signature associated with DNA 8-oxoguanine persistence in MUTYH-defective colorectal cancer. eBioMedicine 20, 39–49 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R31] 31.Cadet J & Davies KJA Oxidative DNA damage & repair: an introduction. Free Radic. Biol. Med 107, 2–12 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R32] 32.Taverne YJ, Bogers AJ, Duncker DJ & Merkus D Reactive oxygen species and the cardiovascular system. Oxid. Med. Cell. Longev 2013, 862423 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R33] 33.Aguiar PH et al. Oxidative stress and DNA lesions: the role of 8-oxoguanine lesions in Trypanosoma cruzi cell viability. PLoS Negl. Trop. Dis 7, e2279 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R34] 34.Jager M et al. Deficiency of nucleotide excision repair is associated with mutational signature observed in cancer. Genome Res. 29, 1067–1077 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R35] 35.Thibodeau ML et al. Base excision repair deficiency signatures implicate germline and somatic MUTYH aberrations in pancreatic ductal adenocarcinoma and breast cancer oncogenesis. Mol. Case Stud 5, a003681 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R36] 36.Kunkel TA & Erie DA Eukaryotic mismatch repair in relation to DNA replication. Annu. Rev. Genet 49, 291–313 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R37] 37.Nemeth E et al. Two main mutational processes operate in the absence of DNA mismatch repair. DNA Repair 89, 102827 (2020). [DOI] [PubMed] [Google Scholar]

[R38] 38.Limpose KL, Corbett AH & Doetsch PW BERing the burden of damage: pathway crosstalk and posttranslational modification of base excision repair proteins regulate DNA damage management. DNA Repair 56, 51–64 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R39] 39.Shafirovich V & Geacintov NE Removal of oxidatively generated DNA damage by overlapping repair pathways. Free Radic. Biol. Med 107, 53–61 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R40] 40.Kumar N, Moreno NC, Feltes BC, Menck CF & Houten BV Cooperation and interplay between base and nucleotide excision repair pathways: from DNA lesions to proteins. Genet. Mol. Biol 43, e20190104 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R41] 41.Consortium, G. T. Human genomics. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans. Science 348, 648–660 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R42] 42.Backes FJ et al. Prospective evaluation of DNA mismatch repair protein expression in primary endometrial cancer. Gynecol. Oncol 114, 486–490 (2009). [DOI] [PubMed] [Google Scholar]

[R43] 43.Koenig AL et al. Single-cell transcriptomics reveals cell-type-specific diversification in human heart failure. Nat. Cardiovasc. Res 1, 263–280 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R44] 44.Kroese LJ & Scheffer PG 8-hydroxy-2′-deoxyguanosine and cardiovascular disease: a systematic review. Curr. Atheroscler. Rep 16, 452 (2014). [DOI] [PubMed] [Google Scholar]

[R45] 45.Blokzijl F et al. Tissue-specific mutation accumulation in human adult stem cells during life. Nature 538, 260–264 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R46] 46.Moniz S et al. Protein kinase WNK2 inhibits cell proliferation by negatively modulating the activation of MEK1/ERK1/2. Oncogene 26, 6071–6081 (2007). [DOI] [PubMed] [Google Scholar]

[R47] 47.Raman M, Earnest S, Zhang K, Zhao Y & Cobb MH TAO kinases mediate activation of p38 in response to DNA damage. EMBO J. 26, 2005–2014 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R48] 48.Oppikofer M et al. Non-canonical reader modules of BAZ1A promote recovery from DNA damage. Nat. Commun 8, 862 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R49] 49.Bai SW et al. Identification and characterization of a set of conserved and new regulators of cytoskeletal organization, cell morphology and migration. BMC Biol. 9, 54 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R50] 50.Mosaddeghzadeh N & Ahmadian MR The RHO family GTPases: mechanisms of regulation and signaling. Cell 10, 1831 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R51] 51.Matsuyama D & Kawahara K Oxidative stress-induced formation of a positive-feedback loop for the sustained activation of p38 MAPK leading to the loss of cell division in cardiomyocytes soon after birth. Basic Res. Cardiol 106, 815–828 (2011). [DOI] [PubMed] [Google Scholar]

[R52] 52.Bergmann O et al. Dynamics of cell generation and turnover in the human heart. Cell 161, 1566–1575 (2015). [DOI] [PubMed] [Google Scholar]

[R53] 53.Xing D, Tan L, Chang CH, Li H & Xie XS Accurate SNV detection in single cells by transposon-based whole-genome amplification of complementary strands. Proc. Natl Acad. Sci. USA 118, e2013106118 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R54] 54.Baslan T et al. Genome-wide copy number analysis of single cells. Nat. Protoc 7, 1024–1041 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R55] 55.Van der Auwera GA et al. From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline. Curr. Protoc. Bioinformatics 43, 11.10.1–11.10.33 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R56] 56.Delaneau O, Marchini J & Zagury JF A linear complexity phasing method for thousands of genomes. Nat. Methods 9, 179–181 (2011). [DOI] [PubMed] [Google Scholar]

[R57] 57.Blokzijl F, Janssen R, van Boxtel R & Cuppen E MutationalPatterns: comprehensive genome-wide analysis of mutational processes. Genome Med. 10, 33 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R58] 58.Love MI, Huber W & Anders S Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R59] 59.Kanehisa M, Furumichi M, Sato Y, Ishiguro-Watanabe M & Tanabe M KEGG: integrating viruses and cellular organisms. Nucleic Acids Res. 49, D545–D551 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R60] 60.Karczewski KJ et al. The ExAC browser: displaying reference data information from over 60 000 exomes. Nucleic Acids Res. 45, D840–D845 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Somatic mutations in single human cardiomyocytes reveal age-associated DNA damage and widespread oxidative genotoxicity

Sangita Choudhury

August Yue Huang

Junho Kim

Zinan Zhou

Katherine Morillo

Eduardo A Maury

Jessica W Tsai

Michael B Miller

Michael A Lodato

Sarah Araten

Nazia Hilal

Eunjung Alice Lee

Ming Hui Chen

Christopher A Walsh

Abstract

Results

Cardiomyocyte polyploidization starts early in life.

Fig. 1 |. Evaluating cardiomyocyte nuclei ploidy in postmortem human heart.

Somatic mutations increase with age in cardiomyocytes.

Fig. 2 |. sSNV density in single human cardiomyocytes detected by single-cell WgS.

Table 1 |.

Signature analysis identifies distinct mutational processes in aging cardiomyocytes.

Fig. 3 |. Signature analysis of sSNV reveals mutational processes in cardiomyocytes during aging.

Potential sources and mechanisms of mutation formation in the aging heart.

Fig. 4 |. Potential mechanism of formation of sSNVs and their impact on cardiomyocytes genome during aging.

Functional impact of sSNVs in the aging process.

Discussion

Methods

Human tissues and DNA sample preparation.

Statistics and reproducibility.

Isolation of cardiomyocyte nuclei.

Flow cytometry.

Ploidy quantification by imaging flow cytometry and karyotyping.

Expression analysis of sorted cardiac nuclei.

Library preparation and WGS.

Read alignment and postprocessing.

Measuring the evenness of genome amplification in single-cells.

Estimation of amplification dropout rates in single-cell WGS data.

Somatic SNV calling from single-cell WGS data.

Somatic SNV burden correction for tetraploid cells.

Mutational signature analysis.

Burden and list of sSNVs in hepatocytes and lymphocytes.

Mixed-effects modeling of somatic SNV density.

Mutation spectrum and strand bias analysis.

Gene expression analysis.

Functional enrichment analysis.

Measurement of oxidative stress.

Modeling accumulation of gene KOs in cardiomyocytes.

Reporting summary.

Extended Data

Extended Data Fig. 1 |. isolation and identification of cardiac nuclei.

Extended Data Fig. 2 |. Comparison of sSNV burden between tetraploid and diploid cardiomyocytes for total mutations.

Extended Data Fig. 3 |. Evaluation of sSNV burden in cardiomyocytes with the consideration of amplification evenness or non-C > T mutations.

Extended Data Fig. 4 |. Signature metrics for de novo mutational signature analysis.

Extended Data Fig. 5 |. Comparison of mutational signatures identified in this and other studies.

Extended Data Fig. 6 |. Hierarchical clustering between cardiac and COSMiC cancer signatures.

Extended Data Fig. 7 |. mRNA expression level of NER (A) and BER (B) pathway genes in the gTEx heart and brain samples (n = 186 donors).

Extended Data Fig. 8 |. Distribution of sSNVs across different categories of genic annotation.

Extended Data Fig. 9 |. Sequencing depth and down-sampling performance for cardiomyocytes and neurons.

Extended Data Fig. 10 |. Model for calling sSNVs from diploid and tetraploid cells by LiRA.

Supplementary Material

Acknowledgements

Footnotes

Data availability

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases