Skip to main content

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

medRxiv logoLink to medRxiv
[Preprint]. 2026 Jun 7:2026.05.20.26353725. [Version 3] doi: 10.64898/2026.05.20.26353725

Wavelet Decomposition-Based Genomic Analysis of the Human Electrocardiogram

Salma Zainana, Larissa Palma Lauer, Tuomo Kiiskinen, Robert j Tibshirani, Trevor Hastie, Euan Ashley, Jack William O'Sullivan, Manuel A Rivas
PMCID: PMC13228739  PMID: 42238462

Abstract

The electrocardiogram (ECG) encodes the electrical activity of the heart across multiple timescales, yet standard clinical analysis collapses this rich signal into a handful of scalar measurements that discard most of the waveform's structure. Whether the frequency signals lost in this reduction carry heritable biological information relevant to cardiovascular disease risk remains unclear. Here we decompose resting 12-lead ECGs from 47,052 White British UK Biobank participants into 84 frequency-specific energy features using Daubechies-6 wavelet analysis across 12 leads and 7 decomposition levels, and perform independent genome-wide association analyses on each feature. We identify 67 independent loci and refine these to 101 high-confidence causal variants (posterior inclusion probability > 0.80) through Bayesian fine-mapping; associated loci converge on genes governing cardiac conduction and myocardial integrity, including SCN5A, TTN, KCNQ1, and DSP, alongside less-characterized cardiomyopathy candidates. SNP-based heritability estimates range from 0.03 to 0.26, with the strongest signals in mid-frequency bands (D6-D4, ~4-32 Hz) of Lead I and aVR, and strong inter-lead genetic correlations indicate a coordinated genetic architecture underlying the waveform. Integrating these features with FinnGen R12 cardiovascular phenotypes reveals genetic correlations reaching 0.56 with heart failure, driven predominantly by energy in the highest-frequency band (D1, 125-250 Hz), a spectral range routinely filtered from clinical ECGs and previously regarded as acquisition noise. These results reframe the electrocardiogram as a multi-frequency genetic phenotype, expand the set of cardiac loci discoverable from ECG data, and implicate high-frequency cardiac electrical activity as an underexplored dimension of cardiovascular disease risk.

Full Text

The Full Text of this preprint is available as a PDF (24.6 MB). The Web version will be available soon.


Articles from medRxiv are provided here courtesy of Cold Spring Harbor Laboratory Preprints

RESOURCES