Abstract
Objective
This paper proposes a novel method to localize origins of premature ventricular contractions (PVCs) from 12-lead electrocardiography (ECG) using convolutional neural network (CNN) and a realistic computer heart model.
Methods
The proposed method consists of two CNNs (Segment CNN and Epi-Endo CNN) to classify among ventricular sources from 25 segments and from epicardium (Epi) or endocardium (Endo). The inputs are the full time courses and the first half of QRS complexes of 12-lead ECG, respectively. After registering the ventricle computer model with an individual patient’s heart, the training datasets were generated by multiplying ventricular current dipoles derived from single pacing at various locations with patient-specific lead field. The origins of PVC are localized by calculating the weighted center of gravity of classification returned by the CNNs. A number of computer simulations were conducted to evaluate the proposed method under a variety of noise levels and heart registration errors. Furthermore, the proposed method was evaluated on 90 PVC beats from 9 human patients with PVCs and compared against ablation outcome in the same patients.
Results
The computer simulation evaluation returned relatively high accuracies for Segment CNN (~78%) and Epi-Endo CNN (~90%). Clinical testing in 9 PVC patients resulted an averaged localization error of 11 mm.
Conclusion
Our simulation and clinical evaluation results demonstrate the capability and merits of the proposed CNN-based method for localization of PVC.
Significance
This work suggests a new approach for cardiac source localization of origin of arrhythmias using only the 12-lead ECG by means of CNN, and may have important applications for future real-time monitoring and localizing origins of cardiac arrhythmias guiding ablation treatment.
Index Terms: 12-lead ECG, Convolutional Neural Network, Source Localization, Endocardium, Epicardium, Premature Ventricular Contraction, Cardiac Arrhythmia, Whole Heart Segmentation
I. Introduction
Since its invention, the 12-lead ECG has been widely used as a diagnostic test for cardiac arrhythmias and it can be recorded rapidly with portable equipment for a long time by using a Holter monitor. [1] A normal heart beat would produce four entities on the ECG: a P wave, a QRS complex, a T wave and a U wave. They represent different electrophysiological stages of the heart: atrial depolarization, ventricular depolarization, ventricular repolarization and papillary muscle repolarization. Among them, the QRS complex is of the most interest to cardiac electrophysiologists since a majority of ventricular arrhythmias are reflected by the changes of morphology on the QRS complex.
A premature ventricular contraction (PVC) is a type of ectopic beat, in which the heart beat is initiated by an ectopic pacemaker in the ventricles. It is one of the most common ventricular arrhythmias and its prevalence is associated with many factors. The prevalence of PVC is >6% among 15,792 adults (aged 45–65 years) from the US based on a 2-minute ECG in a large cross-sectional analysis. [2] If not treated in time, the condition of PVC may degenerate into other ventricular arrhythmias such as ventricular tachycardia (VT) and ventricular fibrillation (VF) and finally lead to sudden cardiac death. Radiofrequency catheter ablation [3] is a minimally invasive procedure that by delivering energy to the sections of the heart who are prone to producing arrhythmias, the arrhythmias are terminated and the patient is treated. To localize the site of ablation, pace-mapping is a predominant technique being used in clinical settings. [4] If the morphology of QRS complex by pacing at a site matches well with the VT or PVC observed on the 12-lead ECG, this site is considered to be a potential ablation site. Pace-mapping is done by stimulating at different endocardial sites, so it is invasive. Non-invasively, researchers have been investigating features of the QRS complex obtained from the 12-lead ECG to help identify ablation targets for PVC and VT. [5]–[7] Some typical characteristics of the QRS complex used for localization are QRS width, QRS axis, QRS patterns (qR, QS, RsR’ and so on), R wave amplitude and concordance.
Some characteristics of the QRS complex are easy to calculate automatically, however, for other features that involve pattern recognition, it is subjective to some degree and requires expertise obtained from a long-time training.
Neural network is well known for recognizing patterns and classification. [8]–[10] The accuracy can be higher than 90% given ample training samples. [11] The input to the neural network can be the original time course of the 12-lead ECG, [12] features extracted from time domain, [13] statistical features, [14], [15] features extracted from frequency domain, [16] components resulting from different transforms, [17]–[19] and outputs from some clustering algorithms. [20] The application of neural network can be the classification among ventricular arrhythmias, [12], [15], [16] classification between normal heart beat and arrhythmias [17], [21], [22] or among all types of arrhythmias. [20] Another important application is the discrimination between healthy subjects and patients, such as patients with myocardial infarction [23]–[25] and patients with coronary artery disease. [18] These applications of neural network are mostly concerned about the state of the heart, whether it is in arrhythmias or have cardiac diseases.
We propose to classify and localize origins of cardiac arrhythmias throughout the ventricles by applying neural networks to 12-lead ECG. The ventricles are divided into 25 segments based on standard myocardial segmentation of the left ventricle [26] and expanded it to the right ventricle. A realistic anisotropic ventricle computer model is used to generate QRS complexes of 12-lead ECG from pacing at all possible ventricular locations. The time course of QRS complexes generated are then fed to convolutional neural network (CNN) for classification and localization purpose.
Epicardial mapping and ablation have expanded considerably in the past few years. It was reported that 13% to 17% VT ablation procedures were epicardial mapping or ablation. [27], [28] On ECG, epicardial VT is reflected as a slow onset of the QRS because the initial part of the wave front progresses slowly until it reaches the Purkinje system at the subendocardium. [29] The intracardiac delay of electrical conduction produces a slurred initial part of the QRS complex (pseudo Δ wave) [30] So the initial part (first half) of the QRS is critical for the detection of an epicardial or endocardial focus. In this study, we use the first half of QRS complexes generated by the ventricle computer model as input to the CNN to classify between an epicardial ectopic beat and an endocardial ectopic beat.
Thus, our proposed method consists of two neural networks: Segment CNN with 25 classifications and Epi-Endo CNN with 2 classifications. The localization of origins of PVC is a function of probability distribution outputs of the two CNNs and the center of gravity of each segments in the ventricular model. Our proposed method was applied to the real 12-lead ECG collected from 9 PVC patients who underwent ablation treatment.
There are other studies to localize the origin of PVC from the 12-lead ECG. Van Dam et al. used myocardial activation imaging technique based on an equivalent double layer model to localize the PVC origin, [31]–[33] although positions of 12-lead ECG were unknown, thus the accuracy of results was reduced. Also, a quantitative measurement of localization error was lacking. In our study, electrodes’ positions were digitized and the average localization error was presented by calculating the spatial distance between the CNN predicted sites of origin of PVC and successful ablation sites recorded from EP study in the patient.
Recently, a patient specific model based intracardiac electrograms simulation was done to resemble clinical body surface signals. [34] Our study would be considered a step forward as the model-based noninvasive identification of ablation targets.
For a more accurate estimation of cardiac activation sequence to localize origins of PVC or VT, a noninvasive 3-dimensional cardiac electrical imaging technique was developed based on the body surface potential mapping. [35], [36] It was evaluated under a variety of animal studies [37]–[41], in pathological hearts [42], and applied to detect high-frequency drivers of atrial fibrillation. [43]
II. Method
The rationale behind this study is that we assume PVCs are generated by focal sources, and if we train CNN with all the possible 12-lead ECGs resulting from a single-site pacing covering the ventricular volume with a certain level of noise, the CNN will be able to identify which segment the origin of PVC lies in and whether it is an epicardial or endocardial source given a set of 12-lead ECGs. And depending on the probability distribution of CNN output, we could also give an estimation of source location based on the classification information.
A. Realistic Anisotropic Ventricle Computer Model
The ventricular excitation was simulated by a cellular automaton heart model. As described in the paper by He et al., [44] heart geometry was constructed from CT images of a human subject and discretized into around 38,000 cellular units with a side length of 1.5 mm. Then a generalized ventricle conduction anisotropy was incorporated into the model. The myocardial fiber orientations rotated counterclockwise over 120° from the outermost layer (epicardium, −60°) to the innermost layer (endocardium, +60°) with equivalent increment between the consecutive layers. The conduction velocity along the fiber was 0.6 m/s, and 0.2 m/s transverse to the fiber, respectively. Also the longitude intracellular conductivity was set as 0.3 S/m and the transverse intracellular conductivity was 0.075 S/m, respectively. The action potential and the vector of local fiber orientation were set individually over all the ventricular cellular units. The equivalent current-dipole density of each unit was computed as the product of the myocardial conductivity tensor and the spatial gradient of instantaneous transmembrane potential. Each dipole has three orthogonal components. Finally, cellular units were further grouped into 3,887 dipoles according to their segment number. The time resolution of electric potentials is 1 millisecond. This ventricle current-dipole model was used previously in the simulation study of a 3-dimensional cardiac imaging technique. [35], [36]
B. Segmentation of the Whole Ventricle and Extraction of Epicardium and Endocardium
In the ventricle current-dipole model which consists of 3,887 cardiac dipoles, the left ventricle was segmented according to AHA standardized myocardial segmentation into 17 segments, [26] and the right ventricle was segmented in a similar way as the left ventricle into 8 segments. Thus, the whole ventricle was classified into 25 segments in total. The position and the number of each segment are shown in the middle column of Fig. 1. The number of cardiac dipoles each segment includes is in the parentheses on the bottom. The left column in Fig. 1 is a visualization of 25 segments. The dipoles lying on epicardium and endocardium were identified and labeled as 1 and 2 respectively, and all other dipoles were assigned 3 as transmural dipoles. Both the left side and the right side of the septum are considered as endocardium.
C. 12-lead ECG for Simulation
The torso and lungs were segmented from a high-resolution male torso MRI images (ViP V2.0 IT’IS Foundation, Zürich, Switzerland). Then the boundary element model of heart-torso was built using a commercial software (Curry 6.0, Neuraoscan, North Carolina, USA). 208 body surface electrodes were placed on the front and back of body surface. By multiplying the current-dipoles with the leadfield matrix between each dipole and each electrode, we could get body surface potentials. Among these signals, nine were chosen to generate 12-lead ECG. Three electrodes were selected to represent the electrodes placed on left leg, left arm, and right arm, respectively. So lead I, II III and lead aVL, aVR and aVF could be derived from the signals of these electrodes. Other six electrodes were selected as V1 to V6 according to their anatomical locations.
D. Training and Testing of Convolutional Neural Network
Convolutional neural network (CNN) is a type of deep, feed-forward artificial neural network. We used a deep learning toolbox in MATLAB developed by Rasmus Berg Palm [45] to setup, train and test CNN. It uses a sigmoid function as the activation function for feed forward propagation and a gradient descent projection method as the back propagation algorithm. Convolution and pooling were done in 2-dimension (2-D). The stride, namely the number of unit that convolutional kernel shifts each time, is equal to 1. There is no padding in convolutional layers. The 2-D convolutional kernel has an equal size along both directions. To ensure the applicability of the convolutional kernel, the input should also be in square shape. Thus, we added 4 more leads derived from three limb leads (LL for left leg, RA for right arm, LA for left arm): LL−RA−LA; LA−RA−LL; RA−LA−LL; (LL+RA+LA)/3. The last one corresponds to the Wilson central terminal. [46] Along the time dimension, we down-sampled the time courses of QRS (or first half of QRS) to 16 time points before adding noise in simulation. Therefore, each input set of ECGs is a 16 ×16 square matrix. This size results in the fast response of both CNNs.
In order to localize the origins of PVCs, we utilized the classification information from two types of CNN: Segment CNN and Epi-Endo CNN. The study diagram of this section is shown in Fig. 2.
Segment CNN
The cardiac dipole components resulted from pacing were multiplied by the lead field matrix to generate body surface electrical potentials. Thus, we would have 3,887 sets of 12-lead ECGs corresponding to pacing at 3,887 locations in the ventricular model. QRS complexes were extracted from each lead. A certain level of Gaussian white noise (20 dB, 10 dB or 5 dB) was added to all the leads 10 times to generate noise contaminated 12-lead QRS complexes. Adding a certain level of noise 10 times is for increasing the robustness of Segment CNN to this level of noise since we would have ample training samples representing different noise variations. In total, our data pool had 38,870 sets of noise-contaminated 12-lead QRS complexes. 10% of the whole data was used for testing, which means 3,887 sets were the testing data. In order to test the CNN without segment bias, within each segment, 155 (≈ 3,887/25) sets of 12-lead QRS complexes were randomly selected each time for testing. The remaining 90% data were used for training procedure. The Segment CNN consisted of 6 layers: Input layer (A set of 12-lead QRS complexes), Hidden layer 1 (convolutional layer with a kernel size of 5), Pooling layer 1 (sampling scale of 1), Hidden layer 2 (convolutional layer with a kernel size of 3), Pooling layer 2 (sampling scale of 2), and Output layer (25 neurons for 25 segments). The batch size was 23, alpha was 1 and the number of epochs was 10. After a ten-fold cross-validation, the accuracy and precision rate of each segment of Segment CNN were calculated.
Epi-Endo CNN
Similar to Segment CNN, 1,709 dipole sources located on epicardium and endocardium were identified and were used to generate single-site pacing 12-lead ECGs. QRS complexes were extracted and the first half of 12-lead QRS complexes was used. Gaussian white noise of a specific SNR was added 10 times. Thus for Epi-Endo CNN, we had 17,090 sets of first half of QRS complexes from 12 leads. 10% of this data was used for testing, which means 855 (≈1,709/2) sets were randomly selected from each category. The remaining 90% data were used for training. The Epi-Endo CNN consisted of 6 layers: Input layer (first half of QRS complexes from 12-lead ECGs), Hidden layer 1 (convolutional layer with a kernel size of 5), Pooling layer 1 (sampling scale of 2), Hidden layer 2 (convolutional layer with a kernel size of 3), Pooling layer 2 (sampling scale of 2), and Output layer (2 neurons corresponding to epicardium and enodcardium). Parameters were set as following: batch size = 25, alpha = 3 and number of epochs = 10. After a ten-fold cross-validation, the accuracy and precision rate of Epi or Endo were calculated.
E. Localization of Origins of PVCs
After training procedures, we tested the localization performance of these two CNNs by feeding a new set of data, which included 12-lead ECGs with a certain SNR resulting from pacing at all possible dipole locations. For each set of 12-lead ECGs, Segment CNN would assign a probability to each segment and Epi-Endo CNN would tell how possible it was from epicardium or endocardium. Based on these two probability distributions, the estimation of source location was calculated as following:
(1) |
In (1), S is source location, Pi is the normalized probability of ith segment and its adjacent segments, the output of Segment CNN. N is the number of adjacent segments each segment has based on the ventricle segmentation in Fig. 1. Pj is the probability of Epi or Endo, the output of Epi-Endo CNN. And j is 1 for Epi; 2 for Endo. CoGij is the center of gravity of Epi or Endo dipole sources in ith segment.
The segment with the maximum probability from Segment CNN was considered as the output segment. Its adjacent segments (sharing boundaries with the output segment) were determined based on the ventricle segmentation in Fig. 1. The endocardial center of gravity and epicardial center of gravity in each segment were calculated. By multiplying spatial locations of epicardial and endocardial centers of gravity with the probabilities of epicardium and endocardium from Epi-Endo CNN, we obtained epi-endo-informed centers of gravity of the output segment and its adjacent segments. This process corresponds to the multiplication inside the parentheses in (1). Since some dipole sources locate near the boundary of two segments, it was probable that Segment CNN would assign it to its neighboring segment. This is why we took the adjacent segments of an output segment into consideration to decrease the localization error (LE). And it was done by first normalizing the output probabilities of Segment CNN only among the output segment and its adjacent segments, and then multiplying the normalized probability with each epi-endo-informed center of gravity. Adding all up, we estimated the source location.
F. Application to PVC Patients
We collected data from 9 PVC patients (Male = 3, Female = 6, average age = 48.3 ± 12.3 years old). The patient statistics were summarized in Table 1. None of them have undergone ablation procedure before and none of them have structural heart disease. Study protocols were approved by the Institutional Review Board of University of Minnesota and Shanghai Ruijin Hospital (affiliated with Shanghai Jiao Tong University School of Medicine, Shanghai, China). Written consent forms were obtained from all patients. For each patient, we had CT images of heart, lungs and torso. Then a boundary element model was built including lungs and torso by using a commercial software (Curry 6.0, Neuroscan, North Carolina, USA). After the segmentation of patient’s heart, we registered our ventricular model with it. Prior to the EP study, 208 Ag-AgCl carbon electrodes were placed on the front and back of body surface to record body surface electrical potentials with 2kHz sampling rate for about 10–30 minutes and positions of electrodes were digitized (Fastrak, Polhemus Inc., Colchester, VT, USA). During the recording, all of the patients have spontaneous PVCs recorded and 2 of them also have non-sustained VT detected. Nine electrodes were selected to form 12-lead ECGs later based on their anatomical positions. We registered the generic model with the patient’s heart based on the least sum of distances between the two. By using Curry 6.0, the leadfield between these nine electrodes and our generic ventricular model was generated. After the EP study, we also collected CARTO files, which contained spatial locations of successful ablation sites. And CARTO data were registered with the patient’s heart based on the landmarks of the geometry of ventricles recorded by the CARTO system. The successful ablation sites are considered as the origins of PVCs in this study.
Table 1.
Patient | Age | Gender | Diagnosis | Heart Rate (bpm) |
Blood Pressure (mmHg) |
Antiarrhythmic Medication |
Medical History | PVCs/24 h | Ablation Outcome |
---|---|---|---|---|---|---|---|---|---|
1 | 41 | F | PVC | 74 | 116/77 | None | Diabetes | 15160 | RVOT free wall |
2 | 53 | M | PVC, RBBB | 70 | 105/63 | Bisoprolol | Allergic to penicillin | 25630 | Left Anterior Fascicle |
3 | 71 | F | PVC, Nonsustained AT | 70 | 117/61 | Aspirin, Lipitor | Atherosclerosis of coronary artery and carotid artery, cervical disc herniation | 21100 | RVOT free wall |
4 | 54 | F | PVC | 80 | 118/67 | Moracizine | Allergic to cephalosporin | 13441 | RVOT free wall |
5 | 37 | M | PVC, Nonsustained VT | 95 | 137/86 | None | Sublingual gland excision | 27161/23 h | RVOT free wall |
6 | 42 | F | PVC | 78 | 115/75 | None | None | 26337 | RVOT free wall |
7 | 43 | M | PVC | 82 | 150/80 | None | None | 39476 | RVOT septum |
8 | 33 | F | PVC | 84 | 100/55 | None | Cardiac arrhythmias, asthma | 51226 | RVOT free wall |
9 | 61 | F | PVC | 82 | 129/79 | Amiodarone, Moracizine, metoprolol | Paroxysmal palpitation | 8470 | RVOT septum |
AT = atrial tachycardia; F = female; M = male; PVC = premature ventricular contraction; RBBB = right bundle branch block; RVOT = right ventricular outflow tract; VT = ventricular tachycardia.
QRS complexes of 10 PVCs were exported from body surface potential recordings of each patient. Body surface recordings were first filtered by a bandpass filter from 1 Hz to 30 Hz. After filtering, the QRS and the first half of QRS were down-sampled to 16 time points respectively to construct input for Segment CNN and Epi-Endo CNN. To estimate the SNR, signals of the same length as QRS complexes of PVCs between heart beats were also exported and were considered as noise. Average SNR is defined as the average of SNR of each channel. Then we trained and tested two CNNs using simulated 12-lead ECGs with the average SNR between QRS complexes and Gaussian white noise. Training and testing details are described in Section D. The only difference is that the leadfield used to generate 12-lead ECGs here is specific to each patient. Finally, 12-lead ECGs of 10 PVCs were fed into two trained CNNs to determine source locations.
G. Statistical Analysis
The accuracy of Segment CNN and Epi-Endo CNN were presented as mean ± SD. The localization error was defined as the spatial distance between the estimated source location and the location of the dipole source in the generic ventricular model that generates the 12-lead ECGs by pacing in simulation, and was defined as the spatial distance between the estimated source location and successful ablation sites in patients’ CARTO data.
III. Results
Table 2 shows the accuracy and precision of individual segment when Segment CNN was trained and tested on three different Gaussian white noise levels. Accuracy is defined as the percentage of dipole sources being correctly classified within each segment. Precision is defined as when Segment CNN predicts it is in one segment, how often it is correct. Fig. 3 shows the precision matrices of Segment CNN. Each column is the precision vector for this segment. When the probability of Segment CNN assigning this segment to other segments is higher than 5%, it is labeled in black in the figure. For example, when trained and tested on 20 dB, for segment 4, Segment CNN would have a relative higher probability of assigning segment 19 and segment 10 to segment 4 even though the precision for segment 4 is 74.1% from Table 2. Table 3 is the precision matrices of Epi-Endo CNN trained and tested on different noise levels.
Table 2.
SNR=20 dB | SNR=10 dB | SNR=5 dB | ||||
---|---|---|---|---|---|---|
| ||||||
Unit (%) | Accuracy | Precision | Accuracy | Precision | Accuracy | Precision |
1 | 75.6 | 72.7 | 77.0 | 65.1 | 63.6 | 53.9 |
2 | 80.5 | 85.2 | 61.9 | 78.4 | 54.5 | 70.7 |
3 | 78.4 | 73.9 | 72.0 | 61.1 | 65.7 | 48.2 |
4 | 79.6 | 74.1 | 71.6 | 64.0 | 56.3 | 51.2 |
5 | 85.7 | 62.3 | 80.0 | 61.6 | 60.3 | 46.8 |
6 | 83.7 | 72.4 | 80.6 | 68.8 | 69.0 | 52.3 |
7 | 84.4 | 79.6 | 77.4 | 74.4 | 59.0 | 58.3 |
8 | 81.5 | 85.5 | 71.2 | 82.3 | 60.7 | 74.0 |
9 | 89.0 | 74.6 | 83.2 | 66.5 | 71.2 | 49.3 |
10 | 82.5 | 73.5 | 68.4 | 63.1 | 46.8 | 56.7 |
11 | 72.5 | 79.7 | 67.3 | 75.3 | 49.9 | 63.1 |
12 | 62.5 | 77.5 | 52.0 | 73.9 | 41.7 | 62.6 |
13 | 81.6 | 75.4 | 77.4 | 71.6 | 58.6 | 69.8 |
14 | 66.5 | 86.1 | 65.8 | 75.5 | 53.3 | 61.8 |
15 | 79.6 | 72.7 | 75.9 | 70.2 | 60.5 | 60.3 |
16 | 77.5 | 83.2 | 83.5 | 66.6 | 77.3 | 60.8 |
17 | 92.6 | 78.3 | 80.4 | 83.7 | 80.2 | 54.2 |
18 | 58.4 | 84.0 | 52.5 | 75.7 | 40.4 | 62.5 |
19 | 54.8 | 91.0 | 48.5 | 85.7 | 25.5 | 71.4 |
20 | 87.1 | 70.8 | 84.4 | 61.7 | 81.7 | 52.3 |
21 | 81.7 | 70.4 | 74.8 | 62.6 | 71.6 | 47.1 |
22 | 79.5 | 75.1 | 72.7 | 69.2 | 63.0 | 58.8 |
23 | 75.4 | 93.8 | 62.6 | 80.9 | 42.8 | 77.3 |
24 | 62.6 | 91.1 | 56.6 | 88.7 | 42.9 | 82.5 |
25 | 89.3 | 89.3 | 80.2 | 89.7 | 57.7 | 79.0 |
Table 3.
SNR=20 dB | SNR=10 dB | SNR=5 dB | ||||
---|---|---|---|---|---|---|
| ||||||
Unit (%) | Epi | Endo | Epi | Endo | Epi | Endo |
Epi | 85.56 | 9.94 | 72.79 | 18.64 | 61.36 | 21.79 |
Endo | 14.44 | 90.06 | 27.21 | 81.36 | 38.64 | 78.21 |
When we tested the localization performance of two CNNs, we tested them by feeding them 3,887 new sets of 12-lead ECGs by pacing at a single site with a variety of noise levels (25 dB, 20 dB, 10 dB, 5 dB, 0 dB, −5 dB and −10 dB). Fig. 4 presents the LEs along with average accuracies of Segment CNN and Epi-Endo CNN. The left part of the figure is the average LE when Segment CNN and Epi-Endo CNN were tested on 12-lead ECGs from all possible pacing sites in the ventricular model with different noise levels. And during the ten-fold cross validation, namely trained and tested on 20 dB 12-lead ECGs, the average accuracy of Epi-Endo CNN was 87.68 % and the average accuracy of Segment CNN was 77.71 %. The middle part and the right part are presented in a similar way as the left part. The exact numbers displayed on Fig. 4 are also included in Table 4.
Table 4.
CNN SNR | 20 dB | 10 dB | 5 dB | |
---|---|---|---|---|
Segment Accuracy (%) | 77.71±0.01 | 71.11±0.53 | 58.18±0.27 | |
EpiEndo Accuracy (%) | 87.68±0.61 | 76.40±0.07 | 66.20±0.01 | |
| ||||
Average LE(mm) of Different Data SNR | 25 dB | 10.37±9.74 | 10.56±9.73 | 11.55±9.74 |
20 dB | 10.43±9.77 | 10.58±9.74 | 11.55±9.75 | |
10 dB | 10.76±9.65 | 10.61±9.64 | 11.56±9.82 | |
5 dB | 11.38±9.51 | 10.94±9.68 | 11.73±9.80 | |
0 dB | 13.09±9.97 | 11.73±9.65 | 12.54±9.80 | |
−5 dB | 16.63±11.52 | 14.32±10.57 | 14.66±10.36 | |
−10 dB | 22.47±13.28 | 19.36±11.94 | 19.96±11.85 |
IV. Discussion
This study shows the feasibility of utilizing convolutional neural network to localize origins of PVCs from 12-lead ECGs. We tested the method in both simulation data and clinical patients’ data, and obtained good localization errors in both situations.
A. Properties of Segment CNN and Epi-Endo CNN
The information in Fig. 3, Table 2 and Table 3 could be obtained when we train and test two CNNs on any patient-specific leadfield and under any noise level. It will guide us and give us a prior knowledge of how good two CNNs perform before we feed testing (or clinical) data to calculate the source location. From Fig. 3, we could see as SNR decreases, it is more likely for Segment CNN to assign dipole sources from other segments to one segment, which means the precision rates have dropped. The number of blocks labeled in the precision matrix increases as SNR decreases. Table 3 is the precision matrices of Epi-Endo CNN under different noise level. The common feature of these three precision matrices is that it is more likely for Epi-Endo CNN to assign sources on endocardial surfaces as epicardial sources. From Table 2, we could know the percentage of dipole sources in each segment that are correctly detected, namely the accuracy of each segment. And also how confident we are in the detection results if one dipole source is predicted to be from some segment.
B. Localization Errors of CNN
From Fig. 4 and Table 4, we could summarize the following trends of the performance of two CNNs: (1) as SNR drops, the accuracies of both CNNs decrease; (2) as SNR of input data drops, the average LE increases; (3) the average LE is the result of two factors, one is the SNR that two CNNs are trained and tested, the other is the similarity between input data and tested data. The accuracy of Segment CNN of 10 dB is only 6 % less than that of 20 dB. So when the SNR of input data is 10 dB and less than 10 dB, the average LEs of CNNs trained with 10 dB are all smaller than those of CNNs trained with 20 dB. This is when the similarity factor plays a dominant role. However, when the accuracy of Segment CNN drops to 58 %, which is 13 % less than that of 10 dB, the accuracy plays an important role in determining source locations. Because even though input data of 5 dB is more similar to CNNs trained with 5 dB, the average LE is larger than that of CNNs trained with 10 dB and input data of 5 dB. Actually, the average LEs of CNNs trained with 5 dB are all larger than those of CNNs trained with 10 dB. Finally, when we compare the performance between CNNs trained with 20 dB and trained with 5 dB, the average LEs of input data with 0 dB and less of CNNs trained with 5 dB are smaller than those of CNNs trained with 20 dB. This is because even with a much higher training accuracy, CNNs trained with 20 dB fail to classify 12-lead ECGs that are too unlike the training data.
A conclusion from the above simulation results is when the SNR of input data is very low, it is best to train and test CNNs with around 10 dB to minimize LEs, and when the SNR of input data is good, it is best to train and test CNNs with a noise level that is the same as the SNR of input data.
C. Localization Errors of CNN with Heart Registration Errors
Since we register our ventricle current-dipole model with patient’s heart and train and test CNNs using this model and finally use CNNs to localize origins of PVCs, it is necessary to investigate the influences of heart registration errors on the localization performance of CNNs. So Fig. 5 shows the average LE when we trained and tested CNNs with correct registration, and applied to localization when the input 12-lead ECGs were generated by the leadfield with heart registration errors. The SNR of training and testing data is the same as the SNR of input data for testing localization performance.
Three categories of heart registration errors were investigated: shift, rotation and scaling. Overall, all the average LEs are larger than those of correct registration. Within shift group, because heart is already very close to the front body surface, so we could only move heart forward by 5 mm. For comparison purpose, we also moved heart backwards by 5 mm. Heart moving to the right always generated a larger LE than moving to the left. Within rotation group, heart rotating to the right always generated a larger LE than rotating to the left. Within scaling group, there is no obvious trend. Among all the registration errors, we found that shift group gave higher LEs than the other two groups on average.
Based on the above analysis of the registration error results, we should pay attention when we need to shift the ventricular model for the registration with patient’s heart. And CNNs are generally robust to a variety of heart registration errors if we compare these LEs with those that are from correct registrations: 10.43 ± 9.77 mm (20 dB), 10.61 ± 9.64 mm (10 dB) and 11.73 ± 9.80 mm (5 dB). Generally, the above heart registration errors would increase average LE by 1 to 2 mm.
D. Application to PVC Patients
From the last column in Table 1, which is the ablation record from CARTO files, the origins of PVC lied in different regions in the ventricles, 1 patient had PVC originated from the left anterior fascicle, 6 patients were ablated at free-wall side of right ventricular outflow tract, and PVCs in the left 2 patients were detected in the septal side of right ventricular outflow tract. On average, the localization error of 9 patients is 10.9 ± 5.5 mm, which is good and reasonable considering the source location was estimated solely from 12-lead ECGs. The average median registration error is 2.4 ± 0.2 mm. Registration error is one factor that contributes to the final localization error. The present results demonstrated the feasibility of employing a generic anisotropic ventricle current-dipole model to mimic cardiac activities in a patient without structural heart disease. Generally, the more similar a patient’s heart is with the ventricular model, the more accurate the predictions will be. And this also shows the capability of our method to estimate origins of PVCs from different areas.
E. Merits and Limitations
Merits
This method can be applied to the localization of PVC and focal ventricular tachycardia (VT) since the training data is generated from single-site pacing. It is a stable method in the sense of two-fold: once CNNs are trained on a patient, they are applicable to all the other PVCs and focal VTs from the same patient; structures and parameters of both CNNs all remain the same across different subjects and noise levels. It is a robust method because average LEs under different noise levels and heart registration errors fluctuate in a range of 1 to 2 mm. Since a generic ventricle current-dipole model is used, this method does not require high-resolution contrast cardiac CT images. Anatomical CT images show the shape and orientation of ventricles are good enough for the registration purpose. Thus, we reduce patients’ burden by avoiding injecting contrast into the body. Training and testing of CNNs could be done off-line prior to the EP study and it generally takes about 20 minutes. The total analysis time including building the boundary element model, registration of the ventricular model, calculating the leadfield between electrodes and ventricular model, and training and testing CNNs would not be longer than 2 hours in a 64-bit operating system with 3.40 GHz Intel i7 processor. Once CNNs have been trained, it would take 0.2 s to calculate the source location given QRS complexes from 12-lead ECGs. Other than the previous methods which requires calculating statistical features from 12-lead ECGs, some expertise of recognizing patterns from 12-lead ECGs, and are usually limited in the types of ventricular arrhythmias, the method we present here is easy to implement, relieves the burden on electrophysiology specialists, and finally helps guide ablation procedure by providing potential source locations throughout the ventricles.
Limitations
Because it is a single-site pacing ventricle current-dipole model, this method is not applicable to multisite excitation activities and reentry VT. The sources of localization errors mainly come from two places: one is the registration between the ventricular model and patient’s heart, the other is the registration of 12-lead electrodes. While the latter can be reduced by digitizing more anatomical landmarks to help the registration of electrodes, the former is somehow unavoidable because we use a generic ventricular model. In the future, we could employ a personalized cardiac electrophysiology model as have been used in cardiac electrical imaging studies [36]-[43][47] to overcome the first source of localization errors.
V. Conclusion
We have proposed a novel method to localize the site of origin of premature ventricular contractions from 12-lead ECG using convolutional neural networks and a realistic anisotropic ventricle computer model. We have evaluated our method under various numerical experiments, and achieved good overall performance. By applying it to real data in a group of 9 PVC patients, we have shown the capability of our method to target the potential ablation site in premature ventricular contractions patients. This work suggests a new approach for cardiac source localization of origin of arrhythmias using only the 12-lead ECG by means of CNN, and may have important applications for future real-time monitoring and localizing origins of cardiac arrhythmias guiding ablation treatment.
Acknowledgments
This work was supported in part by NIH HL080093 and NSF CBET-0756331, and by the Chinese “111 Project” (B08020).
The authors are grateful to Dr. Zhaoye Zhou for assistance in data collection and help discussions.
Contributor Information
Ting Yang, Biomedical Engineering Department, University of Minnesota, Minneapolis, MN 55455, USA.
Long Yu, Biomedical Engineering Department, University of Minnesota, Minneapolis, MN 55455, USA.
Qi Jin, Department of Cardiology, Shanghai Ruijin Hospital, Shanghai, 200025 China.
Liqun Wu, Department of Cardiology, Shanghai Ruijin Hospital, Shanghai, 200025 China.
Bin He, Biomedical Engineering Department, University of Minnesota, Minneapolis, MN 55455, USA.
References
- 1.Corday E, et al. Detection of Phantom Arrhythmias and Evanescent Electrocardiographic Abnormalities: Use of Prolonged Direct Electrocardiocording. JAMA. 1965 Aug;193(6):417–421. doi: 10.1001/jama.1965.03090060007001. [DOI] [PubMed] [Google Scholar]
- 2.Simpson RJ, et al. Prevalence of premature ventricular contractions in a population of African American and white men and women: the Atherosclerosis Risk in Communities (ARIC) study. Am. Heart J. 2002;143(3):535–540. doi: 10.1067/mhj.2002.120298. [DOI] [PubMed] [Google Scholar]
- 3.Klein LS, et al. Radiofrequency catheter ablation of ventricular tachycardia in patients without structural heart disease. Circulation. 1992 May;85(5):1666–1674. doi: 10.1161/01.cir.85.5.1666. [DOI] [PubMed] [Google Scholar]
- 4.Josephson ME, et al. Ventricular activation during ventricular endocardial pacing. II. Role of pace-mapping to localize origin of ventricular tachycardia. Am. J. Cardiol. 1982 Jul;50(1):11–22. doi: 10.1016/0002-9149(82)90003-0. [DOI] [PubMed] [Google Scholar]
- 5.Jadonath RL, et al. Utility of the 12-lead electrocardiogram in localizing the origin of right ventricular outflow tract tachycardia. Am. Heart J. 1995;130(5):1107–1113. doi: 10.1016/0002-8703(95)90215-5. [DOI] [PubMed] [Google Scholar]
- 6.Kamakura S, et al. Localization of optimal ablation site of idiopathic ventricular tachycardia from right and left ventricular outflow tract by body surface ECG. Circulation. 1998;98(15):1525–1533. doi: 10.1161/01.cir.98.15.1525. [DOI] [PubMed] [Google Scholar]
- 7.Josephson ME, Callans DJ. Using the twelve-lead electrocardiogram to localize the site of origin of ventricular tachycardia. Heart Rhythm. 2005;2(4):443–446. doi: 10.1016/j.hrthm.2004.12.014. [DOI] [PubMed] [Google Scholar]
- 8.Setiono R, Liu H. Neural-network feature selector. IEEE Trans. Neural Netw. 1997 May;8(3):654–662. doi: 10.1109/72.572104. [DOI] [PubMed] [Google Scholar]
- 9.Rowley HA, et al. Neural network-based face detection. IEEE Trans. Pattern Anal. Mach. Intell. 1998 Jan;20(1):23–38. [Google Scholar]
- 10.Heermann PD, Khazenie N. Classification of multispectral remote sensing data using a back-propagation neural network. IEEE Trans. Geosci. Remote Sens. 1992 Jan;30(1):81–88. [Google Scholar]
- 11.Foody GM, Arora MK. An evaluation of some factors affecting the accuracy of classification by an artificial neural network. Int. J. Remote Sens. 1997 Mar;18(4):799–810. [Google Scholar]
- 12.Chow H-S, et al. Detection of ventricular ectopic beats using neural networks. Computers in Cardiology 1992, Proceedings of. 1992:659–662. [Google Scholar]
- 13.de Chazal P, Reilly RB. Automatic classification of ECG beats using waveform shape and heart beat interval features. Acoustics, Speech, and Signal Processing, 2003. Proceedings.(ICASSP’03). 2003 IEEE International Conference on. 2003;2:II–269. [Google Scholar]
- 14.Osowski S, Linh TH. ECG beat recognition using fuzzy hybrid neural network. IEEE Trans. Biomed. Eng. 2001;48(11):1265–1271. doi: 10.1109/10.959322. [DOI] [PubMed] [Google Scholar]
- 15.Gothwal H, et al. Cardiac arrhythmias detection in an ECG beat signal using fast fourier transform and artificial neural network. J. Biomed. Sci. Eng. 2011;4(04):289. [Google Scholar]
- 16.Minami K, et al. Real-time discrimination of ventricular tachyarrhythmia with Fourier-transform neural network. IEEE Trans. Biomed. Eng. 1999;46(2):179–185. doi: 10.1109/10.740880. [DOI] [PubMed] [Google Scholar]
- 17.Kaya Y, Pehlivan H. Classification of Premature Ventricular Contraction in ECG. Int J Adv Comput Sci Appl. 2015;6(7):34–40. [Google Scholar]
- 18.Allahverdi N, et al. Diagnosis of Coronary Artery Disease Using Deep Belief Networks; 2. Int. Conf. Eng. Nat. Sci; p. 9. [Google Scholar]
- 19.Sharma LN, et al. Multiscale Energy and Eigenspace Approach to Detection and Localization of Myocardial Infarction. IEEE Trans. Biomed. Eng. 2015 Jul;62(7):1827–1837. doi: 10.1109/TBME.2015.2405134. [DOI] [PubMed] [Google Scholar]
- 20.Özbay Y, et al. A fuzzy clustering neural network architecture for classification of ECG arrhythmias. Comput. Biol. Med. 2006;36(4):376–388. doi: 10.1016/j.compbiomed.2005.01.006. [DOI] [PubMed] [Google Scholar]
- 21.Javadi M, et al. Improving ECG classification accuracy using an ensemble of neural network modules. PLoS One. 2011;6(10):e24386. doi: 10.1371/journal.pone.0024386. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Kiranyaz S, et al. Real-Time Patient-Specific ECG Classification by 1-D Convolutional Neural Networks. IEEE Trans. Biomed. Eng. 2016 Mar;63(3):664–675. doi: 10.1109/TBME.2015.2468589. [DOI] [PubMed] [Google Scholar]
- 23.Hedén B, et al. Acute myocardial infarction detected in the 12-lead ECG by artificial neural networks. Circulation. 1997;96(6):1798–1802. doi: 10.1161/01.cir.96.6.1798. [DOI] [PubMed] [Google Scholar]
- 24.Stamkopoulos T, et al. ECG analysis using nonlinear PCA neural networks for ischemia detection. IEEE Trans. Signal Process. 1998;46(11):3058–3067. [Google Scholar]
- 25.Haraldsson H, et al. Detecting acute myocardial infarction in the 12-lead ECG using Hermite expansions and neural networks. Artif. Intell. Med. 2004;32(2):127–136. doi: 10.1016/j.artmed.2004.01.003. [DOI] [PubMed] [Google Scholar]
- 26.Cerqueira MD, et al. Standardized myocardial segmentation and nomenclature for tomographic imaging of the heart. A statement for healthcare professionals from the Cardiac Imaging Committee of the Council on Clinical Cardiology of the American Heart Association. Circulation. 2002 Jan;105(4):539–542. doi: 10.1161/hc0402.102975. [DOI] [PubMed] [Google Scholar]
- 27.Sacher F, et al. Epicardial ventricular tachycardia ablation. J. Am. Coll. Cardiol. 2010;55(21):2366–2372. doi: 10.1016/j.jacc.2009.10.084. [DOI] [PubMed] [Google Scholar]
- 28.Della Bella P, et al. Epicardial Ablation for Ventricular TachycardiaClinical Perspective. Circ. Arrhythm. Electrophysiol. 2011;4(5):653–659. doi: 10.1161/CIRCEP.111.962217. [DOI] [PubMed] [Google Scholar]
- 29.Fernández-Armenta J, Berruezo A. How to recognize epicardial origin of ventricular tachycardias? Curr. Cardiol. Rev. 2014;10(3):246–256. doi: 10.2174/1573403X10666140514103047. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Berruezo A, et al. Electrocardiographic recognition of the epicardial origin of ventricular tachycardias. Circulation. 2004;109(15):1842–1847. doi: 10.1161/01.CIR.0000125525.04081.4B. [DOI] [PubMed] [Google Scholar]
- 31.van Dam PM, et al. Application of the fastest route algorithm in the interactive simulation of the effect of local ischemia on the ECG. Med. Biol. Eng. Comput. 2009 Jan;47(1):11–20. doi: 10.1007/s11517-008-0391-2. [DOI] [PubMed] [Google Scholar]
- 32.van Dam PM, et al. Quantitative localization of premature ventricular contractions using myocardial activation ECGI from the standard 12-lead electrocardiogram. J. Electrocardiol. 2013;46(6):574–579. doi: 10.1016/j.jelectrocard.2013.08.005. [DOI] [PubMed] [Google Scholar]
- 33.Potyagaylo D, et al. Influence of Modeling Errors on the Initial Estimate for Nonlinear Myocardial Activation Times Imaging Calculated With Fastest Route Algorithm. IEEE Trans. Biomed. Eng. 2016 Dec;63(12):2576–2584. doi: 10.1109/TBME.2016.2561973. [DOI] [PubMed] [Google Scholar]
- 34.Cabrera-Lozoya R, et al. Image-Based Biophysical Simulation of Intracardiac Abnormal Ventricular Electrograms. IEEE Trans. Biomed. Eng. 2017 Jul;64(7):1446–1454. doi: 10.1109/TBME.2016.2562918. [DOI] [PubMed] [Google Scholar]
- 35.Liu Z, et al. Noninvasive reconstruction of three-dimensional ventricular activation sequence from the inverse solution of distributed equivalent current density. IEEE Trans. Med. Imaging. 2006;25(10):1307–1318. doi: 10.1109/tmi.2006.882140. [DOI] [PubMed] [Google Scholar]
- 36.Yu L, et al. Temporal sparse promoting three dimensional imaging of cardiac activation. IEEE Trans. Med. Imaging. 2015;34(11):2309–2319. doi: 10.1109/TMI.2015.2429134. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Han C, et al. Noninvasive imaging of three-dimensional cardiac activation sequence during pacing and ventricular tachycardia. Heart Rhythm. 2011;8(8):1266–1272. doi: 10.1016/j.hrthm.2011.03.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Han C, et al. Noninvasive cardiac activation imaging of ventricular arrhythmias during drug-induced QT prolongation in the rabbit heart. Heart Rhythm. 2013;10(10):1509–1515. doi: 10.1016/j.hrthm.2013.06.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Han C, et al. Noninvasive reconstruction of the three-dimensional ventricular activation sequence during pacing and ventricular tachycardia in the canine heart. Am. J. Physiol.-Heart Circ. Physiol. 2012;302(1):H244–H252. doi: 10.1152/ajpheart.00618.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Han C, et al. Imaging cardiac activation sequence during ventricular tachycardia in a canine model of nonischemic heart failure. Am. J. Physiol.-Heart Circ. Physiol. 2015;308(2):H108–H114. doi: 10.1152/ajpheart.00196.2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Liu C, et al. Noninvasive Mapping of Transmural Potentials During Activation in Swine Hearts From Body Surface Electrocardiograms. IEEE Trans. Med. Imaging. 2012 Sep;31(9):1777–1785. doi: 10.1109/TMI.2012.2202914. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Zhou Z, et al. Noninvasive Imaging of 3-Dimensional Myocardial Infarction From the Inverse Solution of Equivalent Current Density in Pathological Hearts. IEEE Trans. Biomed. Eng. 2015 Feb;62(2):468–476. doi: 10.1109/TBME.2014.2358618. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Zhou Z, et al. Noninvasive Imaging of High-Frequency Drivers and Reconstruction of Global Dominant Frequency Maps in Patients With Paroxysmal and Persistent Atrial Fibrillation. IEEE Trans. Biomed. Eng. 2016;63(6):1333–1340. doi: 10.1109/TBME.2016.2553641. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.He B, et al. Noninvasive imaging of cardiac transmembrane potentials within three-dimensional myocardium by means of a realistic geometry anisotropic heart model. IEEE Trans. Biomed. Eng. 2003;50(10):1190–1202. doi: 10.1109/TBME.2003.817637. [DOI] [PubMed] [Google Scholar]
- 45. [Accessed: 11-Aug-2017];Deep Learning Toolbox - File Exchange - MATLAB Central. [Online]. Available: http://www.mathworks.com/matlabcentral/fileexchange/38310-deep-learning-toolbox.
- 46.Wilson FN, et al. Electrocardiograms that represent the potential variations of a single electrode. Am. Heart J. 1934 Apr;9(4):447–458. [Google Scholar]
- 47.Giffard-Roisin S, et al. Non-Invasive Personalisation of a Cardiac Electrophysiology Model from Body Surface Potential Mapping. IEEE Trans. Biomed. Eng. 2017;PP(99):1–1. doi: 10.1109/TBME.2016.2629849. [DOI] [PubMed] [Google Scholar]