Skip to main content
Health Research Alliance Author Manuscripts logoLink to Health Research Alliance Author Manuscripts
. Author manuscript; available in PMC: 2022 Apr 4.
Published in final edited form as: IEEE Access. 2021 Sep 14;9:127433–127443. doi: 10.1109/access.2021.3112678

Deep Learning-Based Diagnosis of Peripheral Artery Disease via Continuous Property-Adversarial Regularization: Preliminary in Silico Study

SOOHO KIM 1,2, JIN-OH HAHN 3, BYENG DONG YOUN 1,2,4
PMCID: PMC8979332  NIHMSID: NIHMS1741723  PMID: 35382437

Abstract

This paper presents a novel deep learning-based arterial pulse wave analysis (PWA) approach to diagnosis of peripheral artery occlusive disease (PAD). Naïve application of deep learning to PAD diagnosis can be hampered by the fact that securing a large amount of longitudinal dataset encompassing diverse PAD severity as well as anatomical and physiological variability presents formidable challenge. Training of a deep neural network (DNN) to a small training dataset raises the risk of overfitting the PAD diagnosis algorithm only to the individuals in the training dataset while deteriorating its ability to generalize also to other individuals who may exhibit a large variability in anatomical and physiological characteristics beyond the training dataset. To overcome these obstacles, we propose a continuous property-adversarial regularization (CPAR) approach to robust generalization of a DNN against scarce datasets. Our approach fosters the exploitation of latent features that can facilitate the intended task independently of confounding property-induced disturbances. by regularizing the extraction of disturbance-dependent latent features in the network’s feature extraction layer. By training and testing a deep convolutional neural network (CNN) for PAD diagnosis using scarce virtual datasets, we illustrated that the CNN trained by our approach was superior to a conventionally trained CNN in detecting and assessing the severity of PAD against disturbances originating from diversity in the patients’ height and arterial stiffness: when trained with one-time pulse wave signal measurement at ankle and brachial arteries in a small number of patients, our approach achieved detection accuracy of >90% and severity assessment of 0.83 in r2 value, which were >15% and >40% improvement over conventional approach without CPAR. In addition, we ascertained the advantage of our approach in efficient training and robust generalization of DNN by contrasting it to multi-task learning which promotes the exploitation (as opposed to regularization in CPAR) of disturbance-dependent latent features in fulfilling the intended tasks.

INDEX TERMS: Deep learning, continuous domain-adversarial learning, peripheral artery disease, cardiovascular disease, arterial pulse wave analysis, convolutional neural network

I. INTRODUCTION

Peripheral artery occlusive disease (PAD) is a highly prevalent vascular disease entailing high morbidity and mortality risks. The United States and the world are estimated to have >8 million (2000 [1]) and >200 million (in 2010 [2]) patients with PAD. In addition, PAD may become even more prevalent with societal aging. Regardless, PAD is underdiagnosed with low primary care awareness [3].

PAD diagnosis in clinical settings requires imaging-based techniques [4]–[6], which involve invasive procedures and/or expensive equipment. Hence, these techniques are not suited to high-throughput PAD screening and surveillance. On the other hand, the ankle-brachial index (ABI), a widely employed technique for PAD screening, has been criticized for its poor accuracy and robustness in diagnosing PAD [7].

The analysis of arterial pulse wave signals (called here-after the pulse wave analysis (PWA)) can play a pivotal role in PAD diagnosis. Indeed, PAD alters the morphology (i.e., shape) of arterial pulse wave signals (e.g., arterial blood pressure (BP) signals) by affecting the pulse wave propagation and reflection characteristics in the arteries. Hence, PWA has the potential to outperform techniques built upon discrete fiducial points (i.e., features) in the arterial pulse wave signals (e.g., ABI) by allowing the exploitation of the arterial pulse wave signals in their entirety [8]–[11]. A practical advantage of PWA is that it may be relevant to affordable and high-throughput PAD screening and surveillance with convenient arterial pulse wave signals measured at the extremity locations (e.g., arm and ankle).

The mainstream of existing work on PWA has resorted to empirical feature selection. Hence, PWA may be combined with modern deep learning (DL) to leverage its ability for automatic feature selection [11]. Yet, an obvious challenge in incorporating DL into PWA is dataset limitation constraint: naïve application of DL to PWA-based PAD diagnosis requires massive longitudinal datasets associated with PAD progression in time, which preferably also encompass diverse anatomical and physiological characteristics as well as disease severity levels. Yet in reality, only scarce (and possibly non-longitudinal) dataset from a small number of patients associated with limited anatomical and physiological characteristics may be available, which may pose formidable challenges in training a deep neural network (DNN) for PAD diagnosis which is generalizable to a wide range of anatomical and physiological characteristics. In fact, the morphology of the arterial pulse wave signals depends not only on PAD but also on the anatomical and physiological characteristics of a patient. For example, mechanical properties of the arterial wall and height of the patient can alter the shape of arterial pulse wave signals by affecting the way forward and backward traveling BP waves are superposed – mechanical properties (e.g., arterial stiffness, diameter, and thickness) by altering pulse wave velocity (PWV) and height by altering the timings of superposition even for a given PWV [12]. Hence, if a DNN is trained using scarce dataset from a small number of patients (who possibly encompass only a narrow range of anatomical and physiological characteristics), the resulting PAD diagnosis algorithm may be overfitted only to the individuals in the training dataset, while its ability to generalize to other individuals (who may exhibit a large variability in anatomical and physiological characteristics beyond the training dataset) may be deteriorated.

To overcome these obstacles, we propose a continuous property-adversarial regularization (CPAR) approach to robust generalization of a DNN against scarce datasets. Inspired by domain-adversarial learning [13], we extend the conventional discrete domain-adversarial regularization to enable regularization and generalization of a DNN across multiple continuous properties by reformulating the anti-domain classification formalism used in discrete domain-adversarial regularization to a novel anti-property regression formalism applicable to multiple continuous properties. In this way, our approach fosters the exploitation of latent features that can facilitate the intended task independently of confounding property-induced disturbances, by regularizing the extraction of disturbance-dependent latent features in the network’s feature extraction layer. By training and testing a deep convolutional neural network (CNN) for PAD diagnosis using scarce virtual datasets, we illustrated that the CNN trained by CPAR was superior to a conventionally trained CNN in detecting PAD and assessing its severity against disturbances originating from diversity in the patients’ height and arterial stiffness.

This paper is organized as follows. Section II covers the details of the CPAR approach and how it was applied to develop and test a CNN-based PAD diagnosis algorithm. Section III presents results, and Section IV provides discussion of the results. Section V concludes the paper with future directions.

II. METHODS

In this section, we first present the CPAR approach to robust generalization of a DNN against continuous property-induced disturbances. Then, we apply the CPAR approach to PAD diagnosis via CNN-based PWA, in order to efficiently train a CNN using relatively scarce dataset from a small number of patients so that it can robustly generalize to broad patient cohort with wide anatomical and physiological variability.

A. CONTINUOUS DOMAIN-ADVERSARIAL REGULARIZATION

In circumstances in which a DNN is trained using a scarce dataset consisting of only a small number of samples, the DNN may be overfitted to the training dataset and thus exhibit poor performance when applied to an unseen dataset whose samples are associated with properties not covered by those in the training dataset. Two alternative approaches may be employed to mitigate such a degradation in the generalization performance of a DNN: transductive learning and regularization. In transductive learning, a DNN is pre-trained with the training dataset and subsequently refined in the target dataset [14]–[16]. In a prior work, a CNN was pre-trained using chest X-ray images associated with adults, and then was refined using chest X-ray images associated with pediatric population for pediatric pneumonia diagnosis [17]. In another prior work, a CNN was pre-trained using digital mammography images, and then was refined using digital tomosynthesis images for breast cancer diagnosis [18]. Hence, transductive learning requires two training phases and labeled target dataset. In regularization, a DNN is trained with a regularization penalty imposed on the loss function to bias and generalize the DNN [19]. To properly bias the DNN to maximize its generalization capability, the regularization penalty must be judiciously formulated by considering the intended task and the characteristics of the dataset. Otherwise, the DNN may still be overfitted, or it may be underfitted and generalize too broadly, ultimately leading to poor performance in both training and target datasets. Regularization-based generalization has been pursued in various applications [20], [21]. For instance, a prior work on fault diagnosis of rotating machinery regularized the discriminant structure of a DNN to extract robust latent features generalizable to unseen domains [22]. In PAD diagnosis based on PWA, the morphology of arterial pulse signals (which are presented to the DNN as inputs) are influenced by PAD as well as other confounding factors, e.g., anatomical (e.g., height of a subject) and physiological (e.g., arterial biomechanical properties of a subject) characteristics of individual patients. A unique nature of these confounding factors is that they are continuous rather than discrete. Hence, an ideal regularization penalty for training a DNN suited to PAD diagnosis based on a scarce dataset must facilitate the generalization of the DNN across a wide range of continuous disturbances. In this work, we propose a novel regularization approach that can address such a challenge.

Our regularization approach is inspired by the state-of-the-art domain-adversarial learning. In domain-adversarial learning, a DNN is trained to exploit the features relevant to the task but independent of the domain via domain-adversarial training [13], [23]. This is accomplished by defining the shift between a source domain (DS; training dataset) and a target domain (DT; test dataset and datasets encountered in real application of the DNN) as the H-divergence and utilizing it in training a DNN: the H-divergence serves as a penalty term in training the feature extraction layer, where it is minimized to set the parameters in the feature extraction layer so that the consequent features are regularized and domain-independent [24], [25]. Our CPAR approach extends the conventional domain-adversarial training (applicable mostly to discrete and categorical domains) to continuously connected source and target domains. Consider a domain defined by a continuous parameter P:

P{pp_pp¯} (1)

where the lower bound p_ and the upper bound p¯ specify the range of p. For a given sample x in the domain associated with pxP, we define the source and target domains as follows:

DS(p)={xd(p,px)ε}DT(p)={xd(p,px)>ε} (2)

where d (x, y) ≜ tanh |xy| is a distance measure between x and y, and ε > 0. Let the mapping in a DNN associated with the feature extraction layer, label prediction layer, and domain regression layer as Gf (x), Gl (Gf (x)), and Gη (Gf (x)), respectively (Figure 1). Denoting the set of latent features derived from the samples in the source and target domains pertaining to pP as follows:

FS(p)={Gf(x)xDS(p)}FT(p)={Gf(x)xDT(p)} (3)

the effective H-divergence dHeff(P) between FS (p) and FT (p) across the entire (source and target) domain can be derived by integrating the H-idvergence dH(FS(p),FT(p)) associated with a specific value of p over all possible values of p in the domain. Given nS (p) the number of samples from the source domain and nT (p) the number of samples from the target domain, dH(FS(p),FT(p)) can be computed empirically as follows [13]:

dH(FS(p),FT(p))2{1[1nS(p)i=1nS(p)I[η(Gf(xi)=0p)]+1nT(p)j=1nT(p)I[η(Gf(xj)=1p)]]} (4)

where η(·| p) is a binary classifier for source vs target domains in the hypothesis class H(p) associated with p, while I [·] is an indicator (i.e., I = 1 if the predicate is true and I = 0 otherwise). Then, dHeff(P) is given by:

dHeff(P)=pdH(FS(p),FT(p)) (5)

where the summation is an approximation to the integration over p. Then, it is possible to regularize the latent features against P (i.e., make them independent of P) by training a DNN so that dHeff(P) is minimized, which allows the DNN to robustly perform the intended labeling task both in the source and target domains [13]. Inspired by Eq. (5), we conceive the following loss function for domain regression:

LD(θf,θηP)=p[1nS(p)+nT(p)k=1nS(p)+nT(p)LD(xk,θf,θηp)]=p[1nS(p)+nT(p)k=1nS(p)+nT(p)×{(1e(p,pxk))log1d(pxk,Gη(Gf(xk)))+e(p,pxk)log11d(pxk,Gη(Gf(xk)))}] (6)

where e(p,pxk){1,d(p,pxk)ε0,d(p,pxk)>ε, and (Gf (xk)) is the domain regressed by the DNN when xk is inputted. We employ the standard 2-norm-based loss function for label prediction:

LL(θf,θlP)=p[1nS(p)+nT(p)k=1nS(p)+nT(p)LL(xk,θf,θlp)]=p[1nS(p)+nT(p)k=1nS(p)+nT(p)(ykGl(Gf(xk)))2] (7)

where yk is the label associated with xk. Then, training a DNN with CPAR boils down to solving the following set of optimization problems:

θl*=arg min θlLL(θf,θlP)θη*=arg min θηLD(θf,θηP)θf*=arg min θfLL(θf,θlP)+λ1LD(θf,θηP) (8)

where λ > 0 is the regularization weight. It is noted that the way LD (θf, θn | P) is incorporated into the training of θη and θf are distinct in Eq. (8). Indeed, given that LD (θf, θη | P) > 0 and its value increases as domain regression fails, θη can be optimized by minimizing LD (θf, θn | P). In contrast, if LD (θf, θη | P) is incorporated into the training of θf in the classical way (i.e., optimizing θf by minimizing LL (θf l | P) − λLD (θf, θη | P)), then the cost function may behave poorly: (i) the loss function is no longer lower-bounded since LD (θf, θη | P) can grow indefinitely, and (ii) CPAR cannot be effectively enforced when LD (θf, θη | P) ≈ 0 because the contribution of LD (θf, θη | P) to the gradient becomes negligible. Hence, our choice of the loss function for training of θf has several advantages: (i) it is positive and lower-bounded at zero; (ii) it is large when LD (θf, θη | P) is small, which promotes regularization by CPAR; and (iii) it gets dominated by LL (θf l | P) as LD (θf, θη | P) grows indefinitely, which promotes training of θη to improve the domain regression layer while minimizing the CPAR action.

FIGURE 1.

FIGURE 1.

A CNN architecture for peripheral artery occlusive disease (PAD) diagnosis via CNN-based PWA.

It is worth mentioning that our CPAR approach can be readily extended to simultaneously deal with multiple domains pertaining to more than one continuous parameters Pi{pipi_pip¯i}, simply by connecting multiple domain regression layers to the feature extraction layer (see Figure 1 for a CNN with two domain regression layers). In this case, the loss functions in Eq. (8) are extended to the following, where NP is the number of domain parameters:

θl*=arg min θlPiLL(θf,θlPi)θηi*=arg min θηiLD(θf,θηiPi),   i=1,NPθf*=arg min θfPiLL(θf,θlPi)+λi1LD(θf,θηiPi) (9)

B. PAD DIAGNOSIS BASED ON CONTINUOUS PROPERTY-ADVERSARIAL REGULARIZATION

We applied the CPAR approach in Section II.A to the development of a CNN for PAD diagnosis under dataset limitation constraints. As shown in Figure 1, the CNN architecture consists of a modified AlexNet [26] structure (whose efficacy in PAD diagnosis was demonstrated in our prior work [11]) combined with two domain regression layers. The CNN receives arterial pulse wave signals at an arm (brachial artery) and an ankle (tibial artery) as inputs to perform PWA and diagnose PAD. The label prediction layer predicts PAD severity. The domain regression layers are used in the training phase to maximize the extraction of features independent of two key anatomical and physiological characteristics affecting the morphology of arterial pulse wave signals and thus the efficacy of PAD diagnosis: height and arterial stiffness (using PWV as a surrogate measure). Details follow.

1). VIRTUAL DATASET CREATION

We created virtual datasets for training and testing of a CNN for PAD diagnosis using an established lumped-parameter mathematical model of human arterial tree based on the transmission line (TL) theory [27] (Figure 2). The mathematical model consists of 55 TLs, each of which represents a segment of arterial tree characterized by segment-specific viscous, elastic, and inertial properties. Full details of the mathematical model is provided in He et al. [27]. This mathematical model was validated using physiological measurements as well as the results of other studies, and was used in the study of arterial stenosis and arterial viscoelasticity [10], [28], [29] as well as PAD diagnosis [10], [11].

FIGURE 2.

FIGURE 2.

A lumped-parameter mathematical model of human arterial tree based on the transmission line theory.

We created virtual datasets by perturbing a few anatomical and arterial mechanical parameters in the lumped-parameter mathematical model from their respective nominal values. We specifically varied (i) height, (ii) stiffness, diameter, and thickness of all the arteries, and (iii) resistances associated with terminal arteries. Then, we simulated the mathematical model equipped with these parameters to obtain a wide range of virtual brachial and tibial artery BP signals. We considered both inter-individual variability (IIV) and intra-individual uncertainty in perturbing these parameters. We considered the IIV in height in the range of 162–198cm (+/−10% perturbation with respect to nominal value). We considered the IIV in arterial diameter, thickness, and peripheral load resistance parameters of +/−20% perturbation around their respective nominal values. We considered the IIV in arterial stiffness to replicate the range of PWV observed in adults of 50–60 years in age in hypertension cohort (systolic BP>160mmHg and diastolic BP>100mmHg; PWV 4.8–15.1m/s) [30]. In combination with the IIV associated with arterial diameter, thickness, and peripheral load resistances, PWV in the range of 4.4–15.8m/s was obtained by perturbing arterial stiffness over −20%–400% of its nominal value. To create multiple arterial pulse signal samples from each virtual patient obtained with the IIVs described above, we considered intra-individual uncertainty in the form of lognormal distributions associated with the five anatomical (i.e., height) and arterial mechanical (i.e., arterial stiffness, diameter, thickness, and peripheral load resistance) parameters, with subject-specific values (nominal plus IIV) as mean and coefficient of variation of 0.01.

We considered PAD at the abdominal aorta (which is one of the most common PAD locations). To simulate PAD with increasing severity, we decreased the diameter of the abdominal aorta from its nominal value. We defined the PAD severity as the degree of occlusion: 0% when normal, and 100% when fully occluded.

To create training dataset, we randomly sampled 32 virtual patients based on the IIV described above and a randomly chosen PAD severity (from within 0%–80%). From each virtual patient, we created 1,000 random samples subject to intra-individual anatomical and physiological variability by applying the intra-individual uncertainty described above to the five anatomical and arterial mechanical parameters. Then, we repeatedly simulated the mathematical model 32,000 times by characterizing it by each of the 32,000 random parameter samples and the corresponding PAD severity levels to create brachial and tibial BP pulse signals. The resulting 32,000 arterial BP pulse signals pertaining to the 32 virtual patients along with the PAD severity, height, and PWV associated with these arterial BP pulse signals formed the training dataset.

To create test dataset, we sampled a large number of virtual patients by widely changing the IIV corresponding to the five anatomical and arterial mechanical parameters by 5% increments (5 (height: −10%, −5%, 0%, +5%, +10%) × 9 (artery diameter: −20%, −15%, …+15%, +20%) × 9 (artery thickness: −20%, −15%, …+15%, +20%) × 9 (peripheral load resistance: −20%, −15%, …+15%, +20%) × 77 (artery stiffness: −20%, −15%, …+395%, +400%) = 280,665) and PAD severity by 10% increment (from within 0%–80%). From each virtual patient at a specific PAD severity level, we created 10 random samples subject to intra-individual anatomical and physiological variability by applying the intra-individual uncertainty described above to the five anatomical and arterial mechanical parameters. Then, we repeatedly simulated the mathematical model 25,259,850 (=280,665 × 9 PAD severity levels) times by characterizing it by each of the 2,806,650 random parameter samples and all the PAD severity levels (0%–80%) to create brachial and tibial BP pulse signals. The resulting 25,259,850 arterial BP pulse signals pertaining to the 280,665 virtual patients along with the PAD severity associated with these arterial BP pulse signals formed the test dataset. The rationale behind creating a large-size test data was to extensively evaluate the efficacy of the proposed PAD diagnosis approach via CNN-based PWA.

2). TRAINING, TESTING, AND ANALYSIS OF CNN FOR PAD DIAGNOSIS

The morphology of arterial pulse wave signals are influenced by confounding factors arising from anatomical and arterial mechanical characteristics as well as PAD severity. According to the Moens-Korteweg equation [12], arterial stiffness, diameter, and thickness alter the shape of arterial pulse wave signals by changing PWV (which impacts the timings with which forward and backward traveling pulse waves are superimposed [12]). In addition, height also alters the shape of arterial pulse wave signals by changing the length of pulse wave travel paths. To achieve robust PAD diagnosis accuracy irrespective of these disturbances, we applied the CPAR approach to the training of a CNN for PAD diagnosis via PWA. Our prior work suggested that height and arterial stiffness exerts large influence on the morphology of arterial pulse wave signals among all the disturbances due to anatomical and arterial mechanical characteristics [11]. Hence, we included two domain regression layers in the CNN as shown in Figure 1, and used the CPAR approach so that the features extracted from the brachial and tibial BP pulse signals and inputted to the label prediction layer for PAD diagnosis do not include those indicative of height and arterial stiffness.

Considering that direct non-invasive measurement of arterial stiffness is extremely challenging if not impossible, we employed PWV as a surrogate of arterial stiffness. Although PWV is dependent on arterial diameter and thickness as well as arterial stiffness [12], the observation from our prior work showed that the effect of arterial stiffness is predominant in altering the shape of arterial BP pulse signals among all the arterial mechanical properties [11], which suggests that PWV may serve as a credible surrogate of arterial stiffness.

We evaluated the efficacy of our CPAR-trained CNN-based PWA approach to PAD diagnosis for its ability to detect PAD and assess PAD severity by training and testing the CNN using the training and test datasets outlined in Section II.A.1. To robustly evaluate our approach, we repeatedly created the training and test datasets 10 times, evaluated the performance of the CNN, and reported the efficacy of our approach in terms of the aggregated performance of the CNN obtained from the 10 tests. In training the CNN, we used the ADAM optimizer with α = 0.5, β = 0.999, the learning rate of 0.0001, and the regularization weight (λ) of 0.002.

Our performance metrics included (i) sensitivity, specificity, accuracy, and area under the ROC curve (AUC) as the measures of PAD detection and (ii) r2 value and root-mean-squared error (RMSE) between actual vs CNN-predicted PAD severity as the measures of PAD severity assessment, all derived using the test dataset. We evaluated the detection accuracy of our CPAR-trained CNN-based PWA approach to PAD diagnosis against a number of PAD severity values as the PAD labeling threshold. To examine the advantage of PWA over conventional ABI technique, we compared the performance metrics associated with PWA and ABI. To enable objective comparison, we mapped ABI value to PAD severity via a pre-calibrated polynomial regression model relating ABI to PAD severity (which was derived based on the virtual patient characterized by the nominal anatomical and physiological parameter values).

To scrutinize the role of CPAR in promoting the use of features independent of domain disturbances, we compared the PAD diagnosis efficacy of the CNN trained with (i) CPAR, (ii) conventional learning without CPAR (in which no domain-adversarial regularization is enforced), and (iii) multi-task learning (which, as opposed to CPAR, fosters the exploitation of the features commonly useful for PAD diagnosis and domain regression) using the test dataset. In addition, we investigated the performance of our CNN-based PWA approach to PAD diagnosis with respect to the extent to which CPAR is applied, by comparing the CNN trained with CPAR applied to height only, PWV only, and both height and PWV using the test dataset. Finally, to assess the validity of PWV as a credible surrogate of arterial stiffness, we also trained the CNN with arterial stiffness used for domain regularization, and compared its performance with the CNN with PWV used for domain regularization using the test dataset.

III. RESULTS

Figure 3 shows (a) the receiver operating characteristic (ROC) curve and (b) the Bland-Altman plot between actual vs predicted PAD severity associated with our CNN-based PWA approach to PAD diagnosis. Figure 4 compares the PAD detection accuracy associated with the CNN trained with (i) CPAR, (ii) conventional learning without CPAR, and (iii) multi-task learning. Figure 5 compares the PAD detection accuracy of the CNN trained with CPAR applied to (i) height only, (ii) PWV only, and (iii) both height and PWV. Figure 6 compares the PAD detection accuracy of the CNN trained with (i) PWV and (ii) arterial stiffness used for domain regularization, respectively. TABLE 1 summarizes the PAD severity assessment accuracy associated with the ABI technique as well as CNN trained with (i) CPAR, (ii) conventional learning without CPAR, and (iii) multi-task learning.

FIGURE 3.

FIGURE 3.

Efficacy of CNN-based PWA approach to PAD diagnosis. (a) Receiver operating characteristic (ROC) curve. (b) Bland-Altman plot between actual vs predicted PAD severity.

FIGURE 4.

FIGURE 4.

PAD detection accuracy associated with the CNN trained with CPAR, conventional learning without CPAR, and multi-task learning. (a) Sensitivity. (b) Specificity. (c) Accuracy. (d) AUC.

FIGURE 5.

FIGURE 5.

PAD detection accuracy of the CNN trained with CPAR applied to height only (H), PWV only (PWV), and both height and PWV (H&PWV). (a) Sensitivity. (b) Specificity. (c) Accuracy. (d) AUC.

FIGURE 6.

FIGURE 6.

PAD detection accuracy of the CNN trained with PWV (PWV) and arterial stiffness (E) used for domain regularization.

TABLE 1.

PAD severity assessment accuracy associated with the ABI technique as well as CNN trained with (i) CPAR, (ii) conventional learning without CPAR, and (iii) multi-task learning.

ABI DL (NO CPAR) DL (CPAR) MULTI-TASK LEARNING
RMSE [%] 23.0 15.0 9.5 16.9
r2 Value 0.034 0.588 0.834 0.478

IV. DISCUSSION

Current PAD diagnosis requires imaging-based methods not suited to convenient screening and surveillance purposes. In addition, the ABI technique suffers from poor diagnostic accuracy. DL-based PWA has the potential to make a leap in PAD diagnosis by virtue of its ability to automatically exploit the features indicative of PAD from readily measurable arterial pulse wave signals. However, the morphology of arterial pulse wave signals are influenced by anatomical and arterial mechanical characteristics as well as PAD. If combined with only scarce dataset from a small number of patients (who possibly encompass only a narrow range of anatomical and physiological characteristics) available for training of CNN, these confounding factors can deteriorate the training of CNN by increasing the risk of the CNN overfitted only to the individuals in the training dataset while deteriorating its generalizability to unseen individuals who may exhibit more diverse anatomical and physiological characteristics beyond the training dataset. To enable robust diagnosis of PAD against these challenges, we developed a novel CPAR approach and applied it to the development of a CNN-based PWA approach to PAD diagnosis. Below we demonstrate the potential of CPAR in enabling robust PAD diagnosis against confounding anatomical and physiological disturbances.

A. EFFICACY OF CPAR

The CNN trained with CPAR exhibited adequate PAD detection and assessment accuracy (Figure 3). It showed AUC consistently higher than 0.9 across all the detection thresholds considered in this work, and achieved nearly unbiased severity assessment (bias <0.1%).

Comparing the CNN trained with CPAR against the CNN trained without CPAR and another CNN trained with multi-task learning ascertained the role of CPAR in robustifying the CNN against anatomical and arterial mechanical disturbances (Figure 4 and TABLE 1). The CNN trained with CPAR was superior to the CNN trained without CPAR and the CNN trained with multi-task learning in both detection and severity assessment aspects. Specifically, the CPAR-trained CNN exhibited superior sensitivity while maintaining specificity comparable to its competitors (by virtue of its unbiased estimation of PAD severity, in contrast to its competitors in which PAD severity was underestimated on the average), thereby achieving superior accuracy and AUC characteristics. It is noted that the CNN trained without CPAR was superior to the CNN trained with multi-task learning. Noting that (i) the CNN trained without CPAR does not enforce any action against domain disturbances, and that (ii) multi-task learning fosters the extraction of latent features indicative of domain disturbances in performing PAD diagnosis, the results suggest that regularizing the latent features with CPAR can improve the performance of the intended task susceptible to domain-induced disturbances by making them independent of (or at least less dependent on) domain disturbances.

The advantage of CPAR may also be ascertained from the fact that the efficacy of CNN-based PWA approach to PAD diagnosis was improved as CPAR was applied simultaneously to multiple domain-induced disturbances than to one domain disturbance at a time (Figure 5). The CNN trained with CPAR to regularize against height and PWV exhibited consistently higher sensitivity than those trained to regularize only against height or PWV, despite slightly lower specificity than the height-regularized CNN (which, again, appears to be due to its unbiased estimation of PAD severity, in contrast to its competitorsinwhichPADseveritywasunderestimatedonthe average). The CNN regularized against both height and PWV was persistently superior to the one regularized only against PWV in terms of accuracy and AUC. In terms of accuracy and AUC characteristics, the CNN regularized against both height and PWV was not consistently superior to its competitor regularized only against height. However, considering all the performance aspects (sensitivity in particular), we contend that the CNN regularized against both height and PWV may still be superior to the ones regularized only against height or PWV.

All in all, our results strongly demonstrate the potential of CPAR in realizing CNN-based PWA approach to PAD diagnosis. CPAR can derive a CNN robust against anatomical and arterial mechanical properties altering the morphology of arterial BP pulse signals, by virtue of its ability to foster the exploitation of latent features independent of the domain-induced disturbances in performing PAD diagnosis task even when the dataset available for training is scarce and non-longitudinal.

B. POTENTIAL FOR CLINICAL APPLICABILITY

A few critical questions arise regarding the potential of the CNN-based PWA approach to PAD diagnosis for real-world clinical application: (i) is it superior to the current state-of-the-art; and (ii) does the use of PWV as a surrogate of arterial stiffness drastically impact the PAD diagnosis efficacy?

The CNN-based PWA approach to PAD diagnosis was much superior to the ABI technique in terms of both detection and severity assessment (TABLE 1). One implication is that the exploitation of the entire arterial pulse signals is more efficacious than the use of isolated fiducial points therein in diagnosing PAD. This implication justifies the complexity in the measurement and analytics required for PWA. Another implication is that the CNN-based PWA approach can leverage CPAR to improve its robustness against domain-induced disturbances due to the anatomical and arterial mechanical characteristics, whereas the ABI technique is inevitably prone to those disturbances.

The use of PWV instead of (practically unmeasurable) arterial stiffness did not largely compromise the efficacy of CNN-based PAD diagnosis (Figure 6). Indeed, the former exhibited only a marginal drop in the sensitivity, accuracy, and AUC characteristics relative to the latter, while the specificity was more or less maintained. This implies that, although PWV is not a perfect surrogate of arterial stiffness due to its reliance on other arterial mechanical properties, it may still be a viable choice to enable real-world implementation of the CNN-based PWA approach to PAD diagnosis in conjunction with CPAR-augmented learning with regularization.

All in all, our results suggest that CPAR-trained CNN-based PWA approach to PAD diagnosis may open up unprecedented opportunities for improving the convenience and accuracy of PAD screening and surveillance.

V. CONCLUSION

In this paper, we proposed CPAR, an enabling methodology for robust training and generalization of a DNN to detect and assess the severity of PAD against continuous property-induced disturbances arising from anatomical and arterial mechanical characteristics. The results demonstrated the ability of the CPAR to promote the extraction of latent features independent of domain disturbances, thereby robustifying the performance of PAD diagnosis against a wide range of IIV in height and arterial stiffness. Given the proof-of-concept nature of this work based on virtual in silico dataset, future work must investigate the in vivo efficacy of the proposed PAD diagnosis approach based on CPAR-trained CNN. In addition, it may be worth investigating the potential of CPAR-trained deep learning in the diagnosis of diseases other than PAD.

Acknowledgments

This work was supported in part by the National Research Foundation of Korea under Grant 2020R1A2C3003644, and in part by the American Heart Association under Grant 18IPA34110466.

Biographies

SOOHO KIM received the B.S. degree in mechanical and aerospace engineering from Seoul National University, Seoul, South Korea, in 2017, where he is currently pursuing the Ph.D. degree. He is also a Professional at OnePredict Inc. His current research interests include prognostics and health management (PHM) of industrial systems and human health based on machine learning. He was the Winner of PHM SocietyData Challenge, in 2017, and PHM Asia Pacific Data Challenge, in 2017.

graphic file with name nihms-1741723-b0001.gif

JIN-OH HAHN (Senior Member, IEEE) received the B.S. and M.S. degrees in mechanical engineering from Seoul National University, in 1997 and 1999, respectively, and the Ph.D. degree in mechanical engineering from Massachusetts Institute of Technology (MIT), in 2008. He is currently with the University of Maryland, where he is an Associate Professor with the Department of Mechanical Engineering, a Faculty Affiliate with Robert E. Fischell Institute for Biomedical Devices, and a Faculty Affiliate with the Applied Mathematics, Statistics, and Scientific Computation (AMSC) Program. His current research interests include applications of dynamical systems, control, and machine learning theory to health monitoring, fault diagnostics, and maintenance and treatment of dynamical systems with emphasis on health and medicine. He was a recipient of the IEEE EMBS Most Impactful Paper Award from the IEEE Engineering in Medicine and Biology Society in 2019, the Faculty Early Career Development (CARRER) Award from the National Science Foundation (NSF) in 2018, the Best Bio-Systems and Healthcare Paper Award from the Bio-Systems and Healthcare Technical Committee in the Dynamic Systems and Control (DSC) Division of ASME in 2017, the Young Investigator Program Award from the Office of Naval Research (ONR) in 2014, the Young Investigator Grant Award from the Korean-American Scientists and Engineers Association (KSEA) in 2013, and the Outstanding Gemstone Mentor Award from the Gemstone Honors Program at the University of Maryland in 2019. He is an Associate Editor of ASME Journal of Dynamic Systems, Measurement and Control.

graphic file with name nihms-1741723-b0002.gif

BYENG DONG YOUN received the B.S. degree in mechanical engineering from Inha University, Incheon, South Korea, in 1996, the M.S. degree in mechanical engineering from Korea Advanced Institute of Science and Technology, Daejeon, South Korea, in 1998, and the Ph.D. degree in mechanical engineering from the University of Iowa, Iowa City, IA, USA, in 2001. He is currently a Professor with the Department of Mechanical and Aerospace Engineering, Seoul National University, Seoul, South Korea. He was a recipient of the ASME IDETC Best Paper Awards in 2001 and 2008, the ISSMO/Springer Prize for a Young Scientist in 2005, the IEEE PHM Competition Winner in 2014, and the PHM Society Data Challenge Competition Winner in 2014, 2015, and 2017.

graphic file with name nihms-1741723-b0003.gif

REFERENCES

  • [1].Allison MA, Ho E, Denenberg JO, Langer RD, Newman AB, Fabsitz RR, and Criqui MH, “Ethnic-specific prevalence of peripheral arterial disease in the United States,” Amer. J. Preventive Med, vol. 32, no. 4, pp. 328–333, Apr. 2007, doi: 10.1016/j.amepre.2006.12.010. [DOI] [PubMed] [Google Scholar]
  • [2].Fowkes FGR, Rudan D, Rudan P, Aboyans P, Denenberg J, and Dermott PM, “Comparison of global estimates of prevalence and risk factors for peripheral artery disease in 2000 and 2010: A systematic review and analysis,” Lancet, vol. 382, no. 9901, pp. 1329–1340, 2013, doi: 10.1016/S0140-6736(13)61249-0. [DOI] [PubMed] [Google Scholar]
  • [3].Hirsch A, Criqui M, Treat-Jacobson D, and Regensteiner J, “Peripheral arterial disease detection, awareness, and treatment in primary care,” J. Amer. Med. Assoc, vol. 286, no. 11, pp. 1317–1324, 2001, doi: 10.1001/jama.286.11.1317. [DOI] [PubMed] [Google Scholar]
  • [4].Guthaner DF, Wexler L, Enzmann DR, Riederer SJ, Keyes GS, Collins WF, and Brody WR, “Evaluation of peripheral vascular disease using digital subtraction angiography,” Radiology, vol. 147, no. 2, pp. 393–398, May 1983, doi: 10.1148/radiology.147.2.6340157. [DOI] [PubMed] [Google Scholar]
  • [5].Cavallo AU, Koktzoglou I, Edelman RR, Gilkeson R, Mihai G, Shin T, and Rajagopalan S, “Noncontrast magnetic resonance angiography for the diagnosis of peripheral vascular disease,” Circulat., Cardiovascular Imag, vol. 12, no. 5, May 2019, Art. no. e008844, doi: 10.1161/CIRCIMAGING.118.008844. [DOI] [PubMed] [Google Scholar]
  • [6].Romano M, Mainenti PP, and Imbriaco M, “Multidetector row CT angiography of the abdominal aorta and lower extremities in patients with peripheral arterial occlusive disease: Diagnostic accuracy and interobserver agreement,” Eur. J. Radiol, vol. 50, no. 3, pp. 303–308, 2004, doi: 10.1016/S0720-048X(03)00118-9. [DOI] [PubMed] [Google Scholar]
  • [7].Nelson MR, Quinn S, Winzenberg TM, Howes F, Shiel L, and Reid CM, “Ankle-brachial index determination and peripheral arterial disease diagnosis by an oscillometric blood pressure device in primary care: Validation and diagnostic accuracy study,” BMJ Open, vol. 2, no. 5, pp. 1–6, 2012, doi: 10.1136/bmjopen-2012-001689. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [8].Ghasemi Z, Lee JC, Kim C-S, Cheng H-M, Sung S-H, Chen C-H, Mukkamala R, and Hahn J-O, “Estimation of cardiovascular risk predictors from non-invasively measured diametric pulse volume waveforms via multiple measurement information fusion,” Sci. Rep, vol. 8, no. 1, Dec. 2018, Art. no. 10433, doi: 10.1038/s41598-018-28604-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [9].Ebrahimi S, Jason N, Sean PCM, and Hahn MJ, “Model-based cardiovascular disease diagnosis: A preliminary in-silico study,” Biomech. Model. Mechanobiol, vol. 16, no. 2, pp. 549–560, 2017, doi: 10.1007/s10237-016-0836-8. [DOI] [PubMed] [Google Scholar]
  • [10].Xiao H, Avolio A, and Huang D, “A novel method of artery stenosis diagnosis using transfer function and support vector machine based on transmission line model: A numerical simulation and validation study,” Comput. Methods Programs Biomed, vol. 129, pp. 71–81, Jun. 2016, doi: 10.1016/j.cmpb.2016.03.005. [DOI] [PubMed] [Google Scholar]
  • [11].Kim S, Hahn J-O, and Youn BD, “Detection and severity assessment of peripheral occlusive artery disease via deep learning analysis of arterial pulse waveforms: Proof-of-concept and potential challenges,” Frontiers Bioeng. Biotechnol, vol. 8, p. 720, Jun. 2020, doi: 10.3389/fbioe.2020.00720. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [12].Mukkamala R, Hahn J-O, Inan OT, Mestha LK, Kim C-S, Toreyin H, and Kyal S, “Toward ubiquitous blood pressure monitoring via pulse transit time: Theory and practice,” IEEE Trans. Biomed. Eng, vol. 62, no. 8, pp. 1879–1901, Aug. 2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [13].Ganin Y, “Domain-adversarial training of neural networks,” J. Mach. Learn. Res, vol. 17, no. 1, pp. 2030–2096, 2016. [Google Scholar]
  • [14].Huh M, Agrawal P, and Efros AA, “What makes imagenet good for transfer learning?” 2016, arXiv:1608.08614. [Online]. Available: http://arxiv.org/abs/1608.08614
  • [15].Pan SJ and Yang Q, “A survey on transfer learning,” IEEE Trans. Knowl. Data Eng, vol. 22, no. 10, pp. 1345–1359, Oct. 2010, doi: 10.1109/TKDE.2009.191. [DOI] [Google Scholar]
  • [16].Ghafoorian M, “Transfer learning for domain adaptation in MRI: Application in brain lesion segmentation,” in Proc. Int. Conf. Med. Image Comput. Comput.-Assist. Intervent Feb. 2017, pp. 516–524, doi: 10.1007/978-3-319-66179-7_59. [DOI] [Google Scholar]
  • [17].Liang G and Zheng L, “A transfer learning method with deep residual network for pediatric pneumonia diagnosis,” Comput. Methods Programs Biomed, vol. 187, Apr. 2020, Art. no. 104964, doi: 10.1016/j.cmpb.2019.06.023. [DOI] [PubMed] [Google Scholar]
  • [18].Samala RK, Chan HP, Hadjiiski L, Helvie MA, and Richter CD, “Breast cancer diagnosis in digital breast tomosynthesis: Effects of training sample size on multi-stage transfer learning using deep neural nets,” IEEE Trans. Med. Imag, vol. 38, no. 3, pp. 686–696, Mar. 2019, doi: 10.1109/TMI.2018.2870343. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [19].Girosi F, Jones M, Poggio T, Girosi F, Jones M, and Poggio T. (1995). Communicated by Vladimir Vapnik Regularization Theory and Neural Networks Architectures. [Online]. Available: http://direct.mit.edu/neco/article-pdf/7/2/219/812917/neco.1995.7.2.219.pdf
  • [20].Li H, Pan SJ, Wang S, and Kot AC, “Domain generalization with adversarial feature learning,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit, 2018, pp. 5400–5409. [Google Scholar]
  • [21].Wu D, Xia S-T, and Wang Y, “Adversarial weight perturbation helps robust generalization,” 2020, arXiv:2004.05884. [Online]. Available: http://arxiv.org/abs/2004.05884
  • [22].Han T, Li YF, and Qian M, “A hybrid generalization network for intelligent fault diagnosis of rotating machinery under unseen working conditions,” IEEE Trans. Instrum. Meas, vol. 70, 2021, Art. no. 3520011, doi: 10.1109/TIM.2021.3088489. [DOI] [Google Scholar]
  • [23].Zhang Y and Yang Q, “A survey on multi-task learning,” 2017, arXiv:1707.08114. [Online]. Available: http://arxiv.org/abs/1707.08114
  • [24].Ben-David S, Blitzer J, Crammer K, and Pereira F, “Analysis of representations for domain adaptation,” in Proc. Adv. Neural Inf. Process. Syst, 2006, p. 137. [Google Scholar]
  • [25].Ben-David S, Blitzer J, Crammer K, Kulesza A, Pereira F, and Vaughan JW, “A theory of learning from different domains,” Mach. Learn, vol. 79, nos. 1–2, pp. 151–175, May 2010, doi: 10.1007/s10994-009-5152-4. [DOI] [Google Scholar]
  • [26].Krizhevsky A, Sutskever I, and Hinton GE, “ImageNet classification with deep convolutional neural networks,” Commun. ACM, vol. 60, no. 6, pp. 84–90, May 2017, doi: 10.1145/3065386. [DOI] [Google Scholar]
  • [27].He W, Xiao H, and Liu X, “Numerical simulation of human systemic arterial hemodynamics based on a transmission line model and recursive algorithm,” J. Mech. Med. Biol, vol. 12, no. 1, Mar. 2012, Art. no. 1250020, doi: 10.1142/S0219519411004587. [DOI] [Google Scholar]
  • [28].Xiao H, Avolio A, and Zhao M, “Modeling and hemodynamic simulation of human arterial stenosis via transmission line model,” J. Mech. Med. Biol, vol. 16, no. 5, Aug. 2016, Art. no. 1650067, doi: 10.1142/S0219519416500676. [DOI] [Google Scholar]
  • [29].Xiao H, Tan I, Butlin M, Li D, and Avolio AP, “Arterial viscoelasticity: Role in the dependency of pulse wave velocity on heart rate in conduit arteries,” Amer. J. Physiol.-Heart Circulat. Physiol, vol. 312, no. 6, pp. H1185–H1194, Jun. 2017, doi: 10.1152/ajpheart.00849.2016. [DOI] [PubMed] [Google Scholar]
  • [30].The Reference Values for Arterial Stiffness’ Collaboration, “Determinants of pulse wave velocity in healthy people and in the presence of cardiovascular risk factors: Establishing normal and reference values,” Eur. Heart J, vol. 31, no. 19, pp. 2338–2350, Oct. 2010, doi: 10.1093/eurheartj/ehq165. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from IEEE access : practical innovations, open solutions are provided here courtesy of Health Research Alliance manuscript submission

RESOURCES