Abstract
Recent developments in cardiovascular modelling allow us to simulate blood flow in an entire human body. Such model can also be used to create databases of virtual subjects, with sizes limited only by computational resources. In this work, we study if it is possible to estimate cardiovascular health indices using machine learning approaches. In particular, we carry out theoretical assessment of estimating aortic pulse wave velocity, diastolic and systolic blood pressure and stroke volume using pulse transit/arrival timings derived from photopletyshmography signals. For predictions, we train Gaussian process regression using a database of virtual subjects generated with a cardiovascular simulator. Simulated results provides theoretical assessment of accuracy for predictions of the health indices. For instance, aortic pulse wave velocity can be estimated with a high accuracy (r > 0.9) when photopletyshmography is measured from left carotid artery using a combination of foot-to-foot pulse transmit time and peak location derived for the predictions. Similar accuracy can be reached for diastolic blood pressure, but predictions of systolic blood pressure are less accurate (r > 0.75) and the stroke volume predictions are mostly contributed by heart rate.
Author summary
Recently there has been a strong trend for self-monitoring of your cardiovascular health and new wearable sport trackers and mobile applications are coming to the market everyday. However, such solutions are mostly taking advantage of heart rate measurement. Other health indices such as blood pressure and pulse wave velocity reflecting to the condition of cardiovascular system would also be of great interest, but such solutions for continuous monitoring are barely existing or are at least unreliable. In this paper, we use computational modelling to assess theoretical capabilities of such measurements. We concentrate on predicting health indices using on pulse transmit time type of measurements. Such measurements could be carried out, for example, with photopletyshmography sensor or an optical sensor already found from several wearable sport trackers. We use cardiovascular modelling to create a database of “virtual subjects”, which is applied with machine learning to construct predictors for health indices. Our findings suggest that aortic pulse wave velocity and diastolic blood pressured could be predicted with a high accuracy, but predictions of systolic blood pressure are less accurate.
Introduction
This paper considers continuous monitoring of cardiac health using computational modelling. Stiffening of the arterial wall, such as aorta, causes reduction in the pulsatile properties in the vascular tree, accelerates the vascular premature ageing and predisposes to the dysfunction of the heart, brain and other organs [1, 2]. Aortic stiffness can be measured by using invasive methods or medical imaging such as ultrasound [3] and MRI [2]. Another indicator reflecting the cardiac performance is stroke volume (SV), which is typically measured using Doppler ultrasound [4]. However, these imaging modalities typically require special expertise and are only carried out clinically. On the other hand, aortic stiffness is associated with the unfavourable changes in the diastolic and systolic blood pressures (DBP/SBP), which can have several negative consequences in cardiac function and structure [1]. Ambulatory home measurements of DBP and SBP use the techniques based on inflated cuffs, but continuous recording is still cumbersome. It would be helpful to find unobtrusive methods for the long-term monitoring of these cardiac indices during the daily activities and sleep.
Arterial stiffness is often assessed by measuring pulse wave velocity (PWV), which is increased in stiffer arteries. The PWV can be estimated by measuring arrivals of pulse waves at two arterial sites:
The travel time is commonly referred as pulse transit time (PTT). Arrival of the pulse wave to distal arterial sites can be easily measured by using a photoplethysmogram (PPG), which is an optical non-invasive sensor that can be placed, for example, in a wearable device [5]. On the other hand, in order to predict aortic stiffness reliably, the first arterial site should be located at the beginning of aorta (for measurement of aortic valve opening). However, a measurement of valve opening can require a device such as phonocardiograph, ultrasound or MRI.
To overcome this difficulty, PTT is often approximated using pulse arrival time (PAT) which uses the R- wave of electrocardiogram (ECG) as a reference timing [6]. However, there exists controversy in the clinical accuracy of using PAT in the predictions due to variations in pre-ejection period (PEP) from the R-wave to aortic valve opening [7, 8]. An alternative approach is to approximate the reference with a measurement from another distal site near aorta. For example, the gold standard for aortic PWV measurement is to measure differences of pulse arrivals to carotid and femoral arteries.
The estimation of blood pressure from arrival of pulse waves has also been largely studied; see e.g. [6, 9, 10]. Although promising results have been reported, clinical use of these techniques is still limited. Haemodynamic alterations can have significant effects on the accuracy [11].
A common problem with the clinical use of the above methodologies is that the development and validation of the methods typically require a large set of measurements from real human subjects with sufficient variety. Such data collection can be a very difficult and expensive task.
A preliminary assessment of the methods without extensive data collection can be carried out using simulators. For example, Willemet et al [12, 13] proposed approach to use cardiovascular simulator for generation of a database of “virtual subjects” with sizes limited only by computational resources. In their study, the databases were generated using one-dimensional (1D) model of wave propagation in a artery network comprising of largest human arteries [14]. Such 1D models provide computationally efficient way to simulate blood circulation and are also used in several other applications [15]. There are also studies validating 1D simulations against real measurement [16–18]. The virtual database approach was used to assess accuracy of pulse wave velocity measurements for estimation of aortic stiffness [12] and the accuracy of pulse wave analysis algorithms [13].
The aim of our study is to assess theoretical limitations for the prediction of aortic pulse wave velocity (aPWV), blood pressures (DBP/SBP) and SV from PTT/PAT measurements. We apply a similar virtual database approach to find correlations between these cardiac indices and PTT/PAT timings measured from different locations. In particular, we train Gaussian process regressor to predict the cardiac indices using different combinations of PTT and PAT measurements. The regressor model is trained using a large set of virtual subjects generated using 1D cardiovascular simulator, and the results are validated using another set of virtual subjects. The result of study can give preliminary implications for the accuracy of such predictions in rather ideal circumstances.
Our study is based on the 1D haemodynamic model of entire adult circulations introduced by Mynard and Smolich [19]. It includes heart functions and all larger arteries and veins for both systemic and pulmonary circulation. As heart is included to the model, it can also simulate variations in PEP that are essential in the comparison of PTT and PAT timings.
This paper is organized as follows. Cardiovascular model, generation of virtual subjects and prediction methods are described in Methods and Models section. Results section contains numerical experiments. We will finish with Discussion.
Methods and models
In this section, we begin with a short summary of cardiovascular model and present its numerical discretization.We will also describe the generation of the database of virtual subjects and the computation of Gaussian process predictions.
Blood circulation model
The blood circulation model is based on the 1D haemodynamic model described in [19], which basically extends commonly used 1D wave dynamics model (see e.g. [14]) with heart functions and realistic arteria and venous networks including pulmonary and coronary circulations. The components of the model are shortly summarized below, see [19] for more details.
One-dimensional wave dynamics
Human arterial network is illustrated in Fig 1(a). In 1D modelling, the arterial system is divided into segments (e.g. from aortic root to the branching point of brachiochephalic artery; see e.g. [14, 19]). Each segment is assumed to be a straight compliant tube with the length L. The circular cross-sectional area A(x, t) and the velocity profile U(x, t) are assumed to depend on time t and a single axial coordinate x ∈ [0, L]. To radial direction, the velocity profile is assumed to be axisymmetric and flat which agrees relatively well to experimental data (see e.g. [16]). The governing (nonlinear) equations can be written as [14, 19],
(1) |
(2) |
where p is the pressure, ρ and μ are the density and viscosity of blood, and f is the frictional force. With the axisymmetric and flat velocity profile, the frictional force can be written as f = −22μπU [14].
The pressure-area relationship is written as [19, 20]
(3) |
where A0, P0 and c0 are the cross-sectional area, the pressure and the wave speed at a reference state. We have omitted the wall-viscosity in this study since the treatment of the viscosity would result in significantly higher demands in numerical discretization (remind that our aim is to run the model repeatedly). We choose b = 1 which corresponding to the pressure law used in Alastruey’s model [14, 16]. In Mynard et al [19, 20], the constant b was specified as where Pcollapse is the collapse pressure. However, in our experiments, this choice led to very steep raises in pressures during systolic period due to omitted viscosity.
Heart and valves
The anatomy of heart and blood circulation through heart are illustrated in Fig 1(b). The blood flow through atriums (LA/RA) and ventricles (LV/RV) is modelled using a lumped parameter model introduced in [20], which was extended to include interactions between heart chambers and pericardiac pressure in [19]. The model is illustrated in Fig 2(a).
The relationship between the flow through valves (q) and the transvalvular pressure difference Δp (= pin − pout) is given by the Bernoulli equation,
(4) |
where the Bernoulli resistance Bav and the blood inertance Lav are
(5) |
where Aeff and leff are the effective valve orifice area and length. The valve dynamics are modelled using a state variable ξ which represents the state of the valve (0 ≤ ξ ≤ 1, ξ = 0 for closed, ξ = 1 for open) such that Aeff(t) = (Aeff,max − Aeff,min)ξ(t) + Aeff,min. Valve dynamics are modelled by
(6) |
where Kvo and Kvc are rate coefficients for the valve opening and closing, respectively.
The relationship between the pressure p and the volume V of a heart chamber is given by
(7) |
where ppc is the pericardiac pressure (assumed to depend exponentially on the total chamber volumes; see [19]), Enat is the native elastance of the chamber, Esep is the septal elastance, Vp = 0 is the volume of the chamber in zero pressure, Rs is the source resistance, and p* is the pressure in the contralateral chamber. The native elastance of a chamber is given by
(8) |
where Efw is the freewall elastance of the chamber and μ is the atrioventricular plane piston constant. The time varying freewall elastances for each chamber are modelled by
(9) |
and k is the scaling constant chosen such that . The functional properties of heart are specified via the maximum and minimum free wall elastances (), the timing parameters τ1, τ2 and tonset and the slope parameters m1 and m2. For example, increasing increases the contraction of the heart and the length of the pulse can be adjusted through τ1 and τ2. Fig 2(b) shows an example of the form of Efw.
Vascular beds
Mynard and Smolich [19] describe models for circulation through three types of vascular beds (Fig 3): generic vascular beds, a hepatic vascular bed and coronary vascular beds. The generic vascular bed model (Fig 3(a)) is used for all microvasculature beds except the liver and myocardium. It is based on commonly used three-element Windkessel model and consists of the characteristic impedances Zart and Zven(to couple the connecting 1D arteries to the vascular bed), lumped compliances for the arterial and venous microvasculature (Cart and Cven) and the vascular bed resistance Rvb. The resistance is assumed to be pressure dependent to account for the fact that the atriovenous pressure difference remains positive even with zero vascular bed flow:
(10) |
where ptm = p − pext is the transmural pressure, Pzf is the zero-flow pressure and R0 is the reference resistance.
The hepatic vascular bed (Fig 3(b)) is a modification of the above to account for both arterial and venous inlets in liver. It includes a compartment for the flow from hepatic artery (Rart, Cart) which connects to another compartment (Cp/a) with common portal/arterial pressure.
The coronary vascular bed model (Fig 3) represents blood flow through intramyocardial. The coronary vessels experience a large time-varying myocardial pressures pim caused by the contracting heart muscle. To model depth-wise myocardial pressure, the model includes three layers representing subendocardium, midwall and subepicardium, each layer having three non-linear resistances R1, Rm and R2:
(11) |
where the blood volumes V1 and V2 are are given by
(12) |
The intramyocardial pressures pim is assumed to be the sum of pressure transmitted from the ventricular cavity into the heart muscle and pressure generated mechanically by the thickening heart muscle. See [19] for details.
Numerical solution of the cardiovascular model
Our numerical solution of the wave propagation model is based on the discontinuous Galerkin (DG) method. The derivation of the DG solution for the 1D wave model (1)–(2) is described with details e.g. in [14], and therefore it is only briefly summarized here. We will give more details about the treatment of heart chambers, valves and vascular beds as the numerical treatment differs from [19] due to the different numerical scheme.
The Eqs (1)–(2) can be written in a conservative form as [14]
(13) |
where F is called the flux term. As in the standard finite element method (FEM), each (arterial or venous) 1D segment [0, L] is divided into non-overlapping elements Ωe. In addition, (13) is multiplied with a (vector valued) test function ψ and integrated over the segment. Then the integration by parts gives
where (u, v)Ω = ∫Ω u ⋅ v dx is the standard L2(Ω) inner product. For a numerical solution, U and ψ are approximated which piecewise polynomial vector functions Uδ and ψδ. However, contrary to the standard FEM, the approximation Uδ is not enforced to be continuous across the element boundaries. Another application of the integration by parts gives
(14) |
where the term F* is the (approximative) flux function (determined below). The flux F* is responsible of propagating information through the elements interfaces and is also the key element in the specification of the boundary conditions for the 1D blood vessel segments.
In order to apply a numerical integration scheme for temporal discretization, we need to find such that . As in the standard FEM, this corresponds to finding the coefficients of the approximation of such that (14) is satisfied for a chosen set of test functions. However, since the approximation is discontinuous in DG, the coefficients can be solved separately for each element. The problem is further simplified by using Legendre polynomials as the basis functions of the approximation and test functions, which allows us to treat each basis function separately due to L2-orthogonality.
For the numerical integration, the second-order Adams-Bashforth time integration scheme is used; see [14] for more details.
Characteristic analysis and the flux F*
The determination of the flux F* and numerical boundary conditions is based on the Riemann’s method of characteristics. The characteristic functions (or Riemann’s variables) of the system (13) can be written as (see [14] for the derivation)
(15) |
where the subscripts f and b refer to information moving to forward and backward directions, respectively, and c is the wave speed (local PWV),
(16) |
When considering the pressure-area relationship (3),
(17) |
The fluxes F* at the interfaces of elements are calculated as a solution of a Riemann problem with suitable boundary conditions, see e.g. [14]. The procedure involves finding a unique state (A*, U*) such that
(18) |
where the subscript L and R refer to the value of A and U on the left or the right side of the boundary of the element, respectively. The flux is then given as F* = F(A*, U*).
The boundary conditions for the 1D blood vessel segments are handled similarly by finding a state (A*, U*) satisfying conditions similar to (18). Treatment of the boundary conditions related to splitting and merging arteries/veins is presented in [14]. Treatment of the boundary conditions related to the heart, valve and vascular beds is presented below.
Numerical model for heart chambers
We consider left heart (right heart is handled similarly). The Trapezoidal rule applied to the net flow arriving to LV gives (see Fig 1(b))
(19) |
where the superscript n refers to the n ’th temporal discretization point and Δt is the time step. The above equation can be substituted to (7) to give
(20) |
where .
The output of LV is connected to the inlet of lvot-segment; see Fig 1(b). At the inlet of lvot, we have
(21) |
where and are the DG approximations at the inlet. Since ,
(22) |
where . Plugging in (22) and to (20) gives an equation from which A* can be solved using Newton’s method. Finally, U* can be solved from (21) and and are obtained during the iteration.
The atriums have multiple vein connections; see Fig 1(b). Let and be the DG approximations at the outlet of the j’th connecting 1D-segment (j = 1, …, J). We can write , and further
(23) |
where and ψj is the function (17) with the parameters A0 and c0 corresponding to the outlet of j’th segment. Then similarly as above, we can obtain a group of J equations from which can be simultaneously solved using Newton’s method. However, the multi-dimensional problem can be avoided by noticing that the pressure-area relationship (3) can be inverted easily (i.e. we can find A = A(p)). Then it is equivalent to solve p from the one-dimensional problem
(24) |
where is given by (23) with , where the subscript j refers to the mapping in which the parameters A0, c0 and b in (3) are specified for at the outlet of the j’th segment.
Valves
The application of the forward Euler method to (4) gives
(25) |
The Eq (6) is discretized similarly. For MV and TV, the transvalvular pressure is the pressure difference between artium and ventricle (e.g. for MV).
PV and AV are between 1D segments (e.g. AV is between lvot and the first segment of aorta, see Fig 1(b)). For the outlet of the ventricular outflow tracks, we specify the outflow condition (e.g. ). For the inlet of the 1D segments behind the valve, we specify the inflow to be . These inflow and outflow boundary conditions can be treated similarly as above by finding the states (A*, U*); see e.g. [14] for details. Then the pressures on the both sides of the valve can be computed using the states A* and the pressure-area relationship (3).
Vascular beds
We consider the generic vascular bed model (Fig 3(a)). Arterial and venous flows qart and qven in the generic vascular bed model (sums of all flows from/to 1D-segments) are given by
(26) |
The forward Euler method gives
(27) |
The capillary flows (flow through Rvb) are calculated using Ohm’s law.
Vascular beds are connected to the 1D model as the terminal resistance boundary condition similarly as in [14]. For example, we consider coupling of a 1D-arterial segment to the generic vascular bed model (Fig 3(a)). The flow q though impedance Zart is given by Ohm’s law . We need to find (A*, U*) such that
(28) |
The states A* and U* can be solved by combining the equations as above and applying Newton’s method. Then is the sum of flows from all 1D-outlets (A*U*).
Portal and coronary models in Fig 3(b) and 3(c) can be treated similarly.
Virtual database
The database is created by running the cardiovascular model repeatedly. The model parameters are varied to reflect variations between individual (virtual) subjects.
In [12, 13], the seven parameters were varied: elastic artery PWV, muscular artery PWV, the diameter of elastic arteries, the diameter of muscular arteries, heart rate (HR), SV and peripheral vascular resistance. In their study, the parameters were varied by specifying a few possible values for each parameter and the cardiovascular model was run for all of the resulting 7776 combinations. However, in our study, the cardiovascular model has significantly more model parameters (e.g. parameters related to heart model and valves, vascular beds, …). Such systematic variation of all essential parameters would lead to excessively large number of combinations.
In this study, we choose “sampling” approach in which the model parameters are varied randomly. Our aim is to choose random variations that would represent healthy subject and, where applicable, the range of the parameters is of similar range as in [12]. Some choices can be rather subjective due to the limited amount of (probabilistic) information from related physiological quantities. Our goal is to choose variations to be wide enough so that “real world” can be considered as a subset of the population covered by the variations. However, if more sufficient information about parameters becomes available, it should be rather straightforward to carry out the analysis with the adjusted distributions.
In the following, the superscript (s) refers to a virtual subject for which the parameters are specified. The overbar notation (e.g. ) refers to the values used in [19] (the baseline). Unless otherwise mentioned, the variations are chosen to be normally distributed. Furthermore, the statements such as 10% relative variation should be understood in terms of standard deviations instead explicit ranges of the parameter. We use slightly unconventional notation to denote the Gaussian distribution with mean μ and the standard deviation σ = X/100μ (i.e. X% variation relative to the mean/baseline). The uniform distribution is denoted as .
Vascular networks
The arterial and venous network structure is chosen to be same as in [19] (the length L and A0 at the inlet and outlet for each 1D segment are given in their supplementary materials). To include individual variations of subjects, the lengths are chosen as
(29) |
where the subscript ℓ refers to the ℓth segment. The multiplier a(s) can be understood as a variation in the height of subject total length and represents individual variations of blood vessel segments. With these choices, for example, distances from aortic root to the measurement locations (see below) are 17.0 ± 1.8 cm (left carotid artery) and 88.9 ± 9.1 cm (femoral artery) which are similar to the distances reported in [21, 22]. The arterial diameters D0() are also varied similarly, except we use separate common multipliers a(s) for aorta (20% variation) and rest of segments (10% variation).
The elasticity E of blood vessels is controlled by the reference wave speed (PWV), which can be expressed using the empirical formula [23]
(30) |
where r0 is the reference radius, h is the thickness of the wall and k1, k2 and k3 are empirical constants. Elasticity of systemic arteries, especially aorta, have largest effect to the condition of the cardiovascular system (increased significantly during ageing). Therefore, aorta and other systemic arteries are chosen to include largest variations:
where , and . The coefficient α(s) produces 25% variation to the PWV of systemic arteries which, for aorta, is further amplified with β(s) giving 65% maximum variation. The slope k2 is also varied with 5% variation. To produce small variation between segments, additional 1% variation is added to the local PWV (c0) of each segment.
Heart functions and valve model parameter
The duration of the pulses Tc are chosen as follows. For each subject, HR is drawn from , which is rejected if HR < 50 min−1 to avoid too low heart rates. For normal sinus rhythm, pulse lengths Tc are shown to follow the distribution of a (correlated) pink noise [24]. Therefore, Tc are chosen to be realizations of pink noise with the mean 60/HR and the variance σ2, which varies among the subjects ().
To consider variations in heart pumping, we vary and τ1 and τ2 randomly. For each pulse, we choose
(31) |
where represents the level of variations in heart muscle contraction between pulses, which is modelled to vary between subjects. The valve model parameters Aeff,max, Aeff,min, ℓeff,min, Kvo, and Kvc are varied with 10% variation.
Vascular beds
Microvasculature compliances (C) and the reference capillary resistances (R0 or R0,m) are chosen as and . The mean resistance is increased slightly to provide higher, physiologically more relevant diastolic and systolic pressure levels. For coronary vascular beds (see Fig 3), the resistances R1 and R2 and the initial volumes V0,1 and V0,2 are perturbed with 10% variation.
Generation of the virtual database
We generate two datasets: the first is used to train predictors (training set), and another for the validation of predictions (test set). The generation of the training set is described first.
The model is run repeatedly for the parameter variations described above. The initial state for the solution and the model parameters not specified above are set as in [19]. The 1D-model is discretized using varying number of elements in each segment (Nel = ⌈0.5L ⌉ where L is the length of the segment) and the 3rd/2nd order (arteries/veins) Legendre polynomials. The time stepping for temporal discretization is chosen to be Δt = 2 ⋅ 10−6 s. The level of discretization is experimentally verified to result sufficiently small discretization error (compared to a very dense discretization). We simulate 11 heart cycles to ensure that the simulation has been converged (e.g. the dependency to the initial condition is negligible) and the last pulse of each run is used in the analysis. The model is run 9986 times. However, we noticed that similar results can also be achieved with significantly less samples (e.g. 1000) and therefore we can assume that the size of database is sufficient.
To ensure that simulations represent physiologically reasonable solutions, the filtering criteria used in [12, 13] are also applied here: a simulation is accepted only if 1) DBP at the brachial arteries are higher than 40 mmHg, 2) SBP at the brachial arteries are lower than 200 mmHg, the pulse pressures (SBP—DBP) at the brachial arteries are between 25-100 mmHg, 4), the reflection coefficient of the aortic-iliac bifurcation satisfies |Rf| ≤ 0.3. The reflection coefficient is calculated as
(32) |
where the characteristic admittances Y = Ad/(ρcd) (the subscript d refers to diastole) are for the distal abdominal aorta (Yabd) and the proximal common iliac arteries (Yil,left, Yil,right).
Out of the 9986 cases, 5222 samples are accepted after applying the above filtering criteria. Out of the rejected samples, 4543 have too small or large reflection coefficient, 70 have to too small diastolic BP, 9 have too large systolic pressure, and pulse pressure is too large for 1115 samples. The large portion of rejected samples due to insufficient reflection constants can perhaps be avoided if more precise information about spatial variations of arterial diameters and stiffness would be available.
The test set is generated similarly, but with a denser discretization (Δt = 0.5 ⋅ 10−6 s, 4th/3rd order Legendre polynomials for arteries/veins). This dataset comprises of 943 virtual subjects (1792 before filtering). The training and test set have their own unique virtual patients without overlap.
Simulated PPG signal and calculation of PTT/PATs
In this study, we consider predictions based on pulse transit and arrival timings derived from simulated PPG signal. The measurement locations (xobs) considered in this work are listed in Table 1. PPG signal can be understood as a differential measurement of blood volume under the sensor. If we assume that longitudinal variations in the blood veins are negligible, the blood volume can be assumed to be proportional to A (xobs, t). Therefore PPG signal is simulated by removing the scale information:
(33) |
where Amin and Amax are the minimum and maximum of A (xobs, t) over a period of time. We, however, note that the scale does matter when considering PTT/PAT timings.
Table 1. The sensor locations considered in this work.
Arteria | Abbreviation | xobs |
---|---|---|
Left common carotid artery | LCA | 0.65L |
Right common carotid artery | RCA | 0.6L |
Left/right radialis artery | LRad/RRad | 0.9L |
Right femoral artery | Fem | 0.5L |
Arrival of the pulse can be detected as a valley at the beginning of systolic period when pressure p (xobs, t) starts increasing (foot-to-toot PTT; PTTff). Other timings can also be considered: the peak (maximum; PTTp), the steepest raise (the maximum of the derivate; PTTD), and the location of the dicrotic notch (DAT); see Fig 4. DAT can be detected as the peak in the second derivate during the diastolic period.
The pulse transit times are relative to aortic valve opening which can be easily detected from simulations: we detect a valley in the simulated pressure p (x, t) at aortic root (the inlet of the 1D segment connecting to aortic valve). For pulse arrival times, simulated R-wave locations can chosen to be the initiation of the pulse (foot) in the prescribed Efw for LV.
We note that our simplified PPG signal model does not take into account phenomena such as optical scattering which can induce nonlinear effects to pulse waveform. However, we use PPG signal only to infer timings in the pulse and therefore possible nonlinearities do not have significant effects to results as long as foots, peaks and notches can be estimated accurately. Furthermore, we note that other measurement modalities measuring volume/area of the artery (e.g. ultrasound) can also be considered.
Extraction of aPWV, DBP, SBP and SV
Thea aim is to predict aPWV, DBP, SBP or SV using the combination of PTT/PAT times and/or HR (input). These can be extracted from simulated pulses as follows.
aPWV: the wave speed c (t, x) given by (16) averaged over a pulse (integrated numerically). The location x is chosen to be the center point of the segment of aorta between the branching points of brachiocephalic artery and LCA).
DBP, SBP: the minimum and maximum value of p (x, t) at the aortic root (the inlet of the 1D segment connecting to aortic valve)
SV: the integral of flow q = AU at the aortic root over the pulse (calculated numerically).
There are also other options to specify aortic PWV. For example, we can use the foot-to-foot aortic PWV by detecting arrivals of pulses to aortic root and the aortic-iliac bifurcation, but this leads only to very minor differences in the results (the Pearson correlation for between these aPWVs is r > 0.99). Relationships between these different options are studied in [12].
The distributions of selected metrics of the generated virtual database are shown in Fig 5. As a general finding, we note that there are strong correlations between DAT and pulse length (1/HR) signals (Pearson correlation r = 0.96 − 0.98). Due to this strong correlation, using HR and DAT as input provides very similar predictions (which can also be seen in the results below).
Gaussian process model for predictions
We apply Gaussian process regression for the computation of predictors. GPs are widely used, for example, in machine learning, hydrogeology and analysis of computer experiments (e.g. see [25–27]). GPs also provide flexible predictors that can handle non-linear relationship between input data and the response variable as well as uncertainties in the data. However, we note that any other class of regressions capable of nonlinear relationships can also be used for the analysis. For example, similar results can be achieved with multivariate adaptive regression splines [28].
A GP is a stochastic process f (z) () such that f (z1), …, f (zn) is a multivariate Gaussian random variable for all combinations of z1, …, zn. It can be described by the specifying mean function μ (z) = E(f (z)) and the covariance function k (z, z′) = cov(f (z), f (z′)). For more details, see e.g. [25].
Consider a case in which the inputs z are a vector of PTT or PATs and possibly HR and y is the response variable (aPWV, DBP, SBP or SV). We model the response variables as
(34) |
where h (z) is a vector of (deterministic) basis functions, β is a vector of basis function coefficients, f (z) is a GP with zero mean and covariance function k (z, z′), and ϵ is an Gaussian white noise. The first term represents mean behavior of the GP model. The GP term models non-linear relationship between input data and the response variable as well as correlated uncertainties in the data.
Training data comprises of input-output pairs {(zi, yi); i = 1, …, N }. We assume that yi’s are output of the above model i.e. yi = y (z1). Furthermore, let be inputs for which we want to calculate predictions. Then Y = (y1, …, yN) and are both Gaussian and the conditional distribution of Y′ given Y is (see e.g. [25], Appendix A.2]),
(35) |
where μY and ΣY denotes the mean and covariance of Y and ΣYY′ is the cross-covariance of Y and Y′. The means and covariances can be calculated by pluggin in the model (34), which gives
(36) |
(37) |
where h (Z′) and k (Z′, Z) are shorthand notations for the vector and matrix with the components and , respectively. The above conditional mean gives us an prediction of Y′ with a confidence estimate given by the conditional covariance.
In this study, the covariance function are chosen to be Matern kernel function with ν = 3/2 with a separate length scales for each input parameter. This kernel function can be written as
(38) |
where σ2 is the variance and ℓm are the length scales for each input. We note that the choice of the kernel function does not have a large effect to the results as our sample size is large. For example, our experiments show that use of the squared exponential covariance function gives very similar results with differences of the same scale as the prediction uncertainty.
The predictors are computed using fitrgp function in MATLAB Machine Learning Toolbox which provides numerically efficient implementation for the GP regression. The basis functions h(z) are chosen to be linear. The fitrgp function also estimates hyperparameters by minimizing the negative loglikelihood,
(39) |
where . The optimization is carried out using a subset of observations to avoid high computational load. The parameters of fitrgp related to this hyperparameter optimization are chosen to be the default values.
Results
In this section, we apply GP regression to predict aPWV, DBP, SBP and SV using combinations of different type of PTT/PAT timings and HR as input. We train a GP predictor separately for each considered combination as described above. For validation, we apply the trained predictor to the test set and calculate Pearson correlation between the predictions and ground truth values. Tables A-H in S1 Appendix also report 95% confidence intervals (CI) for the Pearson correlations (BCa bootstrapping intervals [29]). Each table also highlights selected predictions with largest Pearson correlations. However, we note that the order of Pearson correlations should be considered as indicate rather than a definite order of performance due to the uncertainty especially when differences are small.
Predictions of aPWV
Fig 6 shows predictions of aPWV for a selected set of combinations when the measurement location is LCA. Table A in S1 Appendix summarizes the results for the complete set of combinations.
The results show that using PTTff or PTTD as a single input gives moderate accuracy and predictions using either HR, PTTp, or DAT are insufficient. Performance can be improved by combining multiple different timings. For example, the accuracy is significantly improved if both PTTff and PTTp are used for predictions (r = 0.90). Furthermore, including also DAT provides the accuracy of r = 0.94, and adding other timings does not significantly improve accuracy any further.
Measurements from RCA provide less accurate predictions (Table B in S1 Appendix): for example, the combination of PTTff, PTTp, PTTD and DAT provides one of highest accuracies for RCA (r = 0.79), but is still only moderate. Such results can be expected as pulse waves travel shorter distance in aorta and also travel through brachiocephalic artery (see Fig 1) inducing additional variations to the (average) wave speeds.
Performance of wrist measurements (LRad / RRad) are even worse (see Table C in S1 Appendix for LRad; results for RRad are similar). For example, the highest accuracy (r = 0.73) can be achieved with the combination of PTTff, PTTp, PTTD and DAT. This is also expected as relative large part of the arterial tree to these measurement locations are comprised of brachial and radialis arteries with their own variations to PWV. On the other hand, measurements from lower limb could provide better performance: for right femoral artery, we can achieve r = 0.75 using PTTff and r = 0.84 using PTTff, PTTp, PTTD and DAT (Table D in S1 Appendix). In this case, pulse travels though the whole aorta to reach these measurement locations.
As mentioned above, in practice, the R-peak location in ECG signal is often used as a surrogate to aortic valve opening due to simpler measurement. However, using PATs gives only mediocre accuracy compared to PTT due to the physiological variations in PEP [7, 8]. Our finding are similar, see for example, Fig A and Table E in S1 Appendix for LCA. The highest accuracy is r = 0.79 (e.g. PATff, PATp, PATD and HR) which is significantly worse compared to using PTTs.
Another approach to avoid measurement of aortic valve opening is to consider differences of pulse arrival times to two distal locations. Such setup also allows us to avoid the influence of PEP variations. Results for measurement between LCA and Fem can be seen in Fig B and Table F in S1 Appendix: difference of PTTff gives r = 0.76 which is slightly better than using normal PTTff measurement from Fem, but not as good as normal PTTff measurement from LCA. The highest accuracy (r = 0.87) can be obtained, for example, with PTTff, PTTp, PTTD and HR. The predictions of PWV that use the difference between LCA and RCA or the difference between LRad and RRad are less accurate (r ≈ 0.75 − 0.78 at best); see Tables G and H in S1 Appendix.
Predictions for blood pressure
Figs 7 and 8 show predictions for DBP and SBP for selected PTT time combinations when measurements are taken from LCA; see also Table A in S1 Appendix for all combinations. For DBP, predictions using PTTff as a single input achieves very low accuracy (r = 0.33). Significantly more accurate predictions can be achieved using HR (r = 0.85) or DAT (r = 0.86). For SBP, the performance of PTT based predictions is better but still quite low (r = 0.58 for PTTff and r = 0.60 for PTTp). Predictions can be improved by adding additional input timings. For DBP, combining PTTff with HR or DAT gives r = 0.92 and the highest accuracy r = 0.94 is obtained with PTTff, PTTp, PTTD and DAT. Additional input timings also improves performance of SDB predictions: PTTff and HR/DAT results in r = 0.735 and the highest accuracy is r = 0.75 (PTTff, PTTp, PTTD and DAT). Findings the other measurements locations are similar; see Tables B, C and D in S1 Appendix.
We also consider predictions from pulse arrival times (i.e. using R-peak as a reference timing). Compared to PTT times, the results are of mixed accuracy; see Table E for PAT measurements from LCA. For DBP, using PATff as single input yields insufficient predictions (r = 0.19), but PATp gives moderate accuracy (r = 0.67). Combinations of different PAT timings can even achieve higher accuracy than using PTTs: for example, r = 0.95 with PATff and DAT and r = 0.96 for PATff, PATp, PATD and DAT. For SBP, PATff provides slightly better accuracy compared to PTTff(r = 0.62), but otherwise results are similar.
As with aPWV, we consider differences of pulse transit/arrival times measured with two sensor. Measuring between LCA and Fem gives very similar performance to PTT measurements from LCA (Table F in S1 Appendix). However, other considered setups provide less accurate results: see Table G in S1 Appendix for differences between LCA and RCA measurements and Table H for differences between measurements from radialis arteries.
Prediction of SV
Results show that HR has largest contribution to the predictions of SV, meanwhile performance with pulse transit or arrival timings (without HR information) can only provide moderate accuracy at best. For example, Fig G and Table A in S1 Appendix show the predictions using measurements from LCA. Predictions with HR as a single input reaches r = 0.81, but predictions using PTTff or PTTD are insufficient estimates (r < 0.25) and predictions with PTTp are of moderate accuracy (r = 0.60). SV can be predicted with good accuracy with DAT, but this is due to the strong correlation between HR and DAT as mentioned above. Furthermore, significant improvements will not be achieved by combining several inputs. For example, highest accuracy is r = 0.83 which can be obtained, for example, with PTTff, PTTD and HR. Results are similar for all other measurement setups; see Tables B-H in S1 Appendix.
Discussion
This paper assessed theoretical limitations for the prediction of aortic pulse wave velocity (aPWV), DBP/SBP and SV from pulse transit and arrival time measurements. We applied a virtual database approach proposed by Willemet et al [12, 13] in which a cardiovascular simulator is used to generate a database of virtual subjects. In this work, we applied one-dimensional haemodynamic model by Mynard and Smolich [19] to construct a simulator for entire adult circulation. This simulator was used to generate a large database of synthetic blood circulations with varied physiological model parameters. The generated database was then used as training data for Gaussian process regressors. Finally, these trained regressors were applied to another synthetic database (test set) to assess capability of regressors to predict aPWV, SDB, DBP and SV using different combinations pulse transit/arrival time and HR measurements.
The results indicate that aPWV and DBP can be estimated from PPG signal with a high accuracy (Pearson correlation r > 0.9 between true and predicted values for measurement from left carotid artery) when, in addition to foot-to-foot PTT time, information about the peak and dicrotic notch location is also given as input to the predictor. The predictions of SDB were less accurate (r = 0.75 at best). For SV, accurate predictions were mostly based on heart rate, with only a very minor improvement in accuracy when PTT timings were also included as inputs.
As this was entirely in silico study, it is not guaranteed that the result can be applicable to the real world as is. However, the aim of the study was to give preliminary results about correlations between the cardiac indices and PTT/PAT timings and the applicability of such predictions. The hope is that the results could to be extended to real clinical applications in future research.
The limitations to be addressed in future are the following. First, the cardiovascular model has its limitations. Although previous studies have shown that similar cardiovascular models can be used to simulate human physiology relatively well [16–18], not all physiological phenomena are fully covered in the Mynard’s model. One example of such phenomenon is respiration. The effect of respiration can be important as the breathing and cardiac cycles are in a close interaction. Several physiological factors, such as the changes in the intrathoracic pressure and the variation in the interbeat intervals modulate the cardiac mechanics and blood outflow from the heart. Even the timing of the shorter cardiac cycles coupled with the longer respiratory cycles has effects on the central circulation. When we considering a healthy heart, the effects of respiration can perhaps be managed by interpreting different virtual subjects to represent inspiratory and expiratory phases of the breathing. Other phenomena that are not covered by the model are, for example, gravity and baroreceptors. Furthermore, lumped parameter models that are used for heart and vascular beds were relatively simple approximations. However, new analytical methods allow us to bridge the models and human bodily functions [30].
The chosen baselines and variations of the model parameters were chosen to represent healthy subject. The choices, however, can be subjective due to the limited amount of (probabilistic) information. Our attempt were to produce variations such that the virtual population covered by the chosen parameter variations includes real physiological variations. We, however, emphasize that the presented approach is not limited to the chosen parameters variations and it can be adjusted if more precise information becomes available.
Due to the limited phenomena covered by the model, the results may not be reliable when considering subjects with medical conditions. For example, the simplified heart model and variations of related model parameter may not present subjects with heart diseases.
In this study, we only considered pulse transit and arrival type of time information as the input to the predictor. Predictions could potentially be improved with other kinds of additional information. For example, aortic PWV predictions could be improved by using information about the distances between aorta and/or measurement points. Information about arterial path lengths could have been easily used in our simulation analysis, but in practice such information would require clinical measurements such as MRI [21, 22]. On the other hand, the arterial path length are often estimated using the body lengths or measuring distances of certain points in the body [21, 22]. Such information was not used in this simulation study as precise statistical knowledge of connection between such body measurement and arterial length was not available. Instead, Gaussian process regressors implicitly marginalize predictions over different arterial lengths that are present in the virtual database.
Ultimately it would be beneficial to develop approaches that do not need reference measurement (aortic valve opening/R-peak). For example, Choudhury et al [31] presented a machine learning algorithm which uses raise times and pulse widths derived from PPG signal to predict DBP and SBP. Furthermore, deep learning approaches could perhaps be used to infer optimal information from PPG waveform. These are subject of our future research.
Supporting information
Data Availability
Data can be simulated with methods described in the manuscript.
Funding Statement
JH was supported by the Seamless Patient Care project (Business Finland - former Tekes - the Finnish Funding Agency for Technology and Innovation grant funding grant 1939/31/2015). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1. Lyle AN, Raaz U. Killing me unsoftly: causes and mechanisms of arterial stiffness. Arter Thromb Vasc Biol. 2017;37(2):e1–e11. 10.1161/ATVBAHA.116.308563 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Lia Y, Hicksonb SS, McEniery CM, Wilkinson IB, Khir AW. Stiffening and ventricular—arterial interaction in the ascending aorta using MRI: ageing effects in healthy humans. Journal of Hypertension. 2019;37:347–355. 10.1097/HJH.0000000000001886 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Negoita M, Hughes AD, Parker KH, Khir AW. A method for determining local pulse wave velocity in human ascending aorta from sequential ultrasound measurements of diameter and velocity. Physiological Measurement. 2018;39:114009 10.1088/1361-6579/aae8a0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Ihlen H, Amlie JP, Dale J, Forfang K, Nitter-Hauge S, Otterstad JE, et al. Determination of cardiac output by Dopler echocardiography. British Heart Journal. 1984;51:54–60. 10.1136/hrt.51.1.54 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Moraes JL, Rocha MX, Vasconcelos GG, Vasconcelos FJE, de Albuquerque VHC, Alexandria AR. Advances in Photopletysmography Signal Analysis for Biomedical Applications. Sensors. 2018;18(6):1894 10.3390/s18061894 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Gao M, Olivier NB, Mukkamala R. Comparison of noninvasive pulse transit time estimates as markers of blood pressure using invasive pulse transit time measurements as a reference. Physiol Rep. 2016;4(10):e12768 10.14814/phy2.12768 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Balmer J, Pretty C, Davidson S, Desaive T, Kamoi S, Pironet A, et al. Pre-ejection period, the reason why the electrocardiogram Q-wave is an unreliable indicator of pulse wave initialization. Physiol Meas. 2018;39(9):095005 10.1088/1361-6579/aada72 [DOI] [PubMed] [Google Scholar]
- 8. Kortekaas MC, van Velzen MHN, Grüne F, Niehof SP, Stolker RJ, Huygen FJPM. Small intra-individual variability of the pre-ejection period justifies the use of pulse transit time as approximation of the vascular transit. PLoS One. 2018;13(10):e0204105 10.1371/journal.pone.0204105 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Mukkamala R, Hahn J, Inan OT, Mestha LK, Kim C, Töreyin H, et al. Toward Ubiquitous Blood Pressure Monitoring via Pulse Transit Time: Theory and Practice. IEEE Trans Biomed Eng. 2015;62(8):1879–1901. 10.1109/TBME.2015.2441951 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Payne RA, Symeonides CN, Webb DJ, Maxwell SRJ. Pulse transit time measured from the ECG: an unreliable marker of beat-to-beat blood pressure. J Appl Physiol. 2006;100:136–141. 10.1152/japplphysiol.00657.2005 [DOI] [PubMed] [Google Scholar]
- 11. Rader F, Victor RG. The Slow Evolution of Blood Pressure Monitoring. JACC: Basic to Translational Science. 2017;2(6):643–645. 10.1016/j.jacbts.2017.11.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Willemet M, Chowienczyk P, Alastruey J. A database of virtual healthy subjects to assess the accuracy of foot-to-foot pulse wave velocities for estimation of aortic stiffness. Am J Physiol Heart Circ Physiol. 2015;309:H663–H675. 10.1152/ajpheart.00175.2015 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Willemet M, Vennin S, Alastruey J. Computational assessment of hemodynamics-based diagnostic tools using a database of virtual subjects: Application to three case studies. J Biomech. 2016;49:3908–3914. 10.1016/j.jbiomech.2016.11.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Alastruey J, Parker KH, Sherwin SJ. Arterial pulse wave haemodynamics. In: 11th Int Conf Pressure Surges (eds. S. Anderson). Bedford, UK; 2012. p. 401–442.
- 15. van de Vosse FN, Stergiopulos N. Pulse Wave Propagation in the Arterial Tree. Annu Rev Fluid Mech. 2011;43(467–499). 10.1146/annurev-fluid-122109-160730 [DOI] [Google Scholar]
- 16. Alastruey J, Khir AW, Matthys KS, Segers P, Sherwin SJ, Verdonck PR, et al. Pulse wave propagation in a model human arterial network: Assessment of 1-D visco-elastic simulations against in vitro measurements. J Biomech. 2011;44:2250–2258. 10.1016/j.jbiomech.2011.05.041 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Matthys KS, Alastruey J, Peiró J, Khir AW, Segers P, Verdonck PR, et al. Pulse wave propagation in a model human arterial network: assessment of 1-D numerical simulations against in vitro measurements. J Biomech. 2007;40(15):3476–3486. 10.1016/j.jbiomech.2007.05.027 [DOI] [PubMed] [Google Scholar]
- 18. Olufsen MS, Peskin C, Kim WY, Pedersen EM, Nadim A, Larsen J. Numerical Simulation and Experimental Validation of Blood Flow in Arteries with Structured-Tree Outflow Conditions. Ann Biomed Eng. 2000;28:1281–1299. 10.1114/1.1326031 [DOI] [PubMed] [Google Scholar]
- 19. Mynard JP, Smolich JJ. One-Dimensional Haemodynamic Modeling and Wave Dynamics in the Entire Adult Circulation. Ann Biomed Eng. 2015;43(6):1443–1460. 10.1007/s10439-015-1313-8 [DOI] [PubMed] [Google Scholar]
- 20. Mynard JP, Davidson MR, Penny DJ, Smolich JJ. A simple, versatile valve model for use in lumped parameter and one-dimensional cardiovascular models. Int J Numer Meth Biomed Engng. 2012;28:626–641. 10.1002/cnm.1466 [DOI] [PubMed] [Google Scholar]
- 21. Sugawara J, Hayashi K, Tanaka H. Arterial Path Length for Arterial Stiffness: Methodological Consideration. Am J Hypertens. 2016;29(11):1237–1244. 10.1093/ajh/hpw075 [DOI] [PubMed] [Google Scholar]
- 22. Bossuyta J, Veldea SVD, Azermaia M, Vermeerscha SJ, Backera TLMD, Devosc DG, et al. Noninvasive assessment of carotid-femoral pulse wave velocity: the influence of body side and body contours. J Hypertens. 2013;31:946–951. 10.1097/HJH.0b013e328360275d [DOI] [PubMed] [Google Scholar]
- 23. Olufsen MS. Structured tree outflow condition for blood flow in larger systemic arteries. Am J Physiol Heart Circ Physiol. 1999;276:H257–H268. 10.1152/ajpheart.1999.276.1.H257 [DOI] [PubMed] [Google Scholar]
- 24. Scarsoglio S, Guala A, Camporeale C, Ridolfi L. Impact of atrial fibrillation on the cardiovascular system through a lumped-parameter approach. Med Biol Eng Comput. 2014;52(11):905–920. 10.1007/s11517-014-1192-4 [DOI] [PubMed] [Google Scholar]
- 25. Rasmussen CE, Williams CKI. Gaussian Processes for Machine Learning Adaptive Computation and Machine Learning, Cambridge, MA: The MIT Press; 2006. [Google Scholar]
- 26. Rubin Y. Applied Stochastic Hydrogeology Oxford University Press; 2003. [Google Scholar]
- 27. Kennedy MC, O’Hagan A. Bayesian calibration of computer models. Journal of the Royal Statistical Society: Series B (Statistical Methodology). 2001;63(3):425–464. 10.1111/1467-9868.00294 [DOI] [Google Scholar]
- 28. Friedman JH. Multivariate adaptive regression splines. Annals of Statistics. 1991;19(1):1–141. [Google Scholar]
- 29. Efron B. Better Bootstrap Confidence Intervals. Journal of the American Statistical Association. 1987;82(397):171–185. 10.2307/2289153 [DOI] [Google Scholar]
- 30. Casas B, Lantz J, Viola F, Cedersund G, Bolger AF, Carlhäll CJ, et al. Bridging the gap between measurements and modelling: a cardiovascular functional avatar. Sci Rep. 2017;7:6214 10.1038/s41598-017-06339-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Choudhury AD, Banerjee R, Sinha A, Kundu S. Estimating blood pressure using Windkessel model on Photoplethysmogram. In: Conf Proc IEEE Eng Med Biol Soc.; 2014. p. 4567–70. [DOI] [PubMed]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Data can be simulated with methods described in the manuscript.