Few-shot pulse wave contour classification based on multi-scale feature extraction

Peng Lu; Chao Liu; Xiaobo Mao; Yvping Zhao; Hanzhang Wang; Hongpo Zhang; Lili Guo

doi:10.1038/s41598-021-83134-y

. 2021 Feb 12;11:3762. doi: 10.1038/s41598-021-83134-y

Few-shot pulse wave contour classification based on multi-scale feature extraction

Peng Lu ^1,^2,^✉, Chao Liu ^1,², Xiaobo Mao ^1,², Yvping Zhao ^2,³, Hanzhang Wang ^1,², Hongpo Zhang ⁴, Lili Guo ^5,⁶

PMCID: PMC7881007 PMID: 33580107

Abstract

The annotation procedure of pulse wave contour (PWC) is expensive and time-consuming, thereby hindering the formation of large-scale datasets to match the requirements of deep learning. To obtain better results under the condition of few-shot PWC, a small-parameter unit structure and a multi-scale feature-extraction model are proposed. In the small-parameter unit structure, information of adjacent cells is transmitted through state variables. Simultaneously, a forgetting gate is used to update the information and retain long-term dependence of PWC in the form of unit series. The multi-scale feature-extraction model is an integrated model containing three parts. Convolution neural networks are used to extract spatial features of single-period PWC and rhythm features of multi-period PWC. Recursive neural networks are used to retain the long-term dependence features of PWC. Finally, an inference layer is used for classification through extracted features. Classification experiments of cardiovascular diseases are performed on photoplethysmography dataset and continuous non-invasive blood pressure dataset. Results show that the classification accuracy of the multi-scale feature-extraction model on the two datasets respectively can reach 80% and 96%, respectively.

Subject terms: Biomedical engineering, Cardiovascular diseases, Computer science

Introduction

The collection process of pulse wave signals is simple and non-invasive, enabling convenient acquisition of PWC data^1–3. The classification of PWC can be used to monitor the heart condition. However, no uniform standard exists for marking PWC⁴. The standard diagnostics of PWC is primarily based on an individual’s different clinical knowledge and experience to define the morphological features. The formed diagnostic results are subjective, which are difficult to be recognized by all experts.

Existing classification methods of PWC can be categorized into two: statistical methods and machine-learning methods^5,6. Statistical methods represented by artificial time-frequency domain feature extraction are suitable for stationary sequences⁷. Machine-learning methods represented by convolution neural networks^8,9 (CNN), recursive neural networks^10,11 (RNN), and support vector machines^12,13 are suitable for complex non-linear sequences. Clinically collected PWCs are non-stationary and time varying¹⁴, rendering it suitable for machine-learning methods.

Long short-term memory (LSTM), a variant of RNN, has the ability of mining long-distance time-series data information¹⁵. It is extensively used in machine translation^16,17, fault diagnosis^18,19, speech recognition^20,21, and electrocardiogram classification^22,23. In literature²⁴, the representation of speech signals from an original network is automatically learned by CNN, and then the temporal representation of features is learned by LSTM; In literature²⁵, the features of wearable sensor data is learned by CNN, and then the time dependence between actions are modeled by LSTM.

The common idea of the above methods is to extract high-dimensional features by CNN and subsequently obtain short-series features through LSTM, which can be used to predict and synthesize time series. PWC is a kind of few-shot data that difficultly meets the training needs of complex deep networks²⁶.

In the present study, a recursive-network unit structure based on LSTM is designed and found to be suitable for few-shot PWC. Compared with LSTM structure, it has fewer parameters and faster training rate. Adjacent unit information is transmitted through state variables, information is updated through a forgetting gate, and the long-term dependence of PWC is retained in the form of unit series. Simultaneously, a multi-scale feature-extraction model adapted to few-shot PWC is proposed. The multi-scale feature-extraction model is an integrated model that extracts features from three angles. The spatial features of single-period PWC and the rhythm features of multi-period PWC are extracted through CNN. The long-term dependence features of PWC are retained through RNN. All extracted features are then combined linearly for classification.

Cardiovascular-disease classification experiments are performed on photoplethysmography (PPG) dataset and continuous non-invasive blood pressure (CNBP) dataset. Experimental results show that the classification accuracy of the multi-scale feature-extraction model can reach 80% and 96%, respectively.

The remaining part is organized as follows. In the ’Materials & Methods’ section, the experimental data are stated, and multi-scale feature-extraction model are proposed. In the ’Results’ section, the process of data pre-processing and experiment are stated in detail. Finally, conclusions are drawn in the ’Discussion’ section.

Materials and methods

Data description

At present, the most commonly used methods of collecting pulse wave signals information primarily include PPG and pressure detection. PPG traces the pulsation state of blood vessels and measures pulse wave signals by measuring the attenuated light reflected and absorbed by human blood vessels and tissues²⁷. Pressure-detection method obtains pulse wave signals by directly detecting changes in the pressure of human arteries over time. The cuff is tied directly to the patient’s wrist, and the patient should maintain the same posture during measurement²⁸. Patients feel that the wrist is sore and uncomfortable during long-term measurement. Compared with the pressure-detection method, the operation of PPG is simpler and suitable for long-term measurement, but it is susceptible to environmental interference during the measurement process, and its measurement accuracy and stability are lower than those of the pressure-detection method. For pulse wave signals obtained using different acquisition methods, to verify the generalization ability of the model, this study conducts experiments based on PPG dataset and CNBP dataset.

Considering that the clinically collected pulse wave signals contain substantial noise, pre-processing steps such as data filtering, beat division, etc. are necessary. The specific pre-processing details are described in the ’Results’ section.

Dataset I: The PPG dataset for the non-invasive detection of cardiovascular diseases contains 657 pulse wave records from 219 subjects²⁹. The data covers the age range of 20–89 years old, and the sampling rate is 1000 Hz. Various diseases including hypertension and diabetes are recorded. The dataset provides four labels for normal blood pressure, pre-hypertension, and stage I/II hypertension. Table 1 shows the details of dataset I.
Dataset II: The CNBP dataset based on cardiovascular diseases is collected from the Fifth Affiliated Hospital of Zhengzhou University. The subjects who are 18–83 years old fill-up an informed-consent form before data collection. ZM-300 intelligent pulse wave signal collector is adopted for data acquisition, with a sampling frequency of 200 Hz. A pressure sensor based on a semiconductor strain gauge, with a sensitivity of 0.5 mV/g (bridge voltage of 6 V) and a pressure range of 0–1000 g, is used for the collection of pulse wave signals. The A/D converter is a 4-channel 12-bit converter. Dataset II contains 1326 pulse wave records from 221 subjects under six kinds of pulse pressures. It records various cardiovascular diseases, including hypertension and arrhythmia.

Moreover, the dataset is collected under the control of standard experimental conditions and specifications. All methods are carried out in accordance with relevant guidelines and regulations, and all experimental protocols are approved by the Ethics Committee of Drug Clinical Trials of the Fifth Affiliated Hospital of Zhengzhou University. Table 2 shows the details of dataset II.

Table 1.

Dataset I label and sample size.

Type of disease	Number of samples(group)	Pre-processing tags	Number of samples(group)
Normal	80*3	Normal	593
Pre-hypertension	85*3	Hypertension	617
Hypertension Stage I	34*3
Hypertension Stage II	20*3

Open in a new tab

Table 2.

Dataset II label and sample size.

Type of disease	Number of samples(group)	Pre-processing tags	Number of samples (group)
Normal	156*6	Normal	1582
Arrhythmia	18*6	Desease	1634
Myocardial ischemia	18*6
Hypertension	15*6
Others	14*6

Open in a new tab

Multi-scale feature-extraction model

Few pulse wave datasets of cardiovascular diseases are available. At the same time, the feature division of PWC is unclear, and no uniform marking standard exists. The machine needs to learn PWC features automatically. In view of the above problems, a solution is to establish a feature-extraction model based on few-shot PWC. In this paper, a multi-scale feature-extraction model is proposed.

The multi-scale feature-extraction model for few-shot PWC is shown in Fig. 1. N continuous PWC cycles are used as the network input, and then three network layers are connected in parallel to extract the multi-scale features of PWC. The recursive network layer is used to extract the length features of PWC; the periodic feature-extraction layer is used to extract the features of PWC within a single cycle; and the rhythm feature-extraction layer is used to extract PWC features between multiple cycles. Finally, features extracted from the three network layers are combined in the inference layer to classify PWC.

Recursive network layer

PWC is composed of multiple cycles of pulse fluctuations and belongs to a one-dimensional time series. RNN is a network which is good at dealing with time series. By modeling PWC through RNN, the correlation between several cycles of PWC can be obtained. PWC is segmented in chronological order and sequentially input into RNN. The output state of the hidden layer represents the long-term memory feature extracted as the recursive network layer.

Considering the number of sample points and the complexity of network training, a recursive-network unit structure for few-shot PWC is designed. The forgetting and saving of information at each moment are controlled by the forgetting gate. Compared with the LSTM unit structure, the parameters are fewer. The specific structure is shown in Fig. 2.

In the recursive-network unit for few-shot PWC, the unit state vector c is controlled by the forgetting gate. The number of unit states of the previous-cycle PWC retained in the current-cycle PWC is determined by the forgetting gate. The forward calculation formula of the forgetting gate is as follows:

\begin{matrix} n e t_{f, t} = & W_{f} \cdot [h_{t - 1}, x_{t}] + b_{f} \end{matrix}

\begin{matrix} f_{t} = & σ (n e t_{f, t}) \end{matrix}

where the weighted input of the forgetting gate is represented by $n e t_{f, t}$ , the weight matrix of the forgetting gate is represented by $W_{f}$ , the operation of splicing two vectors into one vector is represented by $[h_{t - 1}, x_{t}]$ , the offset term of the forgetting gate is represented by $b_{f}$ , the forgetting gate is represented by $f_{t}$ , and $σ$ refers to sigmoid function.

The weight matrix $W_{f}$ is formed by splicing $W_{fh}$ and $W_{fx}$ , corresponding to the previous-cycle PWC $h_{t - 1}$ and the current-cycle PWC $h_{t}$ :

\begin{matrix} n e t_{f, t} = [\begin{matrix} W_{fh} & W_{fx} \end{matrix}] [\begin{matrix} h_{t - 1} \\ x_{t} \end{matrix}] + b_{f} = W_{fh} h_{t - 1} + W_{fx} x_{t} + b_{f} \end{matrix}

The input state ${\tilde{c}}_{t}$ of the current-cycle PWC is calculated from the unit output of the previous cycle and the input of the current cycle:

\begin{matrix} n e {t_{\tilde{c},}}_{t} = & W_{c} \times [h_{t - 1}, x_{t}] + b_{c} = W_{ch} h_{t - 1} + W_{cx} x_{t} + b_{c} \end{matrix}

\begin{matrix} {\tilde{c}}_{t} = & t a n h (n e {t_{\tilde{c},}}_{t}) \end{matrix}

where the weighted input of the PWC unit state is represented by $n e t_{\tilde{c}, t}$ ; the weight matrix of the PWC unit state is represented by $W_{c}$ , which is composed of two matrices $W_{ch}$ and $W_{cx}$ ; and the bias term of the forgetting gate is represented by $b_{c}$ .

To update the cell state of the current cycle, the current memory ${\tilde{c}}_{t}$ needs to be combined with the long-term memory $c_{t - 1}$ , and new information needs to be added whilst forgetting some information. The unit state $c_{t - 1}$ of the previous-cycle PWC is multiplied with the corresponding position element of the forgetting gate $f_{t}$ , and the unit state ${\tilde{c}}_{t}$ of the current-cycle PWC is multiplied with the corresponding position element of $1 - f_{t}$ to obtain the state output of the current-cycle PWC $c_{t}$ :

\begin{matrix} c_{t} = d i a g [f_{t}] c_{t - 1} + d i a g [1 - f_{t}] {\tilde{c}}_{t} \end{matrix}

The output is determined by the state of the PWC unit:

\begin{matrix} h_{t} = t a n h (c_{t}) \end{matrix}

As shown in Fig. 3, the recursive network layer is designed as a two-layer structure. The time of the main wave peak of the front PWC is used as the reference, and the single beat PWC of two adjacent troughs is used as the input of the unit. To mine the information that is opposite to the first-layer network-information transmission direction, a layer of reverse network is built on top of each layer of forward network, and the output of the first-layer network is directly used as the input of the reverse network. The output of the lower networks ( $h_{0} \sim h_{n}$ ) is used as the input of the reverse network after random screening by the upper layer:

\begin{matrix} r_{i} \sim B e r n o u l l i (p) \end{matrix}

\begin{matrix} x_{t} = d i a g (r_{t}) \times h_{t} \end{matrix}

where Bernoulli(p) represents the Bernoulli distribution according to probability p and has a value of 0 or 1; $r_{t} = [r_{0}, r_{1}, \dots, r_{n}]$ , n are the dimensions of $h_{t}$ , and $x_{t}$ is the input of the unit node at time t at the reverse network layer.

Periodic feature-extraction layer

The shape and change of PWC in a single cycle contain its main features. Extracting the spatial features of PWC in a single cycle is necessary. However, PWC is sparse and contains little information at a single moment. If all points of PWC are used as input sessions, the number of network parameters becomes too large. At the same time, the training efficiency and accuracy of the model are greatly reduced by too many features.

To effectively extract the features of PWC in a single cycle, one-dimensional CNN is used. The convolution module has the functions of local connection and weight sharing and can extract the spatial position relationship at different time points of a single-cycle PWC. At the same time, it can reduce the amount of network parameters and reduce the model complexity.

The single-cycle PWC is a signal segment with a length of 235, and it is used as the input. The specific structure of the single-period feature of PWC is obtained through two convolution modules, which each module containing a one-dimensional convolution layer and a maximum pooling layer. To reduce the amount of computation in the process of model training and improve the computation speed, ReLU function is used as the activation function. The specific structure is shown in Fig. 4. .

Structure of periodic feature extraction layer.

Rhythm feature-extraction layer

PWC is formed by the periodic and regular pulse of pressure in arteries. However, slight differences exist between PWC cycles, which contain rhythm information and are difficult to observe artificially. The rhythm features represent the feature relationship between cycles and reflects the envelope change of the pulse signal. These features have a classification effect and can improve classification accuracy. To extract the rhythm features of PWC, a two-dimensional CNN is used.

Compared with single-cycle signals, multi-cycle signals have larger data dimensions, and a suitable convolution module scale is more important. Whilst ensuring that features are not discarded, reducing the dimension of data as much as possible is necessary. N adjacent periodic signals are spliced into a feature map, which is used as the input. The specific structure of the single-period features of PWC is obtained through two convolution modules, with each module containing one two-dimensional convolution layer and one max pooling layer. Same as the periodic feature-extraction layer, ReLU function is used as the activation function. The specific structure is shown in Figure 5.

Structure of rhythm feature extraction layer.

Reasoning layer

The reasoning layer contains two fully connected layers, each neuron is fully connected to all neurons in the previous layer, and the local information with category discrimination in the network layer is integrated. The activation function of the fully connected layer adopts the ReLU function, and the training algorithm adopts the error back-propagation algorithm. The output of the last layer is passed through the SoftMax classifier to obtain PWC classification result.

Given that the gate-unit activation function in the recursive network layer is selected as the Sigmoid function, when the training result is close to the true value, the gradient operator is extremely small, and the convergence speed of the model is slow. The cross-entropy loss function is a logarithmic function. When it is close to the upper boundary, it can still be maintained at a high gradient state without affecting the convergence speed of the model. To improve the convergence speed of the model, cross-entropy is used as the loss function:

\begin{matrix} l o s s = - \frac{1}{N} \sum_{i = 1}^{N} (- \sum_{j = 1}^{M} y_{ij} log (p_{ij})) \end{matrix}

where N is the number of samples, M is the number of PWC categories, $y_{ij}$ is the label of PWC, and $p_{ij}$ is the prediction result corresponding to each label.

Results

Evaluation index

We use Accuracy(Acc), Sensitivity (Sen), Specificity(Spe), Precision(Pre), and F1-score for model evaluation. The calculation formula for the five evaluation indicators is as follows³⁰:

\begin{matrix} Acc = & \frac{T P + T N}{T P + T N + F P + F N} \end{matrix}

\begin{matrix} Sen = & \frac{TP}{T P + F N} \end{matrix}

\begin{matrix} Spe = & \frac{TN}{T N + F P} \end{matrix}

\begin{matrix} Pre = & \frac{TP}{T P + F P} \end{matrix}

\begin{matrix} F 1 - s c o r e = & \frac{2 \times P r e \times S e n}{P r e + S e n} \end{matrix}

where TP stands for true positive and is the number of samples predicting abnormal PWC; TN stands for true negative and is the number of samples predicting normal PWC as normal; FP stands for false positive and is the number of samples predicting normal PWC as abnormal; FN stands for false negative and is the number of samples predicting abnormal PWC as normal; Acc represents the overall classification accuracy of the overall model; Sen represents the proportion of abnormal PWC that are matched and measures the model’s ability to recognize abnormal PWC; Spe represents the proportion of normal PWC that are matched and measures the pairing of the classifier, i.e., the ability to recognize normal PWC; and Pre represents the proportion of PWC classified as abnormal that are actually marked as abnormal.

Preprocessing

Pulse wave signals collected in clinical settings is easily affected by noise, including motion artifacts, inherent noise of collecting instruments, and power-supply noise. Using end-to-end training directly reduces the classification accuracy. Therefore, the original signals need to be pre-processed before the experiment.

The frequency of pulse wave signals is primarily distributed between 0.5–2 Hz. Motion artifact is primarily caused by breathing, and the respiratory frequency of normal adults is about 0.2–0.3 Hz. The inherent noise of collecting instruments is above 90 Hz. The power supply noise is 50 Hz/60 Hz. Empirical mode decomposition (EMD) is a signal-processing method based on the time-scale features of the data itself without pre-setting any basis functions³¹. EMD has obvious advantages in dealing with non-stationary and non-linear data. According to the frequency-distribution features of the pulse wave signals, the EMD method is used to remove the noise.

In the process of EMD, cubic spline function is used to interpolate the maximum value sequence and the minimum value sequence to obtain the upper and lower envelopes. The cubic spline interpolation needs to use two adjacent points before and after. Therefore, there will be divergence at the two ends of the data, that is, the derivative of the intrinsic mode-function at the boundary increases, which makes the filtered signal has obvious distortion at the beginning and end. To avoid the transient phenomenon at the beginning and end of the filtered signal, the end points of the curve are added to the spline. The Pearson correlation degree is used to measure the degree of information loss before and after denoising, and the calculation method is shown in equation (16):

\begin{matrix} P e a r s o n = \frac{\sum XY - \frac{\sum X \sum Y}{N}}{\sqrt{(\sum X^{2} - \frac{{(\sum X)}^{2}}{N}) (\sum Y^{2} - \frac{{(\sum Y)}^{2}}{N})}} \end{matrix}

where X represents the set of elements arranged in time in the original PWC, Y represents the set of elements arranged in time after denoising, and N represents the number of sample points.

The value interval of Person is [− 1,1]. When a person is close to 1, X and Y have a strong linear correlation, showing that the information loss is relatively small after denoising. In the experiment, only data with a Person coefficient greater than 0.93 after filtering is retained to ensure that the filtered data retains the original information to the greatest extent. The calculation result of the signal Person coefficient in Fig. 6 is 0.99.

Signals collected clinically are actually the slice data of pulse wave signals. Few records of pulse wave database itself exist; not every record has all categories, and even some categories appear only on a few records. In this case, to ensure that categories are not omitted as far as possible, the PWC should be divided separately into beats³². In the experiment, the PWC is divided into cycles according to the trough position. The signal segment of the adjacent valleys is regarded as one PWC cycle. The specific pre-treatment process is shown in Fig. 6.

Setting of model parameters

The pre-processed PWC is randomly divided into training set and test set at a ratio of 4:1. Under the condition of a certain amount of data, the classification accuracy of the model no longer increases before the number of hidden nodes of the neural network increases to a certain extent³³. Therefore, the number of hidden nodes in the recursive network layer is set to 32, the number of kernels in the periodic feature-extraction layer is set to 32 and 64, the number of kernels in the rhythm feature-extraction layer is set to 32 and 64. After verification through multiple experiments, the network training parameters are set, as shown in Table 3.

Table 3.

Network hyperparameters.

Parameter		Value
Periodic feature-extraction layer		Convolution Layer 1	Convolution Layer 2
	Kernel dimension	7*1	5*1
	Stride size	5*1	5*1
	Amount of kernels	32	64
	Pooling dimension	3*1	3*1
	Pooling stride size	2*1	1*1
Rhythm feature-extraction layer	kernel dimension	7*3	5*3
	Stride size	5*1	5*3
	Amount of kernels	32	64
	Pooling dimension	3*1	3*1
	Pooling stride size	2*1	1*1
Recursive network layer	Input channels	235
	Number of hidden nodes	32
	Bidirectional	True
	Dropout	0.5
Cycle length of PWC		235
Unit cycle		N
Reasoning layers		Feature dimension32/322
Batch size		64
Optimizer		Adam

Open in a new tab

Experiment

The number of PWC cycles input of the model is expressed as a unit cycle. An increase in amount of cycles of model input leads to an increase in depth of the recursive network. Furthermore, the proportion of features extracted by the periodic feature-extraction layer and the rhythm feature-extraction layer increase. At the same time, the network dimension increases leads to decreased training speed of the model. To determine the optimal number of unit cycles for a suitable PWC multi-feature scale model, experiments are conducted for different unit cycles. Training set II is used in the experiment, and each group is trained 50 times. The final results are shown in Table 4. According to the table, when the number of unit cycles is 5, the model has the optimum effect.

Table 4.

The influence of different unit cycles on experimental results.

Unit cycles	Acc(%)	Sen(%)	Spe(%)	Pre(%)	F1
2	93.17	93.25	92.50	92.68	92.97
3	94.35	94.86	92.96	93.12	93.98
4	95.24	93.37	96.03	96.27	94.80
5	96.39	94.66	96.67	96.88	95.75
6	94.34	91.74	93.94	94.34	93.02

Open in a new tab

On the premise of retaining the structure of the pre-processing layer and the reasoning layer, as well as the network parameters, the network layer is changed for comparison experiments. Zhan³⁴ et al. extracted features through CNN for classification. Liu³⁵ et al. extracted PWC timing features through parallel CNN structures, including signal segment features within a period and multi-period features representing cycle relationships; Ghosh³⁶ et al. extracted PWC features based on LSTM to predict systolic and diastolic blood pressure. The training process of different network models is shown in Figure 7.

Changes in classification accuracy of different network training processes.

Performance comparison with other neural network methods under different datasets is shown in Table 5. According to the table, the multi-scale model has good classification performance on dataset I and dataset II.

Table 5.

Performance comparison of multi-scale model and other methods.

Model	Dataset I (PPG)		Dataset II (CNBP)
Model	Acc(%)	F1	Acc(%)	F1
Zhan	76.02	74.82	92.71	92.12
Liu	78.16	78.33	91.94	91.44
Ghosh	72.54	67.54	95.72	95.35
Multi-scale feature extraction model	79.75	79.01	96.22	95.81

Open in a new tab

Discussion

A small-parameter unit structure and a multi-scale feature-extraction model are proposed to obtain improved results under the condition of few-shot pulse wave contour classification. Cardiovascular-disease classification experiments were carried out on PPG dataset and CNBP dataset. To evaluate the classification performance of the multi-scale feature-extraction model, three different classification methods including single neural network model³⁴, parallel neural network model³⁵, and LSTM model³⁶ were introduced for comparison. Comparative studies show that the multi-scale feature-extraction model outperforms the other classification methods in terms of identification accuracy, stability, and sensitivity, and the multi-scale feature-extraction model consumes less time for training. For the proposed novel PWC classification approach, the model is notably sensitive to the number of unit cycles, and we find that the best unit cycle is five. Also, the multi-scale feature-extraction model depends on the division result of the PWC cycle.

Moreover, (a) the classification problem of few-shot PWC is well handled; (b) owing to the limitation of data volume, only binary classification is carried out; (c) PWC must be pre-processed (including EMD filtering and beat segmentation) before feature learning, which may lead to missing partial information. Finally, given the excellent performance of the multi-scale feature-extraction model in the few-shot PWC experiments, the classification of few-shot PWC is interesting and meaningful to study, and it will be further investigated in our future research.

Acknowledgements

This research was supported by the National Key R&D Program of China (Grant no. 2020YFC2006100), the Key Project at Central Government Level (Grant no. 2060302), and the Outstanding Foreign Scientist Support Project in Henan Province (Grant no. GZS2019008).

Author contributions

P.L. conceived the experiments, C.L. and H.W. conducted the experiments, C.L. and H.W. analyzed the results, X.M., Y.Z., H.Z. and L.G. read and revised the paper. All authors reviewed the manuscript.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

1.Fan F, Yan Y, Tang Y, Zhang H. A motion-tolerant approach for monitoring spo(2) and heart rate using photoplethysmography signal with dual frame length processing and multi-classifier fusion. Comput. Biol. Med. 2017;91:291–305. doi: 10.1016/j.compbiomed.2017.10.017. [DOI] [PubMed] [Google Scholar]
2.Phillips JP, Hickey M, Kyriacou PA. Evaluation of electrical and optical plethysmography sensors for noninvasive monitoring of hemoglobin concentration. Sensors. 2018;12:1816–1826. doi: 10.3390/s120201816. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Papini, G. B. et al. Photoplethysmography beat detection and pulse morphology quality assessment for signal reliability estimation. In Proceedings of the 2017 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Seogwipo, Korea, Jul 11–15, 2017, 117–120, 10.1109/EMBC.2017.8036776 (IEEE, 2017). [DOI] [PubMed]
4.Denisse, et al. A review on wearable photoplethysmography sensors and their potential future applications in health care. Int. J. Biosens. Bioelectron. 2018;4:195–202. doi: 10.15406/ijbsbe.2018.04.00125. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Jermana M, et al. Advances in photopletysmography signal analysis for biomedical applications. Sensors. 2018;18:1894. doi: 10.3390/s18061894. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Srihari A, Mousumi P, Srivatsa K. A comparative study and analysis of time series forecasting techniques. SN Comput. Sci. 2020;1:175. doi: 10.1007/s42979-020-00180-5. [DOI] [Google Scholar]
7.Pirhonen M, Vehkaoja A. Fusion enhancement for tracking of respiratory rate through intrinsic mode functions in photoplethysmography. Biomed. Signal Process. Control. 2020;59:101887. doi: 10.1016/j.bspc.2020.101887. [DOI] [Google Scholar]
8.Makridakis S, Spiliotis E, Assimakopoulos V. Statistical and machine learning forecasting methods: Concerns and ways forward. Plos One. 2018;13:e0194889. doi: 10.1371/journal.pone.0194889. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Poernomo A, Kang DK. Biased dropout and crossmap dropout: Learning towards effective dropout regularization in convolutional neural network. Neural Netw. 2018;104:60–67. doi: 10.1016/j.neunet.2018.03.016. [DOI] [PubMed] [Google Scholar]
10.Sui M, Wang Y, Han X, Kloeden PE. Random recurrent neural networks with delays. J. Differ. Equ. 2020;269:8597–8639. doi: 10.1016/j.jde.2020.06.008. [DOI] [Google Scholar]
11.Quax SC, D’Asaro M, van Gerven MAJ. Adaptive time scales in recurrent neural networks. Sci. Rep. 2020;10:11360. doi: 10.1101/800540. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Tang X, Machimura T, Li J, Liu W, Hong H. A novel optimized repeatedly random undersampling for selecting negative samples: A case study in an SVM-based forest fire susceptibility assessment. J. Environ. Manag. 2020;271:111014. doi: 10.1016/j.jenvman.2020.111014. [DOI] [PubMed] [Google Scholar]
13.Wang D, Chen G, Xing D, Liang D. Fault diagnosis method for switch control circuit based on SVM-AdaBoost. J. Meas. Sci. Instrum. 2020;11:251–257. doi: 10.3969/j.issn.1674-8042.2020.03.008. [DOI] [Google Scholar]
14.Peralta E, Lazaro J, Bailon R, Marozas V, Gil E. Optimal fiducial points for pulse rate variability analysis from forehead and finger photoplethysmographic signals. Physiol. Meas. 2019;40:025007. doi: 10.1088/1361-6579/ab009b. [DOI] [PubMed] [Google Scholar]
15.Karim F, Majumdar S, Darabi H, Chen S. Lstm fully convolutional networks for time series classification. IEEE Access. 2018;6:1662–1669. doi: 10.1109/ACCESS.2017.2779939. [DOI] [Google Scholar]
16.Alayba AM, Palade V, England M, Iqbal R. A combined CNN and LSTM model for Arabic sentiment analysis. Lecture Notes Comput. Sci. 2018;11015:179–191. doi: 10.1007/978-3-319-99740-7_12. [DOI] [Google Scholar]
17.Wu S, et al. Dependency-to-dependency neural machine translation. IEEE ACM Trans. Audio Speech Lang. Process. 2018;26:2132–2141. doi: 10.1109/TASLP.2018.2855968. [DOI] [Google Scholar]
18.Huang D, Fu Y, Qin N, Gao S. Fault diagnosis of high-speed train bogie based on LSTM neural network. Sci. China Inf. Sci. 2021;64:119203. doi: 10.1007/s11432-018-9543-8. [DOI] [Google Scholar]
19.Xue ZY, Xiahou KS, Li MS, Ji TY, Wu QH. Diagnosis of multiple open-circuit switch faults based on long short-term memory network for DFIG-based wind turbine systems. IEEE J. Emerg. Select. Top. Power Electron. 2020;8:2600–2610. doi: 10.1109/JESTPE.2019.2908981. [DOI] [Google Scholar]
20.Wang D, Chen J. Supervised speech separation based on deep learning: An overview. IEEE/ACM Trans. Audio Speech Lang. Process. 2018;26:1702–1726. doi: 10.1109/TASLP.2018.2842159. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Zhao, J., Mao, X. & Chen, L. Speech emotion recognition using deep 1d & 2d CNN LSTM networks. Biomed. Signal Process. Control47, 312–323 10.1016/j.bspc.2018.08.035 (2019).
22.Yildirim O. A novel wavelet sequence based on deep bidirectional LSTM network model for ECG signal classification. Comput. Biol. Med. 2018;96:189–202. doi: 10.1016/j.compbiomed.2018.03.016. [DOI] [PubMed] [Google Scholar]
23.Tan JH, et al. Application of stacked convolutional and long short-term memory network for accurate identification of CAD ECG signals. Comput. Biol. Med. 2018;94:19–26. doi: 10.1016/j.compbiomed.2017.12.023. [DOI] [PubMed] [Google Scholar]
24.Dias I, Fatih Demirci M, Adnan Y. Speech emotion recognition with deep convolutional neural networks. Biomed. Signal Process. Control. 2020;59:101894. doi: 10.1016/j.bspc.2020.101894. [DOI] [Google Scholar]
25.Ordonez FJ, Roggen D. Deep convolutional and LSTM recurrent neural networks for multimodal wearable activity recognition. Sensors. 2016;16:115. doi: 10.3390/s16010115. [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Xiao, X., Fu, D., Shi, Y. & Wen, J. Optimized mahalanobis-taguchi system for high-dimensional small sample data classification. Comput. Intell. Neurosci.https://kopernio.com/viewer?doi=10.1155%2F2020%2F4609423 (2020). [DOI] [PMC free article] [PubMed]
27.Guo Y, et al. Mobile photoplethysmographic technology to detect atrial fibrillation. J. Am. Coll. Cardiol. 2019;74:2365–2375. doi: 10.1016/j.jacc.2019.08.019. [DOI] [PubMed] [Google Scholar]
28.Senturk U, Polat K, Yucedag I. A non-invasive continuous cuffless blood pressure estimation using dynamic recurrent neural networks. Appl. Acoust. 2020;170:107534. doi: 10.1016/j.apacoust.2020.107534. [DOI] [Google Scholar]
29.Liang Y, Chen Z, Liu G, Elgendi M. A new, short-recorded photoplethysmogram dataset for blood pressure monitoring in china. Sci. Data. 2018;5:180020. doi: 10.1038/sdata.2018.20. [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Tjahjadi H, Ramli K, Murfi H. Noninvasive classification of blood pressure based on photoplethysmography signals using bidirectional long short-term memory and time-frequency analysis. IEEE Access. 2020;8:20735–20748. doi: 10.1109/ACCESS.2020.2968967. [DOI] [Google Scholar]
31.Abdulhay E, et al. Computer-aided autism diagnosis via second-order difference plot area applied to EEG empirical mode decomposition. Neural Comput. Appl. 2020;32:10947–10956. doi: 10.1007/s00521-018-3738-0. [DOI] [Google Scholar]
32.Ding X, Cheng F, Morris R, Chen C, Wang Y. Machine learning-based signal quality evaluation of single-period radial artery pulse waves: Model development and validation. JMIR Med. Inf. 2020;8:e18134. doi: 10.1038/sdata.2018.20. [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Duan M, Li K, Liao X, Li K. A parallel multiclassification algorithm for big data using an extreme learning machine. IEEE Trans. Neural Netw. Learn. Syst. 2018;29:2337–2351. doi: 10.1109/TNNLS.2017.2654357. [DOI] [PubMed] [Google Scholar]
34.Zhan Q, Wang W, Haan G. Analysis of CNN-based remote-PPG to understand limitations and sensitivities. Biomed. Opt. Express. 2020;11:1268–1283. doi: 10.1364/BOE.382637. [DOI] [PMC free article] [PubMed] [Google Scholar]
35.Liu G, Zhou W. Aliasing classification of pulse wave characteristics in time-frequency domain based on convolutional neural network (in Chinese) J. Jilin Univ. (Eng. Technol. Edn.) 2020;50:1818–1825. doi: 10.13229/j.cnki.jdxbgxb2019050410.13229/j.cnki.jdxbgxb20190504. [DOI] [Google Scholar]
36.Ghosh, S., Banerjee, A., Ray, N., Wood, P. W. & Boulanger, P. Using accelerometric and gyroscopic data to improve blood pressure prediction from pulse transit time using recurrent neural network. In Proceedings of the 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Calgary, CANADA, APR 15-20, 2018, 935–939, https://ieeexplore.ieee.org/document/8461959 (2017).

[CR1] 1.Fan F, Yan Y, Tang Y, Zhang H. A motion-tolerant approach for monitoring spo(2) and heart rate using photoplethysmography signal with dual frame length processing and multi-classifier fusion. Comput. Biol. Med. 2017;91:291–305. doi: 10.1016/j.compbiomed.2017.10.017. [DOI] [PubMed] [Google Scholar]

[CR2] 2.Phillips JP, Hickey M, Kyriacou PA. Evaluation of electrical and optical plethysmography sensors for noninvasive monitoring of hemoglobin concentration. Sensors. 2018;12:1816–1826. doi: 10.3390/s120201816. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR3] 3.Papini, G. B. et al. Photoplethysmography beat detection and pulse morphology quality assessment for signal reliability estimation. In Proceedings of the 2017 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Seogwipo, Korea, Jul 11–15, 2017, 117–120, 10.1109/EMBC.2017.8036776 (IEEE, 2017). [DOI] [PubMed]

[CR4] 4.Denisse, et al. A review on wearable photoplethysmography sensors and their potential future applications in health care. Int. J. Biosens. Bioelectron. 2018;4:195–202. doi: 10.15406/ijbsbe.2018.04.00125. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR5] 5.Jermana M, et al. Advances in photopletysmography signal analysis for biomedical applications. Sensors. 2018;18:1894. doi: 10.3390/s18061894. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR6] 6.Srihari A, Mousumi P, Srivatsa K. A comparative study and analysis of time series forecasting techniques. SN Comput. Sci. 2020;1:175. doi: 10.1007/s42979-020-00180-5. [DOI] [Google Scholar]

[CR7] 7.Pirhonen M, Vehkaoja A. Fusion enhancement for tracking of respiratory rate through intrinsic mode functions in photoplethysmography. Biomed. Signal Process. Control. 2020;59:101887. doi: 10.1016/j.bspc.2020.101887. [DOI] [Google Scholar]

[CR8] 8.Makridakis S, Spiliotis E, Assimakopoulos V. Statistical and machine learning forecasting methods: Concerns and ways forward. Plos One. 2018;13:e0194889. doi: 10.1371/journal.pone.0194889. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR9] 9.Poernomo A, Kang DK. Biased dropout and crossmap dropout: Learning towards effective dropout regularization in convolutional neural network. Neural Netw. 2018;104:60–67. doi: 10.1016/j.neunet.2018.03.016. [DOI] [PubMed] [Google Scholar]

[CR10] 10.Sui M, Wang Y, Han X, Kloeden PE. Random recurrent neural networks with delays. J. Differ. Equ. 2020;269:8597–8639. doi: 10.1016/j.jde.2020.06.008. [DOI] [Google Scholar]

[CR11] 11.Quax SC, D’Asaro M, van Gerven MAJ. Adaptive time scales in recurrent neural networks. Sci. Rep. 2020;10:11360. doi: 10.1101/800540. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR12] 12.Tang X, Machimura T, Li J, Liu W, Hong H. A novel optimized repeatedly random undersampling for selecting negative samples: A case study in an SVM-based forest fire susceptibility assessment. J. Environ. Manag. 2020;271:111014. doi: 10.1016/j.jenvman.2020.111014. [DOI] [PubMed] [Google Scholar]

[CR13] 13.Wang D, Chen G, Xing D, Liang D. Fault diagnosis method for switch control circuit based on SVM-AdaBoost. J. Meas. Sci. Instrum. 2020;11:251–257. doi: 10.3969/j.issn.1674-8042.2020.03.008. [DOI] [Google Scholar]

[CR14] 14.Peralta E, Lazaro J, Bailon R, Marozas V, Gil E. Optimal fiducial points for pulse rate variability analysis from forehead and finger photoplethysmographic signals. Physiol. Meas. 2019;40:025007. doi: 10.1088/1361-6579/ab009b. [DOI] [PubMed] [Google Scholar]

[CR15] 15.Karim F, Majumdar S, Darabi H, Chen S. Lstm fully convolutional networks for time series classification. IEEE Access. 2018;6:1662–1669. doi: 10.1109/ACCESS.2017.2779939. [DOI] [Google Scholar]

[CR16] 16.Alayba AM, Palade V, England M, Iqbal R. A combined CNN and LSTM model for Arabic sentiment analysis. Lecture Notes Comput. Sci. 2018;11015:179–191. doi: 10.1007/978-3-319-99740-7_12. [DOI] [Google Scholar]

[CR17] 17.Wu S, et al. Dependency-to-dependency neural machine translation. IEEE ACM Trans. Audio Speech Lang. Process. 2018;26:2132–2141. doi: 10.1109/TASLP.2018.2855968. [DOI] [Google Scholar]

[CR18] 18.Huang D, Fu Y, Qin N, Gao S. Fault diagnosis of high-speed train bogie based on LSTM neural network. Sci. China Inf. Sci. 2021;64:119203. doi: 10.1007/s11432-018-9543-8. [DOI] [Google Scholar]

[CR19] 19.Xue ZY, Xiahou KS, Li MS, Ji TY, Wu QH. Diagnosis of multiple open-circuit switch faults based on long short-term memory network for DFIG-based wind turbine systems. IEEE J. Emerg. Select. Top. Power Electron. 2020;8:2600–2610. doi: 10.1109/JESTPE.2019.2908981. [DOI] [Google Scholar]

[CR20] 20.Wang D, Chen J. Supervised speech separation based on deep learning: An overview. IEEE/ACM Trans. Audio Speech Lang. Process. 2018;26:1702–1726. doi: 10.1109/TASLP.2018.2842159. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR21] 21.Zhao, J., Mao, X. & Chen, L. Speech emotion recognition using deep 1d & 2d CNN LSTM networks. Biomed. Signal Process. Control47, 312–323 10.1016/j.bspc.2018.08.035 (2019).

[CR22] 22.Yildirim O. A novel wavelet sequence based on deep bidirectional LSTM network model for ECG signal classification. Comput. Biol. Med. 2018;96:189–202. doi: 10.1016/j.compbiomed.2018.03.016. [DOI] [PubMed] [Google Scholar]

[CR23] 23.Tan JH, et al. Application of stacked convolutional and long short-term memory network for accurate identification of CAD ECG signals. Comput. Biol. Med. 2018;94:19–26. doi: 10.1016/j.compbiomed.2017.12.023. [DOI] [PubMed] [Google Scholar]

[CR24] 24.Dias I, Fatih Demirci M, Adnan Y. Speech emotion recognition with deep convolutional neural networks. Biomed. Signal Process. Control. 2020;59:101894. doi: 10.1016/j.bspc.2020.101894. [DOI] [Google Scholar]

[CR25] 25.Ordonez FJ, Roggen D. Deep convolutional and LSTM recurrent neural networks for multimodal wearable activity recognition. Sensors. 2016;16:115. doi: 10.3390/s16010115. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR26] 26.Xiao, X., Fu, D., Shi, Y. & Wen, J. Optimized mahalanobis-taguchi system for high-dimensional small sample data classification. Comput. Intell. Neurosci.https://kopernio.com/viewer?doi=10.1155%2F2020%2F4609423 (2020). [DOI] [PMC free article] [PubMed]

[CR27] 27.Guo Y, et al. Mobile photoplethysmographic technology to detect atrial fibrillation. J. Am. Coll. Cardiol. 2019;74:2365–2375. doi: 10.1016/j.jacc.2019.08.019. [DOI] [PubMed] [Google Scholar]

[CR28] 28.Senturk U, Polat K, Yucedag I. A non-invasive continuous cuffless blood pressure estimation using dynamic recurrent neural networks. Appl. Acoust. 2020;170:107534. doi: 10.1016/j.apacoust.2020.107534. [DOI] [Google Scholar]

[CR29] 29.Liang Y, Chen Z, Liu G, Elgendi M. A new, short-recorded photoplethysmogram dataset for blood pressure monitoring in china. Sci. Data. 2018;5:180020. doi: 10.1038/sdata.2018.20. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR30] 30.Tjahjadi H, Ramli K, Murfi H. Noninvasive classification of blood pressure based on photoplethysmography signals using bidirectional long short-term memory and time-frequency analysis. IEEE Access. 2020;8:20735–20748. doi: 10.1109/ACCESS.2020.2968967. [DOI] [Google Scholar]

[CR31] 31.Abdulhay E, et al. Computer-aided autism diagnosis via second-order difference plot area applied to EEG empirical mode decomposition. Neural Comput. Appl. 2020;32:10947–10956. doi: 10.1007/s00521-018-3738-0. [DOI] [Google Scholar]

[CR32] 32.Ding X, Cheng F, Morris R, Chen C, Wang Y. Machine learning-based signal quality evaluation of single-period radial artery pulse waves: Model development and validation. JMIR Med. Inf. 2020;8:e18134. doi: 10.1038/sdata.2018.20. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR33] 33.Duan M, Li K, Liao X, Li K. A parallel multiclassification algorithm for big data using an extreme learning machine. IEEE Trans. Neural Netw. Learn. Syst. 2018;29:2337–2351. doi: 10.1109/TNNLS.2017.2654357. [DOI] [PubMed] [Google Scholar]

[CR34] 34.Zhan Q, Wang W, Haan G. Analysis of CNN-based remote-PPG to understand limitations and sensitivities. Biomed. Opt. Express. 2020;11:1268–1283. doi: 10.1364/BOE.382637. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR35] 35.Liu G, Zhou W. Aliasing classification of pulse wave characteristics in time-frequency domain based on convolutional neural network (in Chinese) J. Jilin Univ. (Eng. Technol. Edn.) 2020;50:1818–1825. doi: 10.13229/j.cnki.jdxbgxb2019050410.13229/j.cnki.jdxbgxb20190504. [DOI] [Google Scholar]

[CR36] 36.Ghosh, S., Banerjee, A., Ray, N., Wood, P. W. & Boulanger, P. Using accelerometric and gyroscopic data to improve blood pressure prediction from pulse transit time using recurrent neural network. In Proceedings of the 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Calgary, CANADA, APR 15-20, 2018, 935–939, https://ieeexplore.ieee.org/document/8461959 (2017).

PERMALINK

Few-shot pulse wave contour classification based on multi-scale feature extraction

Peng Lu

Chao Liu

Xiaobo Mao

Yvping Zhao

Hanzhang Wang

Hongpo Zhang

Lili Guo

Abstract

Introduction

Materials and methods

Data description

Table 1.

Table 2.

Multi-scale feature-extraction model

Figure 1.

Recursive network layer

Figure 2.

Figure 3.

Periodic feature-extraction layer

Figure 4.

Rhythm feature-extraction layer

Figure 5.

Reasoning layer

Results

Evaluation index

Preprocessing

Figure 6.

Setting of model parameters

Table 3.

Experiment

Table 4.

Figure 7.

Table 5.

Discussion

Acknowledgements

Author contributions

Competing interests

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases