Skip to main content
Elsevier - PMC COVID-19 Collection logoLink to Elsevier - PMC COVID-19 Collection
. 2021 Dec 8;73:103436. doi: 10.1016/j.bspc.2021.103436

A physiological signal compression approach using optimized Spindle Convolutional Auto-encoder in mHealth applications

Vishal Barot a,, Dr Ritesh Patel b
PMCID: PMC9760972  PMID: 36567676

Abstract

Background and Objectives

The COVID-19 pandemic manifested the need of developing robust digital platforms for facilitating healthcare services such as consultancy, clinical therapies, real time remote monitoring, early diagnosis and future predictions. Innovations made using technologies such as Internet of Things (IoT), edge computing, cloud computing and artificial intelligence are helping address this crisis. The urge for remote monitoring, symptom analysis and early detection of diseases lead to tremendous increase in the deployment of wearable sensor devices. They facilitate seamless gathering of physiological data such as electrocardiogram (ECG) signals, respiration traces (RESP), galvanic skin response (GSR), pulse rate, body temperature, photoplethysmograms (PPG), oxygen saturation (SpO2) etc. For diagnosis and analysis purpose, the gathered data needs to be stored. Wearable devices operate on batteries and have a memory constraint. In mHealth application architectures, this gathered data is hence stored on cloud based servers. While transmitting data from wearable devices to cloud servers via edge devices, a lot of energy is consumed. This paper proposes a deep learning based compression model SCAElite that reduces the data volume, enabling energy efficient transmission.

Results

Stress Recognition in Automobile Drivers dataset and MIT-BIH dataset from PhysioNet are used for validation of algorithm performance. The model achieves a compression ratio of up to 300 fold with reconstruction errors within 8% over the stress recognition dataset and 106.34-fold with reconstruction errors within 8% over the MIT-BIH dataset. The computational complexity of SCAElite is 51.65% less compared to state-of-the-art deep compressive model.

Conclusion

It is experimentally validated that SCAElite guarantees a high compression ratio with good quality restoration capabilities for physiological signal compression in mHealth applications. It has a compact architecture and is computationally more efficient compared to state-of-the-art deep compressive model.

Keywords: Data compression, Energy efficiency, mHealth applications, Physiological signal compression, Spindle Convolutional Auto-encoder

1. Introduction

Wearable devices are computing devices that are a combination of electronics such as sensors and controller boards. These devices can be worn or sticked to human bodies for example watches, bands and glasses etc. The COVID-19 pandemic made us understand the need of continuous remote monitoring, early diagnosis and in time treatment, with limited resources. This has led to an increase in the use of IoT based wearable devices. As reported in [1], the healthcare domain is expected to spend around 20 billion dollars each year till 2023 on IoT based wearable devices. In mHealth applications, the wearable devices are connected to the patient’s mobile phone via Bluetooth. The mobile phone acts as the edge device that relays the information gathered by the wearable device to cloud based medical server. The healthcare providers can keep a check on patient’s vitals using a web interface. Additionally, if the measured vitals are higher or lower than the normal range, an alert or a notification can be sent to the authorities [2]. However, the mHealth architecture is resource constrained. Both wearable devices and edge devices are battery backed and have energy and memory constraint [1]. To address this challenge, we propose compression of the gathered time series - physiological data on the edge device so as to reduce the volume of gathered data. This thereby would reduce the energy consumed in transmission.

We propose Spindle Convolution Auto-encoder Lite (SCAElite) data compression model that is a deep compressive Auto-encoder. It is built with functional layers that extract the local information from physiological signals using convolution operation. The model has two components - convolutional encoder and convolutional decoder. While encoder is deployed on the edge device, decoder is deployed over cloud. As the dimensions increase in the initial few layers, sufficient amount of information is gathered from the physiological signals so as to make sure a quality compression. In the last few layers, the dimensions gradually decrease so as to merge the gathered information into codes, guaranteeing a high compression ratio. This gradual increase and then a decrease in dimensions in the architecture define the spindle structure of the auto-encoder. With the objective of improving the quality of reconstructed signals, the model obtains optimal set of encodings for compression.

Our major contributions are: (1) A deep learning based compression algorithm SCAElite that substantially reduces the data size of physiological signals including quasi-periodic signals such as respiration traces (RESP), electrocardiogram (ECG) etc. (2) Experimental validation of the proposed model over two datasets from PhysioNet - Stress recognition in automobile drivers dataset and MIT-BIH dataset. (3) A thorough performance evaluation of the proposed compression model with a detailed comparison with state-of-the-art deep compressive models in terms of compression ratio, reconstruction error, computational complexity and prediction accuracy.

For validation of the efficacy, we compressed the Stress Recognition in Automobile Drivers dataset from PhysioNet [3] using the proposed SCAElite compression model. This dataset contained physiological signal data such as Electrocardiogram (ECG), Electromyogram (EMG), Galvanic skin response (GSR), Heart rate (HR) and Respiration traces (RESP). The dataset was compressed using SCAElite encoder and the signals were reconstructed from the compressed file using SCAElite decoder. The original volume of dataset was compared with the compressed version to estimate the compression ratio. Root mean squared (RMS) error and Percentage RMS Difference (PRD) were calculated for the signals reconstructed using the decoder. Stress prediction was made over the reconstructed signal data and the accuracy of prediction was compared to that obtained with predictions made on original dataset. Flops were calculated to estimate the computational complexity of the compression model. SCAElite achieves a compression ratio of up to 300 folds, PRD within 8%, prediction accuracy of 98.06% on the reconstructed signals and a reduction in 51.65% in the computational complexity when compared other deep compressive auto-encoders.

For performance comparison with the reference SCAE model [4], SCAElite compression was performed over the MIT-BHI dataset [5] from PhysioNet. ECG signals from the dataset were preprocessed for noise filtering and centralisation and were then compressed using SCAElite encoder. The original volume of dataset was compared with the compressed version to estimate the compression ratio. PRD was calculated for the signals reconstructed using decoder. The model achieved a compression ratio of 106.34, PRD of 8% and a reduction of 51.65% in the computational complexity compared to the SCAE compression model on the same dataset.

The paper is organized as follows: Section 2 provides a review of the existing mHealth applications and compression techniques. Section 3 details the methodology adopted for conducting this research. Section 4 details the SCAElite architecture and the trials conducted for hyperparameter optimization. Section 5 discusses the dataset details. The results in terms of various performance evaluation metrics are presented in Section 6. Section 7 summarizes the results and discusses the efficacy of the model. Section 8 discusses the limitation of the model and concludes the paper.

2. Related work

While COVID-19 spread became a challenge at the global level, the advancements in technology and communication networks contributed in enhancing the global response. Wearable sensor devices and mHealth applications have become more popular due to the continuous remote monitoring capabilities and the facilitated early detection of symptoms and future prediction of a disease. An IoT based smart wearable device called the quarantine band (IoT - Q - Band) was designed to monitor the patient vitals as well as to detect the absconding quarantined subjects [6]. Smart wearable devices were used for designing an IoT framework that would help remotely screen COVID-19 in subjects on the basis of real time data. The evaluation was done by capturing the subject’s heartbeats, body temperature, oxygen saturation (SpO2) and respiration traces (RESP) [7]. Wearable devices were used for analysis of symptoms and the risk factors influencing the COVID-19 virus; generating early predictions on the basis of oxygen saturation (SpO2) and respiration traces (RESP) [8]. A biometric bracelet with infrared thermometer, global positioning system (GPS) module and radio frequency identification (RFID) transferred real time monitoring data to a web based interactive interface helping track the COVID-19 patients real time [9].

Although COVID-19 has accelerated the need of developing a digital framework for remote patient monitoring, devices such as Whoop Strap, VivaLNK Vital Scout, Zephyr BioHarness, Apple Watch, Fitbit, etc. have had a significant impact in the routine lives, improving user experience over time [10], [11], [12]. Biosignals such as electrocardiogram, oxygen level, respiration rate, heart rate, blood pressure, blood glucose etc. can be acquired using wearable devices [13]. Activities such as sleeping, walking, eating, etc. can be recognized tracking hands and legs gestures using smart assistive devices [14]. Apart from healthcare domain, smart home [15], rehabilitation centers [16] as well as skill and working hours assessment in industries [17] make use of wearable devices.

Electrocardiogram (ECG) signal monitoring done at a frequency of 1Mbps generates 80 GBs of data each day for one patient [18]. Applications like fall detection or heart rate monitoring of a patient require instant reporting of data. Other analytical applications like a model for cardiac arrest chances prediction based on ECG pattern or symptom analysis of a disease require storage of data collected over time for future analysis. In mHealth application architecture, both wearable devices and edge devices are battery backed and have a energy and memory constraint [1]. The transmission as well as storage of this huge amount of data generated using wearable devices therefore becomes challenging. Hence, there is a need of an energy efficient data compression technique that would compress the data at edge device so as to minimize total energy consumed in data transmission and volume consumed in storage over cloud [18].

Some of the techniques used for compression are lossless such as the compression using quantizing vectors. It sends codebook indexes instead of original time series [19]. It offers a high compression ratio and low computational complexity but is inefficient for Electrocardiogram (ECG) signal compression due to non-stationarity constraint. A combination of Pulse Code Modulation (PCM), K-means clustering and arithmetic coding techniques can also be used for signal compression [20]. As the demand for robust compression and reconstruction of signals increased, lossy compression algorithms became popular. They offered a high compression with an acceptable quality reconstruction minimizing the power consumed by wearable devices [21], [22].

Lossy compression techniques offer high compression ratio but distortion leads to loss of information that cannot be recovered [23]. Techniques such as Wavelet Transform, Discrete Fourier Transform, Shearlet Transform, and Discrete Cosine Transform were conventionally used for data compression but were later replaced by compressed sensing (CS) based reconstruction techniques [24]. Although these compressed sensing (CS) based techniques were computationally more efficient than wavelet based techniques, they were still complex, took a considerable amount of CPU time for execution [25] and were not suitable for applications requiring reconstruction of real time non-sparse signals. Lossy compression techniques suffer from loss of information and lossless compression techniques suffer from lower compression ratios, striking a trade off. Data driven algorithms such as neural networks have recently found a wide applicability in image compression [26], audio compression [27] as well physiological signals compression [28]. Auto-encoders are neural networks that approximate the identity function such that they are able reconstruct the input signal as much similar with minimum reconstruction error [26]. The study conducted in [27] suggested that Auto-encoders showed reliable compression over Electrocardiogram (ECG) signals. Convolutional Auto-encoders (CAE) are variants of Auto-encoders that produce a good quality restoration of low dimensional signals providing efficient information retrieval [29]. The convolutional neural networks not only preserve the local information but also facilitate expansion of dimensions for extracting features to obtain high level information. They also facilitate reduction in the dimensionality to result in a high ratio compression [4]. Stacked Denoising Auto-encoders (SDAE) is used for reconstruction of Fetal electrocardiogram (FECG) signals [21] from a PhysioNet database. Spindle Convolutional Auto-encoders (SCAE) deep neural network based model compresses the input to a low dimensional code offering a high ratio compression of ECG signals with quality reconstruction [4]. With near lossless compression, accurate predictions can be made over the reconstructed signals. The other way around accuracy of prediction can prove a good estimate of the information loss suffered by a compression model. Deep Convolutional Neural Network, Naïve Bayes, LDA, AdaBoost, and Random Forest were applied for emotion recognition over reconstructed ECG signal [12]. Supervised classifiers such as Support Vector Machine [30] and k-Nearest Neighbors (kNN) [31] were used for emotional stress detection over reconstructed physiological signals. Earlier, these deep compressive neural networks were suitable for deployment on server platforms over cloud only [32]. But, the recent development of the AI chip also known as the Bionic chip or the Neural Processing Unit (NPU) has made deployment of these compute intensive models over mobile devices easy. This has not only reduced response times due to local processing but also reduced the energy consumption increasing battery lives of wearable and edge devices [33].

3. Methodology

In the mHealth application architecture, the subjects were made to wear wearable sensor devices such as smart bracelet, smart belt, smart watch, smart ring, smart shoes etc. These devices measured the subject’s vitals at a preset frequency. The aggregated information was transmitted in the form of signals to the subject’s mobile phone or any other personal digital assistant, acting as the edge device. This wireless communication took place using technologies such as Bluetooth, Zigbee, RFID etc. At the edge device, local conditioning was performed over the measured vitals and if an abnormality was noticed, a notification was sent to the respective healthcare authorities. The data from the edge devices was then relayed via internet to the cloud based medical web server. The data could be displayed on an interactive web interface through which the medical authorities could keep a track of the subject’s health via the medical server. This data can be stored and used for future predictions and analysis in the longer run. As wearable devices and edge devices are battery backed and have energy and memory constraint, we propose compression of the gathered time series - physiological data on the edge device so as to reduce the volume of data gathered thereby reducing the energy consumed in transmission. Fig. 1 depicts the mHealth application architecture.

Fig. 1.

Fig. 1

Proposed architecture.

The proposed SCAElite compression model has two components - convolutional encoder and convolutional decoder. While encoder is deployed on the edge device, decoder is deployed over cloud. Through compression, encoder reduces the data volume enabling energy efficient transmission. On the cloud server, the signals can be reconstructed using the decoder. SCAElite guarantees high compression ratio with a low reconstruction error. As the prediction accuracy over reconstructed signals is higher than that obtained on original dataset, it proves this is a lossless compression approach.

For validation of the efficacy of the model, we compressed the Stress Recognition in Automobile Drivers dataset from PhysioNet [3] using SCAElite compression model. The dataset was compressed using SCAElite encoder and the signals were reconstructed from the compressed file using SCAElite decoder. Its performance was compared with the state-of-the-art deep compressive models such as Fully Connected Neural Network Auto-encoder (FCNN-AE), Convolutional Auto-encoder (CAE) and Spindle Convolutional Auto-encoder (SCAE) [4].

3.1. Performance Metrics

The original volume of dataset was compared with the compressed version to estimate the compression ratio. As stated in [34], compression ratio is defined as follows:

CompressionRatio:CR=C0C1 (1)

where CR is the compression ration achieved, C0 is the size of the original signal and C1 is the size of the reconstructed signal.

RMS error and PRD was calculated for the signals reconstructed using the decoder. PRD helps estimate the compression loss suffered by the model. It includes in its computation the variance in between the original signal and the reconstructed signal. It can be computed using the following formula:

PRD=(G0-GR)2G02*100 (2)

where G0 is the original signal and GR is the reconstructed signal.

Stress prediction was made over the reconstructed signal data and the accuracy of prediction was compared to that obtained with predictions made on original dataset and the accuracies were compared. Fully Connected Neural Network (FCNN) [35] was made use of for stress prediction over the dataset both original as well as the reconstructed one. This neural network had 16 neurons in the input layer, 32 neurons in the hidden layer and 1 neuron in the output layer; binary cross entropy used as a loss function with Adam optimizer, ReLU as the activation function in hidden layers while Sigmoid activation in the output layer; trained for 10 epochs at 0.01 learning rate. Flops were calculated to estimate the computational complexity of the compression model to compare their computational efficiency. SCAElite achieves a compression ratio of up to 300 folds, percentage root mean square difference within 5%, prediction accuracy of 98.06% on the reconstructed signals and a reduction in 51.65% in the computational complexity when compared other deep compressive auto-encoders.

For performance comparison with the reference SCAE model, SCAElite compression was performed over the MIT-BHI dataset [5] from PhysioNet. ECG signals from the dataset were preprocessed for noise filtering and centralisation and were then compressed using SCAElite encoder. The original volume of dataset was compared with the compressed version to estimate the compression ratio. PRD was calculated for the signals reconstructed using decoder. The model achieved a compression ratio of 106.34, a percentage root mean square difference of 8% and a reduction in 51.65% in the computational complexity compared to the SCAE compression model on the same dataset.

4. SCAElite architecture

Spindle Convolution Auto-Encoder Lite (SCAElite) is made up of two components - Convolutional Encoder (CE) and Convolutional Decoder (CD). The hidden layers of convolutional encoder and pooling layers contribute in reducing the size of the input signal. Spindle structure is designed with the objective of obtaining high ratio compression with no information loss. By changing the number of kernels, the structure balances expansion and reduction of signal features. In convolutional encoder, there are 1D convolution layers, max pooling layers, batch normalization and 1*1 convolutional layers [36]. The kernel size is gradually increased in the first few layers to expand dimensionality compressing high level features into codes thereby guaranteeing a good quality compression. Next, kernels are gradually removed reducing their number to merge features representing compressed data, resulting in lower dimension. Hence using convolutional encoder, the input signals are converted to compressed codes which are one kernel outputs. This compression is enhanced by max pooling layer and 1*1 convolutional layers.

Compression:c=g(w,b)(a) (3)

where c is the compressed code, a is the input signal, w is the weight, b is the bias in hidden layers and g(a) is the mapping compression function.

In the convolutional decoder, compressed signal are decoded into features for reconstruction of the original signal. Transposed convolution and max pooling layers are used [37]. There is a linear layer for transformation of features back to original signals. Here again initially dimensions are increased for decoding compressed data and then dimensions are reduced for merging features so as to reconstruct original signal using linear combination.

Decompression:a=f(w,b)(c) (4)

where a is the reconstructed original signal, w is the weight, b is bias, c is the compressed code and f(y) is the mapping reconstruction function.

There are total 17 layers in the architecture. 8 layers in the convolutional encoder (CE) which includes 5 1D convolutional layers initially then one max pooling layer, 1 1D convolutional layer with batch normalization and 1 1D convolutional layer at the end. The convolutional decoder (CD) includes 9 layers namely 2 1D transposed convolutional layers in the beginning followed by a max pooling layer, followed by 5 1D transposed convolutional layers and a linear layer at the end. Fig. 2 shows the overall architecture of the SCAElite model and Table 1 details the model parameters.

Fig. 2.

Fig. 2

SCAElite model architecture.

Table 1.

SCAElite model parameters.

Layer (Type) Output Shape Param#
Input_5 (Input Layer) (None, 300, 1) 0
Conv1D_57 (Conv1D) (None, 300, 2) 4
Conv1D_58 (Conv1D) (None, 300, 4) 132
Conv1D_59 (Conv1D) (None, 300, 4) 516
Conv1D_60 (Conv1D) (None, 300, 2) 514
max_pooling_1D_5 (maxpooling1D) (None, 150, 2) 0
Conv1D_61 (Conv1D) (None, 150, 2) 258
batch_normalization_5 (None, 150, 2) 8
Conv1D_62 (Conv1D) (None, 150, 1) 3
Conv1D_63 (Conv1D) (None, 150, 2) 4
Conv1D_64 (Conv1D) (None, 150, 2) 258
up_sampling1D_9 (None, 300, 2) 0
Conv1D_65 (Conv1D) (None, 300, 4) 260
Conv1D_66 (Conv1D) (None, 300, 4) 260
Conv1D_67 (Conv1D) (None, 300, 1) 5
flatten_5 (flatten) (None, 300) 0
dense_5 (dense) (None, 300) 90300
Total Parameters: 92522 Trainable Parameters: 92518 Non-Trainable Parameters: 4

4.1. Hyperparameter optimization

Using Spindle Convolutional Auto-Encoder [4] as the reference point, we attempted to propose a computationally effective thereby energy efficient model for compression of physiological signals captured using wearable devices, in mHealth applications. In the architecture of a deep neural network, batch normalization provides many different improvements such as speeding up the time spent in training [38]. It also improves the accuracy achieved over the same neural network. Another popular practice is the idea of cutting down certain weights on the basis of their importance. This is technically called pruning. Using pruning, if anything below a certain threshold is removed, it leads to a reduction in the number of parameters with a little impact on the accuracy [38]. This was seen when pruning was performed over Alexnet where parameters got reduced by 9 times and over VGGnet where parameters got reduced by 13 times [38].

There were 8 trials attempted during the phase of experimentation. Table 2 shows the architecture of each trial. Of these trials, trial 4 was the one that proved the best. The number of layers in encoder as well as decoder were 12. Activation function used was tanh and the kernel size increased with subsequent layers while making the forward pass. This model was trained for 2000 epochs.

Table 2.

Architecture details for trials conducted.

Trials Architecture Details
Trial 1 Same architecture as SCAE [4] but without Batch Normalization
Trial 2 Same architecture as SCAE but with increased filters in the first three layers (4,8,8) respectively
Trial 3 First 3 layers same as SCAE but layer 4 and layer 5 with 4,4 filters
Trial 4 First 4 layers same as SCAE. Layer 5 with 2 filters, layer 6 with 2 filters and kernel size 64. Subsequent layers with batch normalization. The final layer has 2 filters with 32 kernel size
Trial 5 First 2 layers and last 3 layers same as SCAE but two more layers with 191 filters added in between
Trial 6 Same architecture as Trial 4, but without Batch Normalization
Trial 7 Same architecture as Trial 4, but removed the middle layers with 191 filters
Trial 8 Same architecture as Trial 4, but added a layer with 64 filters after layer 7

5. Dataset

5.1. Stress recognition in automobile drivers dataset

Stress recognition in automobile drivers dataset published on PhysioNet [3] was used for validation of the efficacy of the proposed compression model. A wearable sensor body network was created to gather physiological signals of 17 different drivers while they drove on a predefined route in and around Boston, Massachusetts for a span of 50 to 90 min. The dataset had 9 attributes: elapsed time, driver, electrocardiogram (ECG), electromyography reading (EMG), foot galvanic skin response (GSR), hand galvanic skin response (GSR), heart rate (HR), marker and respiration traces (RESP). Table 3 depicts sample tuples from Driver 6’s dataset.

  • Elapsed Time – Represents the time stamp of the recorded signals.

  • Driver – Identifies each driver uniquely.

  • ECG – Electrocardiogram Signal of the driver. For normal ECG rates, the heart must be beating at a regular sinus rhythm in between 60 to 100 beats per minute. More specifically, it is 82 beats per minute. Intervals such as P wave must have a normal duration of less than or equal to 0.11 s, PR interval must be between 0.12 to 0.20 s, QRS complex in duration less than 0.12 s, ST segment not depressed more than 0.5 mm, T wave in same direction as QRS complex and around 5 to 6 limbs leads and QT interval in duration less than or equal to 0.40 s for males and 0.44 s for females [39].

  • EMG – Electromyography signal for muscles and nerve cells of the driver. Nerve conduction velocity between 50 to 60 meters per second is normal [40].

  • Foot GSR – Galvanic skin responses of the driver’s foot. GSR refers to the changes in sweat glands that indicate the intensity of a person’s emotional state which is also known as emotional arousal [34].

  • Hand GSR – Galvanic skin responses of the driver’s hand. Something scary, joyful or threatening is in environment leads to change of emotional state. Both negative and positive things can lead to this change in state. GSR hence detects arousal but does not measure the intensity of it [34].

  • HR – Driver’s heart rate. The normal range is 60 to 100 beats per minute for healthy adults [41].

  • Marker – Acts as a label depicting the level of stress the driver undergoes. After recording these biosignals, the drivers were put through a questionnaire that helped label the records. As per the dataset, there are 3 different labels as marked by the drivers wiz low stress, moderate stress and high stress [3]. The labels in marker attribute can be explained as:
    • 1.
      The drivers are in “low stress” while at rest (before or after driving).
    • 2.
      On highways, were there is average amount of traffic, the drivers are in “moderate stress”.
    • 3.
      In city areas, where there are busy streets with a lot of traffic, the drivers are in “high stress”.
  • RESP – Respiration traces of the driver. Normal respiration rate for a healthy adult is 12 to 16 breaths per minute [41].

Table 3.

Stress recognition in automobile driver’s dataset.

Elapsed Time ID ECG EMG Foot GSR Hand GSR HR Marker RESP
‘04.96’ 6 −0.08 0.18 8.9 18.9 93 12.34 36.8
‘04.96’ 6 −0.08 0.18 8.9 18.9 93 12.34 36.8
‘07.29’ 6 −0.07 0.20 8.8 18.9 95 12.29 34.2
‘07.41’ 6 −0.09 0.18 8.8 18.8 95 12.29 34.2
‘07.49’ 6 −0.10 0.16 8.8 18.8 95 12.28 34.0
‘04.97’ 6 −0.09 0.26 8.9 18.9 93 12.26 36.9
‘04.97’ 6 −0.09 0.26 8.9 18.9 93 12.26 36.9

5.2. MIT-BIH Arrhythmia dataset

MIT-BIH Arrhythmia is a biological signal dataset available over PhysioNet [5]. It contains 48 half-hour excerpts of two channel ambulatory ECG readings. This is obtained from 47 subjects. 23 readings from a set of 4000 24-h ambulatory readings were chosen at random. These were collected from a mixed population of inpatients and outpatients at the Boston’s Beth Israel Hospital. Rest 25 readings chosen belonged to the same set but were the ones with lesser common yet clinically significant arrhythmia that could not be captured in the random sample chosen earlier. To compare SCAElite with the reference model SCAE, ECG signals from the MIT-BIH arryhthmia database from PhysioNet [5] were used. These records were all sampled at 360 Hz with 11 bit/sample resolution. Signals were preprocessed for noise removal and centralisation, to achieve a higher compression ratio [4]. SCAElite compression model was applied over the dataset and the performance was evaluated in terms of compression ratio and reconstruction error so as to compare its performance with the state-of-the-art compression model SCAE [4].

6. Results

The training loss history for the FCNN-AE compression model, CAE compression model, SCAE compression model and the proposed SCAElite compression model over the Stress recognition in automobile drivers dataset is represented in the plots in Fig. 3 . The plots include training and test (validation) loss per epoch for each of these models respectively.

Fig. 3.

Fig. 3

Training loss for compression models.

6.1. Compression Ratio (CR)

The compression ratio achieved by FCNN-AE is 5.8, CAE is 31.75, SCAE is 300 and SCAElite is 300. Fig. 4 shows a comparison of the compression ratio achieved by these compression models over the PhysioNet dataset [3].

Fig. 4.

Fig. 4

Comparison of Compression Ratios (CR) of various compression models.

6.2. Computational complexity

The computational complexity of these compression models is measured in terms of Flops. Flop is a basic unit of computation which gives an estimate of how computationally heavy an algorithm is [41]. The computational complexity gives us an estimate of the energy consumed in the execution of an algorithm. FCNN-AE runs for 6065 Flops, CAE for 32806669 Flops, SCAE [4] for 379807 Flops while our proposed SCAElite runs for 184448 Flops. Fig. 5 shows a comparison between computational complexities of these models over the PhysioNet dataset [3].

Fig. 5.

Fig. 5

Comparison of computational complexities of various compression models.

6.3. Prediction accuracy

Stress prediction was made using a feed forward neural network (FCNN) with 16 neurons in the input layer, 32 neurons in the hidden layer and 1 neuron in the output layer; binary cross entropy used as a loss function with Adam optimizer, ReLU as the activation function in hidden layers while Sigmoid activation in the output layer; trained for 10 epochs at 0.01 learning rate [35]. The accuracy achieved over the original dataset was 97.10%. Predictions were then made over the reconstructed signals of different compression models. With FCNN-AE reconstructed signals, the accuracy achieved was is 81.10%, with CAE it was 97.99%, with SCAE it was 98.01% and with our proposed SCAElite was 98.06%. Fig. 6 shows a comparison of the accuracies of these compression models. Fig. 7 shows the training and testing accuracies achieved over the reconstructed signals for each model.

Fig. 6.

Fig. 6

Comparison of accuracies for various compression models achieved using FCNN [35] for st.ress prediction.

Fig. 7.

Fig. 7

Accuracies for various compression models achieved using FCNN [35] for st.ress prediction.

6.4. Reconstruction error

The reconstruction error suffered by a compression model must be as low as possible. FCNN-AE suffers a reconstruction error of 15%, CAE of 9.3% while both SCAE and SCAElite suffer 8% reconstruction error. These are average PRD values calculated after conducting the same experiment with different number of samples. Fig. 8 shows a comparison of the reconstruction error suffered by these compression models over the PhysioNet dataset.

Fig. 8.

Fig. 8

Reconstruction Errors of various compression models.

6.5. MIT-BIH dataset results

The compression ratio achieved by SCAE is 106.45 [4] while that achieved by SCAElite is 106.34. Both SCAE and SCAElite have a low percentage root mean square difference of 8% and the computational complexity of SCAElite is 51.65% less compared to SCAE. Fig. 9 shows the training loss history of SCAElite model over the MIT-BIH dataset. The plot includes training and test (validation) loss per epoch. Table 4 shows performance comparison of SCAE and SCAElite models over the MIT-BIH dataset in terms of compression ratio, reconstruction error and computational complexity.

Fig. 9.

Fig. 9

Training loss for SCAElite compression models.

Table 4.

SCAE versus SCAElite over MIT-BIH dataset.

Performance metrics SCAE SCAElite
Compression ratio 106.45 106.34
Reconstruction error 8% 8%
Computational Complexity (flops) 379807 184448

7. Discussion

With the objective of achieving high quality compression, SCAElite compression model has been proposed. The model guarantees a high compression ratio with good quality restoration capabilities. It provides a more compact architecture compared to the state-of-the-art deep compressive model as it has lesser number of parameters with similar performance in terms of accuracy, compression ratio and reconstruction error. While the architecture ensures good quality compression, the less number of parameters assure there is no overfitting.

On implementing these compression models over the Stress Recognition in Automobile Drivers dataset from PhysioNet, the compression ratio achieved by SCAElite is higher compared to FCNN-AE and CAE while it is equal to the compression ratio achieved by SCAE. The proposed SCAElite model like other compression models minimises root mean square error. The RMS error suffered by FCNN-AE was 0.03, CAE is 0.0072 and SCAE is 0.0063. SCAElite suffers RMS of 0.0058. The stress prediction accuracy achieved over the original dataset was 97.10% while that achieved over the SCAElite compressed - reconstructed dataset is 98.06%. This is due to the pre-processing steps involved before compression, that include removal of noise from the signals. The prediction accuracy being more than that achieved over the original dataset confirms of no information loss making this a near lossless compression. Table 5 summarizes the performance evaluation metrics for all the compression models experimented over the Stress Recognition in Automobile Drivers dataset from PhysioNet. Over the MIT-BIH dataset, the SCAElite compression model achieves a compression ratio of 106.34 and a low reconstruction error of 8%. Table 4 shows the comparison between SCAE and SCAElite in terms of the above performance metrics over the MIT-BIH dataset.

Table 5.

Performance evaluation over Stress Recognition dataset.

No. Approach Compression Ratio Accuracy Flops Error
1 FCNN Auto-Encoder 5.8 81.1% 6065 15%
2 Convolutional Auto-Encoder 31.75 97.99% 32806669 9.3%
3 Spindle Convolutional Auto-Encoder (SCAE) 300 98.01% 379807 8%
4 SCAElite 300 98.06% 184448 8%

The Green500, November 2020 report [42] states that over a battery backed device like a smart wearable watch or an edge device like a mobile phone, one watt energy is consumed per 15.740 Gflops of an algorithm.

Gflops=Flops109 (5)

where Flops are the basic units of computation that an algorithm runs for.

1watt=15.740Gflops (6)

where watt is the unit of energy. A single run of SCAE convolutional encoder [4] consumes 379807 Flops while that of our proposed SCAElite model consumes 184448 Flops. As per these statistics, the energy consumption calculation for SCAE and SCAElite models is presented as under:

For SCAE

Gflops=379807109 (7)
watts=15.740379807*10-9=2.42*10-5 (8)

For SCAElite

Gflops=184448109 (9)
watts=15.740184448*10-9=1.17*10-5 (10)

As per the calculations, energy consumed in a single run of the convolutional encoder of SCAE [4] is 2.42 * 10-5 watts while that consumed by SCAElite is 1.17 * 10-5 watts. This proves that the energy consumed in a single run of the convolutional encoder of SCAE is 51.65% more than that consumed by our proposed SCAElite model. Hence, although in terms of compression ratio, reconstruction error and prediction accuracy, SCAElite gave performance equivalent to SCAE over both the datasets, it proved 51.65% more computationally efficient compared to SCAE, thereby consuming 51.65% less energy.

8. Conclusion

The paper proposes a computationally effective and energy efficient compression model for physiological signal compression in mHealth applications. SCAElite compression model provides high-ratio compression with high quality restoration. The model was experimentally validated by stress recognition in automobile drivers dataset as well as MIT-BIH dataset from PhysioNet. On the stress recognition dataset SCAElite achieves a compression ratio of 300 and on MIT-BIH 106.34. The model provides a low PRD of 8%. SCAElite thus proves to be a promising compression model for physiological signal compression. The computational complexity of SCAElite is 51.65% less compared to that of state-of-the-art deep compressive model. This makes it suitable for wearable devices and edge devices that have power budgets in microwatts or milliwatts ranges. With the use of “AI chips”, implementation of this deep neural network based compression model on edge devices will become convenient. It will then help make the mHealth application architecture energy efficient. A downside of this model is that it is difficult designing a generic model. Thus in future work, there is a need of optimizing the model architecture to result in a generic compression model with guaranteed quality.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References

  • 1.V. Vadori, E. Grisan, M. Rossi, Biomedical signal compression with time-and subject-adaptive dictionary for wearable devices, in: 2016 IEEE 26th International Workshop on Machine Learning for Signal Processing (MLSP), IEEE, 2016, pp. 1–6.
  • 2.Hui M. Hong kong is using tracker wristbands to geofence people under coronavirus quarantine. Quartz. 2020;4:1–9. [Google Scholar]
  • 3.A.L. Goldberger, L.A. Amaral, L. Glass, J.M. Hausdorff, P.C. Ivanov, R.G. Mark, J.E. Mietus, G.B. Moody, C.-K. Peng, H.E. Stanley, Physiobank, physiotoolkit, and physionet: components of a new research resource for complex physiologic signals, circulation 101 (23) (2000) e215–e220. [DOI] [PubMed]
  • 4.Wang F., Ma Q., Liu W., Chang S., Wang H., He J., Huang Q. A novel ecg signal compression method using spindle convolutional auto-encoder. Computer methods and programs in biomedicine. 2019;175:139–150. doi: 10.1016/j.cmpb.2019.03.019. [DOI] [PubMed] [Google Scholar]
  • 5.Moody G.B., Mark R.G. The impact of the MIT-BIH arrhythmia database. IEEE Engineering in Medicine and Biology Magazine. 2001;20(3):45–50. doi: 10.1109/51.932724. [DOI] [PubMed] [Google Scholar]
  • 6.V. Singh, H. Chandna, A. Kumar, S. Kumar, N. Upadhyay, K. Utkarsh, IoT-Q-Band: a low cost internet of things based wearable band to detect and track absconding COVID-19 quarantine subjects, EAI Endorsed Transactions on Internet of Things 6 (21).
  • 7.Mukhtar H., Rubaiee S., Krichen M., Alroobaea R. An IoT framework for screening of COVID-19 using real-time data from wearable sensors. International Journal of Environmental Research and Public Health. 2021;18(8):4022. doi: 10.3390/ijerph18084022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.R. Karthickraja, R. Kumar, S. Kirubakaran, R. Manikandan, et al., COVID-19 prediction and symptom analysis using wearable sensors and IoT, International Journal of Pervasive Computing and Communications.
  • 9.Weizman Y., Tan A.M., Fuss F.K. Use of wearable technology to enhance response to the coronavirus (COVID-19) pandemic. Public Health. 2020;185:221. doi: 10.1016/j.puhe.2020.06.048. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Seshadri D.R., Davies E.V., Harlow E.R., Hsu J.J., Knighton S.C., Walker T.A., Voos J.E., Drummond C.K. Wearable sensors for COVID-19: a call to action to harness our digital infrastructure for remote patient monitoring and virtual assessments. Frontiers in Digital Health. 2020;2:8. doi: 10.3389/fdgth.2020.00008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Vilarinho T., Farshchian B., Bajer D.G., Dahl O.H., Egge I., Hegdal S.S., Lønes A., Slettevold J.N., Weggersen S.M. 2015 IEEE International Conference on Computer and Information Technology; Ubiquitous Computing and Communications; Dependable, Autonomic and Secure Computing; Pervasive Intelligence and Computing. IEEE; 2015. A combined smartphone and smartwatch fall detection system; pp. 1443–1448. [Google Scholar]
  • 12.Santamaria-Granados L., Munoz-Organero M., Ramirez-Gonzalez G., Abdulhay E., Arunkumar N. Using deep convolutional neural network for emotion detection on a physiological signals dataset. amigosIEEE Access. 2018;7:57–67. [Google Scholar]
  • 13.J.A.M. Correa, M.K. Abadi, N. Sebe, I. Patras, Amigos: A dataset for affect, personality and mood research on individuals and groups, IEEE Transactions on Affective Computing.
  • 14.Avci A., Bosch S., Marin-Perianu M., Marin-Perianu R., Havinga P. 23th International conference on architecture of computing systems 2010, VDE. 2010. Activity recognition using inertial sensing for healthcare, wellbeing and sports applications: A survey; pp. 1–10. [Google Scholar]
  • 15.Rashidi P., Cook D.J. Keeping the resident in the loop: Adapting the smart home to the user. IEEE Transactions on systems, man, and cybernetics-part A: systems and humans. 2009;39(5):949–959. [Google Scholar]
  • 16.Patel S., Park H., Bonato P., Chan L., Rodgers M. A review of wearable sensors and systems with application in rehabilitation. Journal of neuroengineering and rehabilitation. 2012;9(1):1–17. doi: 10.1186/1743-0003-9-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Kranz M., Möller A., Hammerla N., Diewald S., Plötz T., Olivier P., Roalter L. The mobile fitness coach: Towards individualized skill assessment using personalized mobile devices. Pervasive and Mobile Computing. 2013;9(2):203–215. [Google Scholar]
  • 18.Al-Sa’D M.F., Tlili M., Abdellatif A.A., Mohamed A., Elfouly T., Harras K., O’Connor M.D., et al. A deep learning approach for vital signs compression and energy efficient delivery in mhealth systems. IEEE Access. 2018;6:33727–33739. [Google Scholar]
  • 19.Hooshmand M., Zordan D., Del Testa D., Grisan E., Rossi M. Boosting the battery life of wearables for health monitoring through the compression of biosignals. IEEE Internet of Things Journal. 2017;4(5):1647–1662. [Google Scholar]
  • 20.Hejrati B., Fathi A., Abdali-Mohammadi F. Efficient lossless multi-channel eeg compression based on channel clustering. Biomedical Signal Processing and Control. 2017;31:295–300. [Google Scholar]
  • 21.Muduli P.R., Gunukula R.R., Mukherjee A. 2016 Twenty Second National Conference on Communication (NCC) IEEE; 2016. A deep learning approach to fetal-ecg signal reconstruction; pp. 1–6. [Google Scholar]
  • 22.Zhang Z., Jung T.-P., Makeig S., Rao B.D. Compressed sensing for energy-efficient wireless telemonitoring of noninvasive fetal ecg via block sparse bayesian learning. IEEE Transactions on Biomedical Engineering. 2012;60(2):300–309. doi: 10.1109/TBME.2012.2226175. [DOI] [PubMed] [Google Scholar]
  • 23.Sayood K. Morgan Kaufmann; 2017. Introduction to data compression. [Google Scholar]
  • 24.Mamaghanian H., Khaled N., Atienza D., Vandergheynst P. Compressed sensing for real-time energy-efficient ecg compression on wireless body sensor nodes. IEEE Transactions on Biomedical Engineering. 2011;58(9):2456–2466. doi: 10.1109/TBME.2011.2156795. [DOI] [PubMed] [Google Scholar]
  • 25.Dixon A.M., Allstot E.G., Gangopadhyay D., Allstot D.J. Compressed sensing system considerations for ecg and emg wireless biosensors. IEEE Transactions on Biomedical Circuits and Systems. 2012;6(2):156–166. doi: 10.1109/TBCAS.2012.2193668. [DOI] [PubMed] [Google Scholar]
  • 26.Toderici G., Vincent D., Johnston N., Jin Hwang S., Minnen D., Shor J., Covell M. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2017. Full resolution image compression with recurrent neural networks; pp. 5306–5314. [Google Scholar]
  • 27.Reynolds R., Dennis S., Hasan I., Slewa J., Chen W., Tian D., Bobba S., Zwar N. A systematic review of chronic disease management interventions in primary care. BMC family practice. 2018;19(1):1–13. doi: 10.1186/s12875-017-0692-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Sriraam N., Eswaran C. Performance evaluation of neural network and linear predictors for near-lossless compression of eeg signals. IEEE Transactions on Information Technology in Biomedicine. 2008;12(1):87–93. doi: 10.1109/TITB.2007.899497. [DOI] [PubMed] [Google Scholar]
  • 29.Hinton G.E., Zemel R.S. Autoencoders, minimum description length, and helmholtz free energy. Advances in neural information processing systems. 1994;6:3–10. [Google Scholar]
  • 30.L. Zhao, L. Yang, H. Shi, Y. Xia, F. Li, C. Liu, Evaluation of consistency of hrv indices change among different emotions, in: 2017 Chinese Automation Congress (CAC), IEEE, 2017, pp. 4783–4786.
  • 31.M.B.H. Wiem, Z. Lachiri, Emotion classification in arousal valence model using mahnob-hci database, International Journal of Advanced Computer Science and Applications 8 (3).
  • 32.Verhelst M., Moons B. Embedded deep neural network processing: Algorithmic and processor techniques bring deep learning to IoT and edge devices. IEEE Solid-State Circuits Magazine. 2017;9(4):55–65. [Google Scholar]
  • 33.A. Roven, What’s the deal with ai chips in the latest smartphones? (Nov 2018). url:https://cutt.ly/tWV7yQ0.
  • 34.B. Farnsworth, What is gsr (galvanic skin response) and how does it work (2018).
  • 35.Azar J., Makhoul A., Barhamgi M., Couturier R. An energy efficient IoT data compression approach for edge machine learning. Future Generation Computer Systems. 2019;96:168–175. [Google Scholar]
  • 36.LeCun Y., Bengio Y., Hinton G. Deep learning. Nature. 2015;521(7553):436–444. doi: 10.1038/nature14539. [DOI] [PubMed] [Google Scholar]
  • 37.Zeiler M.D., Taylor G.W., Fergus R. 2011 International Conference on Computer Vision. IEEE; 2011. Adaptive deconvolutional networks for mid and high level feature learning; pp. 2018–2025. [Google Scholar]
  • 38.Huang R., Pedoeem J., Chen C. 2018 IEEE International Conference on Big Data (Big Data) IEEE; 2018. Yolo-lite: a real-time object detection algorithm optimized for non-gpu computers; pp. 2503–2510. [Google Scholar]
  • 39.Dickinson D.F. The normal ecg in childhood and adolescence. Heart. 2005;91(12):1626–1630. doi: 10.1136/hrt.2004.057307. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Soudmand R., Ward L.C., Swift T.R. Effect of height on nerve conduction velocity. Neurology. 1982;32(4) doi: 10.1212/wnl.32.4.407. 407–407. [DOI] [PubMed] [Google Scholar]
  • 41.Aylott M. Assessment of temperature, pulse and respiration. Clinical Skills in Child Health Practice. 2008:70. [Google Scholar]
  • 42.J. Makino, Analysis of past and present processors, in: Principles of High-Performance Processor Design, Springer, 2021, pp. 65–94.

Articles from Biomedical Signal Processing and Control are provided here courtesy of Elsevier

RESOURCES