Skip to main content
Sensors (Basel, Switzerland) logoLink to Sensors (Basel, Switzerland)
. 2022 Oct 18;22(20):7913. doi: 10.3390/s22207913

Deep Learning-Based Energy Expenditure Estimation in Assisted and Non-Assisted Gait Using Inertial, EMG, and Heart Rate Wearable Sensors

João M Lopes 1,2, Joana Figueiredo 1,2,*, Pedro Fonseca 3, João J Cerqueira 4, João P Vilas-Boas 3,5, Cristina P Santos 1,2
Editor: Carlo Ricciardi
PMCID: PMC9607229  PMID: 36298264

Abstract

Energy expenditure is a key rehabilitation outcome and is starting to be used in robotics-based rehabilitation through human-in-the-loop control to tailor robot assistance towards reducing patients’ energy effort. However, it is usually assessed by indirect calorimetry which entails a certain degree of invasiveness and provides delayed data, which is not suitable for controlling robotic devices. This work proposes a deep learning-based tool for steady-state energy expenditure estimation based on more ergonomic sensors than indirect calorimetry. The study innovates by estimating the energy expenditure in assisted and non-assisted conditions and in slow gait speeds similarly to impaired subjects. This work explores and benchmarks the long short-term memory (LSTM) and convolutional neural network (CNN) as deep learning regressors. As inputs, we fused inertial data, electromyography, and heart rate signals measured by on-body sensors from eight healthy volunteers walking with and without assistance from an ankle-foot exoskeleton at 0.22, 0.33, and 0.44 m/s. LSTM and CNN were compared against indirect calorimetry using a leave-one-subject-out cross-validation technique. Results showed the suitability of this tool, especially CNN, that demonstrated root-mean-squared errors of 0.36 W/kg and high correlation (ρ > 0.85) between target and estimation (R¯2 = 0.79). CNN was able to discriminate the energy expenditure between assisted and non-assisted gait, basal, and walking energy expenditure, throughout three slow gait speeds. CNN regressor driven by kinematic and physiological data was shown to be a more ergonomic technique for estimating the energy expenditure, contributing to the clinical assessment in slow and robotic-assisted gait and future research concerning human-in-the-loop control.

Keywords: artificial intelligence, deep learning, energy expenditure, gait rehabilitation, human-in-the-loop, robotics-based rehabilitation, wearable sensors

1. Introduction

Gait disabilities are among the most frequent disabilities in European countries [1]. Either caused by aging or cardiovascular and/or neurological disorders, impaired gait strongly affects the walking energetic efficiency of persons [2]. Therefore, energy expenditure has gained importance in gait rehabilitation, being a golden marker of gait quality and a primary outcome for evaluating manual or robotics-assisted therapies. It is usually assessed by evaluating the exchanges of oxygen consumption (V.O2) and carbon dioxide production (V.CO2) through indirect calorimetry [3], using wearable gas analyzers. Then, these exchanges of gas are commonly translated into energy using the Brockway’s equation [4].

Powered assistive devices, namely exoskeletons and orthoses, guide users for better and faster functional recovery by introducing assist-as-needed control strategies [5,6,7,8,9]. Energy expenditure has been used in several studies to assess the effectiveness of using such assistive devices [10,11,12,13,14]. For instance, Awad et al. [12] demonstrated that using powered assistive devices in the gait rehabilitation of post-stroke survivors improves their energy efficiency by 10% regarding unpowered assistance, which the authors found to be equivalent to a 32% reduction in the metabolic cost associated with post-stroke walking.

Recent trends point toward the use of exoskeletons and orthoses to reduce the energetic effort and improve the ability of impaired subjects to independently perform activities of daily living through the human-in-the-loop control [15]. This assist-as-needed control strategy evokes personalized and user-oriented assistance [16] by using the energy expenditure as a cost function for optimizing the robotic assistance in real-time [16,17,18]. Thus, the human-in-the-loop control requires accurate and timely energy expenditure estimates to effectively tailor the robotic assistance to reduce subject’s energy expenditure. However, the use of indirect calorimetry may be a problem for control purposes, given its noisy and delayed data [17], hindering the optimization process. Moreover, it is not the most ergonomic setup to use in persons with disabilities since it entails a minimum of invasiveness that should be avoided and may increase their energetic effort.

To overcome these disadvantages, new approaches involving artificial intelligence (AI) algorithms are being studied to attain generalized regression models that timely estimate energy expenditure using easy-to-obtain inputs. These inputs are obtained using smaller and more ergonomic sensors that feed data to regressors, such as linear regression models and neural networks, to replace the use of gas analyzers, which gives noisy and delayed data and can be uncomfortable in long term use. For instance, Ingraham et al. [19] presented a linear regression model to estimate energy expenditure of level, incline, and backward walking, running, cycling, and stair climbing. The authors used multimodal data tracked by wearable sensors, namely breath rate (BR), minute ventilation (MV), heart rate (HR), oxygen saturation (SpO2), acceleration, electrodermal activity, skin temperature, and electromyography (EMG). They have shown reasonable performance, presenting root-mean-squared errors around 1.0 W/kg for the best model. Beltrame et al. [20] used HR, BR, MV, hip acceleration, walking cadence, and HR variation (∆HR) as estimators for a random forest regression model. The results revealed a high correlation with the ground truth signal (r ≥ 0.69). Beltrame et al. [21] also explored the feasibility of a single multilayer perceptron (MLP) to estimate oxygen consumption using the HR, gait speed and grade, rest and activity time, gender, and body mass. The authors found a high correlation between the estimated and target signal (r > 0.90). Zhu et al. [22] explored the potential of a convolutional neural network (CNN) to estimate the energy expenditure when compared to MLP and an activity-specific linear regressor, and the authors found an improvement of more than 30% when using deep learning regressors.

However, the study of AI algorithms’ effectiveness for estimating the energy expenditure at slow gait speeds remains underexplored. These slow gait speeds, often observed in persons with gait disabilities (gait speed ranges from 0.15 to 1.0 m/s, with an average of 0.46 m/s [23]), can entail lower variance in energy expenditure, hindering the estimation process. Moreover, there is still limited evidence of AI algorithms’ feasibility to estimate energy expenditure during robot-assisted gait. Slade et al. [15] estimated the energy expenditure of healthy users for ankle assisted walking at normal speed (1.25 m/s), using ground reaction force (GRF) and EMG signals as inputs. However, GRF signals require force platforms, which are non-wearable nor a setup often available in rehabilitation centers or hospitals. To the authors’ best knowledge, there is no available regression algorithm that relies only on data from ergonomic on-body sensors to accurately estimate energy expenditure while walking in robotic-assisted and slow gait conditions.

To attain these challenges, this work presents a deep learning regression tool for steady-state energy expenditure estimation using estimators tracked by ergonomic, reliable, and clinical-accepted on-body sensors. Novelty arises from a tool that can estimate energy expenditure in both assisted and non-assisted walking by an ankle exoskeleton while considering slow gait speeds (0.22, 0.33, and 0.44 m/s). For this purpose, we compared two deep learning regression networks, namely the long short-term memory (LSTM) and CNN, attending to the results of Zhu et al. [22], using HR, lower limb kinematic, and EMG data as estimators. This study hypothesizes that a deep learning regressor fed by fused gait-related biomechanical and physiological data enables an accurate and timely energy expenditure assessment in basal, assisted, and non-assisted walking, even for slow speeds. We studied this hypothesis through the leave-one-subject-out cross-validation (LOOCV) technique by comparing the estimation of deep learning tool against indirect calorimetry and verify its feasibility. This study presents two-fold contributions: (i) in terms of clinical assessment, it proposes a more ergonomic and rapid technique to estimate the energy expenditure, which is reliable in slow gait, as commonly observed in gait impaired subjects; (ii) in terms of rehabilitation robotics, it supports future research insights regarding human-in-the-loop control by accurately and efficiently estimating the energy expenditure in robotic-assisted conditions through timely and reliable acquired data.

2. Materials and Methods

The development of the deep learning-based tool required a prior data collection. This was conducted under the ethical procedures of the Ethics Committee in Life and Health Sciences (CEICVS 006/2020), following the Helsinki Declaration and the Oviedo Convention. All participants gave their informed consent to be part of the study. Data were collected at the LABIOMEP, Porto Biomechanics Laboratory, University of Porto.

2.1. Participants

Eight healthy participants (four females and four males, see Table 1) were recruited and accepted to participate in this work. A list of eligibility criteria was outlined to conduct the experimental data collection. All subjects that had: (i) 18 or more years, (ii) body mass within 45 and 90 kg, and (iii) height within 150 and 190 cm were included in the study. The inclusion criteria regarding the anthropometric data were imposed given the exoskeleton’s inherent requirements. The subjects were excluded if they reported any disturbance of locomotion or balance, caused by any known neurological or musculoskeletal injury. Table 1 presents the participants’ detailed anthropometric data.

Table 1.

Participants’ anthropometric data and mean (±standard-deviation) regarding gender.

Participant Gender Age
(years)
Body Height
(cm)
Body Mass
(kg)
S01 F 27 162 53
S02 M 25 182 77
S03 M 25 181 74
S04 F 26 162 65
S05 F 23 167 61
S06 M 23 170 81
S07 M 23 170 76
S08 F 22 174 59
Male 4 24.0 ± 1.2 176 ± 6.7 77.0 ± 2.9
Female 4 24.5 ± 2.4 166 ± 5.7 59.5 ± 5.0

2.2. Instrumentation

The participants were instructed to wear shorts and standard sports shoes for better sensor accommodation. Subsequently, we instrumented each participant with the following sensor setup. First, the participants were instrumented with a wireless Polar H10 (Polar Electro Oy, Kempele, Finland, validated in ref. [24]) at the chest, that was used to monitor the HR. Second, they were instrumented with eight wireless EMG surface electrodes from the TrignoTM Avanti Platform (Delsys, Natick, MA, USA) on the tibialis anterior (TA), gastrocnemius lateralis (GL), bicep femoris (BF), and vastus lateralis (VL) muscles of both legs. The sensors were placed following the Surface ElectroMyoGraphy for the Non-Invasive Assessment of Muscles (SENIAM) recommendations [25], and fixed with a white strap, as illustrated in Figure 1. Third, we instrumented the participants with the wireless motion tracker system MVN Awinda (Xsens Technologies B.V., Enschede, The Netherlands, validated in ref. [26]), placing inertial measurement units (IMUs) on the feet, lower-leg, upper-leg, pelvis, and torso. Each IMU was secured with a black strap, as illustrated in Figure 1. Fourth, each participant was instrumented with a K4b2 metabolic respiratory sensor (COSMED, Rome, Italy, validated in ref. [27]), covering the facial respiratory ways, to measure V.O2 and V.CO2. Lastly, the participants were instrumented with an electrical ankle-foot exoskeleton from the H2-Exoskeleton (Technaid S.L., Madrid, Spain) in the lateral side of the right lower limb. This device provides one degree-of-freedom in the sagittal plane.

Figure 1.

Figure 1

Representation of the setup’s instrumentation on the human body: (a) example of a participant during an assisted walking trial, while wearing all sensor setup (kinematic, EMG, and HR sensors and the respirometer); (b) zoomed-in view of the ankle-foot orthosis from the H2-Exoskeleton during the walking trial; and (c) schematic illustrating the IMUs placement on sternum, pelvis, upper leg and lower leg; the EMG sensors on the vastus lateralis, bicep femoris, tibialis anterior, and gastrocnemius lateralis muscles; the Polar H10 heart rate sensor at the chest; the respirometer, covering the user’s facial respiratory ways; and the ankle-foot orthosis fixed on the shank and foot segments of the right lower limb.

2.3. Experimental Protocol

Immediately after the placement of the EMG sensors, the participants performed three maximum voluntary contractions (MVC) for each muscle to normalize EMG data. For the BF muscle, the participants laid on a stretcher in ventral decubitus position with their knee slightly bent. One researcher immobilized the participants’ shank and asked them to perform maximum knee flexion. Regarding the VL muscle, the participants sat down on the same stretcher performing 90 degrees between thighs and shanks. The researcher immobilized their shank and asked the participants to perform maximum knee extension. For the TA and GL muscles, the participants laid on the stretcher, assuming dorsal decubitus position. To perform the MVC, the researcher immobilized the participants’ foot and asked them to perform maximum dorsiflexion of the ankle articulation, in the case of TA muscle, and maximum plantar flexion, in the case of GL muscles. This procedure was repeated for both legs. Afterward, the MVN Awinda sensors were placed and the MVN biomechanical model was calibrated by following the software guidelines: each participant assumed the N-pose, which refers to a neutral position of body segments, in upright position, looking forward with the two arms stretched and hands near the thighs. Each participant held this position for four seconds, and then walked forward in a normal fashion, turned, and walk backwards to the initial position. Subsequently, some anthropometric data, namely gender, height, and body mass, were introduced in the respirometer. After this, each participant experienced a one-day protocol consisting of two sessions: one session with and another without the orthotic device. In both sessions, the subjects performed three walking trials of twelve minutes, one per each gait speed: 0.22, 0.33, and 0.44 m/s. In each trial, the participants were instructed to remain in standing position during the first three minutes to assess the basal energy expenditure, followed by a walking activity on a treadmill that lasted six minutes, and finishing with three minutes in standing position as a recovering period. The participants rested for ten minutes between each trial and/or session. In the assisted session, the ankle-foot exoskeleton assisted according to a position control strategy.

2.4. Data Acquisition

Data acquisition included: (i) V.O2 and V.CO2, in mL/sec, with the K4b2 respiratory sensor breath-by-breath; (ii) the muscular activation of the TA, GL, BF, and VL muscles for both lower limb at 1000 Hz using the 8-channel TrignoTM EMG sensors and the Delsys acquisition software; (iii) the acceleration and angular velocity of lower-leg, upper-leg, feet, pelvis, and torso at 100 Hz using the MVN Awinda; and (iv) the heart rate using a Polar H10. Data acquisition commenced simultaneously for all devices, ensuring time synchronization.

2.5. Data Processing

The energy expenditure (EE), in Watts, was calculated using the V.O2 and V.CO2 following Brockway’s equation [4], depicted in (1).

EE=16.58 V.O2+4.51 V.CO2 (1)

We average each activity (i.e., standing for three minutes, walking for six minutes, and recovering for three minutes) to assess the steady-state energy expenditure considering the last three minutes of each [28]. Due to the high noise of indirect calorimetry, a 95% confidence interval was calculated to eliminate possible outliers. Additionally, the steady-state energy expenditure was normalized by body-mass, as performed in [10,11,12]. Following these processing techniques, a step-like signal of energy expenditure was obtained (Figure 2A–D).

Figure 2.

Figure 2

Example of post-processed data used to estimate the steady-state energy expenditure for one random participant: (A) acceleration; (B) angular velocity; (C) EMG sum; and (D) HR. EE stands for Energy Expenditure, (R/L)LL for Right/Left Lower Leg, (R/L)UL for Right/Left Upper Leg, (R/L)SLL for Right/Left Sum Lower Leg, and (R/L)SUL for Right/Left Sum Upper Leg.

The HR signal was processed by following the same approach. Additionally, the HR was normalized considering the maximum HR expected for the individual considering his/her age, in years.

The kinematic data, namely the acceleration and angular velocity, were first filtered with a fourth-order zero-lag Butterworth filter with a cut-off frequency of 5 Hz [29]. Afterward, the total acceleration and angular velocity (vector’s magnitude) were calculated and low-pass filtered at 0.1 Hz to preserve the low frequencies that belong to the start/stop walking transitions.

EMG signals were processed as follows. First, they were filtered with a zero-lag band-pass filter with cut-off frequencies of 20 and 450 Hz [30,31]. Second, we calculated the envelope using a low-pass filter with a cut-off frequency of 2 Hz and normalized considering the user’s MVC. Third, the signals were low-pass filtered at 0.1 Hz to preserve the low frequencies related to the start/stop walking transitions. Additionally, we calculated the EMG sum (Figure 2B), similar to Ingraham et al. [19], for both lower and upper leg muscles, to have a more general information regarding the muscles’ activation. Figure 2 illustrates an example of the post-processed data that were used to estimate the steady-state energy expenditure.

2.6. AI-Based Regression Models

As regression models, we explored LSTM, suited for sequential data [32], and CNN, due to the reliable performance reported in Zhu et al. [22]. For both models, we fused the following estimators: (i) kinematic data, namely the acceleration (Figure 2A) and angular velocity (Figure 2B) of lower leg, upper leg, feet, pelvis, and torso; (ii) lower and upper leg EMG sum (Figure 2C); (iii) HR (Figure 2D); (iv) gait speed; and (v) anthropometric data, as in [21], as the gender (corresponding to a binary signal), age, and height. The input signals were rescaled between [−1, 1] considering a min-max algorithm.

Regarding the LSTM, we implemented the following architecture. For the first layer, we set the sequence input layer with a sequence length of 300 samples. In the second layer, we implemented one LSTM layer and studied the best number of neurons, considering 10, 50, 100, 150, and 200 neurons. To the best model found in the last step, we studied the effect of adding a second LSTM layer and ranged the number of neurons from 10 to 100 neurons. We introduced one dropout layer (p = 0.5) after each LSTM layer. As the penultimate layer, we added one fully connected layer to the best model found. The last layer is the regression output layer. Figure 3 illustrates an example of a LSTM neural network that was explored in this work.

Figure 3.

Figure 3

LSTM’s architecture of the final model.

For the CNN, we firstly implemented the input layer. At the second layer, we added one convolution layer with 8 filters, and we ranged the filter’s size from 5 [22] to 15 with a resolution of 5. We also tested increasing the number of filters to 16 and introducing a second convolution layer. The filters were initiated with the Glorot initializer and then optimized during the learning process. We used the ReLu layer as the activation function and an average pooling layer of size 2 for each convolution layer. We also introduced one dropout layer (p = 0.5) and one fully connected layer, ranging the number of neurons until a decrease in performance was detected. The last layer is the regression output layer. Figure 4 illustrates one example of CNN configuration that was studied in this work.

Figure 4.

Figure 4

CNN’s architecture of the final model.

For both CNN and LSTM, we used the dropout layer and the L2 regularization method (λ = 0.0001) to prevent the existence of overfitting and gradient vanishing. These regression models were trained with the Adam optimization algorithm using mean-square error (MSE) as the loss function, considering the initial learning rate set to 0.01. We recursively implemented several models by changing the neural networks’ hyperparameters (number of layers, number of neurons of each layer, the introduction of fully connected layers for the LSTM and CNN, and the number of filters and their size for CNN), aiming to find the best model.

Data processing and model implementation were performed using MATLAB® R2019a (MathWorks Inc., Natick, MA, USA) with a machine with an Intel Core i7-3630QM with a maximum clock rate of 2.4 GHz.

2.7. Models’ Evaluation

From the eight participants, we randomly left aside one participant for testing the accuracy of the best model regarding unseen data (test dataset), leaving the remaining seven participants for the training and validation process (training dataset). Given the user-specific variability of the energy expenditure, we implemented a leave-one-subject-out cross-validation technique (LOOCV) with the number of epochs fixed to 1000. This technique enabled us to identify the best model and the respective hyperparameters.

To assess the performance of our models, the following metrics were used, taking the energy expenditure as the ground truth: (i) MSE, (ii) root-mean-squared error (RMSE), (iii) normalized MSE (NMSE), and (iv) Spearman’s correlation coefficient (SCC). The MSE and RMSE were calculated to give a relative accuracy regarding the neural network’s estimation. A value closer to 0 indicates that the neural network performs well. The SCC and NMSE were also evaluated to assess the correlation between target and estimation. A value closer to 1 indicates a perfect fit between both. Additionally, we computed the Bland–Altman plot (enables to compare two measurement techniques, in which the difference between two techniques is plotted against their average) and the linear regression plot with the respective coefficient of determination (R2) to assess, respectively, differences and linearity between energy expenditure estimation using indirect calorimetry or the neural network.

3. Results

3.1. Deep Learning Regression Models Comparison

Table 2 presents the MSE, RMSE, NMSE, and the SCC for the best architectures of LSTM and CNN, and their respective hyperparameters. Regarding the LSTM, we verified that one cell with 150 neurons yields the best model considering this network, with an average MSE of 0.25 W/kg and RMSE of 0.45 W/kg. The NMSE was high (NMSE = 0.67), which shows that this network performs reasonably well in estimating energy expenditure. Increasing the number of neurons to 200 or the number of LSTM layers did not entail an improvement in the energy expenditure estimation, presenting a higher value of MSE (≥0.28 W/kg, RMSE ≥ 0.51 W/kg). The introduction of a fully connected layer also did not improve the tool’s power estimation, presenting a MSE of 0.32 W/kg and RMSE of 0.54 W/kg. Figure 3 shows the best architecture found for the LSTM in this study.

Table 2.

CNN and LSTM accuracy in estimating energy expenditure considering the LOOCV algorithm and the test dataset.

Model Hyperparameters MSE a RMSE b NMSE c SCC d
LOOCV Test LOOCV Test LOOCV Test LOOCV Test
LSTM Neurons 150 0.25
(0.22)
N.A. 0.45
(0.22)
N.A. 0.67
(0.23)
N.A. 0.86
(0.035)
N.A.
Layers 1
Batch size 76
FC Layer 1
CNN Filter size 10 0.14
(0.10)
0.19 0.36
(0.13)
0.44 0.79
(0.12)
0.71 0.87
(0.043)
0.87
No. filters 2 × 8
Batch size 22,800
FC Layer 10

a Mean-Square Error; b Root-Mean-Square Error; c Normalized Mean-Square Error; d Spearman’s Correlation Coefficient.

Regarding CNN, the best model was composed of two convolutional layers of 8 filters of size 10, followed by a fully connected layer of 10 neurons, as shown in Figure 4. The best model presented an average MSE of 0.14 W/kg and RMSE of 0.36 W/kg, being more accurate when compared with the LSTM. With the hyperparameters tuned, it was verified that one convolution layer instead of two, regardless of the filter size (5, 10, or 15) or the number of filters (8, 16), entailed a higher MSE (≥0.19 W/kg, RMSE ≥ 0.41 W/kg). Moreover, we observed that increasing the number of neurons of the fully connected layer to 50 did not improve the estimation’s performance (MSE = 0.22 W/kg, RMSE = 0.44 W/kg).

3.2. Best Model Performance

3.2.1. LOOCV Performance Analysis

Figure 5A illustrates the energy expenditure estimation considering the best model found in this work (CNN with hyperparameters of Table 2) for the worst, medium, and the best prediction subject of the LOOCV algorithm. Figure 5A shows that the CNN has a good capacity to estimate the steady-state energy expenditure and to discriminate between basal and walking energy expenditure. This neural network was able to associate higher energy when the participants were walking, as we can inspect in Figure 5A. This result is supported by a high and positive SCC (above 0.80).

Figure 5.

Figure 5

(A) energy expenditure (EE) estimation for the worst, medium, and the best prediction subject of LOOCV algorithm, marking assisted and non-assisted gait, and basal and walking energy expenditure; (B) Bland-Altman plot of target vs. estimation for the worst, medium, and the best prediction subject; and (C) linear regression plot with the coefficient of determination (R2) for the worst, medium, and the best prediction subject of LOOCV algorithm.

These results are in accordance with Figure 5B,C, illustrating, respectively, the Bland–Altman and the linear regression plots for the worst, medium, and the best subject of the LOOCV. From Figure 5B, we verify that most of the error’s dispersion is within the 95% confidence interval, with a bias closer to 0. The worst subject presented a positive bias of 0.48 W/kg, the medium subject presented a negative bias of −6.5 × 10−2 W/kg, and the best subject presented a positive bias of 6 × 10−2 W/kg. From Figure 5C, the subject with the worst prediction presented an R2 of 0.73, with the linear fit (Figure 5C, black line) slightly deviated from the ideal line (Figure 5C, red line). The subject with medium prediction and the subject with the best prediction presented a best linear fit, closer to the ideal line, with R2 of 0.76 and 0.95, respectively. The mean value of R2 was found to be 0.79 (R¯2 = 0.79 ± 0.12), indicating a high fit regarding the target energy expenditure.

3.2.2. Performance Analysis in New Data

We also evaluated the best model performance regarding unseen data (test dataset). It was verified that it estimated reasonably well the test dataset, as illustrated in Figure 6, presenting an MSE of 0.19 W/kg and an RMSE of 0.44 W/kg. The NMSE and the SCC were positive and above 0.7.

Figure 6.

Figure 6

CNN estimation for the novel subject.

3.2.3. Dependency on Gait Speed and Walking Condition

We investigated if the CNN accuracy depends on gait speed and walking condition (i.e., assisted vs. non-assisted walking). For this analysis, the MSE for each of the three gait speeds was evaluated and considering the trials in which the user walked with and without the ankle-foot exoskeleton. The MSE values were used to create a heatmap, showing the error’s dispersion, as illustrated in Figure 7. By analyzing Figure 7, the CNN presented similar values of MSE for both assisted and non-assisted walking and considering the three gait speeds. The MSE was considered low (≤0.17 W/kg) for all conditions.

Figure 7.

Figure 7

Variation of MSE regarding the gait speed (0.22, 0.33, and 0.44 m/s) and walking condition (assisted vs. non-assisted gait).

4. Discussion

This work presents and validates a deep learning-based tool for steady-state energy expenditure estimation without relying on indirect calorimetry. Although gold standard, indirect calorimetry is an expensive technique, besides being cumbersome and slightly invasive for persons with motor disabilities. Furthermore, it is not adequate for human-in-the-loop control strategies since it takes more than three minutes to reach the steady-state condition, which is a long time for real-time optimization problems. Our investigation sought to find alternative methods to estimate energy expenditure through smaller, ergonomic, and clinically validated sensors, namely inertial, EMG, and heart rate sensors.

To estimate the subjects’ energy expenditure, we implemented and compared two deep learning approaches, namely the CNN and LSTM neural networks. To find the best model that entails a lower estimation error, we evaluated some network’s hyperparameters with a LOOCV algorithm.

4.1. Comparative Analysis of Deep Learning Regression Models

Regarding the LSTM, we verified that increasing the number of neurons until 150 promoted an improvement in the network’s performance. However, we verified an increase of more than 13% in RMSE (∆error ≈ 0.06 W/kg) when overcoming 150 neurons or when introducing another LSTM cell. The introduction of a fully connected layer after the LSTM cells did not improve the network’s performance, resulting in an increase of 19% in RMSE (∆error ≈ 0.08 W/kg). The best model yields one LSTM cell with 150 neurons.

Considering the CNN, we verified that one convolutional layer was not enough to attain the best energy expenditure estimation, resulting in RMSE 13% higher than the best architecture found with two convolutional layers (∆error ≈ 0.05 W/kg). Moreover, increasing the number of neurons of the best CNN’s fully connected layer did not improve the power’s estimation (RMSE increased 21%, ∆error ≈ 0.08 W/kg). The best model yield two convolutional layers of 8 filters each and one fully connected layer with 10 neurons.

By comparing the two deep learning architectures, we verified that the CNN yields a more accurate estimation in all metrics assessed in this work. It presents an improvement of 44% in MSE (∆error ≈ −0.11 W/kg), 20% in RMSE (∆error ≈ −0.090 W/kg), and 18% in NMSE (∆error ≈ 0.12, with values closer to 1.0, indicating a better fitting). Furthermore, the CNN achieved better generalization than the LSTM, since a lower standard deviation was obtained for all metrics. This neural network was reported in ref. [22] as suitable for energy expenditure estimation when compared with an MLP.

4.2. Detailed Analysis of the Best Model

This study demonstrates the suitability of CNN to accurately estimate the subjects’ steady-state energy expenditure based on inertial, EMG, and heart rate sensors.

A deep analysis of Figure 5 reveals that the CNN was able to catch relevant information in the inputs to increase its generalization to different subjects and walking conditions, supported with the low standard-deviation observed. By analyzing Figure 5A, our model are revealed to be accurate in distinguishing the basal from the walking energy expenditure, and it was sensitive to changes in gait speed and walking condition, since an SCC above 0.85 was achieved, indicating that both target and estimation share the same monotony. Figure 5B, which illustrates the Bland–Altman plots, also highlights the suitability of the proposed CNN. The errors between target and estimation were within the 95% confidence interval and the bias was small and close to 0. For the subject with worst estimations, a positive and higher bias (0.48 W/kg) was expected since an underestimation is visible in Figure 5A, especially during the walking condition. For the subject with medium estimations, the CNN slightly overestimates the target energy expenditure and, thus, it would be expected a negative bias. However, this overestimation is not valid for all trials, explaining the negative, yet close bias to 0 (−6.5 × 10−2 W/kg). Regarding the subject with best estimation, the CNN achieved a perfect fit, explaining the low bias of 6 × 10−2 W/kg. These results are supported by Figure 5C, where high linearity is observed (R¯2 = 0.79). Therefore, the steady-state energy expenditure estimation with the CNN is comparable to that obtained with indirect calorimetry.

When estimating energy expenditure for a novel subject, the CNN proved to be reliable. Although the increment of 36% in MSE (∆error ≈ 0.05 W/kg) and 22% in RMSE (∆error ≈ 0.08 W/kg), respectively, when compared to the LOOCV algorithm, these values are within the range observed in Table 2 (0.14 ± 0.1 and 0.36 ± 0.13 W/kg for MSE and RMSE, respectively). The test subject presented a higher variation in the baseline energy expenditure in comparison with other subjects (visible in Figure 6), which introduced an additional challenge in estimating energy expenditure. This may explain the higher error when compared to the LOOCV algorithm. Nevertheless, the SCC was high (0.87) and consistent with the LOOCV algorithm (0.87 ± 0.043), indicating that the inertial, EMG, and heart rate sensors provide enough information to estimate accurately the energy expenditure, regardless of gait speed, walking condition, and basal vs. walking energy expenditure. This was also highlighted in Figure 7, which illustrates the MSE dependency on gait speed and walking conditions. Thus, it seems that there is no dependency regarding gait speed, given the similar values of MSE. Further, the MSE mean values were close to 0 (MSE ≤ 0.17), supporting the conclusion that our tool is accurate in estimating energy expenditure. Yet, we verified a slightly better estimation when the subjects were assisted with the ankle-foot exoskeleton. This may be explained by the fact that gait is more controlled when subjects walk with the ankle-foot exoskeleton. Considering these results, the proposed tool was revealed to be versatile for energy expenditure assessment in multiple rehabilitation scenarios.

4.3. Related Work

Previous works have shown a good performance of AI algorithms in estimating human energy expenditure. For instance, [19] presented an approach to estimate energy expenditure using IMUs, EMG, HR, and, among other physiological signals, the SpO2. The users performed numerous tasks without gait assistance, with a minimum gait speed of 0.6 m/s and a maximum of 2.7 m/s, which resulted in high energy expenditure variation (~10 W/kg). The authors used linear regression models with the best ones presenting errors bellow 1.5 W/kg. Our work innovates by exploring deep learning regressors along with orthotics-based gait assistance considering slow gait speeds, which entailed a lower energy expenditure variation (~4.1 W/kg) when compared to [19]. This lower variation of energy expenditure, conjugated with both assisted and non-assisted gait conditions, may cause an added challenge in the estimation process of our model.

In refs. [20,21], the authors found high correlations between the target and the estimated oxygen uptake by a random forest algorithm and MLP, respectively. Zhu et al. [22] assessed the effectiveness of CNN, MLP, and an activity-specific linear regression model, using inertial, HR, anthropometric data (age, height, weight), and basal metabolic rate. The authors found that the CNN yields the best estimation, resulting in an improvement of more than 30% when compared to the MLP and linear regression model. However, these state-of-the-art approaches may require several minutes (at least 3) to compute the steady-state energy expenditure. Thus, it would entail a higher optimization time for applications with the human-in-the-loop control. Our work innovates over studies from refs. [20,21,22] by (i) estimating the steady-state energy expenditure, which is useful for rapid estimation and reliable use in human-in-the-loop control strategies; (ii) benchmarking two different deep learning regressors, namely the CNN and LSTM, along assisted and non-assisted gait conditions while considering slow gait speeds commonly observed in persons with gait disabilities.

A recent study estimated the steady-state energy expenditure considering ankle assisted walking at 1.25 m/s [15]. The authors used GRF and EMG sensors as inputs of a linear regression model and an LSTM. They found the LSTM was suitable to estimate the end-user energy expenditure, with an average RMSE of 0.40 W/kg for ankle assisted walking, yielding a better model when compared with linear regression (RMSE = 0.43 W/kg). The results were comparable to those obtained with our work, especially for the LSTM neural network. However, our results suggest that using CNN to estimate energy expenditure may improve the estimation power, given the lower RMSE (0.36 W/kg). Additionally, our work innovates the study [15] by exclusively relying on wearable sensor data and exploring slow gait conditions.

4.4. Limitations and Future Insights

The main limitation of our work is the low number of subjects to train the models. Although our trials are long in temporal terms, involving more than 360,000 samples in total to develop the regression models, our algorithm may be affected by the subject’s variability. Even so, an equal gender distribution was guaranteed, which is important to have a more representative population. From a future perspective, we aim to collect more data with different subjects to augment the estimation’s power. Future insights also address the estimator’s selection algorithms, by selecting among data the minimal conjugation of sensors that promotes an accurate estimation and fulfill usability guidelines. Lastly, we aim to collect data with persons that exhibit gait disabilities to assess the differences regarding energy expenditure. The use of transfer learning will endow our models with the ability to estimate the energy expenditure for non-healthy persons.

5. Conclusions

This work presents and validates a deep learning tool for steady-state energy expenditure estimation for both assisted and non-assisted gait conditions considering slow gait speeds typically observed in persons with motor disabilities. Our approach relies on the use of deep learning to promote an accurate estimation of the subjects’ energy expenditure using data collected by reliable, ergonomic, and clinically validated on-body sensors. From the cross-validation results, we verified the suitability of CNN to accurately estimate energy expenditure in basal, assisted, and non-assisted walking, resulting in an improvement of 20% in RMSE when compared to LSTM. Therefore, we propose a versatile tool for estimating the energy expenditure in multiple gait conditions (freely human walking and robotics-based gait rehabilitation) and in optimization problems issues in the human-in-the-loop control strategies.

Author Contributions

Conceptualization, J.F. and C.P.S.; methodology, J.M.L., J.F., P.F. and C.P.S.; software, J.M.L.; validation, J.F. and C.P.S.; formal analysis, J.M.L. and J.F.; investigation, J.M.L., J.F. and C.P.S.; resources, J.P.V.-B. and C.P.S.; data curation, J.M.L. and P.F.; writing—original draft preparation, J.M.L.; writing—review and editing, J.M.L., J.F., J.J.C., J.P.V.-B. and C.P.S.; visualization, J.M.L. and J.F.; supervision, J.J.C., J.P.V.-B. and C.P.S.; project administration, C.P.S.; funding acquisition, J.F., J.P.V.-B. and C.P.S. All authors have read and agreed to the published version of the manuscript.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki and approved by the Ethics Committee for Research in Life and Health Sciences of University of Minho (CEICVS 006/2020).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

Data that support this research will be available upon request.

Conflicts of Interest

The authors declare no conflict of interest.

Funding Statement

This work has been supported in part by the FEDER Funds through the COMPETE 2020—Programa Operacional Competitividade e Internacionalização (POCI) and P2020 with the Reference Project SmartOs Grant POCI-01-0247-FEDER-039868, and by FCT national funds, under the national support to R&D units grant, through the reference project UIDB/04436/2020 and UIDP/04436/2020, under the FCT scholarship with reference 2020.05708.BD, and under the Stimulus of Scientific Employment with the grant 2020.03393.CEECIND.

Footnotes

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Mikolajczyk T., Ciobanu I., Badea D.I., Iliescu A., Pizzamiglio S., Schauer T., Seel T., Seiciu P.L., Turner D.L., Berteanu M. Advanced technology for gait rehabilitation: An overview. Adv. Mech. Eng. 2018;10:1–19. doi: 10.1177/1687814018783627. [DOI] [Google Scholar]
  • 2.Kramer S., Johnson L., Bernhardt J., Cumming T. Energy Expenditure and Cost during Walking after Stroke: A Systematic Review. Arch. Phys. Med. Rehabil. 2016;97:619–632.e1. doi: 10.1016/j.apmr.2015.11.007. [DOI] [PubMed] [Google Scholar]
  • 3.Norman J.F., Bossman S., Gardner P., Moen C. Comparison of the Energy Expenditure Index and Oxygen Consumption Index During Self-Paced Walking in Children with Spastic Diplegia Cerebral Palsy and Children Without Physical Disabilities. Pediatr. Phys. Ther. 2004;16:206–211. doi: 10.1097/01.PEP.0000145930.84009.23. [DOI] [PubMed] [Google Scholar]
  • 4.Brockway J. Derivation of formulae used to calculate energy expenditure in man. Hum. Nutr. Clin. Nutr. 1987;41:463–471. [PubMed] [Google Scholar]
  • 5.Cao J., Xie S.Q., Das R., Zhu G.L. Control strategies for effective robot assisted gait rehabilitation: The state of art and future prospects. Med. Eng. Phys. 2014;36:1555–1566. doi: 10.1016/j.medengphy.2014.08.005. [DOI] [PubMed] [Google Scholar]
  • 6.Meng W., Liu Q., Zhou Z., Ai Q., Sheng B., Xie S.S. Recent development of mechanisms and control strategies for robot-assisted lower limb rehabilitation. Mechatronics. 2015;31:132–145. doi: 10.1016/j.mechatronics.2015.04.005. [DOI] [Google Scholar]
  • 7.Figueiredo J., Félix P., Santos C.P., Moreno J.C. Towards human-knee orthosis interaction based on adaptive impedance control through stiffness adjustment. IEEE Int. Conf. Rehabil. Robot. 2017;2017:406–411. doi: 10.1109/ICORR.2017.8009281. [DOI] [PubMed] [Google Scholar]
  • 8.Jamwal P.K., Hussain S., Ghayesh M.H. Robotic orthoses for gait rehabilitation: An overview of mechanical design and control strategies. Proc. Inst. Mech. Eng. Part H J. Eng. Med. 2020;234:444–457. doi: 10.1177/0954411919898293. [DOI] [PubMed] [Google Scholar]
  • 9.Lopes J.M., Pinheiro C., Figueiredo J., Reis L.P., Santos C.P. Assist-as-needed Impedance Control Strategy for a Wearable Ankle Robotic Orthosis; Proceedings of the IEEE International Conference on Autonomous Robot Systems and Competitions (ICARSC); Ponta Delgada, Portugal. 15–17 April 2020; pp. 10–15. [DOI] [Google Scholar]
  • 10.Mooney L.M., Herr H.M. Biomechanical walking mechanisms underlying the metabolic reduction caused by an autonomous exoskeleton. J. Neuroeng. Rehabil. 2016;13:4. doi: 10.1186/s12984-016-0111-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Seo K., Lee J., Lee Y., Ha T., Shim Y. Fully Autonomous Hip Exoskeleton Saves Metabolic Cost of Walking; Proceedings of the IEEE International Conference on Rehabilitation Robotics and Automation (ICRA); Stockholm, Sweden. 16–21 May 2016; [DOI] [Google Scholar]
  • 12.Awad L.N., Bae J., O’Donnell K., De Rossi S.M.M., Hendron K., Sloot L.H., Kudzia P., Sloot L.H., Kudzia P., Allen S., et al. A soft robotic exosuit improves walking in patients after stroke. Sci. Transl. Med. 2017;9:eaai9084. doi: 10.1126/scitranslmed.aai9084. [DOI] [PubMed] [Google Scholar]
  • 13.Malcolm P., Galle S., Van Den Berghe P., De Clercq D. Exoskeleton assistance symmetry matters: Unilateral assistance reduces metabolic cost, but relatively less than bilateral assistance. J. Neuroeng. Rehabil. 2018;15:74. doi: 10.1186/s12984-018-0381-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Martini E., Crea S., Parri A., Bastiani L., Faraguna U., McKinney Z., Molino-Lova R., Pratali L., Vitiello N. Gait training using a robotic hip exoskeleton improves metabolic gait efficiency in the elderly. Sci. Rep. 2019;9:7157. doi: 10.1038/s41598-019-43628-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Slade P., Troutman R., Kochenderfer M.J., Collins S.H., Delp S.L. Rapid energy expenditure estimation for ankle assisted and inclined loaded walking. J. Neuroeng. Rehabil. 2019;16:67. doi: 10.1186/s12984-019-0535-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Zhang J., Fiers P., Witte K.A., Jackson R.W., Poggensee K.L., Atkeson C.G., Collins S.H. Human-in-the-loop optimization of exoskeleton assistance during walking. Science (80-) 2017;356:1280–1284. doi: 10.1126/science.aal5054. [DOI] [PubMed] [Google Scholar]
  • 17.Koller J.R., Gates D.H., Ferris D.P., Remy C.D. “Body-in-the-Loop” Optimization of Assistive Robotic Devices: A Validation Study. Robot. Sci. Syst. 2016;2016:1–10. doi: 10.15607/RSS.2016.XII.007. [DOI] [Google Scholar]
  • 18.Ding Y., Kim M., Kuindersma S., Walsh C.J. Human-in-the-loop optimization of hip assistance with a soft exosuit during walking. Sci. Robot. 2018;3:eaar5438. doi: 10.1126/scirobotics.aar5438. [DOI] [PubMed] [Google Scholar]
  • 19.Ingraham K.A., Ferris D.P., Remy C.D. Evaluating physiological signal salience for estimating metabolic energy cost from wearable sensors. J. Appl. Physiol. 2019;126:717–729. doi: 10.1152/japplphysiol.00714.2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Beltrame T., Amelard R., Wong A., Hughson R.L. Prediction of oxygen uptake dynamics by machine learning analysis of wearable sensors during activities of daily living. Sci. Rep. 2017;7:45738. doi: 10.1038/srep45738. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Beltrame T., Amelard R., Villar R., Shafiee M.J., Wong A., Hughson R.L. Estimating oxygen uptake and energy expenditure during treadmill walking by neural network analysis of easy-to-obtain inputs. J. Appl. Physiol. 2016;121:1226–1233. doi: 10.1152/japplphysiol.00600.2016. [DOI] [PubMed] [Google Scholar]
  • 22.Zhu J., Pande A., Mohapatra P., Han J.J. Using Deep Learning for Energy Expenditure Estimation with wearable sensors; Proceedings of the 2015 17th International Conference on E-health Networking, Application & Services (HealthCom); Boston, MA, USA. 14–17 October 2015; pp. 501–506. [DOI] [Google Scholar]
  • 23.Beaman C.B., Peterson C.L., Neptune R.R., Kautz S.A. Differences in self-selected and fastest-comfortable walking in post-stroke hemiparetic persons. Gait Posture. 2010;31:311–316. doi: 10.1016/j.gaitpost.2009.11.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Polar Research and Technology. Polar H10 Heart Rate Sensor System. 2019. [(accessed on 17 October 2022)]. Available online: https://www.polar.com/en/img/static/whitepapers/pdf/polar-h10-heart-rate-sensor-white-paper.pdf.
  • 25.Hermens H.J., Freriks B., Disselhorst-Glug C., Rau G. Development of recommendations for SEMG sensors and sensor placement procedures. J. Electromyogr. Kinesiol. 2000;10:361–374. doi: 10.1016/S1050-6411(00)00027-4. [DOI] [PubMed] [Google Scholar]
  • 26.Al-Amri M., Nicholas K., Button K., Sparkes V., Sheeran L., Davies J.L. Inertial measurement units for clinical movement analysis: Reliability and concurrent validity. Sensors. 2018;18:719. doi: 10.3390/s18030719. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.McLaughlin J.E., King G.A., Howley E.T., Bassett D.R., Ainsworth B.E. Validation of the COSMED K4b2 Portable Metabolic System. Int. J. Sports Med. 2001;22:280–284. doi: 10.1055/s-2001-13816. [DOI] [PubMed] [Google Scholar]
  • 28.Reeves M.M., Davies P.S.W., Bauer J., Battistutta D. Reducing the time period of steady state does not affect the accuracy of energy expenditure measurements by indirect calorimetry. J. Appl. Physiol. 2004;97:130–134. doi: 10.1152/japplphysiol.01212.2003. [DOI] [PubMed] [Google Scholar]
  • 29.Winter D.A. Biomechanics and Motor Control of Human Movement. 4th ed. Volume 2. John Wiley & Sons, Inc.; Hoboken, NJ, USA: 2009. [Google Scholar]
  • 30.De Luca C.J., Donald Gilmore L., Kuznetsov M., Roy S.H. Filtering the surface EMG signal: Movement artifact and baseline noise contamination. J. Biomech. 2010;43:1573–1579. doi: 10.1016/j.jbiomech.2010.01.027. [DOI] [PubMed] [Google Scholar]
  • 31.Gerdle B., Karlsson S., Day S., Djupsjöbacka M. Modern Techniques in Neuroscience Research. Springer; Berlin/Heidelberg, Germany: 1999. Acquisition, Processing and Analysis of the Surface Electromyogram; pp. 705–755. [DOI] [Google Scholar]
  • 32.Goodfellow I., Bengio Y., Courville A. Deep Learning. The MIT Press; Cambridge, MA, USA: 2016. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Data that support this research will be available upon request.


Articles from Sensors (Basel, Switzerland) are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES