Abstract
In many applications that include direct human involvement such as control of prosthetic arms, athletic training, and studying muscle physiology, hand force is needed for control, modeling and monitoring purposes. The use of inexpensive and easily portable active electromyography (EMG) electrodes and position sensors would be advantageous in these applications compared to the use of force sensors which are often very expensive and require bulky frames. Among non-model-based estimation methods, Multilayer Perceptron Artificial Neural Networks (MLPANN) has widely been used to estimate muscle force or joint torque from different anatomical features in humans or animals. This paper investigates the use of Radial Basis Function (RBF) ANN and MLPANN for force estimation and experimentally compares the performance of the two methodologies for the same human anatomy, ie, hand force estimation, under an ensemble of operational conditions. In this unified study, the EMG signal readings from upper-arm muscles involved in elbow joint movement and elbow angular position and velocity are utilized as inputs to the ANNs. In addition, the use of the elbow angular acceleration signal as an input for the ANNs is also investigated.
Keywords: Force estimation, neural networks, electromyography, multilayer perceptron, radial basis function
Introduction
In applications, such as control of prostheses,1,2 sports medicine3–5 or ergonomic design analysis,6 the measurement of muscle or limb force is required. The use of force sensors for interface in normal operation is often impractical or inconvenient for direct measurement of forces. Additionally, the high cost of commercially available force/torque sensors makes the alternate method of force estimation from a group of muscles using electromyography (EMG) signals, which requires only comparatively inexpensive electrodes, very attractive.7 The estimated force/torque can also be used for research in biomechanics and muscle physiology.8,9
The torque induced at each body joint is generated by muscle actuators driving the motion of that joint. The generated force in muscles is caused by two processes; activation dynamics and muscle contraction dynamics.10 Muscle contraction dynamics include the mechanical properties of muscle tissues and tendons, which are expressed as force-length and force-velocity relations. The activation dynamics include the voluntary and non-voluntary excitation reflex signals and motor unit recruitment level in muscle. It is well-known that regardless of fatigue, the generated torque in each joint is dependent on muscle activation level (MAL) and joint angle when in a stationary position.11,12 When in motion, joint torque is also a function of joint angular velocity.13 Therefore, joint torque (and force) can be predicted using EMG signals, and joint angle and velocity measurements. In this work, it is assumed that the wrist and shoulder angles are fixed. Thus, hand force can be determined directly from elbow torque.
In the past, parametric and non-parametric model-based approaches have been proposed for muscle force or human joint torque estimation using EMG signals. Parametric approaches have used the Hill muscle model,8,9,14 which takes muscle MAL as an input and outputs the generated muscle force based on muscle length and contraction speed.13 Musculoskeletal kinematic models15 and inverse-dynamics models16 have also been employed to derive joint torque. Nonparametric model-based methods use polynomial functions,1,10,17 linear regression,18 Volterra-series,19 Fast Orthogonal Search (FOS),9,20 Parallel Cascade21 or ANNs.22–27 The advantage of non-parametric estimation of the EMG-force relationship is that it does not require any knowledge about muscle and joint dynamics. Moreover, they often have the capability of accounting for nonlinearities in the EMG-force relationship.
Clancy and Hogan17 proposed the use of a third-order polynomial to estimate the generated torque in the elbow joint under isometrica and quasi-isotonicb conditions. An off-line least squares method was employed to identify the polynomial coefficients. Misener and Morin used an exponential force/velocity function in addition to a third-order polynomial model to estimate the elbow torque.10 The prediction was accurate within 10% for elbow flexion and extension under isotonicc conditions. Clancy et al. used a 15th-order linear finite impulse response (FIR) model to map EMG to torque errors within 7.3% of maximum voluntary contraction during isometric contractions.18 Xu et al.19 and Hashemi et al.21 employed second-order Volterra series and parallel cascade methods to find the EMG-Force relation during isometric flexion of the human elbow joint. Mobasser et al.20 used the FOS, a nonlinear identification method, to map EMG to force. They proposed that the FOS-based force observer would select its pre-determined basis functions from a pool of candidate functions consisting of groupings of linear, cross and square root terms and sinusoidal functions. Later, Mountjoy et al. incorporated Hill-based muscle model as the candidate functions for more accurate force prediction and for estimating upper-arm muscle physiological parameters.9
The first use of ANN for EMG-based force estimation was reported by Sepulveda et al. where they proposed an MLPANN to estimate joint moments in human gait.22 The EMG signals from 16 leg muscles were fed to the ANN to estimate joint moments of the hip, knee and ankle. Savelberg et al. employed MLPANN with a back propagation training method for the estimation of a cat leg dynamic tendon force using recorded EMG data and kinematic information (joints angle and velocity).23,25 The ANN could estimate non-trained data with the root mean square error (RMSE) of less than 15%, under dynamic contraction conditions. Luh et al. utilized the EMG signals collected from the biceps and triceps, and elbow joint angle and velocity in an MLPANN to estimate the elbow joint torques under isokineticd contractions.24 In a hybrid approach, Wang et al.27 employed an MLPANN with two hidden layers to predict the MAL from the EMG signal recorded from the agonist and antagonist elbow muscles. The predicted MAL was then input to a Hill muscle model to calculate the generated muscle force. Finally, the estimated muscle force was employed in an elbow joint geometric model to predict the joint torque. Since the actual MAL was not available, an adjusted back-propagation method was used to train the ANN. The use of MLPANN, such as in the above, has been reported for force estimation on various anatomies of humans or animals under single or often dissimilar operational conditions.
Radial Basis Function ANN (RBFANN) has mostly been used for EMG signal classification.28,29 It has also been used along with acceleration and force data collected during walking to replicate EMG signals from major lower limb muscles for gait analysis.30 RBFANN has also been used to identify changes in the truck muscle stiffness during a sudden load.31 To the authors best knowledge, RBFANN has been reported only for predicting muscle forces using evoked electromyogram (EEMG) signals.32,33 In this work,e RBFANN will be employed in addition to the MLPANN architecture on the data collected from a single human anatomical feature, i.e. elbow joint. The performance of the two ANN architectures will be experimentally studied and compared on a 1-DOF exoskeletal robotic testbed under the same set of operational conditions, i.e. isometric, isotonic and light load.
Methods
In this paper two major architectures are utilized: MLPANN and RBFANN. These two ANNs are both known as universal approximators for nonlinear input-output mapping, which are briefly introduced.35,36
Multilayer Perceptron ANN (MLPANN)
MLP network architecture mimics the structure of the neural networks in the human nervous system. It consists of an interconnection of multiple layers of neurons (Fig. 1A) and can have more than one hidden layer. In the MLP architecture (as shown in Fig. 1A), all the outputs in each layer are connected to the inputs of neurons in the next layer. Moreover, each node has a bias input. The sigmoid function f (x) = 1/(1 + e−x) is often utilized as the activation function, which behaves like a smoothed step function. To train the network for the corresponding weight for each node, the back-propagation method is used. In this method, the error measure for an input vector x is considered to be
(1) |
with y as the desired output, and ŷ as the output of the network, where ||·|| is the vector Euclidean norm and e is the output error. To train the network, the error measure E is minimized by employing an optimization algorithm such as Gradient decent, Conjugate gradient, Quasi-Newton or Levenberg Marquardt (LM).37 The LM method is commonly used in situations involving function approximations with less than a hundred neuron weights where accuracy is paramount.37 The LM training method can be implemented in the MATLAB Neural Network Toolbox using the TRAINLM function. The training algorithm can be written as
(2) |
(3) |
at the kth time iteration, where W is a vector of all network weights, G is the gradient vector of the network errors and δ is an adjustable scalar. When δ is zero, the algorithm is Newton’s method, whereas for large δ the algorithm becomes a gradient descent algorithm.
Radial Basis Function ANN (RBFANN)
On the other hand, an RBF network is constructed of a summation of a number of exponentially decaying localized nonlinear functions (e.g. Gaussian), approximating the input-output mapping globally (Fig. 1B). An RBF network has a single hidden layer, as opposed to an MLP network that can have more than one hidden layer. The output of an RBFANN with N nodes, as shown in Figure 1B, is expressed by
(4) |
where wi is the weight of the output layer for neuron i. f (·) is an RBF function that decreases (or increases) monotonically as the distance between the input x and centroid point μi increases. The parameter vector μi = (μi1, …, μiM) describes the centroid coordinates of the ith node of an RBF network, where M is the dimension of input x.
The Gaussian fg with constant σ described by
(5) |
is the most commonly used radially symmetric function. To train for the centroid and output weight for all nodes, that is
(6) |
the error measure (1) is minimized by using an optimization method, such as the gradient descent method. The optimization is initiated with a random choice of parameters.
RBF is generally faster in the learning process and insensitive to the order of presentation of training data. As a drawback, the number of neurons in RBF, which is necessary to reach a desired accuracy, may be large.
ANN-based Hand Force Observers
Figure 2 illustrates the schematics of the MLPANN and RBFANN force observers. The inputs to the ANNs are the EMG signals recorded from the upper-arm muscles (biceps brachii, triceps brachii, brachialis, and brachioradialis) and the elbow joint angle and angular velocity. The network output is the estimated hand force. The neural networks are trained off-line using the network input data and measured force collected under different conditions, as will be explained in Section 4 (Fig. 2A). The trained networks will then be used for force estimation for real-time applications. The use of elbow joint angular acceleration as an input to ANN will also be explained in Section 5.
One important parameter of choice is the number of nodes in each layer. By increasing this number, training performance improves at the cost of an increase in training time. On the other hand, increasing the number of nodes does not necessarily help the network generalization; hence the network may not truly estimate the hand force. Tests will be conducted in Section 5 to find the number of nodes for both RBFANN and MLPANN resulting in the best achievable generalization for the networks for real-time applications with high accuracy.
Experimental Setup and Procedures
Experimental setup
Figure 3 shows the Queen’s University Arm (QArm1), used for the implementation and comparison of the two MLPANN and RBFANN based force observers. The apparatus is composed of a platform accommodating a Maxon DC motor, a 4:1 cable-driven power transmission system, and an aluminum bar on which the subject’s arm is placed. In this design, the subject’s upper-arm is fixed in position with a brace and the wrist is placed in a second brace attached to a pivoting bar through an ATI Gamma 6-DOF force/torque sensor. The axis of rotation of the bar pivot is aligned with the elbow’s axis of rotation. A high resolution encoder signal provides the motor shaft angle, which is proportional to the elbow angle. The resolution of elbow angle measurement is 1/4000 of a degree. The apparatus can apply up to 6.32 Nm continuous torque to the elbow joint, which is mapped to about 16 N at the wrist. The Quanser WinCon/Venturcom RTX real-time control system is employed for data collection and control of the apparatus at the rate of 1 kHz.
To collect EMG signals from upper-arm muscles, active electrodes containing differential amplifiers are employed to cancel the effect of environmental electromagnetic interference. The recorded raw EMG data is passed through a linear-envelope filter.38 The bias of the recorded EMG is detected by applying a moving average filter, and is removed from the original signal. The resulting signal is then rectified and passed through a second moving average filter with a size of 300. This produces 150 ms of delay in the final EMG signal. The filter length and the resulting delay are chosen to match the average delay between the collected EMG signal (or motor nerve excitations) and the force generated by the muscle.13
Experimental procedure
Electrode attachments
EMG signals are collected from the four muscles biceps, tricps, brachialis and brachioradialis. Due to the size of the biceps and triceps muscles multiple electrodes are placed at different locations for more accurate results. The subject’s shoulder and wrist are placed in the arm braces ensuring that the upper-arm is stabilized and the elbow joint axis coincides with the bar rotation axis. The shoulder is stabilized at 90° abduction, 15° horizontal adduction and neutral pronation-supination. Subjects are asked to relax all muscles not directly involved in elbow flexion/extension.
EMG normalization
The level of EMG readings is dependent on skin impedance at the electrodes location, which may vary between experimental sessions. To compensate for this variability, the EMG readings have to be normalized for each recording session and after electrode placement. A widely used EMG normalization method is the Maximum Voluntary Contraction (MVC) method,13 in which the subject is asked to maximally contract the muscle under study. The corresponding EMG reading is used for normalization of EMG readings from that muscle. The MVC method was originally employed by mobasser, et al. for EMG normalization in the but did not produce satisfactory estimation results.20 In fact, using the MVC method raised concerns about whether or not subjects generated the true maximum force.13
As an alternative approach, at the beginning of each experiment, 6 Nm torque is applied by the robotic arm to the elbow joint in the flexion and extension directions, while the subject is asked to keep the elbow angle at 0° flexion.f The average value of the recorded EMG is used for the normalization of the collected EMG data from the biceps and brachioradialis in flexion and from the lateral triceps in extension. This method has shown better results in comparison to the MVC method. In the case of operation outside a laboratory environment where QArm1 or any other manipulandum is not available, the subject can be asked to hold a specific weight for EMG normalization.
Experiments
In each experiment, after normalization, the elbow joint kinematics and muscle EMG data are recorded under the following sequence of conditions:
Light Load
No torque is applied to the elbow joint by the QArm1 and the subject is asked to move her/his arm in a random pattern for 50 seconds. The only load that is sensed by the subject is the light inertia of QArm1.
Isotonic
QArm1 applies a constant torque to the subject’s elbow joint while the subject moves her/his arm in a random pattern. Equivalent wrist force level of 5 N and 10 N are applied at the joint both in flexion and extension directions for a total duration of 50 seconds.
Isometric
The subject performs a number of isometric flexion and extension tasks at the elbow position of 0° and 30° degrees, while QArm1 is held fixed by the subject’s free hand. The level of force applied by the subject is variable and arbitrary.
For this research, EMG data were collected from the right arm of two female and six male subjects with an average age of 27 and a standard deviation of 5 years. The subjects had no known neuromuscular deficits in their right shoulder, arm or hand. Informed consent was received from each subject and the experiment protocol was explained to each subject prior to the experiment. For each experiment type, subjects were tested in three sessions each. In each recording session, the above described experiment was conducted four times and four sets of data were recorded. The three sessions were conducted at three different times to study the intra-session effect of variations in human subject physiology and EMG electrode placement. The outlier data that were mainly caused by faults in EMG readings, due to amplifier saturation or electrode detachment from skin, were excluded. Therefore, for each experiment type, up to a maximum of twelve data sets were collected and kept for the analysis of each subject performance. Sample data collected from subject F1 performing experiment type I is shown in Figure 4.
Experimental Results
Network settings
In MLPANN, a hyperbolic tangent sigmoid function and linear function were used as activation functions for hidden layers and output layer nodes, respectively. Before training network, the input and output data in each training set were normalized to [−11] to correspond to a [minimum maximum] range of the data. Moreover, the data from the validation sets were normalized by the [minimum maximum] range of the training set. Since the RBFANN activation function is Gaussian and does not saturate, the inputs of RBFANN were not normalized. To avoid overtraining,24 the collected data were down-sampled by 10 for both the MPLANN and RBFANN.
To avoid networks from being trapped in local minima in training processes, ten networks were trained for 1000 epochs with a very small target training goal (smaller than the lowest estimation error that the networks could meet). The best performing network was chosen and trained for an additional 1000 epochs. The codes were implemented in MATLAB on a 3GHz Pentium 4 CPU.
Evaluation criteria
To validate the trained networks, each network was evaluated against all data sets with the three criteria, Relative Mean Square Error (RMSE), Cross-Correlation (CC) and Average Absolute Error (AAE) as formulated below
where yi and ŷi are the measured and the estimated hand forces, respectively, and n is the number of samples. Cross-correlation is a measure of similarity between y and ŷ of the signals regardless of scaling. The closer the value is to 100, the higher the level of similarity between signals.23 In conditions such as light load in which measured force is very small, RMSE becomes very large, hence AAE serves as a better performance index than RMSE.
Number of nodes
Different number of nodes were examined for both types of networks. Two hidden layers were chosen for MLPANN since it is proven that a network type with two hidden layers can approximate any continuous function.35 The average RMSE for validation results for a different number of nodes in the first and second layers for subject M1 are shown in Figure 5. MLPANN with two hidden layers, 4 nodes in the first layer and 3 nodes in the second layer demonstrated the best generalization and the most consistent performance. For RBFANN, as shown in Figure 6, although the training average RMSE is at its minima for a node number higher than 3, the average RMSE for validation which determines network generalization is a minimum within the range of 30–50 nodes. Therefore, 40 nodes are chosen for the RBFANN network.
The training elapsed time for the two network architectures, trained with full-length data collected from experiment type I, is shown in Table 1. The results indicate that training RBFANN is approximately 5–6 times faster than MLPANN. The inclusion of the acceleration signal in ANN inputs increases the training time, with a larger effect on RBFANN. For real-time operations, the online calculations of ANN output takes a fraction of a second since it only requires a forward pass of inputs through the network.
Table 1.
MLPANN | RBFANN | |
---|---|---|
With acceleration | 52.26 sec | 12.43 sec |
Without acceleration | 50.98 sec | 8.89 sec |
Multiple electrodes vs. single electrode
In this work, multiple electrodes were used on biceps and triceps muscles. Three electrodes were placed on the belly of the triceps brachii long head, lateral head and medial head, and two electrodes on the belly of the bicep brachii short head and long head. The locations follow the SENIAM39 recommendations, which are on the line between the acromion and the fossa cubit at 1/3 from the fossa cubit for the biceps and at 50% on the line between the acromion and the olecranon for the triceps. To combine the measured electrodes data and achieve the best overall EMG, the processed and normalized EMG signals from electrodes corresponding to a specific muscle (e.g. Biceps) were averaged at each instance of time.
The ANNs provided better performance when averaged EMG from multiple electrodes were utilized instead of a single electrode for biceps and triceps muscles. The validation results after training on full-length data for all subjects using single-electrode and multiple electrodes EMG data are displayed in Table 2. It can be observed that the use of multiple-electrodes improved force estimation performance by 10%–70%, verifying the results reported in.40 Therefore, in this paper, multiple-electrode EMG data from the biceps and triceps muscles have been collected and input to ANNs.
Table 2.
Subj. F1 | Subj. F2 | Subj. M1 | Subj. M2 | Subj. M3 | Subj. M4 | Subj. M5 | Subj. M6 | |
---|---|---|---|---|---|---|---|---|
Single-electrode | 16 (3) | 20 (7) | 21 (6) | 31 (15) | 62 (64) | 17 (5) | 40 (11) | 35 (18) |
Multiple-electrode | 14 (4) | 11 (4) | 7 (1) | 9 (4) | 32 (11) | 14 (3) | 24 (4) | 20 (7) |
Estimation results
The 12 data sets acquired from each subject across three sessions were used for training 12 networks. Each trained network was validated with the remaining 11 data sets. The network that provided the best validation and generalization performance was recorded. Then the same procedure was conducted for the three segments of data corresponding to the three subtasks, light load, isotonic and isometric. Each subtask resulted in 12 networks out of which one was chosen for the best validation. The training and validation RMSE, RMSE standard deviation, AAE, CC and CC standard deviation results, averaged over all subjects, are presented in Table 3. The detailed training and validation RMSE, RMSE standard deviation, AAE for each subject are also presented in Table 4 in Appendix A.g The right-hand columns represent the result for each subtask based on training with the corresponding subtask of the data for the particular task.
Table 3.
Full-length data set used for training
|
Only subtask data used for training
|
||||||||
---|---|---|---|---|---|---|---|---|---|
Full-length | Isometric | Isotonic | Light load (LL) |
LL + accel. | Isometric | Isotonic | LL | LL + accel. | |
Verification average for RMSE (standard deviation) [AAE] | |||||||||
RBF | 16 (8) [7] | 12 (6) [11] | 34 (29) [6] | [3] | [3] | 14 (9) [12] | 17 (6) [4] | [1.2] | [0.9] |
MLP | 26 (15) [10] | 15 (11) [10] | 111 (43) [11] | [5] | [5] | 12 (8) [9] | 13 (7) [4] | [1.1] | [0.9] |
Verification average for cross correlation (standard deviation) | |||||||||
RBF | 93 (4) | 95 (3) | 88 (6) | 57 (18) | 47 (18) | 94 (3) | 90 (4) | 72 (9) | 85 (6) |
MLP | 85 (8) | 93 (4) | 54 (27) | 27 (20) | 23 (20) | 94 (4) | 92 (5) | 75 (7) | 80 (12) |
Note: The results for the best performing architecture are shown in boldface.
Table 4.
Full-length data used for training
|
Only subtask data used for training
|
||||||||
---|---|---|---|---|---|---|---|---|---|
Full-length | Isometric | Isotonic | Light load (LL) | LL + accel. | Isometric | Isotonic | LL | LL + accel. | |
Sub. F1 | |||||||||
RBF | 14 (4) [4] | 11 (4) [6] | 17 (8) [3] | [2] | [2] | 11 (4) [6] | 13 (4) [3] | [1.3] | [0.9] |
MLP | 32 (6) [6] | 12 (3) [9] | 65 (81) [7] | [5] | [4] | 9 (2) [1] | 8 (2) [2] | [1.3] | [1] |
Sub. F2 | |||||||||
RBF | 11 (4) [6] | 11 (5) [12] | 8 (2) [3] | [1] | [2] | 19 (3) [16] | 9 (2) [1] | [1.3] | [0.7] |
MLP | 31 (5) [10] | 7 (2) [10] | 118 (27) [12] | [7] | [6] | 7 (1) [10] | 4 (1) [2] | [0.8] | [0.7] |
Sub. M1 | |||||||||
RBF | 7 (1) [7] | 5 (2) [11] | 22 (11) [6] | [3] | [3] | 9 (6) [14] | 26 (11) [4] | [1] | [0.6] |
MLP | 17 (6) [10] | 11 (6) [15] | 89 (16) [9] | [6] | [3] | 9 (7) [14] | 28 (15) [5] | [1] | [0.7] |
Sub. M2 | |||||||||
RBF | 9 (4) [6] | 5 (2) [8] | 27 (19) [6] | [5] | [3] | 3 (1) [7] | 17 (5) [5] | [1] | [0.8] |
MLP | 17 (2) [9] | 4 (2) [8] | 97 (23) [12] | [5] | [3] | 5 (2) [8] | 13 (10) [4] | [0.8] | [0.9] |
Sub. M3 | |||||||||
RBF | 32 (11) [7] | 22 (9) [13] | 51 (41) [6] | [5] | [4] | 18 (10) [11] | 27 (16) [4] | [1.8] | [1.6] |
MLP | 55 (11) [10] | 32 (11) [14] | 102 (23) [9] | [7] | [8] | 29 (16) [4] | 15 (10) [3] | [1.8] | [1.8] |
Sub. M4 | |||||||||
RBF | 14 (3) [7] | 11 (3) [11] | 20 (8) [4] | [3] | [3] | 7 (2) [9] | 17 (7) [4] | [1.4] | [1.1] |
MLP | 23 (5) [8] | 9 (3) [10] | 98 (3) [10] | [2] | [2] | 7 (3) [9] | 10 (5) [3] | [1.4] | [1.1] |
Sub. M5 | |||||||||
RBF | 24 (4) [8] | 21 (3) [14] | 28 (13) [6] | [2] | [1] | 33 (17) [17] | 15 (5) [4] | [0.7] | [0.4] |
MLP | 29 (2) [9] | 14 (4) [11] | 106 (11) [10] | [2] | [7] | 9 (2) [9] | 9 (7) [3] | [0.6] | [0.5] |
Sub. M6 | |||||||||
RBF | 20 (7) [10] | 13 (4) [15] | 99 (15) [10] | [4] | [4] | 15 (6) [16] | 15 (5) [4] | [1.3] | [0.8] |
MLP | 5 (31) [17] | 34 (25) [5] | 210 (71) [15] | [8] | [6] | 17 (9) [19] | 21 (6) [6] | [0.8] | [0.7] |
Average of verification results | |||||||||
RBF | 16 (8) [7] | 12 (6) [11] | 34 (29) [6] | [3] | [3] | 14 ([12] | 17 (6) [4] | [1.2] | [0.9] |
MLP | 26 (15) [10] | 15 (11) [10] | 111 (43) [11] | [5] | [5] | 12 (8) [9] | 13 (7) [4] | [1.1] | [0.9] |
The validation results for the isometric condition for both networks trained with full-length data are significantly better than those obtained under isotonic and light load conditions. This can also be seen for subject F1 in Figure 7. In the light load condition, since the measured forces are significantly smaller, the RMSE values are very large. Therefore, the AAE serves as a better performance measure. In general, looking at the RMSE and CC values, both networks are unable to provide accurate estimation of hand force in light load. This is because under the light load condition the recorded EMG signals are small and as such not rich enough for training. The trained network with task subset data performs better since they are trained for the specific condition for which they are validated. This performance improvement can be observed by comparing the sub-figures in the left and the corresponding right-hand columns in Figure 8.
Comparing the performance of the two ANN architectures, RBFANN provides a better estimation for each subtask or all three tasks after full-length data training. However, MLPANN performs better when training is done for a specified subtask such as isotonic and isometric tasks. Adding acceleration as an input to ANN improves the RMSE and CC in the light load condition after training with subtask data, as shown in Figure 8 for RBFANN. The frequency domain analysis results in Figure 9 verify the above results. It can be observed that RBFANN and MLPANN have close performance in isotonic and isometric conditions, whereas RBFANN has superior performance after training with full-length data and when acceleration signal is used as an input in the light load condition.
The results pertaining to the best performing architecture for each subject, trained with full-length and subtask data, are emphasized in Table 4. For the best performing network, validation for full-length data training was RMSE < 32% (CC > 86%), for isometric training RMSE < 18% (CC > 91%), and for isotonic training RMSE < 26% (CC > 84%). For light load training, RMSE < 40%, AAE [the actual indicative in this case] < 1.6 N (CC > 76%).
It is observed that none of the examined architectures is dominant in terms of performance. Moreover, the networks provide better performance when trained with subtask data. Therefore, authors suggest a switching scheme that chooses the best network out of a bank of subtask-trained networks based on the current operating condition. The switching decision can be made by a neural network or a neurofuzzy classifier that detects isometric, isotonic and light load states in real-time.
Discussion
As stated in the Introduction section, MLPANN has been reported for joint torque estimation.22–26 Although the anatomies or experimental conditions are different from ours, in most of the cases, a general comparison of the networks in terms of methodology, network structure and performance would better put our approach and results into perspective. Almost all methods, including ours, use feedforward networks with error backpropogation learning algorithm due to their simplicity. Song et al.26 however, use recurrent MLPANN to account for dynamic situations. The above models relate the EMG and/or joint kinematics to the joint torque. The methods that use single hidden layer22,24 has reported a large number of nodes from 10 to 32 in their hidden layer, and also large number of epochs in the order of 60,000 to reach RMSE% or AAE comparable to the results that we have reported. Two hidden layer networks use in the order of or less than 10 nodes and require significantly lower number of epochs (about 1,000) for training to avoid over-fitting and under-fitting. In terms of inputs, Sepulveda et al. use only the EMG signals to predict joint torque. However, it has been established that joint dynamics is also dependent on joint angular position and velocity. In fact, Song et al. demonstrate torque prediction improvement by up to three folds under dynamic conditions when kinematic data incorporated. Recent research, including ours, has reported the use of joint kinematic data as well. Savelberg et al. has argued that under dynamic conditions the relationship between EMG and force is history dependent.25 Therefore, they use an array of 10 to 30 data points preceding the instant of interest; As such the input layer in their networks contain up to 40 nodes. They have also argued that the number of nodes and the length of the EMG array depend on the desired level of generalization, whether it is intra-session or inter-session.
To improve the quality and reliability of the results we have taken a few measures as discussed in below. MAL is a function of excitation signal firing rate and recruitment of muscle fibers. The measured EMG is an ensemble of the Motor Unit Action Potentials (MUAPs) under the electrode and is heavily dependent on the electrode configuration, placement and signal pre-conditioning. Therefore, the recorded EMG is not an accurate representation of MAL. This factor and the stochastic property of EMG signals affects the repeatability of force estimation based on EMG data. The position of electrodes and the fidelity of the EMG/MAL relation play an important role in the accuracy of estimations based on ANN.24 This is caused by existing crosstalk readings from adjacent muscles. Therefore, active sensor locations should be selected to minimize the crosstalk effect.22 Towards this purpose, as reported in Section 5, multiple electrodes were utilized to record EMG signals from the biceps and triceps muscles. The results showed an improvement between 10% and 70% in estimation accuracy.
To further improve the fidelity of the EMG/MAL relation, a whitening filter.41 was employed. However, since the filter amplifies the higher frequency component of the EMG signals, no significant improvement in estimation performance was observed. The history of EMG data (up to 10 samples) was also employed as input to the networks, but the results were inconclusive and no consistent improvement was observed.
Another significant issue is muscle fatigue which has not been taken into consideration as the experiments were all conducted under non-fatiguing conditions. In addition, in anisometric motion, artifacts, caused by the movement of muscle under the electrode are often a problem. Since part of the hand force in motion is generated by hand dynamics and the dynamics of the grasped object, the inclusion of acceleration in the ANN inputs improves the performance (especially in free motion) as it has been observed in the results presented in Section 5.
In terms of applications, although the use of ANN for force estimation does not provide any insight on the joint dynamic model, it is suitable for applications that only require joint torque or hand force for monitoring purposes, such as in the analysis of athletes activities, and for the control of prosthetic arms.
Conclusion
This paper investigates the use of two artificial neural network architectures RBFANN and MLPANN for force estimation, and compares the performance of the two methodologies for human hand force estimation under an ensemble of operational conditions; isometric, isotonic and light load. A new EMG normalization method was employed as an alternative approach for MVC. The trained ANNs were able to predict the highly nonlinear relation between the joint parameters (EMG, angle and velocity) and the hand generated force. Use of multiple-electrode EMG reading from major muscles improved force estimation accuracy.
It was observed that both networks provide better performance when trained with subtask data. The experiments with three subtasks showed that the best performing architecture for each subject can generalize intra-session force estimation to a maximum RMSE of 40%, 26% and 18% for light load, isotonic and isometric conditions, respectively. Moreover, ANN is able to estimate the full-length data to a maximum RMSE of 40%. The experiments also showed that the performance of neither MLPANN nor RBFANN was dominant for all operating conditions. The training and validation results for both architectures for the isometric condition were significantly better than those for isotonic and light load conditions. RBFANN provided better estimation performance in the light load condition and MLPANN in the isotonic condition. The performance in light load condition improved significantly by including joint angular acceleration in the network inputs.
As none of the examined architectures proved to be dominant in terms of performance across all subtasks, a switching scheme that chooses the best RBF or MLP network out of a bank of subtask-trained networks can be considered as an attractive alternative for enhanced force prediction. An alternative solution can be to combine the MLP and RBF for an assemble network to predict the joint torque, which may result in improved estimation by utilizing different features of the two networks.
Acknowledgments
The authors wish to thank Queen’s University Sensory Motor Systems Group for their collaboration and support in the design and construction of QArm1 and the volunteer subjects for their help and patience. The authors would also like to thank Drs. P. Mousavi and E. Morin for their valuable discussions.
Footnotes
An operational condition involving muscular contractions against resistance without movement such that the length of the muscle does not change, ie, constant joint angle.
An isometric condition with very low constant velocity movement.
An operational condition at which opposing muscles contract and there is controlled movement. In other words, tension is constant while the lengths of the muscles change. In our case, a constant force is applied to joint while joint angle varies.
An operational condition at which the speed of movement is usually controlled, e.g. constant speed, allowing maximal force to be exerted throughout the full range of movement.
A preliminary version of this work appeared in [34].
The zero angle refers to the posture at which the forearm is perpendicular to the upper-arm.
Due to space limitations, the detailed results of CC and the standard deviation of CC for each subject is not included.
Author Contributions
Conceived and designed the experiments: FM, KH. Analysed the data: FM, KH. Wrote the first draft of the manuscript: FM, KH. Contributed to the writing of the manuscript: FM, KH. Agree with manuscript results and conclusions: FM, KH. Jointly developed the structure and arguments for the paper: FM, KH. Made critical revisions and approved final version: FM, KH. All authors reviewed and approved of the final manuscript.
Funding
This work was supported in part by Institute for Robotics and Intelligent Systems (IRIS), Natural Sciences and Engineering Research of Canada (NSERC), and Queen’s University Advisory Research Committee (ARC).
Competing Interests
Author(s) disclose no potential conflicts of interest.
Disclosures and Ethics
As a requirement of publication author(s) have provided to the publisher signed confirmation of compliance with legal and ethical obligations including but not limited to the following: authorship and contributorship, conflicts of interest, privacy and confidentiality and (where applicable) protection of human and animal research subjects. The authors have read and confirmed their agreement with the ICMJE authorship and conflict of interest criteria. The authors have also confirmed that this article is unique and not under consideration or published in any other publication, and that they have permission from rights holders to reproduce any copyrighted material. Any disclosures are made in this section. The external blind peer reviewers report no conflicts of interest.
References
- 1.Cheng HS, Ju MS, Lin CCK. Improving elbow torque output of stroke patients with assistive torque controlled by emg signals. ASME Jour of Biomed Eng. 2003;125:881–6. doi: 10.1115/1.1634284. [DOI] [PubMed] [Google Scholar]
- 2.Moradi M, Hashtrudi-Zaad K, Mountjoy K, Morin E. An emg-based force control system for prosthetic arms. In Proc of IEEE Canadian Conf on Electrical and Computer Engineeirng. 2008:1737–42. [Google Scholar]
- 3.Baudouin A, Hawkins D. A biomechanical review of factors affecting rowing performance. Jour of Sports Medicine. 2002;36:396–402. doi: 10.1136/bjsm.36.6.396. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Boyer K, Nigg B. Muscle tunning during running: Implications of an un-tuned landing. ASME Jour of Biomed Eng. 2006;128:815–22. doi: 10.1115/1.2354202. [DOI] [PubMed] [Google Scholar]
- 5.Zitzewitz J, Rauter G, S R, Brunschweiler A, R R. A versatile wire robot concept as a haptic interface sport simulation. In Proc of IEEE Int Conf on Robotics and Automation. 2009:313–8. [Google Scholar]
- 6.Allen C, Karam K, Le Cam P, Hill M, Tindle T. Application of virtual reality devices to the quantitative assessment of manual assembly forces in a factory environment. In Proc of IEEE Int Conf on Industrial Electronics, Control and Instrumentation. 1995:1048–53. [Google Scholar]
- 7.Kim K, Kim D, Jeong Y, Kim K, Park J. A biological man-machine interface for teleoperation. In Proc of Int Symp on Robotics. 2001 Apr;:371–8. [Google Scholar]
- 8.Cavallaro E, Rosen J, Perry J, Burns S. Real-time myo-processors fo a neural controlled powered exoskeleton arm. IEEE Trans on Biomedical Engineering. 2006;53:2387–96. doi: 10.1109/TBME.2006.880883. [DOI] [PubMed] [Google Scholar]
- 9.Mountjoy K, Morin E, Hashtrudi-Zaad K. Use of the fast orthogonal search method to estimate optimal joint angle for upper limb Hill-muscle models. IEEE Trans on Biomedical Engineering. 2010;57:790–8. doi: 10.1109/TBME.2009.2036444. [DOI] [PubMed] [Google Scholar]
- 10.Misener D, Morin E. An emg to force model for the human elbow derived from surface emg parameters. In Proc of IEEE Int Conf of the Engineering in Medicine and Biology Society. 1995:1205–6. [Google Scholar]
- 11.Inman V, Ralston H, Saunders J, Feinstein B, Wright W. Relation of human electromyogram to muscular tension. Electromyography Clinical Neurophysiology. 1952;4:187–94. doi: 10.1016/0013-4694(52)90008-4. [DOI] [PubMed] [Google Scholar]
- 12.Tsuji T, Kaneko M. Estimation and modeling of human hand impedance during isometric muscle contraction. In Proc of the ASME Dynamic Systems and Control Division. 1996;5:575–82. [Google Scholar]
- 13.De Luca C. The use of surface electromyogra-phy in biomechanics. Jour of Applied Biomechanics. 1997;13:135–63. [Google Scholar]
- 14.Zajac F. Muscle and tendon: properties, models, scaling, and application to biomechanics and motor control. Critical Reviews in Biomedical Engineering. 1989;17:359–411. [PubMed] [Google Scholar]
- 15.Pham KD, Morin EL. Dynamic surface EMG/force relation in human elbow flexion. In Proc of the Int Conf of the IEEE Engineering in Medicine and Biology Society. 1995;2:1203–4. [Google Scholar]
- 16.Bogey R, Perry J, Gitter A. An emg-to-force processing approach for determining ankle muscle forces during normal human gait. Neural Systems and Rehabilitation Engineering. 2005 Sep;13(3):302–10. doi: 10.1109/TNSRE.2005.851768. [DOI] [PubMed] [Google Scholar]
- 17.Clancy EA, Hogan N. Relating agonist-antagonist electromyograms to joint torque during isometric, quasi-isotonic, nonfatiguing contractions. IEEE Trans on Biomededical Engineering. 1997;44:1024–8. doi: 10.1109/10.634654. [DOI] [PubMed] [Google Scholar]
- 18.Clancy E, Bida O, Rancourt D. Influ-ence of advanced electromyogram (EMG) amplitude processors on EMG-torque estimation during constant posture, force varying contractions) Jour of Biomechanics. 2006;39:2690–8. doi: 10.1016/j.jbiomech.2005.08.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Xu L, Zhang Y. Identification of emg-force system using the second-order volterra model. In Proc of Int Conf of the IEEE Engineering in Medicine and Biology Society. 1996;2:569–70. [Google Scholar]
- 20.Mobasser F, Eklund J, Hashtrudi-Zaad K. Estimation of elbow-induced wrist force with EMG signals using Fast Orthogonal Search. IEEE Trans on Biomedical Engineering. 2007;54:683–93. doi: 10.1109/TBME.2006.889190. [DOI] [PubMed] [Google Scholar]
- 21.Hashemi J, Morin E, Mousavi P, Mountjoy K, Hashtrudi-Zaad K. EMG-force relationship using parallel cascade identification. Jour of Electromyography and Kinesiology. 2012;22(3):469–477. doi: 10.1016/j.jelekin.2011.10.012. [DOI] [PubMed] [Google Scholar]
- 22.Sepulveda F, Wells D, Vaughan C. A neural network representation of electromyography and joint dynamics in human gait. Jour of Biomechanics. 1993;26:101–9. doi: 10.1016/0021-9290(93)90041-c. [DOI] [PubMed] [Google Scholar]
- 23.Liu MM, Herzog W, Savelberg HH, CM Dynamic muscle force predictions from EMG: an artificial neural network approach. Jour of Electromyography and Kinesiology. 1999;9:391–400. doi: 10.1016/s1050-6411(99)00014-0. [DOI] [PubMed] [Google Scholar]
- 24.Luh J, Chang G, Cheng C, Lai J, Kuo S. Isokinetic elbow joint torques estimation from surface emg and joint kinematic data: using an artificial neural network model. Jour of Electromyography and Kinesiology. 1999;9:173–83. doi: 10.1016/s1050-6411(98)00030-3. [DOI] [PubMed] [Google Scholar]
- 25.Savelberg H, Herzog W. Prediction of dynamic tendon forces from electromyographic signals: an artificial neural network approach. Jour of Neurosience Methods. 1997;78:65–74. doi: 10.1016/s0165-0270(97)00142-8. [DOI] [PubMed] [Google Scholar]
- 26.Song R, Tong K. Using recurrent artifi-cial neural network model to estimate voluntary elbow torque in dynamic situation. Jour of Biomechanics. 1993;26:101–9. doi: 10.1007/BF02344728. [DOI] [PubMed] [Google Scholar]
- 27.Wang L, Buchanan TS. Prediction of joint moments using a neural network model of muscle activations from emg signals. IEEE Trans on Neural Systems and Rehabilitation Engineering. 2002;10:30–7. doi: 10.1109/TNSRE.2002.1021584. [DOI] [PubMed] [Google Scholar]
- 28.Zalzala A, Chaiyaratana N. Myo-electric signal classification using evolutionary hybrid RBF-MLP networks. In Proc of Int Conf on Evolu-tonay Computation. 2000:691–8. [Google Scholar]
- 29.Wang R, Yang Y, Hu X, et al. A hybrid AB-RBF clasifier for surface electromyography classification. Lecture Notes in Computer Science. 2007;4561:727–35. [Google Scholar]
- 30.Milovanovic I. Radial basis functon (rbf) networks for improved gait analysis. In Proc of Symp on Neural Networks Applications in Electrical Engineering. 2008 [Google Scholar]
- 31.Lawrence B, Bucker G, Mirka G. An adaptive system identification model of the biomechanical response of the human trunk during sudden loading. ASME Jour of Biomed Eng. 2006;128:235–41. doi: 10.1115/1.2165696. [DOI] [PubMed] [Google Scholar]
- 32.Donaldson N, Gollee H, Hunt J, Jarvis J, Kwende M. A radial basis function model of muscle stimulated with irregular inter-pulse intervals. Medical Engineering Physics. 1995;17:431–41. doi: 10.1016/1350-4533(94)00013-y. [DOI] [PubMed] [Google Scholar]
- 33.Erfanian A, Chizeck H. Prediction of electrically stimulated muscle force under isometric conditions using self-constructing neural networks. In Proc of the Conf on Int Functional Electrical Stimulation Society. 2003 [Google Scholar]
- 34.Mobasser F, Hashtrudi-Zaad K. Hand force estimation using electromyography signals. In Proc of IEEE Int Conf on Robotics and Automation. 2005:2642–7. [Google Scholar]
- 35.Haykin S. Neural Network: A comprehensive foundation. Macmillan College Publishing; 1994. [Google Scholar]
- 36.Girosi F, Jones M, Poggio T. Regularization theory and neural networks architectures. Neural Computation. 1995;7:219–69. [Google Scholar]
- 37.Demuth H, Beale M. MATLAB Neural Network Toolbox User’s Guide. MathWorks; 2001. [Google Scholar]
- 38.Winter DA. Biomechanics and Motor Control of Human Movement. Wiley; New York, NY: 1990. [Google Scholar]
- 39. http://seniam.org.
- 40.Clancy E, Hogan N. Multiple site elec-tromyograph amplitude estimation. IEEE Trans on Biomedical Engineering. 1995;42:203–11. doi: 10.1109/10.341833. [DOI] [PubMed] [Google Scholar]
- 41.Clancy E, Hogan N. Estimation of joint torque from the surface emg. In Proc of Int Conf of IEEE Engineering in Medicine and Biology Society. 1991:877–8. [Google Scholar]