Comparison of Classical, Neural Network and Hybrid Models for Hysteretic Single-tendon Catheter Kinematics

Yuan Wang; Pierre E Dupont

doi:10.1109/lra.2024.3504321

. Author manuscript; available in PMC: 2026 Jan 1.

Published in final edited form as: IEEE Robot Autom Lett. 2024 Nov 21;10(1):96–103. doi: 10.1109/lra.2024.3504321

Comparison of Classical, Neural Network and Hybrid Models for Hysteretic Single-tendon Catheter Kinematics

Yuan Wang ¹, Pierre E Dupont ¹

PMCID: PMC11633443 NIHMSID: NIHMS2039215 PMID: 39669770

Abstract

While robotic control of catheter motion can improve tip positioning accuracy, hysteresis arising from tendon friction and flexural deformation degrades kinematic modeling accuracy. In this paper, we compare the capabilities of three types of models for representing the forward and inverse kinematic maps of a clinical single-tendon cardiac catheter. Classical hysteresis models, neural networks and hybrid combinations of the two are included. Our results show that modeling accuracy is best when models are trained using motions corresponding to the anticipated clinical motions. For sinusoidal motions, recurrent neural network models provide the best performance. For point-to-point motions, however, a simple backlash model can provide comparable performance to a recurrent neural network.

Index Terms—: Cardiac catheter, hysteresis, continuum robot, neural networks

I. Introduction

Tendon-actuated catheters are used extensively in percutaneous intracardiac procedures including for the repair and replacement of heart valves, the treatment of cardiac arrhythmias and the repair of occluded vessels. For example, pulmonary vein isolation is a therapeutic procedure used to treat atrial fibrillation, an irregular heart rhythm. In this procedure, a transseptal sheath is guided from the femoral vein in the groin into the right atrium of the heart, across the atrial septum and just into the left atrium (Fig.1).

Fig. 1. — Transcatheter pulmonary vein isolation. (a) Transseptal sheath extends from femoral vein in groin across atrial septum into left atrium. (b) Steerable catheter extending through transseptal sheath is used to ablate tissue at sequence of points encircling pairs of pulmonary veins.

A steerable ablation catheter is inserted through the transseptal sheath which is sequentially positioned at closely spaced sets of points around the pairs of pulmonary veins. At each point, the catheter contacts the tissue and, using radiofrequency energy, creates a scar. The rings of scars prevent abnormal electrical signals arising inside the pulmonary veins from propagating into the left atrium. The catheter pauses at each point for up to 60 seconds to create a scar.

Precise control of point-to-point motions, such as those used in pulmonary vein isolation, is challenging because the kinematic modeling of continuum robots is more difficult than for traditional robots comprised of rigid links and discrete joints. While mechanics-based models of continuum robot designs have been developed [1]–[3], they typically neglect hysteretic phenomena despite its known effect on modeling accuracy [4].

While feedback control can mitigate feedforward modeling inaccuracy, this too has its challenges. Few catheters incorporate tracking or shape sensors owing to the cost and complexity of including sensors in small cross sections. This forces the operator to close the feedback loop using real-time imaging to compensate for kinematic modeling error. This increases the level of attention that the operator must devote to catheter control and is challenging given the limitations of fluoroscopic and ultrasound imaging.

The alternative approach is to incorporate improved kinematic models into the robot controller that include accurate characterizations of hysteresis. The challenge for the robot designer, however, is that, while there are many possible techniques for modeling hysteresis, most investigations consider only one or two methods, do not model both forward and inverse kinematics and do not consider the effect of the anticipated clinical motions on model accuracy. Consequently, there are no definitive studies comparing the performance and characteristics of a broad range of models.

For example, both traditional hysteresis and deadband models as well as neural networks provide potential means of modeling kinematic hysteresis in catheters. Tendon-actuated hysteresis modeling has been considered in the context of mapping tendon tension to tube flexure or tip contact force [5], [6], examining how sheath curvature affects tendon hysteresis [7], and modeling the combined hysteretic behavior of a tendon-actuated catheter driven by a pneumatic artificial muscle [8].

These papers have considered a variety of hysteresis models including traditional models like the Preisach and Prandtl-Ishlinskii models [5], [8], as well as advanced approaches involving recurrent neural networks (RNNs) like Long short-term memory (LSTM) networks [6], [8], [9] and hybrid models combining RNNs and the Preisach model [7].

To provide a unified comparison of model types, this paper presents an experimental comparison of seven hysteretic models for the forward and inverse kinematics. An eighth model, comprised of a feedforward neural network (FNN), is used as a baseline non-hysteretic model for comparison since it can learn a nonlinear kinematic map but lacks the capability to model history dependence. FNNs represent the best performance achievable by a model that ignores hysteresis and so provide a baseline for quantifying the improvement achieved by other models when considering historical information.

Here, we study a single-tendon catheter that flexes in one direction. This design is commonly used clinically such as in the application shown in Fig. 1. Catheter motions are composed of a combination of axial translation, axial rotation and uni-directional flexure. While catheter translation and rotation can be performed smoothly, the tendon-actuated flexure of catheters fabricated from polymeric components exhibits a significant amount of hysteresis.

The hysteresis models compared include two classical models (backlash [10] and Prandt1-Ishlinskii [11]), three neural network models (LSTM, GRU and a feedforward neural network with an input buffer) and two hybrid models (combinations of backlash and LSTM). The paper also investigates two classes of catheter motions: sinusoidal and point-to-point.

The remainder of the paper is arranged as follows. Section II defines the kinematic inputs and outputs. It also introduces the classical models, neural network models, and hybrid models used for hysteretic kinematics modeling. Section III describes the sets of experiments performed to evaluate the models along with modeling results. Conclusions are presented in Section IV.

II. Modeling Hysteretic Kinematics

As shown in Fig. 2, when a tensile force or displacement is applied to a tendon attached to the tip of a flexible catheter, it causes the catheter to deflect. The kinematic input can be considered to be either tendon displacement or tension force. We assume that, while catheter flexure is hysteretic, there is a unique relationship between flexure angle, $θ$ , tip offset from the longitudinal axis, $y$ , and tip displacement along the longitudinal axis, $x$ . Thus, the output of the kinematic map can be taken as one of ${θ, x, y}$ and the remaining two variables can be computed from the first. As described below, we investigated the utility of a variety of models for representing the forward and inverse kinematic maps.

A. Classical hysteresis models

Numerous mathematical models have been developed to describe nonlinear hysteresis behavior exhibited by materials and systems [12], [13]. Here, we consider two models, a backlash model and a frequency-dependent Prandtl–Ishlinskii model.

The virtue of the backlash model is its simplicity. As shown in Fig. 3(a), it is described by four parameters and was proposed to describe and compensate for the dead-zone, hysteresis or backlash characteristics in control systems [10], [14]. The change in output, $v (t)$ , verses input, $u (t)$ , is characterized by two straight lines connected by horizontal line segments and is given by

v (t) = \{\begin{array}{l} m_{L} [u (t) - c_{L}] & if u (t) \leq z_{L} \\ m_{R} [u (t) - c_{R}] & if u (t) \geq z_{R} \\ v (t - 1) & if z_{L} \leq u (t) \leq z_{R} \end{array}

(1)

\begin{array}{r} z_{L} = \frac{v (t - 1)}{m_{L}} + c_{L}, z_{R} = \frac{v (t - 1)}{m_{R}} + c_{R} \end{array}

(2)

Fig. 3. — Hysteresis models. (a) Four-parameter backlash model. (b) Hysteresis loop of Prandtl-Ishlinskii model. (c) Hybrid LSTM-Backlash model (HB1). (d) Parallel hybrid LSTM-Backlash model (HB2).

Here, $m_{L}, m_{R}, c_{L}$ , and $c_{R}$ are parameters defining the slopes and positions of the two straight lines. The inverse form of the backlash model to be used for inverse kinematic modeling is given by

u (t) = \{\begin{array}{l} u (t - 1) & if v (t) = v (t - 1), \\ \frac{v (t)}{m_{L}} + c_{L} & if v (t) < v (t - 1), \\ \frac{v (t)}{m_{R}} + c_{R} & if v (t) > v (t - 1) . \end{array}

(3)

Among those models developed specifically for hysteresis, e.g., Preisach, Prandtl-Ishlinskii (PI), and Bou–Wen, the PI model has merit in that model identification and inversion are somewhat easier than other models [12], [13]. Although the classical PI model does not account for rate-dependent behavior, several modified versions incorporate this capability. We use here the version proposed in [11]. The rate-dependent PI model is formulated by superposing time-dependent play operators and is given by

v (t) : = a_{0} u (t) + \sum_{i = 1}^{k} a_{i} Ψ_{r_{i}} (t) [u], Ψ_{r_{i}} (t) [u] = max (u (t) - r_{i} (t), min (u (t) + r_{i} (t), Ψ_{r_{i}} (t_{j - 1}) [u])) for t \in [t_{j - 1}, t_{j}], r_{i} (t) = ξ i + β |\dot{u} (t)| for i = 1, \dots, k,

(4)

where $Ψ_{r_{i}} (t) [u]$ are the time dependent play operators and $r_{i}$ is a time dependent threshold function of these play operators. In this paper, $k$ is set to 4 and an example loop is depicted in Fig. 3(b) for parameters $\{a_{0}, a_{1}, a_{2}, a_{3}, a_{4}, ξ, β\} = {0.1,0.2,0.3,0.4,0.5,0.1,2}$ .

For modeling the inverse kinematics, we use the inverse form of the rate-dependent PI model given by [15]. The parameters to be optimized are $\{b_{0}, b_{1}, b_{2}, b_{3}, b_{4}, ξ, β\}$ and the model is given by

u (t) : = c_{0} v (t) + \sum_{i = 1}^{k} c_{i} Ψ_{s_{i}} (t) [v], Ψ_{s_{i}} (t) [v] = max (v (t) - s_{i} (t), min (v (t) + s_{i} (t), Ψ_{s_{i}} (t_{j - 1}) [v])) for t \in [t_{j - 1}, t_{j}], r_{i} (t) = ξ i + β |\dot{v} (t)| for i = 1, \dots, k, s_{i} (t) = s_{i - 1} (t) + (r_{i} (t) - r_{i - 1} (t)) \sum_{j = 0}^{i - 1} b_{j} for i = 2, \dots, k, c_{0} = \frac{1}{b_{0}}, c_{i} = \frac{1}{\sum_{j = 0}^{i} b_{j}} - \frac{1}{\sum_{j = 0}^{i - 1} b_{j}} for i = 1, \dots, k .

(5)

B. Neural network models

Neural networks are effective tools for identifying complex patterns in data. Feedforward Neural Networks (FNNs) utilize a single-direction architecture to learn input-output mappings and so lack the capability to model any dependence on the history of motion. We use FNNs in this paper to represent the best that a non-hysteretic model can do in reproducing the forward and inverse kinematics.

Recurrent Neural Networks (RNNs) are specifically designed to handle sequences so their structures allow them to model hysteresis. Long short-term memory (LSTM) networks incorporate memory cells and gating mechanisms which make them more effective at dealing with long sequences [16]. Gated Recurrent Unit (GRU) networks have a simplified architecture compared to LSTMs, which reduces their computational complexity while maintaining their performance in many tasks [17].

In addition to RNNs and GRUs, we also consider a modified FNN which incorporates a history input buffer [18], [19]. In this modification, the input is comprised of the current input plus a sequence of preceding inputs. This FNN with History Input Buffer (FNN-HIB) is able to reproduce hysteretic behavior because it creates a map between a sequence of prior inputs and the current output. Its structure is simpler than that of an RNN, but its implementation requires the use of the input buffer.

In this paper, all of the neural network models incorporate an input layer, two hidden layers of dimension 64 and an output layer. All of the input layers are comprised of a single node except for the FNN-HIB, which is comprised of 50 nodes, serving as the buffer storing a sequence of 49 prior inputs plus the current input value.

C. Hybrid models

Since enhanced modeling performance can often be achieved by combining neural networks with phenomenological models, we also consider two hybrid models integrating neural networks with classical hysteresis models. Fig. 3(c) and (d) shows the two models considered which each combine an LSTM network and a backlash model. In the first model (Fig. 3(c)), the LSTM is used to calculate instantaneous values of the four backlash parameters, $\{m_{L}, m_{R}, c_{L}, c_{R}\}$ .

(m_{L}, m_{R}, c_{L}, c_{R}) = f_{lstm} (u), v_{H B 1} = f_{b} ((m_{L}, m_{R}, c_{L}, c_{R}), u)

(6)

The second model of Fig. 3(d) is a parallel connection of the models. Here, the backlash model has fixed parameters $\{m_{L}, m_{R}, c_{L}, c_{R}\}$ which are learned in an initial training session. Subsequently, the LSTM is trained to model the error of the backlash model. The equations of this hybrid model are

v_{H B 2} = f_{lstm} (u) + f_{b} ((m_{L}, m_{R}, c_{L}, c_{R}), u)

(7)

III. Experiments

We performed experiments on the clinical single-tendon catheter of Fig. 2 using the custom drive system of Fig.4. The catheter is made up of a polymer tube with laser-cut tubing embedded inside. It is comprised of a stiff proximal section and a flexible, steerable distal section. The non-steerable proximal section was constrained using a table-mounted stabilizer block as shown. The steerable distal segment, which allows for bending, measures 6 cm in length. A stainless-steel tendon wire runs through a channel near the outer polymer layer of the catheter, enabling controlled movement of the flexible section.

The proximal end of the tendon was wrapped around a pulley integrated with a cartridge to which the sheath is attached. The pulley was driven thorugh a bevel gear by a DC motor (Faulhaber 2342012CR) via servo drive (Accelnet ACJ-055–09). A PC running ROS 2 was used to control the tendon with the servo drive reporting tendon displacement and tension. An electromagnetic sensor (Ascension trakSTAR) was used to track tip position and orientation with accuracy of ±1.4mm and ±0.35°.

When initializing the robot, tendon slack can cause significant trial-to-trial variation. Tendon pretension, however, creates a precurvature in the sheath. To balance these effects, we conducted preliminary experiments to determine the minimum pretension producing repeatable results.

Experimental trials were conducted using varying levels of pretension. To quantify the repeatability of the catheter’s performance, the average standard deviation across five trials was calculated for each pretension level. The minimum pretension was chosen as the lowest pretension value that yielded a standard deviation less than 0.5°. This minimum pretension value was 4.92N corresponding to an initial bending angle of 20°. The tension $(F)$ was calculated using the equation $F = I K_{t} / r$ , where $I$ represents the measured current, $K_{t}$ denotes the torque constant, and $r$ is the radius of the motor.

We also observed that, with further flexure of the sheath, it would never return to this initial bending angle, but instead would relax to a larger angle, 30°. The initial flexure, occurring only once during the catheter’s initialization, is not modeled since we focus on the catheter’s repeatable behavior flexing over a range of angles greater than or equal to this minimum angle of 30°. To avoid including data from this nonrepeatable initial flexure in the modeling data, a preliminary commanded motion was performed corresponding to a 0.1 Hz sinusoidal bending to a maximum of 90° and then back to the relaxed bending angle of 30°.

Following these initialization steps, custom motions were imposed as tendon displacements to gather data for modeling purposes. The collected data was sampled at 25 Hz to ensure uniformity across the modeling datasets.

Training for all the models included the use of a mean square error (MSE) loss function and the Adam optimizer in the PyTorch framework, with a learning rate set to 0.001. Each model was trained with a data sequence length of 50 and a batch size of 16. The early stopping strategy was adopted to select the best model based on validation performance within 100 epochs.

Four sets of experiments were performed. The first set investigated hysteresis properties, the second set compared alternative kinematic mappings, and the third set evaluated the effect of training motions on model performance. The final set comprehensively compared all models based on what was learned in the preceding experiments. Each set is detailed below.

A. Hysteresis Properties

We first investigated the hysteresis properties for the catheter of Fig. 2. These characteristics include frequency dependence, the selection of kinematic input variable (tendon displacement versus tendon tension), and repeatability.

We examined frequency dependence by comparing the catheter’s bending angle response to tendon displacement and tension across five frequencies of decaying sinusoidal signals. Figs. 5(a) and (b) show that the output amplitudes decrease to a small degree with increasing frequency, indicating a slight frequency dependence.

Furthermore, comparing the hysteresis loops of Figs. 5(a) and (b) reveals that using tendon displacement as the input provides a smoother, more-uniform loop than tendon tension. Anticipating that this loop will be easier to model, we use tendon displacement as the input variable for kinematic modeling.

As shown in Fig. 5(c), we also examined the repeatability of hysteresis by performing multiple trials with the same tendon motion. These results confirm that the kinematic behavior is repeatable and so is amenable to the proposed modeling approach.

B. Model Accuracy versus Choice of Output Variable

For the three output variables shown in Fig. 2, it is expected that they are functionally dependent on each other. For example, if the assumption of constant curvature holds, then the tangent angle $θ$ and tip position coordinates $x$ and $y$ are related by

x = \frac{l}{θ} sin (θ), y = \frac{l}{θ} (1 - cos (θ))

(8)

where $l$ is the length of the bending section of the catheter. Assuming these variables are related to each other, we should be able to train a hysteretic model for one output variable and then compute the other two using a non-hysteretic functional model.

We used preliminary experiments to model the relationship between ${x, y, θ}$ and to compare the relative accuracy of the hysteretic models in mapping directly to the three output variables. These experiments showed that, while ${x, y, θ}$ are not related by the constant curvature assumption, their relationship can be accurately modeled using a quadratic equation as shown in the block of Fig. 6(b).

Furthermore, we determined that the hysteretic mapping from tendon displacement to $x$ and $y$ is more nonlinear than the map from tendon displacement to $θ$ . While the backlash and PI models proved adept at reproducing the mapping from tendon displacement, $d$ , to $θ$ (Fig. 6(a)), they struggled to directly predict $x$ and $y$ . For these models, we found that more accurate maps to $x$ and $y$ could be produced by the composite map of Fig. 6(b). The hysteretic model maps first to $θ$ and then a quadratic function is used to map $θ$ to $x$ or $y$ . An example motion prediction comparing the two mappings for the backlash model is shown in Fig. 6(c).

Since neural networks are particularly adept at capturing nonlinear relationships, it was observed as expected that the mappings of Fig. 6(a) and (b) produced equivalent accuracy for all neural network and hybrid models tested.

C. Modeling Clinical Motions

In this paper, two types of motions are considered for hysteretic kinematic modeling: sinusoidal and point-to-point motions. Sinusoidal motions of varying amplitude and frequency are commonly employed when assessing hysteresis [7], [8], [20]. In clinical settings, such periodic motions are used to create oscillating catheter flexion for scanning motions of an imaging probe, e.g., as discussed in [8].

The most common catheter-based procedures, however, such as the example of pulmonary vein isolation in Fig. 1, require the catheter to move between a sequence of tip positions with long pauses at each location. These point-to-point motions can be modeled as stationary time periods interspersed with segments of constant velocity.

We collected motion data of both types for model training, validation and testing as described below.

1). Sinusoidal Motion Data:

Tendon displacements, $d (t)$ , given by decaying sinusoids were used to represent periodic motions. Data was collected for seven frequencies and three offsets as given below.

d (t) = d_{max} e^{- τ t} (sin (2 π f t - \frac{π}{2}) + c) + d_{offset},

(9)

\begin{array}{l} t & \in & \{0, \dots, t_{\max}\} \\ f & \in & \{0.1, 0.15, 0.2, 0.3, 0.4, 0.45, 0.5\} Hz \\ τ & = & f \log (7 / 6) Hz \\ d_{max} & = & 3 mm \\ \{c, d_{offset}\} & \in & (\{1, 0 mm\}, \{0, 3 mm\}, \{- 1, 6 mm\}) \end{array}

(10)

The maximum time, $t_{max}$ , and decay rate, $τ$ , were selected so that data for seven full cycles could be collected. The maximum tendon displacement, $d_{max}$ , was selected to correspond to a tip bending angle of $90^{\circ}$ . The three pairs of $\{c, d_{offset}\}$ produce oscillations in which the simusoidal maxima are all at the maximum tendon displacement, the sinusoidal means are all at the mean tendon displacement, and the sinusoidal minima are all at the pretension tendon displacement, respectively.

The frequencies $f \in {0.1,0.2,0.4,0.5} Hz$ were used for training while $f = 0.3 Hz$ was used for validation and $f \in {0.15,0.45} Hz$ were used for testing.

2). Point-to-Point Motion Data:

Point-to-point motions were composed of 20 segments which alternated between constant nonzero velocity and zero velocity. The nonzero velocity segments performed tendon displacements connecting 10 positions randomly selected from the range, $d \in {0,6} mm$ . Each displacement was performed at a constant velocity randomly selected in the range 0.5–6mm/sec and followed by a random-duration zero-velocity segment. The tendon displacements $d (t)$ are formulated as below:

d (t) = \{\begin{array}{l} v_{i} (t - t_{2 (i - 1)}) + d_{i - 1}, & t_{2 (i - 1)} \leq t < t_{2 (i - 1) + 1} \\ d_{i}, & t_{2 (i - 1) + 1} \leq t < t_{2 i} \end{array}

(11)

Here, $i = 1,2, \dots, 10, v_{i} \in [0.5,6] mm / sec$ are constant velocities, $d_{0} = 0$ , and $d_{i} \in [0,6] mm$ are the displacements and $t_{2 (i - 1) + 1}$ are the time points at which the motion changes from constant velocity to zero velocity. The duration of nonzero velocity segments $t_{2 (i - 1) + 1} - t_{2 (i - 1)} = \frac{d_{i} - d_{i - 1}}{v_{i}}$ . The duration of stationary segments $t_{2 i} - t_{2 (i - 1) + 1} \in [1,5] sec$ . Also, to ensure that the data was sufficiently exciting, each tendon displacement, $Δ d$ , was forced to satisy $0.5 m m < Δ d < 3 m m$ .

Twenty trials were performed with fifteen of these trials used for training and the remaining five used for validation. Twenty additional trials were performed for testing in which stationary segments had random durations of 1–10 seconds and tendon displacements were not constrained.

During initial training experiments, it was observed that minimizing mean square error is not sufficient for producing well-behaved kinematic models. It is also necessary that the model produce a constant output for a constant input. Otherwise, the model could predict that the catheter tip could continue moving in a neighborhood around the actual configuration. The structure of the backlash model (and the hybrid models constructed from it) is such that it inherently incorporates the constant-input constant-output property. In contrast, neural network models can produce fluctuating outputs under static input conditions due to their tendency to overfit. To overcome this problem, we introduced a new term in the loss function to supplement the mean square error (MSE) term. The new term penalized the ratio of the standard deviation of model output, $σ (\hat{v})$ , to standard deviation of model input, $σ (u)$ , computed over the data sequence length of 50 samples. In the loss function, $L$ , below, $w_{1}$ and $w_{2}$ are scalar weighting functions and $ϵ > 0$ is a small constant intended to prevent a zero denominator.

L = w_{1} MSE (v, \hat{v}) + w_{2} \frac{σ (\hat{v})}{σ (u) + ϵ}

(12)

The weighting factors were selected as $w_{1} = 1$ and $w_{2} \in \{0, 10^{- 4}\}$ , based on empirical evaluation. This new term penalized changes in the output when the input was constant and resulted in much improved performance during stationary portions of point-to-point motions. During model training, both values of $w_{2}$ were compared and the model with lowest error is reported.

3). Training versus Testing Motion Type:

To investigate if the models would be sensitive to the type of motion that they were trained on, we compared how models trained on one type of motion were able to predict the other type of motion.

Fig. 7 displays the results of our evaluation, comparing the performance of LSTM models trained on these two different datasets. When tested on a sinusoidal motion, as shown in Fig. 7(a), the tip angle prediction of the model trained on sinusoidal data matches well with experimental data, whereas the model trained on point-to-point motions fails to capture the sinusoidal patterns accurately. Conversely, when applied to a point-to-point motion, as shown in Fig. 7(b), the model trained on sinusoidal data struggles to handle the stationary periods, while the model trained on point-to-point data successfully maintains a constant output during these periods.

We also trained models on both types of motions, but found that the resulting cross-trained models were not as accurate as models trained and tested on a single motion type. Inverse kinematic modeling produced similar results.

D. Model Comparison

The seven hysteresis models of Section II were compared with the memoryless FNN model with respect to reproducing the catheter’s forward and inverse kinematics using the test data. As noted above, modeling errors are minimized if training and testing are performed on the same motion type (sinusoidal or point-to-point) so only those results are reported here. Forward and inverse maps for the three output variables, ${θ, x, y}$ were modeled. Since modeling errors were comparable for the three outputs, only results for $θ$ are reported here.

Fig. 8 presents a comparison of forward and inverse kinematic modeling errors for all the models. The figure includes the mean, standard deviation and maximum error for each model computed over the test trials. One-sided Wilcoxon signed-rank tests were conducted to determine whether the errors produced by the models were statistically greater or less than each other. Table I summarizes the results of these tests.

TABLE I.

Statistical Equivalence of Models.

		FNN	Backlash	PI	LSTM	GRU	FNN-HIB	HB1
Backlash	FS / IS	< / <
Backlash	FP / IP	< / <
PI	FS / IS	< / <	> / <
PI	FP / IP	< / <	> / >
LSTM	FS / IS	< / <	< / <	< / <
LSTM	FP / IP	< / <	< / ~	< / <
GRU	FS / IS	< / <	< / <	< / <	~ / ~
GRU	FP / IP	< / <	~ / ~	< / ~	> / ~
FNN-HIB	FS / IS	< / <	< / <	< / ~	> / >	> / >
FNN-HIB	FP / IP	< / <	> / >	~ / >	> / >	> / >
HB1	FS / IS	< / <	< / <	< / ~	~ / >	~ / >	< / ~
HB1	FP / IP	< / <	< / ~	< / <	~ / ~	< / ~	< / <
HB2	FS / IS	< / <	< / <	< / <	~ / ~	~ / ~	< / <	~ / <
HB2	FP / IP	< / <	< / <	< / <	~ / <	< / <	< / <	~ / <

Open in a new tab

A “~” indicates equivalent modeling error, while “<” or “>” indicate that the model in the first column on the left has significantly less or greater error than the model in the first row, with a significance level of $p < 0.05$ . FS = Forward model for sinusoidal motions.IS = Inverse model for Sinusoidal motions. FP = Forward model for Point-to-point motions. IP = Inverse model for Point-to-point motions.

In Fig. 8, the leftmost model with the highest error is the FNN. Since it cannot reproduce hysteresis, its prediction represents an average of the hysteresis loop. For the forward kinematic mapping, this results in it overestimating an increasing bending angle by an average of 2.5° for sinusoidal motions and 2.25° for point-to-point motions. For decreasing bending angles, it underestimates the bending angle by the same amounts. The inverse FNN models similarly produce an averaged prediction of tendon displacement.

For sinusoidal motions, all of the neural network models except FNN-HIB and HB1 provide excellent and statistically equivalent performance. Of these, the LSTM and GRU models provide the smallest mean and maximum errors considering both forward and inverse kinematics. For example, the LSTM model reduced forward mean and maximum prediction errors to 17% and 23% of the nonhysteretic FNN model predictions. Inverse mean and maximum prediction errors were reduced to 23% and 66% of the nonhysteretic model predictions.

For point-to-point motions, hybrid model HB2 provided the best error reduction, which was not as substantial as the reduction observed for sinusoidal motions. Specifically, the HB2 model reduced forward mean and maximum prediction errors to 43% and 55% of the nonhysteretic FNN model predictions. Inverse mean and maximum prediction errors were reduced to 39% and 68% of the nonhysteretic model predictions. The LSTM and hybrid HB1 models were second best and provided statistically equivalent error reduction. Of note, while the backlash model produced slightly larger errors than the LSTM and hybrid models, the difference may be acceptable in practice.

Fig. 9 compares the HB2 and FNN-HIB models with the non-hysteretic FNN model for a point-to-point motion trial. The effect of neglecting hysteresis with the FNN model is most obvious during the stationary periods. For the forward kinematics of Fig. 9(a), the non-hysteretic FNN model tends to overestimate tip angle when angle is increasing and underestimate it when tip angle is decreasing. For the inverse kinematics of Fig. 9(b), this relationship of over- and under-estimating tendon displacement is reversed. The FNN-HIB model adds an input buffer to the FNN model enabling it to reduce mean and maximum errors to 66% and 65% in the forward model. While mean error of the inverse model is also reduced to64% of the FNN model, the maximum error of the FNN-HIB model exceeds that of the FNN model.

IV. CONCLUSIONS

This paper investigated the hysteretic kinematic modeling of a single-tendon catheter, offering a comparative analysis of eight models, including two classical hysteresis models, four neural networks, and two hybrid models. Among the neural network models, the feedforward model is used as the non-hysteretic baseline since it can model complex nonlinear functions, but not time-dependent behavior.

In contrast to the neural network models, it was observed that the classical models were not as effective at modeling nonlinear behavior. This meant that while they could be used to directly produce the map from tendon displacement to bending angle, a two part mapping was needed to accurately produce tip positional coordinates. This additional complexity can be be viewed as a disadvantage of the classical models.

On the other hand, the classical models inherently exhibited the property that a constant input produced a constant output. In contrast, RNNs are not constrained in this way and the loss function had to be modified to explicitly penalize output motion when the input was constant.

The impact of anticipated motion type on model training and performance was examined. While a standard approach in modeling hysteresis is to perform training and testing with sinusoidal motions, the clinical applications that employ such motions are limited. Many applications consist of positioning the catheter tip at a sequence of positions. At each position, the catheter is held stationary.

It was observed that smaller modeling errors were obtained for models tested on the same type of motion they were trained on. In addition, errors were lower for sinusoidal motions than for point-to-point motions. This is to be expected since if one were to approximate point-to-point motions as sums of sinusoids, the approximations would require a much larger ranges of frequencies than those used in clinical sinusoidal motions. It is not unreasonable that models trained on a small range of frequencies can reproduce motions with those frequencies better than motions containing a much greater range of frequencies. Similarly, models trained on a larger range of frequencies will produce a good overall fit to motions with many frequencies, but not as good a fit for motions with a few frequencies when compared with models trained specifically on those frequencies.

For those applications that do involve sinusoidal motions, the LSTM and GRU models provide excellent forward and inverse kinematic models. The hybrid models also perform comparably, but since they are more complex, there is no advantage to using them. In contrast, the FNN-HIB model provided slightly worse behavior while also being more complex since its implementation requires an input buffer. The backlash and PI models were inferior to all of the hysteretic neural network models.

For applications using point-to-point motions, the HB2 model edged out the LSTM and HB1 models in reducing error. Interestingly, the backlash model performs almost as well. Its simplicity, using only four parameters, suggests its potential appropriateness for some point-to-point applications. If, however, the output variable is selected to be tip coordinate $x$ or $y$ , instead of $θ$ , then the backlash model does require the composite mapping of Fig. 6 to be used. The classical PI model and the FNN-HIB model provide inferior performance to all of the other models for point-to-point motions.

One limitation of this study was that the proximal portion of the catheter was constrained to be straight. While for transfemoral procedures, much of the proximal length is approximately straight, the distal portion inside the heart is often deflected by an outer sheath. While this deflection will change the hysteresis properties of the catheter, it is unlikely to change the relative performance of competing hysteresis models and so was not considered here. In practical use, however, proximal curvature should be considered (see, e.g., [7]).

Acknowledgments

This paper was recommended for publication by Editor Pietro Valdastri upon evaluation of the Associate Editor and Reviewers’ comments. This work was supported by the NIH under grant R01HL124020.

References

[1].Rucker DC and Webster RJ III, “Statics and dynamics of continuum robots with general tendon routing and external loading,” IEEE Transactions on Robotics, vol. 27, no. 6, pp. 1033–1044, 2011. [Google Scholar]
[2].Dupont PE, Lock J, Itkowitz B, and Butler D, “Design and Control of Concentric-Tube Robots,” IEEE Trans Robot, vol. 26, no. 2, pp. 209–225, 2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
[3].Chitalia Y, Donder A, and Dupont PE, “Modeling tendon-actuated concentric tube robots,” in 2023 International Symposium on Medical Robotics (ISMR), 2023, pp. 1–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
[4].Ha J, Fagogenis G, and Dupont P, “Effect of path history on concentric tube robot model calibration,” in The Hamlyn Symposium on Medical Robotics, 2017, p. 77. [Google Scholar]
[5].Chitalia Y, Deaton NJ, Jeong S, Rahman N, and Desai JP, “Towards fbg-based shape sensing for micro-scale and meso-scale continuum robots with large deflection,” IEEE Robotics and Automation Letters, vol. 5, no. 2, pp. 1712–1719, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
[6].Xiang P, Zhang J, Sun D, Qiu K, Fang Q, Mi X, Wang Y, Xiong R, and Lu H, “Learning-based high-precision force estimation and compliant control for small-scale continuum robot,” IEEE Transactions on Automation Science and Engineering, pp. 1–13, 2023. [Google Scholar]
[7].Kim D, Kim H, and Jin S, “Recurrent neural network with preisach model for configuration-specific hysteresis modeling of tendon-sheath mechanism,” IEEE Robotics and Automation Letters, vol. 7, no. 2, pp. 2763–2770, 2022. [Google Scholar]
[8].Wu D, Zhang Y, Ourak M, Niu K, Dankelman J, and Poorten EV, “Hysteresis modeling of robotic catheters based on long short-term memory network for improved environment reconstruction,” IEEE Robotics and Automation Letters, vol. 6, no. 2, pp. 2106–2113, 2021. [Google Scholar]
[9].Li X, Tiong AMH, Cao L, Lai W, Phan PT, and Phee SJ, “Deep learning for haptic feedback of flexible endoscopic robot without prior knowledge on sheath configuration,” International Journal of Mechanical Sciences, vol. 163, p. 105129, 2019. [Google Scholar]
[10].Tao G and Kokotović P, “Adaptive control of systems with backlash,” IFAC Proceedings Volumes, vol. 25, no. 14, pp. 87–93, 1992. [Google Scholar]
[11].Al Janaideh M and Krejčí P, “Prandtl-ishlinskii hysteresis models for complex time dependent hysteresis nonlinearities,” Physica B: Condensed Matter, vol. 407, no. 9, pp. 1365–1367, 2012. [Google Scholar]
[12].Gan J and Zhang X, “A review of nonlinear hysteresis modeling and control of piezoelectric actuators,” AIP Advances, vol. 9, no. 4, p. 040702, April 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
[13].Hassani V, Tjahjowidodo T, and Do TN, “A survey on hysteresis modeling, identification and control,” Mechanical Systems and Signal Processing, vol. 49, no. 1, pp. 209–233, 2014. [Google Scholar]
[14].Vörös J, “Identification of cascade systems with backlash,” International Journal of Control, vol. 83, no. 6, pp. 1117–1124, 2010. [Google Scholar]
[15].Al Janaideh M and Krejčí P, “An inversion formula for a prandtl-ishlinskii operator with time dependent thresholds,” Physica B: Condensed Matter, vol. 406, no. 8, pp. 1528–1532, 2011. [Google Scholar]
[16].Hochreiter S and Schmidhuber J, “Long Short-Term Memory,” Neural Computation, vol. 9, no. 8, pp. 1735–1780, 111997. [DOI] [PubMed] [Google Scholar]
[17].Cho K, Van Merriënboer B, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, and Bengio Y, “Learning phrase representations using rnn encoder-decoder for statistical machine translation,” arXiv preprint arXiv:1406.1078, 2014. [Google Scholar]
[18].Hao J, Duan J, Wang K, Hu C, and Shi C, “Inverse kinematic modeling of the tendon-actuated medical continuum manipulator based on a lightweight timing input neural network,” IEEE Transactions on Medical Robotics and Bionics, vol. 5, no. 4, pp. 916–928, 2023. [Google Scholar]
[19].Wang Y, McCandless M, Donder A, Pittiglio G, Moradkhani B, Chitalia Y, and Dupont PE, “Using neural networks to model hysteretic kinematics in tendon-actuated continuum robots,” in 2024 International Symposium on Medical Robotics (ISMR), 2024, pp. 1–7. [Google Scholar]
[20].Do T, Tjahjowidodo T, Lau M, Yamamoto T, and Phee S, “Hysteresis modeling and position control of tendon-sheath mechanism in flexible endoscopic systems,” Mechatronics, vol. 24, no. 1, pp. 12–22, 2014. [Google Scholar]

[R1] [1].Rucker DC and Webster RJ III, “Statics and dynamics of continuum robots with general tendon routing and external loading,” IEEE Transactions on Robotics, vol. 27, no. 6, pp. 1033–1044, 2011. [Google Scholar]

[R2] [2].Dupont PE, Lock J, Itkowitz B, and Butler D, “Design and Control of Concentric-Tube Robots,” IEEE Trans Robot, vol. 26, no. 2, pp. 209–225, 2010. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R3] [3].Chitalia Y, Donder A, and Dupont PE, “Modeling tendon-actuated concentric tube robots,” in 2023 International Symposium on Medical Robotics (ISMR), 2023, pp. 1–7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R4] [4].Ha J, Fagogenis G, and Dupont P, “Effect of path history on concentric tube robot model calibration,” in The Hamlyn Symposium on Medical Robotics, 2017, p. 77. [Google Scholar]

[R5] [5].Chitalia Y, Deaton NJ, Jeong S, Rahman N, and Desai JP, “Towards fbg-based shape sensing for micro-scale and meso-scale continuum robots with large deflection,” IEEE Robotics and Automation Letters, vol. 5, no. 2, pp. 1712–1719, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R6] [6].Xiang P, Zhang J, Sun D, Qiu K, Fang Q, Mi X, Wang Y, Xiong R, and Lu H, “Learning-based high-precision force estimation and compliant control for small-scale continuum robot,” IEEE Transactions on Automation Science and Engineering, pp. 1–13, 2023. [Google Scholar]

[R7] [7].Kim D, Kim H, and Jin S, “Recurrent neural network with preisach model for configuration-specific hysteresis modeling of tendon-sheath mechanism,” IEEE Robotics and Automation Letters, vol. 7, no. 2, pp. 2763–2770, 2022. [Google Scholar]

[R8] [8].Wu D, Zhang Y, Ourak M, Niu K, Dankelman J, and Poorten EV, “Hysteresis modeling of robotic catheters based on long short-term memory network for improved environment reconstruction,” IEEE Robotics and Automation Letters, vol. 6, no. 2, pp. 2106–2113, 2021. [Google Scholar]

[R9] [9].Li X, Tiong AMH, Cao L, Lai W, Phan PT, and Phee SJ, “Deep learning for haptic feedback of flexible endoscopic robot without prior knowledge on sheath configuration,” International Journal of Mechanical Sciences, vol. 163, p. 105129, 2019. [Google Scholar]

[R10] [10].Tao G and Kokotović P, “Adaptive control of systems with backlash,” IFAC Proceedings Volumes, vol. 25, no. 14, pp. 87–93, 1992. [Google Scholar]

[R11] [11].Al Janaideh M and Krejčí P, “Prandtl-ishlinskii hysteresis models for complex time dependent hysteresis nonlinearities,” Physica B: Condensed Matter, vol. 407, no. 9, pp. 1365–1367, 2012. [Google Scholar]

[R12] [12].Gan J and Zhang X, “A review of nonlinear hysteresis modeling and control of piezoelectric actuators,” AIP Advances, vol. 9, no. 4, p. 040702, April 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R13] [13].Hassani V, Tjahjowidodo T, and Do TN, “A survey on hysteresis modeling, identification and control,” Mechanical Systems and Signal Processing, vol. 49, no. 1, pp. 209–233, 2014. [Google Scholar]

[R14] [14].Vörös J, “Identification of cascade systems with backlash,” International Journal of Control, vol. 83, no. 6, pp. 1117–1124, 2010. [Google Scholar]

[R15] [15].Al Janaideh M and Krejčí P, “An inversion formula for a prandtl-ishlinskii operator with time dependent thresholds,” Physica B: Condensed Matter, vol. 406, no. 8, pp. 1528–1532, 2011. [Google Scholar]

[R16] [16].Hochreiter S and Schmidhuber J, “Long Short-Term Memory,” Neural Computation, vol. 9, no. 8, pp. 1735–1780, 111997. [DOI] [PubMed] [Google Scholar]

[R17] [17].Cho K, Van Merriënboer B, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, and Bengio Y, “Learning phrase representations using rnn encoder-decoder for statistical machine translation,” arXiv preprint arXiv:1406.1078, 2014. [Google Scholar]

[R18] [18].Hao J, Duan J, Wang K, Hu C, and Shi C, “Inverse kinematic modeling of the tendon-actuated medical continuum manipulator based on a lightweight timing input neural network,” IEEE Transactions on Medical Robotics and Bionics, vol. 5, no. 4, pp. 916–928, 2023. [Google Scholar]

[R19] [19].Wang Y, McCandless M, Donder A, Pittiglio G, Moradkhani B, Chitalia Y, and Dupont PE, “Using neural networks to model hysteretic kinematics in tendon-actuated continuum robots,” in 2024 International Symposium on Medical Robotics (ISMR), 2024, pp. 1–7. [Google Scholar]

[R20] [20].Do T, Tjahjowidodo T, Lau M, Yamamoto T, and Phee S, “Hysteresis modeling and position control of tendon-sheath mechanism in flexible endoscopic systems,” Mechatronics, vol. 24, no. 1, pp. 12–22, 2014. [Google Scholar]

PERMALINK

Comparison of Classical, Neural Network and Hybrid Models for Hysteretic Single-tendon Catheter Kinematics

Yuan Wang

Pierre E Dupont

Abstract

I. Introduction

Fig. 1.

II. Modeling Hysteretic Kinematics