Abstract
Modeling and controlling cable-driven snake robots is a challenging problem due to nonlinear mechanical properties such as hysteresis, variable stiffness, and unknown friction between the actuation cables and the robot body. This challenge is more significant for snake robots in ophthalmic surgery applications, such as the Improved Integrated Robotic Intraocular Snake (I2RIS), given its small size and lack of embedded sensory feedback. Data-driven models take advantage of global function approximations, reducing complicated analytical models’ challenge and computational costs. However, their performance might deteriorate in case of new data unseen in the training phase. Therefore, adding an adaptation mechanism might improve these models’ performance during snake robots’ interactions with unknown environments. In this work, we applied a model predictive path integral (MPPI) controller on a data-driven model of the I2RIS based on the Gaussian mixture model (GMM) and Gaussian mixture regression (GMR). To analyze the performance of the MPPI in unseen robot-tissue interaction situations, unknown external disturbances and environmental loads are simulated and added to the GMM-GMR model. These uncertainties of the robot model are then identified online using a radial basis function (RBF) whose weights are updated using an extended Kalman filter (EKF). Simulation results demonstrated the robustness of the optimal control solutions of the MPPI algorithm and its computational superiority over a conventional model predictive control (MPC) algorithm.
I. INTRODUCTION
Epiretinal Membrane (ERM) can develop due to various causes, including diabetic retinopathy, retinal vein occlusion, and ocular inflammation following vitreous separation [1]. The progression of ERM leads to metamorphopsia, reduced visual acuity, and central vision loss, significantly interfering with and decreasing the quality of everyday life [2]. Surgeons typically perform the ERM peeling procedure manually using an inner limiting membrane (ILM) micro-forceps, needle picks, and micro-vitreoretinal blades. Multiple attempts at engaging and peeling the membrane are often necessary, which increases the risk of iatrogenic damage. Focal retinal hemorrhages are also reported due to unintentional pinches to the nerve fiber layer caused by hand tremors or human errors [3], and significant injuries are possible.
Several robotic systems were developed to minimize human error, including the Steady-Hand Eye Robot (SHER) 2.0 [5], SHER 2.1 [6], the PRECEYES Surgical System [7], and the Robot-Assisted Minimally Invasive Eye Surgery (iRAM!S) [8]. SHER 3.0 [9] (see Fig. 1), a next-generation cooperative control robotic system, provides enhanced tremor reduction, precise localization, easy integration into surgical environments, and reduced costs and physical footprint. Yet, higher dexterity at the distal end is advantageous in enabling ERM peeling at an optimum approach angle and accessing hard-to-reach structures in the eye.
Fig. 1:

The Steady-Hand Eye Robot (SHER) 3.0 integrated with the I2RIS robot: (A) the real robots, where I2RIS is attached to the end effector of SHER 3.0. (B) The digital twin developed in AMBF [4] provides a useful simulation environment for testing control strategies. (C) The digital twin of the I2RIS alone, and (D) the articulated segment and the gripper of the I2RIS.
Continuum robots, known for their versatility, flexibility, and small size, showed great promise in Minimally Invasive Surgeries (MIS), reducing invasive operations, postoperative pain, surgery times, and costs [10]. These promising capabilities could benefit ophthalmic surgeries as well. For example, the Improved Integrated Robotic Intraocular Snake (I2RIS) [11] adds a small wrist-like mechanism with pitch and yaw degrees of freedom (DoFs) and gripping. Integrating I2RIS with SHER 3.0 (see Fig. 1) can increase the overall dexterity of the SHER 3.0 during membrane peeling procedures. As a result, the combination of SHER 3.0 and I2RIS can optimize surgical accuracy and efficiency while minimizing patient harm caused by human error. Moreover, the I2RIS can provide fine and delicate manipulation and optimize the approach angle for tissue grasp, resulting in reduced tractional forces and hemorrhage during ERM peeling. However, developing kinematic and dynamic models for continuum robots remains challenging due to their complexity compared to traditional rigid-link models, as they involve a large number of DoFs [12], [13], [14]. Furthermore, embedding sensors in continuum robots is challenging due to their small size and limited space [15]. Without reliable feedback from robot configuration, real-time estimation based on complex model-based algorithms becomes challenging, particularly in the presence of uncertainties [12], [16].
Classical model-based control methods, such as Cosserat rod theory and rigid link models, are applied to the kinematic modeling of continuum robots. However, these methods involve trade-off between accuracy and computational complexity, especially under environmental interaction or dynamic contact [12], [15]. The piecewise constant curvature (PCC) model [17] requires ideal assumptions, such as neglecting friction and the mass of the robot, which reduces the accuracy of the estimates [18].
Model-free control methods were developed to overcome these limitations. For example, methods such as local Gaussian Regression [19], Jacobian matrix updates [20] and reinforcement learning [21], [22] provide adaptation to the system. Additionally, researchers employed adaptive algorithms during runtime to control manipulators in combination with Model Predictive Control (MPC). This approach aimed to improve the dynamic performance and reduce steady-state error in continuum robots under various conditions, such as restrictions on design and dynamics [23]. For example, Nonlinear Model Predictive Control (NMPC) is used to control concentric tube robots (CTRs) [24]. Pneumatically actuated continuum robots are controlled by Nonlinear Evolutionary Model Predictive Control (NEMPC), which incorporates Deep Neural Networks (DNN) [25] for dynamics approximation. The main drawback of MPC is its computational demand, resulting in significantly longer processing times for each sampling period [26].
Model Predictive Path Integral (MPPI) control is an advanced, model-based optimal control framework that utilizes stochastic trajectory sampling to generate fast, near-optimal control inputs for complex nonlinear and stochastic systems [27], [28], [29]. Unlike traditional approaches like MPC, MPPI imposes no specific restrictions on the form of the state cost function [30], making it more flexible for a wide range of applications. By continuously generating a series of sampled trajectories through the system’s dynamic model and evaluating their associated costs, MPPI computes an open-loop control sequence by integrating the costs along these paths. One key advantage of MPPI over MPC is its reliance on a gradient-free, sample-based numerical strategy, which makes it particularly effective for handling nonlinear dynamics and non-convex cost functions, where conventional optimization techniques like MPC may struggle. MPPI also benefits from its computational efficiency in certain scenarios, as it avoids solving complex optimization problems at each time step, instead relying on stochastic sampling, which can be parallelized and, using a graphics processing unit (GPU) [31], adapted to real-time applications.
In this paper, we simulated an MPPI control algorithm on a data-driven probabilistic model of the I2RIS based on Gaussian mixture models (GMM) and Gaussian mixture regression (GMR) [32]. The contributions of this work are as follows:
We employed a radial basis function identifier to estimate the states of the I2RIS, pitch, and yaw DoFs coming from the GMM-GMR model.
We designed an extended Kalman filter to update the weights of the RBF identifier to adapt to the unknown external disturbances and synthesized environmental loads added to the GMM-GMR model, trying to improve the adaption of the robot during real tissue interaction.
We presented a discrete-time MPPI controller for the I2RIS model. The MPPI used the estimated states of the I2RIS through the EKF-RBF identifier. We also compared the performance of the MPPI with an MPC in a trajectory tracking problem.
Of note, the previously developed GMM-GMR model is trained offline using experimental data from the robot’s free motion, whose accuracy may be significantly impacted in the presence of unknown environmental loads during real tissue interaction. A Kalman filter can provide learning speeds faster than conventional RBF using gradient descent methods [33].
This paper is organized as follows. Section II describes the materials and methods. Section III presents and discusses the simulation results. Section IV concludes the paper.
II. MATERIALS AND METHODS
To analyze the performance of the MPPI in controlling the I2RIS robot, we used a probabilistic model of the robot developed in a previous study [32]. This model was based on Gaussian mixture models (GMM) and Gaussian mixture regression (GMR) with 15 Gaussian functions () whose parameters (prior probability , mean value , and covariance matrix for the k-th Gaussian function) were optimized by the Expectation Maximization (EM) algorithm [34] using experimental data collected from the I2RIS control inputs (motor angular position) and the output states (pitch and yaw DoFs) measured by two microscopes (see Fig. 2) [32].
Fig. 2:

(A) An overview of the experimental setup, (B) the I2RIS actuation unit, and (C) an image of the articulated segment and the gripper of the I2RIS captured by the camera.
A. System Model
We assume that the robot motion could be characterized by a discrete-time nonlinear dynamic between the control action (motor angular position in ) at time step , denoted as , and the snake tip bending angle for each of the pitch and yaw DoFs as follows:
| (1) |
| (2) |
where is an unknown nonlinear function representing the system dynamics, is the process noise, a zero-mean Gaussian random sequence with covariance , which represents the environmental disturbances and unmodeled dynamics, is an observation (or measurement) of the true state at time is the observation model, and is the observation noise, a zero-mean Gaussian white noise with covariance that is uncorrelated with the process noise, i.e. [35]. Based on the current system setup with two monocular cameras capturing the pitch and yaw motions (see Fig. 2), we can assume that the states are fully observable () when the snake robot is outside the eye. If inside the eye, we may still have a full or partial observation of the tip orientation using optical coherence tomography (OCT) imaging [36], [37].
We also assume that the pitch and yaw motions could be decoupled, each of them modeled by (1) () with a different nonlinear function depending on their cable tension and friction conditions.
B. RBF Identifier
Our objective is to identify the nonlinear function to estimate the robot states , trying to control it toward a desired state . To this end, we use a radial basis function due to its universal function approximation capability and fast learning process [33]. The RBF approximation of the function could be defined as:
| (3) |
where is the RBF estimation of is the number of basis functions, are the weights associated with each basis function, is the -th basis function with center and width , and [] is the state-input pair vector. The centers are determined using -means clustering algorithm, and the widths are set based on the maximum distance between the cluster centers [38].
C. Extended Kalman Filter for RBF Weights Update
To adapt the robot identifier to the snake robot’s external disturbances and environmental loads, the RBF weights are updated online using an extended Kalman filter. The state of the Kalman filter is defined as the vector of RBF weights, .
The system model for the Kalman filter is the dynamics of the RBF in discrete time as follows:
| (4) |
| (5) |
in which (4) specifies that the weights of the RBF are modeled as a stationary process with a process noise with covariance matrix , and (5) is the observation (measurement) model given by the predicted state difference in which is the desired response of the RBF identifier in (3) [39], [40], [41]. Here, is the vector of RBF basis functions, and is the measurement noise with covariance .
The Kalman filter update law for the states (RBF weights) based on the observed data is formulated as follows:
1). Prediction Step:
| (6) |
| (7) |
2). Measurement Update:
| (8) |
| (9) |
| (10) |
| (11) |
where is the updated estimate of the weights, is the updated error covariance matrix, and is the Kalman gain.
D. MPPI Control
The model predictive path integral is a model-based optimal control framework that uses stochastic sampling of trajectories to generate fast optimal control solutions for nonlinear and stochastic systems without restrictions on the state cost function form. MPPI leverages a sampling-based control strategy that constantly generates a sequence of sampled trajectories using a dynamic system model and computes the costs of the sampled trajectories. An open-loop optimal control sequence is then generated by evaluating the path integral of the sampled trajectories [27]. An MPPI control algorithm could be implemented on a dynamic system by taking the following steps:
1). Control Sampling:
At each time step, MPPI generates sampled trajectories with a finite time horizon by rolling out random sequence of control inputs , where stands for the index of the -th sampled trajectory. At each time step, the control sequence for the -th sampled trajectory, , is sampled over a finite horizon from the following distribution [42]:
| (12) |
where is the variance of control sampling noise determining how wide the random samples of the control inputs are. The system state for the -th trajectory, , corresponding to each control sequence in (12) is propagated using the system dynamics as follows:
| (13) |
where is estimated by the RBF identifier with EKF update (3)–(11) to adapt to the external disturbances and unmodeled dynamics.
2). Cost Function:
The goal of the MPPI as a nonlinear model predictive control algorithm is to generate an optimal control policy that minimizes the following cost function over a finite horizon :
| (14) |
where is a quadratically bounded terminal cost and is the stage cost defined as:
| (15) |
| (16) |
in which is positive semidefinite, and and are positive definite weighting matrices [43]. To follow a desired trajectory, we select discrete points (states) along the trajectory, and for every iteration, we set the current state of the robot as or the initial state of the optimization problem, and the next point on the trajectory as the . We keep updating and until all points on the trajectory are tracked.
![]() |
3). Cost Evaluation:
MPPI solves the above optimal control problem numerically based on a sample-based gradient-free stochastic strategy, making it suitable for systems with nonlinear dynamics and non-convex cost functions. For each sampled trajectory, the cumulative cost is computed as:
| (17) |
To ensure that those sampled control sequences that resulted in sampled trajectories with higher costs should have less importance in determining the optimal control solution , we define control weights for the -th trajectory as the exponential of the negative cost on that trajectory, computed as follows:
| (18) |
where is a temperature parameter.
4). Control Update:
The optimal control solution of the MPPI algorithm is updated by averaging the sampled control sequences weighted by the exponentiated cost in (18) as follows:
| (19) |
This optimal control sequence is then applied to the system. For a trajectory tracking problem, Algorithm 1 summarizes the proposed MPPI controller and the RBF identifier with EKF weight updates.
III. RESULTS AND DISCUSSIONS
In this study, we evaluate the performance of the MPPI controller on a probabilistic (GMM-GMR) model of the I2RIS robot developed in a previous study [32] (see Fig. 3). We identify the output of this GMM-GMR model by an RBF whose weights are updated online by an extended Kalman filter to adapt to the potential external disturbances and environmental loads caused during robot-tissue interactions. These external disturbances are modeled as with a variable noise covariance and the unknown environmental loads are modeled as , where is the robot state coming from the GMM-GMR model. We intentionally made these disturbances dependent on the robot state , as it is seen in the real snake robot that the more it is bent, the more nonlinear behavior is observed due to the increase in the cables’ tension and friction [32].
Fig. 3:

Results of the GMM-GMR model in estimating the pitch (a) and yaw (b) degrees for different values of motor shaft angles (rad). A total number of Gaussian functions are used in this model. The blue dots are experimental data points for the input-output pairs (), and the red curve is the GMM-GMR model output.
The RBF identifier has 10 basis functions (), and the weights are initialized by a multivariate normal distribution with mean and covariance as .
Due to the Kalman filter’s computational superiority over gradient descent methods [33], we used an extended Kalman filter to estimate and update the RBF weights. The extended Kalman filter gains are initialized with for the initial estimation of the error covariance, for the process noise covariance, and for the measurement noise covariance with .
The parameters of the MPPI control algorithm used in this study are as follows. We used sampled trajectories over a horizon , a temperature value of , the variance of control noise , terminal cost weight , and stage cost weights and .
We also compared the MPPI controller with a conventional MPC [44] in the presence of external disturbances, measurement noise, and simulated environmental loads.
We defined five desired trajectories, horizontal and vertical ovals (⬭), and infinity signs (∞) with 5 and 10 deg amplitudes for pitch and yaw DOFs, and a star (☆) with an inner and outer radius of 5 and 10 deg. We discretized each trajectory into points and solved the optimal control problem between every two consecutive points, considering the one as the initial state and the next point to be reached on the trajectory, , as . Once the optimal control solution is found, it is applied to the robot’s model (GMM-GMR) to move it toward the next desired point. The new state of the robot is then set as the current state for the next iteration, and another untracked point is set as a new desired state, and so on. This procedure is repeated until all points along a desired trajectory are tracked (see Algorithm 1). The forceps’ approach angle significantly affects successful ERM peeling [45]; hence, such maneuverabilities could provide an optimum approach angle and reduce potential hemorrhages. Fig. 4 shows the performance of the MPPI and MPC in tracking the desired trajectories. Each trajectory is tracked 5 times by each controller. The results are summarized in Table I.
Fig. 4:

Trajectory tracking performance of the MPPI and MPC on the snake robot tip orientation model (GMM-GMR) in the presence of synthesized unknown external disturbances and environmental loads. The MPPI controller uses the estimation of the robot states from the RBF identifier, whose weights are updated using an extended Kalman. Five desired trajectories are defined, including (a) a horizontal oval with 10 and 5 deg for pitch and yaw, (b) a vertical oval, (c) a horizontal infinity sign, (d) a vertical infinity sign with a similar radius for pitch and yaw, and (e) a star with an inner and outer radius of 5 and 10 deg. Each desired trajectory (red) is tracked five times by both MPC (green) and MPPI (blue) controllers. MPPI showed a more robust behavior in the presence of unknown external disturbances with significant superiority in computational efficiency and speed.
TABLE I:
Comparison of MPC and MPPI in tracking five trajectories in terms of computation time and RMSE (deg) for pitch and yaw.
| Controller Type | Desired Trajectory | Computation Time (s) | RMSE Pitch (deg) | RMSE Yaw (deg) |
|---|---|---|---|---|
|
| ||||
| MPC | ⬭ | 1180.46 ± 50.24 | 0.13 | 0.36 |
|
|
1038.62 ± 68.12 | 0.08 | 0.28 | |
| ∞ | 1192.43 ± 47.94 | 0.21 | 0.10 | |
|
|
1134.09 ± 58.73 | 0.19 | 0.14 | |
| ☆ | 992.36 ± 41.06 | 0.77 | 0.82 | |
|
| ||||
| MPPI | ⬭ | 227.64 ± 24.31 | 0.06 | 0.26 |
|
|
217.29 ± 29.48 | 0.07 | 0.27 | |
| ∞ | 222.47 ± 32.12 | 0.17 | 0.07 | |
|
|
201.01 ± 21.86 | 0.10 | 0.18 | |
| ☆ | 200.47 ± 26.04 | 0.32 | 0.44 | |
The simulations are performed on a CPU computer (Intel(R) Core(TM) i5–8250U 1.6GHz 1.8 GHz) in MATLAB (2021b), and the computational time for MPC and MPPI controllers are compared in Table I. It is observed that the MPPI is significantly faster than the MPC () and outperforms the MPC in terms of tracking RMSE. In a future study, we aim to implement this algorithm on both the digital twin and the real robot by further improving the computational speed of the MPPI using GPUs. Due to recent advances in parallel programming with GPUs, MPPI can generate optimal control solutions as fast as 50 Hz [46], which is fast enough for our application.
IV. CONCLUSIONS
In this study, we designed and simulated a model predictive path integral control algorithm for the I2RIS robot. We used Gaussian mixture models and Gaussian mixture regression as a model for simulating the snake robot motion. The parameters of this GMM-GMR model were optimized using experimental data from the snake robot in a previous study. Here, we used a radial basis function for online identification of the robot states (the GMM-GMR model). The weights of the RBF are updated using an extended Kalman filter. Moreover, we compared the performance of the proposed MPPI controller with a conventional MPC in tracking five different trajectories. It is observed that the MPPI controller, along with the EKF-RBF identifier, showed more accurate and robust behavior compared to MPC in the presence of unknown external disturbances and environmental loads. Also, MPPI outperformed MPC in terms of computational time and returned optimal control solutions about 5 times faster than MPC. In the future, we plan to implement this control strategy on the I2RIS robot using a GPU to take advantage of the parallel nature of MPPI for further improvement of computational performance.
Acknowledgments
This work was supported by the U.S. National Institutes of Health under grant numbers R01EB023943 and R01EB034397 and partially by JHU internal funds.
References
- [1].Fung AT, Galvin J, and Tran T, “Epiretinal membrane: a review,” Clinical & Experimental Ophthalmology, vol. 49, no. 3, pp. 289–308, 2021. [DOI] [PubMed] [Google Scholar]
- [2].Kanukollu VM and Agarwal P, “Epiretinal membrane,” in StatPearls [Internet]. StatPearls Publishing, 2023. [PubMed] [Google Scholar]
- [3].Lumi X, Petrovski BE, and Petrovski G, “Simple new technique for macular pucker peel without forceps,” Frontiers in Medicine, vol. 9, p. 947578, 2022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [4].Munawar A and Fischer GS, “An asynchronous multi-body simulation framework for real-time dynamics, haptics and learning with application to surgical robots,” in 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2019, pp. 6268–6275. [Google Scholar]
- [5].Fleming I, Balicki M, Koo J, Iordachita I, Mitchell B, Handa J, Hager G, and Taylor R, “Cooperative robot assistant for retinal microsurgery,” in Medical Image Computing and Computer-Assisted Intervention–MICCAI 2008: 11th International Conference, New York, NY, USA, September 6–10, 2008, Proceedings, Part II 11. Springer, 2008, pp. 543–550. [DOI] [PubMed] [Google Scholar]
- [6].Zhao B, Esfandiari M, Usevitch DE, Gehlbach P, and Iordachita I, “Human-robot interaction in retinal surgery: A comparative study of serial and parallel cooperative robots,” in 2023 32nd IEEE International Conference on Robot and Human Interactive Communication (RO-MAN). IEEE, 2023, pp. 2359–2365. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [7].Meenink T, Naus G, de Smet M, Beelen M, and Steinbuch M, “Robot assistance for micrometer precision in vitreoretinal surgery,” Investigative Ophthalmology & Visual Science, vol. 54, no. 15, pp. 5808–5808, 2013. [Google Scholar]
- [8].Nasseri MA, Eder M, Nair S, Dean E, Maier M, Zapp D, Lohmann CP, and Knoll A, “The introduction of a new robot for assistance in ophthalmic surgery,” in 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC). IEEE, 2013, pp. 5682–5685. [DOI] [PubMed] [Google Scholar]
- [9].Alamdar A, Usevitch DE, Wu J, Taylor RH, Gehlbach P, and Iordachita I, “Steady-hand eye robot 3.0: Optimization and benchtop evaluation for subretinal injection,” IEEE transactions on medical robotics and bionics, 2023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [10].Iqbal F, Esfandiari M, Amirkhani G, Hoshyarmanesh H, Lama S, Tavakoli M, and Sutherland G, “Continuum and soft robots in minimally invasive surgery: A systematic review,” IEEE Access, 2025. [Google Scholar]
- [11].Jinno M and Iordachita I, “Improved integrated robotic intraocular snake: Analyses of the kinematics and drive mechanism of the dexterous distal unit,” Journal of medical robotics research, vol. 6, no. 01n02, p. 2140001, 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [12].Russo M, Sadati SMH, Dong X, Mohammad A, Walker ID, Bergeles C, Xu K, and Axinte DA, “Continuum robots: An overview,” Advanced Intelligent Systems, vol. 5, no. 5, p. 2200367, 2023. [Google Scholar]
- [13].Rezaeian S, Badie B, and Sheng J, “A telescopic tendon-driven needle robot for minimally invasive neurosurgery,” in 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2023, pp. 10 301–10 307. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [14].Ceja JA, Rezaeian S, Vélez-Cordero JR, Hernández-Cordero J, Badie B, and Sheng J, “Towards a robotically steerable laser ablation probe,” in 2024 International Symposium on Medical Robotics (ISMR). IEEE, 2024, pp. 1–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [15].da Veiga T, Chandler JH, Lloyd P, Pittiglio G, Wilkinson NJ, Hoshiar AK, Harris RA, and Valdastri P, “Challenges of continuum robots in clinical context: a review,” Progress in Biomedical Engineering, vol. 2, no. 3, p. 032003, 2020. [Google Scholar]
- [16].Amirkhani G, Goodridge A, Esfandiari M, Phalen H, Ma JH, Iordachita I, and Armand M, “Design and fabrication of a fiber bragg grating shape sensor for shape reconstruction of a continuum manipulator,” IEEE Sensors Journal, vol. 23, no. 12, pp. 12 915–12 929, 2023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [17].López-González A, Tejada JC, and López-Romero J, “Review and proposal for a classification system of soft robots inspired by animal morphology,” Biomimetics, vol. 8, no. 2, p. 192, 2023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [18].Wang Z, Wang T, Zhao B, He Y, Hu Y, Li B, Zhang P, and Meng MQ-H, “Hybrid adaptive control strategy for continuum surgical robot under external load,” IEEE Robotics and Automation Letters, vol. 6, no. 2, pp. 1407–1414, 2021. [Google Scholar]
- [19].Fang G, Tian Y, Yang Z-X, Geraedts JM, and Wang CC, “Efficient jacobian-based inverse kinematics with sim-to-real transfer of soft robots by learning,” IEEE/ASME Transactions on Mechatronics, vol. 27, no. 6, pp. 5296–5306, 2022. [Google Scholar]
- [20].Yip MC and Camarillo DB, “Model-less hybrid position/force control: a minimalist approach for continuum manipulators in unknown, constrained environments,” IEEE Robotics and Automation Letters, vol. 1, no. 2, pp. 844–851, 2016. [Google Scholar]
- [21].Kargin TC and Kołota J, “A reinforcement learning approach for continuum robot control,” Journal of Intelligent & Robotic Systems, vol. 109, no. 4, p. 77, 2023. [Google Scholar]
- [22].Karimi Z, Ho S-H, Thach B, Kuntz A, and Brown DS, “Reward learning from suboptimal demonstrations with applications in surgical electrocautery,” in 2024 International Symposium on Medical Robotics (ISMR), 2024, pp. 1–7. [Google Scholar]
- [23].Hyatt P, Johnson CC, and Killpack MD, “Model reference predictive adaptive control for large-scale soft robots,” Frontiers in Robotics and AI, vol. 7, p. 558027, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [24].Khadem M, O’Neill J, Mitros Z, Da Cruz L, and Bergeles C, “Autonomous steering of concentric tube robots via nonlinear model predictive control,” IEEE Transactions on Robotics, vol. 36, no. 5, pp. 1595–1602, 2020. [Google Scholar]
- [25].Hyatt P and Killpack MD, “Real-time nonlinear model predictive control of robots using a graphics processing unit,” IEEE Robotics and Automation Letters, vol. 5, no. 2, pp. 1468–1475, 2020. [Google Scholar]
- [26].Amouri A, Cherfia A, and Merabti H, “Nonlinear model predictive control of a class of continuum robots using kinematic and dynamic models,” FME Transactions, vol. 50, no. 2, 2022. [Google Scholar]
- [27].Williams G, Aldrich A, and Theodorou EA, “Model predictive path integral control: From theory to parallel computation,” Journal of Guidance, Control, and Dynamics, vol. 40, no. 2, pp. 344–357, 2017. [Google Scholar]
- [28].Wang Z, Williams G, and Theodorou EA, “Information theoretic model predictive control on jump diffusion processes,” in 2019 American Control Conference (ACC). IEEE, 2019, pp. 1663–1670. [Google Scholar]
- [29].Akshay P, Vrushabh D, Sonam K, Wagh S, and Singh N, “Hamiltonian monte carlo based path integral for stochastic optimal control,” in 2020 28th Mediterranean Conference on Control and Automation (MED). IEEE, 2020, pp. 254–259. [Google Scholar]
- [30].Pezzato C, Salmi C, Spahn M, Trevisan E, Alonso-Mora J, and Corbato CH, “Sampling-based model predictive control leveraging parallelizable physics simulations,” arXiv preprint arXiv:2307.09105, 2023. [Google Scholar]
- [31].Williams G, Rombokas E, and Daniel T, “Gpu based path integral control with learned dynamics,” arXiv preprint arXiv:1503.00330, 2015. [Google Scholar]
- [32].Esfandiari M, Zhou Y, Dehghani S, Hadi M, Munawar A, Phalen H, Usevitch DE, Gehlbach P, and Iordachita I, “A data-driven model with hysteresis compensation for I2RIS robot,” in 2024 International Symposium on Medical Robotics (ISMR). IEEE, 2024, pp. 1–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [33].Simon D, “Training radial basis neural networks with the extended Kalman filter,” Neurocomputing, vol. 48, no. 1–4, pp. 455–475, 2002. [Google Scholar]
- [34].Moon TK, “The expectation-maximization algorithm,” IEEE Signal Processing Magazine, vol. 13, no. 6, pp. 47–60, 1996. [Google Scholar]
- [35].Stengel RF, Optimal control and estimation. Courier Corporation, 1994. [Google Scholar]
- [36].Arikan D, Zhang P, Sommersperger M, Dehghani S, Esfandiari M, Taylor RH, Nasseri MA, Gehlbach P, Navab N, and Iordachita I, “Real-time deformation-aware control for autonomous robotic subretinal injection under ioct guidance,” arXiv preprint arXiv:2411.06557, 2024. [Google Scholar]
- [37].—, “Towards motion compensation in autonomous robotic subretinal injections,” arXiv preprint arXiv:2411.18521, 2024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [38].Yang X, Li Y, Sun Y, Long T, and Sarkar TK, “Fast and robust RBF neural network based on global k-means clustering with adaptive selection radius for sound source angle estimation,” IEEE Transactions on Antennas and Propagation, vol. 66, no. 6, pp. 3097–3107, 2018. [Google Scholar]
- [39].Haykin S, Kalman filtering and neural networks. John Wiley & Sons, 2004. [Google Scholar]
- [40].Sum J, Leung C-S, Young GH, and Kan W-K, “On the Kalman filtering method in neural network training and pruning,” IEEE Transactions on Neural Networks, vol. 10, no. 1, pp. 161–166, 1999. [DOI] [PubMed] [Google Scholar]
- [41].Pesce V, Silvestrini S, and Lavagna M, “Radial basis function neural network aided adaptive extended Kalman filter for spacecraft relative navigation,” Aerospace Science and Technology, vol. 96, p. 105527, 2020. [Google Scholar]
- [42].Tao C, Kim H, and Hovakimyan N, “RRT guided model predictive path integral method,” in 2023 American Control Conference (ACC). IEEE, 2023, pp. 776–781. [Google Scholar]
- [43].Köhler J, Zeilinger MN, and Grüne L, “Stability and performance analysis of NMPC: Detectable stage costs and general terminal costs,” IEEE Transactions on Automatic Control, vol. 68, no. 10, pp. 6114–6129, 2023. [Google Scholar]
- [44].Esfandiari M, Chan S, Sutherland G, and Westwick D, “Nonlinear model predictive control of robot manipulators using quasi-LPV representation,” in 2019 7th International Conference on Control, Mechatronics and Automation (ICCMA). IEEE, 2019, pp. 116–120. [Google Scholar]
- [45].Chang H-H, Chen S-J, Hsu C-A, and Chou Y-B, “Efficiency and safety of internal limiting membrane peeling with different forceps for macular disease,” Retina, pp. 10–1097, 2022. [DOI] [PubMed] [Google Scholar]
- [46].Pravitra J, Ackerman KA, Cao C, Hovakimyan N, and Theodorou EA, “L1-adaptive MPPI architecture for robust and agile control of multirotors,” in 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2020, pp. 7661–7666. [Google Scholar]

