Efficient Dynamics Estimation with Adaptive Model Sets

Ellis Ratner; Andrea Bajcsy; Terrence Fong; Claire J Tomlin; Anca D Dragan

doi:10.1109/lra.2021.3060415

. Author manuscript; available in PMC: 2022 Apr 1.

Published in final edited form as: IEEE Robot Autom Lett. 2021 Feb 18;6(2):2373–2380. doi: 10.1109/lra.2021.3060415

Efficient Dynamics Estimation with Adaptive Model Sets

Ellis Ratner ¹, Andrea Bajcsy ¹, Terrence Fong ², Claire J Tomlin ¹, Anca D Dragan ¹

PMCID: PMC8098078 NIHMSID: NIHMS1683713 PMID: 33969182

Abstract

Robotic systems frequently operate under changing dynamics, such as driving across varying terrain, encountering sensing and actuation faults, or navigating around humans with uncertain and changing intent. In order to operate effectively in these situations, robots must be capable of efficiently estimating these changes in order to adapt at the decision-making, planning, and control levels. Typical estimation approaches maintain a fixed set of candidate models at each time step; however, this can be computationally expensive if the number of models is large. In contrast, we propose a novel algorithm that employs an adaptive model set. We leverage the idea that the current model set must be expanded if its models no longer sufficiently explain the sensor measurements. By maintaining only a small subset of models at each time step, our algorithm improves on efficiency; at the same time, by choosing the appropriate models to keep, we avoid compromising on performance. We show that our algorithm exhibits higher efficiency in comparison to several baselines, when tested on simulated manipulation, driving, and human motion prediction tasks, as well as in hardware experiments on a 7 DOF manipulator.

Keywords: Probabilistic Inference, Motion and Path Planning, Human-Aware Motion Planning

I. Introduction

Whether operating on the road, on a deep space exploration mission to a distant world such as Europa, or in a household around people, robots frequently face changing dynamics. These changes arise for a variety of reasons, such as traversing changing terrains, faults induced by wear and tear from extreme operating conditions, or navigating around people with uncertain and changing intentions.

In these situations, we can typically obtain models of each dynamics mode via first-principles or data-driven approaches. Nevertheless, during operation, the robot will have uncertainty about which dynamics mode is currently occurring. In order to plan effectively under this uncertainty, we must estimate the dynamics based on sensor measurements. This estimation problem can be posed as a general filtering problem over the space of possible dynamics models. However, this set of models is typically large in practice. Furthermore, since the dynamics switch over time, an optimal estimator must track all possible mode sequences, the number of which grows geometrically in the time step. While there exist approximate estimators [1, 2], these typically still require large model sets at estimation time, which can be computationally expensive to manage.

In this work, we propose to use only a small subset of models for estimation at each time step. The fundamental challenge with such an approach, however, is that the best model may not be in this subset. To address this, our key idea is that the robot can detect when none of the current models sufficiently explain the sensor measurements, which in turn it can use as an indicator of when to expand the model set. Equipped with this idea, we propose a multiple model estimator with a novel mechanism for adapting the model set. Starting with a small subset of models, our algorithm only expands that set when the true observations are assigned low likelihood under all models in our current model set. To determine which model to add, we measure the predictive performance of each model currently not in the set and add the one from the set which assigns highest likelihood to the true observations. Further, to keep the subset as small as possible, we remove any models with low posterior probability which indicates that they are no longer needed to explain the measurements; should they become necessary again, we will detect this and add them in later on.

We experimentally evaluate the performance, with respect to efficiency and accuracy, and the robustness of our algorithm on a set of simulations on the following domains: trajectory tracking for a 3 DOF and a 6 DOF manipulator, which encounter actuation faults; trajectory tracking for a skid-steering vehicle driving across uncertain and changing terrains; and, trajectory planning for a Dubins’ car-like robot navigating around a human with changing intentions. Through these evaluations, we show that our adaptive estimation algorithm is computationally favorable when compared to non-adaptive estimators without sacrificing on estimation performance. Additionally, we found that our adaptive estimator was actually more accurate at predicting the true system mode in all experiments. We attribute this to an additional layer of filtering in our mechanism for adapting the model set, providing more stability in predicting the most likely mode.

II. Background and Related Work

In this work, we wish to estimate the state x_k at time k, given the sequence of measurements observed, y_0:k, and controls taken, u_0:k. We assume the system evolves according to:

x_{k + 1} \sim Pr (\cdot ∣ x_{k}, u_{k}, m_{k}) y_{k + 1} \sim Pr (\cdot ∣ x_{k + 1}, m_{k + 1}) m_{k + 1} \sim Pr (\cdot ∣ m_{k})

(1)

where m_k ∈ {1, 2,…, N} is the mode of the system at time k. We assume access to a set of candidate models that could characterize the dynamics mode of the robot, obtained from first-principles or identified via data-driven approaches. We further assume that the state and mode are initially unknown. So, at any given time k, we must simultaneously estimate the state and mode of the system. Note that we do, however, assume full knowledge of the system dynamics in Eq. 1.

Consider the state estimator ${\hat{x}}_{k}$ of x_k, defined to be the expected state given the sequence of measurements taken up to time k. Then

{\hat{x}}_{k} ≔ E [x_{k} ∣ y_{0 : k}] = \sum_{m_{0 : k}} Pr (m_{0 : k} ∣ y_{0 : k}) E [x_{k} ∣ y_{0 : k}, m_{0 : k}] .

(2)

Observe that such an estimator requires enumerating all possible mode sequences of length k + 1, or N^k+1 sequences, and thus typically cannot be implemented [3-5]. Even in the case where we make the simplifying assumption that we know the state x_k, and only need to estimate m_k, this geometric complexity remains.

Due to these issues, prior work has focused on developing approximate methods, which do not track all N^k+1 mode sequences to estimate the state and/or mode. We broadly divide such related work into fixed model set, online model learning, skill learning, and adaptive model set approaches.

Fixed model set.

There has been a wealth of research on estimation algorithms that maintain a fixed set of models (pre-defined or learned a priori) to simultaneously estimate the state and mode of the system. The Multiple Model Adaptive Estimation (MMAE) algorithm [6] keeps a bank of estimators, one per mode, each of which computes the mode conditioned estimate E[x_k ∣ y_0:k, m_i,k]; traditionally, these are implemented as Kalman filters. To handle tracking of the mode sequence, the MMAE algorithm makes the simplifying assumption that the mode is fixed but unknown at k = 0, and therefore the estimator in Eq. 2 simplifies to:

{\hat{x}}_{k} = \sum_{i = 1}^{N} Pr (m_{k} = i ∣ y_{0 : k}) E [x_{k} ∣ y_{0 : k}, m_{k} = i] .

(3)

While computationally convenient, such an approximation often does not perform well when the mode evolves over time, as in Eq. 1. To address this, the Interacting Multiple Model (IMM) algorithm [1, 2] adds an additional mixing step to the estimator update each time step, to account for the mode switches that may occur over time. While still an approximation, IMM estimation typically outperforms the MMAE algorithm, without considerable overhead [7]. Several other algorithms have been designed to handle such switches, a notable class of those being the generalized pseudo-Bayesian (GPB) estimators [8-10]. Finally, Cully et al. leverage an offline precomputation to enable the system to detect and compensate for failures (e.g. the loss of an actuator) [11]; while no new models are learned online, their system is still capable of adapting to novel situations at execution time. Our method is complementary to these approaches, as we introduce a mechanism for adapting the set of models, rather than keeping a fixed set for all time.

Online model learning.

In Event-Triggered Learning (ETL), the algorithm learns a new model when there is a mismatch between what the existing model predicts, and what is actually measured [12]. Harris et al. introduce a method for measuring the overall performance of a control system for the purposes of indicating that a chosen controller may be insufficient for the task at hand [13]. MOSAIC [14] simultaneously learns the set of all possible models, as well as how to select the subset relevant for the current environment in a single learning-based framework. In contrast to these approaches, our method selects from an existing set of models acquired a priori. These online learning approaches, however, are complementary: a combined algorithm could determine whether to choose from this existing set of models or identify a new model online.

Learning new skills.

Within robot skill learning, some prior work focuses on how to combine existing model sets with new model sets (e.g. of how to accomplish a task). Koert et al. manage a set of skills, represented by Gaussian Mixture Models, and introduce a mechanism for adding and removing skills from this set, analogous to our procedures from adding and removing models from a model set [15]. Similarly, Maeda et al. introduce a mechanism for deciding whether to rely on previously learned skills (represented as Gaussian Process motion primitives) to accomplish a task, or to signal learning of a new skill [16]. While these approaches employ a similar idea of managing a set of models via adding, removing, and updating mechanisms, they do not apply directly to the simultaneous mode and state estimation problem that we address in this paper.

Adaptive model set.

In some cases, such as when computational resources are constrained and the number of possible modes is high, it is desirable to adapt the model set over time. Estimators that choose a model set from an existing set of models are generally referred to as Variable Structure Multiple Model (VSMM) algorithms [17, 18]; our approach resides in this category. One approach is to frame the questions around how to adapt the model set as statistical hypothesis tests [4, 19]. However, we need to perform 2^N of these, which is computationally prohibitive. To address this, certain methods leverage structure in the system, such as the Model-Group Switching (MGS) algorithm [20, 21]. In many robotics settings, however, we often cannot assume such structure (e.g. sensing and actuation faults are unpredictable). Our proposed approach, in contrast, scales linearly (rather than geometrically) with N, and does not assume any structure on the mode switching dynamics.

III. The Adaptive Model Set (AMS) Algorithm

A. Overview

Our proposed Adaptive Model Set (AMS) estimator applies the IMM algorithm with the addition of a model set update via Alg. 1 at each step. We wish to use only a small subset of all possible models at each time step in order to mitigate computational expense.

When to expand the model set.

Our key idea is to expand the current model set only if the current models are insufficient to explain the sensor measurements. We implement this in line 6, which checks a threshold on the measurement likelihood p(y_k ∣ y_0:k–1, m_k = i) > β If none of the current models can explain the current measurement, then Alg. 1 expands the model set.

Majority voting.

Instead of deciding to expand the model set on the basis of a single measurement, our algorithm considers a majority vote over this decision across N_V time steps (lines 6-10). For example, in Fig. 2, the algorithm expands the model set only after a majority of votes between time step k₁ and k₁ + N_V agree that it is necessary to do so. In Fig. 2, grey indicates a vote against adding models, and red indicates a vote for adding models at that time step.

Fig. 2: — An example depicting the operation of our AMS estimator on the Kinova JACO 7 DOF manipulator system. As in an experiment in Sec. VII, the end-effector follows a position trajectory. The person immobilizes one of the robot’s joint at time step k₁. By time step k₁ + N_V, the algorithm detects that the current model set is insufficient to explain the encoder measurements– so it expands the model set appropriately. Later, the algorithm removes the models that are no longer needed to explain the measurements. At time step k₂, the person lets go of the robot, allowing the joint to move freely again.

If N_V > 1, then the system may have switched modes at some point during the past N_V time steps. To account for this, we enumerate all mode sequences of length N_V, illustrated by the trees of mode sequences at time steps k₁+N_V and k₂+N_V in Fig. 2. Then, for each mode sequence M, we instantiate a filter to compute the state estimate at time step k conditioned on mode sequence M between time steps k – N_V + 1 and k. To do so, we instantiate the filter with the state estimate at k – N_V + 1 (line 14), and then for each time step up until k, we update the filter with the input and measurements that were received at that time step (lines 15-17). In order to perform these updates, we must store u_k,y_k, as well as some additional information about the estimates, for the last N_V time steps (e.g. for a Kalman filter, we would store the previous state estimate and estimation error covariances).

While we enumerate all mode sequences of length N_V, there exist several possible optimizations. For example, we could assume that the system switched modes only once during the voting period.

Which models to add.

Once it has constructed all candidate filters, Alg. 1 selects the best performing models, with respect to an evaluation function q. This evaluation function is specific to the type of filter that we are using, and provides a measure of how well the filter predicts the current observed measurements. For example, if we are using Kalman filters, then q(F_M) is proportional to the filter’s residual (i.e. difference between predicted and observed measurement), where i is the final mode in the mode sequence M. Finally, Alg. 1 adds the best performing model $F_{M}^{*}$ (line 19), as well as all other filters F_M whose performance is close to $F_{M}^{*}$ with respect to q, to the new filter set M_k (lines 20-22).

Removing models.

If a model has low a posteriori probability Pr(m_k–1 = i∣ y_0:k–1) (denoted as p_i in the pseudocode) at the previous time step k – 1, then the algorithm removes it (lines 3-5); hence, models not needed to explain the data seen thus far are not kept in the model set. While prematurely removing a model temporarily degrades the estimator performance, if that model is important, it will simply be added again at a later time step.

Algorithm 1 Updates the model set at time step k, given the latest measurement and control input.

\begin{matrix} 1 : function A MS-UPDATEMODELS (u_{k}, y_{k}, M_{k - 1}, {p_{i}}) \\ 2 : M_{k} \leftarrow M_{k - 1} \\ 3 : for each model i in M_{k - 1} do \\ 4 : if p_{i} < α then \\ 5 : Remove model i from M_{k} \\ 6 : if p (y_{k} ∣ y_{0 : k - 1}, m_{k} = i) > β for all i in M_{k - 1} then \\ 7 : Vote f o r adding new models at time step k \\ 8 : else \\ 9 : Vote a g a i n s t adding new models at time step k \\ 10 : Check majority vote over past N_{V} time steps \\ 11 : if majority vote favors adding models then \\ 12 : filters \leftarrow \emptyset \\ 13 : for each mode sequence M of length N_{V} do \\ 14 : Construct F_{M} at time step k - N_{V} + 1 \\ 15 : for l = 1, 2, \dots, N_{V} do \\ 16 : Switch internal model of F_{M} according to M \\ 17 : Update F_{M} with u_{k - N_{V} + l} and y_{k - N_{V} + l} \\ 18 : Add F_{M} to filters \\ 19 : F_{M}^{*} \leftarrow one-of \underset{F_{M} \in filters}{argmin} q (F_{M}) \\ 20 : for each F_{M} in filters s.t. ∣ q (F_{M}^{*}) - q (F_{M}) ∣ < γ do \\ 21 : Add the l a s t mode in M to M_{k} \\ 22 : Assign F_{M} as the filter associated with mode M \\ return M_{k} \end{matrix}

Open in a new tab

B. Parameters

Threshold for removing models (α).

The parameter α (line 4) determines when to remove a model, based on the a posteriori probability of that particular model best representing the system mode. In our experiments, we use α = 0.01.

Threshold for expanding the model set (β).

The parameter β is used to determine when to expand the model set (line 6), and may in fact vary over time. For example, in our experiments, since we use Kalman filters that assume Gaussian measurement likelihood distributions, we set β to be the likelihood of a measurement that is two standard deviations away from the predicted measurement.

Threshold for adding models (γ).

The parameter γ determines how the algorithm “groups” similarly performing models (line 20). An alternative approach would add only the models i where q-value is above a threshold. In the case where none of the models adequately explain the measurements with respect to this threshold, no models would be added, causing poor performance. Therefore, it is better to add the best models we have relative to one another, rather than with respect to a fixed threshold. In practice, the value of γ depends on the scale of the evaluation function q, and should typically be small.

Number of votes (N_V).

The parameter N_V determines the number of votes before expanding the model set. This parameter captures a trade-off between robustness (explored experimentally in Sec. V), and computational efficiency, since the number of mode sequences is on the order of N^N_V.

IV. Performance Evaluation of AMS Estimator

We first evaluate if the AMS estimator enables high estimation accuracy, while decreasing the computational load. We evaluate our estimator on 2 simulated systems: a planar 3 DOF manipulator, and a 6 DOF manipulator, proposed for a future NASA lander mission to Europa [22] (see Fig. 3).

Fig. 3: — NASA 6 DOF Europa lander arm, tracking a position trajectory for the end-effector (the scoop). The actuator on the second joint suffers a temporary 75% degradation. Using the nominal model for controlling the manipulator leads to a trajectory that diverges from the reference, since controller is unable to compensate for the degradation.

Simulated Behavior.

Each experiment is 3.1 s long, with a time step of 0.01 s. The system starts in mode nominal, and switches to another modes after 0.75 s, and then switches back to nominal after 1.75 s.

Nominal:

Manipulator is operating normally. We apply a first-order Euler discretization in time to the manipulator equations (see [23]) to derive an equation of the form in Eq. (1). Locked joint: A single joint is completely immobile. Free-swinging joint: All input torque at the free-swinging joint is set to zero. Degraded actuator: The input torque at the degraded actuator is multiplied by a scalar degradation factor in the interval (0, 1). We consider degradation factors of 0.25, 0.50 and 0.75.

Each of these modes, other than the nominal mode, can occur at each joint. Therefore, to count the total number of modes, we multiply by the number of joints. So, in these experiments, we consider 17 possible modes for the 3 DOF arm, and 32 modes for the 6 DOF arm.

Control Objective.

We consider two control objectives: first is a jointspace tracking task, where the goal is for the manipulator to follow a time-varying, sinusoidal sequence of joint positions, velocities, and accelerations; the second is a taskspace tracking task, where the goal is for the end-effector to follow a time-varying sequence of positions and velocities in taskspace.

Control Law.

For both control objectives, we use a computed torque control (CTC) law [23]. We design the CTC input based on the model with highest a posteriori probability at each time step.

Independent Variables.

We manipulate whether we adapt the model set or keep it fixed; this leads to three non-adaptive baselines to compare with our adaptive estimator. Ground-Truth (GT): A Kalman filter using the ground-truth model at each time step. Nominal (N): A Kalman filter using the nominal internal model. Interacting Multiple Model (IMM): The IMM algorithm described in Sec. II. Note that the model set includes all possible models at each time step. Adaptive Model Set (AMS): Our proposed AMS estimator, described in Alg. 1. For the AMS algorithm, we also manipulate N_V.

Dependent Measures.

Mode Prediction Accuracy: The percentage of time steps for which the estimator correctly predicts the system mode. Here, the estimator predicts the mode with maximum a posteriori probability. State Estimation Error: Defined for each time step to be the norm of the difference between the actual state and the estimated state. Position/Velocity Tracking Error: Defined as the norm of the difference between the desired position/velocity and the actual position/velocity. For the jointspace control objective, position/velocity is the joint angles/velocities. For the taskspace control objective, position/velocity is the end-effector position/velocity. Estimator Update Time: Defined for each time step as the amount of time (in seconds) needed to update the state estimate. All measures reported in the tables in this paper are mean values, with standard error reported in parenthesis.

Hypotheses.

H1: Adapting the model set, via our AMS algorithm, performs better with respect to computation time than a non-adaptive estimator. H2: Adapting the model set, via our AMS algorithm, performs at least as well as a non-adaptive estimator with respect to state estimation and trajectory tracking error.

Trials.

For each algorithm, we conduct 50 trials, each with a different random seed, every combination of manipulator, true mode the system switches to, and control objective.

Analysis.

For the 3 DOF and 6 DOF systems, our AMS estimator is faster than the IMM estimator on both control objectives, supporting H1. Fig. 4 shows the update times for the 6 DOF experiments, and the 3 DOF experiments follow the same trend. Our estimator also performs similar or better than the IMM estimator with respect to estimation and tracking errors on both control objectives, shown in Table I, in support of H2. We note that AMS outperforms IMM in mode prediction accuracy, shown in Fig. 4.

TABLE I.

3DOF Manipulator
	Estimation Error		Tracking Error
	Joint (rad)	Task (m)	Joint (rad)	Task (m)
N	1.178 (4.7e-3)	0.202 (7.1e-4)	1.008 (4.6e-3)	0.070 (2.7e-4)
IMM	0.047 (4.8e-5)	0.052 (5.2e-5)	0.093 (3.0e-4)	0.019 (7.2e-5)
AMS	0.045 (5.1e-5)	0.053 (6.3e-5)	0.082 (3.0e-4)	0.020 (1.1e-4)
GT	0.044 (4.8e-5)	0.044 (4.7e-5)	0.074 (3.0e-4)	0.012 (6.3e-5)

Open in a new tab

We observe no effect of voting for the 3 DOF manipulator, in that the performance of our estimator with and without voting is comparable; however, in the 6 DOF manipulator experiments, we see about a 10% increase in mode prediction accuracy when using our estimator, compared to using the IMM estimator. This is likely because voting is designed to help when there is ambiguity among which model best describes the system mode. These ambiguities are more prone to occur in the 6 DOF experiments, where there are 32 possible models, compared to in the 3 DOF experiments where there are only 17 possible models.

We also observe that all estimators perform better on the jointspace objective than on the taskspace objective. In the jointspace objective, the manipulator follows a sinusoidal trajectory; it is likely that this trajectory more persistently excites the system when compared to the taskspace trajectory, leading to better estimator performance.

Summary.

We find that our AMS algorithm is more computationally efficient than baselines, while not compromising on performance. In some cases our method outperforms baselines, most notably in mode prediction accuracy. This is due in part because our AMS estimator provides an additional layer of filtering: we only increase the model set when the existing models are poorly explaining the measurements. This prevents the maximum a posteriori probability model from switching as frequently as in the IMM estimator.

V. Robustness to Misspecified Models

Our first experiment analyzed the performance of our AMS algorithm in situations where the models perfectly describe the dynamics of the system. Next, we verify that working with an adaptive model set, rather than the full one as in the IMM algorithm, does not negatively affect accuracy in situations where the models are imperfect: where the noise model is inaccurate, where the link masses are imperfectly characterized, or where we are missing a model of the correct mode altogether. We keep the same 3 DOF manipulator system, as well as the same models, control objective, control law, trials, and dependent measures as in Sec. IV.

Independent Variables.

In addition to the estimation algorithms described in Sec. IV, we further manipulate the following three variables in separate experiments to evaluate robustness: the standard deviation of the simulated sensor noise (in this case $σ_{v}^{2} = 0.03$ ) versus the standard deviation of the modeled sensor noise (in this case $σ_{v}^{2} = 0.01$ ); whether the link masses are mismodeled (i.e. the actual simulated masses are perturbed randomly in the range [−0.2, 0.2] kg from their modeled values used by the estimator); and, whether or not the estimator has access to a model of the mode to which the system switches.

Hypotheses.

H3: Our AMS estimator does not lead to further degradation in performance when faced with (a) mismodeling of the measurement noise; (b) mismodeling of the link masses; and (c) a completely unknown system mode.

Analysis.

For N_V = 1, while the errors for our estimator are in some cases greater than for the IMM estimator, it is never by more than 1 degree on average, as shown in Table II, III, and IV. Increasing N_V greater than 1 showed no effect on performance for the second and third experiments, and hence the results are not shown in the respective tables. For the first experiment, however, increasing N_V to 3, 5 and 7 provided additional robustness to the mismodeling of sensor noise as measured by mode prediction accuracy, as shown in Fig. 5. Despite this, there is not necessarily an improvement in performance from the perspective of estimation and tracking errors in Table II. Furthermore, the geometric complexity in N_V of the model set expansion starts to have an effect when N_V increases to 5 and 7 votes. Overall, we find that the performance of our AMS estimator is similar to the performance of the IMM estimator, in support of H3.

TABLE II.

3DOF Manipulator Mismodelled Sensor Noise (H3 (a))
	Estimation Error		Tracking Error
	Joint (rad)	Task (m)	Joint (rad)	Task (m)
IMM	0.090 (1.45e-4)	0.084 (9.01e-5)	0.137 (3.19e-4)	0.026 (7.33e-5)
AMS1	0.095 (1.71e-4)	0.095 (1.10e-4)	0.135 (3.67e-4)	0.023 (7.24e-5)
AMS3	0.086 (1.65e-4)	0.091 (1.07e-4)	0.146 (5.60e-4)	0.019 (7.65e-5)
AMS5	0.090 (3.74e-4)	0.090 (1.21e-4)	0.180 (1.21e-3)	0.019 (8.20e-5)
AMS7	0.096 (6.15e-4)	0.090 (1.60e-4)	0.199 (1.66e-3)	0.020 (8.61e-5)

Open in a new tab

TABLE III.

3DOF Manipulator, Incorrect Link Masses (H3 (b))
	Estimation Error		Tracking Error
	Joint (rad)	Task (m)	Joint (rad)	Task (m)
IMM	0.060 (1.02e-4)	0.057 (6.39e-5)	0.100 (3.01e-4)	0.020 (7.16e-5)
AMS	0.070 (1.01e-4)	0.069 (8.07e-5)	0.104 (3.02e-4)	0.021 (6.56e-5)

Open in a new tab

TABLE IV.

3DOF Manipulator Completely Unknown Mode (H3 (c))
	Estimation Error		Tracking Error
	Joint (rad)	Task (m)	Joint (rad)	Task (m)
IMM	0.113 (3.86e-4)	0.075 (2.14e-4)	0.267 (9.98e-4)	0.035 (1.67e-4)
AMS	0.124 (5.16e-4)	0.084 (2.73e-4)	0.252 (1.03e-3)	0.028 (1.37e-4)

Open in a new tab

Fig. 5: — Evaluating the robustness of our AMS estimator versus an IMM estimator, when sensor noise is mismodeled. The sensor is modeled as having noise with $σ_{v}^{2} = 0.01$ , whereas it is simulated with $σ_{v}^{2} = 0.03$ (i.e. sensor is noisier than modeled). Error bars indicate standard deviation.

Summary.

Our experiments indicate that our estimation algorithm exhibits reasonable robustness to mismodeling when compared to the IMM estimator. Furthermore, our experiments provide evidence that voting may provide robustness to mismodeling of sensor noise; however, we also observe that for N_V ≥ 5, the computational advantages of our estimator compared to IMM estimation begin to diminish.

VI. Domain Generalization

So far, our experiments have been restricted to manipulator systems with mode switching caused by some change in the robots own internal dynamics model. In fact, many interesting switching dynamics arise from changing dynamics external to the robot– such as varying environmental conditions or the changing behavior of other agents. For example, a vehicle driving over dirt experiences a change in dynamics that it must compensate for if it suddenly encounters an icy patch on the road, as shown in Fig. 1 (center). Similarly, a robot operating around a human must adapt when the person changes their intention, as shown in Fig. 1 (right). The following 2 experiments evaluate the performance of our AMS estimator in such situations.

Fig. 1: — We apply our adaptive estimator in three domains: (left) Manipulator moves into contact with a table, (center) interaction dynamics between the vehicles wheels and the ground change when it moves from dirt to icy terrain, (right) human switches goal locations as they move through a hallway (path shown in color), affecting a nearby robot’s motion plan (shown in grey arrows).

A. Driving on Variable Terrain

Vehicle Model.

We use a kinematic, skid-steering model of the Robotnik Summit XL robot, shown in Fig. 1. The state is the position, orientation, and linear and angular velocities of the robot with respect to the world frame. The input is a commanded velocity for each of the 4 wheels.

Modes.

For each terrain, we model the effect of friction by setting v_k+1 = γv_k + (1 – γ)v_k,in, where v_k+1 is the vehicle velocity at the next time step k+1, v_k is the vehicle velocity at the current time step k, and v_k,in is the commanded velocity, which we convert from the commanded wheel velocities via the kinematics model. Finally, γ ∈ [0, 1] is a coefficient represents the amount of friction. Dirt. We model friction having negligible effects, so γ = 0. Sand. We model the friction with γ = 0.01. Ice. We model the friction with γ = 0.001. Shallow Mud. Here we set γ = 0; however, we model the vehicle being stuck in mud if its velocity is below 0.3 m/s. Deep Mud. γ = 0 as in the model of shallow mud, but here the vehicle is stuck if its velocity is below 0.6 m/s. Single Wheel Stuck. Here we set γ = 0; however, to model a single wheel being stuck, we simply zero out that wheels commanded velocity.