Exploiting short-term memory in soft body dynamics as a computational resource

K Nakajima; T Li; H Hauser; R Pfeifer

doi:10.1098/rsif.2014.0437

. 2014 Nov 6;11(100):20140437. doi: 10.1098/rsif.2014.0437

Exploiting short-term memory in soft body dynamics as a computational resource

K Nakajima ^1,^2,^✉, T Li ³, H Hauser ³, R Pfeifer ³

PMCID: PMC4191087 PMID: 25185579

Abstract

Soft materials are not only highly deformable, but they also possess rich and diverse body dynamics. Soft body dynamics exhibit a variety of properties, including nonlinearity, elasticity and potentially infinitely many degrees of freedom. Here, we demonstrate that such soft body dynamics can be employed to conduct certain types of computation. Using body dynamics generated from a soft silicone arm, we show that they can be exploited to emulate functions that require memory and to embed robust closed-loop control into the arm. Our results suggest that soft body dynamics have a short-term memory and can serve as a computational resource. This finding paves the way towards exploiting passive body dynamics for control of a large class of underactuated systems.

Keywords: soft robots, physical reservoir computing, morphological computation, octopus

1. Introduction

In recent years, soft materials have been increasingly used to incorporate flexible elements into robots' bodies. The resulting machines, called soft robots, have significant advantages over traditional articulated robots owing to deformable morphology and safety in interaction [1]. They can adapt their morphology to unstructured environments, and carry and touch fragile objects without causing damage, which makes them applicable for rescue and human interactions, in particular care for the elderly, prosthetics and wearables [2,3]. In addition, they can generate diverse behaviours with simple types of actuation by partially outsourcing control to the morphological and material properties of their soft bodies [4], which is made possible by the tight coupling between control, body and environment [5,6]. In this paper, we build on these perspectives and add a novel advantage of soft bodies, demonstrating that they can be exploited as computational resources.

One of the major differences between rigid and soft bodies can be found in their body dynamics. Soft body dynamics usually exhibit a variety of properties, including nonlinearity, elasticity and potentially infinitely many degrees of freedom, which are difficult to reduce to lower dimensionality. In particular, their degrees of freedom are often larger than a number of actuators, which leads to a typical underactuated system [7], and this makes the soft body difficult to control with conventional frameworks. Here, we demonstrate that these properties can, in fact, be highly beneficial in that they can be employed for computation. Our approach is based on a machine learning technique called reservoir computing, which has a particular focus on real-time computing of time-varying input that provides an alternative to computational frameworks based on Turing machines [8–11]. By driving a high-dimensional dynamical system, typically referred to as the reservoir, with a low-dimensional input stream, transient dynamics are generated that operate as a type of temporal and finite kernel that facilitate the separation of input states [10,12]. If the dynamics involve enough nonlinearity and memory, emulating complex nonlinear dynamical systems only requires adding a linear, static readout from the high-dimensional state space of the reservoir. A number of different implementations for reservoirs have been proposed: for example, abstract dynamical systems for echo state networks (ESNs) [8,9], or models of neurons for liquid state machines [10]. Implementations even include using the surface of water in a laminar state [13]. Lately, it has been demonstrated that nonlinear mass spring systems have the potential to serve as reservoirs as well [14,15], and this has been applied in a number of ways [16–18].

In this study, we establish a simple but powerful physical platform with a soft silicone arm and demonstrate, through a number of experiments, that the soft body dynamics can be used as a reservoir. In particular, we focus on the property of short-term memory [19–21], which is the ability to store information about recent input sequences in the transient dynamics of the reservoir. In neuroscience, this property has drawn attention as a mechanism to perform real-time computations on sensory input streams [22,23], which is a prerequisite for cognitive phenomena, such as planning and decision-making. We show that short-term memory also exists in the body dynamics of a soft silicone arm and, in particular, that it can be exploited to control the arm's motions robustly in a closed-loop manner. In other words, the seemingly undesirable properties of soft body dynamics are no longer drawbacks for control but constitute core aspects of the system's functionality.

2. Material and methods

2.1. A soft silicone arm as a computational resource

There have been several soft silicone arms proposed in the literature, which are inspired by the octopus [24–26]. In this paper, we use a soft silicone arm, which has a similar material characteristic to the one proposed in [24]. The platform consists of a soft silicone arm, its sensing and actuation systems, data processing via a PC, and a water tank containing fresh water as an underwater environment (figure 1). By rotating the base of the arm and generating body dynamics induced by the interaction between the underwater environment and the soft silicone material, we aim to show that the sensory timeseries that are reflected in the body dynamics can be exploited as part of a computational device. The unit of timestep t used in this study is a sensing and actuation loop of the PC (this is approx. 0.03 s in physical time). Throughout this study, we observe the behaviour of the system from one side of the tank and use terminology, such as ‘left’ or ‘right’, with respect to this point of view.

The arm embeds 10 bend sensors within the silicone material (figure 1a and see also electronic supplementary material, figure S1). A bend sensor gives a base value when it is straight. If it bends in the ventral side, the sensor value is smaller, and if it bends in the dorsal side, the value is larger; the change in value reflects the degree of bend in each case. The sensors are embedded near the surface of the arm, with their ventral sides directed outwards. We numbered these sensors from the base towards the tip as s1–s10. The sensors are embedded alternately, with odd-numbered sensors on the right side of the arm and even-numbered sensors on the left side (figure 1a and electronic supplementary material, figure S1). The base of the arm can rotate left and right through the actuation of a servo motor. The motor commands sent from the PC are binary values, M = {0, 1}. If the command is 0 or 1, the motor is controlled to move from its current position towards the maximum right position (L_right) or the maximum left position (L_left), respectively (figure 1b). The actual servo motor positions are also sent to the PC to monitor the current position of the base rotation θ(t). The positions L_right and L_left were heuristically determined to avoid damaging the motor components. The values for Inline graphic are about 46.4° by setting the origin of the rotation angle (0°) when the arm is aligned vertically to the water surface. Throughout this study, θ(t) is linearly normalized to be in the range from 0 to 1. Note that the motor command does not always take the roller position to L_right or L_left; rather, it decides the motor movement direction for each timestep. In addition, if the command is 0 or 1, when the current position is L_right or L_left, respectively, then the position will stay unchanged.

To exploit the soft silicone arm as a computational resource, we need to determine how to provide inputs I(t) to the system and how to generate corresponding outputs O(t). In this paper, we provide the input to the motor command, m(t)∈ M, and the output is generated by linearly combining all sensory timeseries s_i(t) (i = 1, 2, … , 10) with a weighted sum using the weights w_i (i = 1, 2, … ,10) (figure 2). In addition, a bias is added, which is expressed as b = w₀s₀(t), where s₀(t) is a constant value set to 1. As a result, we have 11 pairs of weights and corresponding sensory timeseries (w_i, s_i(t)) (i = 0, 1, … , 10) in our system. Our system output takes a binary state, O(t) ∈ {0, 1}, which is obtained by thresholding the weighted sum of the sensory values (see the electronic supplementary material for details). To emulate a desired function with our system, we first apply the inputs to the system, which then generate the arm motions, and we collect the corresponding sensory timeseries. Together with the target outputs, we have a training dataset for supervised learning. The linear readout weights are then optimized with simple logistic regression with respect to minimizing the error between the system output and the target output. The performance of the system output is evaluated by comparing with the target output for a new experimental trial (see the electronic supplementary material for details of the training procedures and the logistic regression).

Figure 2. — Schematic showing the information processing scheme using the arm. Input is provided to the motor command to generate arm motion, and the embedded bend sensors reflect the resulting body dynamics. By using the detected sensory timeseries, the binary state output is generated by thresholding the weighted sum of the sensory values. See the main text for details.

We used three tasks to evaluate the computational power of our soft silicone arm with the focus on the property of short-term memory. Unlike a conventional computer, our system does not contain explicit memory storage; instead, the memory is expected to be implicitly included in the transient dynamics of the soft body. By assigning a task to the system that requires memory to be carried out and by evaluating its performance, we can characterize its memory capacity.

Our first task is to construct a timer exploiting the soft body dynamics. Triggered by a cue sent at certain timesteps, the arm starts to move from L_right to L_left. The system should output a pulse of predefined length by exploiting the body dynamics. To perform this task, the system has to be able to ‘recognize’ the duration of time that has passed since the cue was launched. This clearly requires memory. By increasing the desired pulse response, we systematically investigate the limits of the physical system to represent memory in its transient body dynamics.

The second task is to perform a closed-loop control exploiting the soft body dynamics. With a periodic square wave function, which switches its motor command from 0 to 1 and from 1 to 0 with a fixed period as a target function, we aim to evaluate the maximal length for the period of the square function that can be embedded in the system. In this task, the system should ‘recognize’ how much time has passed since the motor command switched from 0 to 1 (or from 1 to 0), and it should decide when to switch the motor command to the next position. Again, this task requires memory. Furthermore, this task also evaluates whether the soft body dynamics can be exploited as a computational resource to control the arm's own motion. This is especially interesting as typically the complex dynamics of a soft body are the main obstacles to applying a classic control theoretic approach. Remarkably, in our proposed context, this property is beneficial because it can be exploited as a computational resource.

The third task is an emulation task of functions that require memory. A random binary input sequence is provided to the system, and by exploiting the generated soft body dynamics, the system should emulate two functions simultaneously: the first one is a function that reproduces past inputs with a given delay, and the second one is the N-bit parity checker. Emulations of these functions are commonly used as benchmark tasks to characterize the computational power of the system, and again, both functions require memory. In particular, these functions should be emulated using the same soft body dynamics at the same time, which points to another remarkable property of the approach (typically referred to as multitasking [14]).

In all three tasks, we are adjusting only the linear readout weights, which are fixed after learning, i.e. no memory is present in the readout. Hence, we can confirm that the required memory is purely owing to the property of the soft silicone arm. Unlike conventional computational units (e.g. artificial neural networks), our proposed set-up has a constraint owing to the specifications of the mechanical structure of the system, because inputs are transformed to the mechanical realm. For example, a drastic and frequent switching of the motor command can result in motor overheat and a total stop. We defined the presented tasks to evaluate the memory capacity of our system by taking these physical constraints into consideration. Accordingly, the input/output (I/O) setting in our system slightly differs in each task (see the electronic supplementary material for detailed information on the I/O setting for each task).

2.2. Dynamic property of the silicone arm

We here present the basic property of our arm motion and the step response. Figure 3a shows a typical arm motion when the motor command is switched from 0 to 1. The arm is initially set to L_right, and at t = 0 it starts to move towards L_left. The silicone arm shows characteristic body dynamics because of the interaction with the water (see electronic supplementary material, video S1). In particular, even when the base reaches the position of L_left, the entire arm still shows transient dynamics. Figure 3 clearly shows that because the arm moves from right to left, the right side of the arm bends and the left side of the arm arches according to the water friction.

The dynamic behaviour of the arm can be captured by the responses of the sensors (figure 3b and electronic supplementary material, video S1). When the motor command switches from 0 to 1, θ(t) takes about nine timesteps to reach θ(t) = 1, which forms a physical constraint based on the motor and the mechanical structure of our platform (figure 3b, upper plot and electronic supplementary material, video S1). When the motor command is switched from 0 to 1, all the odd-numbered sensors start to show smaller values than those shown before the motion generation. They take the local minimum at a different timestep, then gradually approach their resting states (figure 3b, middle plot and electronic supplementary material, video S1). Because the arm is passive, the movement of the base rotation propagates from the base towards the tip at a certain velocity. For example, s1 seems to show a direct reflection of the motor actuation because it is embedded close to the base. This effect can be confirmed by checking the local minimum of the sensory response of s1 at around timestep 9, which is the same timestep at which the motor rotation stops. For even-numbered sensors, although all sensors show larger values than the values before the motion generation, some sensors (e.g. s6, s8 and s10) show a smaller value in some timesteps owing to inertia caused by the immediate bend in the left side of the arm (figure 3b, lower plot and electronic supplementary material, video S1). This effect also seems to be propagating from the base towards the tip of the arm. All sensors reach a resting state at around 40 timesteps. In the resting state L_left, the odd-numbered sensors show smaller values, and the even-numbered sensors show greater values than those shown before motion generation (figure 3b and electronic supplementary material, video S1). This phenomenon is the result of gravity; the left side of the arm arches slightly, whereas the right side of the arm bends slightly (figure 3a). When the motor command is switched from 1 to 0 with the arm position initially set to L_left, we can observe a similar behaviour with switched roles of the odd- and even-numbered sensors.

3. Results

3.1. Timer task

Our first task is to emulate the function of a timer exploiting the body dynamics of the arm. The task has been chosen as it enables us to investigate systematically the memory inherently present in the soft body dynamics. One of the characteristic properties of our soft body is its transient dynamics during its motion from one state to another, e.g. moving from right to left. In this task, the arm is initially set to L_right and kept at this position. Triggered by the input at t_start, the motor command switches from 0 to 1, when the rotation of the base generates the body dynamics (figure 3a). The timer task consists of producing an output pulse starting from τ_ini timesteps after t_start, which is τ_timer timesteps in length, by exploiting the body dynamics during this transient single motion (see the electronic supplementary material, figure S2, for details). To perform this task, the system has to have a certain amount of memory. In other words, we can evaluate whether the sensory timeseries that reflects the transient dynamics during the motion from L_right to L_left contains sufficient information to recognize the duration of time since the trigger event by applying this task. A similar task was introduced in [8] to demonstrate the existence of short-term memory within an artificial recurrent neural network (e.g. ESN) [9,19]. To demonstrate that such a memory can be found and exploited in a real physical system, we applied this task employing the soft silicone arm. As explained earlier, our system output is generated by thresholding the weighted sum of the sensory values, and the weights are optimized with a simple logistic regression by using a dataset collected in the training phase (see the electronic supplementary material for details). We performed this experiment by varying τ_ini and τ_timer to investigate the relevance of these parameters to the system performance.

Figure 3c shows examples of the averaged system outputs for each τ_timer with τ_ini fixed to 9. As one can see, our system is able to emulate a timer with given duration times τ_timer (see also the electronic supplementary material, video S1). Naturally, the performance decreases when increasing the length of τ_timer. This is caused by the gradual fading of memory within the body dynamics after the initiation of motion generation. This tendency can be found for different settings of τ_ini (figure 3d). As can be seen in figure 3d, the error values (mean-squared error (MSE)) are especially low when around τ_ini < 20 and τ_timer < 20, characterizing the amount of memory that can be exploited with the given soft body. Note that when τ_ini is close to 0, the error values are higher than for other parameters. This is because when the arm starts to move, the effect of the motor rotation takes some time to propagate owing to the softness of the arm (figure 3a,b), and if τ_ini is small, it is difficult to distinguish the sensory values from the values when the arm is stopped.

3.2. Closed-loop control task

We demonstrated in the previous task that we can use the sensory timeseries generated by the transient dynamics to construct a timer. By using the same property, in this second task, we aim to realize a closed-loop control of our soft silicone arm. That is, we aim to demonstrate that the arm's body dynamics can be used to control its own motion. The target motor command sequence is a square wave in which the amplitude alternates at a steady frequency, between m(t) = 0 and 1, with the same duration of timesteps, τ_square (see electronic supplementary material, figure S3a, for details). Similar to the process in the previous task, when the motor command is switched from 0 to 1 (or from 1 to 0), it should recognize the time length of τ_square timesteps and switch the motor command from 1 to 0 (or from 0 to 1). Thus, it requires memory to fulfil this task. Recently, similar types of oscillatory motor command have been used to demonstrate the octopus-inspired swimming motion, called sculling, in a physical platform with an open-loop manner [27]. We aim to emulate this oscillatory wave pattern by using the sensory timeseries from the soft body and close the loop. This is realized by feeding back the system output generated by thresholding the weighted sum of the sensory timeseries as the next motor command to the system (see electronic supplementary material, figure S3b, for details). As with the previous task, we aim to emulate the target output only by adjusting the static linear readout weights.

Figure 4a shows an example of a timeseries with the motor commands and sensory values when the system is driven by the closed-loop control emulating a square function with τ_square = 10. The timeseries of the motor command exactly overlaps with the target output, showing that the closed-loop control is successfully embedded (see also electronic supplementary material, video S2). For real-world applications, it is important to investigate whether the system is robust against external perturbation. We investigated the robustness of the system by applying a manual mechanical perturbation disturbing the arm motion (figure 4b,c and electronic supplementary material, video S3). We found that during the perturbation both the sensory timeseries and the system output were affected; however, after removing the disturbance, the system was able to recover immediately its original trajectory (electronic supplementary material, video S3). This can be confirmed by checking the timeseries of the motor commands and their corresponding sensory values, and it implies that our system is robust against external perturbations (figure 4b). Note that although the system output shows a phase shift compared with the target output after the perturbation, it is generating a square function with a required length of τ_square.

To evaluate the maximal length of τ_square of a square function that our system can embed, we investigated an average system output for one period of a square function by clamping the feedback loop from the system output and providing the target output as input for each τ_square (figure 4d and see the electronic supplementary material for details). If the system is driven by the closed-loop control, the error in the system output would propagate to the motor command through the feedback loop, which makes it difficult to evaluate the limitation of the system performance efficiently. In figure 4e, according to the increase of τ_square, the average system output starts to deviate largely from the target output. By calculating the system error by means of the MSE in this setting, we found that the error grows immediately when τ_square becomes larger than 18 (figure 4e). Consistent with this result, we observed that when τ_square is more than 18, the system cannot embed a correct square function anymore, or it simply stops, continuously providing 0 or 1 as output. Thus, we can speculate that our system possesses enough memory to be exploited for embedding a square function up to a length of around τ_square = 18.

3.3. Function emulation tasks

In this final task, we aim to quantitatively characterize the intrinsic computational capacity of our system, particularly focusing on its memory capacity. By providing a random binary sequence to the motor command as input, the system should perform function emulation tasks using the resulting sensory timeseries. Because our system is not an abstract computational unit but has physical and mechanical constraints, we need to define a certain duration of time for one input state or symbol. We call this duration of time τ_state. We found that when a random binary sequence is provided as motor commands in the form of τ_state < 5, the motor overheats and stops. Accordingly, we performed our experiments with τ_state ≥ 5. In addition, we introduced a different timescale for I/O, defined as t’, which takes one input symbol as a unit. This means that t’ is increased by increments of 1 for each τ_state timestep (see the electronic supplementary material, figure S4a, for details).

The first function we aim to emulate is one that provides a delayed version of the input, i.e. I(t’ − n) (n = 1, 2, … ) (see the electronic supplementary material for details). This task enables the direct evaluation of whether the system contains memory traces of a past input within the current sensory values, and is frequently used to evaluate the memory capacity of dynamical systems [19–21]. For descriptive purposes, we call this the short-term memory task. The second function we aim to emulate is the N-bit parity checker. The output should provide 0 if Inline graphic is an even number; otherwise, it should provide 1, with n = 1, 2, … (see the electronic supplementary material for details). Note that it is actually a ‘(n + 1)-bit parity checker’ in our case. According to the definition, the system needs the memory of input symbols to previous n symbols within the system to emulate this function. In addition, this function is a nonlinear function, which maps the input to a linearly inseparable state [28]. Because we are adjusting only the static linear weights externally, we can evaluate whether the system contains memory and nonlinearity to be exploited. This task is also common in the evaluation of the computational capacity of dynamical systems [29,30]. Along with the definition of the input symbol, we also need to deter-mine how to define a corresponding sensory timeseries. Let us assume that an input symbol was provided at timestep t(= t’τ_state). As a result, the arm generates corresponding transient dynamics until the next input symbol is provided at timestep (t’ + 1)τ_state. We define sensory values at (t’ + 1)τ_state − 1 as corresponding values s_i(t’) for this input symbol, which is one timestep before the next input symbol is provided (see the electronic supplementary material, figure S4a, for details). By providing random binary input sequences to the system over several trials for each parameter τ_state and n, we collected the sensory timeseries used for training. In the evaluation, both target functions are simultaneously emulated over a previously unseen random input sequence (see the electronic supplementary material, figure S4b, for details).

Examples of the system performance for the short-term memory task and the N-bit parity check task with τ_state = 5 and 11, respectively, can be found in figure 5 and electronic supplementary material, video S4. The system output shows almost a perfect match with the target output when n = 1 and 2 in τ_state = 5 for the short-term memory task (figure 5a) and in τ_state = 11 for the N-bit parity check task (figure 5b). For both tasks, the performance gradually gets worse when the delay n is increased. To evaluate the influence of the parameters of τ_state and n on the system performance, we introduced a measure based on mutual information, MI_n, between the system output and the target output [29]. This measure evaluates the similarity between the system output and the target output and, in our experiment, can take the value of 1 as maximum and 0 as minimum. Additionally, we introduced a measure called ‘capacity’, which is a summation of MI_n over the delays, expressed as Inline graphic where n_max is set to 10 in this analysis (see the electronic supplementary material for details). This measure can evaluate the system's performance over the delays, which can take 10 as maximum and 0 as minimum in our experiment.

Figure 5. — Examples of the output timeseries for the function emulation tasks. (a) Plots showing the example of the performance in the short-term memory task with τ_state = 5. (b) Plots showing the example of the performance in the N-bit parity check task with τ_state = 11. The open squares show the target outputs and the filled squares show the system outputs, and the cases for n = 1, 2, 3 and 4 are shown. (Online version in colour.)

Figure 6a,b shows the results of the average MI_n for each n value and the average capacity for each τ_state for each task (see the electronic supplementary material for details of the setting). For the short-term memory task, when τ_state is increased, the value of MI_n suddenly drops when n is larger than 2 (figure 6a, left). For the capacity, increasing τ_state results, first, in a gradual decrease and then in saturation at a constant value for τ_state > 11 (figure 6a, right). This can be explained by the behaviour of the arm (see electronic supplementary material, video S4)—if the length of the input symbol is short, it is more likely that the current transient dynamics contains the trace of previously provided input symbols. Considering that the arm base takes about nine timesteps to get from one end to the other, if τ_state gets larger than nine timesteps, the trace of previous input symbols starts to fade out gradually. Nevertheless, the arm can possess information about the last input symbol because of the simple one-way bend motion. This explains the maximal performance with respect to MI_n when n = 1. To see the contribution of the physical body to the computational task, we compared the performance with a model that has a readout directly attached to the input (see the electronic supplementary material for details of the setting). We can confirm that this model cannot perform this task at all, suggesting that the performance of our system is purely based on the body dynamics (figure 6a, right).

For the N-bit parity check task, even if τ_state is small (τ_state = 5), when n = 1 (figure 6b, left), MI_n shows a smaller value than when τ_state is larger (τ_state = 10 and 20). When τ_state gets larger (τ_state = 10), MI_n starts to show the highest value when n = 1, and a moderately high value when n = 2. If we increase τ_state further (τ_state = 20), MI_n still shows the highest value when n = 1, but the value when n = 2 starts to decrease. This tendency reflects the results of the capacity (figure 6b, right). The capacity shows a peak around τ_state = 9, 10 and 11. The low values of capacity in τ_state less than 9 and larger than 11 are because of the low values of MI_n for n = 1 and n = 2, respectively. Additionally, in this task, the model with a readout directly attached to the input cannot perform the emulation at all (figure 6b, right). Considering that the N-bit parity check task requires not only memory, but also nonlinearity to perform, this result suggests that, even if the transient dynamics of the arm possesses a high memory capacity when τ_state is low, it does not contain sufficient nonlinearity to be exploited. This is interesting, because this result is not detectable simply by looking at the arm motion. Furthermore, the results show that the amount of computational capacity depends on the type of motion generated in the arm.

We have further characterized the computational power of our system by comparing its performance with a conventional ESN, which has the same I/O settings with the same training procedures for the readout weights, the same number of computational nodes (10 fully coupled nodes with one bias term), and the same length of training and evaluation datasets. It has been shown that the computational performance of an ESN is up to the spectral radius of the reservoir connectivity matrix [11]. In each task, we varied the spectral radius of the ESN from 0.05 to 2.0 and calculated the averaged capacity over 30 trials in each spectral radius value, with a new ESN in each trial. For the short-term memory task, the best capacity value of the ESN was 4.59 ± 0.58 (electronic supplementary material, figure S5 left), whereas our system showed the best value capacity of 2.50 ± 0.08 when τ_state = 5 (figure 6a, right), which was lower than the ESN. For the N-bit parity check task, the best capacity value of the ESN was 1.65 ± 0.37 (electronic supplementary material, figure S5, right), and the best capacity value of our system showed a similar value of 1.65 ± 0.07 when τ_state = 11 (figure 6b, right). Considering that soft bodies have multifaceted usages and advantages in addition to the computational abilities presented here, whereas the ESN is focused only on computational tasks, we think that our system performance is at a satisfactory level. Further details of these comparisons are given in the electronic supplementary material.

4. Discussion

In this study, we have systematically demonstrated that the body dynamics of the soft silicone arm can be exploited as computational resources. In particular, for the closed-loop control task, our results suggest that soft body dynamics can be sufficient to perform the task to control the body without the need of an external controller for additional memory capacity. This can be, for example, directly applied to the recently proposed octopus-inspired swimming robot [27] to generate the arm motion in a closed-loop manner exploiting the body dynamics itself, which largely outsources the computational load required to generate the motor command to the body. The technique presented here can be potentially applied to a wide class of soft robots, because the main component required is the soft body itself. Consequently, different types of morphology and material properties of robots that increase the computational capacity of the body should be explored in the future. In addition, developments in new types of sensors, which can effectively monitor body dynamics, would make the presented approach usable in additional applications. To conclude, we believe that we have presented a crucial step towards a novel control scheme for soft robots.

In reservoir computing studies, it has been established that to have powerful computational capabilities, a reservoir should have the properties of input separability and fading memory [10]. Input separability is usually achieved by a nonlinear mapping of the low-dimensional input to a high-dimensional state space. Fading memory is a property to uphold the influence of a recent input sequence within the system, which permits integration of stimulus information over time. This guarantees reproducible computation, for which the recent history of the signal is important. Our insight here was to exploit soft body dynamics as a reservoir. Passive body dynamics of soft materials typically tend to underactuated systems [7]. This naturally maps the actuation signal into the higher dimension of the soft body, which realizes the separability of the actuation signal. Furthermore, the interaction between the body and the environment (in our case, the underwater environment) implements fading memory, which takes a certain duration of time to relax when actuated due to the damping effect provided by the environment. Mechanical structures exhibiting these properties can also be exploited with our approach.

The framework presented in this study may also shed light on the role of the body in biological systems. Such systems have soft bodies that can adapt and behave effectively in a given ecological niche. For example, the octopus does not have any rigid components in its body but it shows extremely sophisticated behaviour that capitalizes on its body morphology and muscle structures [31]. In particular, we have shown that a form of short-term memory, which is thought to be a functionality of the brain, can also be found in soft body dynamics. We think this line of studies is an interesting research direction to be explored further.

Supplementary Material

Supplementary Information

rsif20140437supp1.pdf^{(1.1MB, pdf)}

Funding statement

This work was partially supported by the European Commission in the ICT-FET OCTOPUS Integrating Project (EU project no. FP7-231608), and by the JSPS Postdoctoral Fellowships for Research Abroad.

References

1.Pfeifer R, Lungarella M, Iida F. 2012. The challenges ahead for bio-inspired ‘soft’ robotics. Commun. ACM 55, 76–87. ( 10.1145/2366316.2366335) [DOI] [Google Scholar]
2.Trivedi D, Rahn CD, Kier WM, Walker ID. 2008. Soft robotics: biological inspiration, state of the art, and future research. Appl. Bionics Biomech. 5, 99–117. ( 10.1080/11762320802557865) [DOI] [Google Scholar]
3.Kim S, Laschi C, Trimmer B. 2013. Soft robotics: a new perspective in robot evolution. Trends Biotechnol. 31, 287–294. ( 10.1016/j.tibtech.2013.03.002) [DOI] [PubMed] [Google Scholar]
4.Shepherd RF, Ilievski F, Choi W, Morin SA, Stokes AA, Mazzeo AD, Chen X, Wang M, Whitesides GM. 2011. Multi-gait soft robot. Proc. Natl Acad. Sci. USA 108, 20 400–20 403. ( 10.1073/pnas.1116564108) [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Pfeifer R, Lungarella M, Iida F. 2007. Self-organization, embodiment, and biologically inspired robotics. Science 318, 1088–1093. ( 10.1126/science.1145803) [DOI] [PubMed] [Google Scholar]
6.Pfeifer R, Bongard J. 2006. How the body shapes the way we think: a new view of intelligence. Cambridge, MA: MIT Press. [Google Scholar]
7.Tedrake R. 2009. Underactuated robotics: learning, planning, and control for efficient and agile machines. Course Notes for MIT 6.832: Working draft edition.
8.Jaeger H. 2002. Tutorial on training recurrent neural networks, covering BPTT, RTRL, EKF and the ‘echo state network’ approach. GMD report 159, German National Research Center for Information Technology.
9.Jaeger H, Haas H. 2004. Harnessing nonlinearity: predicting chaotic systems and saving energy in wireless communication. Science 304, 78–80. ( 10.1126/science.1091277) [DOI] [PubMed] [Google Scholar]
10.Maass W, Natschläger T, Markram H. 2002. Real-time computing without stable states: a new framework for neural computation based on perturbations. Neural Comput. 14, 2531–2560. ( 10.1162/089976602760407955) [DOI] [PubMed] [Google Scholar]
11.Verstraeten D, Schrauwen B, D'Haene M, Stroobandt D. 2007. An experimental unification of reservoir computing methods. Neural Netw. 20, 391–403. ( 10.1016/j.neunet.2007.04.003) [DOI] [PubMed] [Google Scholar]
12.Rabinovich M, Huerta R, Laurent G. 2008. Transient dynamics for neural processing. Science 321, 48–50. ( 10.1126/science.1155564) [DOI] [PubMed] [Google Scholar]
13.Fernando C, Sojakka S. 2003. Pattern recognition in a bucket. Lect. Notes Comput. Sci. 2801, 588–597. ( 10.1007/978-3-540-39432-7_63) [DOI] [Google Scholar]
14.Hauser H, Ijspeert AJ, Füchslin RM, Pfeifer R, Maass W. 2011. Towards a theoretical foundation for morphological computation with compliant bodies. Biol. Cybern. 105, 355–370. ( 10.1007/s00422-012-0471-0) [DOI] [PubMed] [Google Scholar]
15.Hauser H, Ijspeert AJ, Füchslin RM, Pfeifer R, Maass W. 2012. The role of feedback in morphological computation with compliant bodies. Biol. Cybern. 105, 595–613. ( 10.1007/s00422-012-0516-4) [DOI] [PubMed] [Google Scholar]
16.Nakajima K, Hauser H, Kang R, Guglielmino E, Caldwell DG, Pfeifer R. 2013. A soft body as a reservoir: case studies in a dynamic model of octopus-inspired soft robotic arm. Front. Comput. Neurosci. 7, 91 ( 10.3389/fncom.2013.00091) [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Nakajima K, Hauser H, Kang R, Guglielmino E, Caldwell DG, Pfeifer R. 2013. Computing with a muscular-hydrostat system. In Proc. IEEE Int. Conf. on Robotics and Automation, Karlsruhe, Germany, 6–10 May 2013, pp. 1504–1511. ( 10.1109/ICRA.2013.6630770) [DOI] [Google Scholar]
18.Caluwaerts K, D'Haene M, Verstraeten D, Schrauwen B. 2013. Locomotion without a brain: physical reservoir computing in tensegrity structures. Artif. Life 19, 35–66. ( 10.1162/ARTL_a_00080) [DOI] [PubMed] [Google Scholar]
19.Jaeger H. 2001. Short term memory in echo state networks. GMD report 152, German National Research Center for Information Technology.
20.White O, Lee D, Sompolinsky H. 2002. Short-term memory in orthogonal neural networks. Phys. Rev. Lett. 92, 148102 ( 10.1103/PhysRevLett.92.148102) [DOI] [PubMed] [Google Scholar]
21.Ganguli S, Huh D, Sompolinsky H. 2008. Memory traces in dynamical systems. Proc. Natl Acad. Sci. USA 105, 18 970–18 975. ( 10.1073/pnas.0804451105) [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Buonomano DV, Maass W. 2009. State-dependent computations: spatiotemporal processing in cortical networks. Nat. Rev. Neurosci. 10, 113–125. ( 10.1038/nrn2558) [DOI] [PubMed] [Google Scholar]
23.Nikolić D, Häusler S, Singer W, Maass W. 2009. Distributed fading memory for stimulus properties in the primary visual cortex. PLoS Biol. 7, e1000260 ( 10.1371/journal.pbio.1000260) [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Cianchetti M, Arienti A, Follador M, Mazzolai B, Dario P, Laschi C. 2011. Design concept and validation of a robotic arm inspired by the octopus. Mater. Sci. Eng. C 31, 1230–1239. ( 10.1016/j.msec.2010.12.004) [DOI] [Google Scholar]
25.Calisti M, Giorelli M, Levy G, Mazzolai B, Hochner B, Laschi C, Dario P. 2011. An octopus-bioinspired solution to movement and manipulation for soft robots. Bioinsp. Biomim. 6, 036002 ( 10.1088/1748-3182/6/3/036002) [DOI] [PubMed] [Google Scholar]
26.Martinez RV, Branch JL, Fish CR, Jin L, Shepherd RF, Nunes RMD, Suo Z, Whitesides GM. 2013. Robotic tentacles with three-dimensional mobility based on flexible elastomers. Adv. Mater. 25, 205–212. ( 10.1002/adma.201203002) [DOI] [PubMed] [Google Scholar]
27.Sfakiotakis M, Kazakidi A, Pateromichelakis N, Tsakiris DP. 2013. Octopus-inspired eight-arm robotic swimming by sculling movements. In Proc. IEEE Int. Conf. on Robotics and Automation, Karlsruhe, Germany, 6–10 May 2013, pp. 5155–5161. ( 10.1109/ICRA.2013.6631314) [DOI] [Google Scholar]
28.Rumelhart DE, McClelland JL. 1987. Parallel distributed processing: explorations in the microstructure of cognition: vol. 1: foundations. Cambridge, MA: MIT Press. [DOI] [PubMed] [Google Scholar]
29.Bertschinger N, Natschläger T. 2004. Real-time computation at the edge of chaos in recurrent neural networks. Neural Comput. 16, 1413–1436. ( 10.1162/089976604323057443) [DOI] [PubMed] [Google Scholar]
30.Snyder D, Goudarzi A, Teuscher C. 2013. Computational capabilities of random automata networks for reservoir computing. Phys. Rev. E 87, 042808 ( 10.1103/PhysRevE.87.042808) [DOI] [PubMed] [Google Scholar]
31.Hochner B. 2012. An embodied view of octopus neurobiology. Curr. Biol. 22, R887–R892. ( 10.1016/j.cub.2012.09.001) [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Information

rsif20140437supp1.pdf^{(1.1MB, pdf)}

[RSIF20140437C1] 1.Pfeifer R, Lungarella M, Iida F. 2012. The challenges ahead for bio-inspired ‘soft’ robotics. Commun. ACM 55, 76–87. ( 10.1145/2366316.2366335) [DOI] [Google Scholar]

[RSIF20140437C2] 2.Trivedi D, Rahn CD, Kier WM, Walker ID. 2008. Soft robotics: biological inspiration, state of the art, and future research. Appl. Bionics Biomech. 5, 99–117. ( 10.1080/11762320802557865) [DOI] [Google Scholar]

[RSIF20140437C3] 3.Kim S, Laschi C, Trimmer B. 2013. Soft robotics: a new perspective in robot evolution. Trends Biotechnol. 31, 287–294. ( 10.1016/j.tibtech.2013.03.002) [DOI] [PubMed] [Google Scholar]

[RSIF20140437C4] 4.Shepherd RF, Ilievski F, Choi W, Morin SA, Stokes AA, Mazzeo AD, Chen X, Wang M, Whitesides GM. 2011. Multi-gait soft robot. Proc. Natl Acad. Sci. USA 108, 20 400–20 403. ( 10.1073/pnas.1116564108) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20140437C5] 5.Pfeifer R, Lungarella M, Iida F. 2007. Self-organization, embodiment, and biologically inspired robotics. Science 318, 1088–1093. ( 10.1126/science.1145803) [DOI] [PubMed] [Google Scholar]

[RSIF20140437C6] 6.Pfeifer R, Bongard J. 2006. How the body shapes the way we think: a new view of intelligence. Cambridge, MA: MIT Press. [Google Scholar]

[RSIF20140437C7] 7.Tedrake R. 2009. Underactuated robotics: learning, planning, and control for efficient and agile machines. Course Notes for MIT 6.832: Working draft edition.

[RSIF20140437C8] 8.Jaeger H. 2002. Tutorial on training recurrent neural networks, covering BPTT, RTRL, EKF and the ‘echo state network’ approach. GMD report 159, German National Research Center for Information Technology.

[RSIF20140437C9] 9.Jaeger H, Haas H. 2004. Harnessing nonlinearity: predicting chaotic systems and saving energy in wireless communication. Science 304, 78–80. ( 10.1126/science.1091277) [DOI] [PubMed] [Google Scholar]

[RSIF20140437C10] 10.Maass W, Natschläger T, Markram H. 2002. Real-time computing without stable states: a new framework for neural computation based on perturbations. Neural Comput. 14, 2531–2560. ( 10.1162/089976602760407955) [DOI] [PubMed] [Google Scholar]

[RSIF20140437C11] 11.Verstraeten D, Schrauwen B, D'Haene M, Stroobandt D. 2007. An experimental unification of reservoir computing methods. Neural Netw. 20, 391–403. ( 10.1016/j.neunet.2007.04.003) [DOI] [PubMed] [Google Scholar]

[RSIF20140437C12] 12.Rabinovich M, Huerta R, Laurent G. 2008. Transient dynamics for neural processing. Science 321, 48–50. ( 10.1126/science.1155564) [DOI] [PubMed] [Google Scholar]

[RSIF20140437C13] 13.Fernando C, Sojakka S. 2003. Pattern recognition in a bucket. Lect. Notes Comput. Sci. 2801, 588–597. ( 10.1007/978-3-540-39432-7_63) [DOI] [Google Scholar]

[RSIF20140437C14] 14.Hauser H, Ijspeert AJ, Füchslin RM, Pfeifer R, Maass W. 2011. Towards a theoretical foundation for morphological computation with compliant bodies. Biol. Cybern. 105, 355–370. ( 10.1007/s00422-012-0471-0) [DOI] [PubMed] [Google Scholar]

[RSIF20140437C15] 15.Hauser H, Ijspeert AJ, Füchslin RM, Pfeifer R, Maass W. 2012. The role of feedback in morphological computation with compliant bodies. Biol. Cybern. 105, 595–613. ( 10.1007/s00422-012-0516-4) [DOI] [PubMed] [Google Scholar]

[RSIF20140437C16] 16.Nakajima K, Hauser H, Kang R, Guglielmino E, Caldwell DG, Pfeifer R. 2013. A soft body as a reservoir: case studies in a dynamic model of octopus-inspired soft robotic arm. Front. Comput. Neurosci. 7, 91 ( 10.3389/fncom.2013.00091) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20140437C17] 17.Nakajima K, Hauser H, Kang R, Guglielmino E, Caldwell DG, Pfeifer R. 2013. Computing with a muscular-hydrostat system. In Proc. IEEE Int. Conf. on Robotics and Automation, Karlsruhe, Germany, 6–10 May 2013, pp. 1504–1511. ( 10.1109/ICRA.2013.6630770) [DOI] [Google Scholar]

[RSIF20140437C18] 18.Caluwaerts K, D'Haene M, Verstraeten D, Schrauwen B. 2013. Locomotion without a brain: physical reservoir computing in tensegrity structures. Artif. Life 19, 35–66. ( 10.1162/ARTL_a_00080) [DOI] [PubMed] [Google Scholar]

[RSIF20140437C19] 19.Jaeger H. 2001. Short term memory in echo state networks. GMD report 152, German National Research Center for Information Technology.

[RSIF20140437C20] 20.White O, Lee D, Sompolinsky H. 2002. Short-term memory in orthogonal neural networks. Phys. Rev. Lett. 92, 148102 ( 10.1103/PhysRevLett.92.148102) [DOI] [PubMed] [Google Scholar]

[RSIF20140437C21] 21.Ganguli S, Huh D, Sompolinsky H. 2008. Memory traces in dynamical systems. Proc. Natl Acad. Sci. USA 105, 18 970–18 975. ( 10.1073/pnas.0804451105) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20140437C22] 22.Buonomano DV, Maass W. 2009. State-dependent computations: spatiotemporal processing in cortical networks. Nat. Rev. Neurosci. 10, 113–125. ( 10.1038/nrn2558) [DOI] [PubMed] [Google Scholar]

[RSIF20140437C23] 23.Nikolić D, Häusler S, Singer W, Maass W. 2009. Distributed fading memory for stimulus properties in the primary visual cortex. PLoS Biol. 7, e1000260 ( 10.1371/journal.pbio.1000260) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20140437C24] 24.Cianchetti M, Arienti A, Follador M, Mazzolai B, Dario P, Laschi C. 2011. Design concept and validation of a robotic arm inspired by the octopus. Mater. Sci. Eng. C 31, 1230–1239. ( 10.1016/j.msec.2010.12.004) [DOI] [Google Scholar]

[RSIF20140437C25] 25.Calisti M, Giorelli M, Levy G, Mazzolai B, Hochner B, Laschi C, Dario P. 2011. An octopus-bioinspired solution to movement and manipulation for soft robots. Bioinsp. Biomim. 6, 036002 ( 10.1088/1748-3182/6/3/036002) [DOI] [PubMed] [Google Scholar]

[RSIF20140437C26] 26.Martinez RV, Branch JL, Fish CR, Jin L, Shepherd RF, Nunes RMD, Suo Z, Whitesides GM. 2013. Robotic tentacles with three-dimensional mobility based on flexible elastomers. Adv. Mater. 25, 205–212. ( 10.1002/adma.201203002) [DOI] [PubMed] [Google Scholar]

[RSIF20140437C27] 27.Sfakiotakis M, Kazakidi A, Pateromichelakis N, Tsakiris DP. 2013. Octopus-inspired eight-arm robotic swimming by sculling movements. In Proc. IEEE Int. Conf. on Robotics and Automation, Karlsruhe, Germany, 6–10 May 2013, pp. 5155–5161. ( 10.1109/ICRA.2013.6631314) [DOI] [Google Scholar]

[RSIF20140437C28] 28.Rumelhart DE, McClelland JL. 1987. Parallel distributed processing: explorations in the microstructure of cognition: vol. 1: foundations. Cambridge, MA: MIT Press. [DOI] [PubMed] [Google Scholar]

[RSIF20140437C29] 29.Bertschinger N, Natschläger T. 2004. Real-time computation at the edge of chaos in recurrent neural networks. Neural Comput. 16, 1413–1436. ( 10.1162/089976604323057443) [DOI] [PubMed] [Google Scholar]

[RSIF20140437C30] 30.Snyder D, Goudarzi A, Teuscher C. 2013. Computational capabilities of random automata networks for reservoir computing. Phys. Rev. E 87, 042808 ( 10.1103/PhysRevE.87.042808) [DOI] [PubMed] [Google Scholar]

[RSIF20140437C31] 31.Hochner B. 2012. An embodied view of octopus neurobiology. Curr. Biol. 22, R887–R892. ( 10.1016/j.cub.2012.09.001) [DOI] [PubMed] [Google Scholar]

PERMALINK

Exploiting short-term memory in soft body dynamics as a computational resource

K Nakajima

T Li

H Hauser

R Pfeifer

Abstract

1. Introduction

2. Material and methods

2.1. A soft silicone arm as a computational resource

Figure 1.

Figure 2.

2.2. Dynamic property of the silicone arm

Figure 3.