Skip to main content
Sensors (Basel, Switzerland) logoLink to Sensors (Basel, Switzerland)
. 2020 Mar 30;20(7):1943. doi: 10.3390/s20071943

Multibody System-Based Adaptive Formation Scheme for Multiple Under-Actuated AUVs

Hai Huang 1, Qirong Tang 2, Guocheng Zhang 1, Tiedong Zhang 1,*, Lei Wan 1, Yongjie Pang 1
PMCID: PMC7180628  PMID: 32235677

Abstract

Underwater vehicles’ coordination and formation have attracted increasing attention since they have great potential for real-world applications. However, such vehicles are usually under-actuated and with very limited communication capabilities. On the basis of the multibody system concept, a multiple autonomous underwater vehicle formation and communication link framework has been established with an adaptive and radial basis function (RBF) strategy. For acoustic communication, a packets transmission scheme with topology and protocol has been investigated on the basis of an acoustic communication framework and transmission model. Moreover, the cooperative localization errors caused by packet loss are estimated through reinforcement learning radial basis function neural networks. Furthermore, in order to realize formation cruising, an adaptive RBF formation scheme with magnitude reduced multi-layered potential energy functions has been designed on the basis of a time-delayed network framework. Finally, simulations and experiments have been extensively performed to validate the effectiveness of the proposed methods.

Keywords: underwater vehicle formation, limited communication, adaptive formation, multibody system

1. Introduction

Currently, Autonomous Underwater Vehicles (AUVs) are increasingly attractive for various underwater tasks such as environmental exploration [1], seabed survey [2], harbor protection [3], and submarine search and rescue [4]. Recent advances in Autonomous Underwater Vehicle (AUV) research have enabled the utilization of multiple AUVs (MAUVs) to realize complex marine missions [5,6,7]. Moreover, although acoustic communications allow the improvement of localization precision, they are still limited with propagating delays and communication channel noise [8]. The upper bound range rate of underwater acoustic channel is 40 km·kb/s [9]. In order to achieve a global objective, each AUV can be taken as a relay and mobile node [10]. Information flow with interconnection topology graphs is necessary for reliable communication [11]. These graphs can have undirected or directed edges for the model of position constraints, information flow, or leader following inter-agent control specifications [12]. Therefore, the MAUVs’ acoustic communication structure should include both communication topology structure [13] and some collision-free data link schemes [14] in order to reduce information congestion and improve flow efficiency [15]. Meng, Shi, and Wang [16] presented a multi-channel scheme based on the multiple access with collision avoidance protocol to improve the network efficiency using code division multiple access (CDMA). Two channels are used for RTS (request to send)/CTS (clear to send) data packets. All nodes in the network are assigned to the same common channel for any packet arrival. One of the disadvantages of this protocol is the centralized nature of the CDMA scheme. In modular and scalable communication architecture, the data infrastructure includes navigation data, tracking data, target data, data request telegrams, etc. [17]. It can be reused, expanded, and bridged for wide area networks. However, the data for transmission is so great that wireless communication was used to assist acoustic communication during the sea trials of two AUVs’ cooperative missions [18]. Guo, Frater, and Ryan [19] proposed an adaptive propagation-delay tolerant media access control (MAC) [20] protocol for an underwater acoustic communication network. The protocol performs an improved handshaking with destination using RTS and CTS frames before transmitting the actual data frame to improve the communication efficiency. In order to further consider the channel blocking effect for simultaneous localization, artificial intelligence-based learning strategies have been employed for controllers [21,22].

In order to realize MAUVs’ formation, the vehicles’ trajectories not only depend on their own dead reckoning and control [23,24], but also depend on group objectives and environmental obstacles. Pramod Abichandani et al. [25] proposed a mixed integer nonlinear programming method for MAUVs’ motion planning under constraint communication. The collision-free trajectories accommodate stricter safety requirements despite intersecting or overlapping paths. Yukun Lin et al. [26] utilized a leader/follower multi-AUV control system to enable the AUVs to drive toward the target through a collision-free path. Mingzhi Chen et al. [27] proposed a novel cooperative hunting algorithm for inhomogeneous MAUVs to achieve quick and active path pursuit and planning. Marcello Farina et al. [28] proposed a distributed predictive control approach for robot coordination. The cost function was defined for rapid exploring formation control [12], coverage sensing, and collision avoidance. However, MAUVs’ formation in unknown environments and navigation in hostile environments [29] are often baffled with vehicle nonlinearity, constraint communications, environmental disturbance, and obstacles. Xiang Cao et al. [30] proposed a target following a cooperative search approach by combining the Glasius bio-inspired neural network with a bio-inspired cascaded tracking control approach to improve their search efficiency and reduce tracking errors. Hao Wang et al. [31] designed a smoothly switching function-based neural network adaptive technique to compensate system uncertainties for cooperative path following. Chengzhi Yuan et al. [32] proposed a learning-based formation scheme for multiple AUVs on heterogeneous nonlinear uncertain dynamics under the virtual leader-following framework, which includes an adaptive observer and a deterministic learning controller. The learned knowledge can be effectively stored in a time-invariant fashion by using radial basis function (RBF) neural networks. However, the formation error and stability not only lie in the controller quality, but they also are affected by the cooperative localization errors caused by packet transmission losses.

This paper proposed an adaptive formation scheme on the basis of multibody system concept, the contributions can be summarized in the following:

  1. On the basis of the multibody system concept, the MAUVs’ formation and communication link framework has been established. The connection between AUVs can be viewed as a springs and damping system. An adaptive control strategy has been set up for multiple under-actuated AUVs formation with a desired formation region and magnitude reduced artificial potential function.

  2. On the basis of the MAUVs’ formation and communication link framework, the packets transmission scheme has been designed with learning-based multi-layered network topology; the cooperative localization errors caused by packet loss are estimated and modified through reinforcement learning RBF neural networks.

  3. On the basis of the MAUVs’ formation and communication link framework, an adaptive RBF formation scheme with magnitude reduced multi-layered potential energy functions has been designed on the basis of the time-delayed network framework. Simulations and experiments have verified the performance of the purposed schemes.

The rest of this study is organized as follows. In Section 2, the MAUVs’ formation and communication link framework will be proposed. An adaptive formation control approach of multiple AUVs will be proposed in Section 3. Simulations and experiments will be discussed and analyzed in Section 4. We will present the conclusion in Section 5.

2. MAUVs’ Formation and Communication Link Framework

2.1. Multibody-Based Formation Framework

When MAUVs cruise in the oceanic environment, they can be taken as mobile nodes. In comparison with the nodes of a normal multibody system, the distances between the nodes and their shapes are changing over time. Their connections are soft and established through communications and formation control approaches. However, the multibody system concept can strengthen the bonds and formation robustness between AUVs. From the multibody system concept (see Figure 1), the constraints between adjacent AUVs are designed through a virtual spring and damping system. During the formation process, the distance constraints between the i-th node and the j-th or the k-th node could be modified or from the variation of the external environment during the formation process. Although the formation is expected to maintain the shape, it is still necessary for shape modification for varied external environments, such as obstacles, narrow passages, dangerous zones, and so forth. Therefore, the shape modification could also lead to changes in the node positions and constraints of the multibody system.

Figure 1.

Figure 1

Multibody system based multiple Autonomous Underwater Vehicle (AUV) formation framework.

2.2. Adaptive Communication Protocol

If we take multiple AUVs’ formation as a multibody system, the mobile AUV nodes should be connected and coordinated over network communication. However, constantly varying the nodes’ distance and transmission latency could lead to the difficulties in data transmission and relative distance observation. Moreover, the energy consumption is correlated with the data transmission distance from source nodes. We adopt a multi-layer-based AUV formation topology according to their relative distance. Linear topology communication is applied for objective AUV nodes far away from the source AUV node while others are closer in the same direction; a one–many contending topology communication is applied for the other leader–follower conditions. The Multiple AUVs network topology is shown in Figure 2.

Figure 2.

Figure 2

Multiple AUVs (MAUV) network topology.

2.2.1. Protocol for Linear Topology

For communication between source node and objective nodes, the protocol at the data link layer includes four-way handshaking access methods for packets waiting and transmission control such as “RTS”, “CTS”, “Data”, “Acknowledgment for Receiving” (ACK), and “Blocked to Send (BTS)”. The communication process is illustrated in Figure 3.

Figure 3.

Figure 3

Communication protocol for linear topology.

After channel contention and selection, the source AUV node and objective AUV nodes realize time consensus through broadcasting and answering. The source AUV node will send information to the objective AUV node 2 through objective AUV node 1.

Secondly, the source AUV node will send information to the source node. The format of the data package is {RTS/overtime, node_pos, node_speed, destination_node}, which denotes the data send request, present position, speed, and destined AUV node (in Figure 3, the objective AUV node 2 is supposed as the destined AUV node). When the “RTS” message has been received by the objective AUV node 1, it will be sent to the objective AUV node 2 immediately. At the same time, objective AUV node 1 will be waiting for the “CTS” message from the objective AUV node 2 or a return timeout frame. When the “RTS” message has been received by the objective AUV node 2, it is informed about the forthcoming message, comes into “response adjustment” status, and sends the “CTS” message to the source AUV node through the objective AUV node 1. When the “CTS” message is received by the objective AUV node 1, it will be transmitted to the source AUV node with the format of the data package as {CTS1/overtime, node1_pos, node1_speed, CTS2/overtime, node2_pos, node2_speed}, which denotes the speed and position of objective nodes. When timeout happens, the source AUV node will send the request again or reselect another objective AUV node.

Thirdly, when the “CTS” message is received, “data” will be sent from the source AUV node to the objective AUV node 2 through the objective AUV node 1. When the objective AUV node 1 received “data”, it will come into “Response adjustment” status and send the data package to the objective AUV node 2. After the data has been received, the objective AUV node 2 will return “ACK” to the source node through the objective AUV node 1. The format of the data package is {ACK1/overtime, node1_pos, node1_speed, ACK2/overtime, node2_pos, node2_speed}, which denotes the speed and position of objective nodes. After “ACK” has been received by the source node, the transmission process will terminated.

2.2.2. Protocol for One-Many Contending Topology

The protocol includes a four-way handshaking access method for “RTS”, “CTS”, “Data”, and “Acknowledgment for receiving”, as well as “Blocked to Send” packets for waiting control. The “Response adjustment” time includes the time of propagation and process delay. Once a source decides to start transmission through one channel, the handshaking process will start and transmit a “Blocked to Send” to other sources (other AUVs) at the same time (see Figure 4).

Figure 4.

Figure 4

Communication protocol for one–many contending topology.

At the first stage, when the RTS frame is received, the destination is notified for the forthcoming transmission. The destination goes to the “Response adjustment” state to receive the packets from its neighbor through the selected channel. A block to send is transmitted to other neighbors so as to alert potential interferers that this channel will be busy for the whole carrying time before it can cause a collision.

At the second stage, the source waits until receiving either “CTS” or a timeout frame. When a timeout occurs, the source is back to the channel contention and selection state. Obviously, the propagation delay between a frame and its “Response adjustment” is at least equal to the length of the frame to be transmitted/received in it so that the node response can be dealt with one after another. Thus, the transmission of an “RTS” frame and reception of a “CTS” frame are two actions that have the same maximum single-trip propagation delay, Pmax. If we define the fixed length gap between a control frame and its consequent frame as “CML”, thus, the gap at the source between RTS and CTS is called CMLRTS, and the gap at the destination between “CTS” and “Data” is called “CMLCTS”. We define:

CMLRTS = CMLCTS = Pmax (1)

for the worst propagation scenario. After receiving the RTS frame, the destination then uses the distance information measured from the “RTS” frame to calculate the time to reply with a “CTS” frame so that the “CTS” frame reaches the source after a “CML” space can be counted. During the gap of “CML”, a potential interferer is avoided for collision-free transmission. Once the “Adjusted Response” state finishes, the source sends the data packets through the corresponding channel and goes to the “ACK” state. In summary, the second stage allows the destination to negotiate with the source, which gives both the source and the destination more flexibility and therefore reduces the chance that the destination fails due to channel collision.

The third stage starts as soon as the “CTS” frame is actually received. During this stage, if the destination receives “Data” from the source, it goes to the “Response adjustment” state to verify that the data packet is coming from the source. Otherwise, a timeout occurs.

At the fourth stage, “ACK” for the corresponding data packets are sent through the selected channel once the “Response adjustment” state finishes. After receiving the first ACK packet, the source finishes its transmission process. The BTS values are reset, and the node goes to a “Channel request” state if there are packets to transmit.

2.3. RBF Learning Network for Localization Errors Estimation

The sound propagation loss is one of the major reasons for cooperative localization errors. It is composed mainly of three aspects: namely, geometrical spreading, attenuation by absorption, and the anomaly of propagation:

10logA(l,f)=k10logl+lα (2)

where α is the absorption coefficient in dB/km, k represents the geometrical spreading factor, l represents the transmission range, and f represents the signal frequency.

If we set Nt as the turbulence noise, Nv as the vehicle noise, Nw as the wind driven wave noise, and Nth as the thermal noise,

N=Nt+Nv+Nw+Nth (3)

therefore, we obtain the channel capacity as:

Etrans=Blog2(1+PtxA(l,f)NB)df (4)

where B is the bandwidth and Ptx is the signal transmission power.

MAUVs in the formation should not only keep the formation configuration to realize purposed missions, but also avoid collision with obstacles. The formation shape and relative distances maintenance are important. If we set pc as the formation center, one obtains:

pc=1Ni=1Npi. (5)

Each AUV can acquire a geometric center by communicating with its neighbors so as to keep the formation. Hence, the error between pc and the desired center, pcd= [xcd,ycd,zcd,θcd,ψcd,ϕcd]T is the desired center of the formation region:

e=pcpcd+W^iTσ(si) (6)

where W^1,iTσ1,i(si)=[W^1,iTσ1,i(si),W^2,iTσ2,i(si),W^3,iTσ3,i(si)]T is the RBF neural network to estimate three dimensional cooperative localization errors caused by the data transmission packets loss and measurement noise. W^=[w1,,wNh] is the weight vector, while si represents the input, including the packet loss, delay, current relative distance and between the AUVS, throughput, and current AUV speed.

The output of the RBF neural network can be expressed in the following:

fi=m=1Nh[wimσ¯(k=1Niξmkμk+δξj)+δwj],i=1,2,,No (7)

where Nh, Ni, and No represent the number of hidden layers, input layers, and output neurons. wim and ξmk denote the network weights, δξj and δwj represent the threshold offsets, and σ¯() denotes the Gaussian function:

σ¯(sri)=exp((sri)T(sri)γi2) (8)

where ri is the center vector of the receptive field. wim can be obtained through the following reinforcement learning algorithm.

w(s(t),ak(t))=w(s(t),ak(t))+α[r(t+1)+γw(s(t+1))w(s(t),ak(t))] (9)

In this algorithm, the action is taken on the packets transmission episode. The actions are chosen through the ε greedy strategy. If ε>>0, the actions are taken randomly of a(t)U(amin,amax). When ε<<1, the system exploits the knowledge through selecting the actions. The actions are selected through the comparisons between a random value of xεU[0,1] and ε:

a(t)={U(amin,amax)ifxεεargmaxaw(s(t),ak(t))ifxε>ε. (10)

The actions represent the power transmission levels. The state is a combination of transmission energy Etrans and channel transmission error evaluation, Perror:

{s(t)=Etrans+PerrorPerror=1(1Berror)Nbit (11)

where Berror is the bit error rate and Nbit is the number of bits in the packet [13]. If each transmission action attempts to transmit the total packets, the rewards are defined as a combination of packets reception and energy power levels:

r(t)=π[(prq(t)1)npts+(nptspEdiss(t))mprnpts2] (12)

where π is the quantization step size factor between two consecutive quantization levels. pr(t) and pEdiss(t) are the packets reception levels and energy dissipation levels, respectively, while mpr is the number of quantized pr(t) levels.

If one defines

p˙cd=L(G)epcd+p+1β (13)

where β is the maximum speed of desired trajectory pd, β=max(pd), is the Kronecker product.

Then, the derivative of the error is given by:

e˙=(L(G)+I)e+1Nj=1Np˙j+1Nj=1N(pjpi)β+W^˙1,iTσ1,i(si) (14)

where W^˙1,iT=Γ1,i(σ(si)η1,i+τ1,iW^1,iT). Γ1,i, τ1,i, and η1,i are free parameters, η1,i=[η11,i,η12,i,η13,i]T.

2.4. Formation Shape Maintenance with Potential Field

Potential functions play a great role in helping AUVs move along the desired gradients directions and finally stabilize at the local minima. The following will define the layered potential functions’ shape for the AUVs to reach the desired region and maintain a formation shape (see Figure 5).

fS(δηi)=[fS1(δηio1),fS2(δηio2),,fSm(δηiom)]T0 (15)

where ηiol=ηiηol, ηol is a constant reference point of the l-th desired region, l=1,2,,m, and m is the total number of objective functions. fSl(δηiol) represents the scalar functions with continuous partial derivatives. From Equation (1), the desired range of AUV motions in the formation is defined as a cylindrical and ring-shape region. For each AUV pi, the desired region is the ring centered around pcd between R1 and R2 with height h. Therefore, the scalar attractive forces of the shape function can be defined as follows.

Layer 1:{fS1(δηio1)=(xixcd)2(yiycd)2R120fS1(δηio1)=(zizcd)2h20fS1(δηio1)=(θiθcd)2θh20Layer 2:{fS2(δηio2)=R12(xixcd)2(yiycd)20andfS2(δηio2)=(xixcd)2(yiycd)2R220fS2(δηio2)=(zizcd)2h20fS2(δηio2)=(θiθcd)2θh20Layer 3:{fS3(δηio2)=R22(xixcd)2(yiycd)20andfS3(δηio2)=(xixcd)2(yiycd)2R320fS3(δηio2)=(zizcd)2h20fS3(δηio2)=(θiθcd)2θh20 (16)

Hence, the center of the desired formation region is:

pcd=[rcd,zcd,θcd]T. (17)

If kl is set as a positive constant, the traditional potential energy function for the desired formation regions in Figure 5 is:

PSl(δηiol)=kl2[max(0,fSl(δηiol))]2={0fSl(δηiol)0kl2fSl2(δηiol)fSl(δηiol)>0. (18)

Figure 5.

Figure 5

The layered region for AUV formation and collision avoidance.

In the consideration with the under-actuated characteristic of AUV, the potential energy functions’ magnitude produced from three-dimensional distances have been reduced to improve the scheme robustness and convergence. On the other hand, since the rudder angle is significant for under-actuated AUV to arrive at desired positions, the yaw error of AUV formation appears to be more important.

{PSm(δpiom)=km2[max(0,In(|fSm(δpiom)|)]={0fSm(δpiom)0km2In(|fSm(δpiom)|)fSm(δpiom)>0m=1,2,3PSm(δpiom)=km2[max(0,fSm(δpiom))]2={0fSm(δpiom)0km2fSm2(δpiom)fSk(δpiom)>0m=4 (19)

Thus, the region error for the i-th AUV is defined as follows.

(PSi(δpiok)pi)T=m=1N(km1max(0,fSm(δpiom))×(fSm(δpiom)δqiom)T)+k4max(0,fS4(δpio4))×(fSk(δpio4)δpio4)T=Δξi (20)

For the collision avoidance conditions, the repulsive forces between AUVs or AUVs and obstacles are defined in the form as:

firep=krep((1pipj)pipjpipj+(1pipoi)pipoipipoi) (21)

where poi is the position vector of the i-th obstacle, the energy functions are defined on the basis of the collision avoidance region:

{g1ij(δηij)=Ri12δηij20g2ij(δηij)=Ri22δηij20gNij(δηij)=RiN2δηij20 (22)

where δηij=ηiηj, g1ij,g2ij,,gNLij are the functions for the first layer, second layer,…, and the innermost layer, respectively, and these layers are continuous and differentiable, while N is the number of layers, and Ri1>Ri2>RiN denote the radius of the first, second, and innermost layers, respectively.

Similar to the equations shown in (19), the collision avoidance energy functions have been magnitude reduced as:

{Q1ij(δpij)=k1ij2[max(0,In|g1ij(δpij)|)]Q2ij(δpij)=k2ij2[max(0,In|g2ij(δpij)|)]QNij(δpij)=kNij2[max(0,In|gNij(δpij)|)] (23)

where kNij>>k2ij>k1ij are positive constants. The potential energy for collision avoidance between the i-th and j-th vehicle is:

Qij(δpij)=Q1ij(δpij)+Q2ij(δpij)++QLij(δpij)=k1ij2[max(0,In|g1ij(δpij)|)]2+k2ij2[max(0,In|g2ij(δpij)|)]2++kNij2[max(0,In|gNij(δpij)|)]2 (24)

and

QNij(δq1ij)δqij=h=1Nkhijmax(0,1ghij(δqij))×(ghij(δqij)δqij)TΔρij. (25)

Therefore, if pcd=[xcd,ycd,zcd,ϕcd,ψcd,θcd]T is set as the desired center of AUV formation, the desired AUV positions and formation shape can be obtained through Equations (16)–(25).

3. Adaptive RBF Formation Scheme

The dynamic equation of the i-th AUV can be expressed as:

Mi(pi)p¨i+Ci(pi)p˙i+Di(pi)p˙i+gi(pi)+Δi(pi)=Ti (26)

where Mi(pi) is the 6×6 mass matrix of the AUV, Ci(pi) is a 6×6 matrix of centrifugal and coriolis terms, Di(pi) is the damping matrix, gi(pi) is the vector of gravitational forces and moments, Δi(pi) is uncertain dynamics, and Ti contains the forces and torques from thrusts. If we define:

p˙ci=p˙cd(αiΔξi+γj=1NiΔρij) (27)

and set Δεi=αiΔξi+γj=1NiΔρij, we have p˙ci=p˙cdΔεi, where p¨ci=p˙cdΔε˙i. We define a sliding vector for the i-th AUV as:

si=p˙ip˙ci=p˙ip˙cd+Δεi. (28)

Thus, we obtain:

s˙i=p¨ip¨cd+Δε˙i. (29)

Substituting Equations (28) and (29) into Equation (27), one has:

Mi(pi)s˙i+Ci(pi)si+Di(pi)si+Mi(pi)p¨cd+Ci(pi)p˙cd+Di(pi)p˙cd+gi(pi)+Δi(pi)=Ti. (30)

According to the adaptive control principle, we obtain:

Mi(pi)p¨cd+Ci(pi)p˙cd+Di(pi)p˙cd+gi(pi)+Δi(pi)=ϒi(pi,p˙i,p˙cd,p¨cd)λi (31)

where ϒi(pi,p˙i,p˙cd,p¨cd) is a known regressor matrix and λi represents the dynamic parameters. Therefore, the RBF-based region based adaptive controller is:

Ti=KsisiKpΔεi+ϒi(pi,p˙i,p˙cd,p¨cd)λ^i+W^iTσ(si). (32)

If we set Li as positive definite matrices, the estimated parameter λ^i is updated as:

λ^i=LiϒiT(qi,q˙i,q˙cd,q¨cd)si. (33)

Therefore,

Mi(qi)s˙i+Ci(qi)si+Di(qi)si+Ksisi+KpΔεi+ϒi(qi,q˙i,q˙cd,q¨cd)Δλi+W^iTσ(si)=0 (34)

where Δλi=λiλ^i.

In order to prove the stability of the RBF-based adaptive formation scheme, we obtain a Lyapunov-like function for the multiple AUVs system as:

V=i=1N12siTMi(qi)si+i=1N12ΔλiTMi(qi)λi+k=1312W˜k,iTΓk,i1W˜k,i+i=1N12αiKpl=16KlPSm(δqlom)+i=1N12γiKpj=1NKijQ2ij(δqij) (35)

We obtain from Equations (20), (31), and (32):

V˙i=i=1NsiTKsisii=1NsiTDi(qi)sii=1NsiTKpΔεi+i=1NαiKpe˙TΔξi+i=1N12γiKpj=1Nih=1Lkhijδq˙ijT[max(0,ghij(δqij))](ghij(δqij)δqij)Tk=1312W˜k,iT(σ(si)ηk,i+τk,iW^k,iT) (36)

If we set ENi=[1,,1Ni]T, the last term of the Equation (36) can be rewritten by using Equation (25):

i=1N12γij=1NiKpe˙Δρiji=1N12γiKpj=1Nih=1Lkhije˙T[max(0,ghij(δqij))](ghij(δqij)δqij)T. (37)

From Equation (22), we can obtain

ghij(δqij)=ghji(δqji)andghij(δqij)δqij=ghji(δqji)δqji. (38)

Thus, the last term of Equation (35) can be written as

i=1N12γiKpj=1Nih=1Lkhije˙T[max(0,ghji(δqji))](ghji(δqji)δqji)T=i=1N12γiKpj=1Nih=1Lkhjie˙T[max(0,ghji(δqji))](ghji(δqji)δqji)T=i=1N12γiKpj=1Nih=1Lkhjie˙T[max(0,ghji(δqji))](ghji(δqji)δqji)T=i=1N12γiKpj=1Nie˙TΔρji=i=1N12γiKpj=1Nie˙TΔρij (39)

Moreover, τk,iW˜k,iW^k,iT12τk,i(W˜k,i2+W˜k,i*2), Wk,i* denotes the ideal constant weights.

Therefore, the time derivative of the Lyapunov function in Equation (37) is

V˙ii=1NsiTKsisii=1NsiTDi(qi)sii=1NKpΔεiTΔεi12τk,i(W˜k,i2+W˜k,i*2)0. (40)

From Equation (40), it can be obtained that si, Δεi, Δξ˙i, Δρ˙ij and Δε˙i are bounded. q¨ri is bounded if e¨ is bounded. Thus, s˙i is bounded from Equation (32). Applying Barbalat’s lemma, we obtain and si0 as t if e˙0. From Equation (28), Δρij0.

Since

Δεi=αiΔξi+γj=1NiΔρij0 (41)

as t, all the error terms are summing yields:

i=1N(αiΔξi+γj=1NiΔρij)0 (42)

Since the interactive forces between AUVs are bi-directional, the summation of all the interactive forces in the systems is zero, we obtain:

i=1NαiΔξi0. (43)

One trivial solution of Equation (43) is Δξi0, which means that all the AUVs remain in the desired region all the time because of Equation (40). This means that each AUV is in the desired region and maintains a minimum distance among themselves simultaneously. On the contrary, if we assume Δξi0, the AUV are outside the desired region. Thus, some of the AUVs must be on the opposite sides of the desired region and their Δξi values can not be cancelled out, which contradicts with the fact that i=1NαiΔξi=0. Therefore, the only possibility is i=1NαiΔξi=0 when Δξi=0. From Equation (41), Δρij=0. Therefore, if and only if all the forces of Δξi are zero or cancelled out, does i=1NαiΔξi=0. This means that some AUVs must be on the opposite sides of the desired region. When there are interactions or coupling among the AUVs from different sides of the desired region, a reasonable weightage can be obtained for Δξi by adjusting αi. Finally, since si0 and Δξi0, we can conclude from Equation (28) that Δρij0. Hence, all the AUVs are synchronized to the same speed and maintain constant distances among themselves at steady state.

4. Simulations and Experiments

In order to analyze and verify the designed communication link framework and formation scheme, simulations and experiments have been launched. In the formation simulations of Figure 6 and Figure 7, comparisons have been made on the proposed adaptive formation scheme with and without the RBF neural network. The disturbance is set with a current speed as 0.1 m/s in the west direction. The simulation includes the formation along a round curve and cruising in the confined channel. Their communications are simulated in the NS-2 simulator on the basis of the communication protocol of Section 2. The formation control simulation platform was established on the basis of AUV hydrodynamic equations.

Figure 6.

Figure 6

Figure 6

Formation simulation along a round curve.

Figure 7.

Figure 7

Formation simulation in the channel.

In Figure 6, the three AUVs are planned to follow a round curve with a line shape, e.g., the followers are planned to maintain the same distance one after another. The protocol for linear topology has been applied for the formation communication on the basis of the network framework of Section 2. Since the radius of the trace curvature is greater than the radius of the AUVs’ gyration, these three AUVs can keep formation cruising precisely. The package loss and data transmission throughput are illustrated in Figure 6b; one can improve the cooperative localization accuracy through reinforcement learning RBF neural network and therefore improve the formation stability. From Figure 6c, the reinforcement learning RBF neural network can compensate and reduce the cooperative localization errors caused by communication loss through Equations (12)–(14).

Channel cooperative exploration is one of the important applications, and it is very difficult for MAUVs because of the change of channel size and curve. Through the reinforcement learning RBF neural network, the MAUVs’ formation can obtain more accurate cooperative localization information. The multibody system-based potential field can help MAUVs maintain and change their formation shape according to the environment. The protocol for one–many contending topology and linear topology have been applied and switched according to the shape requirements.

Offshore experiments of MAUVs formation coverage exploration are illustrated in Figure 8. The vehicles were given folding lines with a 90-degree yaw path to test the formation performance of heterogeneous AUVs. The three AUVs can keep their formation while cruising under the strategies proposed in this study.

Figure 8.

Figure 8

Formation coverage experiments.

5. Conclusions

MAUVs’ formation is of great significance for marine surveys and exploration. In order to realize MAUVs’ formation, this study has focused on their communication and formation. On the basis of the multibody system concept, the MAUVs’ formation and communication link framework has been established with an adaptive RBF strategy. The connection for communication and formation between AUVs can be viewed as a springs and damping system. The packets transmission scheme has been designed with multi-layered network topology, which reduces the packets’ loss rate and improves the throughput of the network. Moreover, through the reinforcement-learning RBF neural networks, an adaptive RBF formation strategy can be improved with more accurate cooperative localization information. Simulations and offshore experiments with multiple heterogeneous under-actuated AUVs testify the performance of proposed method.

Author Contributions

Conceptualization, Y.P.; methodology, L.W. and Q.T.; data curation, G.Z.; writing—original draft preparation, H.H.; writing—review and editing, T.Z. and Q.T. All authors have read and agreed to the published version of the manuscript.

Funding

This project is funded by the National Nature Science Foundation of China, grant number: 61633009, 51579053, 51779059; and Field Fund of the 13th Five-Year Plan for Pre-research Equipment, grant number: 61403120301, and also funded by the Key Basic Research Project of “Shanghai Science and Technology Innovation Plan”, grant number: No.15JC1403300.

Conflicts of Interest

The authors declare no conflict of interest.

References

  • 1.Mohammad Pourmahmood Aghababa 3D path planning for underwater vehicles using five evolutionary optimization algorithms avoiding static and energetic obstacles. Appl. Ocean Res. 2012;38:48–62. doi: 10.1016/j.apor.2012.06.002. [DOI] [Google Scholar]
  • 2.Cheng C.-T., Fallahi K., Leung H., Tse C.K. A Genetic Algorithm-Inspired UUV Path Planner Based on Dynamic Programming. IEEE Trans. Syst. ManCybernytics-Part C Appl. Rev. 2012;42:1128–1134. doi: 10.1109/TSMCC.2011.2180526. [DOI] [Google Scholar]
  • 3.Lee T.-S., Lee B.H. A new hybrid terrain coverage method for underwater robotic exploration. J. Mar. Sci. Technol. 2014;19:75–89. doi: 10.1007/s00773-013-0231-4. [DOI] [Google Scholar]
  • 4.Zeng Z., Lian L., Sammut K., He F., Tang Y., Lammas A. A survey on path planning for persistent autonomy of autonomous underwater vehicles. Ocean Eng. 2015;110:303–313. doi: 10.1016/j.oceaneng.2015.10.007. [DOI] [Google Scholar]
  • 5.Enrico P., Fabio L., Marco C. High Accuracy Buoyancy for Underwater Gliders: The Uncertainty in the Depth Control. Sensors. 2019;19:1831. doi: 10.3390/s19081831. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Mario A., Antonelli G. Experiments on Sampling/Patrolling with two Autonomous Underwater Vehicles. Robot. Auton. Syst. 2015;67:61–71. doi: 10.1016/j.robot.2014.09.030. [DOI] [Google Scholar]
  • 7.Petritoli E., Leccese F. High accuracy attitude and navigation system for an autonomous underwater vehicle (AUV) Acta Imeko. 2018;7:3–9. doi: 10.21014/acta_imeko.v7i2.535. [DOI] [Google Scholar]
  • 8.Park B.S. Adaptive formation control of underactuated autonomous underwater vehicles. Ocean Eng. 2015;96:1–7. doi: 10.1016/j.oceaneng.2014.12.016. [DOI] [Google Scholar]
  • 9.Zheng H., Wang N., Wu J. Minimizing deep sea data collection delay with autonomous underwater vehicles. J. Parallel Distrib. Comput. 2017;104:99–113. doi: 10.1016/j.jpdc.2017.01.006. [DOI] [Google Scholar]
  • 10.Liang H., Kang F., Li H. UUV formation system modeling and simulation research based on Multi-Agent Interaction Chain. Int. J. Modeling Simul. Sci. Comput. 2015;6:1550019–1–23. doi: 10.1142/S1793962315500191. [DOI] [Google Scholar]
  • 11.Zhang L., Wang T., Zhang F., Xu D. Cooperative Localization for Multi-AUVs Based on GM-PHD Filters and Information Entropy Theory. Sensors. 2017;17:1–16. doi: 10.3390/s17102286. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Allotta B., Caiti A., Costanzi R., Corato F.D., Fenucci D., Monni N., Pugi L., Ridolfi A. Cooperative navigation of AUVs via acoustic communication networking: Field experience with the Typhoon vehicles. Auton. Robot. 2016;40:1229–1244. doi: 10.1007/s10514-016-9594-9. [DOI] [Google Scholar]
  • 13.Walls J.M., Eustice R.M. An origin state method for communication constrained cooperative localization with robustness to packet loss. Int. J. Robot. Res. 2014;33:1191–1208. doi: 10.1177/0278364914532390. [DOI] [Google Scholar]
  • 14.Khan J.U., Cho H.-S. Data-Gathering Scheme Using AUVs in Large-Scale Underwater Sensor Networks: A Multihop Approach. Sensors. 2016;16:1626. doi: 10.3390/s16101626. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Yan Z., Wu Y., Du X., Li J. Limited Communication Consensus Control of Leader-Following Multi-UUVs in a Swarm System Under Multi-Independent Switching Topologies and Time Delay. IEEE Access. 2018;5:33183–33200. doi: 10.1109/ACCESS.2018.2844817. [DOI] [Google Scholar]
  • 16.Meng T., Shi H., Wang Y. A MAC protocol for underwater sensor networks; Proceedings of the 8th International Conference on Electronic Measurement and Instruments; Xi’an, China. 16–18 August 2007. [Google Scholar]
  • 17.Bharamagoudra M.R., Manvi S.S., Gonen B. Event driven energy depth and channel aware routing for underwater acoustic sensor networks: Agent oriented clustering based approach. Comput. Electr. Eng. 2017;58:1–19. doi: 10.1016/j.compeleceng.2017.01.004. [DOI] [Google Scholar]
  • 18.Brignone L., Alves J., Opderbecke J. GREX sea trials: First experiences in multiple underwater vehicle coordination based on acoustic communication; Proceedings of the OCEANS 2009-EUROPE; Bremen, Germany. 11–14 May 2009; pp. 1–6. [Google Scholar]
  • 19.Guo X., Frater M.R., Ryan M.J. Design of a propagation-delay-tolerant MAC protocol for underwater acoustic sensor networks. IEEE J. Ocean. Eng. 2009;34:170–180. [Google Scholar]
  • 20.Kim S., Yoo Y. Impact of MAC Delay on AUV Localization: Underwater Localization Based on Hyperbolic Frequency Modulation Signal. Sensors. 2018;18:356. doi: 10.3390/s18020356. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Guerra R.E.H., Quiza R., Villalongda A., Arenas J., Castaño F. Digital Twin-Based Optimization for Ultraprecision Motion Systems with Backlash and Friction. IEEE Access. 2019 doi: 10.1109/ACCESS.2019.2928141. [DOI] [Google Scholar]
  • 22.Beruvides G., Juanes C., Castaño F., Guerra R.E.H. A self-learning strategy for artificial cognitive control systems; Proceedings of the 2015 IEEE International Conference on Industrial Informatics; Cambridge, UK. 22–24 July 2015; pp. 1180–1185. [Google Scholar]
  • 23.Qi X. Adaptive coordinated tracking control of multiple autonomous underwater vehicles. Ocean Eng. 2014;91:84–90. doi: 10.1016/j.oceaneng.2014.08.019. [DOI] [Google Scholar]
  • 24.Abichandani P., Torabi S., Basu S., Benson H. Mixed Integer Nonlinear Programming Framework for Fixed Path Coordination of Multiple Underwater Vehicles under Acoustic Communication Constraints. IEEE J. Ocean. Eng. 2015;40:864–873. doi: 10.1109/JOE.2014.2379131. [DOI] [Google Scholar]
  • 25.Lin Y., Hsiung J., Piersall R., White C., Lowe C.G., Clark C.M. A Multi-Autonomous Underwater Vehicle System for Autonomous Tracking of Marine Life. J. Field Robot. 2017;34:757–774. doi: 10.1002/rob.21668. [DOI] [Google Scholar]
  • 26.Chen M., Zhu D. A Novel Cooperative Hunting Algorithm for Inhomogeneous Multiple Autonomous Underwater Vehicles. IEEE Access. 2018;6:7818–7828. doi: 10.1109/ACCESS.2018.2801857. [DOI] [Google Scholar]
  • 27.Farina M., Perizzato A., Scattolini R. Application of distributed predictive control to motion and coordination problems for unicycle autonomous robots. Robot. Auton. Syst. 2015;72:248–260. doi: 10.1016/j.robot.2015.06.007. [DOI] [Google Scholar]
  • 28.Li Y., Cui R., Li Z., Xu D. Neural Network Approximation Based Near-Optimal Motion Planning with Kinodynamic Constraints Using RRT. IEEE Trans. Ind. Electron. 2018;65:8718–8729. doi: 10.1109/TIE.2018.2816000. [DOI] [Google Scholar]
  • 29.Liu Y., Jia Y. An iterative learning approach to formation control of multi-agent systems. Syst. Control Lett. 2012;61:148–154. doi: 10.1016/j.sysconle.2011.10.011. [DOI] [Google Scholar]
  • 30.Cao X., Sun H., Jan G.E. Multi-AUV cooperative target search and tracking in unknown underwater environment. Ocean Eng. 2018;150:1–11. doi: 10.1016/j.oceaneng.2017.12.037. [DOI] [Google Scholar]
  • 31.Wang H., Liu K., Li S. Command filter based globally stable adaptive neural control for cooperative path following of multiple underactuated autonomous underwater vehicles with partial knowledge of the reference speed. Neurocomputing. 2018;275:1478–1489. doi: 10.1016/j.neucom.2017.09.095. [DOI] [Google Scholar]
  • 32.Yuan C., Licht S., He H. Formation Learning Control of Multiple Autonomous Underwater Vehicles with Heterogeneous Nonlinear Uncertain Dynamics. IEEE Trans. Cybern. 2018;48:2920–2934. doi: 10.1109/TCYB.2017.2752458. [DOI] [PubMed] [Google Scholar]

Articles from Sensors (Basel, Switzerland) are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES