Bio-inspired self-organized cooperative control consensus for crowded UUV swarm based on adaptive dynamic interaction topology

Liang, Hongtao; Fu, Yanfang; Gao, Jie

doi:10.1007/s10489-020-02104-5

Bio-inspired self-organized cooperative control consensus for crowded UUV swarm based on adaptive dynamic interaction topology

Published: 05 January 2021

Volume 51, pages 4664–4681, (2021)
Cite this article

Download PDF

Applied Intelligence Aims and scope Submit manuscript

Bio-inspired self-organized cooperative control consensus for crowded UUV swarm based on adaptive dynamic interaction topology

Download PDF

Hongtao Liang¹,
Yanfang Fu² &
Jie Gao¹

787 Accesses
19 Citations
Explore all metrics

Abstract

Cooperative control is currently a challenging topic of crowded unmanned underwater vehicle (UUV) swarm. However, individual behavior conflict and chain-avalanche collision involved in this swarm are easily triggered due to the fluctuations and disturbances. In order to address the two problems, a bio-inspired self-organized cooperative control consensus derived from adaptive dynamic interaction topology is investigated in this paper. Firstly, a novel following-interaction framework incorporating the topological interaction and visual interaction is devised to ensure the minimum number and optimal distribution for neighborhoods. Then, an adaptive dynamic computing model inspired by single-nearest-neighbor following and weighted- multiple-nearest-neighbors following is proposed to steer a sensitive following behavior, in which the influence of each individual on this following behavior is described by a nonlinear weight. Finally, a distributed control protocol is put forward by using the proposed following model and mathematics-based potential fields to achieve the cohesive flocking and avoiding collision, and its sufficient conditions is proven by Laypunov and LaSalle invariance principle to accomplish a self- organized cooperative control. Simulation results are presented for illustrating the feasibility and effectiveness of our proposed control approach.

Distributed Formation Control of Autonomous Underwater Vehicles Based on Flocking and Consensus Algorithms

Bio-Inspired Formation Control for UUVs Swarm Based on Social Force Model

Optimal Formation of UUV Groups Based on Shape Theory and Improved Ant Colony Algorithm Under Communication Delay

1 Introduction

Unmanned underwater vehicle (UUV) is an imperative vehicle in oceanic engineering over the past decades, which has a wide application for monitoring, exploration and surveillance, especially in hazardous environment [1,2,3,4,5]. However, due to the limited robustness and adaptability of existing rigid-formation control methods for more complex missions [3], a crowded UUV swarm composed of a large number of homogeneous/heterogeneous and low-cost submersible intelligent robots, has been aroused more compelling interest. However, the cooperative control for this swarm is a challenge because individual behavior conflict and chain avalanche collision derived from unknown fluctuations and disturbances are easily triggered [5].

How to overcome the conflict and collision involved in the crowded UUV swarm, increasing attentions have been paid to self-organized cooperative control [6, 7]. The essence of this control approach in swarm represents a coordination of a group agents to generate and maintain a pattern in a self-organized way. Up to now, numerous studies on self-organized cooperative control have been developed, all of which can be divided into centralized and distributed control manners [7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28]. The centralized manner generally adopts a top-down modeling, where an upper controller is employed to perform the global mission. However, this scheme is vulnerable and unavailable in real applications [8]. Conversely, the distributed manner formed by down-top modeling is attracted more attentions, when a large number of agents are involved. More recently, various approaches in distributed manner have been proposed, such as leader-following [9], virtual structure [10, 11], artificial potential field [12, 13], and consensus control [14, 15].

Leader-following, as a distinguished pattern, is essentially a predefined distributed pattern to achieve a swarm control due to its simplicity and reliability, in which all following agents track a real or desired leader to maintain a specific orientation and distance among agents. For example, Wang et al. [9] proposed a leader-following formation based on sliding mode control for autonomous underwater vehicles (AUVs). Despite existing significant theoretical formalization, this scheme has a fatal limitation that the formation pattern is rigid.

In order to tackle the above weakness of leader-following scheme, the virtual structure approach is developed, where the swarm is regarded as a tight or loose entity and its desired position is assigned to a structure. Basant et al. [10] presented a formulation of cooperative control for a group of AUVs, in which a virtual flocking center was predicted by consensus protocol. Yang et al. [11] proposed a novel formation control based on Jacobi shape for multi-AUVs, where the geometric shape was designed to track a desired trajectory. However, a structure generated in this method is unavailable for avoiding collision, which may take more sensing and computing costs due to the adverse robustness.

To overcome the drawback of strict geometric relationship in both leader-following and virtual structure, the artificial potential field is provided, where each agent moves according to a gradient direction of potential field generated by total sum of all virtual attractive and repulsive forces. Pan et al. [12] designed a distributed formation control based artificial potential field to accomplish an AUV formation. In addition, Zhu et al. [13] designed a self-organized artificial potential filed formation control to avoid obstacles and replan paths for UUV formation. Although the aforementioned works are effective, there exist two common limitations, i.e., local minimum and deadlock phenomenon, when one obtains multiple attractive and repulsive forces, so adaptability is relatively weak.

Consensus control method has been widely utilized in cooperative control fields, where agents make use of information exchange to reach a common consensus on velocity or orientation for a cohesive swarm. Hu et al. [14] stated a consensus for multi-agents with antagonistic interactions and communication noises. Cai et al. [15] investigated a leader-following output consensus for discrete-time multi-agent systems with uncertainties. Apparently, these consensus-based methods are all based on an ideal communication, and the influence of each individual on formation pattern is ignored or simplified.

Unfortunately, some above-mentioned techniques on distributed control are only available for desired environment and lack the self-organized cooperative behavior. In fact, the UUV is always exposed to the disturbed environment, and its sensing and communicating are limited. Therefore, developing high-performance cooperative control is a major challenging task in real application. Behavior-driven methods [7, 16,17,18,19,20,21,22,23,24,25,26,27,28] inspired by collective behaviors in nature, such as a school of fish, a flock of birds, a herd of sheep and a swarm of ants, have received considerable attentions in the field of cooperative control. Two existing modeling styles, swarm-based macroscopic mode and individual-based microscopic mode, are employed to design a behavior-driven method. The former study works on an entire larger-scale swarm while ignoring all underlying interactions. Conversely, the latter focuses on how to describe the cluster motion via individual interaction. Due to intuitive and appealing interaction potentials, the latter has gained increasing interest in cooperative control.

As one landmark work, Reynolds et al. [17] primarily introduced a distinguished swarm model, so-called Boids, in which three behavioral rules, attraction, repulsion and alignment, were devised to establish a flock. Based on this, various models such as Vicsek [18], Couzin [20], and social force model [21], have been proposed for swarm coordination. The importance of aforementioned works is that it successfully shows the behavioral emergence of a group only influenced by its neighbors and environment [19]. However, most previous works devote on the velocity-average mechanism to achieve rendezvous, cohesion and consensus, in which more heterogeneous characteristics are ignored in local interaction.

In recent past decade, with a rapidly technical development of target tracking, image processing, and data analysis, more hidden interactions are excavated such as topological interaction [22], single-neighbor following [23], multiple-nearest-neighbors following [24], visual perception and attention [25]. The above interactions not only reveal the inherent mechanism of behavioral emergence, but also provide some solutions for self-organizing cooperative control. Duan et al. [26] designed a hierarchical network with behavior learning mechanism inspired by single-neighbor following in pigeon flocking, thereby achieving an unmanned aerial vehicle formation control. Liang et al. [27] presented a behavior-driven cooperative control strategy, in which multiple neighbors can form an immune network to achieve an intelligent swarm of UUVs. Yang et al. [28] proposed a control strategy for time-delay self-organized fission behavior of flocking system. Although most above-mentioned works can achieve a cooperative control, the lower synchronous velocity still exists and the robustness against disturbances is terrible, especially under the unpredictable ocean, which may easily trigger conflicts and collisions.

Motivated by above observations, this paper considers the cooperative control problem for crowded UUV swarm. A novel bio-inspired self-organized cooperative control approach inspired by intelligent perception and interactive computing from various biological clusters is creatively proposed. Firstly, a following interaction framework is designed to ensure the minimum number and optimal distribution of the neighborhood via the topological interaction and visual interaction. Then, a novel computing model involved with single-nearest-neighbor following (SNNF) and weighted multiple-nearest-neighbors following (WMNNF) is proposed to achieve an adaptive and dynamic following process, and the influence of individual on the sensitive behavior is described by a nonlinear weight, which is essentially a non-average velocity mechanism unlike the traditional average-mechanism. Moreover, a cooperative control protocol based on the proposed following model and potential field, is designed to achieve a self-organized flocking for crowded UUV swarm. Simulation results demonstrate that our approach can accomplish the cooperative control with a higher effectiveness and robustness in comparison to the existing methods. The main contributions of this paper are summarized as follows:

(i)
A bio-inspired self-organized cooperative control consensus derived from adaptive dynamic interaction topology is firstly proposed for the crowded UUV swarm, by elaborately configuring the latest intelligent perceptions and interactive computing mechanisms.
(ii)
An adaptive dynamic interaction topology is creatively proposed to steer interactions via the minimum number and optimal distribution of neighborhoods, which is essentially a non- average mechanism considering unlike the previous average-mechanism [17,18,19,20,21].
(iii)
A self-organized cooperative control protocol is designed by using the proposed following model and mathematics-based potential fields, which can effectively achieve the swarm cohesion and avoiding collision to solve the individual behavior conflict and chain-avalanche collision.
(iv)
In formulating the proposed approach, three quantitative indexes, such as mean heading, orderness parameter, and scale parameter, are devised to prove that the robustness and efficiency of our approach are better than that of the existing approaches.

The remaining of this paper is organized as follows. Section 2 describes preliminary knowledge and problem statement. Section 3 presents an adaptive dynamic interaction topology. Section 4 designs a self-organized cooperative control protocol. Section 5 performs all simulations. Finally, a conclusion is given in Section 6.

2 Preliminary knowledge and problem statement

2.1 Local interaction

It has been revealed that the local interaction can describe the collective behavior of biological cluster in [19,20,21,22,23,24,25,26,27,28]. In the formulating the local interaction, how to describe an individual action influenced by effects from its neighbors is a key topic. Up to now, the topological interaction, visual interaction, single-nearest-neighbor synergy and multiple-nearest-neighbors synergy have been investigated by bio-inspired scientists in physics and engineering [22,23,24,25].

For topological interaction, it was firstly discovered in analyzing the flight data of starlings [29], where each individual in the group only interacts with the nearest 6 or 8 neighbors. This study subverts a traditional definition of neighbor that an individual can interact with all neighbors who is within a fixed distance as shown in Fig. 1a. The reason for this difference is that a specific number of neighbors is subject to the cortical elaboration of prenumeric ability, rather than individual’s perception ability. It is reported that an interaction network formed by topological interaction, as shown in Fig. 1b, has a trade-off between perceived cost and group robustness [30].

For visual interaction, it is implied that the visual perception and visual attention can provide a solution to depict individual interactions in biological cluster. (i) The visual perception can sense all neighbors around within the sensing range, which is based on limited view (blindness angles). It is intuitively expected to improve synchronization motion as shown in Fig. 2a. (ii) The visual attention is an important psychological mechanism to select the most relevant information from limited view [25], i.e., it can discard a large amount of background information to achieve an efficient screening, as illustrated in Fig. 2b. Hence, some metaphorical mechanisms inspired by visual interaction can be useful to guide the cohesion and avoidance [31].

For single-nearest-neighbor synergy, it states that the individual only tracks a single neighbor. For example, a school of fishes is dominated by an intermittent pairwise interaction between the individual and its nearest neighbor [21]. Similarly. Herbert et al. [32] used a neural network to analyze the tracking behavior in shoaling fish, revealed that each individual actually responded only to the nearest neighbor. In addition, a single-nearest-neighbor synergy based on hierarchical relationship was ensconced in pigeon-Inspired optimization [33]. Clearly, a distinct advantage of this mechanism is smaller computing cost, which is conducive to improving the robustness and sensitiveness.

For multiple-nearest-neighbors synergy, its essence is that an individual can synthesize all influences from multiple-nearest neighbors to make decision for its behavior. Due to its robustness for collective behaviors, several popular models, such as Boids, Vicsek and Couzin [19], all adopt this synergy. However, existing works are all based on velocity-average mechanism, in which accompanying properties, i.e., sensing, communicating, influencing and decision-making abilities, are ignored in this procedure. Actually, these features play an important role in achieving a collective behavior. It was reported that influences from multiple-nearest neighbors have nonlinear characteristic, rather than a simple linear superposition [34]. Thus, the non-average mechanism involved in multiple-nearest-neighbors synergy is a worthy of discussing in flocking control.

2.2 Graphs theory

It is convenient to model the neighbor interactions between agents by an undirected or directed graph [35], where an individual UUV is regarded as a node and the interconnection topology in UUV swarm can be described as a graph.

Suppose that a graph G(V, E, A) consists of a node set V = {v₁, v₂, ⋯, v_n} and a edge setE ⊆ V × V, in which each edge is a pair of vertices (v_i, v_j) such as i ≠ j. If (v_i, v_j) ∈ E, then one usually says that i and j are adjacent.A = [a_ij] ∈ ℝ^n × n is defined as adjacent matrix which is an integer matrix with rows and columns indexed by vertices, such that (v_i, v_j) ∈ E is equal to a_ij = 1, else a_ij = 0. The Laplacian matrix L = [l_ij] ∈ ℝ^n × n is denoted by L = D-A, where D = [d_ij] is a diagonal matrix with $ {d}_{ij}={\sum}_{j=1}^n{a}_{ij} $. The neighbor set for agent i is denoted as N_i = {v_j ∈ V| (v_i, v_j) ∈ E}.

Remark 1:

If (v_i, v_j) ∈ E ⇔ (v_j, v_i) ∈ E, G is an undirected graph, else it is called a directed graph. Furthermore, G is strongly connected if there is a directed path from (v_i, v_j) and (v_j, v_i)between any pair of distinct nodes v_i and v_j. A spanning tree of a directed graph is a directed tree formed by graph edges that connect all nodes of the graph.

Lemma 1:

For a connected graph G, its L is symmetrical and positive semidefinite, and all eigenvalues are non-negative real number which denoted by λ_i ∈ ℂ with an ascending order in magnitude, i.e.,0 = λ₁(L) ≤ λ₂(L) ≤ ⋯ ≤ λ_n(L).

2.3 Problem statement

Consider a group of N agents in the crowded UUV swarm, labeled 1,..., N, moving in horizontal plane. For agent i, its kinematic and dynamic with six-Degree of Freedom can be given by

$$ \Big\{{\displaystyle \begin{array}{l}{\dot{\eta}}_i=J\left({\psi}_i\right){v}_i\\ {}{M}_i{\dot{v}}_i+{C}_i\left({v}_i\right){v}_i+{D}_i\left({v}_i\right){v}_i+{g}_i\left({\eta}_i\right)={\tau}_i+{w}_i\end{array}} $$

(1)

where η_i = [x_i, y_i, ψ_i]^T ∈ ℝ³ is the positions and yaw angle in earth-fixed frame, respectively. v_i = [u_i, v_i, r_i]^T ∈ ℝ³ is the velocity vector in body-fixed frame, where u_i is a surge velocity, v_i is a sway velocity, and r_i is an angular yaw velocity.τ_i = [τ_iu, 0, τ_ir]^T ∈ ℝ³ is a control input vector, and g_i = [g_iu, g_iv, g_ir]^T ∈ ℝ³ denotes the generalized gravitational and buoyancy forces, and w_i = [w_iu, w_iv, w_ir]^T ∈ ℝ³ denotes total unknown external disturbances caused by winds, waves and ocean currents. M_i, C_i(v_i) and D_i(v_i) are the inertia matrix, Coriolis matrix and damping matrix, respectively. TheJ_i(ψ_i)is a transformation matrix given by

$$ {J}_i\left({\psi}_i\right)=\left[\begin{array}{ccc}\cos {\psi}_i& -\sin {\psi}_i& 0\\ {}\sin {\psi}_i& \cos {\psi}_i& 0\\ {}0& 0& 1\end{array}\right] $$

(2)

Due to the hydrodynamic performance induced by the assumption of symmetry in plane and vertical directions, g_i = [g_iu, g_iv, g_ir]^T = 0. Moreover, the technique of state feedback linearization is utilized to simplify the kinematic and dynamic model into a general double-integrator model [36].

Specifically, the nonlinear model (1) can be written as

(3)

where N(v_i)v_i = C_i(v_i)v_i + D_i(v_i)v_i,G(ζ) denotes the hydrodynamic coefficients, and denotes the sum of forces and rudder angles.

Defining ξ_i = [η^T, v^T]^T and h(ξ_i) = η, a standard nonlinearization function can be obtained from (3) as follows

(4)

where is the input of this nonlinear system.

Then, a new transformed coordination with p_i = [h_i(ξ)]^T ∈ ℝ³ and q_i = [∂_f h_i (ξ)]^T ∈ ℝ³ is defined by derivative of h(ξ_i), and its corresponding control input can be defined as

(5)

where $ B\left({\xi}_i\right)={\left[{\partial}_f^2{h}_i\left(\xi \right)\right]}^{\mathrm{T}} $, and Γ(ξ_i) = [∂_g∂_fh_i(ξ_i)] ∈ ℝ^3 × 3.

Thus, the motion model (1) can be represented by a second-order integrator model by (3) and (4), as follows

$$ \Big\{{\displaystyle \begin{array}{l}{\dot{p}}_i={q}_i\\ {}{\dot{q}}_i={u}_i\end{array}}\kern1.5em \left(i=1,\mathrm{2...},N\right) $$

(6)

where p_i ∈ ℝ³ and q_i ∈ ℝ³ are the position and velocity variables of agent i, respectively, and u_i is the control input.

In addition, an underlying interaction topology among agents may dynamically change, which can be described by neighbor set in a connected graph, as follows

$$ {N}_i(t)=\left\{j|{d}_{ij}(t)\le {R}_i,j\in \left\{1,\cdots, N\right\},j\ne i\right\} $$

(7)

where R_i is the sensing radius of agent i, and d_ij(t) = ‖p_i − p_j‖ is an Euclidean distance between agent i and agent j.

Remark 2:

The neighbor graph is essentially a distance-dependent and time-varying topology, there are two reasons given by (i) each agent has a nonuniform sensing strength due to the factors such as limited perception range and narrow motion-space in scale effects, and (ii) all agents are influenced by external disturbances composed of information loss, delayed communication and noises.

Assumption 1:

The velocity is bounded to prevent the collisions caused by motion inertia, which is given by

$$ {q}_i=\Big\{{\displaystyle \begin{array}{l}{q}_i,\kern3.5em \left\Vert {q}_i\right\Vert \le {V}_{\mathrm{max}}\\ {}{V}_{\mathrm{max}}\frac{q_i}{\left\Vert {q}_i\right\Vert },\kern0.75em \left\Vert {q}_i\right\Vert >{V}_{\mathrm{max}}\end{array}} $$

(8)

where V_max is the maximum velocity.

In this paper, our objective is to design a self-organized cooperative control that can guarantee that the crowded UUV swarm can achieve the cohesive flocking and collision avoiding without any splits, under the assumption that the initial network is a connected graph. In particular, the objectives of this paper are formulated as

$$ \underset{t\to \infty }{\lim}\left\Vert {p}_i(t)-{p}_j(t)\right\Vert \le R,\kern0.75em \left(i,j\in N\right) $$

(9)

$$ \underset{t\to \infty }{\lim}\left\Vert {p}_i(t)-{p}_j(t)\right\Vert \ge D,\kern0.75em \left(i,j\in N\right) $$

(10)

$$ \underset{t\to \infty }{\lim}\left\Vert {q}_i(t)-{q}_j(t)\right\Vert \to 0,\kern1em \left(i,j\in N\right) $$

(11)

where R denotes the sensing radius, and D denotes the minimum distance for inter-collision avoidance.

Remark 3:

The objective (9) can preserve the connectivity with any splittings, and objective (10) can guarantee the collision avoidance among all agents, and the objective (11) can reach a velocity consensus for all agents.

3 Adaptive dynamic interaction topology

Based on above-mentioned preliminaries, an adaptive dynamic interaction topology is proposed in this section, and the related framework and model are briefly demonstrated.

3.1 Following-interaction framework

The following-interaction framework composed of three key steps are devised by integrating four bio-inspired interaction mechanisms, as illustrated in Fig. 3.

Step1: The topological interaction is firstly employed to select all potential neighbors with a reasonably prescribed number, unlike the previous local interactions depend on an aprioristic assumption [29]. Therefore, topological interaction can be employed to shrink the number of neighbors in comparison to the traditional metric-distance interaction.

Step2: The visual interaction is devised to ensure the minimum number and optimal distribution of the neighborhood within local interaction. It consists two parts: (i) Due to the restrictions of the equipped sonars, the sensing field of UUV is limited. If this limited field is reasonably considered, it is an optimal strategy that the neighbor who is located in blindness zone is excluded to accomplish the local interaction. (ii) Due to the crowded phenomena existed in topological interaction, each individual has multiple nearest neighbors in a certain direction-interval. If one nearest neighbor who is located in each direction-interval, is selected to form a new neighborhood for each individual. Therefore, this step can further decrease the number and optimize the distribution of neighborhood after topological interaction.

Step3: The local following is essentially a non-average making-decision strategy integrated by single-nearest-neighbor and multiple-nearest-neighbors, rather than an average-velocity mechanism that is commonly assumed to design consensus or protocols. If the control parameter is smaller than an empirical triggered threshold, the SNNF originated from single-nearest-neighbor synergy is adopted to achieve a directed transmission of information, else the weighted-MNNF (WMNNF) originated from multiple-nearest-neighbors is proposed to synthesize all individual differences, in which a weight of each agent is determined by its corresponding interaction control parameter. In this step, advantages of the two following models are formulated in an unified framework, which can provide an adaptive making-decision to address behavior conflict and chain collision.

3.2 Adaptive dynamic computing model

In terms of the proposed following-interaction framework, its corresponding computing model is developed in this subsection. Consider a group of N agents in the crowded UUV swarm, each individual has an omnidirectional view field with sensing radius R and blindness sectorω. The specific procedure is demonstrated as follows.

(i) Consider an omnidirectional view field of individual i, its neighbors setN_iat time t is determined by

$$ {N}_i\left(t,\varphi \right)=\left\{j|{d}_{ij}\le {R}_i,j\in \left\{1,\cdots, N\right\},j\ne i\right\} $$

(12)

where ϕ denotes the view angle, and N_i < N.

Remark 4:

Eq. (12) is essentially a fixed distance interaction determined by sensing radius, and most of existing models all adopt this to decide the interaction intensity [17,18,19,20,21].

(ii) The topological interaction is employed to select a fixed number with κ-nearest neighbors from N_i, determined by

$$ {N}_i^1\left(t,\varphi, \kappa \right)=\underset{j\in {N}_i}{\mathrm{argmin}}\left\{j|{d}_{ij}(t)\right\} $$

(13)

where $ {N}_i^1<{N}_i $.

Remark 5:

Eq. (13) can decreases the number of neighbors in comparison to the fixed-distance interactions, such as Boids, Vicsek, and Couzin [19].

(iii) The blindness angel ω is considered to improve the speed of information transmission via decreasing the omnidirectional view, which is consistent with the blindness sector caused by the restricted physical size [27]. Assuming the UUV is symmetrical about UUV heading, as shown in Fig. 4. Such that, the neighbor marked with the purple circle, who is located in the blindness sector, is excluded from interacting with others.

Hence, a new neighbor set $ {N}_i^2 $ with restricted blindness sector ω is determined by

$$ {N}_i^2\left(t,\varphi, \kappa, \omega \right)=\left\{j|{d}_{ij}(t)\wedge {\varphi}_i-{\omega}_i,j\in {N}_i^1\ \right\} $$

(14)

where $ {N}_i^2<{N}_i^1 $.

(iv) The individual i focuses on correlating and transmission of abnormal behaviors by utilizing the visual attention [37, 38], of which essence is that φ_i − ω_i is divided intom_c sectors along clockwise direction in the heading of agent i, then the nearest individual in each sub-sector is chosen, as shown in Fig. 5.

In particular, the view angle of each sub-sector of individual i is calculated as

$$ {\xi}_{im}=\frac{\varphi_i-{\omega}_i}{m_c}\kern1em \left(m=1,2,\cdots, {m}_c\right) $$

(15)

Then, the neighbor set $ {\varTheta}_i^m $ in m-th sub-sector is determined by

$$ {\varTheta}_i^m(t)=\left\{j|{\xi}_{im}\left(m-1\right)\le {\theta}_{ij}-{\vartheta}_i\le {\xi}_{im}m\right\} $$

(16)

where ϑ_i is the heading of individual i, and θ_ij is the azimuth of individual j relative to individual i. Such that the nearest neighbor in each sub-sector $ {\ell}_i^m $ is chosen as

$$ {\ell}_i^m(t)=\left\{j|j=\underset{j\in {\varTheta}_i^m}{\arg\ \min }{d}_{ij},m=1,2,\cdots, {m}_c\right\} $$

(17)

Finally, all the nearest individuals can generate a new neighbor set $ {N}_i^3 $, which is given by

$$ {N}_i^3\left(t,\varphi, \kappa, \omega, {m}_c\right)=\left\{{\ell}_i^m(t),m=1,2,\cdots, {m}_c\right\} $$

(18)

where $ {N}_i^3<{N}_i^2 $.

Remark 6:

Eqs. (15) ~ (18) are essentially an optimal strategy to select the all candidate neighbors from $ {N}_i^2 $. The reasonable number of sub-sectors and the choosing operation in each sub-sector can guarantee the only one agent existed in each sub-sector, which can address the crowded problem, and further avoid the behavior conflict and chain collision via an optimal distribution in $ {N}_i^3 $.

(v) Based on $ {N}_i^3 $, an interaction control parameter δ_ij(t) is calculated by a synergy of position p_i, velocity q_i and heading ϑ_i to quantify the interaction intensity between individuals i and j [39]:

$$ {\delta}_{ij}(t)=\frac{\alpha_{ij}}{1+\left\Vert {p}_i-{p}_j\right\Vert}\times \frac{\beta_{ij}\left\Vert \left({q}_i-{q}_j\right)\times {\overline{q}}_i\right\Vert }{\sum_{j\in {N}_{i3}}\left\Vert \left({q}_i-{q}_j\right)\times {\overline{q}}_i\right\Vert}\times \frac{\chi_{ij}}{1+\left\Vert {\vartheta}_i-{\vartheta}_j\right\Vert } $$

(19)

where α_ij, β_ijand χ_ij denote the coefficients corresponding with position, velocity and heading, respectively. And, $ {\overline{q}}_i $ is an average-velocity of neighbor set $ {N}_i^3(t) $.

Remark 7:

The δ_ij(t) is related to various factors such as relative distance, speed and heading between individuals, as well as the number of neighbors and spatial distribution. Furthermore, in the evolving process, swarm graph is dynamic, hence δ_ij(t) exhibits a time-varying characteristic.

(vi) The SNNF and WMNNF by virtue of the proposed interaction control parameter δ_ij(t) are to design an adaptive following interaction model $ {u}_i^f(t) $, which is given by

$$ {u}_i^f(t)=\Big\{{\displaystyle \begin{array}{l}{\sum}_{j\in {N}_i^3}{w}_j{u}_j\left(t-\varDelta t\right),\kern1.5em {\delta}_{ij}<{\varDelta}_i\\ {}{u}_j\left(t-\varDelta t\right),\kern5em {\delta}_{ij}\ge {\varDelta}_i\kern0.5em \end{array}} $$

(20)

where Δ_i is an empirical parameter, Δt denotes a sampling period, and w_j represents a weight with control law $ {u}_i^f(t) $ of individual j. The w_j is given by

$$ {w}_j(t)=\frac{u_j(t)}{\sum \limits_{j\in {N}_i^3}{u}_j(t)} $$

(21)

where $ \sum \limits_{j\in {N}_i^3}{w}_j=1 $, and w_j(t) ≥ 0.

Remark 8:

If existing δ_ij < Δ_i, individual i adopts a WMNNF pattern. Conversely, if δ_ij ≥ Δ_i, it adopts a SNNF pattern to follow $ j=\underset{j\in {N}_i^3}{\mathrm{argmax}}\left\{{\delta}_{ij}|{\delta}_{ij}\ge {\varDelta}_i\right\} $ with the largest intensity.

4 Self-organized cooperative control protocol

In this section, a novel self-organized cooperative control protocol is proposed, and its stability analysis is given to illustrate the feasibility and effectiveness.

4.1 Control protocol

The self-organized cooperative control protocol is derived from the proposed adaptive dynamic interaction topology and mathematics-based potential fields. In this approach, each UUV applies a control protocol that consists of four terms

$$ {u}_i(t)={u}_i^p+{u}_i^q+{u}_i^f+{u}_i^{dis},i=1,2,...,N $$

(22)

where $ {u}_i^p $ is the position coordination term, and $ {u}_i^q $ is the velocity consensus term, and $ {u}_i^f $ is the proposed following interaction term, and$ {u}_i^{dis} $ is the disturbance term.

(i)
$ {u}_i^p $ is utilized to adjust the positions between individual i and its neighbors in $ {N}_i^3 $, which embodies the both attraction and repulsion rules [17]. It is given by the gradient-based potential fields as follows

$$ {u}_i^p(t)=\sum \limits_{j\in {N}_i^3}{\psi}_{\alpha}\left({\left\Vert {p}_i-{p}_j\right\Vert}_{\sigma}\right){\sigma}_{\varepsilon}\left\Vert {p}_i-{p}_j\right\Vert +\sum \limits_{j\in {N}_i^3}{\psi}_{\beta}\left({\left\Vert {p}_i-{p}_j\right\Vert}_{\sigma}\right){\sigma}_{\varepsilon}\left\Vert {p}_i-{p}_j\right\Vert $$

(23)

where, ψ_α and ψ_β are the pairwise smooth attractive and repulsive potential functions, and ‖·‖_σ is a σ-norm of a vector p_i − p_j, which is defined as

$$ {\left\Vert {p}_i-{p}_j\right\Vert}_{\sigma }=\left(1/\varepsilon \right)\left[\sqrt{1+\varepsilon {\left\Vert {p}_i-{p}_j\right\Vert}^2}-1\right] $$

(24)

where ε ∈ (0, 1) is a fixed parameter. The gradient of ‖p_i − p_j‖_σ defined by σ_ε‖p_i − p_j‖ is given by

$$ {\sigma}_{\varepsilon}\left\Vert {p}_i-{p}_j\right\Vert =\nabla {\left\Vert {p}_i-{p}_j\right\Vert}_{\sigma }=\frac{\left\Vert {p}_i-{p}_j\right\Vert }{\sqrt{1+\varepsilon {\left\Vert {p}_i-{p}_j\right\Vert}^2}}=\frac{\left\Vert {p}_i-{p}_j\right\Vert }{1+\varepsilon {\left\Vert {p}_i-{p}_j\right\Vert}_{\sigma }} $$

(25)

where ∇ denotes a gradient operator.

Due to the limited forces provided by actuators, both ψ_α and ψ_β triggered by two continuously bounded active functions Θ_α and Θ_β, are designed to generate repulsive and attractive forces of all agents. Specifically, the repulsive function and its corresponding active function are given by

$$ {\psi}_{\alpha}\left({d}_{ij}\right)={\int}_{{\left\Vert R\right\Vert}_{\sigma}}^{d_{ij}}{\varTheta}_{\alpha }(s) ds $$

(26)

$$ {\varTheta}_{\alpha}\left({d}_{ij}\right)=\Big\{{\displaystyle \begin{array}{l}-{\varpi}_{\alpha }/{d}_{ij}^2,\kern0.5em {d}_{ij}\in \left[0,{\left\Vert R\right\Vert}_{\sigma}\right)\\ {}0,\kern3em {d}_{ij}\in \left[{\left\Vert R\right\Vert}_{\sigma },+\infty \right)\end{array}} $$

(27)

where ϖ_α > 0 is a designing parameter.

Similarly, the attractive function and its corresponding active function are given by

$$ {\psi}_{\beta}\left({d}_{ij}\right)={\int}_0^{d_{ij}}{\varTheta}_{\beta }(s) ds $$

(28)

$$ {\varTheta}_{\beta}\left({d}_{ij}\right)=\Big\{{\displaystyle \begin{array}{l}\frac{-{\varpi}_{\beta}\left({d}_{ij}-{d}_{\mathrm{min}}\right)\left({d}_{ij}-{\left\Vert R\right\Vert}_{\sigma}\right)}{d_{ij}},\kern0.5em {d}_{ij}\in \left[0,{\left\Vert R\right\Vert}_{\sigma}\right)\\ {}0,\kern10.5em {d}_{ij}\in \left[{\left\Vert R\right\Vert}_{\sigma },+\infty \right)\end{array}} $$

(29)

where ϖ_β > 0 is a designing parameter, and d_min represents the minimum local distance.

Hence, the total potential functions for the crowded UUV swarm are given by

$$ V\left(p,q\right)=\frac{1}{2}\sum \limits_i\sum \limits_{j\ne i}{\psi}_{ij}\left({\left\Vert {p}_i-{p}_j\right\Vert}_{\sigma}\right) $$

(30)

where ψ_ij = ψ_α + ψ_β.

To simplify the presentation, (23) can be given by

$$ {u}_i^p(t)=\sum \limits_{j\in {N}_i^3}{\nabla}_{p_i}{\psi}_{ij}\left({\left\Vert {p}_i-{p}_j\right\Vert}_{\sigma}\right) $$

(31)

(ii)
$ {u}_i^q $ represents the velocity alignment using consensus-based protocol

$$ {u}_i^q(t)=\sum \limits_{j\in {N}_i^3}{a}_{ij}(q)\left({q}_j-{q}_i\right) $$

(32)

where a_ij > 0 is an element of adjacency matrix.

(iii)
$ {u}_i^f $ represents the proposed following interaction, which is utilized to control individuals tracking the informed agent and make the velocity consensus. Its expression is of form as

$$ {u}_i^f(t)=\Big\{{\displaystyle \begin{array}{l}{\sum}_{j\in {N}_i^3}{w}_j\left({\nabla}_{p_i}{\psi}_{ij}\left({\left\Vert {p}_i-{p}_i\right\Vert}_{\sigma}\right)+{\lambda}_1{a}_{ij}\left({q}_i-{q}_j\right)\right),\kern1.5em {\delta}_{ij}<{\varDelta}_i\\ {}{\lambda}_{2i}{\nabla}_{p_i}{\psi}_{ij}\left(\left\Vert {p}_i-{p}_j\right\Vert \right)+{\lambda}_2{a}_{ij}\left({q}_i-{q}_j\right),\kern5em {\delta}_{ij}\ge {\varDelta}_i\kern0.5em \end{array}} $$

(33)

where Δ_i is an empirical parameter determined by

(34)

$$ {\varPhi}_i=\frac{1}{\mid {N}_i\mid +1}\left\Vert \sum \limits_{j\in {N}_i^3\cup i},\frac{q_j}{\left\Vert {q}_j\right\Vert}\right\Vert $$

(35)

where ƛ is positive constant. Moreover, λ_2i and λ₂ are feedback gain coefficients, where λ_2i is given by

$$ {\lambda}_{2i}=\Big\{{\displaystyle \begin{array}{l}{\varUpsilon}_q,\kern10.25em \left\Vert {q}_i\right\Vert \le {V}_{\mathrm{max}}\\ {}\frac{\varUpsilon_q{\varUpsilon}_m}{\left\Vert {q}_i\right\Vert +{\varUpsilon}_m-{V}_{\mathrm{max}}},\kern5.25em \left\Vert {q}_i\right\Vert >{V}_{\mathrm{max}}\kern0.5em \end{array}} $$

(36)

where ϒ_q and ϒ_m are positive constants, and V_max is the maximum velocity given by Eq. (8).

(iv)
$ {u}_i^{dis} $ describes a composing of external disturbances and noises, which is given by

$$ {\dot{u}}_i^{dis}(t)+{T}_d{u}_i^{dis}(t)={K}_d{F}_d $$

(37)

where T_d denotes constant matrix,K_d denotes gain matrix, and F_d denotes the largest amplitude of the white noise. It should be noted that $ {u}_i^{dis} $ is continuous and bounded.

Remark 9:

In control protocol Eq. ( 22 ), the individual characteristics represented by w _j can generate an influence on forming the neighbor set $ {N}_i^3\left(t,\psi, k,\omega, {m}_c\right) $ , which is not similar to the existing models that all individuals are assumed to be an equal state. Thus, the proposed control protocol is a more realistic model.

Remark 10:

In control protocol Eq. (22), each individual in neighbors set is updated within each sampling period, rather than just tracking a fixed target to follow. Therefore, it is essentially different from the leader-following control protocol [9].

Remark 11:

In control protocol Eq. (22), if existing δ_ij < Δ_i,$ {u}_i^f $ is determined by its all neighbors. if existing δ_ij ≥ Δ_i,$ {u}_i^f $ is determined by the individual with largest interaction intensity. Thus, the control protocol in [40] can be regarded as a special case of our proposed control protocol.

Assumption 2:

In the G associated with $ {N}_i^3\left(t,\psi, k,\omega, {m}_c\right) $ , there exists a directed path from the informed agent to any other agent.

4.2 Stability analysis

Theorem 1:

Consider a crowded UUV swarm with dynamic model (1), if the initial graph G(0) is connected and initial state is from the LaSalle invariance principle, and Assumption 2 holds, such that all agents can achieve a self-organized cooperative control consensus under the proposed control protocol (22) ~ (37). Then, following statements hold:

i)
The velocities of all individuals will asymptotically converge to a consensus.
ii)
Almost every final configuration reaches in a local minimum.
iii)
No collisions occur for all t ≥ 0.
iv)
No splittings occur for all t ≥ 0.

Proof

Consider a Lyapunov-like energy function under δ_ij < Δ_i

$$ Q\left(p,q\right)=\frac{1}{2}\sum \limits_{i=1}^N\left(U(p)+{\left({q}_i-{q}_j\right)}^T\left({q}_i-{q}_j\right)\right) $$

(38)

where

$$ U(p)=V\left(p,q\right)+{w}_j\sum \limits_{j={N}_i^3}{\psi}_{ij}\Big({\left\Vert {p}_i-{p}_j\right\Vert}_{\sigma } $$

(39)

Substituting (30) into (39), we can obtain

$$ U(p)=\left(1+{w}_j\right)\sum \limits_{j={N}_i^3}{\psi}_{ij}\Big({\left\Vert {p}_i-{p}_j\right\Vert}_{\sigma } $$

(40)

Let (p_c, q_c) be the center of mass (COM) of neighbour set, which is given by

$$ {p}_c=\frac{1}{\mathcal{N}}\sum \limits_{i=1}^{\mathcal{N}}{p}_i,{q}_c=\frac{1}{\mathcal{N}}\sum \limits_{i=1}^{\mathcal{N}}{q}_i $$

(41)

where $ \mathcal{N}={N}_i^3(t) $ is of convenience.

Let $ {\tilde{p}}_i={p}_i-{p}_c $ and $ {\tilde{q}}_i={q}_i-{q}_c $ describe the relative positions and velocities between agent i and (p_c, q_c), then $ {p}_i-{p}_j={\tilde{p}}_i-{\tilde{p}}_j={\tilde{p}}_{ij} $,$ {q}_i-{q}_j={\tilde{q}}_i-{\tilde{q}}_j={\tilde{q}}_{ij} $, and $ {u}_i^p(t) $ can be now written as

$$ {u}_i(t)=-\left(1+{w}_j\right)\sum \limits_{j\in \mathcal{N}}{\nabla}_{p_i}{\psi}_{ij}\left({\left\Vert {\tilde{p}}_{ij}\right\Vert}_{\sigma}\right)-{a}_{ij}{\tilde{q}}_{ij}-{\lambda}_1{w}_j\sum \limits_{j\in \mathcal{N}}{a}_{ij}{\tilde{q}}_{ij} $$

(42)

Then (38) is rewritten as

$$ Q\left(\tilde{p},\tilde{q}\right)=\frac{1}{2}\sum \limits_{i=1}^{\mathcal{N}}\Big(\left(1+{w}_i\right)\sum \limits_{j\in \mathcal{N}}{\psi}_{ij}\left({\left\Vert {\tilde{p}}_{ij}\right\Vert}_{\sigma }+{{\tilde{q}}_{ij}}^T{\tilde{q}}_{ij}\right) $$

(43)

According to the symmetry of ψ_ij and the symmetric matrices of a_ij, then

$$ \frac{\partial {\psi}_{ij}\left({\left\Vert {\tilde{p}}_{ij}\right\Vert}_{\sigma}\right)}{\partial {\tilde{p}}_{ij}}=\frac{\partial {\psi}_{ij}\left({\left\Vert {\tilde{p}}_{ij}\right\Vert}_{\sigma}\right)}{\partial {\tilde{p}}_i}=-\frac{\partial {\psi}_{ij}\left({\left\Vert {\tilde{p}}_{ij}\right\Vert}_{\sigma}\right)}{\partial {\tilde{p}}_j} $$

(44)

Differentiating (43) gives

$$ \dot{Q}\left(\tilde{p},\tilde{q}\right)=\left(1+{w}_i\right)\sum \limits_{i=1}^{\mathcal{N}}{\nabla}_{{\tilde{p}}_{ij}}{\left(\sum \limits_{j\in \mathcal{N}}{\dot{\psi}}_{ij}\left({\left\Vert {\tilde{p}}_{ij}\right\Vert}_{\sigma}\right)\right)}^{\mathrm{T}}{\tilde{q}}_{ij}+{{\tilde{q}}_{ij}}^T{u}_i $$

(45)

Substituting (42) into (45), we have

$$ {\displaystyle \begin{array}{l}\dot{Q}\left(\tilde{p},\tilde{q}\right)=\left(1+{w}_i\right)\sum \limits_{i=1}^{\mathcal{N}}{\nabla}_{{\tilde{p}}_{ij}}{\left(\sum \limits_{j\in \mathcal{N}}{\dot{\psi}}_{ij}\left({\left\Vert {\tilde{p}}_{ij}\right\Vert}_{\sigma}\right)\right)}^{\mathrm{T}}{\tilde{q}}_{ij}+\sum \limits_{i=1}^{\mathcal{N}}{{\tilde{q}}_{ij}}^T\left[-\left(1+{w}_j\right)\sum \limits_{j\in \mathcal{N}}{\nabla}_{p_i}{\psi}_{ij}\left({\left\Vert {\tilde{p}}_{ij}\right\Vert}_{\sigma}\right)-{a}_{ij}{\tilde{q}}_{ij}-{\lambda}_1{w}_j\sum \limits_{j\in \mathcal{N}}{a}_{ij}{\tilde{q}}_{ij}\right]\\ {}=\sum \limits_{i=1}^{\mathcal{N}}{{\tilde{q}}_{ij}}^T\left(-{a}_{ij}{\tilde{q}}_{ij}-{\lambda}_1{w}_j\sum \limits_{j\in \mathcal{N}}{a}_{ij}{\tilde{q}}_{ij}\right)\\ {}=-{\tilde{q}}^T\left[L\left(\tilde{p}\right)+{a}_{ij}{I}_{\mathcal{N}}\otimes {I}_n\right]{\tilde{q}}^T\end{array}} $$

(46)

where $ \tilde{p}=\mathrm{col}{\left[{p}_1,{p}_2,{p}_{\mathcal{N}}\right]}^T\in {\mathbb{R}}^{\mathcal{N}\times n} $,$ \tilde{q}=\mathrm{col}{\left[{q}_1,{q}_2,{q}_{\mathcal{N}}\right]}^T\in {\mathbb{R}}^{\mathcal{N}\times n} $,⊗ is the Kronecker operator,$ L\left(\tilde{p}\right) $ is the Laplacian matrix of $ G\left(\tilde{p}\right) $, and $ {I}_{\mathcal{N}} $ denotes a $ \mathcal{N} $ dimensional unit vector. Due to the existing a_ij ∈ [0, 1], we can obtain

$$ \dot{Q}\left(\tilde{p},\tilde{q}\right)=-{\tilde{q}}^T\left[L\left(\tilde{p}\right)+{a}_{ij}{I}_{\mathcal{N}}\otimes {I}_n\right]{\tilde{q}}^T\le 0 $$

(47)

Thus, by virtual of Barbalat Lemma, it holds that

$$ \underset{t\to \infty }{\lim}\dot{Q}\left(\tilde{p},\tilde{q}\right)=0 $$

(48)

It is implied that $ Q\left(\tilde{p},\tilde{q}\right) $ is monotonic in (48). Let Ω_c = {(p, q) : Q(p, q) ≤ Q₀} is a compact set, existing Q₀(p₀, q₀) = ψ_σ(0) > 0 is the initial energy. Moreover, it is easy to obtain that $ Q\left(\tilde{p},\tilde{q}\right)\le {Q}_0 $ is closed and bounded for any time internal t ∈ [0, t^∗]. Therefore, the control system is asymptotically stable.

From the LaSalle invariance principle, all solutions $ \left(\tilde{p},\tilde{q}\right) $ of Eq. (22) starting in Ω_ccoverage to its largest invariant set given by

$$ S=\left\{\left(\tilde{p},\tilde{q}\right)\in {\mathbb{R}}^{\mathcal{N}\times n}:\dot{Q}\left(p,q\right)=0\right\} $$

(49)

It can be evident that $ \dot{Q}\left(\tilde{p},\tilde{q}\right)=0 $ and $ \tilde{q}=0 $, and we can deduce that velocities of all agents asymptotically match each other, i.e.,

$$ {q}_1={q}_2=\cdots ={q}_{\mathcal{N}} $$

(50)

Now, part i) and ii) have been proved. Subsequently, we prove part iii) by contradiction strategy. Assume existing an instant time t = t₁ > 0, two individuals k and l collide, i.e., ‖p_l(t₁) − p_k(t₁)‖ ≤ D, where D is the minimum distance between two agents. For all t > 0, we obtain a smooth potential function,

$$ {\displaystyle \begin{array}{l}Q\left(p\left({t}_1\right)\right)=\frac{1}{2}\sum \limits_i\sum \limits_{j\ne i}{\psi}_{\sigma}\left({\left\Vert {p}_i-{p}_j\right\Vert}_{\sigma}\right)\\ {}={\psi}_{\sigma}\left({\left\Vert {p}_l\left({t}_1\right)-{p}_k\left({t}_1\right)\right\Vert}_{\sigma}\right)+\frac{1}{2}\sum \limits_{i\in \mathcal{N}\backslash \left\{k,l\right\}}\sum \limits_{j\in \mathcal{N}\backslash \left\{i,k,l\right\}}{\psi}_{\sigma}\left({\left\Vert {p}_i\left({t}_1\right)-{p}_j\left({t}_1\right)\right\Vert}_{\sigma}\right)\\ {}\ge {\psi}_{\sigma}\left({\left\Vert {p}_l\left({t}_1\right)-{p}_k\left({t}_1\right)\right\Vert}_{\sigma}\right)\end{array}} $$

(51)

Hence, existing Q(p(t₁)) ≥ ψ_σ(0) = Q₀. However, it is in contradiction with an inequality

$$ Q\left(p\left({t}_1\right)\right)\le {Q}_0 $$

(52)

Therefore, in terms of (51) and (52) in contradiction with the invariant principle, no two agents collide at any time t ≥ 0.

Furthermore, assume that G(t) switches at time t_k,k = 1, 2, ⋯, and G(t) is a fixed graph on each time-interval [t_k − 1, t_k]. Note that Q₀ is finite and the time derivative ofQ(t) in t ∈ [t₀, t₁] is (47), existing an inequality

$$ Q\left({t}_1\right)\le {Q}_0<\infty, \forall t\in \left[{t}_0,{t}_1\right) $$

(53)

Such that

$$ {\lim}_{d_{ij}\to R}\psi \left({d}_{ij}\right)=\infty $$

(54)

Clearly, it is implied that there is no edges will be lost before t₁ and be added at switching t₁. Similar to the t ∈ [t₀, t₁], time derivative Q(t) on each time-interval [t_k − 1, t_k], is also satisfied with (47). It can be given by

$$ Q\left({t}_k\right)\le Q\left({t}_{k-1}\right)\le {Q}_0<\infty, \forall t\in \left[{t}_{k-1},{t}_k\right),k=1,2\cdots $$

(55)

Thus no edges will be lost before t_k and be added at switching t_k. In addition, by virtue of the Assumption 2, G(t) can be guaranteed to keep connectivity for all t ≥ 0.

From the above proof, we can conclude that each agent in swarm can asymptotically converge to a consensus, and no collisions and splittings occur between any two agents. □.

Remark 12:

For case of δ_ij ≥ Δ_i, its stability proof is similar to the case of δ_ij < Δ_i. Due to space limitations, a more rigorous proof is omitted herein.

5 Simulation results

In this section, we will provide simulation results to demonstrate effectiveness and robustness of our proposed approach. We consider a crowed UUV swarm with N = 51. The initial positions are located in 50 × 50 m such that the initial graph is connected, and initial headings and velocities are chosen from [−π, π] rad and [0, 10]m/s, respectively. Without the loss of generality, the heading of each agent specifies the direction of velocity, and V_max = 15m/s. The nonlinear UUV model can be seen in [27]. The following parameters remain fixed through all simulations: ϕ = 2π, ω = [−π/6, π/6],m_c = 12,κ = 8,α_ij = 1.2,β_ij = 1.2,χ_ij = 1.2, ϖ_α = 100, R = 12m, ϖ_β > 100, d_min = 4m,ε = 0.5,ƛ = 0.5,ϒ_q = 1,ϒ_m = 0.2,T_d = diag [100, 100, 100],K_d = diag [10, 5, 6],F_d = 1, and D = 0.5m.

All experiments are divided into two cases: without external stimuli signal and with external stimuli signal, which are conducted in MATLAB and C++ Platform. In addition, a computer with Intel(R) Core™i7 CPU3.20GHz is utilized for simulations.

5.1 Self-organized cooperative control without external stimuli signal

In the absence of external stimuli signal, assuming that a crowded UUV swarm performs a desired velocity [5, 5]^Τ m/s. The experimental results are demonstrated to prove the asymptotic convergence of the proposed self-organized cooperative control in Figs.6, 7 and 8.

Figure 6 exhibits the flocking trajectories without external stimuli signal. Fig. 6a shows the initial pattern, in which each individual is marked by green circle with arrow, and the scale of arrow denotes the magnitude of initial velocity. Fig. 6b shows the path trajectories, it is clearly observed that the swarm can be self-organized to achieve the asymptotic convergence. Moreover, consensus and synchronicity can be simultaneously guaranteed as shown in Fig. 6c, in which all velocities are equal to[5, 5]^Τ. Fig. 6d plots the relative distance d_ij of crowded UUV swarm, and d_ij satisfies D ≤ d_ij ≤ R. Hence, we can deduce a conclusion that without any collisions and splits occurring in self-organized motion.

Figure 7 plots velocities of all individuals including velocity-x and velocity-y, respectively. Due to great convergence of the proposed consensus, these velocities can be quickly converged at t = 10s, and flocking swarm can be synchronous to the desired velocity [5, 5]^Τ at t = 50s. Additionally, one can be seen that the crowded swarm can keep a relatively stable distance without a larger change after t = 10s in Fig. 8. From all above results, all observations are in close agreement with our theoretical predictions, and it is implied that our approach can achieve a self-organized flocking without triggering any conflict and collision.

5.2 Self-organized cooperative control with external stimuli signal

In the presence of an external stimuli signal, assuming that the crowded UUV swarm performs a desired velocity [5, 5]^Τ m/s, and an informed agent who moves on boundary of swarm changes its heading in responding to the stimuli signal, which is given by

$$ {u}_i^s(t)=\hslash \left({q}_i-{q}_i^s\right) $$

(56)

where ℏ = 10 denotes the feedback gain, and $ {q}_i^s={\left[5,0\right]}^T $ is an expected velocity of agent i who is labeled as informed agent.

In order to verify the efficiency of our proposed control approach, self-organized fission/fusion method (SFF) [39] and single-informed-based distributed consensus (SDC) [40] are employed to perform the comparative experiments. The differences between the three methods are the neighbor number and following mechanism. It is noted that the external stimuli signal appears at t = 60s after flocking achieved, and comparison results are illustrated in Figs. 9, 10 and 11 and Table 1.

Table 1 Comparisons of relative distances (The optimal result is in bold)

Full size table

Figures 9, 10 and 11 present the results of comparisons among SFF, SDC and our proposed approach in the presence of an external stimuli signal, respectively. The Figs. 9a, 10a and 11a exhibit flocking trajectories, and Figs. 9b, 10b and 11b show their final patterns, respectively, in which the agent marked with red circle indicates an informed agent and the dotted line indicates its trajectory after t = 60s. The informed agent changes its heading from [5, 5]^Τ to [5, 0]^Τ at t = 60s, and its neighbors can synchronously change their original heading in Fig. 9. However, all neighbors eventually fail to track the informed agent, which leads to a splitting swarm into two sub-groups. The one sub-group is composed of the only informed agent who tracks the stimuli signal, and the other sub-group consisting 50 individuals moves in a direction between [5, 5]^Τ and [5, 0]^Τ. The similar results are provided in Fig. 10, where the swarm is also separated into two sub-groups. The only difference between Figs. 9 and 10 is that the direction of SDC with respect to the informed agent is smaller than that of SFF. The best performance of our proposed approach can be seen in Fig. 11, in which all neighbors accurately follow the informed agent and the velocity consensus can be guaranteed, such that the flocking pattern remains stable.

Additionally, the relative distances among crowded UUV swarm are considered after external stimuli signal appearing at t = 60s, as shown in Table 1. It is observed that the minimum and maximum relative distance d_ij of our proposed consensus are 1.1845 m and 11.9813 m, respectively, both of which satisfy D ≤ d_ij ≤ R, such that the objectives (9) and (10) can be guaranteed without any splits and collisions even under a stimuli signal. Furthermore, the minimum relative distances of other approaches are larger than the predefined collision distance D = 0.5m. Thus, the SFF and SDC can also achieve the avoiding collisions. However, maximum relative distances, 54.6522 m and 23.4991 m, are larger than the predefined sensing distance R = 12m, which indicates that communicating connections of SFF and SDC are broken due to splits occurring. Obviously, it is evident that our approach is more robust against fluctuations and disturbances in comparison to other approaches.

The reason why these differences are due to different the following mechanisms existed in the three approaches. Clearly, both the SFF and SDC employ the average mechanism, in which two limitations are exposed. (i) All individuals within sensing range are automatically activated to participate in local interaction, where how to optimize information transmission and computing is not fully considered. (ii) The averaged-synergy filters or dilutes the external stimuli signals, such that all heterogeneous propensities and their effects may be ignored. Conversely, our proposed approach can alleviate the above two limitations from two aspects. (i) Our following-interaction framework can ensure the minimum number and optimal distribution of neighborhood. (ii) Our adaptive computing strategy considers all individual differences via single-nearest-neighbor and multiple-nearest-neighbors synergies with nonlinear weights.

5.3 Discussion

In order to further verify the robustness of our proposed approach, three quantitative indicators such as mean heading (MH), orderness parameter (OP) and scale parameter (SP) are devised to demonstrate the comparative performances.

The MH is defined as

$$ {\varphi}_{MH}(t)=\frac{1}{N}\left|\sum \limits_{i=1}^N,{e}^{j{\vartheta}_i}\right| $$

(57)

where N represents the swarm size. Generally, a great φ_MH(t) value shows a worse flocking.

The OP is defined as:

$$ {\varphi}_{OP}=\frac{1}{N}\left|\sum \limits_{i=1}^N,\frac{q_i}{\left\Vert {q}_i\right\Vert}\right| $$

(58)

where ‖q_i‖ represents the Euclidean norm of velocity vector of agent i. Apparently,φ_OP ∈ [0, 1], the swarm is orderness if φ_OP = 1. Conversely, it is unordered whenφ_OP = 0.

The SP is defined as

$$ {\varphi}_{SP}=\sqrt{\frac{1}{N}\sum \limits_{i=1}^N{\left\Vert {p}_i-{p}_c\right\Vert}^2} $$

(59)

where p_c represents the COM given by Eq. (41). Obviously, a smaller φ_SP indicates a more cohesive distribution.

In particular, all simulations are repeated 50 times, and the results are demonstrated in Fig. 12.

Figure 12a demonstrates curves of MH results. At t = 0, all results obtained from three approaches are divergent. At t = 10s, the values asymptotically converge to a constant due to desired head [5, 5]^Τ followed by all UUVs. When an external stimuli signal $ {q}_i^s={\left[5,0\right]}^T $ appears t = 60s, this indicator significantly declines and slightly converges to its minimum at t = 65 s. Although SFF and SDC complete flocking, only our approach can simultaneously guarantee flocking and following behaviors. Therefore, it is implied that the tracking and synchronizing capabilities of our approach outperform SFF and SDC.

As shown in Fig. 12b, the OP result of our approach is sensitive to internal and external disturbances in comparison to other approaches, and greater convergence speed can be obtained to form an ordered flocking, which can be seen when the internal and external disturbances appear at t = 0 and t = 60s, respectively. Additionally, the OP result of our approach eventually converges to 1, which is higher than that of SFF and SDC. The reasons accounting for this result are insufficient information interactions and long convergence time in SFF and SDC. Consequently, it is implied that our approach has an excellent orderness.

Figure 12c illustrates SP results. In initial stage, there exists a fluctuation. After t = 10s, SP results gradually converge. Obviously, this indicator monotonously increases when individuals response to the external stimuli signal, and our approach is better than that of other two methods. The reason why these results occur is that there is no splits in our approach, but other two approaches separate the swarm into two sub-groups. Accordingly, it is clear that our designed approach is effective for UUV swarm in the presence of external stimuli signal.

Summarily, it is clear that our proposed control approach shows better robustness against fluctuations and disturbances in comparison to the SFF and SDC. It is also confirmed that the minimum number and optimal distribution of neighborhood can improve the speed of information dissemination in local interaction. Furthermore, the proposed approach can effectively achieve the swarm cohesion and collision avoidance.

6 Conclusions

In this paper, a novel bio-inspired self-organized cooperative control is developed to address behavior conflict and chain collision for crowded UUV swarm in the presence of fluctuations and disturbances. All simulation results prove that our proposed approach can obtain robust control performance without any collision and split occurring, in comparison to the existing methods via MH, OP and SP indicators. The primary contributions of this paper are summarized as follows:

(i) An optimized following interaction framework is devised by the topological interaction and visual interaction to improve the synchronous velocities and save the sensing costs, in which two constrains, i.e., limited view and crowded phenomenon, are simultaneously considered. This framework can ensure the minimum number and optimal distribution of the neighborhood for each agent in dynamic graph, instead of the previous fixed-number and fixed-distance studies.

(ii) An adaptive dynamic computing model is proposed by incorporating SNNF and WMNNF to establish an effective decision-making strategy for solving the problems of behavior conflict and chain collision, where the influence of each individual on sensitive behavior is synthesized by a nonlinear weight. Therefore, it is essentially a non-average mechanism unlike the traditional average-velocity mechanism.

(iii) By virtue of adaptive dynamic interaction topology, a cooperative control protocol based on the proposed following model and mathematics-based potential fields is designed to steer a self-organized flocking with two abilities of the connectivity-preserving and collision-avoiding. Furthermore, the sufficient condition is analyzed via Laypunov and LaSalle invariance principle. It is confirmed that our approach can be suitable for cooperative control of crowded UUV swarm.

For future works, the jointed effects of time delays and packet loss will be considered in this self-organized cooperative control approach, and the control performance can be further improved towards achieving the lager-scale heterogeneous UUV swarm.

References

Wu Y, Low KH, Lv C (2020) Cooperative Path Planning for Heterogeneous Unmanned Vehicles in a Search-and-Track Mission Aiming at an Underwater Target. IEEE Trans Veh Technol 69(6):6782–6787
Article Google Scholar
Londhe PS, Patre BM (2019) Adaptive fuzzy sliding mode control for robust trajectory tracking control of an autonomous underwater vehicle. Intell Serv Robot 12:87–102
Article Google Scholar
Ingrand F, Ghallab M (2017) Deliberation for autonomous robots: a survey. Artif Intell 247:10–14
Article MathSciNet Google Scholar
Bukhari AC, Kim YG (2013) A research on an intelligent multipurpose fuzzy semantic enhanced 3D virtual reality simulator for complex maritime missions. Appl Intell 38:193–209
Article Google Scholar
Liang HT, Qiang N (2020) Distributed Cooperative Control Based on Dynamic Following Interaction Mechanism for UUV Swarm. 2020 39th Chinese control conference (CCC), Shenyang, China, pp 5092–5097
Google Scholar
Oh H, Shirazi AR, Sun CL, Jin YC (2017) Bio-inspired self-organising multi-robot pattern formation: a review. Robot Auton Syst 91:83–100
Article Google Scholar
Ferrante E, Turgut AE, Huepe C, Stranieri A, Pinciroli C, Dorigo M (2012) Self-organized flocking with a mobile robot swarm: a novel motion control method. Adapt Behav 20(6):460–477
Article Google Scholar
Pandey P, Pompili D, Yi J (2015) Dynamic collaboration between networked robots and clouds in resource-constrained environments. IEEE Trans Autom Sci Eng 12(2):471–480
Article Google Scholar
Wang J, Wang C, Wei Y, Zhang C (2020) Neuroadaptive sliding mode formation control of autonomous underwater vehicles with uncertain dynamics. IEEE Syst J 14(3):3325–3333
Article Google Scholar
Sahu BK, Subudhi B (2018) Flocking Control of Multiple AUVs Based on Fuzzy Potential Functions. IEEE Trans Fuzzy Syst 26(5):2539–2551
Article Google Scholar
Yang H, Zhang F (2012) Robust control of formation dynamics for autonomous underwater vehicles in horizontal plane. J Dyn Syst Meas Control 134:031009
Article Google Scholar
Pan W, Jiang D, Pang Y, Qi Y, Luo D. Distributed Formation Control of Autonomous Underwater Vehicles Based on Flocking and Consensus Algorithms. In: Huang Y, Wu H, Liu H, Yin Z (eds) Intelligent robotics and applications. ICIRA 2017. Lecture Notes in Computer Science, vol 10462. Springer, Cham. https://doi.org/10.1007/978-3-319-65289-4_68
Chen YY, Zhu DQ (2020) Research on the Method of Multi-AUV Formation Control Based on Self-organized Artificial Potential Filed. Control Eng China 26(10):1875–1881
Google Scholar
Hu J, Wu Y, Li T, Ghosh BK (2019) Consensus control of general linear multiagent systems with antagonistic interactions and communication noises. IEEE Trans Autom Control 64(5):2122–2127
Article MathSciNet Google Scholar
Cai YL, Zhang HG, Liang YL, Gao ZY (2020) Reduced-order observer-based robust leader-following control of heterogeneous discrete-time multi-agent systems with system uncertainties. Appl Intell 50:1794–1812
Article Google Scholar
Maupong TM, Rapisard P (2017) Data-driven control: a behavioral approach. Syst Control Lett 101:37–43
Article MathSciNet Google Scholar
Reynolds CW (1987) Flocks, herds, and schools: a distributed behavioral model. Comput Graph 21(4):25–34
Couzin ID, Krause J, Franks NR (2005) Effective leadership and decision-making in animal groups on the move. Nature 433:513–516
Article Google Scholar
Vicsek T, Zafeiris A (2012) Collective motion. Phys Rep 517:71–140
Article Google Scholar
Aldana M, Dossetti V, Huepe C (2007) Phase transitions in systems of self-propelled agents and related network models. Phys Rev Lett 98:095702
Article Google Scholar
Liu MY, Lei XK, Yang PP (2014) Progress of theoretical modelling and empirical studies on collective motion. Chin Sci Bull 59:2464–2483
Article Google Scholar
Grünbaum D, Viscido S, Parrish JK (2005) Extracting interactive control algorithms from group dynamics of schooling fish. Coop Control 309:103–117
Nagy M, Vásárhelyi G, Pettit B, Mariani R, Vicsek T, Biro D (2013) Context-dependent hierarchies in pigeons. Proc Natl Acad Sci 110:13049–13054
Article Google Scholar
Conradt L (2012) Models in animal collective decision-making: Information uncertainty and conflicting preferences. Interface Focus 2:226–240
Article Google Scholar
Anderson JR (2004) Cognitive psychology and its implications. Worth Publishers, New York
Google Scholar
Qiu HX, Duan HB (2020) A multi-objective pigeon-inspired optimization approach to UAV distributed flocking among obstacles. Inf Sci 509:515–529
Article MathSciNet Google Scholar
Liang HT, Fu YF, Kang FJ, Gao J, Ning Q (2020) A Behavior-driven Coordination Control Framework for Target Hunting by UUV Intelligent Swarm. IEEE Access 8(1):4838–4859
Article Google Scholar
Yang PP, Liu MY, Lei XK, Song C (2016) A novel control algorithm for the self-organized fission behavior of flocking system with time delay. Int J Control Autom Syst 14(4):986–997
Article Google Scholar
Khaldi B, Harrou F, Cherif F, Sun Y (2020) Improving robots swarm aggregation performance through the Minkowski distance function. 6th international conference on mechatronics and robotics engineering (ICMRE), Barcelona, Spain, pp 87–91
Google Scholar
Chen C, Chen G, Guo L (2017) On the minimum number of neighbors needed for consensus of flocks. Control Theory Technol 15:327–339
Article MathSciNet Google Scholar
Massé B, Ba S, Horaud R (2018) Tracking gaze and visual focus of attention of people involved in social interaction. IEEE Trans Pattern Anal Mach Intell 40(11):2711–2724
Article Google Scholar
Herbert JE, Perna A, Mann RP, Schaerf TM, Sumpter DJT, Ward AJW (2011) Inferring the rules of interaction of shoaling fish. Proc Natl Acad Sci 108:18726–18731
Article Google Scholar
Duan H, Huo M, Shi Y (2020) Limit-cycle-based mutant multiobjective pigeon-inspired optimization. IEEE Trans Evol Comput 24(5):948–959
Article Google Scholar
Katz Y, Tunstrøm K, Ioannou CC, Huepe C, Couzin ID (2011) Inferring the structure and dynamics of interactions in schooling fish. Proc Natl Acad Sci 108:1870–1872
Article Google Scholar
Godsil C, Royle G (2001) Algebraic graph theory. Springer-Verlag, Berlin
Book Google Scholar
Yan ZP, Liu YB, Zhou JJ, Zhang W, Wang L (2017) Consensus of multiple autonomous underwater vehicles with double independent Markovian switching topologies and timevarying delays. Chin Phys B 26(4):040203
Article Google Scholar
Zhang XY, Jia SM, Li XZ (2017) Improving the synchronization speed of self-propelled particles with restricted vision via randomly changing the line of sight. Nonlinear Dyn 90:43–51
Article Google Scholar
Li P, Duan HB (2019) A flocking model based on selective attention mechanics. Sci Sin Technol 49(9):1040–1050
Article Google Scholar
Yang PP, Tang Y, Song JC (2018) Self-organized fission/fusion method for flocking system based on predictive intelligence. Control Decis 33(12):2270–2276
Google Scholar
Dai S, He S, Lin H, Wang C (2018) Platoon formation control with prescribed performance guarantees for USVs. IEEE Trans Ind Electron 65(5):4237–4246
Article Google Scholar

Download references

Acknowledgments

The authors acknowledge the financial support from the National Natural Science Foundation of China under Grant 11404205, Natural Science Foundation of Shaanxi under Grant 2019JQ-026 and Fundamental Research Funds for Central Universities under Grant GK201903016 and GK201803023. And the authors would like to thank all reviewers and editors who provided extensive valuable feedback.

Author information

Authors and Affiliations

School of Physics and Information Technology, Shaanxi Normal University, Xian, 710119, China
Hongtao Liang & Jie Gao
School of Computer Science and Engineering, Xi’an Technological University, Xi’an, 710032, China
Yanfang Fu

Authors

Hongtao Liang
View author publications
You can also search for this author in PubMed Google Scholar
Yanfang Fu
View author publications
You can also search for this author in PubMed Google Scholar
Jie Gao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hongtao Liang.

Ethics declarations

Conflict of interest

This work is original research and approved by all authors. The authors declare that they have no conflict of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Liang, H., Fu, Y. & Gao, J. Bio-inspired self-organized cooperative control consensus for crowded UUV swarm based on adaptive dynamic interaction topology. Appl Intell 51, 4664–4681 (2021). https://doi.org/10.1007/s10489-020-02104-5

Download citation

Accepted: 27 November 2020
Published: 05 January 2021
Issue Date: July 2021
DOI: https://doi.org/10.1007/s10489-020-02104-5

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Bio-inspired self-organized cooperative control consensus for crowded UUV swarm based on adaptive dynamic interaction topology

Abstract

Similar content being viewed by others

Distributed Formation Control of Autonomous Underwater Vehicles Based on Flocking and Consensus Algorithms

Bio-Inspired Formation Control for UUVs Swarm Based on Social Force Model

Optimal Formation of UUV Groups Based on Shape Theory and Improved Ant Colony Algorithm Under Communication Delay

1 Introduction

2 Preliminary knowledge and problem statement

2.1 Local interaction

2.2 Graphs theory

Remark 1:

Lemma 1:

2.3 Problem statement

Remark 2:

Assumption 1:

Remark 3:

3 Adaptive dynamic interaction topology

3.1 Following-interaction framework

3.2 Adaptive dynamic computing model

Remark 4:

Remark 5:

Remark 6:

Remark 7:

Remark 8:

4 Self-organized cooperative control protocol

4.1 Control protocol

Remark 9:

Remark 10:

Remark 11:

Assumption 2:

4.2 Stability analysis

Theorem 1:

Proof

Remark 12:

5 Simulation results

5.1 Self-organized cooperative control without external stimuli signal

5.2 Self-organized cooperative control with external stimuli signal

5.3 Discussion

6 Conclusions

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation