1 Introduction

Unmanned underwater vehicle (UUV) is an imperative vehicle in oceanic engineering over the past decades, which has a wide application for monitoring, exploration and surveillance, especially in hazardous environment [1,2,3,4,5]. However, due to the limited robustness and adaptability of existing rigid-formation control methods for more complex missions [3], a crowded UUV swarm composed of a large number of homogeneous/heterogeneous and low-cost submersible intelligent robots, has been aroused more compelling interest. However, the cooperative control for this swarm is a challenge because individual behavior conflict and chain avalanche collision derived from unknown fluctuations and disturbances are easily triggered [5].

How to overcome the conflict and collision involved in the crowded UUV swarm, increasing attentions have been paid to self-organized cooperative control [6, 7]. The essence of this control approach in swarm represents a coordination of a group agents to generate and maintain a pattern in a self-organized way. Up to now, numerous studies on self-organized cooperative control have been developed, all of which can be divided into centralized and distributed control manners [7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28]. The centralized manner generally adopts a top-down modeling, where an upper controller is employed to perform the global mission. However, this scheme is vulnerable and unavailable in real applications [8]. Conversely, the distributed manner formed by down-top modeling is attracted more attentions, when a large number of agents are involved. More recently, various approaches in distributed manner have been proposed, such as leader-following [9], virtual structure [10, 11], artificial potential field [12, 13], and consensus control [14, 15].

Leader-following, as a distinguished pattern, is essentially a predefined distributed pattern to achieve a swarm control due to its simplicity and reliability, in which all following agents track a real or desired leader to maintain a specific orientation and distance among agents. For example, Wang et al. [9] proposed a leader-following formation based on sliding mode control for autonomous underwater vehicles (AUVs). Despite existing significant theoretical formalization, this scheme has a fatal limitation that the formation pattern is rigid.

In order to tackle the above weakness of leader-following scheme, the virtual structure approach is developed, where the swarm is regarded as a tight or loose entity and its desired position is assigned to a structure. Basant et al. [10] presented a formulation of cooperative control for a group of AUVs, in which a virtual flocking center was predicted by consensus protocol. Yang et al. [11] proposed a novel formation control based on Jacobi shape for multi-AUVs, where the geometric shape was designed to track a desired trajectory. However, a structure generated in this method is unavailable for avoiding collision, which may take more sensing and computing costs due to the adverse robustness.

To overcome the drawback of strict geometric relationship in both leader-following and virtual structure, the artificial potential field is provided, where each agent moves according to a gradient direction of potential field generated by total sum of all virtual attractive and repulsive forces. Pan et al. [12] designed a distributed formation control based artificial potential field to accomplish an AUV formation. In addition, Zhu et al. [13] designed a self-organized artificial potential filed formation control to avoid obstacles and replan paths for UUV formation. Although the aforementioned works are effective, there exist two common limitations, i.e., local minimum and deadlock phenomenon, when one obtains multiple attractive and repulsive forces, so adaptability is relatively weak.

Consensus control method has been widely utilized in cooperative control fields, where agents make use of information exchange to reach a common consensus on velocity or orientation for a cohesive swarm. Hu et al. [14] stated a consensus for multi-agents with antagonistic interactions and communication noises. Cai et al. [15] investigated a leader-following output consensus for discrete-time multi-agent systems with uncertainties. Apparently, these consensus-based methods are all based on an ideal communication, and the influence of each individual on formation pattern is ignored or simplified.

Unfortunately, some above-mentioned techniques on distributed control are only available for desired environment and lack the self-organized cooperative behavior. In fact, the UUV is always exposed to the disturbed environment, and its sensing and communicating are limited. Therefore, developing high-performance cooperative control is a major challenging task in real application. Behavior-driven methods [7, 16,17,18,19,20,21,22,23,24,25,26,27,28] inspired by collective behaviors in nature, such as a school of fish, a flock of birds, a herd of sheep and a swarm of ants, have received considerable attentions in the field of cooperative control. Two existing modeling styles, swarm-based macroscopic mode and individual-based microscopic mode, are employed to design a behavior-driven method. The former study works on an entire larger-scale swarm while ignoring all underlying interactions. Conversely, the latter focuses on how to describe the cluster motion via individual interaction. Due to intuitive and appealing interaction potentials, the latter has gained increasing interest in cooperative control.

As one landmark work, Reynolds et al. [17] primarily introduced a distinguished swarm model, so-called Boids, in which three behavioral rules, attraction, repulsion and alignment, were devised to establish a flock. Based on this, various models such as Vicsek [18], Couzin [20], and social force model [21], have been proposed for swarm coordination. The importance of aforementioned works is that it successfully shows the behavioral emergence of a group only influenced by its neighbors and environment [19]. However, most previous works devote on the velocity-average mechanism to achieve rendezvous, cohesion and consensus, in which more heterogeneous characteristics are ignored in local interaction.

In recent past decade, with a rapidly technical development of target tracking, image processing, and data analysis, more hidden interactions are excavated such as topological interaction [22], single-neighbor following [23], multiple-nearest-neighbors following [24], visual perception and attention [25]. The above interactions not only reveal the inherent mechanism of behavioral emergence, but also provide some solutions for self-organizing cooperative control. Duan et al. [26] designed a hierarchical network with behavior learning mechanism inspired by single-neighbor following in pigeon flocking, thereby achieving an unmanned aerial vehicle formation control. Liang et al. [27] presented a behavior-driven cooperative control strategy, in which multiple neighbors can form an immune network to achieve an intelligent swarm of UUVs. Yang et al. [28] proposed a control strategy for time-delay self-organized fission behavior of flocking system. Although most above-mentioned works can achieve a cooperative control, the lower synchronous velocity still exists and the robustness against disturbances is terrible, especially under the unpredictable ocean, which may easily trigger conflicts and collisions.

Motivated by above observations, this paper considers the cooperative control problem for crowded UUV swarm. A novel bio-inspired self-organized cooperative control approach inspired by intelligent perception and interactive computing from various biological clusters is creatively proposed. Firstly, a following interaction framework is designed to ensure the minimum number and optimal distribution of the neighborhood via the topological interaction and visual interaction. Then, a novel computing model involved with single-nearest-neighbor following (SNNF) and weighted multiple-nearest-neighbors following (WMNNF) is proposed to achieve an adaptive and dynamic following process, and the influence of individual on the sensitive behavior is described by a nonlinear weight, which is essentially a non-average velocity mechanism unlike the traditional average-mechanism. Moreover, a cooperative control protocol based on the proposed following model and potential field, is designed to achieve a self-organized flocking for crowded UUV swarm. Simulation results demonstrate that our approach can accomplish the cooperative control with a higher effectiveness and robustness in comparison to the existing methods. The main contributions of this paper are summarized as follows:

  1. (i)

    A bio-inspired self-organized cooperative control consensus derived from adaptive dynamic interaction topology is firstly proposed for the crowded UUV swarm, by elaborately configuring the latest intelligent perceptions and interactive computing mechanisms.

  2. (ii)

    An adaptive dynamic interaction topology is creatively proposed to steer interactions via the minimum number and optimal distribution of neighborhoods, which is essentially a non- average mechanism considering unlike the previous average-mechanism [17,18,19,20,21].

  3. (iii)

    A self-organized cooperative control protocol is designed by using the proposed following model and mathematics-based potential fields, which can effectively achieve the swarm cohesion and avoiding collision to solve the individual behavior conflict and chain-avalanche collision.

  4. (iv)

    In formulating the proposed approach, three quantitative indexes, such as mean heading, orderness parameter, and scale parameter, are devised to prove that the robustness and efficiency of our approach are better than that of the existing approaches.

The remaining of this paper is organized as follows. Section 2 describes preliminary knowledge and problem statement. Section 3 presents an adaptive dynamic interaction topology. Section 4 designs a self-organized cooperative control protocol. Section 5 performs all simulations. Finally, a conclusion is given in Section 6.

2 Preliminary knowledge and problem statement

2.1 Local interaction

It has been revealed that the local interaction can describe the collective behavior of biological cluster in [19,20,21,22,23,24,25,26,27,28]. In the formulating the local interaction, how to describe an individual action influenced by effects from its neighbors is a key topic. Up to now, the topological interaction, visual interaction, single-nearest-neighbor synergy and multiple-nearest-neighbors synergy have been investigated by bio-inspired scientists in physics and engineering [22,23,24,25].

For topological interaction, it was firstly discovered in analyzing the flight data of starlings [29], where each individual in the group only interacts with the nearest 6 or 8 neighbors. This study subverts a traditional definition of neighbor that an individual can interact with all neighbors who is within a fixed distance as shown in Fig. 1a. The reason for this difference is that a specific number of neighbors is subject to the cortical elaboration of prenumeric ability, rather than individual’s perception ability. It is reported that an interaction network formed by topological interaction, as shown in Fig. 1b, has a trade-off between perceived cost and group robustness [30].

Fig.1
figure 1

a Distance Interaction and bTopological Interaction. a The distance interaction selects all neighbors in a specific fixed distance, i.e., sensing region marked by solid circle with radius R. Most of the existing models, i.e., Boids [17], Vicsek [18], and Couzin [20], all rely on this aprioristic assumption. b The topological interaction can generate the interactions within a fixed number of neighbors, for example the specific k in a dotted circle, whose radius is less than that of distance interaction. Compared with the distance interaction, this interaction is more suitable to keep cohesion in the presence of fluctuations and disturbances

For visual interaction, it is implied that the visual perception and visual attention can provide a solution to depict individual interactions in biological cluster. (i) The visual perception can sense all neighbors around within the sensing range, which is based on limited view (blindness angles). It is intuitively expected to improve synchronization motion as shown in Fig. 2a. (ii) The visual attention is an important psychological mechanism to select the most relevant information from limited view [25], i.e., it can discard a large amount of background information to achieve an efficient screening, as illustrated in Fig. 2b. Hence, some metaphorical mechanisms inspired by visual interaction can be useful to guide the cohesion and avoidance [31].

Fig.2
figure 2

a Visual Perception and b Visual Attention

For single-nearest-neighbor synergy, it states that the individual only tracks a single neighbor. For example, a school of fishes is dominated by an intermittent pairwise interaction between the individual and its nearest neighbor [21]. Similarly. Herbert et al. [32] used a neural network to analyze the tracking behavior in shoaling fish, revealed that each individual actually responded only to the nearest neighbor. In addition, a single-nearest-neighbor synergy based on hierarchical relationship was ensconced in pigeon-Inspired optimization [33]. Clearly, a distinct advantage of this mechanism is smaller computing cost, which is conducive to improving the robustness and sensitiveness.

For multiple-nearest-neighbors synergy, its essence is that an individual can synthesize all influences from multiple-nearest neighbors to make decision for its behavior. Due to its robustness for collective behaviors, several popular models, such as Boids, Vicsek and Couzin [19], all adopt this synergy. However, existing works are all based on velocity-average mechanism, in which accompanying properties, i.e., sensing, communicating, influencing and decision-making abilities, are ignored in this procedure. Actually, these features play an important role in achieving a collective behavior. It was reported that influences from multiple-nearest neighbors have nonlinear characteristic, rather than a simple linear superposition [34]. Thus, the non-average mechanism involved in multiple-nearest-neighbors synergy is a worthy of discussing in flocking control.

2.2 Graphs theory

It is convenient to model the neighbor interactions between agents by an undirected or directed graph [35], where an individual UUV is regarded as a node and the interconnection topology in UUV swarm can be described as a graph.

Suppose that a graph G(V, E, A) consists of a node set V = {v1, v2, ⋯, vn} and a edge setE ⊆ V × V, in which each edge is a pair of vertices (vi, vj) such as i ≠ j. If (vi, vj) ∈ E, then one usually says that i and j are adjacent.A = [aij] ∈ n × n is defined as adjacent matrix which is an integer matrix with rows and columns indexed by vertices, such that (vi, vj) ∈ E is equal to aij = 1, else aij = 0. The Laplacian matrix L = [lij] ∈ n × n is denoted by L = D-A, where D = [dij] is a diagonal matrix with \( {d}_{ij}={\sum}_{j=1}^n{a}_{ij} \). The neighbor set for agent i is denoted as Ni = {vj ∈ V| (vi, vj) ∈ E}.

Remark 1:

If (vi, vj) ∈ E ⇔ (vj, vi) ∈ E, G is an undirected graph, else it is called a directed graph. Furthermore, G is strongly connected if there is a directed path from (vi, vj) and (vj, vi)between any pair of distinct nodes vi and vj. A spanning tree of a directed graph is a directed tree formed by graph edges that connect all nodes of the graph.

Lemma 1:

For a connected graph G, its L is symmetrical and positive semidefinite, and all eigenvalues are non-negative real number which denoted by λi ∈ ℂ with an ascending order in magnitude, i.e.,0 = λ1(L) ≤ λ2(L) ≤ ⋯ ≤ λn(L).

2.3 Problem statement

Consider a group of N agents in the crowded UUV swarm, labeled 1,..., N, moving in horizontal plane. For agent i, its kinematic and dynamic with six-Degree of Freedom can be given by

$$ \Big\{{\displaystyle \begin{array}{l}{\dot{\eta}}_i=J\left({\psi}_i\right){v}_i\\ {}{M}_i{\dot{v}}_i+{C}_i\left({v}_i\right){v}_i+{D}_i\left({v}_i\right){v}_i+{g}_i\left({\eta}_i\right)={\tau}_i+{w}_i\end{array}} $$
(1)

where ηi = [xi, yi, ψi]T ∈ 3 is the positions and yaw angle in earth-fixed frame, respectively. vi = [ui, vi, ri]T ∈ 3 is the velocity vector in body-fixed frame, where ui is a surge velocity, vi is a sway velocity, and ri is an angular yaw velocity.τi = [τiu, 0, τir]T ∈ 3 is a control input vector, and gi = [giu, giv, gir]T ∈ 3 denotes the generalized gravitational and buoyancy forces, and wi = [wiu, wiv, wir]T ∈ 3 denotes total unknown external disturbances caused by winds, waves and ocean currents. Mi, Ci(vi) and Di(vi) are the inertia matrix, Coriolis matrix and damping matrix, respectively. TheJi(ψi)is a transformation matrix given by

$$ {J}_i\left({\psi}_i\right)=\left[\begin{array}{ccc}\cos {\psi}_i& -\sin {\psi}_i& 0\\ {}\sin {\psi}_i& \cos {\psi}_i& 0\\ {}0& 0& 1\end{array}\right] $$
(2)

Due to the hydrodynamic performance induced by the assumption of symmetry in plane and vertical directions, gi = [giu, giv, gir]T = 0. Moreover, the technique of state feedback linearization is utilized to simplify the kinematic and dynamic model into a general double-integrator model [36].

Specifically, the nonlinear model (1) can be written as

(3)

where N(vi)vi = Ci(vi)vi + Di(vi)vi,G(ζ) denotes the hydrodynamic coefficients, and denotes the sum of forces and rudder angles.

Defining ξi = [ηT, vT]T and h(ξi) = η, a standard nonlinearization function can be obtained from (3) as follows

(4)

where is the input of this nonlinear system.

Then, a new transformed coordination with pi = [hi(ξ)]T ∈ 3 and qi = [f hi (ξ)]T ∈ 3 is defined by derivative of h(ξi), and its corresponding control input can be defined as

(5)

where \( B\left({\xi}_i\right)={\left[{\partial}_f^2{h}_i\left(\xi \right)\right]}^{\mathrm{T}} \), and Γ(ξi) = [gfhi(ξi)] ∈ 3 × 3.

Thus, the motion model (1) can be represented by a second-order integrator model by (3) and (4), as follows

$$ \Big\{{\displaystyle \begin{array}{l}{\dot{p}}_i={q}_i\\ {}{\dot{q}}_i={u}_i\end{array}}\kern1.5em \left(i=1,\mathrm{2...},N\right) $$
(6)

where pi ∈ 3 and qi ∈ 3 are the position and velocity variables of agent i, respectively, and ui is the control input.

In addition, an underlying interaction topology among agents may dynamically change, which can be described by neighbor set in a connected graph, as follows

$$ {N}_i(t)=\left\{j|{d}_{ij}(t)\le {R}_i,j\in \left\{1,\cdots, N\right\},j\ne i\right\} $$
(7)

where Ri is the sensing radius of agent i, and dij(t) = ‖pi − pj‖ is an Euclidean distance between agent i and agent j.

Remark 2:

The neighbor graph is essentially a distance-dependent and time-varying topology, there are two reasons given by (i) each agent has a nonuniform sensing strength due to the factors such as limited perception range and narrow motion-space in scale effects, and (ii) all agents are influenced by external disturbances composed of information loss, delayed communication and noises.

Assumption 1:

The velocity is bounded to prevent the collisions caused by motion inertia, which is given by

$$ {q}_i=\Big\{{\displaystyle \begin{array}{l}{q}_i,\kern3.5em \left\Vert {q}_i\right\Vert \le {V}_{\mathrm{max}}\\ {}{V}_{\mathrm{max}}\frac{q_i}{\left\Vert {q}_i\right\Vert },\kern0.75em \left\Vert {q}_i\right\Vert >{V}_{\mathrm{max}}\end{array}} $$
(8)

where Vmax is the maximum velocity.

In this paper, our objective is to design a self-organized cooperative control that can guarantee that the crowded UUV swarm can achieve the cohesive flocking and collision avoiding without any splits, under the assumption that the initial network is a connected graph. In particular, the objectives of this paper are formulated as

$$ \underset{t\to \infty }{\lim}\left\Vert {p}_i(t)-{p}_j(t)\right\Vert \le R,\kern0.75em \left(i,j\in N\right) $$
(9)
$$ \underset{t\to \infty }{\lim}\left\Vert {p}_i(t)-{p}_j(t)\right\Vert \ge D,\kern0.75em \left(i,j\in N\right) $$
(10)
$$ \underset{t\to \infty }{\lim}\left\Vert {q}_i(t)-{q}_j(t)\right\Vert \to 0,\kern1em \left(i,j\in N\right) $$
(11)

where R denotes the sensing radius, and D denotes the minimum distance for inter-collision avoidance.

Remark 3:

The objective (9) can preserve the connectivity with any splittings, and objective (10) can guarantee the collision avoidance among all agents, and the objective (11) can reach a velocity consensus for all agents.

3 Adaptive dynamic interaction topology

Based on above-mentioned preliminaries, an adaptive dynamic interaction topology is proposed in this section, and the related framework and model are briefly demonstrated.

3.1 Following-interaction framework

The following-interaction framework composed of three key steps are devised by integrating four bio-inspired interaction mechanisms, as illustrated in Fig. 3.

Fig. 3
figure 3

Following-Interaction Framework. In this framework, the minimum number and optimal distribution of neighbors can be achieved through topological interaction and visual interaction, and the making-decision of individual behavior is adaptive and robust via SNNF and WMNNF. We use a yellow circle, black circle and purple circle to represent the centered individual, interacting individual and non-interacting individual, respectively. It is noted that the centered individual is only a simple reference individual in swarm, rather than a global centered agent in centralized control

Step1: The topological interaction is firstly employed to select all potential neighbors with a reasonably prescribed number, unlike the previous local interactions depend on an aprioristic assumption [29]. Therefore, topological interaction can be employed to shrink the number of neighbors in comparison to the traditional metric-distance interaction.

Step2: The visual interaction is devised to ensure the minimum number and optimal distribution of the neighborhood within local interaction. It consists two parts: (i) Due to the restrictions of the equipped sonars, the sensing field of UUV is limited. If this limited field is reasonably considered, it is an optimal strategy that the neighbor who is located in blindness zone is excluded to accomplish the local interaction. (ii) Due to the crowded phenomena existed in topological interaction, each individual has multiple nearest neighbors in a certain direction-interval. If one nearest neighbor who is located in each direction-interval, is selected to form a new neighborhood for each individual. Therefore, this step can further decrease the number and optimize the distribution of neighborhood after topological interaction.

Step3: The local following is essentially a non-average making-decision strategy integrated by single-nearest-neighbor and multiple-nearest-neighbors, rather than an average-velocity mechanism that is commonly assumed to design consensus or protocols. If the control parameter is smaller than an empirical triggered threshold, the SNNF originated from single-nearest-neighbor synergy is adopted to achieve a directed transmission of information, else the weighted-MNNF (WMNNF) originated from multiple-nearest-neighbors is proposed to synthesize all individual differences, in which a weight of each agent is determined by its corresponding interaction control parameter. In this step, advantages of the two following models are formulated in an unified framework, which can provide an adaptive making-decision to address behavior conflict and chain collision.

3.2 Adaptive dynamic computing model

In terms of the proposed following-interaction framework, its corresponding computing model is developed in this subsection. Consider a group of N agents in the crowded UUV swarm, each individual has an omnidirectional view field with sensing radius R and blindness sectorω. The specific procedure is demonstrated as follows.

(i) Consider an omnidirectional view field of individual i, its neighbors setNiat time t is determined by

$$ {N}_i\left(t,\varphi \right)=\left\{j|{d}_{ij}\le {R}_i,j\in \left\{1,\cdots, N\right\},j\ne i\right\} $$
(12)

where ϕ denotes the view angle, and Ni < N.

Remark 4:

Eq. (12) is essentially a fixed distance interaction determined by sensing radius, and most of existing models all adopt this to decide the interaction intensity [17,18,19,20,21].

(ii) The topological interaction is employed to select a fixed number with κ-nearest neighbors from Ni, determined by

$$ {N}_i^1\left(t,\varphi, \kappa \right)=\underset{j\in {N}_i}{\mathrm{argmin}}\left\{j|{d}_{ij}(t)\right\} $$
(13)

where \( {N}_i^1<{N}_i \).

Remark 5:

Eq. (13) can decreases the number of neighbors in comparison to the fixed-distance interactions, such as Boids, Vicsek, and Couzin [19].

(iii) The blindness angel ω is considered to improve the speed of information transmission via decreasing the omnidirectional view, which is consistent with the blindness sector caused by the restricted physical size [27]. Assuming the UUV is symmetrical about UUV heading, as shown in Fig. 4. Such that, the neighbor marked with the purple circle, who is located in the blindness sector, is excluded from interacting with others.

Fig. 4
figure 4

Neighboring region of the agent i with a blindness angle

Hence, a new neighbor set \( {N}_i^2 \) with restricted blindness sector ω is determined by

$$ {N}_i^2\left(t,\varphi, \kappa, \omega \right)=\left\{j|{d}_{ij}(t)\wedge {\varphi}_i-{\omega}_i,j\in {N}_i^1\ \right\} $$
(14)

where \( {N}_i^2<{N}_i^1 \).

(iv) The individual i focuses on correlating and transmission of abnormal behaviors by utilizing the visual attention [37, 38], of which essence is that φi − ωi is divided intomc sectors along clockwise direction in the heading of agent i, then the nearest individual in each sub-sector is chosen, as shown in Fig. 5.

Fig. 5
figure 5

Neighboring region of the individual i with divided sectors

In particular, the view angle of each sub-sector of individual i is calculated as

$$ {\xi}_{im}=\frac{\varphi_i-{\omega}_i}{m_c}\kern1em \left(m=1,2,\cdots, {m}_c\right) $$
(15)

Then, the neighbor set \( {\varTheta}_i^m \) in m-th sub-sector is determined by

$$ {\varTheta}_i^m(t)=\left\{j|{\xi}_{im}\left(m-1\right)\le {\theta}_{ij}-{\vartheta}_i\le {\xi}_{im}m\right\} $$
(16)

where ϑi is the heading of individual i, and θij is the azimuth of individual j relative to individual i. Such that the nearest neighbor in each sub-sector \( {\ell}_i^m \) is chosen as

$$ {\ell}_i^m(t)=\left\{j|j=\underset{j\in {\varTheta}_i^m}{\arg\ \min }{d}_{ij},m=1,2,\cdots, {m}_c\right\} $$
(17)

Finally, all the nearest individuals can generate a new neighbor set \( {N}_i^3 \), which is given by

$$ {N}_i^3\left(t,\varphi, \kappa, \omega, {m}_c\right)=\left\{{\ell}_i^m(t),m=1,2,\cdots, {m}_c\right\} $$
(18)

where \( {N}_i^3<{N}_i^2 \).

Remark 6:

Eqs. (15) ~ (18) are essentially an optimal strategy to select the all candidate neighbors from \( {N}_i^2 \). The reasonable number of sub-sectors and the choosing operation in each sub-sector can guarantee the only one agent existed in each sub-sector, which can address the crowded problem, and further avoid the behavior conflict and chain collision via an optimal distribution in \( {N}_i^3 \).

(v) Based on \( {N}_i^3 \), an interaction control parameter δij(t) is calculated by a synergy of position pi, velocity qi and heading ϑi to quantify the interaction intensity between individuals i and j [39]:

$$ {\delta}_{ij}(t)=\frac{\alpha_{ij}}{1+\left\Vert {p}_i-{p}_j\right\Vert}\times \frac{\beta_{ij}\left\Vert \left({q}_i-{q}_j\right)\times {\overline{q}}_i\right\Vert }{\sum_{j\in {N}_{i3}}\left\Vert \left({q}_i-{q}_j\right)\times {\overline{q}}_i\right\Vert}\times \frac{\chi_{ij}}{1+\left\Vert {\vartheta}_i-{\vartheta}_j\right\Vert } $$
(19)

where αij, βijand χij denote the coefficients corresponding with position, velocity and heading, respectively. And, \( {\overline{q}}_i \) is an average-velocity of neighbor set \( {N}_i^3(t) \).

Remark 7:

The δij(t) is related to various factors such as relative distance, speed and heading between individuals, as well as the number of neighbors and spatial distribution. Furthermore, in the evolving process, swarm graph is dynamic, hence δij(t) exhibits a time-varying characteristic.

(vi) The SNNF and WMNNF by virtue of the proposed interaction control parameter δij(t) are to design an adaptive following interaction model \( {u}_i^f(t) \), which is given by

$$ {u}_i^f(t)=\Big\{{\displaystyle \begin{array}{l}{\sum}_{j\in {N}_i^3}{w}_j{u}_j\left(t-\varDelta t\right),\kern1.5em {\delta}_{ij}<{\varDelta}_i\\ {}{u}_j\left(t-\varDelta t\right),\kern5em {\delta}_{ij}\ge {\varDelta}_i\kern0.5em \end{array}} $$
(20)

where Δi is an empirical parameter, Δt denotes a sampling period, and wj represents a weight with control law \( {u}_i^f(t) \) of individual j. The wj is given by

$$ {w}_j(t)=\frac{u_j(t)}{\sum \limits_{j\in {N}_i^3}{u}_j(t)} $$
(21)

where \( \sum \limits_{j\in {N}_i^3}{w}_j=1 \), and wj(t) ≥ 0.

Remark 8:

If existing δij < Δi, individual i adopts a WMNNF pattern. Conversely, if δij ≥ Δi, it adopts a SNNF pattern to follow \( j=\underset{j\in {N}_i^3}{\mathrm{argmax}}\left\{{\delta}_{ij}|{\delta}_{ij}\ge {\varDelta}_i\right\} \) with the largest intensity.

4 Self-organized cooperative control protocol

In this section, a novel self-organized cooperative control protocol is proposed, and its stability analysis is given to illustrate the feasibility and effectiveness.

4.1 Control protocol

The self-organized cooperative control protocol is derived from the proposed adaptive dynamic interaction topology and mathematics-based potential fields. In this approach, each UUV applies a control protocol that consists of four terms

$$ {u}_i(t)={u}_i^p+{u}_i^q+{u}_i^f+{u}_i^{dis},i=1,2,...,N $$
(22)

where \( {u}_i^p \) is the position coordination term, and \( {u}_i^q \) is the velocity consensus term, and \( {u}_i^f \) is the proposed following interaction term, and\( {u}_i^{dis} \) is the disturbance term.

  1. (i)

    \( {u}_i^p \) is utilized to adjust the positions between individual i and its neighbors in \( {N}_i^3 \), which embodies the both attraction and repulsion rules [17]. It is given by the gradient-based potential fields as follows

$$ {u}_i^p(t)=\sum \limits_{j\in {N}_i^3}{\psi}_{\alpha}\left({\left\Vert {p}_i-{p}_j\right\Vert}_{\sigma}\right){\sigma}_{\varepsilon}\left\Vert {p}_i-{p}_j\right\Vert +\sum \limits_{j\in {N}_i^3}{\psi}_{\beta}\left({\left\Vert {p}_i-{p}_j\right\Vert}_{\sigma}\right){\sigma}_{\varepsilon}\left\Vert {p}_i-{p}_j\right\Vert $$
(23)

where, ψα and ψβ are the pairwise smooth attractive and repulsive potential functions, and ‖·‖σ is a σ-norm of a vector pi − pj, which is defined as

$$ {\left\Vert {p}_i-{p}_j\right\Vert}_{\sigma }=\left(1/\varepsilon \right)\left[\sqrt{1+\varepsilon {\left\Vert {p}_i-{p}_j\right\Vert}^2}-1\right] $$
(24)

where ε ∈ (0, 1) is a fixed parameter. The gradient of ‖pi − pjσ defined by σεpi − pj‖ is given by

$$ {\sigma}_{\varepsilon}\left\Vert {p}_i-{p}_j\right\Vert =\nabla {\left\Vert {p}_i-{p}_j\right\Vert}_{\sigma }=\frac{\left\Vert {p}_i-{p}_j\right\Vert }{\sqrt{1+\varepsilon {\left\Vert {p}_i-{p}_j\right\Vert}^2}}=\frac{\left\Vert {p}_i-{p}_j\right\Vert }{1+\varepsilon {\left\Vert {p}_i-{p}_j\right\Vert}_{\sigma }} $$
(25)

where ∇ denotes a gradient operator.

Due to the limited forces provided by actuators, both ψα and ψβ triggered by two continuously bounded active functions Θα and Θβ, are designed to generate repulsive and attractive forces of all agents. Specifically, the repulsive function and its corresponding active function are given by

$$ {\psi}_{\alpha}\left({d}_{ij}\right)={\int}_{{\left\Vert R\right\Vert}_{\sigma}}^{d_{ij}}{\varTheta}_{\alpha }(s) ds $$
(26)
$$ {\varTheta}_{\alpha}\left({d}_{ij}\right)=\Big\{{\displaystyle \begin{array}{l}-{\varpi}_{\alpha }/{d}_{ij}^2,\kern0.5em {d}_{ij}\in \left[0,{\left\Vert R\right\Vert}_{\sigma}\right)\\ {}0,\kern3em {d}_{ij}\in \left[{\left\Vert R\right\Vert}_{\sigma },+\infty \right)\end{array}} $$
(27)

where ϖα > 0 is a designing parameter.

Similarly, the attractive function and its corresponding active function are given by

$$ {\psi}_{\beta}\left({d}_{ij}\right)={\int}_0^{d_{ij}}{\varTheta}_{\beta }(s) ds $$
(28)
$$ {\varTheta}_{\beta}\left({d}_{ij}\right)=\Big\{{\displaystyle \begin{array}{l}\frac{-{\varpi}_{\beta}\left({d}_{ij}-{d}_{\mathrm{min}}\right)\left({d}_{ij}-{\left\Vert R\right\Vert}_{\sigma}\right)}{d_{ij}},\kern0.5em {d}_{ij}\in \left[0,{\left\Vert R\right\Vert}_{\sigma}\right)\\ {}0,\kern10.5em {d}_{ij}\in \left[{\left\Vert R\right\Vert}_{\sigma },+\infty \right)\end{array}} $$
(29)

where ϖβ > 0 is a designing parameter, and dmin represents the minimum local distance.

Hence, the total potential functions for the crowded UUV swarm are given by

$$ V\left(p,q\right)=\frac{1}{2}\sum \limits_i\sum \limits_{j\ne i}{\psi}_{ij}\left({\left\Vert {p}_i-{p}_j\right\Vert}_{\sigma}\right) $$
(30)

where ψij = ψα + ψβ.

To simplify the presentation, (23) can be given by

$$ {u}_i^p(t)=\sum \limits_{j\in {N}_i^3}{\nabla}_{p_i}{\psi}_{ij}\left({\left\Vert {p}_i-{p}_j\right\Vert}_{\sigma}\right) $$
(31)
  1. (ii)

    \( {u}_i^q \) represents the velocity alignment using consensus-based protocol

$$ {u}_i^q(t)=\sum \limits_{j\in {N}_i^3}{a}_{ij}(q)\left({q}_j-{q}_i\right) $$
(32)

where aij > 0 is an element of adjacency matrix.

  1. (iii)

    \( {u}_i^f \) represents the proposed following interaction, which is utilized to control individuals tracking the informed agent and make the velocity consensus. Its expression is of form as

$$ {u}_i^f(t)=\Big\{{\displaystyle \begin{array}{l}{\sum}_{j\in {N}_i^3}{w}_j\left({\nabla}_{p_i}{\psi}_{ij}\left({\left\Vert {p}_i-{p}_i\right\Vert}_{\sigma}\right)+{\lambda}_1{a}_{ij}\left({q}_i-{q}_j\right)\right),\kern1.5em {\delta}_{ij}<{\varDelta}_i\\ {}{\lambda}_{2i}{\nabla}_{p_i}{\psi}_{ij}\left(\left\Vert {p}_i-{p}_j\right\Vert \right)+{\lambda}_2{a}_{ij}\left({q}_i-{q}_j\right),\kern5em {\delta}_{ij}\ge {\varDelta}_i\kern0.5em \end{array}} $$
(33)

where Δi is an empirical parameter determined by

(34)
$$ {\varPhi}_i=\frac{1}{\mid {N}_i\mid +1}\left\Vert \sum \limits_{j\in {N}_i^3\cup i},\frac{q_j}{\left\Vert {q}_j\right\Vert}\right\Vert $$
(35)

where ƛ is positive constant. Moreover, λ2i and λ2 are feedback gain coefficients, where λ2i is given by

$$ {\lambda}_{2i}=\Big\{{\displaystyle \begin{array}{l}{\varUpsilon}_q,\kern10.25em \left\Vert {q}_i\right\Vert \le {V}_{\mathrm{max}}\\ {}\frac{\varUpsilon_q{\varUpsilon}_m}{\left\Vert {q}_i\right\Vert +{\varUpsilon}_m-{V}_{\mathrm{max}}},\kern5.25em \left\Vert {q}_i\right\Vert >{V}_{\mathrm{max}}\kern0.5em \end{array}} $$
(36)

where ϒq and ϒm are positive constants, and Vmax is the maximum velocity given by Eq. (8).

  1. (iv)

    \( {u}_i^{dis} \) describes a composing of external disturbances and noises, which is given by

$$ {\dot{u}}_i^{dis}(t)+{T}_d{u}_i^{dis}(t)={K}_d{F}_d $$
(37)

where Td denotes constant matrix,Kd denotes gain matrix, and Fd denotes the largest amplitude of the white noise. It should be noted that \( {u}_i^{dis} \) is continuous and bounded.

Remark 9:

In control protocol Eq. ( 22 ), the individual characteristics represented by w j can generate an influence on forming the neighbor set \( {N}_i^3\left(t,\psi, k,\omega, {m}_c\right) \) , which is not similar to the existing models that all individuals are assumed to be an equal state. Thus, the proposed control protocol is a more realistic model.

Remark 10:

In control protocol Eq. (22), each individual in neighbors set is updated within each sampling period, rather than just tracking a fixed target to follow. Therefore, it is essentially different from the leader-following control protocol [9].

Remark 11:

In control protocol Eq. (22), if existing δij < Δi,\( {u}_i^f \) is determined by its all neighbors. if existing δij ≥ Δi,\( {u}_i^f \) is determined by the individual with largest interaction intensity. Thus, the control protocol in [40] can be regarded as a special case of our proposed control protocol.

Assumption 2:

In the G associated with \( {N}_i^3\left(t,\psi, k,\omega, {m}_c\right) \) , there exists a directed path from the informed agent to any other agent.

4.2 Stability analysis

Theorem 1:

Consider a crowded UUV swarm with dynamic model (1), if the initial graph G(0) is connected and initial state is from the LaSalle invariance principle, and Assumption 2 holds, such that all agents can achieve a self-organized cooperative control consensus under the proposed control protocol (22) ~ (37). Then, following statements hold:

  1. i)

    The velocities of all individuals will asymptotically converge to a consensus.

  2. ii)

    Almost every final configuration reaches in a local minimum.

  3. iii)

    No collisions occur for all t ≥ 0.

  4. iv)

    No splittings occur for all t ≥ 0.

Proof

Consider a Lyapunov-like energy function under δij < Δi

$$ Q\left(p,q\right)=\frac{1}{2}\sum \limits_{i=1}^N\left(U(p)+{\left({q}_i-{q}_j\right)}^T\left({q}_i-{q}_j\right)\right) $$
(38)

where

$$ U(p)=V\left(p,q\right)+{w}_j\sum \limits_{j={N}_i^3}{\psi}_{ij}\Big({\left\Vert {p}_i-{p}_j\right\Vert}_{\sigma } $$
(39)

Substituting (30) into (39), we can obtain

$$ U(p)=\left(1+{w}_j\right)\sum \limits_{j={N}_i^3}{\psi}_{ij}\Big({\left\Vert {p}_i-{p}_j\right\Vert}_{\sigma } $$
(40)

Let (pc, qc) be the center of mass (COM) of neighbour set, which is given by

$$ {p}_c=\frac{1}{\mathcal{N}}\sum \limits_{i=1}^{\mathcal{N}}{p}_i,{q}_c=\frac{1}{\mathcal{N}}\sum \limits_{i=1}^{\mathcal{N}}{q}_i $$
(41)

where \( \mathcal{N}={N}_i^3(t) \) is of convenience.

Let \( {\tilde{p}}_i={p}_i-{p}_c \) and \( {\tilde{q}}_i={q}_i-{q}_c \) describe the relative positions and velocities between agent i and (pc, qc), then \( {p}_i-{p}_j={\tilde{p}}_i-{\tilde{p}}_j={\tilde{p}}_{ij} \),\( {q}_i-{q}_j={\tilde{q}}_i-{\tilde{q}}_j={\tilde{q}}_{ij} \), and \( {u}_i^p(t) \) can be now written as

$$ {u}_i(t)=-\left(1+{w}_j\right)\sum \limits_{j\in \mathcal{N}}{\nabla}_{p_i}{\psi}_{ij}\left({\left\Vert {\tilde{p}}_{ij}\right\Vert}_{\sigma}\right)-{a}_{ij}{\tilde{q}}_{ij}-{\lambda}_1{w}_j\sum \limits_{j\in \mathcal{N}}{a}_{ij}{\tilde{q}}_{ij} $$
(42)

Then (38) is rewritten as

$$ Q\left(\tilde{p},\tilde{q}\right)=\frac{1}{2}\sum \limits_{i=1}^{\mathcal{N}}\Big(\left(1+{w}_i\right)\sum \limits_{j\in \mathcal{N}}{\psi}_{ij}\left({\left\Vert {\tilde{p}}_{ij}\right\Vert}_{\sigma }+{{\tilde{q}}_{ij}}^T{\tilde{q}}_{ij}\right) $$
(43)

According to the symmetry of ψij and the symmetric matrices of aij, then

$$ \frac{\partial {\psi}_{ij}\left({\left\Vert {\tilde{p}}_{ij}\right\Vert}_{\sigma}\right)}{\partial {\tilde{p}}_{ij}}=\frac{\partial {\psi}_{ij}\left({\left\Vert {\tilde{p}}_{ij}\right\Vert}_{\sigma}\right)}{\partial {\tilde{p}}_i}=-\frac{\partial {\psi}_{ij}\left({\left\Vert {\tilde{p}}_{ij}\right\Vert}_{\sigma}\right)}{\partial {\tilde{p}}_j} $$
(44)

Differentiating (43) gives

$$ \dot{Q}\left(\tilde{p},\tilde{q}\right)=\left(1+{w}_i\right)\sum \limits_{i=1}^{\mathcal{N}}{\nabla}_{{\tilde{p}}_{ij}}{\left(\sum \limits_{j\in \mathcal{N}}{\dot{\psi}}_{ij}\left({\left\Vert {\tilde{p}}_{ij}\right\Vert}_{\sigma}\right)\right)}^{\mathrm{T}}{\tilde{q}}_{ij}+{{\tilde{q}}_{ij}}^T{u}_i $$
(45)

Substituting (42) into (45), we have

$$ {\displaystyle \begin{array}{l}\dot{Q}\left(\tilde{p},\tilde{q}\right)=\left(1+{w}_i\right)\sum \limits_{i=1}^{\mathcal{N}}{\nabla}_{{\tilde{p}}_{ij}}{\left(\sum \limits_{j\in \mathcal{N}}{\dot{\psi}}_{ij}\left({\left\Vert {\tilde{p}}_{ij}\right\Vert}_{\sigma}\right)\right)}^{\mathrm{T}}{\tilde{q}}_{ij}+\sum \limits_{i=1}^{\mathcal{N}}{{\tilde{q}}_{ij}}^T\left[-\left(1+{w}_j\right)\sum \limits_{j\in \mathcal{N}}{\nabla}_{p_i}{\psi}_{ij}\left({\left\Vert {\tilde{p}}_{ij}\right\Vert}_{\sigma}\right)-{a}_{ij}{\tilde{q}}_{ij}-{\lambda}_1{w}_j\sum \limits_{j\in \mathcal{N}}{a}_{ij}{\tilde{q}}_{ij}\right]\\ {}=\sum \limits_{i=1}^{\mathcal{N}}{{\tilde{q}}_{ij}}^T\left(-{a}_{ij}{\tilde{q}}_{ij}-{\lambda}_1{w}_j\sum \limits_{j\in \mathcal{N}}{a}_{ij}{\tilde{q}}_{ij}\right)\\ {}=-{\tilde{q}}^T\left[L\left(\tilde{p}\right)+{a}_{ij}{I}_{\mathcal{N}}\otimes {I}_n\right]{\tilde{q}}^T\end{array}} $$
(46)

where \( \tilde{p}=\mathrm{col}{\left[{p}_1,{p}_2,{p}_{\mathcal{N}}\right]}^T\in {\mathbb{R}}^{\mathcal{N}\times n} \),\( \tilde{q}=\mathrm{col}{\left[{q}_1,{q}_2,{q}_{\mathcal{N}}\right]}^T\in {\mathbb{R}}^{\mathcal{N}\times n} \),⊗ is the Kronecker operator,\( L\left(\tilde{p}\right) \) is the Laplacian matrix of \( G\left(\tilde{p}\right) \), and \( {I}_{\mathcal{N}} \) denotes a \( \mathcal{N} \) dimensional unit vector. Due to the existing aij ∈ [0, 1], we can obtain

$$ \dot{Q}\left(\tilde{p},\tilde{q}\right)=-{\tilde{q}}^T\left[L\left(\tilde{p}\right)+{a}_{ij}{I}_{\mathcal{N}}\otimes {I}_n\right]{\tilde{q}}^T\le 0 $$
(47)

Thus, by virtual of Barbalat Lemma, it holds that

$$ \underset{t\to \infty }{\lim}\dot{Q}\left(\tilde{p},\tilde{q}\right)=0 $$
(48)

It is implied that \( Q\left(\tilde{p},\tilde{q}\right) \) is monotonic in (48). Let Ωc = {(p, q) : Q(p, q) ≤ Q0} is a compact set, existing Q0(p0, q0) = ψσ(0) > 0 is the initial energy. Moreover, it is easy to obtain that \( Q\left(\tilde{p},\tilde{q}\right)\le {Q}_0 \) is closed and bounded for any time internal t ∈ [0, t]. Therefore, the control system is asymptotically stable.

From the LaSalle invariance principle, all solutions \( \left(\tilde{p},\tilde{q}\right) \) of Eq. (22) starting in Ωccoverage to its largest invariant set given by

$$ S=\left\{\left(\tilde{p},\tilde{q}\right)\in {\mathbb{R}}^{\mathcal{N}\times n}:\dot{Q}\left(p,q\right)=0\right\} $$
(49)

It can be evident that \( \dot{Q}\left(\tilde{p},\tilde{q}\right)=0 \) and \( \tilde{q}=0 \), and we can deduce that velocities of all agents asymptotically match each other, i.e.,

$$ {q}_1={q}_2=\cdots ={q}_{\mathcal{N}} $$
(50)

Now, part i) and ii) have been proved. Subsequently, we prove part iii) by contradiction strategy. Assume existing an instant time t = t1 > 0, two individuals k and l collide, i.e., ‖pl(t1) − pk(t1)‖ ≤ D, where D is the minimum distance between two agents. For all t > 0, we obtain a smooth potential function,

$$ {\displaystyle \begin{array}{l}Q\left(p\left({t}_1\right)\right)=\frac{1}{2}\sum \limits_i\sum \limits_{j\ne i}{\psi}_{\sigma}\left({\left\Vert {p}_i-{p}_j\right\Vert}_{\sigma}\right)\\ {}={\psi}_{\sigma}\left({\left\Vert {p}_l\left({t}_1\right)-{p}_k\left({t}_1\right)\right\Vert}_{\sigma}\right)+\frac{1}{2}\sum \limits_{i\in \mathcal{N}\backslash \left\{k,l\right\}}\sum \limits_{j\in \mathcal{N}\backslash \left\{i,k,l\right\}}{\psi}_{\sigma}\left({\left\Vert {p}_i\left({t}_1\right)-{p}_j\left({t}_1\right)\right\Vert}_{\sigma}\right)\\ {}\ge {\psi}_{\sigma}\left({\left\Vert {p}_l\left({t}_1\right)-{p}_k\left({t}_1\right)\right\Vert}_{\sigma}\right)\end{array}} $$
(51)

Hence, existing Q(p(t1)) ≥ ψσ(0) = Q0. However, it is in contradiction with an inequality

$$ Q\left(p\left({t}_1\right)\right)\le {Q}_0 $$
(52)

Therefore, in terms of (51) and (52) in contradiction with the invariant principle, no two agents collide at any time t ≥ 0.

Furthermore, assume that G(t) switches at time tk,k = 1, 2, ⋯, and G(t) is a fixed graph on each time-interval [tk − 1, tk]. Note that Q0 is finite and the time derivative ofQ(t) in t ∈ [t0, t1] is (47), existing an inequality

$$ Q\left({t}_1\right)\le {Q}_0<\infty, \forall t\in \left[{t}_0,{t}_1\right) $$
(53)

Such that

$$ {\lim}_{d_{ij}\to R}\psi \left({d}_{ij}\right)=\infty $$
(54)

Clearly, it is implied that there is no edges will be lost before t1 and be added at switching t1. Similar to the t ∈ [t0, t1], time derivative Q(t) on each time-interval [tk − 1, tk], is also satisfied with (47). It can be given by

$$ Q\left({t}_k\right)\le Q\left({t}_{k-1}\right)\le {Q}_0<\infty, \forall t\in \left[{t}_{k-1},{t}_k\right),k=1,2\cdots $$
(55)

Thus no edges will be lost before tk and be added at switching tk. In addition, by virtue of the Assumption 2, G(t) can be guaranteed to keep connectivity for all t ≥ 0.

From the above proof, we can conclude that each agent in swarm can asymptotically converge to a consensus, and no collisions and splittings occur between any two agents. □.

Remark 12:

For case of δij ≥ Δi, its stability proof is similar to the case of δij < Δi. Due to space limitations, a more rigorous proof is omitted herein.

5 Simulation results

In this section, we will provide simulation results to demonstrate effectiveness and robustness of our proposed approach. We consider a crowed UUV swarm with N = 51. The initial positions are located in 50 × 50 m such that the initial graph is connected, and initial headings and velocities are chosen from [−π, π] rad and [0, 10]m/s, respectively. Without the loss of generality, the heading of each agent specifies the direction of velocity, and Vmax = 15m/s. The nonlinear UUV model can be seen in [27]. The following parameters remain fixed through all simulations: ϕ = 2π, ω = [−π/6, π/6],mc = 12,κ = 8,αij = 1.2,βij = 1.2,χij = 1.2, ϖα = 100, R = 12m, ϖβ > 100, dmin = 4m,ε = 0.5,ƛ = 0.5,ϒq = 1,ϒm = 0.2,Td =  diag [100, 100, 100],Kd =  diag [10, 5, 6],Fd = 1, and D = 0.5m.

All experiments are divided into two cases: without external stimuli signal and with external stimuli signal, which are conducted in MATLAB and C++ Platform. In addition, a computer with Intel(R) Core™i7 CPU3.20GHz is utilized for simulations.

5.1 Self-organized cooperative control without external stimuli signal

In the absence of external stimuli signal, assuming that a crowded UUV swarm performs a desired velocity [5, 5]Τ m/s. The experimental results are demonstrated to prove the asymptotic convergence of the proposed self-organized cooperative control in Figs.6, 7 and 8.

Fig. 6
figure 6

Flocking without external stimuli signal: (a) Initial pattern, (b) path trajectory, (c) final pattern and (d) relative distance

Fig. 7
figure 7

The velocities of all individuals at t = 60s. (a) velocity-x and (b) velocity-y

Fig. 8
figure 8

The average position error of all individuals at t = 60s. (a) position-x average error and (b) position-y average error

Figure 6 exhibits the flocking trajectories without external stimuli signal. Fig. 6a shows the initial pattern, in which each individual is marked by green circle with arrow, and the scale of arrow denotes the magnitude of initial velocity. Fig. 6b shows the path trajectories, it is clearly observed that the swarm can be self-organized to achieve the asymptotic convergence. Moreover, consensus and synchronicity can be simultaneously guaranteed as shown in Fig. 6c, in which all velocities are equal to[5, 5]Τ. Fig. 6d plots the relative distance dij of crowded UUV swarm, and dij satisfies D ≤ dij ≤ R. Hence, we can deduce a conclusion that without any collisions and splits occurring in self-organized motion.

Figure 7 plots velocities of all individuals including velocity-x and velocity-y, respectively. Due to great convergence of the proposed consensus, these velocities can be quickly converged at t = 10s, and flocking swarm can be synchronous to the desired velocity [5, 5]Τ at t = 50s. Additionally, one can be seen that the crowded swarm can keep a relatively stable distance without a larger change after t = 10s in Fig. 8. From all above results, all observations are in close agreement with our theoretical predictions, and it is implied that our approach can achieve a self-organized flocking without triggering any conflict and collision.

5.2 Self-organized cooperative control with external stimuli signal

In the presence of an external stimuli signal, assuming that the crowded UUV swarm performs a desired velocity [5, 5]Τ m/s, and an informed agent who moves on boundary of swarm changes its heading in responding to the stimuli signal, which is given by

$$ {u}_i^s(t)=\hslash \left({q}_i-{q}_i^s\right) $$
(56)

where  = 10 denotes the feedback gain, and \( {q}_i^s={\left[5,0\right]}^T \) is an expected velocity of agent i who is labeled as informed agent.

In order to verify the efficiency of our proposed control approach, self-organized fission/fusion method (SFF) [39] and single-informed-based distributed consensus (SDC) [40] are employed to perform the comparative experiments. The differences between the three methods are the neighbor number and following mechanism. It is noted that the external stimuli signal appears at t = 60s after flocking achieved, and comparison results are illustrated in Figs. 9, 10 and 11 and Table 1.

Fig. 9
figure 9

The results of SFF under an external stimuli signal. (a) Flocking trajectory and (b) Final pattern

Fig. 10
figure 10

The results of SDC under an external stimuli signal. (a) Flocking trajectory and (b) Final pattern

Fig. 11
figure 11

The results of our approach under an external stimuli signal. (a) Flocking trajectory and (b) Final pattern

Table 1 Comparisons of relative distances (The optimal result is in bold)

Figures 9, 10 and 11 present the results of comparisons among SFF, SDC and our proposed approach in the presence of an external stimuli signal, respectively. The Figs. 9a, 10a and 11a exhibit flocking trajectories, and Figs. 9b, 10b and 11b show their final patterns, respectively, in which the agent marked with red circle indicates an informed agent and the dotted line indicates its trajectory after t = 60s. The informed agent changes its heading from [5, 5]Τ to [5, 0]Τ at t = 60s, and its neighbors can synchronously change their original heading in Fig. 9. However, all neighbors eventually fail to track the informed agent, which leads to a splitting swarm into two sub-groups. The one sub-group is composed of the only informed agent who tracks the stimuli signal, and the other sub-group consisting 50 individuals moves in a direction between [5, 5]Τ and [5, 0]Τ. The similar results are provided in Fig. 10, where the swarm is also separated into two sub-groups. The only difference between Figs. 9 and 10 is that the direction of SDC with respect to the informed agent is smaller than that of SFF. The best performance of our proposed approach can be seen in Fig. 11, in which all neighbors accurately follow the informed agent and the velocity consensus can be guaranteed, such that the flocking pattern remains stable.

Additionally, the relative distances among crowded UUV swarm are considered after external stimuli signal appearing at t = 60s, as shown in Table 1. It is observed that the minimum and maximum relative distance dij of our proposed consensus are 1.1845 m and 11.9813 m, respectively, both of which satisfy D ≤ dij ≤ R, such that the objectives (9) and (10) can be guaranteed without any splits and collisions even under a stimuli signal. Furthermore, the minimum relative distances of other approaches are larger than the predefined collision distance D = 0.5m. Thus, the SFF and SDC can also achieve the avoiding collisions. However, maximum relative distances, 54.6522 m and 23.4991 m, are larger than the predefined sensing distance R = 12m, which indicates that communicating connections of SFF and SDC are broken due to splits occurring. Obviously, it is evident that our approach is more robust against fluctuations and disturbances in comparison to other approaches.

The reason why these differences are due to different the following mechanisms existed in the three approaches. Clearly, both the SFF and SDC employ the average mechanism, in which two limitations are exposed. (i) All individuals within sensing range are automatically activated to participate in local interaction, where how to optimize information transmission and computing is not fully considered. (ii) The averaged-synergy filters or dilutes the external stimuli signals, such that all heterogeneous propensities and their effects may be ignored. Conversely, our proposed approach can alleviate the above two limitations from two aspects. (i) Our following-interaction framework can ensure the minimum number and optimal distribution of neighborhood. (ii) Our adaptive computing strategy considers all individual differences via single-nearest-neighbor and multiple-nearest-neighbors synergies with nonlinear weights.

5.3 Discussion

In order to further verify the robustness of our proposed approach, three quantitative indicators such as mean heading (MH), orderness parameter (OP) and scale parameter (SP) are devised to demonstrate the comparative performances.

The MH is defined as

$$ {\varphi}_{MH}(t)=\frac{1}{N}\left|\sum \limits_{i=1}^N,{e}^{j{\vartheta}_i}\right| $$
(57)

where N represents the swarm size. Generally, a great φMH(t) value shows a worse flocking.

The OP is defined as:

$$ {\varphi}_{OP}=\frac{1}{N}\left|\sum \limits_{i=1}^N,\frac{q_i}{\left\Vert {q}_i\right\Vert}\right| $$
(58)

where ‖qi‖ represents the Euclidean norm of velocity vector of agent i. Apparently,φOP ∈ [0, 1], the swarm is orderness if φOP = 1. Conversely, it is unordered whenφOP = 0.

The SP is defined as

$$ {\varphi}_{SP}=\sqrt{\frac{1}{N}\sum \limits_{i=1}^N{\left\Vert {p}_i-{p}_c\right\Vert}^2} $$
(59)

where pc represents the COM given by Eq. (41). Obviously, a smaller φSP indicates a more cohesive distribution.

In particular, all simulations are repeated 50 times, and the results are demonstrated in Fig. 12.

Fig. 12
figure 12

Comparison results of three approaches: (a) MH, (b) OP and (c) SP

Figure 12a demonstrates curves of MH results. At t = 0, all results obtained from three approaches are divergent. At t = 10s, the values asymptotically converge to a constant due to desired head [5, 5]Τ followed by all UUVs. When an external stimuli signal \( {q}_i^s={\left[5,0\right]}^T \) appears t = 60s, this indicator significantly declines and slightly converges to its minimum at t = 65 s. Although SFF and SDC complete flocking, only our approach can simultaneously guarantee flocking and following behaviors. Therefore, it is implied that the tracking and synchronizing capabilities of our approach outperform SFF and SDC.

As shown in Fig. 12b, the OP result of our approach is sensitive to internal and external disturbances in comparison to other approaches, and greater convergence speed can be obtained to form an ordered flocking, which can be seen when the internal and external disturbances appear at t = 0 and t = 60s, respectively. Additionally, the OP result of our approach eventually converges to 1, which is higher than that of SFF and SDC. The reasons accounting for this result are insufficient information interactions and long convergence time in SFF and SDC. Consequently, it is implied that our approach has an excellent orderness.

Figure 12c illustrates SP results. In initial stage, there exists a fluctuation. After t = 10s, SP results gradually converge. Obviously, this indicator monotonously increases when individuals response to the external stimuli signal, and our approach is better than that of other two methods. The reason why these results occur is that there is no splits in our approach, but other two approaches separate the swarm into two sub-groups. Accordingly, it is clear that our designed approach is effective for UUV swarm in the presence of external stimuli signal.

Summarily, it is clear that our proposed control approach shows better robustness against fluctuations and disturbances in comparison to the SFF and SDC. It is also confirmed that the minimum number and optimal distribution of neighborhood can improve the speed of information dissemination in local interaction. Furthermore, the proposed approach can effectively achieve the swarm cohesion and collision avoidance.

6 Conclusions

In this paper, a novel bio-inspired self-organized cooperative control is developed to address behavior conflict and chain collision for crowded UUV swarm in the presence of fluctuations and disturbances. All simulation results prove that our proposed approach can obtain robust control performance without any collision and split occurring, in comparison to the existing methods via MH, OP and SP indicators. The primary contributions of this paper are summarized as follows:

(i) An optimized following interaction framework is devised by the topological interaction and visual interaction to improve the synchronous velocities and save the sensing costs, in which two constrains, i.e., limited view and crowded phenomenon, are simultaneously considered. This framework can ensure the minimum number and optimal distribution of the neighborhood for each agent in dynamic graph, instead of the previous fixed-number and fixed-distance studies.

(ii) An adaptive dynamic computing model is proposed by incorporating SNNF and WMNNF to establish an effective decision-making strategy for solving the problems of behavior conflict and chain collision, where the influence of each individual on sensitive behavior is synthesized by a nonlinear weight. Therefore, it is essentially a non-average mechanism unlike the traditional average-velocity mechanism.

(iii) By virtue of adaptive dynamic interaction topology, a cooperative control protocol based on the proposed following model and mathematics-based potential fields is designed to steer a self-organized flocking with two abilities of the connectivity-preserving and collision-avoiding. Furthermore, the sufficient condition is analyzed via Laypunov and LaSalle invariance principle. It is confirmed that our approach can be suitable for cooperative control of crowded UUV swarm.

For future works, the jointed effects of time delays and packet loss will be considered in this self-organized cooperative control approach, and the control performance can be further improved towards achieving the lager-scale heterogeneous UUV swarm.