Cooperative iterative learning for uncertain nonlinear agents in leaderless switching networks

doi:10.1016/j.automatica.2021.109692

Automatica

Volume 129, July 2021, 109692

https://doi.org/10.1016/j.automatica.2021.109692 Get rights and content

Abstract

This paper is aimed at cooperative iterative learning tasks for leaderless networks with nonlinear agents, and the effects arising from switching topologies, locally Lipschitz nonlinearities, initial state shifts, and external disturbances of agents are addressed. By proposing a learning-based distributed algorithm, desired relative formation behaviors of leaderless networks can be realized, where all agents’ trajectories can be ensured to be uniformly bounded. A Lyapunov-like analysis approach is introduced to ensure learning convergence with an exponential rate by leveraging the properties of products of stochastic matrices, which can also be employed to develop input-to-state consensus results of discrete parameterized systems.

Introduction

Cooperative control of networks has attracted considerable attention owing to its potential applications in, e.g., biological systems (Olfati-Saber, 2006), multiple-vehicle systems (Fax & Murray, 2004), and sensor networks (Schenato & Fiorentin, 2011). Typically, a distributed protocol is designed for each agent by using the relative information between its nearest neighbors and itself such that the relative state deviations between agents can asymptotically converge to specified values, which implies the relative formation behaviors of networks (Olfati-Saber, Fax, & Murray, 2007). In particular, if all specified values equal zeros, then the relative formation behavior collapses to the consensus behavior, which indicates that the states of agents can asymptotically achieve agreement. For detailed explanations of cooperative control, see, e.g., Oh, Park, and Ahn (2015), Ren, Beard, and Atkins (2005) and the references therein.

Motivated by the practical applications of satellite networks (Ahn, Moore, & Chen, 2010) and marching bands (Meng and Moore, 2016, Meng and Moore, 2017), in which networks operate in a repetitive manner and the objective is to maintain a high-precision relative formation behavior between agents within a finite time interval, cooperative iterative learning—a combined study of cooperative control and iterative learning control (ILC) for networks is explored (Ahn et al., 2010, Bu et al., 2018, Bu et al., 2019, Hui et al., 2020, Jin, 2016, Li et al., 2018, Meng and Moore, 2016, Meng and Moore, 2017, Shen and Xu, 2018, Yang et al., 2016). Typically, under a learning-based distributed algorithm, the input signal of each agent is corrected based on the relative formation deviations between its nearest neighbors and itself from the previous repetitions. In such an approach, the relative formation behavior can be gradually achieved over the finite time interval with increasing iterations.

Most of the above-mentioned cooperative iterative learning problems fall into the leader–follower framework (Bu et al., 2018, Bu et al., 2019, Hui et al., 2020, Jin, 2016, Li et al., 2018, Shen and Xu, 2018, Yang et al., 2016). Benefiting from the specified reference trajectory of the leader agent, the analysis of the relative formation behaviors can be transformed into an ILC stability problem, where we directly analyze the errors between the reference trajectory and the states of agents. For this class of problems, various mature results and analysis techniques can be leveraged, such as the contraction mapping-based analysis approach in, e.g., Bu et al., 2018, Bu et al., 2019, Hui et al., 2020 and Yang et al. (2016) and the composite energy function-based Lyapunov-like approach in, e.g., Jin, 2016, Li et al., 2018 and Shen and Xu (2018). However, in the leaderless networks, the trajectories to which all agents will converge are unknown in advance. As a result, the behavior analysis problem of agents cannot be transformed into an ILC stability problem, and these two methods cannot be applied, which makes the problem more challenging. For some class of leaderless networks, e.g., sensor networks (Schenato & Fiorentin, 2011), those results investigated in the leader–follower framework may not be effective. Thus, it is of great significance to study the cooperative iterative learning for leaderless networks (Meng and Moore, 2016, Meng and Moore, 2017).

Another critical problem of cooperative iterative learning is about the network topologies. For a network with a fixed and quasi-strongly connected network topology, some remarkable results of cooperative iterative learning have been given in Bu et al., 2018, Jin, 2016, Li et al., 2018 and Shen and Xu (2018). When the switching topologies are taken into account, the strict model repetitiveness viewed as one of the basic assumptions of ILC is no longer satisfied. In some recent studies of ILC, this assumption can be relaxed (see the ILC results of, e.g., Chien et al., 2018, Yu and Li, 2017 and Altın, Willems, Oomen, and Barton (2017) accommodating the nonrepetitive plant models). However, the topology may switch from one to another totally different one. Thus, the essential model nonrepetitiveness from the switching topologies cannot be modeled as those used in, e.g., Chien et al., 2018, Yu and Li, 2017 and Altın et al. (2017). To investigate the cooperative iterative learning for networks with switching topologies (Yang et al., 2016), some severe requirements are imposed such that each of the possible topologies is strongly connected. To release the requirement, some cooperative iterative learning results of, e.g., Bu et al., 2019, Meng and Moore, 2016, Meng and Moore, 2017 and Hui et al. (2020) can admit a joint quasi-strong connectivity condition, which has been verified to be the necessary condition for ensuring the relative formation behaviors of networks with switching topologies.

From the perspective of the dynamics of agents, the existing results of the cooperative iterative learning mainly concentrate on linear agents and a special class of nonlinear agents whose dynamics fulfill a globally Lipschitz condition (Bu et al., 2018, Bu et al., 2019, Hui et al., 2020, Meng and Moore, 2016, Yang et al., 2016). This is because these results are developed based on ILC, in which the globally Lipschitz nonlinearity is seen as one of the basic assumptions (Ahn, Chen, & Moore, 2007). Naturally, they may no longer be feasible if agents’ non-globally Lipschitz nonlinearities are considered, as shown in Meng and Moore (2020). Toward this end, some efforts have been devoted in, e.g., Meng and Moore (2017) such that agents subject to a class of non-globally Lipschitz nonlinearities can be accommodated. However, it has not been disclosed whether and how the locally Lipschitz nonlinearities can be dealt with in the framework of cooperative iterative learning. In addition, robustness is also regarded as one of the most significant problems in the cooperative iterative learning because the uncertainties that arise from initial state shifts and external disturbances are generally unavoidable for agents.

In this paper, we contribute to the cooperative iterative learning issue for leaderless networks by simultaneously handling switching topologies, locally Lipschitz nonlinear dynamics, initial shifts, and external disturbances. By comparisons with the existing results of, e.g., Bu et al., 2018, Bu et al., 2019, Hui et al., 2020, Jin, 2016, Li et al., 2018, Meng and Moore, 2016, Meng and Moore, 2017, Yang et al., 2016 and Shen and Xu (2018), the main contributions are summarized in the following three aspects.

(1)
A novel learning-based design of distributed algorithms is presented, in which the consistency between topology and relative formation error information is investigated. By contrast with, e.g., Meng and Moore (2016) and Meng and Moore (2017), the relative formation errors among neighboring agents that are determined by the current network topology are employed for the input updating of each agent.
(2)
A new Lyapunov-like approach incorporating properties of stochastic matrices is leveraged to show input-to-state consensus results of discrete parameterized systems, and is rarely reported in ILC, of which convergence analysis is generally executed with contraction mapping methods (e.g., Bu et al., 2018, Bu et al., 2019, Hui et al., 2020, Yang et al., 2016). The construction of the Lyapunov function expresses a distinct difference from those of the composite energy functions in, e.g., Jin, 2016, Li et al., 2018 and Shen and Xu (2018), and thus the employed approach enriches the analysis tools of ILC for network systems.
(3)
The relative formation behaviors for leaderless networks can be accomplished with an exponential rate, of which the convergence analysis cannot be transformed into an ILC stability problem as done in, e.g., Bu et al., 2018, Bu et al., 2019, Hui et al., 2020, Jin, 2016, Li et al., 2018, Yang et al., 2016 and Shen and Xu (2018). Despite this good result, the switching topologies and the locally Lipschitz nonlinear dynamics of agents that violate basic assumptions of ILC—the strict model repetitiveness and the globally Lipschitz nonlinearity, respectively, are well treated. These properties make our cooperative iterative learning results greatly extend those of, e.g., Bu et al., 2018, Bu et al., 2019, Hui et al., 2020, Jin, 2016, Li et al., 2018, Meng and Moore, 2016, Meng and Moore, 2017, Yang et al., 2016 and Shen and Xu (2018).

We organize this paper as follows. The problem description is given in Section 2. Main results including the input-to-state consensus results of the discrete parameterized system and the exponential convergence of the relative formation error among agents are given in Section 3 and the corresponding technical proofs are provided in Section 4. The conclusions are made in Section 5.

Notations

$R$ , $R^{n}$ , and $R^{m \times n}$ represent the sets of real numbers, $n$ -dimensional real vectors, and ( $m \times n$ )-dimensional real matrices, respectively. We employ $[0, \infty) = {x \in R : x \geq 0}$ , $Z_{+} = {0, 1, 2, \dots}$ , $Z_{T} = {0, 1, 2, \dots, T}$ , and $F_{n} = {1, 2, \dots, n}$ , and let $t^{+} = t + 1$ for $t \in Z_{+}$ , as well as $1_{n} = {[1, 1, \dots, 1]}^{T} \in R^{n}$ . For a matrix (respectively, a vector) $A$ , ${‖ A ‖}_{\infty}$ is the maximum row sum matrix norm (respectively, $l_{\infty}$ norm). A matrix $A = [a_{i j}] \in R^{n \times n}$ is called a nonnegative matrix and denoted by $A \geq 0$ if $a_{i j} \geq 0$ , $\forall i, j \in F_{n}$ and $A$ is called a stochastic matrix if $A \geq 0$ and $A 1_{n} = 1_{n}$ . For two matrices $A$ and $B$ , $A \otimes B$ is the Kronecker product of $A$ and $B$ . For a sequence ${A_{i} \in R^{n \times n}, i \in Z_{+}}$ , denote $\prod_{i = h}^{j} A_{i} = A_{j} A_{j - 1} \dots A_{h}$ if $j \geq h$ and $\prod_{i = h}^{j} A_{i} = I_{n}$ (the $(n \times n)$ -dimensional identity matrix) if $j < h$ . If a scalar function $λ : [0, \infty) \to [0, \infty)$ is continuous and strictly increasing, and fulfills $λ (0) = 0$ and ${lim}_{x \to \infty} λ (x) = \infty$ , then it is said to belong to the class $K_{\infty}$ .

Section snippets

Problem description

Consider a network with $n$ agents denoted by $v_{1}, v_{2}, \dots, v_{n}$ . Let the agents evolve over time $t \in Z_{T}$ and iteration $k \in Z_{+}$ with the following dynamics: $\{\begin{aligned} x_{i, k} (t^{+}) & = f_{i} (x_{i, k} (t), t) + u_{i, k} (t) + w_{i, k} (t) \\ x_{i, k} (0) : & {‖ x_{i, k} (0) - x_{i 0} ‖}_{\infty} \leq ψ_{i} σ_{i}^{k} \\ w_{i, k} (t) : & {‖ w_{i, k} (t) - w_{i} (t) ‖}_{\infty} \leq ϑ_{i} (t) υ_{i}^{k} (t) \end{aligned}, i \in F_{n}$ where $x_{i, k} (t) \in R^{p}$ , $u_{i, k} (t) \in R^{p}$ , and $w_{i, k} (t) \in R^{p}$ are the state, input, and disturbance, respectively; $x_{i 0}$ and $w_{i} (t)$ are iteration-invariant vectors that represent the steady quantities of $x_{i, k} (0)$ and $w_{i, k} (t)$ , respectively; and $ψ_{i} > 0$ , $0 \leq σ_{i} < 1$ , $ϑ_{i} (t) > 0$ , and $0 \leq$

Main results

In this section, we utilize a 2-D dynamics analysis approach to simultaneously investigate the evolution of the state in the time direction and those of the input and the relative formation error in the iteration direction.

The evolution of the state in the time axis is exactly shown in (1), and to explore the evolution of the relative formation error in the iteration direction, (1) can be applied to obtain $e_{i, k + 1} (t^{+}) = e_{i, k} (t^{+}) + u_{i, k} (t) - u_{i, k + 1} (t) + f_{i} (x_{i, k} (t), t)$ $- f_{i} (x_{i, k + 1} (t), t) + w_{i, k} (t) - w_{i, k + 1} (t) .$ By

Technical proofs

Proof of Lemma 4

The result (1) can be verified based on the definitions of the union and the composition of digraphs, and the result (2) can be proved by following the similar way as that of Cao et al. (2008, Proposition 3). Thus, the proof details are omitted here for simplicity. □

Proof of Lemma 5

We can validate two facts under the condition (13) as:

(i)
$Ω_{k} (t) \in I_{n}$ holds for all $k \in Z_{+}$ and $t \in Z_{T - 1}$ ;
(ii)
$G_{k} (t^{+})$ is a subgraph of $G^{s} (Ω_{k} (t))$ .

With the condition (A2) and the fact (ii), we can leverage the result (1) of Lemma 4 to $G^{s} (Ω_{k_{j} (t^{+})} (t))$ , $G^{s}$

Conclusions

In this paper, we have investigated the cooperative iterative learning problem for leaderless networks subject to topologies switching in both iteration and time directions, where the agents’ locally Lipschitz nonlinear dynamics have also been involved. By utilizing the properties of products of stochastic matrices and the digraphs induced by them, we have proposed a Lyapunov-like analysis approach to derive the ISC results for the discrete parameterized system. These motivate a powerful

Jingyao Zhang received the B.S. degree in information and computing science and the M.S. degree in mathematics from Beihang University (BUAA), Beijing, China, in 2015 and 2018, respectively. He is currently pursuing the Ph.D. degree with School of Automation Science and Electrical Engineering at Beihang University (BUAA).

His current research interests include iterative learning control and nonlinear control. He was a co-recipient of the “Best Paper Award” from the IEEE 7th Data Driven Control

References (22)

AltınB. et al.
Iterative learning control of iteration-varying systems via robust update laws with experimental implementation
Control Engineering Practice
(2017)
JinX.
Adaptive iterative learning control for high-order nonlinear multi-agent systems consensus tracking
Systems & Control Letters
(2016)
LiJ. et al.
Adaptive iterative learning protocol design for nonlinear multi-agent systems with unknown control direction
Journal of the Franklin Institute
(2018)
MengD. et al.
Learning to cooperate: Networks of formation agents with switching topologies
Automatica
(2016)
MengD. et al.
Robust cooperative learning control for directed networks with nonlinear dynamics
Automatica
(2017)
OhK.-K. et al.
A survey of multi-agent formation control
Automatica
(2015)
SchenatoL. et al.
Average TimeSynch: A consensus-based protocol for clock synchronization in wireless sensor networks
Automatica
(2011)
ShenD. et al.
Distributed learning consensus for heterogenous high-order nonlinear multi-agent systems with output constraints
Automatica
(2018)
YangS. et al.
Iterative learning control with input sharing for multi-agent consensus tracking
Systems & Control Letters
(2016)
AhnH.-S. et al.
Iterative learning control: Brief survey and categorization
IEEE Transactions on Systems, Man, and Cybernetics–Part C: Applications and Reviews
(2007)

AhnH.-S. et al.

Trajectory-keeping in satellite formation flying via robust periodic learning control

International Journal of Robust Nonlinear Control

(2010)

Cited by (9)

Event-triggered learning synchronization of coupled heterogeneous recurrent neural networks
2023, Knowledge-Based Systems
This paper investigates the synchronization of coupled heterogeneous recurrent neural networks. Based on the assumption of the existence of a spanning tree in the communication digraph, an effective event-triggered iterative learning control applicable to continuous nonlinear dynamical systems is proposed, under which some sufficient criteria for guaranteeing the synchronization of coupled heterogeneous recurrent neural networks are rigorously derived in virtue of contracting mapping principle. Moreover, the exclusion of the Zeno behaviors is analyzed. In contrast with relevant existing results, the control presented herein is applicable to both continuous and nonlinear dynamical systems, and the designed control involves the directed topology with a spanning tree, which includes the existing controls that based on the strongly connected topologies as special cases. Finally, the validity of theoretical results is substantiated by a numerical example.
Iterative learning control for piecewise arc path tracking with validation on a gantry robot manufacturing platform
2023, ISA Transactions
The piecewise arc path tracking problem is a common feature of manufacturing systems operating in a repetitive mode, e.g. assembly production lines. Here, the system end-effector must follow a spatial path without any specific temporal tracking constraints, which makes the temporal profile not fixed a priori. The technique of iterative learning control (ILC) is well-suited to handle this problem, since compared to classical feedback control methods, ILC is capable of learning from previous trial information to minimize the tracking error over repeated trials. This paper extends the ILC task description to address piecewise arc path tracking tasks, and further formulates a more general design framework than existing spatial ILC approaches. A comprehensive ILC algorithm is designed to handle this class of piecewise arc path tracking problems, and practical implementation instructions are provided. Validation is conducted on a gantry robot manufacturing testbed to confirm its feasibility and efficiency in practice with a comparison to existing methods showing its higher path tracking accuracy.
Distributed Data-driven Iterative Learning Control for Consensus Tracking
2023, IFAC-PapersOnLine
High performance consensus tracking of networked dynamical systems working repetitively has found applications in a range of areas. Existing iterative learning control (ILC) designs for this problem either require a system model that can be difficult or expensive to obtain in practice, and/or cannot guarantee the monotonic convergence of the tracking error norm. They often have difficulties handling varying networks too. This paper proposes a data-driven norm optimal ILC (DD-NOILC) framework to address these limitations using the recent development in data-driven control, in particular, the so called Willems’ fundamental lemma. The novel design guarantees that even without using any model information, the proposed DD-NOILC framework can achieve the same convergence performance as the model-based NOILC framework, i.e., monotonic convergence of the tracking error norm to zero. Furthermore, using the alternating direction method of multipliers (ADMM), a distributed implementation of the framework is developed such that each subsystem's input can be updated locally, making the proposed distributed DD-NOILC algorithm suitable for large-scale and varying networks. Convergence properties of the proposed algorithms are analysed rigorously, and numerical examples are provided to verify the effectiveness of the distributed DD-NOILC algorithm.
Distributed quadratic optimization with terminal consensus iterative learning strategy
2023, Neurocomputing
This paper applies a terminal learning strategy to study distributed quadratic optimization problems. Since the optimal state is unknown in advance, the tracking error information is generally unavailable. To achieve the optimal state without the tracking error information, the terminal consensus iterative learning scheme is used to solve the problem. And the terminal consensus state is obtained without the global information of network. On this basis, the optimal target is also achieved by choosing the proper initial state and learning parameters. And the optimization problem is studied with the constraints of state and control input. Results show that our approach is effective. Compared with existing distributed optimization methods, the learning strategy in this paper provides another effective analysis scheme. Last, a numerical example is presented to show the effective aspects of the method.
Global Consensus Tracking Control for High-Order Nonlinear Multiagent Systems With Prescribed Performance
2023, IEEE Transactions on Cybernetics
Decentralized iterative learning control for constrained collaborative tracking
2023, International Journal of Robust and Nonlinear Control

View all citing articles on Scopus

His current research interests include iterative learning control and nonlinear control. He was a co-recipient of the “Best Paper Award” from the IEEE 7th Data Driven Control and Learning Systems Conference in 2018.

Deyuan Meng received the B.S. degree in mathematics and applied mathematics from Ocean University of China (OUC), Qingdao, China, in June 2005, and the Ph.D. degree in control theory and control engineering from Beihang University (BUAA), Beijing, China, in July 2010. From November 2012 to November 2013, he was a Visiting Scholar with the Department of Electrical Engineering and Computer Science, Colorado School of Mines, Golden, CO, USA. He is currently a Full Professor with the Seventh Research Division and the School of Automation Science and Electrical Engineering, Beihang University (BUAA).

His current research interests include iterative learning control, data-driven control, and multi-agent systems.

^☆: This work was supported by the National Natural Science Foundation of China under Grants 61922007 and 61873013. The material in this paper was not presented at any conference. This paper was recommended for publication in revised form by Associate Editor Bert Tanner under the direction of Editor Christos G. Cassandras.

View full text

Brief PaperCooperative iterative learning for uncertain nonlinear agents in leaderless switching networks☆

Abstract

Introduction

Section snippets

Problem description

Main results

Technical proofs

Conclusions

Control Engineering Practice

Systems & Control Letters

Journal of the Franklin Institute

Automatica

Automatica

Automatica

Automatica

Automatica

Systems & Control Letters

Iterative learning control: Brief survey and categorization

IEEE Transactions on Systems, Man, and Cybernetics–Part C: Applications and Reviews

Trajectory-keeping in satellite formation flying via robust periodic learning control

International Journal of Robust Nonlinear Control

Brief Paper
Cooperative iterative learning for uncertain nonlinear agents in leaderless switching networks☆