Data-driven asymptotic stabilization for discrete-time nonlinear systems

doi:10.1016/j.sysconle.2013.11.003

Systems & Control Letters

Volume 64, February 2014, Pages 79-85

https://doi.org/10.1016/j.sysconle.2013.11.003 Get rights and content

Abstract

In this paper, we propose a data-driven feedback controller design method based on Lyapunov approach, which can guarantee the asymptotic stability of the closed-loop and enlarge the estimate of domain of attraction (DOA) for the closed-loop. First, sufficient conditions for a feedback controller asymptotically stabilizing the discrete-time nonlinear plant are proposed. That is, if a feedback controller belongs to an open set consisting of pairs of control input and state, whose elements can make the difference of a control Lyapunov function (CLF) to be negative-definite, then the controller asymptotically stabilizes the plant. Then, for a given CLF candidate, an algorithm, to estimate the open set only using data, is proposed. With the estimate, it is checked whether the candidate is or is not a CLF. If it is, a feedback controller is designed just using data, which satisfies sufficient conditions mentioned above. Finally, the estimate of DOA for closed-loop is enlarged by finding an appropriate CLF from a CLF candidate set based on data. Because the controller is designed directly from data, complexity in building the model and modeling error are avoided.

Introduction

Lyapunov approach provides a powerful framework for analyzing the stability of nonlinear dynamical systems as well as designing feedback controllers that guarantee closed-loop system stability [1], [2]. The synthesis typically relies on the co-design of a CLF and a state feedback controller. Historically, this is an important but challenging problem for the general class of nonlinear systems [3]. The main bottleneck to the success of these methods lies in the construction of CLF. Moreover, the DOA, an invariant set characterizing stabilizable area around the equilibrium, needs further investigation because global stabilization is difficult to achieve in practical applications [4]. There have been a number of studies to solve this problem. These results can be divided into two categories: model-based and data-driven.

Among model-based approaches, one simple approach to obtain a quadratic CLF is solving the Riccati equation associated with the linearized system of the nonlinear system, which often leads to a small DOA for the closed-loop due to approximation errors. Based on sum-of-squares (SOS) programming [5], [6], a polynomial CLF can be constructed and an enlarged estimate of DOA is simultaneously obtained [7], [8], [9]. However, SOS based methods can handle polynomial or rational systems only. SOS programming is extended to non-polynomial systems by variable transformation with algebraic constraints [10]. In [4], a fractional programming problem is formulated to construct a CLF for non-polynomial systems. The main disadvantage of model-based approaches is that the model of the nonlinear plant is the prerequisite and the model must be control-affine.

In fact, there are lots of plants which are hard to be effectively modeled. So designing controllers directly from data bypassing the modeling step, called Data-driven Control Approach, is promising and has received much attention recently. Many data-driven control approaches could be found, such as unfalsified control (UC) [11], model-free adaptive control (MFAC) [12], [13], etc. The main ideas of these approaches are quite different, but they do not require a plant model and do directly use data.

Among data-driven approaches, the control Lyapunov measure approach [14] deals with a similar problem with the one in this paper. Instead of a point-wise notion of the CLF, a control Lyapunov measure is constructed in the term of measure-theory for a nonlinear plant. This approach is data-driven, even though the authors do not say so. In this approach, the model is used to generate data for computing the Markov matrix. The disadvantage of this approach is that it leads to weaker coarse stability while our conclusion on stability is exact.

In this paper, we propose a data-driven asymptotic stabilization for discrete-time nonlinear systems, where a feedback controller asymptotically stabilizing the plant is obtained directly from data and the DOA of the closed-loop is enlarged. First, sufficient conditions for the feedback controller asymptotically stabilizing the plant are proposed. For a given CLF of the plant, if a feedback controller belongs to an open set consisting of pairs of control input and state of the plant, whose elements make the difference of the CLF to be negative-definite, the feedback controller can asymptotically stabilize the plant. However, under traditional controller design frameworks, it is hard to obtain the set for general nonlinear plants. Then, based on a data set collected from the plant and a given CLF candidate, an estimate of the set can be obtained. The idea of estimating the set, similar to set oriented numerical methods [15], is covering the set by a finite number of cells which contains data points satisfying some specific conditions. From the estimate of the set, it is easy to check whether the candidate is or is not a CLF. If it is, a feedback controller is designed using data, which satisfies sufficient conditions mentioned above. Finally, the estimate of DOA for closed-loop is enlarged by finding an appropriate CLF from a CLF candidate set based on data. An unconstrained nonlinear optimization problem, which can be solved by metaheuristic optimizers, is proposed to find the appropriate CLF. In our method, we directly use data bypassing the modeling step. Hence, complexity in building the model and modeling error are avoided.

This paper is organized as follows. In Section 2 the control problem is formulated. In Section 3, sufficient conditions for asymptotic stabilization and estimation of DOA for the closed-loop are introduced. In Section 4, the data-driven asymptotic stabilization is derived. In Section 5, the estimate of DOA for the closed-loop is enlarged by selecting an appropriate CLF from a CLF candidate set. Finally, in Section 6, the conclusion is drawn and further works are summarized.

Notation: $R$ represents the set of real numbers. $R_{+}$ represents the set of positive real numbers. ${\bar{R}}_{+}$ represents $R_{+} \cup {0}$ . $Z_{+}$ represents the set of positive integer numbers. ${\bar{Z}}_{+}$ represents $Z_{+} \cup {0}$ . $R^{n}$ represents the set of real vectors with $n$ elements. For a vector $x \in R^{n}, ‖ x ‖$ represents $\sqrt{x^{T} x}$ . For a vector $x \in R^{n}, x_{(i)}$ represents the $i$ -th element of $x, i = 1, 2, \dots, n$ . For a domain $X \subset R^{n}, m (X)$ represents the Lebesgue measure of $X$ (in Euclidean space, it is the volume of $X$ ).

Section snippets

Problem formulation

Consider the nonlinear discrete-time system $x (k + 1) = f (x (k), u (k)), x (0) = x_{0}, k \in {\bar{Z}}_{+},$ where $x (k) \in R^{n}$ is the state, $u (k) \in R^{m}$ is the control input, $f : R^{n} \times R^{m} \to R^{n}$ is an unknown piecewise continuous function satisfying $0 = f (0, 0)$ and is asymptotically stabilizable at the origin.

Although $f$ is unknown, we have a data set $T = {T_{i}, i = 1, 2, \dots, N}$ collected from the plant (1) without measurement noises, where $T_{i} = {x_{i} (0 : K_{i}), u_{i} (0 : K_{i} - 1)}$ consists of the state trajectory and the control sequence, $x_{i} (0 : K_{i}) = (x_{i} (0), \dots, x_{i} (K_{i}))$ is the

Sufficient conditions for asymptotic stabilization and estimation of DOA for closed-loops

In this section, first, we introduce sufficient conditions for estimation of DOA for nonlinear discrete-time systems without control input in Lemma 1. Since the theory for nonlinear discrete-time systems closely parallels the theory for nonlinear continuous-time systems, many of the results are similar [1]. However, for the estimate of DOA by Lyapunov function, the discrete-time result deviates markedly from its continuous-time counterpart as illuminating in Remark 2. Then, sufficient

Data-driven asymptotic stabilization

The control problem formulated in Section 2 is solved in this section and the next section. In this section, for a given CLF candidate, we propose an algorithm to get an estimate ${\hat{Ω}}_{V, f}$ of $Ω_{V, f}$ using the data set. With ${\hat{Ω}}_{V, f}$ , it is easy to check whether the candidate is or is not a CLF. If it is, then a feedback controller satisfying conditions in Lemma 2 is designed based on ${\hat{Ω}}_{V, f}$ and the data set. It should be noted that, during the above procedure, only data is used. The problem of finding

Enlarging estimate of DOA for closed-loop

In Section 4, we design an asymptotic stabilizer based on data set. However, our control objective is not finished yet. We hope to find an estimate of DOA for the closed-loop, which is as large as possible. This is solved by finding an appropriate CLF from a CLF candidate set by using data set.

According to Lemma 2, if the $γ$ -level set $X_{V, γ}$ , of a continuous positive-definite function $V : R^{n} \to R$ , satisfies conditions (9), (11), then $X_{V, γ}$ is an invariant subset of DOA. Based on this idea, the

Conclusion

In this paper, a feedback controller, which asymptotically stabilizes the nonlinear plant, is designed directly from data. Meanwhile, the estimate of DOA for the closed-loop is enlarged. Because our method just uses data directly, complexity in building the model and modeling error are avoided. From Lemma 2, we know that a state feedback controller asymptotically stabilizes the plant if it belongs to an open subset of $u - x$ space ( $u$ denotes the control input and $x$ denotes the state). In this

References (17)

Y. Yang et al.
An iterative optimization approach to design of control Lyapunov function
J. Process Control
(2012)
U. Topcu et al.
Local stability analysis using simulations and sum-of-squares programming
Automatica
(2008)
J. van Helvoort et al.
Direct data-driven recursive controller unfalsification with analytic update
Automatica
(2007)
J.-X. Xu et al.
Notes on data-driven system approaches
Acta Automat. Sinica
(2009)
W.M. Haddad et al.
Nonlinear Dynamical Systems and Control: A Lyapunov-Based Approach
(2008)
I. Karafyllis et al.
Stability and Stabilization of Nonlinear Systems
(2011)
E.D. Sontag
Control-Lyapunov function
A. Packard et al.
Help on SOS [ask the experts]
IEEE Control Syst.
(2010)

There are more references available in the full text version of this article.

Cited by (17)

Research on improved partial format MFAC greenhouse temperature control method based on low energy consumption optimization
2024, Computers and Electronics in Agriculture
Temperature is critical to the growth of crops in agricultural greenhouses. Thus, designing a greenhouse temperature controller that maximizes energy savings while maintaining control accuracy is very important. This paper proposes an improved partial format model-free adaptive control method and designs a greenhouse temperature controller based on this method to balance control accuracy and energy consumption. Firstly, a limited energy consumption term is added to the control input cost function of the traditional partial format model-free adaptive control to penalize excessive control input. We derive an improved partial format model-free adaptive control input algorithm and design a greenhouse temperature controller using this algorithm. Then, the controller's sensitive parameters are selected using a Monte Carlo-based parameter sensitivity method. Finally, the whale optimization algorithm is used to optimize the sensitive parameters. The insensitive parameters are set according to experience. This paper, a theoretical study based on simulated experiments, proves the convergent tracking error of the proposed improved partial format model-free adaptive algorithm. Simulation results show that the improved controller minimizes energy consumption while ensuring control accuracy, reducing power consumption by 12.35 % compared to the traditional controller.
Data-driven robust stabilization with robust domain of attraction estimate for nonlinear discrete-time systems
2020, Automatica
Nonlinear robust control is pursued by overcoming the drawback of linear robust control that it ignores available information about existing nonlinearities and the resulting controllers may be too conservative, especially when the nonlinearities are significant. However, most existing nonlinear robust control approaches just consider the affine nonlinear nominal model and thereby ignore available information about existing non-affine nonlinearities. When the general nonlinear nominal model is considered, the robust domain of attraction (RDOA) of closed-loops requires extensive investigation because it is hard to achieve the global stabilization. In this paper, we propose a new nonlinear robust control method based on Lyapunov function to stabilize a discrete-time uncertain system and to estimate the RDOA of closed-loops. First, a sufficient condition for robust stabilization of all plants in a plant set and estimation of the RDOA of all closed-loops is proposed. Then, to tackle the non-affine nonlinearities, a data-driven method of estimating the robust negative-definite domains (RNDD) is presented, and based on it the estimation of the RDOA of closed-loops and the resulting controller design are also given.
Data-driven approximate Q-learning stabilization with optimality error bound analysis
2019, Automatica
Citation Excerpt :
And the optimality error bound of the AQL closed-loop is also analyzed. The proof of Lemma 1 is consistent with that of Lemma 2 in Li and Hou (2014). For convenience and clearness of analyzing the optimality error bound, the Q-learning operator is defined in Definition 1 and its properties are given in Theorem 1.
The approximate Q-learning (AQL), as a typical reinforcement learning method, has attracted extensive attention in the past few years because of its outstanding ability to solve the nonlinear optimal control problem when the knowledge/model of the plant is unavailable. However, because of function approximation errors, the AQL algorithms can just give a near-optimal solution. Hence, a quantitative analysis result of the optimality error bound has important significance. In this paper, the off-line value iteration AQL is used to solve the model-free optimal stabilization control problem and a new optimality error bound analysis framework is proposed. Firstly, for convenience and clearness of analyzing the optimality error bound, the Q-learning operator is well defined based on the estimate of the domain of attraction (DOA) for closed-loops. Secondly, a quantitative analysis result of the estimation error bound for the optimal Q-function is obtained by selecting the function estimator as Gaussian processes regression. Finally, a quantitative analysis result of the optimality error bound, which is the error bound between the optimal cost and the actual cost of the AQL closed-loop, is given. As shown in the main result of this paper, the optimality error bound is determined by the approximation error bound of the function estimator (due to the finite number of data points) and the difference between the two Q functions obtained in the last two iterations (due to the finite number of iterations).
Distributed adaptive dynamic programming for data-driven optimal control
2018, Systems and Control Letters
Citation Excerpt :
However, the stability of online designed controllers is difficult to predict and requires restrictive assumptions. This shortcoming is overcome by offline approaches (e.g. [12–14]). Combined offline–online approaches (e.g. [15]) have been developed to incorporate both closed-loop stability and performance improvement with online data.
Adaptive dynamic programming (ADP), as an important optimal control technique, can be exploited in the setting of data-driven control based on an approximate regression-based solution of the Hamilton–Jacobi–Bellman (HJB) equations. Distributed optimization algorithms, which are extensively studied in statistics and machine learning, have not yet been applied to the solution of data-driven ADP problems. In this work, we identify the data-driven ADP problem as a consensus optimization problem for nonlinear affine systems, and apply the alternating direction method of multipliers (ADMM) and its accelerated variants for its solution. For the input-constrained optimal control problem, we define a combined optimal primal–dual function to develop a data-based version of the input-constrained HJB equation.
Data-driven approximate value iteration with optimality error bound analysis
2017, Automatica
Citation Excerpt :
In this study, we present the most significant theoretical analysis result of the optimal control method proposed in Li et al. (2014). The proof of Lemma 1 is consistent with that of Lemma 2 in Li and Hou (2014). The proof of Lemma 4 is presented in Appendix B.
Features of the data-driven approximate value iteration (AVI) algorithm, proposed in Li et al. (2014) for dealing with the optimal stabilization problem, include that only process data is required and that the estimate of the domain of attraction for the closed-loop is enlarged. However, the controller generated by the data-driven AVI algorithm is an approximate solution for the optimal control problem. In this work, a quantitative analysis result on the error bound between the optimal cost and the cost under the designed controller is given. This error bound is determined by the approximation error of the estimation for the optimal cost and the approximation error of the controller function estimator. The first one is concretely determined by the approximation error of the data-driven dynamic programming (DP) operator to the DP operator and the approximation error of the value function estimator. These three approximation errors are zeros when the data set of the plant is sufficient and infinitely complete, and the number of samples in the interested state space is infinite. This means that the cost under the designed controller equals to the optimal cost when the number of iterations is infinite.
Online Adaptive Optimal Control Algorithm of Partial Unknown System with Adding Experience Replay and Safety Check
2022, Lecture Notes in Electrical Engineering

View all citing articles on Scopus

^☆: This research was supported by the State Key Program (No. 60834001) and the Major Program of International Cooperation and Exchanges (No. 61120106009) of National Natural Science Foundation of China.

View full text

Data-driven asymptotic stabilization for discrete-time nonlinear systems☆

Abstract

Introduction

Section snippets

Problem formulation

Sufficient conditions for asymptotic stabilization and estimation of DOA for closed-loops

Data-driven asymptotic stabilization

Enlarging estimate of DOA for closed-loop

Conclusion

J. Process Control

Automatica

Automatica

Acta Automat. Sinica

Nonlinear Dynamical Systems and Control: A Lyapunov-Based Approach

Stability and Stabilization of Nonlinear Systems

Control-Lyapunov function

Help on SOS [ask the experts]

IEEE Control Syst.