Adaptive learning tracking for robot manipulators with varying trial lengths

https://doi.org/10.1016/j.jfranklin.2019.04.034Get rights and content

Abstract

This paper proposes adaptive iterative learning control schemes for robot manipulator systems with iteration-varying lengths. To prove the asymptotical convergence of the joint position tracking error along the iteration axis, this paper develops a new composite energy function based on the newly introduced auxiliary variables for the analysis. Moreover, the traditional assumption of identical initialization condition is relaxed to be arbitrarily varying and then an initial rectifying mechanism is introduced to tackle initial shift problem of robotic systems. Illustrative simulations on a two degree-of-freedom robot manipulator are provided to verify the theoretical results.

Introduction

Learning is a basic skill of human beings. That is, people keep trying and learning from failures, which makes them behave better and better in daily work and lives. Inspired by this learning concept, some scholars speculated on whether industrial machines could own certain learning ability to serve human better. In fact, this basic cognition directly motivates the introduction and developments of iterative learning control (ILC). This novel learning control method is very effective in dealing with repetitive industrial processes, where the operation intervals for each batch are finite and fixed, and the learning objectives are identical for all iterations [1], [2], [3], [4]. The iterative learning controller is simple in structure, and actually is a data-driven mode requiring little knowledge of system dynamics. Therefore, ILC has been widely used in modeling various industrial systems, such as robotic system [5], [6], [7], [8], [9], [10]. Through experiments while by learning and correction, the iterative learning controller can gradually improve its tracking performance and achieve accurate tracking as the iteration number increases.

In order to ensure the desired control or system information can be learned over iterations, a fundamental requirement for ILC is that the operation lengths should be iteration-invariant. However, this precondition is often violated for many practical applications as a result of unknown uncertainties or unpredictable factors. For example, while applying ILC in humanoid and biped walking robots, the gaits are divided into phases whose duration is different in every batch [11]. Another example can be found in analyzing the lab-scale gantry crane, where the iteration will terminate once the output exceeds the specified boundary, and hence the trial length is varying [12]. A third example can be seen in [13], where the authors investigated the functional electrical stimulation for drop foot treatment, and they found that a trial might end early due to safety considerations. All these application examples motivate us to consider the varying-length problem in ILC design and analysis.

There were several primary researches trying to resolve this issue. In [14], Seel et al. considered the linear system and first completed the proof of the monotonic convergence. It should be emphasized that no specific formulation of varying length was imposed in this paper. Li et al. proposed the random model of the iteration-varying trial length problem based on the introduced Bernoulli variable whose distribution is known to the controller in [15], [16], [17], where an iteration-averaging operator was provided to compensate for the missing information caused by the early termination of the batch. Afterwards, to derive strong convergence of the conventional P-type ILC scheme, Shen et al. [18], [19] provided new analysis techniques where the iteration-averaging operator was not involved. Additionally, [20] applied the iteration-moving-average operator and proposed two new update laws for linear systems. However, in these papers, the considered systems are all linear or globally Lipschitz nonlinear. In our recent works [21], [22], we develop adaptive ILC laws for continuous-time parametric nonlinear systems, where the nonlinear functions do not satisfy the globally Lipschitz continuous condition and the trial lengths vary randomly. It has been known that the multi-degree-of-freedom robot manipulator described by Lagrange equation is a very complex nonlinear system. Therefore, these results obtained in these studies may not be directly applicable to such systems while the kernel concept can be borrowed.

Moreover, another challenge with ILC on robotic systems is the well-known identical initialization conditions (i.i.c.), which matches the repetition requirement of ILC and has been widely used in most ILC literature [23], [24], [25], [26]. Specifically, i.i.c. requires that the system states be accurately reset to the desired values at the start of each iteration. This assumption is so strict that it may be difficult to realize due to various reasons. In practical applications, the initial state of the control system is usually not equal to the expected value or even iteration-varying. An alternative solution for this problem is to introduce some initial rectifying mechanism, whose essence is that the initial tracking error is introduced into the traditional ILC law for a small time interval [0, T1) with T1 can be arbitrarily small, to eliminate the influence of the initial error for the rest part [T1, T]. In [27], Sun et al. proposed this mechanism when studying the initial problem for a class of continuous-time nonlinear systems with time-delay. Subsequently, the rectifying mechanism was extended to the nonlinear systems with general high-order relative degree [28] and the SISO discrete-time nonlinear systems with high-order relative degree [29]. However, the initial states in [27], [28], [29] are usually fixed. Motivated by this, Li et al. [30] investigated a continuous-time affine multi-variable system and the initial conditions were relaxed to non-fixed case. Moreover, Jin also provided interesting results on the relaxation of initial state condition for learning control with output and state constraints [31], [32], [33], [34], where the initial state can be arbitrarily set. These results have inspired us to relax the identical initial condition for extending the possible application environments in this paper.

In particular, we consider the ILC problem for robot manipulator systems with random trial lengths. The manipulator system is of high-order multi-input-multi-output form, which cannot be covered by the existing results. To handle this system, we employ the back-stepping adaptive scheme, which can be further extended to more complicated nonlinear systems. To derive the asymptotical convergence, we define a new composite energy function (CEF), in which a virtual tracking error is introduced for the untrodden part of each iteration. To address the different initial states, we modify the original reference trajectory to guarantee the convergence property beyond a small initial time interval. The major contributions of this paper are summarized as follows:

  • We consider the adaptive learning control of robot manipulator systems, in which the operation length is randomly varying from trial to trial.

  • We apply the initial rectifying mechanism similar to [31], [32], [33], [34] and modify the reference trajectory to guarantee a precise tracking to the original objective beyond a small initial interval.

  • A novel CEF is defined with the help of newly introduced auxiliary variables to facilitate the convergence analysis.

The paper is organized as follows. In Section 2, the general formulation of the robot manipulator system and the iteration-varying trial length problem are given. Section 3 discusses the controller design and convergence analysis. The relaxation of the i.i.c. is addressed in Section 4. Illustrative simulations are given in Section 5. In the end, Section 6 concludes this paper.

Notations: R denotes the set of real numbers, Rn is the space of all n-dimensional vectors, and N is the set of positive integers. E[·], P[·], F[·] denote the mathematical expectation of a random variable, the probability of an event, and the probability distribution function of a random variable, respectively.

Section snippets

Problem formulation

Consider an n degree-of-freedom (DOF) robot manipulator model with uncertainties described byx˙1,k=x2,kD(x1,k)x˙2,k+C(x1,k,x2,k)x2,k+G(x1,k)+F(x1,k,x2,k,t)=τkwhere k denotes the iteration number. x1,kx1,k(t)Rn, x2,kx2,k(t)Rn, x˙2,kx˙2,k(t)Rn are the joint position, joint velocity and joint acceleration vectors, respectively. t ∈ [0, T], T > 0 represents the operation time in each iteration. D(x1,k)Rn×n is the inertia matrix, C(x1,k,x2,k)Rn×n is the centripetal-Coriolis matrix, G(x1,k)Rn

ILC design and convergence analysis

Define the state tracking error e1,k=x1,kx1,r and e2,k=x2,kx2,r for t ≤ Tk. To facilitate our discussions, we introduce a fictitious joint velocity tracking error asγk=x2,kσk,σk=x˙1,rμ1e1,kwhere μ1 > 0 is the feedback gain to be designed. According to assumption A 3, it can be derived that e1,k(0)=e2,k(0)=γk(0)=0.

The control scheme is constructed asτk=e1,kμ2γk+Ψkθksign(γk)dk,tTkwhere ΨkΨ(σ˙k,σk,x1,k,x2,k), dkd(x1,k,x2,k,t), μ2 > 0 is the feedback gain to be designed, sign( · ) is the

Extensions to uncertain initial states

In the previous section, the widely-used i.i.c. is assumed to ensure a precise tracking over the whole operation interval [0, Tk]. However, a perfect initial resetting may not be easy because of various situations. Actually, the initial state x1,k(0) and x2,k(0) may be both iteration varying in practical applications. Then, it is impossible for x1,k(t), x2,k(t) to track x1,r(t), x2,r(t) due to the initial tracking error. An alternative solution is to track the reference trajectory beyond a

Illustrative simulations

In this section, to verify the effectiveness of the control schemes we proposed above, simulations have been done on a two DOF robot manipulator described as Eq. (1) [10]. The joint position and velocity vectors are x1,k=[x11,k,x12,k]T and x2,k=[x21,k,x22,k]T, respectively. The inertia matrix D(x1,k)=[Dij]2×2 is given by D11=m1l12+m2(l12+l22+2l1l2cos(x12,k)), D12=D21=m2(l22+l1l2cos(x12,k)) and D22=m2l22. The centripetal-Coriolis matrix C(x1,k,x2,k)=[Cij]2×2 is given as C11=m2l1l2x22,ksinx12,k,

Conclusions

In this paper, we present the first attempt to apply ILC to robot manipulator systems in which the iteration lengths vary randomly. To demonstrate the convergence of the joint position tracking error, a virtual tracking error is defined for the missing section of the operation process and a novel CEF is introduced for the nonuniform trial length problem. We compensate the missing tracking error with the iteration ending value, which allows us to present an explicit difference between adjacent

References (35)

  • D.A. Bristow et al.

    A survey of iterative learning control: a learning-based method for high-performance tracking control

    IEEE Control Syst. Mag.

    (2006)
  • AhnH.S. et al.

    Iterative learning control: brief survey and categorization

    IEEE Trans. Syst. Man Cybern.-Part C

    (2007)
  • ShenD.

    Iterative learning control with incomplete information: a survey

    IEEE/CAA J. Autom. Sin.

    (2018)
  • S. Arimoto et al.

    Bettering operation of robots by learning

    J. Robot. Syst.

    (1984)
  • YuC. et al.

    Trajectory tracking of wheeled mobile robot by adopting iterative learning control with predictive, current, and past learning items

    Robotica

    (2015)
  • ZhaoY. et al.

    Calibration-based iterative learning control for path tracking of industrial robots

    IEEE Trans. Ind. Electron.

    (2015)
  • P. Sampson et al.

    Using functional electrical stimulation mediated by iterative learning control and robotics to improve arm movement for people with multiple sclerosis

    IEEE Trans. Neural Syst. Rehabil. Eng.

    (2016)
  • Cited by (0)

    This work was supported by National Natural Science Foundation of China (61673045, 11661016).

    View full text