Elsevier

Automatica

Volume 38, Issue 3, March 2002, Pages 433-443
Automatica

Subspace-based system identification: weighting and pre-filtering of instruments

https://doi.org/10.1016/S0005-1098(01)00235-7Get rights and content

Abstract

Subspace-based system identification is typically based on an estimate of the extended observability matrix. It is thus of great interest to investigate, and also optimize, the estimate of the observability matrix. Of special interest in this paper is the fact that the influence of certain weighting matrices is an unresolved issue in the literature on subspace identification. Here, an asymptotic analysis of the estimated observability matrix is presented. The main result of the analysis is that novel weighting matrices and pre-filters of instrumental variables are derived.

Introduction

Methods for Subspace-based State Space System Identification (4SID) have lately been suggested as alternatives to traditional system identification techniques, see e.g. Larimore (1983), Ljung (1999), VanOverschee and De Moor (1996), Verhaegen (1994) and Viberg (1995). The 4SID methods are attractive since they estimate a state-space realization directly from input–output data, without requiring canonical parameterizations and non-linear optimizations, which typically is the case for the Prediction Error Method (PEM), cf. Ljung (1999), Söderström and Stoica (1989). Subspace methods are often implemented using robust numerical tools such as the QR-factorization and the Singular Value Decomposition (SVD), which make them attractive from a numerical point of view.

Although 4SID methods have been studied for a while and several successful applications, also on real-world data, have been reported, there is still work to be done in terms of analysis and performance optimization. In the literature, several contributions in this direction have appeared, see e.g. Bauer and Jansson (2000), Jansson and Wahlberg (1996) and Viberg, Wahlberg, and Ottersten (1997). In Bauer and Jansson (2000) it was shown that the estimated transfer function, under a set of assumptions, asymptotically has a normal distribution assuming application of a MOESP-like (Multivariable Output-Error state SPace identification, cf. Verhaegen, 1994) 4SID algorithm. The expression for the variance of the estimated transfer function is however complicated and was not explicitly provided. In Viberg et al. (1997), an optimally weighted subspace fitting (WSF) procedure was proposed. The WSF approach is based on a parameterization of the left nullspace of the observability matrix. The optimality is in terms of asymptotically producing minimum-variance estimates of the parameters describing the left nullspace of the observability matrix, given an estimate of the observability matrix (i.e. the subspace estimate). In Viberg et al. (1997) it was not guaranteed that the applied subspace estimate was the best possible. Given this rather arbitrary subspace estimate, the parameters describing the left null-space of the observability matrix were however estimated in an optimal manner. An open issue here is then how to optimize the subspace estimate. In Viberg et al. (1997) also the asymptotic variance of the estimated poles was derived, assuming a “shift-invariant approach” for estimation of system poles. In Jansson and Wahlberg (1996), 4SID algorithms were derived using a reduced-rank weighted least squares approach, which produced natural choices of weighting matrices. Similar results were obtained in Katayama and Picci (1999), where a multi-stage least squares approach was studied. Although different approaches of analysis, it is interesting to note that the column weightings found in Jansson and Wahlberg (1996), Katayama and Picci (1999) and Verhaegen (1994) all correspond to the CVA weighting of Larimore (1983).

The available statistical results on 4SID methods are in conclusion rather complicated, and the influence of certain user-defined quantities is not fully understood. The objective of the work in the present paper is to carry out a statistical investigation of 4SID methods. We will only address parts of this difficult problem. The first step in 4SID methods is in general to estimate the observability matrix. The particular state-space realization that the estimated observability matrix belongs to is in general unknown, and only its range space is retrieved. The basic idea of the present paper is then to analyze the asymptotic variance of the estimated observability matrix, rather than to consider the more difficult problem of analyzing the variance of the estimated transfer function. From this analysis, the following issues are addressed:

The analysis of the present paper is based on a sub-optimal figure of merit that not necessarily is related to the variance of the estimated transfer function. To appreciate the obtained results on weightings and pre-filters, it is of course desirable to relate our findings to relevant system quantities. For this purpose we analyze available results on the asymptotic variance of the estimated poles, and we also relate these results to our findings. Several numerical examples illustrating the benefits of the new weighting and pre-filtering strategies are included in the paper.

Section snippets

Notations and general assumptions

Assume that the discrete-time linear time-invariant dynamical system under study can be described by the following nth order state-space model with p outputs and m inputs collected in y(t) and u(t), respectively:x(t+1)=Ax(t)+Bu(t)+w(t),y(t)=Cx(t)+Du(t)+v(t).Here, w(t) denotes the n-dimensional process noise, v(t) denotes the p-dimensional measurement noise, and x(t) denotes the n-dimensional state vector. The system matrices are of dimensions ARn×n,BRn×m,CRp×n, and DRp×m respectively. The

Analysis of Γ̂

The subspace estimate Γ̂ consists in general of components from both R{Γ} (the orthogonal complement of R{Γ}) and R{Γ}. On the other hand, only the part of Γ̂ that lies in R{Γ} actually is an error. With this in mind, we decompose Γ̂ asΓ̂=ΠΓΓ̂+ΠΓΓ̂,where ΠΓ denotes the orthogonal projection onto R{Γ}, and ΠΓ denotes the orthogonal projection onto R{Γ}, i.e.ΠΓ=Γ(ΓTΓ)−1ΓT,ΠΓ=IΠΓ.Instead of analyzing the properties of the “subspace error” ΠΓΓ̂, we have chosen to study the following

Results on weighting matrices

The purpose of this section is to derive weighting matrices, and to outline an algorithm for subspace identification. The starting point of the analysis lies in the asymptotic covariance matrix of εN.

Analysis of the variance of the pole estimate

The purpose of this section is to relate our findings to existing statistical results. For that purpose, recall the common “shift-invariant” approach for estimating A, see e.g. Viberg et al. (1997). Define the selection matricesJ1=[I(α−1)p0(α−1)p×p,],J2=[0(α−1)p×pI(α−1)p],where IK is the K×K identity matrix, and 0K×L denotes an K×L matrix with zeros. The shift-invariant structure of Γ implies thatJ1ΓA=J2Γ.Given Γ̂, an estimate of A can be found by solving (57) in the least-squares sense. In

Pre-filtering of p(t)

In the identification literature it has been noted that IV methods in general perform worse than the PEM. In order to make IV methods to perform as well as the PEM, a pre-filtering of the instruments is typically required, cf. Söderström and Stoica (1989, Chapter 8). Thus, considering IV-interpretations of 4SID methods, it is natural to investigate whether pre-filtering of the IV-vector p(t) can improve the subspace estimate. For simplicity, only scalar “pre-filters” are considered.

The W4SID algorithm

In this section, previously obtained results are illustrated by means of a couple of numerical examples. In our first example, we consider identification of the following simple ARMAX SISO system:(1−q−1+0.5q−2)y(t)=(1−0.5q−2)u(t)+(1+q−1+0.5q−2)e(t),where e(t) is white Gaussian noise. Three different identification algorithms are investigated:

Alg1:The PO-MOESP weighting (i.e. Wc=Ŵcoa).
Alg2:The W4SID algorithm.
Alg3:PEM, implemented using the ARMAX routine in Matlab's System Identification

Conclusions

We have presented an analysis on the asymptotic error of the estimated observability matrix. Once a particular figure of merit was defined, useful results on how to choose weighting matrices were found. A particularly important result is that we found a novel weighting matrix, which in our numerical examples showed superior accuracy compared to PO-MOESP. From the numerical examples, we draw the following conclusions:

  • The W4SID algorithm performs favorable to MOESP.

  • Pre-filtering of the

Tony Gustafsson was born in Värnamo, Sweden, in 1969. He received the M.S. degree in electrical engineering from Chalmers University of Technology, Sweden, in 1994, and in 1999 he received the Ph.D. degree in signal processing from the same university. From 1999 to 2000, he was a postdoctoral researcher at University of California San Diego. Presently he is a research engineer at Switchcore corporation, Göteborg, Sweden.

Cited by (30)

  • Variance reduction in Covariance Based Realization Algorithm with application to closed-loop data

    2020, Automatica
    Citation Excerpt :

    In addition, for MIMO state-space models, an optimal column weighting matrix was proposed for MOESP-like (Multivariable Output-Error state SPace identification) subspace methods in Gustafsson (2002) by using a similar idea from Stoica and Söderström (1983) and Viberg, Wahlberg, and Ottersten (1997). An optimal column weighting for CoBRA in an open-loop setting was then proposed theoretically in Hu and de Callafon (2017) based on the work in Gustafsson (2002) and Viberg et al. (1997). This paper analyzes a two stage CoBRA for closed-loop MIMO system identification, where the work from Hu and de Callafon (2017) is extended to the first stage.

View all citing articles on Scopus

Tony Gustafsson was born in Värnamo, Sweden, in 1969. He received the M.S. degree in electrical engineering from Chalmers University of Technology, Sweden, in 1994, and in 1999 he received the Ph.D. degree in signal processing from the same university. From 1999 to 2000, he was a postdoctoral researcher at University of California San Diego. Presently he is a research engineer at Switchcore corporation, Göteborg, Sweden.

This paper was originally presented at the 12th IFAC Symposium on System Identification, 2000, Santa Barbara, USA. This paper was recommended for publication in revised form by Associate Editor Brett Ninness under the direction of Editor Torsten Söderström. This work was completed while visiting University of California San Diego. Support by the Swedish Foundation for International Cooperation in Research and Higher Education, and Telefonaktiebolaget LM Ericsson is gratefully acknowledged.

View full text