Elsevier

Information Sciences

Volume 329, 1 February 2016, Pages 670-689
Information Sciences

Effectiveness of Bayesian filters: An information fusion perspective

https://doi.org/10.1016/j.ins.2015.09.041Get rights and content

Highlights

  • A fundamental issue concerned the effectiveness of the Bayesian filter is raised.

  • The observation-only (O2) inference is presented for dynamic state estimation.

  • The “probability of filter benefit” is defined and quantitatively analyzed.

  • Convincing simulations demonstrate that many filters can be easily ineffective.

Abstract

The general solution for dynamic state estimation is to model the system as a hidden Markov process and then employ a recursive estimator of the prediction–correction format (of which the best known is the Bayesian filter) to statistically fuse the time-series observations via models. The performance of the estimator greatly depends on the quality of the statistical mode assumed. In contrast, this paper presents a modeling-free solution, referred to as the observation-only (O2) inference, which infers the state directly from the observations. A Monte Carlo sampling approach is correspondingly proposed for unbiased nonlinear O2 inference. With faster computational speed, the performance of the O2 inference has identified a benchmark to assess the effectiveness of conventional recursive estimators where an estimator is defined as effective only when it outperforms on average the O2 inference (if applicable). It has been quantitatively demonstrated, from the perspective of information fusion, that a prior “biased” information (which inevitably accompanies inaccurate modelling) can be counterproductive for a filter, resulting in an ineffective estimator. Classic state space models have shown that a variety of Kalman filters and particle filters can easily be ineffective (inferior to the O2 inference) in certain situations, although this has been omitted somewhat in the literature.

Introduction

Dynamic state estimation has been a long-standing and vibrant area of research concerned with the sequential process of estimating a/multiple state(s) evolving over time based on noisy observations. It is the core of many fundamental problems including positioning, tracking, econometric forecasting, adaptive control, etc.

A “naïve” estimation solution is to infer the state directly from the noisy observations received in discrete time instants, hereafter referred to as the observation-only (O2) inference, which will be addressed in this paper. This is a computationally fast estimation method, providing accuracy that is completely dependent on the observation noise regardless of the state process (for which there is, therefore, no need to model it).

In contrast to the straightforward O2 inference, which provides only the point state-estimate, the prevailing solution that has been most investigated is to model the system as a hidden Markov process and employ a recursive estimator to statistically fuse the observations with models in real time. In this case, a two-step estimation paradigm must be adopted, including model identification based on data and filter design based on the identified model [41]. The optimal recursive state estimator in the Bayesian sense requires the complete posterior density of the state to be determined as a function of time. The posterior probability density function (PDF) can be analytically computed only for linear systems with additive Gaussian noises for which the known Kalman filter [24], [25] gives the optimal estimate (and some other special cases [9]). In the general case of nonlinear system or/and non-Gaussian noises, it is impossible to compute the exact form of the posterior PDF; instead, one has to resort to some form of approximation which can be parametric (e.g. Gaussian filters or Gaussian sum filters), non-parametric (e.g. Monte Carlo methods) or a mixture of both. An astonishing surge of various recursive filters/smoothers has been witnessed since [24], [25].

These recursive estimators, which have the Bayesian paradigm as the theoretically most elaborated base [26], perform well as long as the models used are accurate, having few disturbances, and that the approximation (required in nonlinear systems) is insignificant. Ideally, an optimality (e.g. Cramér–Rao lower bounds, CRLB [14], [51], [57]) can be reached if the physical world and the assumed model coincide perfectly. However, in most practical problems, accurate knowledge of the state process model (and noises) is often missing. The model of a real process may differ from the assumed model or the best available model for that process, leaving a difference we refer to as modeling error.

It has been well acknowledged in literature since at least [13], [21], [22] that modeling errors (and significant disturbances) can easily cause significant performance deteriorations or even failures of filters. Therefore, dealing with modeling errors has been a fundamental problem. This, however, is not a problem for the O2 inference as it is free of state process modeling. For recursive estimation, a large variety of strategies have been proposed to enhance the filtering performance including model assessment [11], adaptive filtering (e.g. [19], [34]), robust filtering (e.g. effective characteristics [7], particularly including detection and treatment of uncertain noise [45], outlier [38], abrupt motion [36], asynchronous observations [47], [40] and colored noises [53]), “direct” filtering [41], variable rate filtering [15] and finite impulse response filtering [29], just to name a few. Similar issues occur in Bayesian smoothers and predictors [1], [6], [18], [48] as well as other recursive estimators e.g. optimization-based estimator [27], [42], [46]. The situation will be much more complicated in the multi-target case of cluttered environments, see e.g. [3], [30], [31], [55]. We do not intend to detail these in this paper. However, we would like to point out that:

  • (1)

    While considerable efforts have been devoted to developing sophisticated recursive filters, the general effectiveness of these filters has remained elusive. Simply stated, it is rare to be asked whether the use of a filter will pay off when modeling errors (including outlier noise) occur or when too much approximation is triggered. This is primarily because a clear definition of the effectiveness for general filters is still missing. Such a definition would require a clear, efficient and engineer-friendly benchmark that is qualified to assess all filters in a consistent manner. The same holds for the work on smoothers and other recursive estimators.

  • (2)

    It has been demonstrated that the Bayesian inference can behave very badly if the model under consideration is erroneous e.g. [18]. More specifically simple deterministic methods outperform the Bayesian filter in a type of finite-state estimation [44] even when the model is properly set up. In any case, the quantitative analysis of the failure of filters is missing. This paper will thoroughly demonstrate that the O2 inference can outperform recursive filters in certain situations, thus indicating that filters do not always pay off.

In this paper, two primary contributions have been made with regard to these fundamental issues.

  • (1)

    The O2 inference is established as a benchmark, a bottom line, to assess the effectiveness of recursive estimators, including the Bayesian filter, where an estimator is defined as effective only when it can at minimum outperform the O2 inference on average in accuracy. For a nonlinear observation function, a bias is noticed in the O2 inference and, consequently, a Monte Carlo sampling-based debiasing approach is proposed.

  • (2)

    The effectiveness of the Bayesian filter of the prediction–correction format is quantitatively investigated from the information fusion perspective, and examples are evaluated on classic filtering models via simulation. Both theoretical studies and simulation results show that the O2 inference can easily outperform the filters in certain situations, more so than expected. This deserves particular attention for the application of any filter.

The remainder of the paper is organized as follows. The basic idea of the Bayesian filter and the O2 inference is given in Section 2. Section 3 investigates the effectiveness of the recursive filter from the general perspective of information fusion while Section 4 presents simulation results based on three representative problem models to demonstrate the theoretical findings. We conclude in Section 5.

Section snippets

A brief review of Bayesian filters

The dynamic state estimation, also referred to as the filtering problem, is generally modeled in the state space where the system being modeled is assumed to be a Markov process of hidden state. This can be formulated as a state space model (SSM) that is comprised of a state process equation and an observation equation as follows xt=ft(xt1,ut)yt=ht(xt,vt)where t indicates the time instant, xt denotes the state vector, yt denotes the observation (also called measurement) vector, and ut and vt

Probability of filter benefit

For simplicity, both the prior xp and the O2 inference xo are assumed to be subject to Gaussian in the 1-dimensional state space, either biased or unbiased with regard to the true state xT i.e.p(xo)=N(mo,δo2), p(xp)=N(mp,δp2). Here, we omit the reasons that cause the bias (to the prior or to the O2 inference) and are only concerned with how the bias that once occurred will affect the filtering result in different situations. The Bayesian filter fuses p(xo) and p(xp) obtaining the posteriorp(xf)=

Simulations

In this section, we will investigate the effectiveness of several known (extensions of) Kalman filters and particle filters based on two popular one-dimensional state space models, one with Gaussian state process noise and the other non-Gaussian, and a representative maneuvering target tracking case.

Conclusions

The observation-only (O2) inference is a straightforward, and probably the simplest, solution for dynamic state estimation. We have elaborated this method systematically and proposed a Monte Carlo sampling solution for unbiased nonlinear implementation. While the posterior CRLB provides a lower bound on the mean-square error of any “unbiased” estimator of the random parameter, the O2 takes a more practical approach by setting a higher bound on the mean error of any “effective” estimator

Acknowledgments

The authors acknowledge the insights of Prof./Dr. Yu-Chi (Larry) Ho, Huimin Chen, Xiao-Rong Li, Miodrag Bolicć, Petar Djuricć, Mahendra Mallick, Quan Pan, etc. on this work and the language checking by Deanna Garcia. This work is partly supported by European Commission: FP7-PEOPLE-2012-IRSES (ref. 318878) and MSCA-RISE-2014 (ref. 641794) and National Natural Science Foundation of China (No. 51475383) and Tiancheng Li's work has been supported by the Excellent Doctorate Foundation of

References (57)

  • H. Rezaei et al.

    Improved robust finite-horizon Kalman filtering for uncertain networked time-varying systems

    Inform. Sci.

    (2015)
  • M. Simandl et al.

    Advanced point-mass method for nonlinear state estimation

    Automatica

    (2006)
  • Y. Bar-Shalom et al.

    Tracking and Data Fusion: A Handbook of Algorithms

    (2011)
  • S. Bordonaro et al.

    Decorrelated unbiased converted measurement Kalman filter

    IEEE Trans. Aerosp. Electron. Syst.

    (2014)
  • F.S. Cattivelli et al.

    Diffusion strategies for distributed Kalman filtering and smoothing

    IEEE Trans. Autom. Control

    (2010)
  • L.A. Dalton et al.

    Intrinsically optimal Bayesian robust filtering

    IEEE Trans. Signal Process.

    (2014)
  • D. Dardari et al.

    Indoor tracking: theory, methods, and technologies

    IEEE Trans. Veh. Technol.

    (2015)
  • F. Daum

    Exact finite-dimensional nonlinear filters

    IEEE Trans. Autom. Control

    (1986)
  • F. Daum et al.

    Exact particle flow for nonlinear filters

  • P.M. Djuric et al.

    Assessment of nonlinear dynamic models by Kolmogorov–Smirnov statistics

    IEEE Trans. Signal Process.

    (2010)
  • J.A. Fessler

    Mean and variance of implicitly defined biased estimators (such as penalized maximum likelihood): applications to tomography

    IEEE Trans. Image Process.

    (1996)
  • B. Friedland

    Treatment of bias in recursive filtering

    IEEE Trans. Autom. Control

    (1969)
  • C. Fritsche et al.

    A fresh look at Bayesian Cramér–Rao bounds for discrete-time nonlinear filtering

  • S. Godsill et al.

    Models and algorithms for tracking of maneuvering objects using variable rate particle filters

    Proc. IEEE

    (2007)
  • N. Gordon et al.

    Novel approach to nonlinear/non-Gaussian Bayesian state estimation

    IEE Proc. F Radar Signal Process.

    (1993)
  • W. Greg et al.

    An Introduction to the Kalman Filter

    (2006)
  • P.D. Grünwald, T. van Ommen. Inconsistency of Bayesian inference for misspecified linear models, and a proposal for...
  • F. Gustafsson

    Adaptive Filtering and Change Detection

    (2000)
  • Cited by (62)

    • Adaptive Bayesian filtering based restoration of MR images

      2021, Biomedical Signal Processing and Control
    • Combining pre- and post-model information in the uncertainty quantification of non-deterministic models using an extended Bayesian melding approach

      2019, Information Sciences
      Citation Excerpt :

      There are many approaches in information fusion based on different uncertainty theories, such as Dempster-Shafer evidence theory [33], possibility theory [5], fuzzy set [6] and rough set [37]. Among all in-used information fusion methodologies, Bayesian-based approaches have drawn extensive attentions due to the ability to combine both subjective experts judgements and objective test data in a principled way to perform inference [21,39]. To integrate all available information in reliability analysis, Johnson et al. [17] developed a full-Bayesian approach for a multi-level system.

    View all citing articles on Scopus
    View full text