Effectiveness of Bayesian filters: An information fusion perspective

doi:10.1016/j.ins.2015.09.041

Information Sciences

Volume 329, 1 February 2016, Pages 670-689

https://doi.org/10.1016/j.ins.2015.09.041 Get rights and content

Highlights

•
A fundamental issue concerned the effectiveness of the Bayesian filter is raised.
•
The observation-only (O2) inference is presented for dynamic state estimation.
•
The “probability of filter benefit” is defined and quantitatively analyzed.
•
Convincing simulations demonstrate that many filters can be easily ineffective.

Abstract

The general solution for dynamic state estimation is to model the system as a hidden Markov process and then employ a recursive estimator of the prediction–correction format (of which the best known is the Bayesian filter) to statistically fuse the time-series observations via models. The performance of the estimator greatly depends on the quality of the statistical mode assumed. In contrast, this paper presents a modeling-free solution, referred to as the observation-only (O₂) inference, which infers the state directly from the observations. A Monte Carlo sampling approach is correspondingly proposed for unbiased nonlinear O₂ inference. With faster computational speed, the performance of the O₂ inference has identified a benchmark to assess the effectiveness of conventional recursive estimators where an estimator is defined as effective only when it outperforms on average the O₂ inference (if applicable). It has been quantitatively demonstrated, from the perspective of information fusion, that a prior “biased” information (which inevitably accompanies inaccurate modelling) can be counterproductive for a filter, resulting in an ineffective estimator. Classic state space models have shown that a variety of Kalman filters and particle filters can easily be ineffective (inferior to the O₂ inference) in certain situations, although this has been omitted somewhat in the literature.

Introduction

Dynamic state estimation has been a long-standing and vibrant area of research concerned with the sequential process of estimating a/multiple state(s) evolving over time based on noisy observations. It is the core of many fundamental problems including positioning, tracking, econometric forecasting, adaptive control, etc.

A “naïve” estimation solution is to infer the state directly from the noisy observations received in discrete time instants, hereafter referred to as the observation-only (O₂) inference, which will be addressed in this paper. This is a computationally fast estimation method, providing accuracy that is completely dependent on the observation noise regardless of the state process (for which there is, therefore, no need to model it).

In contrast to the straightforward O₂ inference, which provides only the point state-estimate, the prevailing solution that has been most investigated is to model the system as a hidden Markov process and employ a recursive estimator to statistically fuse the observations with models in real time. In this case, a two-step estimation paradigm must be adopted, including model identification based on data and filter design based on the identified model [41]. The optimal recursive state estimator in the Bayesian sense requires the complete posterior density of the state to be determined as a function of time. The posterior probability density function (PDF) can be analytically computed only for linear systems with additive Gaussian noises for which the known Kalman filter [24], [25] gives the optimal estimate (and some other special cases [9]). In the general case of nonlinear system or/and non-Gaussian noises, it is impossible to compute the exact form of the posterior PDF; instead, one has to resort to some form of approximation which can be parametric (e.g. Gaussian filters or Gaussian sum filters), non-parametric (e.g. Monte Carlo methods) or a mixture of both. An astonishing surge of various recursive filters/smoothers has been witnessed since [24], [25].

These recursive estimators, which have the Bayesian paradigm as the theoretically most elaborated base [26], perform well as long as the models used are accurate, having few disturbances, and that the approximation (required in nonlinear systems) is insignificant. Ideally, an optimality (e.g. Cramér–Rao lower bounds, CRLB [14], [51], [57]) can be reached if the physical world and the assumed model coincide perfectly. However, in most practical problems, accurate knowledge of the state process model (and noises) is often missing. The model of a real process may differ from the assumed model or the best available model for that process, leaving a difference we refer to as modeling error.

It has been well acknowledged in literature since at least [13], [21], [22] that modeling errors (and significant disturbances) can easily cause significant performance deteriorations or even failures of filters. Therefore, dealing with modeling errors has been a fundamental problem. This, however, is not a problem for the O₂ inference as it is free of state process modeling. For recursive estimation, a large variety of strategies have been proposed to enhance the filtering performance including model assessment [11], adaptive filtering (e.g. [19], [34]), robust filtering (e.g. effective characteristics [7], particularly including detection and treatment of uncertain noise [45], outlier [38], abrupt motion [36], asynchronous observations [47], [40] and colored noises [53]), “direct” filtering [41], variable rate filtering [15] and finite impulse response filtering [29], just to name a few. Similar issues occur in Bayesian smoothers and predictors [1], [6], [18], [48] as well as other recursive estimators e.g. optimization-based estimator [27], [42], [46]. The situation will be much more complicated in the multi-target case of cluttered environments, see e.g. [3], [30], [31], [55]. We do not intend to detail these in this paper. However, we would like to point out that:

(1)
While considerable efforts have been devoted to developing sophisticated recursive filters, the general effectiveness of these filters has remained elusive. Simply stated, it is rare to be asked whether the use of a filter will pay off when modeling errors (including outlier noise) occur or when too much approximation is triggered. This is primarily because a clear definition of the effectiveness for general filters is still missing. Such a definition would require a clear, efficient and engineer-friendly benchmark that is qualified to assess all filters in a consistent manner. The same holds for the work on smoothers and other recursive estimators.
(2)
It has been demonstrated that the Bayesian inference can behave very badly if the model under consideration is erroneous e.g. [18]. More specifically simple deterministic methods outperform the Bayesian filter in a type of finite-state estimation [44] even when the model is properly set up. In any case, the quantitative analysis of the failure of filters is missing. This paper will thoroughly demonstrate that the O₂ inference can outperform recursive filters in certain situations, thus indicating that filters do not always pay off.

In this paper, two primary contributions have been made with regard to these fundamental issues.

(1)
The O₂ inference is established as a benchmark, a bottom line, to assess the effectiveness of recursive estimators, including the Bayesian filter, where an estimator is defined as effective only when it can at minimum outperform the O₂ inference on average in accuracy. For a nonlinear observation function, a bias is noticed in the O₂ inference and, consequently, a Monte Carlo sampling-based debiasing approach is proposed.
(2)
The effectiveness of the Bayesian filter of the prediction–correction format is quantitatively investigated from the information fusion perspective, and examples are evaluated on classic filtering models via simulation. Both theoretical studies and simulation results show that the O₂ inference can easily outperform the filters in certain situations, more so than expected. This deserves particular attention for the application of any filter.

The remainder of the paper is organized as follows. The basic idea of the Bayesian filter and the O₂ inference is given in Section 2. Section 3 investigates the effectiveness of the recursive filter from the general perspective of information fusion while Section 4 presents simulation results based on three representative problem models to demonstrate the theoretical findings. We conclude in Section 5.

Section snippets

A brief review of Bayesian filters

The dynamic state estimation, also referred to as the filtering problem, is generally modeled in the state space where the system being modeled is assumed to be a Markov process of hidden state. This can be formulated as a state space model (SSM) that is comprised of a state process equation and an observation equation as follows $x_{t} = f_{t} (x_{t - 1}, u_{t})$ $y_{t} = h_{t} (x_{t}, v_{t})$ where t indicates the time instant, x_t denotes the state vector, y_t denotes the observation (also called measurement) vector, and u_t and v_t

Probability of filter benefit

For simplicity, both the prior x_p and the O₂ inference x_o are assumed to be subject to Gaussian in the 1-dimensional state space, either biased or unbiased with regard to the true state x_T i.e. $p (x_{o}) = N (m_{o}, δ_{o}^{2})$ , $p (x_{p}) = N (m_{p}, δ_{p}^{2})$ . Here, we omit the reasons that cause the bias (to the prior or to the O₂ inference) and are only concerned with how the bias that once occurred will affect the filtering result in different situations. The Bayesian filter fuses p(x_o) and p(x_p) obtaining the posterior $p (x_{f}) =$

Simulations

In this section, we will investigate the effectiveness of several known (extensions of) Kalman filters and particle filters based on two popular one-dimensional state space models, one with Gaussian state process noise and the other non-Gaussian, and a representative maneuvering target tracking case.

Conclusions

The observation-only (O₂) inference is a straightforward, and probably the simplest, solution for dynamic state estimation. We have elaborated this method systematically and proposed a Monte Carlo sampling solution for unbiased nonlinear implementation. While the posterior CRLB provides a lower bound on the mean-square error of any “unbiased” estimator of the random parameter, the O₂ takes a more practical approach by setting a higher bound on the mean error of any “effective” estimator

Acknowledgments

The authors acknowledge the insights of Prof./Dr. Yu-Chi (Larry) Ho, Huimin Chen, Xiao-Rong Li, Miodrag Bolicć, Petar Djuricć, Mahendra Mallick, Quan Pan, etc. on this work and the language checking by Deanna Garcia. This work is partly supported by European Commission: FP7-PEOPLE-2012-IRSES (ref. 318878) and MSCA-RISE-2014 (ref. 641794) and National Natural Science Foundation of China (No. 51475383) and Tiancheng Li's work has been supported by the Excellent Doctorate Foundation of

References (57)

J. Ala-Luhtala et al.
Gaussian filtering and variational approximations for Bayesian smoothing in continuous-discrete stochastic dynamic systems
Signal Process.
(2015)
D.L. Alspach
Gaussian sum approximations in nonlinear filtering and control
Inform. Sci.
(1974)
V.A. Bavdekar et al.
Identification of process and measurement noise covariances for state and parameter estimation using extended Kalman filter
J. Process Control
(2011)
M. Kárný
Recursive estimation of high-order Markov chains: approximation by finite mixtures
Inform. Sci.
(2016)
W.H. Kwon et al.
A receding horizon unbiased FIR filter for discrete-time state space models
Automatica
(2002)
T. Li et al.
Fight sample degeneracy and impoverishment in particle filters: a review of intelligent approaches
Expert Syst. Appl.
(2014)
M.K. Lim et al.
Refined particle swarm intelligence method for abrupt motion tracking
Inform. Sci.
(2014)
M.M. Naushad Ali et al.
Multiple object tracking with partial occlusion handling using salient feature points
Inform. Sci.
(2014)
S.C. Patwardhan et al.
Nonlinear Bayesian state estimation: a review of recent developments
Control Eng. Practice
(2012)
W. Qi et al.
Robust weighted fusion time-varying Kalman smoothers for multisensor system with uncertain noise variances
Inform. Sci.
(2014)

H. Rezaei et al.

Improved robust finite-horizon Kalman filtering for uncertain networked time-varying systems

Inform. Sci.

(2015)

M. Simandl et al.

Advanced point-mass method for nonlinear state estimation

Automatica

(2006)

Y. Bar-Shalom et al.

Tracking and Data Fusion: A Handbook of Algorithms

(2011)

S. Bordonaro et al.

Decorrelated unbiased converted measurement Kalman filter

IEEE Trans. Aerosp. Electron. Syst.

(2014)

F.S. Cattivelli et al.

Diffusion strategies for distributed Kalman filtering and smoothing

IEEE Trans. Autom. Control

(2010)

L.A. Dalton et al.

Intrinsically optimal Bayesian robust filtering

IEEE Trans. Signal Process.

(2014)

D. Dardari et al.

Indoor tracking: theory, methods, and technologies

IEEE Trans. Veh. Technol.

(2015)

F. Daum

Exact finite-dimensional nonlinear filters

IEEE Trans. Autom. Control

(1986)

F. Daum et al.

Exact particle flow for nonlinear filters

P.M. Djuric et al.

Assessment of nonlinear dynamic models by Kolmogorov–Smirnov statistics

IEEE Trans. Signal Process.

(2010)

J.A. Fessler

Mean and variance of implicitly defined biased estimators (such as penalized maximum likelihood): applications to tomography

IEEE Trans. Image Process.

(1996)

B. Friedland

Treatment of bias in recursive filtering

IEEE Trans. Autom. Control

(1969)

C. Fritsche et al.

A fresh look at Bayesian Cramér–Rao bounds for discrete-time nonlinear filtering

S. Godsill et al.

Models and algorithms for tracking of maneuvering objects using variable rate particle filters

Proc. IEEE

(2007)

N. Gordon et al.

Novel approach to nonlinear/non-Gaussian Bayesian state estimation

IEE Proc. F Radar Signal Process.

(1993)

W. Greg et al.

An Introduction to the Kalman Filter

(2006)

P.D. Grünwald, T. van Ommen. Inconsistency of Bayesian inference for misspecified linear models, and a proposal for...

F. Gustafsson

Adaptive Filtering and Change Detection

(2000)

Cited by (62)

Chernoff fusion using observability Gramian-centric weighting
2024, Information Sciences
In this work, an observability Gramian (OG)-based Chernoff fusion (CF) rule is investigated for dealing with unknown correlated probability density functions (PDFs). Specifically, we introduce a generalised uniform observability (GUO) condition, which ensures that error covariances and estimate errors are bounded under nonlinear settings within the extended Kalman filter (EKF) framework. Leveraging the GUO condition, we develop an OG-centric weighting selection method that optimises fusion weights while guaranteeing non-divergent performance using an approximated Chernoff fusion (ACF) algorithm. The resulting OG-centric weights are then embedded to develop an OG-based approximated Chernoff fusion (OGBACF) algorithm that can compute fusion weights and error covariances in parallel. Finally, we conduct simulations to demonstrate the efficacy of our proposed fusion methodology.
A multi-source information fusion model for outlier detection
2023, Information Fusion
Multi-source information fusion (MSIF) is a useful strategy for combining complimentary data from numerous information sources to produce an overall precise description, which can help with effective decision-making, prediction, and categorization, etc. In order to find the objects that are different from the expected ones after fusion, i.e., anomalies, or outliers, an MSIF model is put forward for outlier detection. This is a two-stage model that includes fusion of multiple information sources and outlier detection of fused data. The first stage uses information sets to construct uncertainty criteria for information source values and combines multiple information sources into a single information source based on the minimum uncertainty strategy. The second stage uses the Gaussian kernel method for possibility modeling based on the fused data to construct knowledge granules. From the perspective of granular computing, outliers in the fused data can be assigned to each knowledge granule. Then, we can find all outliers just by evaluating these knowledge granules. Inspired by this, the fuzzy knowledge measure (FKM) is proposed to evaluate the knowledge granule. Moreover, several metrics are induced on the basis of FKM to describe outliers in knowledge granules and an FKM-based outlier detection algorithm (FKMOD) is designed. Finally, we conduct the experiments on sixteen open access outlier detection datasets. The experimental results show that the proposed FKMOD method has more accurate detection performance than nine classical methods.
From target tracking to targeting track: A data-driven yet analytical approach to joint target detection and tracking
2023, Signal Processing
This paper addresses the problem of real-time detection and tracking of a non-cooperative target in the challenging scenario with almost no a-priori information about target birth, death, dynamics and detection probability. Furthermore, there are false and missing data at an unknown yet low rate in the measurements. The only information given in advance is about the target-measurement model and the constraint that there is no more than one target in the scenario. To solve these challenges, we model the movement of the target by using a polynomial trajectory function of time (T-FoT), which aims to estimate the continuous-time trajectory of the target rather than a series of discrete-time point estimates as is done in most existing filters/trackers. Data-driven T-FoT initiation and termination strategies are proposed for identifying the (re-)appearance and disappearance of the target. During the existence of the target, real target measurements are distinguished from clutter if the target indeed exists and is detected, in order to update the T-FoT at each scan for which we design a least-squares estimator. Overall, our approach is Markov-free, data-driven yet analytical. Simulations using either linear or nonlinear systems are conducted to demonstrate the effectiveness of our approach in comparison with the Bayes optimal Bernoulli filters. The results show that our approach is comparable to the perfectly-modeled filters, even outperforms them in some cases while requiring much less a-priori information and computing much faster.
Adaptive Bayesian filtering based restoration of MR images
2021, Biomedical Signal Processing and Control
MR images contain noise and for quantitative clinical diagnosis restoration is essential. For single coil MR images, noise follows Rician distribution when signal to noise ratio (SNR) is low and Gaussian distribution when SNR is high. Rician noise is signal dependent and introduces bias. The work proposes an adaptive Bayesian framework for restoration of 2D magnitude MR images. Restoration is achieved by Rician likelihood as the data attachment term with range and domain Gaussian filters, adaptive to noise as prior in Maximum $\overset{´}{A}$ posterior framework. A good filtering behavior is achieved due to the domain component of the filter and crisp edges are preserved at the same time due to the noise adaptive range component. Rician likelihood aids the image restoration in terms of bias removal. Convergence of the proposed method further highlights the optimal filtering performance. Experiments conducted on publically available Brainweb phantom demonstrate enhanced performance in terms of signal to noise ratio, structural similarity index and overall performance.
Combining pre- and post-model information in the uncertainty quantification of non-deterministic models using an extended Bayesian melding approach
2019, Information Sciences
Citation Excerpt :
There are many approaches in information fusion based on different uncertainty theories, such as Dempster-Shafer evidence theory [33], possibility theory [5], fuzzy set [6] and rough set [37]. Among all in-used information fusion methodologies, Bayesian-based approaches have drawn extensive attentions due to the ability to combine both subjective experts judgements and objective test data in a principled way to perform inference [21,39]. To integrate all available information in reliability analysis, Johnson et al. [17] developed a full-Bayesian approach for a multi-level system.
Due to the increasing complexity of manufacturing process and the diversity of information sources, it is not rare in practical engineering that multiple priors are simultaneously available on the same quantity. To address this issue, which occurs due to inconsistent information from different sources, we propose a probability framework to quantify the uncertainty of a general propagation model. An extended Bayesian melding approach is developed to eliminate the limitations inherent in traditional Bayesian methods. It is found that the aggregation error, which is caused by inconsistent information from multi-sources, can be alleviated by combining the pre- and post- model information. Novel features of our approach involve a modified sampling importance resampling algorithm in which a distribution mixture technique is adopted to reduce the computational cost. To meet practical engineering requirements, this approach is extended to a non-deterministic scenario that has not been covered by existing studies. We use several case studies to validate our proposal as well as its benefits in practical applications.
Radar Perception in Autonomous Driving: Exploring Different Data Representations
2023, arXiv

View all citing articles on Scopus

View full text

Effectiveness of Bayesian filters: An information fusion perspective

Highlights

Abstract

Introduction

Section snippets

A brief review of Bayesian filters

Probability of filter benefit

Simulations

Conclusions

Acknowledgments

Signal Process.

Inform. Sci.

J. Process Control

Inform. Sci.

Automatica

Expert Syst. Appl.

Inform. Sci.

Inform. Sci.

Control Eng. Practice

Inform. Sci.

Inform. Sci.

Automatica

Tracking and Data Fusion: A Handbook of Algorithms

Decorrelated unbiased converted measurement Kalman filter

IEEE Trans. Aerosp. Electron. Syst.

Diffusion strategies for distributed Kalman filtering and smoothing

IEEE Trans. Autom. Control

Intrinsically optimal Bayesian robust filtering

IEEE Trans. Signal Process.

Indoor tracking: theory, methods, and technologies

IEEE Trans. Veh. Technol.

Exact finite-dimensional nonlinear filters

IEEE Trans. Autom. Control

Exact particle flow for nonlinear filters

Assessment of nonlinear dynamic models by Kolmogorov–Smirnov statistics

IEEE Trans. Signal Process.

Mean and variance of implicitly defined biased estimators (such as penalized maximum likelihood): applications to tomography

IEEE Trans. Image Process.

Treatment of bias in recursive filtering

IEEE Trans. Autom. Control

A fresh look at Bayesian Cramér–Rao bounds for discrete-time nonlinear filtering

Models and algorithms for tracking of maneuvering objects using variable rate particle filters

Proc. IEEE

Novel approach to nonlinear/non-Gaussian Bayesian state estimation

IEE Proc. F Radar Signal Process.

An Introduction to the Kalman Filter

Adaptive Filtering and Change Detection