Optimal instantaneous rigid motion estimation insensitive to local minima
Introduction
The instantaneous velocity or optic flow field encountered by a moving observer contains an enormous amount of information related to the three dimensional (3D) structure of the environment and to the presence and motion of independently moving objects. Knowledge of the egomotion or self-motion of the observer is a necessary prerequisite to obtain this valuable information. Since small observer motions can have large effects on the optic flow field, it is advisable to extract the egomotion parameters from the optic flow field itself. This, however, is non-trivial and an active topic of research.
The field has matured a lot over the years and a number of ‘optimal’ algorithms (unbiased and minimal variance of the estimates) have appeared [1], [2]. The error function of the optimal problem formulation is however highly nonlinear and contains a large number of local minima [3], [4], which renders these algorithms unreliable and hard to use in practical applications. The earlier approaches [5], [6], [7], [8], which operate on a linearization of the problem, are no valid alternative. Compared to optimal algorithms, they are extremely sensitive to noise [1], [2], [9] and the estimates they provide are unsuitable, even as initializations for the optimal methods.
As an alternative to the time-consuming process of repeatedly minimizing with different, random initializations and selection of the minimum-cost solution, we propose to regularize the error function. We reformulate the problem in such a way that the complexity of the error function (the likelihood that algorithms end up in local minima) is controlled by a single parameter. We propose a reweighting scheme that gradually increases the problem complexity during the minimization, until the optimal problem formulation is obtained. We demonstrate, both in simulation and on real data, that the proposed method retains the accuracy of optimal algorithms, but is much less sensitive to local minima. On the extensive set of data investigated, these improvements come at the cost of less than a doubling in computation time compared to previous optimal algorithms.
Section snippets
Problem statement
Under a static environment assumption, the motion of all points in space, relative to a coordinate system centered in the nodal point of the observer’s eye, is determined by the translational velocity, t = (tx, ty, tz)T, and rotational velocity, ω = (ωx, ωy, ωz)T, of the moving observer. The 3D velocity, v = (vx, vy, vz)T, of a point in space, x = (x, y, z)T, is then [10]Under perspective projection and assuming, without loss of generality, a focal length equal to unity, these 3D motion vectors are
Previous algorithms
A wide variety of egomotion-estimation methods have been proposed in the past. An important distinction can be made between the earlier approaches, which suffer from biased and/or widely varying estimates, and the more recent optimal algorithms.
Proposed method
As mentioned in the introduction, the optimal algorithms suffer heavily from local minima. These minima are due to singularities in the unweighted error function that arise from the normalization of the bilinear constraints (Eq. (6)) by ∥A(x)t∥. As a consequence, a singularity exists for each feature where t ∝ (x, y, 1)T. Under certain conditions, which are not uncommon in real-world optic flow fields, these singularities interact and influence larger regions of heading space [3], [4]. Optimal
Experiments
In this section, the proposed method is extensively compared to some of the algorithms discussed in Section 3. First, in Section 5.1, the algorithms are compared in terms of accuracy of the parameter estimates. This evaluation involves synthetic data only and is applied to both optimal and non-optimal algorithms. Next, in Section 5.2, the proposed method’s superior robustness to local minima as compared to other optimal algorithms is demonstrated. For this purpose, a synthetic problem is
Discussion
We have presented a novel method that reduces the sensitivity to local minima of optimal egomotion-estimation algorithms by gradually increasing the problem complexity during the optimization. We have demonstrated that the local minima encountered by these algorithms are related to the feature (or feature cluster) locations and, as such, their values can be arbitrary and unrelated to the true solution. This makes these algorithms hard to use in practical applications.
As a remedy, it has been
Acknowledgments
Thanks to Dr. Tong Zhang and Dr. Tina Y. Tian and coworkers for providing the source code of some of the egomotion algorithms used in this paper. Thanks also to Dr. Temujin Gautama for helpful suggestions on the manuscript. Karl Pauwels and Marc M. Van Hulle are supported by the Belgian Fund for Scientific Research—Flanders (G.0248.03, G.0234.04), the Flemish Regional Ministry of Education (Belgium) (GOA 2000/11), the Belgian Science Policy (IUAP P5/04), and the European Commission
References (15)
- et al.
Passive navigation
Computer Graphics and Image Processing
(1983) - et al.
Optimal structure from motion: local ambiguities and global estimates
International Journal of Computer Vision
(2000) - et al.
On the consistency of instantaneous rigid motion estimation
International Journal of Computer Vision
(2002) - et al.
Understanding the behavior of SFM algorithms: a geometric approach
International Journal of Computer Vision
(2003) The least-squares error for structure from infinitesimal motion
International Journal of Computer Vision
(2005)- et al.
Subspace methods for recovering rigid motion I: Algorithm and implementation
International Journal of Computer Vision
(1992) - et al.
Linear differential algorithm for motion recovery: a geometric approach
International Journal of Computer Vision
(2000)
Cited by (14)
Joint direct estimation of 3D geometry and 3D motion using spatio temporal gradients
2021, Pattern RecognitionCitation Excerpt :Zhang [4] considered a two-step iterative approach that estimates the translation separately from the rotation, using numerical Gauss-Newton approximation. Pauwels and van Halle [3] proposed a method that integrates depth cues into the minimization of the squared-distance image reprojection error. This method improved previous works by reducing the risk of falling into local minima.
A review and evaluation of methods estimating ego-motion
2012, Computer Vision and Image UnderstandingCitation Excerpt :Unlike their descriptive review, we presented methods in terms of constraints and optimization techniques, implemented and evaluated these methods. Using their definition of categories we included a motion parallax model [54,34], error minimization models [9,30,67,10,49], a template model [50,51], and a neural network model [36,35]. Here, we presented the model of Lappe and Rauschecker [36] in the context of probabilistic constraints evaluated within a Hough transform scheme.
Robust Frame-to-Frame Camera Rotation Estimation in Crowded Scenes
2023, Proceedings of the IEEE International Conference on Computer VisionCalibration-free visual-inertial fusion with deep convolutional recurrent neural networks
2019, Proceedings of the 32nd International Technical Meeting of the Satellite Division of the Institute of Navigation, ION GNSS+ 2019Performance evaluation of ego-motion methods in static environment
2015, 2015 23rd Signal Processing and Communications Applications Conference, SIU 2015 - Proceedings