Video object tracking using adaptive Kalman filter

doi:10.1016/j.jvcir.2006.03.004

Journal of Visual Communication and Image Representation

Volume 17, Issue 6, December 2006, Pages 1190-1208

https://doi.org/10.1016/j.jvcir.2006.03.004 Get rights and content

Abstract

In this paper, a new video moving object tracking method is proposed. In initialization, a moving object selected by the user is segmented and the dominant color is extracted from the segmented target. In tracking step, a motion model is constructed to set the system model of adaptive Kalman filter firstly. Then, the dominant color of the moving object in HSI color space will be used as feature to detect the moving object in the consecutive video frames. The detected result is fed back as the measurement of adaptive Kalman filter and the estimate parameters of adaptive Kalman filter are adjusted by occlusion ratio adaptively. The proposed method has the robust ability to track the moving object in the consecutive frames under some kinds of real-world complex situations such as the moving object disappearing totally or partially due to occlusion by other ones, fast moving object, changing lighting, changing the direction and orientation of the moving object, and changing the velocity of moving object suddenly. The proposed method is an efficient video object tracking algorithm.

Introduction

The researches for segmenting, estimating, and tracking semantic objects in video have received great attention for the last few years [1], [2], [3], [4], [5], [6], [7], [8], [9], [10], [11], [12], [13], [14], [15], [16], [17], [18], [19], [20], [21], [22], [23], [24], [25], [26]. The moving object tracking is an important issue in video system, such as surveillance, sports reporting, video annotation, and traffic management system. However, if the background and moving objects vary dynamically, such situations will be complicated. In video analysis, we have to know the features of moving objects, such as color, texture, and shape, etc., so that the moving object can be detected and be tracked. There are some situations for video in a real world environment, including camera lens is fixed or not, multiple moving objects, rigid object, or non-rigid object, occlusion by the other objects, one or many cameras, full automatic or semi-automatic semantic object tracking, etc. According to the above discussion, to conclude, we will encounter the following three problems for tracking moving object.

■ Initial moving object segmenting problem.
■ Detection of moving object.
■ Tracking the moving target in occlusion.

In the initial moving object segmenting problem, the methods of [5], [9], [14] use the user’s assistance to appoint a moving object as target. This is called semi-automatic system. Or, in the environment of still background, [3], [8] apply the difference information of two consecutive frames to extract the moving objects. For the sake of simplicity, to solve this problem, most of methods assume that they are in the environment of still background in the initial state. Although the semi-automatic system simplifies the initial moving object problem, its applications are still popular in real systems.

About the detection of moving object, most of techniques base on the features of color, texture, shape, edge, motion, and shape. Generally, the color feature is frequently used [9], [14], [17], [20], since the human perception is sensitive to the color. However, the disadvantage of the single color feature for moving object detection is that it can only be used in the target object with uniformly distributed color. In addition, Jang and Choi [2] propose an active model to detect and track the moving object. However, the method [2] with the Kalman filter technique which predicts the detecting range to reduce the computational complexity can not be applied to solve the target in occlusion. Therefore, for tracking the moving target in occlusion, Jang and Choi in paper [1] propose the structural Kalman filter to estimate the motion information under a deteriorating condition as occlusion. The structural Kalman filter is a composite of two types of the Kalman filters: cell Kalman filters and relation Kalman filters. Their method partitions an object into several sub-regions and utilizes the relational information among sub-regions of a moving object. The cell Kalman filter estimates motion information of each sub-region of a target, and the relation Kalman filter is to estimate the relative relationship between two adjacent sub-regions. When a sub-region is judged not to be occluded, the cell Kalman filter of the sub-region is enough to estimate its motion information. If a sub-region is judged to be occluded, the relation Kalman filters of the adjacent sub-region are used to compensate the corrupted estimate of the cell Kalman filter. The compensation weighting is set by the degree of occlusion. The idea of the structural Kalman filter method is good, however, its applications are limited since the structural Kalman filter is very complicated and it is not easy to select the criteria or features to partition an object into sub-regions in the different real-world tracking application. In addition, the degree of occlusion needs the other model to judge. More importantly, a complex system is difficult to be expanded to keep tracking multiple objects.

Additionally, many works have been done by motion model based human recognition and estimating human body postures [21], [22], [23], [24], [25]. In general, the above methods offer resistance for the tracker to cope with partial occlusions, changing light condition, and object pose. However, these methods only adapt to human tracking, and more importantly, their computation for model estimation is expensive and the number of model parameters is usually large. And, these methods handle partial occlusions only, failing for severe or complete occlusions. Furthermore, according to the suggestions of the paper [17], for a large range of tracking applications where the object motion is rigid or the object is sufficiently distant from the camera, the tracking system will not need to separate model for the object shape since it can be determined from the object motion. And, tracking on object appearance rather than geometry is easier due to better identification power of appearance features.

In the paper [17], H.T. Nguyen and A.W.M. Smeulders think that the tracking algorithm needs to satisfy the two qualities: simplicity and robustness. Simplicity implies that the algorithm is easy to implement and has the minimum number parameters. Robustness implies the ability of the algorithm to track objects under different conditions which include: severe occlusions, lighting changes, object orientation changing, background clutter, a moving camera, zooms, and rotations. Therefore, in this paper, a simple and efficient method for tracking object using adaptive Kalman filter model is developed and its the robustness quality can be applied to a real-world environment. In the following section, the proposed method is shown. The experimental results are demonstrated in Section 3. Finally, the Section 4 brings to the conclusion and discussion.

Section snippets

The proposed method

In this paper, the initial moving object is selected by user. Because there may exist more than one moving objects in video, therefore the tracking target should be determined according to the user interested. After the selection, we initially segment the object in frame t by the difference of three consecutive frames, t − 1, t, t + 1, and then apply the region growing algorithm to segment the desired object. Afterwards, the dominant color feature of moving target is extracted from the RGB color

Experimental results

In this section, we make comparisons between the proposed adaptive Kalman filter method and the others including moving object detection method without Kalman filter involved and typical Kalman filter. About experimental tools, the PC with AMD XP 2400 processor, Window XP operation system, and Borland C++ Builder 6.0 are used. In addition, the video image consists of bitmap image sequence with 352 × 240 pixels per frame. Some kinds of experimental videos are used to evaluate the robust ability of

Conclusion and discussion

In this paper, an effective adaptive Kalman filter is proposed to track the moving object. In the proposed adaptive Kalman filter, the occlusion rate is used to adjust the error covariance of Kalman filter adaptively. The method can track the moving object in real-time. It successfully estimates the object’s position in some kinds of real-world situations such as the fast moving object, partial occlusion, long-lasting occlusion, changing lighting, changing the direction and orientation of the

Acknowledgments

The authors thank the anonymous reviewers for their valuable comments and suggestions. This work was supported by I-Shou University under Grant number ISU-94-01-16.

Shiuh-Ku Weng received the B.S. degree from the Feng Chia University in information engineering, Taiwan, in 1991 and both M.S. and Ph.D. degrees were obtained in electronic engineering from Chung Yuan Christian University, Taiwan, in 1993 and Chung Cheng Institute of Technology, National Defense University, Taiwan, in 1997, respectively. In 1999, he joined the faculty of the Department of Electrical Engineering at Chinese Naval Academy for four years. In 2003, he was invited to join the

References (28)

Dae-Sik Jang et al.
2D human body tracking with structural Kalman filter
Pattern Recognition
(2002)
Dae-Sik Jang et al.
Active models for tracking moving objects
Pattern Recognition
(2000)
Johnson I. Agbinya et al.
Multi-object tracking in video
Real-Time Imaging
(1999)
T.B. Moeslund et al.
A survey of computer vision-based human motion capture
Computer Vision and Image Understanding
(2001)
Shao-Yi Chien et al.
Efficient moving object segmentation algorithm using background registration technique
IEEE Transactions on Circuits and Systems for Video Technology
(2002)
Changick Kim et al.
Fast and automatic video object segmentation and tracking for content-based applications
IEEE Transactions on Circuits and Systems for Video Technology
(2002)
Dong Kwon Park et al.
Fast object tracking in digital video
IEEE Transactions on Consumer Electronics
(2000)
Greg Welch, Gary Bishop, An Introduction to the Kalman Filter, UNC-Chapel Hill, TR 95-041, March 11,...
Yiwei Wang, John F. Doherty, Robert E. Van Dyck, Moving object tracking in video, in: Proceedings of the 29th workshop...
Alan J. Lipton, Hironobu Fujiyoshi, Raju S. Patil, Moving target classification and tracking from real-time video,...

Roberta Piroddi et al.

Multiple-feature spatiotemporal segmentation of moving sequences using a rule-based approach

British Machine Vision Conference

(2002)

Volker Rehrmann, Object oriented motion estimation in color image sequences, in: ECCV ’98, vol. I, Lecture Notes of...

Michael Mason, Zoran Duric, Using histograms to detect and track objects in color video, Applied Imagery Pattern...

Georgi D. Borshukov et al.

Motion segmentation by multistage affine classification

IEEE Transactions on Image Processing

(1997)

Cited by (307)

Optimal estimation of broiler movement for commercial tracking
2023, Smart Agricultural Technology
Nowadays, video tracking has taken a considerable part in monitoring systems. It allows identifying and follow every object in the camera field over time. While most of these algorithms are rather well suited to regular movements (following cars, pedestrians), they are often limited in more complex situations (high variations in speed, low detection rate, frequent shape variation). This paper proposes three methods adapted to broilers tracking in commercial environment. Past movements analysis of known broilers enable to estimate their motions and therefore to predict their new position. New unidentified broilers positions are then compared to these predicted positions. Distances between these two sets of positions are then used in the Hungarian Algorithm to assign an ID to new detected broilers regarding their past positions. Our methods differentiate by the way they predict the future positions. Contrary to most methods, they do not seek perfect regularity of movements and can deal with low rate detection. The proposed methods showed better performances than existing one. Tests have been made at 21, 26, and 37 days of age. At 21 days, our best method produces up to 35% fewer errors than a method with no estimation of movement. At 26 days of age, displacement distances can be set to only 68% of the maximum recorded displacement while improving an average of 21% of tracking errors across all methods.
AMF-MSPF: A retrospective analysis with online object tracking algorithms
2023, Displays
Surveillance systems employ object tracking methods, a class of automatic data analysis algorithms, to accurately locate the target or its trajectory under a wide range of constraints in an uncontrolled environment. The best object tracking algorithm should be able to track under a range of constraints over long periods of time. Moreover, since object tracking algorithms are preliminary stages to these higher-level applications, they must be assigned a fraction of system resources to save computational power for more intensive higher-level decision-making tasks. Object tracking algorithms depend on a pipeline of components such as preprocessing stage, object detection, feature extraction/selection and fusion, and the search mechanism for object tracking. These multidimensional components are the tuning parameters for providing a balance between robustness, accuracy, and real-time performance of surveillance systems. In this review paper, the focus is on the detection-free class of tracking, where unlike the track-by-detection and end-to-end tracking, the object is initialized manually.
The first aim of this review study is: a literature review on the multi-dimensional research trends with a focus on improving online object-tracking algorithms, and the second part is an experimental evaluation of our previously proposed Adaptive Multi-Feature Framework for MSPF (AMF-MSPF). AMF-MSPF is an MSPF-based adaptive framework that works under an online feature selection mechanism. However, a limited sequence of videos was used from the CAVIAR and PETS datasets to evaluate the AMF-MSPF. Therefore, we felt that an extensive evaluation and comparison, with online state-of-the-art tracking methods in the detection-free category, are necessary on a richer dataset to thoroughly investigate the effectiveness of AMF-MSPF and highlight its pitfalls. Twenty sequences were selected from the Amsterdam Library of Ordinary Video (ALOV++), a popular dataset with the tracking community, were selected to compare the results of the AMF-MSPF with reference methods in the online category of tracking. ALOV++ is a publically available dataset that is popular for its broad constraints, and more importantly the results of the reference methods are also publically available on this dataset. While the extensive experimentation has demonstrated the effectiveness of the AMF-MSPF against many challenging scenarios, nevertheless, this experimental analysis has highlighted some performance measures where the AMF-MSPF framework fell short of achieving even moderately satisfactory results. This research paper points toward these observations to caution researchers and advise them on what future research should concentrate on while developing generalized real-time tracking systems that would be effective under many scenarios. This research would benefit researchers and engineers working on surveillance systems to identify and tune these parameters for performance gains.
A survey of detection-based video multi-object tracking
2022, Displays
Multiple Object Tracking (MOT) has emerged as a hot issue in the field of computer vision recently. MOT has academic and commercial potential in urban public security, military, self-driving and other fields. Although researchers have put forward different approaches to solve the problem, it is still challenging due to factors such as appearance mutation and severe object occlusion. This study makes a comprehensive research on the problem of detection-based video multi-object tracking. The main contributions of this review are as follows: (1) The important aspects of MOT system are studied, including the basic principles, main framework, technical difficulties, classification and evaluation; (2) This work review the classification and latest progress in object detection due to its close relevance to the field of MOT; (3) This work introduce a taxonomy based on the MOT problem on this basis, the original and improved versions of each classical tracking algorithm are discussed, analyzed and compared; (4) This work summarize the results of 40 algorithms on popular datasets to provide quantitative comparison and analysis of experimental results. Finally, the possible research directions for the future are put forward, by analyzing the commonness of several methods with excellent performance.
More robust object tracking via shape and motion cue integration
2022, Signal Processing
Most current trackers utilise an appearance model to localise the target object in each frame. However, such approaches often fail when there are similar looking distractor objects in the surrounding background. This paper promotes an approach that can be combined with many existing trackers to tackle this issue and improve tracking robustness. The proposed approach makes use of two additional cues to target location: shape cues which are exploited through offline training of the appearance model, and motion cues which are exploited online to predict the target object’s future position based on its history of past locations. Combining these additional mechanisms with the existing trackers SiamFC, SiamFC++, Super_DiMP and ARSuper_DiMP all resulted in an increase in the tracking accuracy compared to that achieved by the corresponding underlying tracker alone. When combined with ARSuper_DiMP the resulting tracker is shown to outperform all popular state-of-the-art trackers on three benchmark datasets (OTB-100, NFS, and LaSOT), and produce performance that is competitive with the state-of-the-art on the UAV123, Trackingnet, GOT-10K and VOT2020 datasets.
Multi-modal visual tracking: Review and experimental comparison
2024, Computational Visual Media
Enhancing Embedded Object Tracking: A Hardware Acceleration Approach for Real-Time Predictability
2024, Journal of Imaging

View all citing articles on Scopus

Chung-Ming Kuo received the B.S. degree from the Chinese Naval Academy, Kaohsiung, Taiwan, in 1982, the M.S. and Ph.D. degrees all in electrical engineering from Chung Cheng Institute of Technology, Taiwan, in 1988 and 1994, respectively. From 1988 to 1991, he was an instructor in the Department of Electrical Engineering at Chinese Naval Academy. Since January 1995, he has been an associate professor at the same institution. From 2000 to 2003, he was an associate professor in the Department of Information Engineering at I-Shou University. Since February 2004, he has been Professor at the same institution. His research interests include image/video compression, image/video feature extraction, understanding and retrieving, motion estimation and tracking, multimedia signal processing, and optimal estimation.

Shu-Kang Tu was born in Kaohsiung, Taiwan, R.O.C., in 1978. He received the B.E., and M.E. degree from the Department of Information Engineering, I-Shou University, Kaohsiung, R.O.C., in 2002, and 2004. He is working in Best Wise International Computing CO., LTD, currently. His research interests include video tracking and segmentation algorithms.

View full text

Video object tracking using adaptive Kalman filter

Abstract

Introduction

Section snippets

The proposed method

Experimental results

Conclusion and discussion

Acknowledgments

Pattern Recognition

Pattern Recognition

Real-Time Imaging

Computer Vision and Image Understanding

Efficient moving object segmentation algorithm using background registration technique

IEEE Transactions on Circuits and Systems for Video Technology

Fast and automatic video object segmentation and tracking for content-based applications

IEEE Transactions on Circuits and Systems for Video Technology

Fast object tracking in digital video

IEEE Transactions on Consumer Electronics

Multiple-feature spatiotemporal segmentation of moving sequences using a rule-based approach

British Machine Vision Conference

Motion segmentation by multistage affine classification

IEEE Transactions on Image Processing