Video object tracking using adaptive Kalman filter

https://doi.org/10.1016/j.jvcir.2006.03.004Get rights and content

Abstract

In this paper, a new video moving object tracking method is proposed. In initialization, a moving object selected by the user is segmented and the dominant color is extracted from the segmented target. In tracking step, a motion model is constructed to set the system model of adaptive Kalman filter firstly. Then, the dominant color of the moving object in HSI color space will be used as feature to detect the moving object in the consecutive video frames. The detected result is fed back as the measurement of adaptive Kalman filter and the estimate parameters of adaptive Kalman filter are adjusted by occlusion ratio adaptively. The proposed method has the robust ability to track the moving object in the consecutive frames under some kinds of real-world complex situations such as the moving object disappearing totally or partially due to occlusion by other ones, fast moving object, changing lighting, changing the direction and orientation of the moving object, and changing the velocity of moving object suddenly. The proposed method is an efficient video object tracking algorithm.

Introduction

The researches for segmenting, estimating, and tracking semantic objects in video have received great attention for the last few years [1], [2], [3], [4], [5], [6], [7], [8], [9], [10], [11], [12], [13], [14], [15], [16], [17], [18], [19], [20], [21], [22], [23], [24], [25], [26]. The moving object tracking is an important issue in video system, such as surveillance, sports reporting, video annotation, and traffic management system. However, if the background and moving objects vary dynamically, such situations will be complicated. In video analysis, we have to know the features of moving objects, such as color, texture, and shape, etc., so that the moving object can be detected and be tracked. There are some situations for video in a real world environment, including camera lens is fixed or not, multiple moving objects, rigid object, or non-rigid object, occlusion by the other objects, one or many cameras, full automatic or semi-automatic semantic object tracking, etc. According to the above discussion, to conclude, we will encounter the following three problems for tracking moving object.

  • ■ Initial moving object segmenting problem.

  • ■ Detection of moving object.

  • ■ Tracking the moving target in occlusion.

In the initial moving object segmenting problem, the methods of [5], [9], [14] use the user’s assistance to appoint a moving object as target. This is called semi-automatic system. Or, in the environment of still background, [3], [8] apply the difference information of two consecutive frames to extract the moving objects. For the sake of simplicity, to solve this problem, most of methods assume that they are in the environment of still background in the initial state. Although the semi-automatic system simplifies the initial moving object problem, its applications are still popular in real systems.

About the detection of moving object, most of techniques base on the features of color, texture, shape, edge, motion, and shape. Generally, the color feature is frequently used [9], [14], [17], [20], since the human perception is sensitive to the color. However, the disadvantage of the single color feature for moving object detection is that it can only be used in the target object with uniformly distributed color. In addition, Jang and Choi [2] propose an active model to detect and track the moving object. However, the method [2] with the Kalman filter technique which predicts the detecting range to reduce the computational complexity can not be applied to solve the target in occlusion. Therefore, for tracking the moving target in occlusion, Jang and Choi in paper [1] propose the structural Kalman filter to estimate the motion information under a deteriorating condition as occlusion. The structural Kalman filter is a composite of two types of the Kalman filters: cell Kalman filters and relation Kalman filters. Their method partitions an object into several sub-regions and utilizes the relational information among sub-regions of a moving object. The cell Kalman filter estimates motion information of each sub-region of a target, and the relation Kalman filter is to estimate the relative relationship between two adjacent sub-regions. When a sub-region is judged not to be occluded, the cell Kalman filter of the sub-region is enough to estimate its motion information. If a sub-region is judged to be occluded, the relation Kalman filters of the adjacent sub-region are used to compensate the corrupted estimate of the cell Kalman filter. The compensation weighting is set by the degree of occlusion. The idea of the structural Kalman filter method is good, however, its applications are limited since the structural Kalman filter is very complicated and it is not easy to select the criteria or features to partition an object into sub-regions in the different real-world tracking application. In addition, the degree of occlusion needs the other model to judge. More importantly, a complex system is difficult to be expanded to keep tracking multiple objects.

Additionally, many works have been done by motion model based human recognition and estimating human body postures [21], [22], [23], [24], [25]. In general, the above methods offer resistance for the tracker to cope with partial occlusions, changing light condition, and object pose. However, these methods only adapt to human tracking, and more importantly, their computation for model estimation is expensive and the number of model parameters is usually large. And, these methods handle partial occlusions only, failing for severe or complete occlusions. Furthermore, according to the suggestions of the paper [17], for a large range of tracking applications where the object motion is rigid or the object is sufficiently distant from the camera, the tracking system will not need to separate model for the object shape since it can be determined from the object motion. And, tracking on object appearance rather than geometry is easier due to better identification power of appearance features.

In the paper [17], H.T. Nguyen and A.W.M. Smeulders think that the tracking algorithm needs to satisfy the two qualities: simplicity and robustness. Simplicity implies that the algorithm is easy to implement and has the minimum number parameters. Robustness implies the ability of the algorithm to track objects under different conditions which include: severe occlusions, lighting changes, object orientation changing, background clutter, a moving camera, zooms, and rotations. Therefore, in this paper, a simple and efficient method for tracking object using adaptive Kalman filter model is developed and its the robustness quality can be applied to a real-world environment. In the following section, the proposed method is shown. The experimental results are demonstrated in Section 3. Finally, the Section 4 brings to the conclusion and discussion.

Section snippets

The proposed method

In this paper, the initial moving object is selected by user. Because there may exist more than one moving objects in video, therefore the tracking target should be determined according to the user interested. After the selection, we initially segment the object in frame t by the difference of three consecutive frames, t  1, t, t + 1, and then apply the region growing algorithm to segment the desired object. Afterwards, the dominant color feature of moving target is extracted from the RGB color

Experimental results

In this section, we make comparisons between the proposed adaptive Kalman filter method and the others including moving object detection method without Kalman filter involved and typical Kalman filter. About experimental tools, the PC with AMD XP 2400 processor, Window XP operation system, and Borland C++ Builder 6.0 are used. In addition, the video image consists of bitmap image sequence with 352 × 240 pixels per frame. Some kinds of experimental videos are used to evaluate the robust ability of

Conclusion and discussion

In this paper, an effective adaptive Kalman filter is proposed to track the moving object. In the proposed adaptive Kalman filter, the occlusion rate is used to adjust the error covariance of Kalman filter adaptively. The method can track the moving object in real-time. It successfully estimates the object’s position in some kinds of real-world situations such as the fast moving object, partial occlusion, long-lasting occlusion, changing lighting, changing the direction and orientation of the

Acknowledgments

The authors thank the anonymous reviewers for their valuable comments and suggestions. This work was supported by I-Shou University under Grant number ISU-94-01-16.

Shiuh-Ku Weng received the B.S. degree from the Feng Chia University in information engineering, Taiwan, in 1991 and both M.S. and Ph.D. degrees were obtained in electronic engineering from Chung Yuan Christian University, Taiwan, in 1993 and Chung Cheng Institute of Technology, National Defense University, Taiwan, in 1997, respectively. In 1999, he joined the faculty of the Department of Electrical Engineering at Chinese Naval Academy for four years. In 2003, he was invited to join the

References (28)

  • Dae-Sik Jang et al.

    2D human body tracking with structural Kalman filter

    Pattern Recognition

    (2002)
  • Dae-Sik Jang et al.

    Active models for tracking moving objects

    Pattern Recognition

    (2000)
  • Johnson I. Agbinya et al.

    Multi-object tracking in video

    Real-Time Imaging

    (1999)
  • T.B. Moeslund et al.

    A survey of computer vision-based human motion capture

    Computer Vision and Image Understanding

    (2001)
  • Shao-Yi Chien et al.

    Efficient moving object segmentation algorithm using background registration technique

    IEEE Transactions on Circuits and Systems for Video Technology

    (2002)
  • Changick Kim et al.

    Fast and automatic video object segmentation and tracking for content-based applications

    IEEE Transactions on Circuits and Systems for Video Technology

    (2002)
  • Dong Kwon Park et al.

    Fast object tracking in digital video

    IEEE Transactions on Consumer Electronics

    (2000)
  • Greg Welch, Gary Bishop, An Introduction to the Kalman Filter, UNC-Chapel Hill, TR 95-041, March 11,...
  • Yiwei Wang, John F. Doherty, Robert E. Van Dyck, Moving object tracking in video, in: Proceedings of the 29th workshop...
  • Alan J. Lipton, Hironobu Fujiyoshi, Raju S. Patil, Moving target classification and tracking from real-time video,...
  • Roberta Piroddi et al.

    Multiple-feature spatiotemporal segmentation of moving sequences using a rule-based approach

    British Machine Vision Conference

    (2002)
  • Volker Rehrmann, Object oriented motion estimation in color image sequences, in: ECCV ’98, vol. I, Lecture Notes of...
  • Michael Mason, Zoran Duric, Using histograms to detect and track objects in color video, Applied Imagery Pattern...
  • Georgi D. Borshukov et al.

    Motion segmentation by multistage affine classification

    IEEE Transactions on Image Processing

    (1997)
  • Cited by (307)

    View all citing articles on Scopus

    Shiuh-Ku Weng received the B.S. degree from the Feng Chia University in information engineering, Taiwan, in 1991 and both M.S. and Ph.D. degrees were obtained in electronic engineering from Chung Yuan Christian University, Taiwan, in 1993 and Chung Cheng Institute of Technology, National Defense University, Taiwan, in 1997, respectively. In 1999, he joined the faculty of the Department of Electrical Engineering at Chinese Naval Academy for four years. In 2003, he was invited to join the Department of Information Management at the same school. His research interests include video object tracking, image segmentation, optimal estimation, and signal detection.

    Chung-Ming Kuo received the B.S. degree from the Chinese Naval Academy, Kaohsiung, Taiwan, in 1982, the M.S. and Ph.D. degrees all in electrical engineering from Chung Cheng Institute of Technology, Taiwan, in 1988 and 1994, respectively. From 1988 to 1991, he was an instructor in the Department of Electrical Engineering at Chinese Naval Academy. Since January 1995, he has been an associate professor at the same institution. From 2000 to 2003, he was an associate professor in the Department of Information Engineering at I-Shou University. Since February 2004, he has been Professor at the same institution. His research interests include image/video compression, image/video feature extraction, understanding and retrieving, motion estimation and tracking, multimedia signal processing, and optimal estimation.

    Shu-Kang Tu was born in Kaohsiung, Taiwan, R.O.C., in 1978. He received the B.E., and M.E. degree from the Department of Information Engineering, I-Shou University, Kaohsiung, R.O.C., in 2002, and 2004. He is working in Best Wise International Computing CO., LTD, currently. His research interests include video tracking and segmentation algorithms.

    View full text