Human tracking from a mobile agent: Optical flow and Kalman filter arbitration

https://doi.org/10.1016/j.image.2011.06.005Get rights and content

Abstract

Tracking moving objects is one of the most important but problematic features of motion analysis and understanding. The Kalman filter (KF) has commonly been used for estimation and prediction of the target position in succeeding frames. In this paper, we propose a novel and efficient method of tracking, which performs well even when the target takes a sudden turn during its motion. The proposed method arbitrates between KF and Optical flow (OF) to improve the tracking performance. Our system utilizes a laser to measure the distance to the nearest obstacle and an infrared camera to find the target. The relative data is then fused with the Arbitrate OFKF filter to perform real-time tracking. Experimental results show our suggested approach is very effective and reliable for estimating and tracking moving objects.

Highlights

► An autonomous mobile robot is designed to track a human using an infrared camera. ► The proposed method arbitrates between Kalman filter and Optical flow to improve the tracking performance. ► The algorithm has been successfully tested in real-time tracking of a man in an indoor lab environment. ► Compare the results to other approaches such as particle filters.

Introduction

Human tracking using mobile robots serves a lot of attention because an automated system estimating and tracking moving objects has many potential applications in the field of surveillance and engineering [46]. The mobile robot target tracking mechanism essentially needs the motion analysis of the target behavior. A number of approaches on prediction and tracking are based on the traditional Kalman filter (KF) [10], [11], [12], [13], [14], [15]. In the KF approach, it is presumed that the behavior of a moving target could be characterized by a predefined model, and the models can be represented in terms of a state vector. In reality, however, these models fail to characterize the motion of moving targets accurately. As a result of this, KF fails to track a target, especially when there are occlusions caused by other objects or sudden changes in the trajectory of the motion [13]. In this paper, we propose an efficient method of tracking the target called Arbitrate OFKF, which will overcome the abovementioned problem and work successfully even when sudden changes in trajectory occur. This proposed method arbitrates the output of OF and KF depending on the trajectory of the previous motion. Our arbitration algorithm is such that the output, during sharp turns, will be taken from OF algorithm, and, in all other instances, from KF algorithm.

There are some visual restrictions in which any methods cannot be applicable, for instance in industrial applications, like the ones examined in visual environments and applications [48], airports with crowded conditions, industrial process monitoring [49] and environmental monitoring. We have exploited an infra-camera for visual content, in which the presented technique is applicable as other types of visual contents that can potentially solve these existing issues in [48], [49].

The paper is organized as follows. Section 2 introduces the related work in the tracking using a mobile robot. Section 3 explains the piecewise constant acceleration model for implementation of KF. In Section 4, a solution for improving KF based tracking using arbitration between KF and OF. Section 5 reports experimental results and considerations. Finally, conclusions and future work are illustrated in Section 6.

Section snippets

Mobile robot

Tracking people by the use of a mobile robot is an essential task for the coming generation of service and human-interaction robots. In the field of computer vision, the tracking of moving objects by mobile robot is a relatively hard problem. The system described in [1] uses a sample-based joint probabilistic data association filter for human tracking with a mobile robot. A fuzzy logic approach is used for navigation of the mobile robot [2] whereas the neurofuzzy-based approach for tracking

Traditional methods: Framing the model of target tracking

First, we choose a state vector to contain information about the position and the velocity of the man. We then update the state according to our model called the constant acceleration model, in which the value of acceleration is calculated from the last few states, using a Taylor series expansion of the present expansion. The state can thus be thought of as being updated through the equations:xk+1=Axk+Bdxkdtk+wk

And the output equations may be written asyk=Cxk+zk,where A, B and C are matrices of

Proposed pan-tilt operation

In the target region of interest (ROI) tracking, we need to find the required pan and tilt angle for the specific camera configuration as shown in Fig. 2 so that the camera head can rotate to track the target. In other words, this is the angle vector θ required to make image center coincident with the target centroid:θ=[θxθy],where the rotational angle (θx) with respect of the center of the image is the only angle associated with pan defined by ΔX. It is similar with (θy), when there is only

Arbitration of Optical flow and Kalman filter

We have applied the Optical flow as shown in the flow chart in Fig. 4 for the tracking process. We further propose the Arbitrate OF and KF algorithm to calculate the estimated value of the pan angle, which will be used for our prediction. In our proposed method, both KF and OF will predict the value of the pan angle but, depending on the situation, we will decide which value should be fed to the robot for tracking the target. If the man walks smoothly, then the pan angle value predicted by KF

Experimental results

This section compares the results obtained from algorithms using only KF, only OF, and finally the Arbitrate OFKF model developed in this paper. Since we have tracked a single point in the KF, we have used only one feature point in the OF for comparisons. A final comparison is made between Arbitrate OFKF and the particle filter at the end of the section. In the series of the experiments, we have specifically evaluated pan angle values.

Conclusion

In this paper, a novel tracking method is proposed, which is able to predict a target position very efficiently even if the target object turns suddenly during its motion. The proposed method is based on an arbitration between OF and KF. It takes into consideration the trajectory of the target motion, and gives a much better result of tracking than individual OF or KF filters. In this paper, attention has been drawn to different scenarios where either the KF works better or the OF does. A

Acknowledgment

This study was supported in part by the School of Engineering at Virginia Commonwealth University (VCU) through the internship program between Indian Institute of Technology, Kharagpur and VCU. The first author also acknowledges the support of NSF Division of Electrical, Communications and Cyber Systems, CAREER Award #1054333.

References (49)

  • Y. Motai

    Visual-based human–robotic interaction for extracting salient features of an industrial object for an automated assembly system, Special Issue on Machine Vision

    International Journal of Computers in Industry

    (2005)
  • W. Schulz et al.

    People tracking with mobile robots using sample-based joint probabilistic data association filters

    International Journal of Robotics Research

    (2003)
  • A.M. Zhu et al.

    Neurofuzzy-based approach to mobile robot navigation in unknown environments

    IEEE Transactions on Systems Man and Cybernetics Part C—Applications and Reviews

    (2007)
  • S.J. Kim, W.K. Choi, H.T. Jeon, Intelligent robot control with personal digital assistants using fuzzy logic and neural...
  • J.B. Mbede et al.

    Fuzzy and recurrent neural network motion control among dynamic obstacles for robot manipulators

    Journal of Intelligent & Robotic Systems

    (2001)
  • E. Rimon et al.

    Exact robot navigation using artificial potential functions

    IEEE Transactions on Robotics and Automation

    (1992)
  • J. Borenstein et al.

    The vector field histogram—fast obstacle aviodance for mobile robots

    IEEE Transactions on Robotics and Automation

    (1991)
  • R. Polana, R. Nelson, Low level recognition of human motion (or how to get your man without finding his body parts),...
  • M. Mikic et al.

    Human body model acquisition and tracking using voxel data

    International Journal of Computer Vision

    (2003)
  • A.K. Rastogi et al.

    Design of a real-time tracking system for fast-moving objects

    IETE Journal of Research

    (1997)
  • X.D. Sun et al.

    Region of interest extraction and virtual camera control based on panoramic video capturing

    IEEE Transactions on Multimedia

    (2005)
  • X.P. Yun et al.

    Design, implementation, and experimental results of a quaternion-based Kalman filter for human body motion tracking

    IEEE Transactions on Robotics

    (2006)
  • Beymer,K. Konolige, Tracking people from a mobile platform, in: IJCAI-2001 Workshop on Reasoning with Uncertainty in...
  • D. Schulz et al.

    People tracking with mobile robots using sample-based joint probabilistic data association filters

    International Journal of Robotics Research

    (2003)
  • Cited by (50)

    • CNN and HOG based comparison study for complete occlusion handling in human tracking

      2020, Measurement: Journal of the International Measurement Confederation
      Citation Excerpt :

      It was aimed to distinguish pose change resulting from partial or complete occlusion. In a different study by Motai, et al. [45], KF and Optical Flow (OF) were used together for sudden maneuverable human tracking. Laser and infrared cameras were used for that study.

    • A novel fusion framework for robust human tracking by a service robot

      2017, Robotics and Autonomous Systems
      Citation Excerpt :

      Additionally, local feature descriptors are proven to be a good choice for image matching tasks on a mobile platform, where occlusions and missing objects can be handled [8,9]. These robust local feature descriptors are being increasingly used for visual object tracking applications [10,11,6,7]. Compared to SIFT and SURF which are computationally heavy, a number of light weight binary point features like BRIEF [12], BRISK [13], ORB [14], FREAK [15] and DAISY [16] have been discovered that are being used for applications such as place recognition in visual SLAM [17].

    View all citing articles on Scopus
    View full text