Entity Association Using Context for Wide Area Motion Imagery Target Tracking

Blasch, Erik; Liang, Pengpeng; Shi, Xinchu; Li, Peiyi; Ling, Haibin

doi:10.1007/978-3-319-28971-7_21

Erik Blasch⁶,
Pengpeng Liang⁷,
Xinchu Shi⁷,
Peiyi Li⁷ &
…
Haibin Ling⁷

Part of the book series: Advances in Computer Vision and Pattern Recognition ((ACVPR))

1395 Accesses

Abstract

Entity estimation includes tracking the numbers and types of targets in a scene and is challenging in large area surveillance due to high target density, severe similar target ambiguity, and a low sensor frame rate. Moving vehicle detection from wide area aerial surveillance can be aided by context information. In this paper, we utilize the maximum consistency context (MCC) as spatiotemporal information to estimate multiple targets, and temporal context (TC) to capture the road information. For a candidate association, the MCC is defined as the most consistent association in its neighborhood. Such a maximum selection chooses the reliable neighborhood context information while filtering out noisy and distracting data. In contrast with previous methods to exploit road information, TC does not need to get the location of the road first or use the geographical information systems’ (GIS) information. We first use background subtraction to generate the candidates and then build MCC/TC based on the candidates that have been classified as positive by histograms of oriented gradient (HOG) with multiple kernel learning (MKL). For each positive candidate, a region around the candidate is divided into several subregions based on the direction of the candidate, then each subregion is divided into 12 bins with a fixed length; and finally the TC, a histogram, is built according to the positions of the positive candidates in eight consecutive frames. In order to benefit from both the appearance and context information, we use MKL to combine MCC/TC and HOG. We demonstrate the usefulness of context modeling on multi-target tracking using three challenging wide area motion imagery (WAMI) sequences using the publicly available Columbus Large Image Format (CLIF) 2006 dataset. Both quantitative and qualitative results show clearly the effectiveness of using MCC and TC information, in comparison with algorithms that use no context information. Likewise, the experiments demonstrate that the proposed MCC/TC are useful to remove the false positives that are away from the road and the combination of TC and HOG with MKL outperforms the use of TC or HOG only.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 99.00; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Hardcover Book: USD 179.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

S. Pellegrini, A. Ess, K. Schindler, L. Van Gool, You’ll never walk alone: modeling social behavior for multi-target tracking, in IEEE International Conference on Computer Vision (2009) pp. 261–268
Google Scholar
J. Xiao, H. Cheng, F. Han, H. Sawhney, Geo-spatial aerial video processing for scene understanding and object tracking, in IEEE Conference on Computer Vision and Pattern Recognition (2008), pp. 1–8
Google Scholar
J. Xiao, H. Cheng, H.S. Sawhney, F. Han, Vehicle detection and tracking in wide field-of-view aerial video, in CVPR, 2010
Google Scholar
E. Blasch, G. Seetharaman, K. Palaniappan, H. Ling, G. Chen, Wide-area motion imagery (WAMI) exploitation tools for enhanced situation awareness, in Proceedings of IEEE Applied Imagery Pattern Recognition (AIPR) Workshop: Computer Vision: Time for Change, 2012
Google Scholar
J. Prokaj, X. Zhao, G.G. Medioni, Tracking many vehicles in wide area aerial surveillance, in CVPR Workshops (2012), pp. 37–43
Google Scholar
J. Prokaj, G. Medioni, Using 3D scene structure to improve tracking, in IEEE Conference on Computer Vision and Pattern Recognition (2011), pp. 1337–1344
Google Scholar
V. Reilly, H. Idrees, M. Shah, Detection and tracking of large number of targets in wide area surveillance, in European Conference on Computer Vision (2010), pp. 186–199
Google Scholar
H. Ling, Y. Wu, E. Blasch, G. Chen, L. Bai, Evaluation of visual tracking in extremely low frame rate wide area motion imagery, in Proceedings of the International Conference on Information Fusion (FUSION), 2011
Google Scholar
K. Palaniappan, F. Bunyak, P. Kumar, I. Ersoy, S. Jaeger, K. Ganguli, A. Haridas, J. Fraser, R.M. Rao, G. Seetharaman, Efficient feature extraction and likelihood fusion for vehicle tracking in low frame rate airborne video, in Proceedings of the International Conference on Information Fusion (FUSION), 2010
Google Scholar
X. Shi, H. Ling, E. Blasch, W. Hu, Context-driven moving vehicle detection in wide area motion imagery, in International Conference on Pattern Recognition (ICPR), 2012
Google Scholar
S. Ali, V. Reilly, M. Shah, Motion and appearance contexts for tracking and re-acquiring targets in aerial videos, in IEEE Conference on Computer Vision and Pattern Recognition (2007), pp. 1–6
Google Scholar
V. Reilly, H. Idrees, M. Shah, Detection and tracking of large number of targets in wide area surveillance, in ECCV (3), 2010
Google Scholar
J. Prokaj, M. Duchaineau, G. Medioni, Inferring tracklets for multi-object tracking, in Workshop of Aerial Video Processing Joint with IEEE CVPR, 2011
Google Scholar
F. Bunyak, K. Palaniappan, S.K. Nath, G. Seetharaman, Flux tensor constrained geodesic active contours with sensor fusion for persistent object tracking. J. Multimedia 2(4), 20–33 (2007)
Article Google Scholar
G. Heitz, D. Koller, Learning spatial context: using stuff to find things, in ECCV (1), 2008
Google Scholar
A. Jain, A. Gupta, L.S. Davis, Learning what and how of contextual models for scene labeling, in ECCV (4), 2010
Google Scholar
S.K. Divvala, D. Hoiem, J. Hays, A.A. Efros, M. Hebert, An empirical study of context in object detection, in CVPR, 2009
Google Scholar
H. Myeong, J.Y. Chang, K. M. Lee, Learning object relationships via graph-based context model, in CVPR, 2012
Google Scholar
C. Galleguillos, B. McFee, S.J. Belongie, G.R.G. Lanckriet, Multi-class object localization by combining local contextual interactions, in CVPR, 2010
Google Scholar
Z. Niu, G. Hua, X. Gao, Q. Tian, Context aware topic model for scene recognition, in CVPR, 2012
Google Scholar
J. Porway, K. Wang, B. Yao, S.C. Zhu, A hierarchical and contextual model for aerial image understanding, in CVPR, 2008
Google Scholar
A. Rabinovich, A. Vedaldi, C. Galleguillos, E. Wiewiora, S. Belongie, Objects in context, in ICCV, 2007
Google Scholar
N. Dalal, B. Triggs, Histograms of oriented gradients for human detection, in CVPR (1), 2005
Google Scholar
A. Jain, S.V.N. Vishwanathan, M. Varma, Spg-gmkl: generalized multiple kernel learning with a million kernels, in KDD, 2012
Google Scholar
M. Varma, B.R. Babu, More generality in efficient multiple kernel learning, in ICML, 2009
Google Scholar
M. Varma, D. Ray, Learning the discriminative power invariance trade-off, in ICCV, 2007
Google Scholar
M. G¨onen, E. Alpaydin, Multiple kernel learning algorithms. J. Mach. Learn. Res. 12, 2211–2268 (2011)
Google Scholar
O. Mendoza-Schrock, J.A. Patrick, E. Blasch, Video image registration evaluation for a layered sensing environment, in Proceedings of IEEE National Aerospace Electronics Conference (NAECON), 2009
Google Scholar
Y. Bar-Shalom, T. Fortmann, Tracking and Data Association (Academic Press, 1988)
Google Scholar
D. Reid, An algorithm for tracking multiple targets. TAC 24(6), 843–854 (1979)
Google Scholar
M. Yang, Y. Wu, G. Hua, Context-aware visual tracking. IEEE Trans. Pattern Anal. Mach. Intell. 31(7), 1195–1209 (2009)
Article Google Scholar
H. Grabner, J. Matas, L. Van Gool, P. Cattin, Tracking the invisible: learning where the object might be, in IEEE Conference on Computer Vision and Pattern Recognition (2010), pp. 1285–1292
Google Scholar
T. Zhao, R. Nevatia, Car detection in low resolution aerial image, in ICCV, 2001
Google Scholar
X. Shi, H. Ling, E. Blasch, W. Hu, Context-driven moving vehicle detection in wide area motion imagery, in International Conference on Pattern Recognition (ICPR), 2012
Google Scholar
J. Xiao, H. Cheng, H.S. Sawhney, F. Han, Vehicle detection and tracking in wide field-of-view aerial video, in CVPR, 2010
Google Scholar
P. Liang, G. Teodoro, H. Ling, E. Blasch, G. Chen, L. Bai, Multiple kernel learning for vehicle detection in wide area motion imagery, in International Conference on Information Fusion (FUSION), 2012
Google Scholar
K. Palaniappan, F. Bunyak, P. Kumar, I. Ersoy, S. Jaeger, K. Ganguli, A. Haridas, J. Fraser, R.M. Rao, G.S. Seetharaman, Efficient feature extraction and likelihood fusion for vehicle tracking in low frame rate airborne video, in Proceedings of the International Conference on Information Fusion (FUSION), 2010
Google Scholar
R. Pelapur, S. Candemir, F. Bunyak, M. Poostchi, G. Seetharaman, K. Palaniappan, Persistent target tracking using likelihood fusion in wide-area and full motion video sequences, in Proceedings of the International Conference on Information Fusion (FUSION), 2012
Google Scholar
H. Ling, Y. Wu, E. Blasch, G. Chen, L. Bai, Evaluation of visual tracking in extremely low frame rate wide area motion imagery, in Proceedings of the International Conference on Information Fusion (FUSION), 2011
Google Scholar
E. Blasch, G. Seetharaman, S. Suddarth, K. Palaniappan, G. Chen, H. Ling, A. Basharat, Summary of methods in wide-area motion imagery (WAMI), in Proceedings of SPIE, vol. 9089, 2014
Google Scholar
X. Shi, P. Li, W. Hu, E. Blasch, H. Ling, Using maximum consistency context for multiple target association in wide area traffic scenes, in Proceedings of International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2013
Google Scholar
W.-S. Zheng, S. Gong, T. Xiang, Quantifying and transferring contextual information in object detection. IEEE Trans. Pattern Anal. Mach. Intell. 34(4), 762–777 (2012)
Article Google Scholar
Y. Ding, J. Xiao, Contextual boost for pedestrian detection, in CVPR, 2012
Google Scholar
Z. Song, Q. Chen, Z. Huang, Y. Hua, S. Yan, Contextualizing object detection and classification, in CVPR, 2011
Google Scholar
P.F. Felzenszwalb, R.B. Girshick, D.A. McAllester, D. Ramanan, Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Anal. Mach. Intell. 32(9), 1627–1645 (2010)
Article Google Scholar
M. Andriluka, S. Roth, B. Schiele, People-tracking-by-detection and people-detection-by-tracking, in IEEE Conference on Computer Vision and Pattern Recognition (2008), pp. 1–8
Google Scholar
M.D. Breitenstein, F. Reichlin, B. Leibe, E. Koller-Meier, L. Van Gool, Online multiperson tracking-by-detection from a single, uncalibrated camera. IEEE Trans. Pattern Anal. Mach. Intell. 33(9), 1820–1833 (2011)
Article Google Scholar
H. Bay, T. Tuytelaars, L. Van Gool, Surf: speeded up robust features, in European Conference on Computer Vision (2006), pp. 404–417
Google Scholar
N. Dalal, B. Triggs, Histograms of oriented gradients for human detection. IEEE Conf. on Comput. Vis. Pattern Recogn. 1, 886–893 (2005)
Google Scholar
H.W. Kuhn, The hungarian method for the assignment problem. Naval Res. Logistics Quart. 2(1–2), 83–97 (1955)
Article MathSciNet MATH Google Scholar
M.J. Swain, D.H. Ballard, Color indexing. Int. J. Comput. Vis. 7(1), 11–32 (1991)
Article Google Scholar
C. Huang, B. Wu, R. Nevatia, Robust Object Tracking by Hierarchical Association of Detection Responses (2008), pp. 788–801
Google Scholar
C.H. Kuo, C. Huang, R. Nevatia, Multi-target tracking by on-line learned discriminative appearance models, in IEEE Conference on Computer Vision and Pattern Recognition (2010), pp. 685–692
Google Scholar
D.G. Lowe, Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)
Article Google Scholar
M. Muja, D.G. Lowe, Fast approximate nearest neighbors with automatic algorithm configuration, in International Conference on Computer Vision Theory and Application, 2009
Google Scholar
M.A. Fischler, R.C. Bolles, Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24(6), 381–395 (1981)
Article MathSciNet Google Scholar
E.C. Cho, S.S. Iyengar, G. Seetharaman, R. Holyer, M. Lybanon, Velocity Vectors for Features of Sequential Oceanographic Images
Google Scholar
AFRL: Columbus large image format (clif) 2006, https://www.sdms.afrl.af.mil/index.php?collection=clif2006
O. Mendoza-Schrock, J.A. Patrick, E. Blasch, Video image registration evaluation for a layered sensing environment, in Proceedings of IEEE National Aerospace Electronics Conference (NAECON), 2009
Google Scholar
C.-C. Chang, C.-J. Lin, LIBSVM: a library for support vector machines, ACM Trans. Intell. Syst. Technol. 2, 27:1–27:27 (2011), software available at http://www.csie.ntu.edu.tw/cjlin/libsvm
Google Scholar

Download references

Acknowledgments

This work is partly supported by the Air Force Office of Scientific Research (AFOSR) under the Dynamic Data Driven Application Systems program and the Air Force Research Lab. The work was supported in part by NSF Grants IIS-1218156 and IIS-1350521.

Author information

Authors and Affiliations

Air Force Research Lab, Rome, USA
Erik Blasch
Department of Computer and Information Sciences, Temple University, Philadelphia, USA
Pengpeng Liang, Xinchu Shi, Peiyi Li & Haibin Ling

Authors

Erik Blasch
View author publications
You can also search for this author in PubMed Google Scholar
Pengpeng Liang
View author publications
You can also search for this author in PubMed Google Scholar
Xinchu Shi
View author publications
You can also search for this author in PubMed Google Scholar
Peiyi Li
View author publications
You can also search for this author in PubMed Google Scholar
Haibin Ling
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Erik Blasch .

Editor information

Editors and Affiliations

Department Mathematics and Computer Science, University of Udine, Udine, Italy
Lauro Snidaro
University Carlos III, Department of Computer Science and Engg. University Carlos III, Colmenarejo, Spain
Jesús García
Center Multisource Info. Fusion & Dept, Univ. at Buffalo, of Industrial and Engg Center Multisource Info. Fusion & Dept, Buffalo, New York, USA
James Llinas
Information Directorate, Air Force Research Laboratory Information Directorate, Rome, New York, USA
Erik Blasch

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Blasch, E., Liang, P., Shi, X., Li, P., Ling, H. (2016). Entity Association Using Context for Wide Area Motion Imagery Target Tracking. In: Snidaro, L., García, J., Llinas, J., Blasch, E. (eds) Context-Enhanced Information Fusion. Advances in Computer Vision and Pattern Recognition. Springer, Cham. https://doi.org/10.1007/978-3-319-28971-7_21

Download citation

DOI: https://doi.org/10.1007/978-3-319-28971-7_21
Published: 26 May 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-28969-4
Online ISBN: 978-3-319-28971-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics