research-article

Cross-domain traffic scene understanding by motion model transfer

Authors:

Timothy HospedalesAuthors Info & Claims

ARTEMIS '13: Proceedings of the 4th ACM/IEEE international workshop on Analysis and retrieval of tracked events and motion in imagery stream

Pages 77 - 86

https://doi.org/10.1145/2510650.2510657

Published: 21 October 2013 Publication History

Abstract

This paper proposes a novel framework for cross-domain traffic scene understanding. Existing learning-based outdoor wide-area scene interpretation models suffer from requiring long term data collection in order to acquire statistically sufficient model training samples for every new scene. This makes installation costly, prevents models from being easily relocated, and from being used in UAVs with continuously changing scenes. In contrast, our method adopts a geometrical matching approach to relate motion models learned from a database of source scenes (source domains) with a handful sparsely observed data in a new target scene (target domain). This framework is capable of online ''sparse-shot'' anomaly detection and motion event classification in the unseen target domain, without the need for extensive data collection, labelling and offline model training for each new target domain. That is, trained models in different source domains can be deployed to a new target domain with only a few unlabelled observations and without any training in the new target domain. Crucially, to provide cross-domain interpretation without risk of dramatic negative transfer, we introduce and formulate a scene association criterion to quantify transferability of motion models from one scene to another. Extensive experiments show the effectiveness of the proposed framework for cross-domain motion event classification, anomaly detection and scene association.

References

[1]

D. H. Douglas and T. K. Peucker. Algorithms for the reduction of the number of points required to represent a digitized line or its caricature. Cartographica: The International Journal for Geographic Information and Geovisualization, 10(2):112--122, Oct. 1973.

[2]

Federal Highway Administration. Next generation simulation (ngsim) dataset. http://ops.fhwa.dot.gov/trafficanalysistools/ngsim.htm.

[3]

A. Goshtasby. Image registration by local approximation methods. Image Vision Comput., 6(4):255--261, 1988.

Digital Library

[4]

J. Hershey and P. Olsen. Approximating the kullback leibler divergence between gaussian mixture models. In ICASSP, 2007.

[5]

T. Hospedales, S. Gong, and T. Xiang. Video behaviour mining using a dynamic topic model. International Journal of Computer Vision, 98:303--323, 2012.

Digital Library

[6]

W. Hu, X. Xiao, Z. Fu, D. Xie, T. Tan, and S. J. Maybank. A system for learning statistical motion patterns. IEEE Transaction on Pattern Analysis and Machine Intelligence, 28(9):1450--1464, 2006.

Digital Library

[7]

S. Khokhar, I. Saleemi, and M. Shah. Similarity invariant classification of events by kl divergence minimization. In ICCV, 2011.

Digital Library

[8]

J. F. Kooij, G. Englebienne, and D. M. Gavrila. A non-parametric hierarchical model to discover behavior dynamics from tracks. In ECCV, 2012.

Digital Library

[9]

H. Kuhn. The Hungarian method for the assignment problem. Naval research logistics quarterly, 2(1--2):83--97, 1955.

[10]

J. Li, S. Gong, and T. Xiang. Learning behavioural context. International Journal of Computer Vision, 97(3):276--304, 2012.

Digital Library

[11]

J. Lin. Divergence measures based on the shannon entropy. IEEE Transactions on Information Theory, 37(1):145--151, 1991.

Digital Library

[12]

R. Mehran, A. Oyama, and M. Shah. Abnormal crowd behavior detection using social force model. In CVPR, 2009.

[13]

B. T. Morris and M. M. Trivedi. Learning and classification of trajectories in dynamic scenes: A general framework for live video analysis. In AVSS, 2008.

Digital Library

[14]

B. T. Morris and M. M. Trivedi. A survey of vision-based trajectory learning and analysis for surveillance. IEEE Transaction on Circuits and Systems for Video Technology, 18(8):1114--1127, 2008.

Digital Library

[15]

J. Nocedal and S. Wright. Numerical optimization. Springer-Verlag, 2nd edition, 2006.

[16]

S. J. Pan and Q. Yang. A survey on transfer learning. IEEE Transsaction on Knowledge and Data Engineering, 22(10):1345--1359, Oct. 2010.

Digital Library

[17]

J. Prokaj, X. Zhao, and G. G. Medioni. Tracking many vehicles in wide area aerial surveillance. In CVPR Workshops, 2012.

[18]

I. Saleemi, L. Hartung, and M. Shah. Scene understanding by statistical modeling of motion patterns. In CVPR, 2010.

[19]

A. Sodemann, M. Ross, and B. Borghetti. A review of anomaly detection in automated surveillance. IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews, 42(6):1257--1272, 2012.

Digital Library

[20]

O. Tuzel, F. Porikli, and P. Meer. Region covariance: A fast descriptor for detection and classification. In ECCV, 2006.

Digital Library

[21]

X. Wang, X. Ma, and E. Grimson. Unsupervised activity perception by hierarchical bayesian models. In CVPR, 2007.

[22]

L. Zelnik-Manor and P. Perona. Self-tuning spectral clustering. In NIPS, 2004.

Digital Library

[23]

G. Zen, E. Ricci, and N. Sebe. Exploiting sparse representations for robust analysis of noisy complex video scenes. In ECCV, 2012.

Digital Library

Cited By

Zheng QHe ZLiang CChen JLin CTao D(2020)Transferring fashion to surveillance with weak labelsNeural Computing and Applications10.1007/s00521-020-05528-935:18(13021-13035)Online publication date: 23-Nov-2020
https://doi.org/10.1007/s00521-020-05528-9
Xu XHospedales TGong S(2017)Discovery of Shared Semantic Spaces for Multiscene Video Query and SummarizationIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2016.253271927:6(1353-1367)Online publication date: Jun-2017
https://doi.org/10.1109/TCSVT.2016.2532719
Ashok Kumar PVaidehi V(2017)A transfer learning framework for traffic video using neuro-fuzzy approachSādhanā10.1007/s12046-017-0705-x42:9(1431-1442)Online publication date: 4-Aug-2017
https://doi.org/10.1007/s12046-017-0705-x

Index Terms

Cross-domain traffic scene understanding by motion model transfer
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
  2. Computer graphics
    1. Animation
      1. Motion capture
      2. Motion processing
    2. Image manipulation

Recommendations

Spectral domain-transfer learning
KDD '08: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining

Traditional spectral classification has been proved to be effective in dealing with both labeled and unlabeled data when these data are from the same domain. In many real world applications, however, we wish to make use of the labeled data from one ...
Cross-Domain Traffic Scene Understanding by Integrating Deep Learning and Topic Model
Understanding cross-domain traffic scenarios from multicamera surveillance network is important for environmental perception. Most of existing methods select the source domain which is most similar to the target domain by comparing entire domains for ...
Deep Transfer Learning for Cross-domain Activity Recognition
ICCSE'18: Proceedings of the 3rd International Conference on Crowd Science and Engineering

Human activity recognition plays an important role in people's daily life. However, it is often expensive and time-consuming to acquire sufficient labeled activity data. To solve this problem, transfer learning leverages the labeled samples from the ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

ARTEMIS '13: Proceedings of the 4th ACM/IEEE international workshop on Analysis and retrieval of tracked events and motion in imagery stream

October 2013

94 pages

ISBN:9781450323932

DOI:10.1145/2510650

General Chairs:
Anastasios Doulamis
Technical University of Crete/National Technical University of Athens, Greece
,
Marco Bertini
University of Florence, Italy
,
Nikolaos Doulamis
National Technical University of Athens, Greece
,
Jordi Gonzàlez
Universitat Autònoma de Barcelona, Spain
,
Program Chair:
Athanasios Voulodimos
National Technical University of Athens, Greece

Copyright © 2013 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 October 2013

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

MM '13

Sponsor:

SIGMM

MM '13: ACM Multimedia Conference

October 21, 2013

Barcelona, Spain

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

3
Total Citations
View Citations
124
Total Downloads

Downloads (Last 12 months)2
Downloads (Last 6 weeks)0

Reflects downloads up to 23 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Zheng QHe ZLiang CChen JLin CTao D(2020)Transferring fashion to surveillance with weak labelsNeural Computing and Applications10.1007/s00521-020-05528-935:18(13021-13035)Online publication date: 23-Nov-2020
https://doi.org/10.1007/s00521-020-05528-9
Xu XHospedales TGong S(2017)Discovery of Shared Semantic Spaces for Multiscene Video Query and SummarizationIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2016.253271927:6(1353-1367)Online publication date: Jun-2017
https://doi.org/10.1109/TCSVT.2016.2532719
Ashok Kumar PVaidehi V(2017)A transfer learning framework for traffic video using neuro-fuzzy approachSādhanā10.1007/s12046-017-0705-x42:9(1431-1442)Online publication date: 4-Aug-2017
https://doi.org/10.1007/s12046-017-0705-x

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents