skip to main content
10.1145/2643188.2643191acmotherconferencesArticle/Chapter ViewAbstractPublication PagessccgConference Proceedingsconference-collections
research-article

Cheap rendering vs. costly annotation: rendered omnidirectional dataset of vehicles

Published:28 May 2014Publication History

ABSTRACT

Detection of vehicles in traffic surveillance needs good and large training datasets in order to achieve competitive detection rates. We are showing an approach to automatic synthesis of custom datasets, simulating various major influences: viewpoint, camera parameters, sunlight, surrounding environment, etc. Our goal is to create a competitive vehicle detector which "has not seen a real car before." We are using Blender as the modeling and rendering engine. A suitable scene graph accompanied by a set of scripts was created, that allows simple configuration of the synthesized dataset. The generator is also capable of storing rich set of metadata that are used as annotations of the synthesized images. We synthesized several experimental datasets, evaluated their statistical properties, as compared to real-life datasets. Most importantly, we trained a detector on the synthetic data. Its detection performance is comparable to a detector trained on state-of-the-art real-life dataset. Synthesis of a dataset of 10,000 images takes only several hours, which is much more efficient, compared to manual annotation, let aside the possibility of human error in annotation.

References

  1. Agarwal, S., Awan, A., and Roth, D. 2004. Learning to detect objects in images via a sparse, part-based representation. IEEE PAMI 26, 1475--1490. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Bileschi, S. M. 2006. StreetScenes: Towards scene understanding in still images. PhD thesis, MIT. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Caraffi, C., Vojir, T., Trefny, J., Sochman, J., and Matas, J. 2012. A system for real-time detection and tracking of vehicles from a single car-mounted camera. In ITS Conference, 975--982.Google ScholarGoogle Scholar
  4. Dalal, N., and Triggs, B. 2005. Histograms of oriented gradients for human detection. In IEEE CVPR, 886--893. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Dollár, P., Appel, R., Belongie, S., and Perona, P. 2014. Fast feature pyramids for object detection. IEEE PAMI.Google ScholarGoogle Scholar
  6. Fei-Fei, L., Fergus, R., and Perona, P. 2007. Learning generative visual models from few training examples: An incremental bayesian approach tested on 101 object categories. Comput. Vis. Image Underst. 106, 59--70. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Felzenszwalb, P. F., Girshick, R. B., McAllester, D., and Ramanan, D. 2010. Object detection with discriminatively trained part based models. IEEE PAMI 32, 1627--1645. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Geiger, A., Lenz, P., Stiller, C., and Urtasun, R. 2013. Vision meets robotics: The KITTI dataset. IJRR. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Griffin, G., Holub, A., and Perona, P. 2007. Caltech-256 object category dataset. Tech. Rep. CNS-TR-2007-001, California Institute of Technology.Google ScholarGoogle Scholar
  10. Gupte, S., Masoud, O., Martin, R., and Papanikolopoulos, N. 2002. Detection and classification of vehicles. IEEE Transactions on Intelligent Transportation Systems 3, 37--47. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Kumar, P., Sengupta, K., and Lee, A. 2002. A comparative study of different color spaces for foreground and shadow detection for traffic monitoring system. In ITSC, 100--105.Google ScholarGoogle Scholar
  12. Li, J., and Zhang, Y. 2013. Learning SURF cascade for fast and accurate object detection. In IEEE CVPR, 3468--3475. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Morris, B., and Trivedi, M. 2008. Learning, modeling, and classification of vehicle track patterns from live video. IEEE Transactions on Intelligent Transportation Systems 9, 425--437. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Niknejad, H., Takeuchi, A., Mita, S., and McAllester, D. 2012. On-road multivehicle tracking using deformable object model and particle filter with improved likelihood estimation. IEEE Tran. ITS 13, 748--758.Google ScholarGoogle Scholar
  15. Papageorgiou, C., and Poggio, T. 2000. A trainable system for object detection. International Journal Computer Vision. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Russell, B. C., Torralba, A., Murphy, K. P., and Freeman, W. T. 2008. LabelMe: A database and web-based tool for image annotation. IJCV 77, 157--173. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Shotton, J., Fitzgibbon, A., Cook, M., Sharp, T., Finocchio, M., Moore, R., Kipman, A., and Blake, A. 2011. Real-time human pose recognition in parts from single depth images. In IEEE CVPR, 1297--1304. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Sivaraman, S., and Trivedi, M. 2013. Looking at vehicles on the road: A survey of vision-based vehicle detection, tracking, and behavior analysis. IEEE Tran. ITS 14, 1773--1795.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Viola, P., and Jones, M. J. 2004. Robust real-time face detection. International Journal Computer Vision 57, 137--154. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Yuan, Q., Thangali, A., Ablavsky, V., and Sclaroff, S. 2011. Learning a family of detectors via multiplicative kernels. IEEE PAMI 33, 514--530. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Zhou, B., Cao, J., Zeng, X., and Wu, H. 2010. Adaptive traffic light control in wireless sensor network-based intelligent transportation system. In Vehicular Technology Conference.Google ScholarGoogle Scholar

Index Terms

  1. Cheap rendering vs. costly annotation: rendered omnidirectional dataset of vehicles

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      SCCG '14: Proceedings of the 30th Spring Conference on Computer Graphics
      May 2014
      105 pages
      ISBN:9781450330701
      DOI:10.1145/2643188

      Copyright © 2014 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 28 May 2014

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate42of81submissions,52%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader