skip to main content
10.1145/2425836.2425912acmotherconferencesArticle/Chapter ViewAbstractPublication PagesivcnzConference Proceedingsconference-collections
poster

3D object tracking with a high-resolution GPU based real-time stereo

Authors Info & Claims
Published:26 November 2012Publication History

ABSTRACT

Stereo correspondence algorithms, which are fast enough for real-time use, require hardware assistance and inevitably trade some matching accuracy for speed. A cloud of 3D points thus produced by our previously reported GPU accelerated implementation of a dynamic programming correspondence algorithm is noisy and contains artifacts, which hinder tracking accuracy. We have augmented this implementation with modules for re-projection and filtering. A fast clustering procedure based upon a set of simple volume rules identifies candidate objects. An opportunistic tagging system tracks objects through occlusions. Kalman filtering predicts positions in the next frame. These steps reduce the effects of dynamic programming streaks in the depth maps. Experiments with synthetic and real-world video sequences confirmed the accuracy in tracking multiple objects (e.g. humans) in various environments.

References

  1. S. Asano, T. Maruyama, and Y. Yamaguchi. Performance comparison of FPGA, GPU and CPU in image processing. In Proc. Int. Conf. on Field Programmable Logic and Applications, pages 126--131, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  2. L. Cai, L. He, Y. Xu, Y. Zhao, and X. Yang. Multi-object detection and tracking by stereo vision. Pattern Recognition, 43(12): 4028--4041, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. T. Darrell, G. Gordon, M. Harville, and J. Woodfill. Integrated person tracking using stereo, color, and pattern detection. International Journal of Computer Vision, 37(2): 175--185, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. G. Gimel'farb. Probabilistic regularisation and symmetry in binocular dynamic programming stereo. Pattern Recognition Letters, 23(4): 431--442, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. M. L. Gong and Y. H. Yang. Real-time stereo matching using orthogonal reliability-based dynamic programming. IEEE Trans. on Image Processing, 16(3): 879--884, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. M. Harville. Stereo person tracking with adaptive plan-view templates of height and occupancy statistics. Image and Vision Computing, 22(2): 127--142, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  7. M. Himmelsbach, A. Müller, T. Lüttel, and H. Wünsche. Lidar-based 3d object perception. In Proceedings of 1st International Workshop on Cognition for Technical Systems, 2008.Google ScholarGoogle Scholar
  8. R. Kalarot and J. Morris. Comparison of FPGA and GPU implementations of real-time stereo vision. In Computer Vision and Pattern Recognition Workshops (CVPRW), 2010 IEEE Computer Society Conference on, pages 9--15. IEEE, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  9. R. Kalarot and J. Morris. Implementation of symmetric dynamic programming stereo matching algorithm using CUDA. In Proc. 16th Korea-Japan Joint Workshop on Frontiers of Computer Vision. FCV, 2010.Google ScholarGoogle Scholar
  10. R. Kalarot, J. Morris, D. Berry, and J. Dunning. Analysis of real-time stereo vision algorithms on GPU. In International Conference Image and Vision Computing New Zealand (IVCNZ), pages 179--184, 2011.Google ScholarGoogle Scholar
  11. R. Kalarot, J. Morris, and G. Gimel'farb. Performance analysis of multi-resolution symmetric dynamic programming stereo on GPU. In 25th International Conference Image and Vision Computing New Zealand (IVCNZ), pages 1--7. IEEE, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  12. K. Khoshelham and S. Elberink. Accuracy and resolution of kinect depth data for indoor mapping applications. Sensors, 12(2): 1437--1454, 2012.Google ScholarGoogle ScholarCross RefCross Ref
  13. K. Lee, B. Kalyan, S. Wijesoma, M. Adams, F. Hover, and N. Patrikalakis. Tracking random finite objects using 3d-lidar in marine environments. In Proceedings of the 2010 ACM Symposium on Applied Computing, pages 1282--1287. ACM, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. V. Lepetit and P. Fua. Monocular model-based 3D tracking of rigid objects, volume 1. Now Publishers Inc., Hanover, MA, USA, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. R. Muñoz-Salinas, M. García-Silvente, and R. Medina Carnicer. Adaptive multi-modal stereo people tracking without background modelling. Journal of Visual Communication and Image Representation, 19(2): 75--91, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. R. Muñoz-Salinas, R. Medina-Carnicer, F. Madrid-Cuevas, and A. Carmona-Poyato. People detection and tracking with multiple stereo cameras using particle filters. Journal of Visual Communication and Image Representation, 20(5): 339--350, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. S. Obdržálek, G. Kurillo, J. Han, T. Abresch, R. Bajcsy, et al. Real-time human pose detection and tracking for tele-rehabilitation in virtual reality. Studies in Health Technology and Informatics, 173: 320, 2012.Google ScholarGoogle Scholar
  18. S. Park and H. Jeong. Real-time stereo vision fpga chip with low error rate. In Proc. Int. Conf. on Multimedia and Ubiquitous Engineering, pages 751--756, Washington, DC, USA, 2007. IEEE Computer Society. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. L. Spinello, M. Luber, and K. Arras. Tracking people in 3d using a bottom-up top-down detector. In 2011 IEEE International Conference on Robotics and Automation (ICRA), pages 1304--1310. IEEE, 2011.Google ScholarGoogle ScholarCross RefCross Ref
  20. A. Yilmaz, O. Javed, and M. Shah. Object tracking: A survey. ACM Computing Surveys (CSUR), 38(4), 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. 3D object tracking with a high-resolution GPU based real-time stereo

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in
            • Published in

              cover image ACM Other conferences
              IVCNZ '12: Proceedings of the 27th Conference on Image and Vision Computing New Zealand
              November 2012
              547 pages
              ISBN:9781450314732
              DOI:10.1145/2425836

              Copyright © 2012 ACM

              Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 26 November 2012

              Permissions

              Request permissions about this article.

              Request Permissions

              Check for updates

              Qualifiers

              • poster

              Acceptance Rates

              Overall Acceptance Rate55of74submissions,74%
            • Article Metrics

              • Downloads (Last 12 months)0
              • Downloads (Last 6 weeks)0

              Other Metrics

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader