Abstract
We consider the problem of rapidly detecting objects in static images or videos. The task consists in locating and identifying objects of interest. With the progress of affordable high computing hardware, we propose to analyse and evaluate the deformable part model on the Graphics Processing Unit. We do not take any prior assumptions on the scene and location of the objects. We provide a fast implementation and analyse the different modules of the state-of-the-art detector. Our implementation allows to accelerate both training and testing. While maintaining comparable classification performance, we report a speed-up of \(\times\)10.6 using a standard GPU card compared to a baseline implemented in C++ on a single core and \(\times\)5 compared to a multi-core OpenMP (8 threads) implementation.
Similar content being viewed by others
References
Viola, P., Jones, M., Snow, D.: Detecting pedestrians using patterns of motion and appearance. In: International Conference on Computer Vision, IEEE Press (2003)
Ess, A., Leibe, B., Schindler, K., van Gool, L.: A mobile vision system for robust multi-person tracking. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR’08), IEEE Press (2008)
Gavrila, D.M., Munder, S.: Multi-cue pedestrian detection and tracking from a moving vehicle. Int J Comput Vision 73(1):41–59. doi:10.1007/s11263-006-9038-7 (2007)
Chum, O., Perdoch, M., Matas, J.: Geometric min-hashing: finding a (thick) needle in a haystack. In: Computer Vision and Pattern Recognition, pp. 17–24 (2009)
Letessier, P., Buisson, O., Joly, A., Boujemaa, N.: Scalable mining of small visual objects. In: ACM Multimedia (2012)
Viola, P.A., Jones, M.J.: Rapid object detection using a boosted cascade of simple features. In: Computer Vision and Pattern Recognition, IEEE Computer Society (2001)
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Schmid, C., Soatto, S., Tomasi, C. (eds.) International Conference on Computer Vision & Pattern Recognition, vol. 2, pp. 886–893 (2005)
Felzenszwalb, P.F., Girshick, R.B., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part based models. IEEE Trans. Pattern Anal. Mach. Intell. 32(9), 1627–1645 (2010a)
Cho, H., Rybski, P.E., Bar-Hillel, A., Zhang, W.: Real-time pedestrian detection with deformable part models. In: Intelligent Vehicles Symposium (2012)
Sudowe, P., Leibe, B. Efficient use of geometric constraints for sliding-window object detection in video. In: International Conference on Computer Vision Systems (ICVS’11) (2011)
Pedersoli, M., Gonzalez, J., Hu, X., Roca, X.: Towards a real-time pedestrian detection based only on vision. J. Intell. Transp. Syst. (2012)
Felzenszwalb, P.F., Girshick, R.B., McAllester, D.A.: Cascade object detection with deformable part models. In: CVPR (2010)
Lampert, C.H., Blaschko, M.B., Hofmann, T.: Efficient subwindow search: a branch and bound framework for object localization. IEEE Trans Pattern Anal Mach Intell (2009)
Alexe, B., Deselaers, T., Ferrari, V.: What is an object? In: CVPR (2010)
Dollár, P., Belongie, S., Perona, P. The fastest pedestrian detector in the west. In: BMVC (2010)
Prisacariu, V., Reid, I.: fastHOG—a real-time GPU implementation of HOG. Department of Engineering Science, Oxford University, Tech. rep. (2009)
Benenson, R., Mathias, M., Timofte, R., Van Gool, L. Pedestrian detection at 100 frames per second. In: CVPR (2012)
Song, H.O., Zickler, S., Althoff, T., Girshick, R., Fritz, M., Geyer, C., Felzenszwalb, P., Darrell, T.: Sparselet models for efficient multiclass object detection. In: European Conference on Computer Vision (ECCV) (2012)
Cuda, C.: Best practices guide. http://docs.nvidia.com/cuda/cuda-c-best-practices-guide/index.html (2013)
Nvidia cuda home page. http://www.nvidia.com/object/cuda_home_new.html (2013)
Opencv home page. http://opencv.org/ (2013)
Danielsson, P.E.: Euclidean distance mapping 14(3), 227–248 (1980)
Grevera, G.J.: Distance transform algorithms and their implementation and evaluation. In: Deformable Models, pp 33–60 (2007)
Everingham, M., Van Gool, L., Williams, C.K.I., Winn, J., Zisserman, A. The PASCAL visual object classes challenge 2007 (VOC2007) Results. http://www.pascal-network.org/challenges/VOC/voc2007/workshop/index.html (2007)
Prisacariu, V., Reid, I. fastHOG—a real-time GPU implementation of HOG. Tech. Rep. 2310/09. Department of Engineering Science, Oxford University (2009)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Gadeski, E., Fard, H.O. & Le Borgne, H. GPU deformable part model for object recognition. J Real-Time Image Proc 14, 279–291 (2018). https://doi.org/10.1007/s11554-014-0447-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11554-014-0447-5