Skip to main content

Advertisement

Log in

Multiple human tracking in drone image

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Object tracking, especially human tracking is one of the challenging research problems in computer vision. Although the performance has gained some positive changes recently, there is still room for improvement. In this paper, we introduce an approach for human detection and tracking using Convolution Neural Network (CNN) and Hungarian Algorithm (HA). A CNN is used to localize multiple human beings from frame to frame in a video stream. This deep CNN is known as Faster R-CNN which achieved the state of the art performance in object detection problem. In the tracking process, we solve the data association problem in visual tracking using HA. A detected person will be assigned to a tracklet based on the data distribution in the video frame. The experimental results show that our system can deal with the videos captured from different scenarios in near real-time.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

References

  1. Babenko B, Yang MH, Belongie S (2011) Robust object tracking with online multiple instance learning. IEEE Trans Pattern Anal Mach Intell 33(8):1619–1632

    Article  Google Scholar 

  2. Broida TJ, Chellappa R (1986) Estimation of object motion parameters from noisy images. IEEE Trans Pattern Anal Mach Intell (PAMI) PAMI-8(1):90–99

    Article  Google Scholar 

  3. Del Moral P (1996) Nonlinear filtering: interacting particle solution. Markov Process Relat Fields 2(4):555–580

    MathSciNet  MATH  Google Scholar 

  4. Girshick R (2015) Fast R-CNN. In: IEEE International Conference on Computer Vision (ICCV)

  5. Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

  6. Grest D, Koch R (2004) Realtime multi-camera person tracking for immersive environments. In: IEEE 6th workshop on multimedia signal processing, pp 387–390

  7. Hare S, Saffari A, Torr PHS (2011) Struck: structured output tracking with Kernels. In: International Conference on Computer Vision (ICCV)

  8. Hare S, Golodetz S, Saffari A (2016) Struck: structured output tracking with kernels. IEEE Trans Pattern Anal Mach Intell (PAMI) 38(10):2096–2109

    Article  Google Scholar 

  9. Harville M, Gordon G, Woodfill J (2001) Foreground segmentation using adaptive mixture models in color and depth. In: IEEE workshop on detection and recognition of events in video, pp 3–11

  10. Henriques JF, Caseiro R, Martins P, Batista J (2012) Exploiting the circulant structure of tracking-by-detection with Kernels. In: ECCV, pp 702–715

  11. Jia X, Lu H, Yang M-H (2012) Visual tracking via adaptive structural local sparse appearance model. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

  12. Jia Y, Shelhamer E, Donahue J, Karayev S, Long J, Girshick R, Guadarrama S, Darrell T (2014) Caffe: convolutional architecture for fast feature embedding. In: Proceedings of the 22nd ACM international conference on multimedia, pp 675–678

  13. Kristan M, Matas J, Leonardis A, Vojir T, Pflugfelder R, Fernandez G, Nebehay G, Porikli F, Cehovin L (2016) A novel performance evaluation methodology for single-target trackers. In: IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI)

  14. Margrit B, Zheng W (2016) Data association for multi-object visual tracking. Synth Lect Comput Vis 6:1–120

    Google Scholar 

  15. Munkres J (1957) Algorithms for assignment and transportation problems. J Soc Ind Appl Math 5(1):32–38

    Article  MathSciNet  Google Scholar 

  16. Munoz-Salinas R, Aguirre E, Garcia-Silvente M (2007) People detection and tracking using stereo vision and color. Image Vis Comput 25(6):995–1007

    Article  Google Scholar 

  17. Nguyen HT, Smeulders AW (2006) Robust tracking using foreground-background texture discrimination. Int J Comput Vis 68(3):277–294

    Article  Google Scholar 

  18. Ren S, He K, Girshick R, Sun J (2015) Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems. MIT Press, Cambridge, pp 91–99

  19. Wu Y, Lim J, Yang M-H (2013) Online object tracking: a benchmark. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

  20. Zhang S, Benenson R, Omran M, Hosang J, Schiele B (2016) How far are we from solving pedestrian detection? In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 1259–1267

  21. Zhong W, Lu H, Yang M-H (2012) Robust object tracking via sparsity-based collaborative model. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Download references

Acknowledgements

This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (NRF-2018R1A1A1A05022526), (NRF-2017R1A4A1015559) and (NRF-2015R1D1A3A01019642).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to In Seop Na.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Nguyen, H.D., Na, I.S., Kim, S.H. et al. Multiple human tracking in drone image. Multimed Tools Appl 78, 4563–4577 (2019). https://doi.org/10.1007/s11042-018-6141-z

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-018-6141-z

Keywords

Navigation