Skip to main content

Advertisement

Log in

SHORT: Segmented histogram technique for robust real-time object recognition

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Object recognition is a broad area that covers several topics including face recognition, gesture recognition, human gait recognition, traffic road signs recognition, among many others. Object recognition plays a vital role in several real-time applications such as video surveillance, traffic analysis, security systems, and image retrieval. This work introduces a novel, real-time object recognition approach, namely “SHORT”: segmented histogram object recognition technique. “SHORT” implements segmentation technique applied on the histogram of selected vectors of an image to identify similar image(s) in a database. The proposed technique performance was evaluated by means of two different image databases, namely the Yale Faces and Traffic Road Signs. The robustness was also assessed by applying different levels of distortion on both databases using Gaussian noise and blur, and testing distortion impact on recognition rates. Additionally, the efficiency was evaluated by comparing the recognition execution time of the proposed technique with another well-known recognition algorithm called “Eigenfaces”. The experimental results revealed successful recognition on clear and distorted objects. Moreover, “SHORT” performed 4.5X faster than the “Eigenfaces” algorithm under the same conditions. Furthermore, the “SHORT” algorithm was implemented on FPGA hardware by exploiting data parallelism to improve the execution performance. The results showed that the FPGA hardware version is 28X faster than the “Eigenfaces” algorithm, which makes “SHORT” a robust and practical solution for real-time applications.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16

Similar content being viewed by others

Notes

  1. http://www.digilentinc.com/

  2. http://www.vision.caltech.edu/Image_Datasets/Caltech101/Caltech101.html

  3. http://vision.cs.princeton.edu/projects/2010/SUN/

References

  1. Xilinx. Vivado design suite - hlx editions (2016)

  2. Bateux Q, Marchand E (2017) Histograms-based visual servoing. IEEE Robot Autom Lett 2(1):80–87

    Article  Google Scholar 

  3. Beheshti I, Maikusa N, Matsuda H, Demirel H, Anbarjafari G (2017) Histogram-based feature extraction from individual gray matter similarity-matrix for alzheimers disease classification. Journal of Alzheimer’s Disease, (Preprint), pp 1–12

  4. Bonny T, Affan Zidan M, Salama KN (2010) An adaptive hybrid multiprocessor technique for bioinformatics sequence alignment. In: 2010 5th cairo international biomedical engineering conference, pp 112–115

  5. Bonny T, Debsi RA, Majzoub S, Elwakil AS (2019) Hardware optimized fpga implementations of high-speed true random bit generators based on switching-type chaotic oscillators. Circ Syst Signal Process 38(3):1342–1359

    Article  Google Scholar 

  6. Bonny T, Elwakil AS (2018) Fpga realizations of high-speed switching-type chaotic oscillators using compact vhdl codes. Nonlinear Dyn 93(2):819–833

    Article  Google Scholar 

  7. Bonny T, Henno S (2018) Image edge detectors under different noise levels with fpga implementations. J Circ Syst Comput 27(13):1850209

    Article  Google Scholar 

  8. Bonny T, Rabie T, Hafez AHA (2018) Multiple histogram-based face recognition with high speed fpga implementation. Multimedia Tools and Applications

  9. Cha S-H, Srihari SN (2002) On measuring the distance between histograms. Pattern Recogn 35(6):1355–1370

    Article  MATH  Google Scholar 

  10. Cinaroglu I, Bastanlar Y (2016) A direct approach for object detection with catadioptric omnidirectional cameras Signal. Image Video Process 10(2):413–420

    Article  Google Scholar 

  11. Demirel H, Anbarjafari G (2008) Pose invariant face recognition using probability distribution functions in different color channels. IEEE Signal Process Lett 15:537–540

    Article  Google Scholar 

  12. Déniz O, Bueno G, Salido J, De la Torre F (2011) Face recognition using histograms of oriented gradients. Pattern Recogn Lett 32(12):1598–1603

    Article  Google Scholar 

  13. Georgescu D (2011) A real-time face recognition system using eigenfaces. J Mob Embedded Distrib Syst 3(4):193–204

    Google Scholar 

  14. Georghiades A, Belhumeur PN, Kriegman DJ (1997) Yale face database. In: Center for computational Vision and Control at Yale University, pp 2

  15. Gross R, Matthews I, Baker Simon (2004) Appearance-based face recognition and light-fields. IEEE Trans Pattern Anal Mach Intell 26(4):449–465

    Article  Google Scholar 

  16. Inc. Xilinx. AXI Reference Guide, volume 14. Xilinx (2012)

  17. Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, pp 1097–1105

  18. Kshirsagar VP, Baviskar MR, Gaikwad ME (2011) Face recognition using eigenfaces. In: 2011 3rd international conference on computer research and development (ICCRD). IEEE, vol 2, pp 302–306

  19. Li F-F, Fergus R, Perona P (2006) One-shot learning of object categories. IEEE Trans Pattern Anal Mach Intell 28(4):594–611

    Article  Google Scholar 

  20. Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110

    Article  Google Scholar 

  21. Masoud O, Papanikolopoulos NP (2001) A novel method for tracking and counting pedestrians in real-time using a single camera. IEEE Trans Veh Technol 50 (5):1267–1278

    Article  Google Scholar 

  22. Or-El R, Rosman G, Wetzler A, Kimmel R, Bruckstein AM (2015) Rgbd-fusion: Real-time high precision depth recovery. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5407–5416

  23. Rabie T (2017) Training-less color object recognition for autonomous robotics. Inf Sci 418:218–241

    Article  Google Scholar 

  24. Romera-Paredes B, Torr PHS (2015) An embarrassingly simple approach to zero-shot learning. In: ICML, pp 2152–2161

  25. Rothganger F, Lazebnik S, Schmid C, Ponce J (2006) 3d object modeling and recognition using local affine-invariant image descriptors and multi-view spatial constraints. Int J Comput Vis 66(3):231–259

    Article  Google Scholar 

  26. Schiele B, Crowley JL (2000) Recognition without correspondence using multidimensional receptive field histograms. Int J Comput Vis 36(1):31–50

    Article  Google Scholar 

  27. Sengar SS, Mukhopadhyay S (2017) Moving object detection based on frame difference and w4. SIViP, pp 1–8

  28. Sermanet P, Eigen D, Zhang X, Mathieu M, Fergus R, LeCun Y (2013) Overfeat: Integrated recognition, localization and detection using convolutional networks. arXiv:1312.6229

  29. Song J, Gao L, Nie F, Shen HT, Yan Y, Sebe N (2016) Optimized graph learning using partial tags and multiple features for image and video annotation. IEEE Trans Image Process 25(11):4999–5011

    Article  MathSciNet  MATH  Google Scholar 

  30. Song J, Guo Y, Gao L, Li X, Hanjalic A, Shen HT (2018) From deterministic to generative: Multi-modal stochastic rnns for video captioning. IEEE Transactions on Neural Networks and Learning Systems

  31. Song J, Zhang H, Li X, Gao L, Wang M, Hong R (2018) Self-supervised video hashing with hierarchical binary auto-encoder. IEEE Trans Image Process 27 (7):3210–3221

    Article  MathSciNet  MATH  Google Scholar 

  32. Sun Z, Bebis G, Miller R (2004) Object detection using feature subset selection. Pattern Recogn 37(11):2165–2176

    Article  Google Scholar 

  33. Swain M, Ballard D (1991) Color Indexing. Inter J Comput Vis 7:11x–32

    Article  Google Scholar 

  34. Tompson J, Stein M, Lecun Y, Perlin K (2014) Real-time continuous pose recovery of human hands using convolutional networks. ACM Trans Graph (TOG) 33 (5):169

    Article  Google Scholar 

  35. Traffic road sign database (2016)

  36. Turk M (2013) Over twenty years of eigenfaces. ACM Trans Multimed Comput Commun Appl (TOMM) 9(1s):45

    Google Scholar 

  37. Turk M, Pentland A (1991) Eigenfaces for recognition. J Cogn Neurosci 3 (1):71–86

    Article  Google Scholar 

  38. Wang X, Gao L, Wang P, Sun X, Liu X (2018) Two-stream 3-d convnet fusion for action recognition in videos with arbitrary size and length. Trans Multi 20 (3):634–644

    Article  Google Scholar 

  39. Xilinx Inc. 7 Series FPGAs Overview, volume 1. Xilinx (2014)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Talal Bonny.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Bonny, T., Rabie, T., Baziyad, M. et al. SHORT: Segmented histogram technique for robust real-time object recognition. Multimed Tools Appl 78, 25781–25806 (2019). https://doi.org/10.1007/s11042-019-07826-4

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-019-07826-4

Keywords

Navigation