Abstract
Object recognition is a broad area that covers several topics including face recognition, gesture recognition, human gait recognition, traffic road signs recognition, among many others. Object recognition plays a vital role in several real-time applications such as video surveillance, traffic analysis, security systems, and image retrieval. This work introduces a novel, real-time object recognition approach, namely “SHORT”: segmented histogram object recognition technique. “SHORT” implements segmentation technique applied on the histogram of selected vectors of an image to identify similar image(s) in a database. The proposed technique performance was evaluated by means of two different image databases, namely the Yale Faces and Traffic Road Signs. The robustness was also assessed by applying different levels of distortion on both databases using Gaussian noise and blur, and testing distortion impact on recognition rates. Additionally, the efficiency was evaluated by comparing the recognition execution time of the proposed technique with another well-known recognition algorithm called “Eigenfaces”. The experimental results revealed successful recognition on clear and distorted objects. Moreover, “SHORT” performed 4.5X faster than the “Eigenfaces” algorithm under the same conditions. Furthermore, the “SHORT” algorithm was implemented on FPGA hardware by exploiting data parallelism to improve the execution performance. The results showed that the FPGA hardware version is 28X faster than the “Eigenfaces” algorithm, which makes “SHORT” a robust and practical solution for real-time applications.
















Similar content being viewed by others
References
Xilinx. Vivado design suite - hlx editions (2016)
Bateux Q, Marchand E (2017) Histograms-based visual servoing. IEEE Robot Autom Lett 2(1):80–87
Beheshti I, Maikusa N, Matsuda H, Demirel H, Anbarjafari G (2017) Histogram-based feature extraction from individual gray matter similarity-matrix for alzheimers disease classification. Journal of Alzheimer’s Disease, (Preprint), pp 1–12
Bonny T, Affan Zidan M, Salama KN (2010) An adaptive hybrid multiprocessor technique for bioinformatics sequence alignment. In: 2010 5th cairo international biomedical engineering conference, pp 112–115
Bonny T, Debsi RA, Majzoub S, Elwakil AS (2019) Hardware optimized fpga implementations of high-speed true random bit generators based on switching-type chaotic oscillators. Circ Syst Signal Process 38(3):1342–1359
Bonny T, Elwakil AS (2018) Fpga realizations of high-speed switching-type chaotic oscillators using compact vhdl codes. Nonlinear Dyn 93(2):819–833
Bonny T, Henno S (2018) Image edge detectors under different noise levels with fpga implementations. J Circ Syst Comput 27(13):1850209
Bonny T, Rabie T, Hafez AHA (2018) Multiple histogram-based face recognition with high speed fpga implementation. Multimedia Tools and Applications
Cha S-H, Srihari SN (2002) On measuring the distance between histograms. Pattern Recogn 35(6):1355–1370
Cinaroglu I, Bastanlar Y (2016) A direct approach for object detection with catadioptric omnidirectional cameras Signal. Image Video Process 10(2):413–420
Demirel H, Anbarjafari G (2008) Pose invariant face recognition using probability distribution functions in different color channels. IEEE Signal Process Lett 15:537–540
Déniz O, Bueno G, Salido J, De la Torre F (2011) Face recognition using histograms of oriented gradients. Pattern Recogn Lett 32(12):1598–1603
Georgescu D (2011) A real-time face recognition system using eigenfaces. J Mob Embedded Distrib Syst 3(4):193–204
Georghiades A, Belhumeur PN, Kriegman DJ (1997) Yale face database. In: Center for computational Vision and Control at Yale University, pp 2
Gross R, Matthews I, Baker Simon (2004) Appearance-based face recognition and light-fields. IEEE Trans Pattern Anal Mach Intell 26(4):449–465
Inc. Xilinx. AXI Reference Guide, volume 14. Xilinx (2012)
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, pp 1097–1105
Kshirsagar VP, Baviskar MR, Gaikwad ME (2011) Face recognition using eigenfaces. In: 2011 3rd international conference on computer research and development (ICCRD). IEEE, vol 2, pp 302–306
Li F-F, Fergus R, Perona P (2006) One-shot learning of object categories. IEEE Trans Pattern Anal Mach Intell 28(4):594–611
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110
Masoud O, Papanikolopoulos NP (2001) A novel method for tracking and counting pedestrians in real-time using a single camera. IEEE Trans Veh Technol 50 (5):1267–1278
Or-El R, Rosman G, Wetzler A, Kimmel R, Bruckstein AM (2015) Rgbd-fusion: Real-time high precision depth recovery. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5407–5416
Rabie T (2017) Training-less color object recognition for autonomous robotics. Inf Sci 418:218–241
Romera-Paredes B, Torr PHS (2015) An embarrassingly simple approach to zero-shot learning. In: ICML, pp 2152–2161
Rothganger F, Lazebnik S, Schmid C, Ponce J (2006) 3d object modeling and recognition using local affine-invariant image descriptors and multi-view spatial constraints. Int J Comput Vis 66(3):231–259
Schiele B, Crowley JL (2000) Recognition without correspondence using multidimensional receptive field histograms. Int J Comput Vis 36(1):31–50
Sengar SS, Mukhopadhyay S (2017) Moving object detection based on frame difference and w4. SIViP, pp 1–8
Sermanet P, Eigen D, Zhang X, Mathieu M, Fergus R, LeCun Y (2013) Overfeat: Integrated recognition, localization and detection using convolutional networks. arXiv:1312.6229
Song J, Gao L, Nie F, Shen HT, Yan Y, Sebe N (2016) Optimized graph learning using partial tags and multiple features for image and video annotation. IEEE Trans Image Process 25(11):4999–5011
Song J, Guo Y, Gao L, Li X, Hanjalic A, Shen HT (2018) From deterministic to generative: Multi-modal stochastic rnns for video captioning. IEEE Transactions on Neural Networks and Learning Systems
Song J, Zhang H, Li X, Gao L, Wang M, Hong R (2018) Self-supervised video hashing with hierarchical binary auto-encoder. IEEE Trans Image Process 27 (7):3210–3221
Sun Z, Bebis G, Miller R (2004) Object detection using feature subset selection. Pattern Recogn 37(11):2165–2176
Swain M, Ballard D (1991) Color Indexing. Inter J Comput Vis 7:11x–32
Tompson J, Stein M, Lecun Y, Perlin K (2014) Real-time continuous pose recovery of human hands using convolutional networks. ACM Trans Graph (TOG) 33 (5):169
Traffic road sign database (2016)
Turk M (2013) Over twenty years of eigenfaces. ACM Trans Multimed Comput Commun Appl (TOMM) 9(1s):45
Turk M, Pentland A (1991) Eigenfaces for recognition. J Cogn Neurosci 3 (1):71–86
Wang X, Gao L, Wang P, Sun X, Liu X (2018) Two-stream 3-d convnet fusion for action recognition in videos with arbitrary size and length. Trans Multi 20 (3):634–644
Xilinx Inc. 7 Series FPGAs Overview, volume 1. Xilinx (2014)
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Bonny, T., Rabie, T., Baziyad, M. et al. SHORT: Segmented histogram technique for robust real-time object recognition. Multimed Tools Appl 78, 25781–25806 (2019). https://doi.org/10.1007/s11042-019-07826-4
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-019-07826-4