Skip to main content
Log in

Object detection and recognition using contour based edge detection and fast R-CNN

  • 1220: Visual and Sensory Data Processing for Real Time Intelligent Surveillance System
  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Object detection is a technique of computer vision whose primary intent is to detect objects. The objects can be detected from any image or video feeds. Now a day’s object detection is extensively applied in video surveillance systems, human tracking, and self-driving cars. This paper presented a novel object detection approach that uses only wireframe-based features. The wireframe of the image is identified by using Cellular logical array processing. This technique can determine the visual and geometric features of the image. This paper focuses on a deep neural network framework to detect the target object in the image. Fast R-CNN is used for the detection of objects. The detection speed is fast because only the wireframe of the image is obtained first and then fed into the Fast RCNN model for detection and classification purposes. The performance of the proposed methodology is evaluated on PASCAL VOC, example-based synthesis dataset and real-time dataset. The proposed methodology gives mean average precision (mAP) 89.4%, 91.33% and 88.1% on PASCAL VOC, example-based and real-time dataset. The experimental analysis demonstrated that our proposed detection method achieves better results than the other state of art methods. The approach is helpful to detect the 2D and 3D objects as well.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18

Similar content being viewed by others

Data availability

Data and other related material are available.

Code availability

Implementation of algorithm is available.

References

  1. Ablavatski A, Lu S, Cai J (2017) Enriched deep recurrent visual attention model for multiple object recognition. IEEE Winter Conference on Applications of Computer Vision (WACV), pp 971–978

  2. Alahi A, Goel K, Ramanathan V, Robicquet A, Fei-Fei L, Savarese S (2016) Social LSTM: human trajectory prediction in crowded spaces. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, pp 961–971

  3. Araki R, Onishi T, Hirakawa T, Yamashita T, Fujiyoshi H (2020) MT-DSSD: deconvolutional single shot detector using multi task learning for object detection, segmentation, and grasping detection. IEEE International Conference on Robotics and Automation (ICRA), pp 10487–10493

  4. Baimukashev D, Zhilisbayev A, Kuzdeuov A, Oleinikov A, Fadeyev D, Makhataeva Z, Varol HA (2015) Deep learning based object recognition using physically-realistic synthetic depth scenes. Mach Learn Knowl Extr 1(3):883–903

    Article  Google Scholar 

  5. Bhuvaneswari R, Subban R (2018) Novel object detection and recognition system based on points of interest selection and SVM classification. Cogn Syst Res 52(1):985–994

    Article  Google Scholar 

  6. Cao C, Liu X, Yang Y et al (2016) Look and think twice: capturing top-down visual attention with feedback convolutional neural networks. IEEE International Conference on Computer Vision (ICCV), Santiago, Chile, pp 2956–2964

  7. Chen C, Wang T, Li D, Hong J (2020) Repetitive assembly action recognition based on object detection and pose estimation. J Manuf Syst 55(1):325–333

    Article  Google Scholar 

  8. Chen X, Li H, Wu Q, Ngan KN, Xu L (2020) High-quality R-CNN object detection using multi-path detection calibration network. IEEE Trans Circuits Syst Video Technol 31:715–727

    Article  Google Scholar 

  9. Condat R, Rogozan A, Bensrhair A (2020) GFD-Retina: gated fusion double RetinaNet for multimodal 2D road object detection. 23rd IEEE International Conference on Intelligent Transportation Systems (ITSC), pp 1–6

  10. Elaraby AF, Hamdy A, Rehan M (2018) A Kinect-based 3D object detection and recognition system with enhanced depth estimation algorithm. IEEE 9th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON), pp 247–252

  11. Felzenszwalb P, McAllester D, Ramanan D (2008) A discriminatively trained, multi-scale, deformable part model, vol 2. CVPR, New Jersy, p 7

    Google Scholar 

  12. Felzenszwalb PF, Girshick RB, McAllester D, Ramanan D (2010) Object detection with discriminatively trained part-based models. IEEE Trans Pattern Anal Mach Intell 32(9):1627–1645

    Article  Google Scholar 

  13. Hu Q, Paisitkriangkrai S, Shen C, van den Hengel A, Porikli F (2015) Fast detection of multiple objects in traffic scenes with a common detection framework. IEEE Trans Intell Transp Syst 17(4):1002–1014

    Article  Google Scholar 

  14. Ibrahem H, Salem ADA, Kang H-S (2021) Real-time weakly supervised object detection using center-of-features localization. IEEE Access 9:8742–38756

    Article  Google Scholar 

  15. Jiang M, Deng C, Pan Z, Wang L, Sun X (2018) Multiobject tracking in videos based on LSTM and deep reinforcement learning. Complexity 2018:1–12

    Google Scholar 

  16. Kim JU, Ro YM (2019) Attentive layer separation for object classification and object localization in object detection. IEEE International Conference on Image Processing (ICIP), pp 3995–3999

  17. Min W, Zhang Y, Li J, Xu S (2018) Recognition of pedestrian activity based on dropped object detection. Signal Process 144(1):238–252

    Article  Google Scholar 

  18. Raghunandan A, Mohana, Raghav P, Aradhya HVR (2018) Object detection algorithms for video surveillance applications. 2018 International Conference on Communication and Signal Processing (ICCSP), pp 0563–0568

  19. Raja R, Kumar S, Mahmood MR (2020) Color Object Detection Based Image Retrieval using ROI Segmentation with Multi-Feature Method. Wirel Pers Commun 112:169–192

    Article  Google Scholar 

  20. Ramík DM (2014) A machine learning based intelligent vision system for autonomous object detection and recognition. Appl Intell 40(2):358–375

    Article  Google Scholar 

  21. Rani S, Lakhwani K, Kumar S (2021) Three dimensional wireframe model of medical and complex images using cellular logic array processing techniques. Proceedings of the 12th International Conference on Soft Computing and Pattern Recognition, Advances in Intelligent Systems and Computing, pp 1–11

  22. Sharma V, Mir RN (2019) Saliency guided faster-RCNN (SGFr-RCNN) model for object detection and recognition. J King Saud Univ Comput Inform Sci 1:1–13

    Google Scholar 

  23. Soebhakti H, Prayoga S, Fatekha RA, Fashla MB (2019) The real-time object detection system on mobile soccer robot using YOLO v3. 2nd IEEE International Conference on Applied Engineering (ICAE), pp 1–6

  24. Viola P, Jones MJ (2004) Robust real-time face detection. Int J Comput Vision 57(2):137–154

    Article  Google Scholar 

  25. Yang Y, Raman D (2011) Articulated pose estimation with flexible mixtures of parts. CVPR, New Jersy, pp 1385–1392

    Google Scholar 

  26. Yang X, Wu T, Zhang L, Yang D, Wang N, Song B, Gao X (2018) CNN with Spatio-temporal information for fast suspicious object detection and recognition in THz security images. Signal Process 160(1):202–214

    Google Scholar 

  27. Yang X, Wu T, Zhang L, Yang D, Wang N, Song B, Gao X (2019) CNN with Spatio-temporal information for fast suspicious object detection and recognition in THz security images. Signal Process 1:202–214

    Article  Google Scholar 

  28. Yao C, Kong Y, Feng L, Jin B, Si H (2020) Contour-aware recurrent cross constraint network for salient object detection. IEEE Access 8(1):218739–218751

    Article  Google Scholar 

  29. Yao X, Feng X, Han J, Cheng G, Guo L (2020) Automatic weakly supervised object detection from high spatial resolution remote sensing images via dynamic curriculum learning. IEEE Trans Geosci Remote Sens 59(1):675–685

    Article  Google Scholar 

  30. Zhang J, Chen S, Hou Y (2020) Accurate object detection with relation module on improved R-FCN. IEEE International Conference on Chinese Automation Congress (CAC), pp 7131–7135

  31. Zhang S, Wang C, Chan SC, Wei X, Ho CH (2014) New object detection, tracking, and recognition approach for video surveillance over camera network. IEEE Sens J 15(5):2679–2691

    Article  Google Scholar 

  32. Zhang S, Wen L, Bian X, Lei Z, Li SZ (2018) Single-shot refinement neural network for object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4203–4212

  33. Zhao S, Liu Y, Han Y, Hong R, Hu Q, Tian Q (2018) Pooling the convolutional layers in deep ConvNets for video action recognition. IEEE Trans Circuits Syst Video Technol 28(8):1–14

    Article  Google Scholar 

  34. Zhu K, Chen Z, Peng Y, Zhang L (2019) Mobile edge assisted literal multi-dimensional anomaly detection of in-vehicle network using LSTM. IEEE Trans Veh Technol 68(5):4275–4284

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shilpa Rani.

Ethics declarations

Ethical approval

This article does not contain any studies with human participants performed by any of the authors.

Consent to participate

We are voluntarily agreed to participate in this research study.

Consent for publication

I Shilpa Rani give my consent for information about myself to be published in Journal.

Conflict of interest

On behalf of all authors, the corresponding author states that there is no conflict of interest.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Rani, S., Ghai, D. & Kumar, S. Object detection and recognition using contour based edge detection and fast R-CNN. Multimed Tools Appl 81, 42183–42207 (2022). https://doi.org/10.1007/s11042-021-11446-2

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-021-11446-2

Keywords

Navigation