Skip to main content
Log in

Automated image and video object detection based on hybrid heuristic-based U-net segmentation and faster region-convolutional neural network-enabled learning

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Object detection is one of the major areas of computer vision, which adopts machine learning approaches in diverse contributions. Nowadays, the machine learning field has been directed through Deep Neural Networks (DNNs) that takes eminent features of progressions in data availability and computing power. In all the cases, the quality of images and videos are biased and noisy, and thus, the distributions of data are also considered as imbalanced and disturbed. Different techniques are developed for solving the abovementioned challenges, which are mostly considered based on deep learning and computer vision. Though, traditional algorithms constantly offer poor detection for dense and small objects and yet fail the detection of objects through random geometric transformations. One of the categories of deep learning called Convolutional Neural Network (CNN) is famous and well-matched method for image-related tasks, in which the network is trained for discovering the numerous features like colour differences, corners, and edges in the images and videos that are combined into more complex shapes. This proposal intends to develop improved object detection in images and videos with the advancements of deep learning models. The three main phases of the proposed object detection model are (a) pre-processing, (b) segmentation, and (c) detection. Once the pre-processing of the image is performed by median filtering approach, the adaptive U-Net segmentation is performed for the object segmentation using the newly proposed Sun Flower-Deer Hunting Optimization Algorithm (SF-DHOA). The maximization of segmentation accuracy and dice coefficient is considered as the main objective of the proposed segmentation. The hybrid meta-heuristic algorithm termed SF-DHOA is proposed with Sun Flower Optimization (SFO) and Deer Hunting Optimization Algorithm (DHOA), which is used for optimally tuning the U-Net by optimizing the encoder depth and the number of epoch. Further, the detection is performed by the modified Faster Region-Convolutional Neural Network (Faster-RCNN), in which the optimization of number of epoch is performed by hybrid SF-DHOA algorithm with the intention of minimizing the error and training loss function. The performance of the proposed algorithm is evaluated, and the proposed algorithm shows high improvement when compared to existing deep learning-based algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

References

  1. Azzam R, Kemouche MS, Aouf N, Richardson M (2016) Efficient visual object detection with spatially global gaussian mixture models and uncertainties. J Vis Commun Image Represent 36:90–106

    Article  Google Scholar 

  2. Bonyadi MR, Michalewicz Z (2016) Analysis of stability, local convergence, and transformation sensitivity of a variant of the particle swarm optimization algorithm. IEEE Trans Evol Comput 20(3):370–385

    Article  Google Scholar 

  3. Bouwmans T (2014) Traditional and recent approaches in background modeling for foreground detection: An overview. Comput Sci Rev 11–12:31–66

    Article  MATH  Google Scholar 

  4. Brammya G, Praveena S, Ninu Preetha NS, Ramya R, Rajakumar BR, Binu D Deer hunting optimization algorithm: a new nature-inspired Meta-heuristic paradigm. 24 May 2019.

  5. Cao W, Yuan J, He Z, Zhang Z, He Z (2018) Fast deep neural networks with knowledge guided training and predicted regions of interests for real-time video object detection. IEEE Access 6:8990–8999

    Article  Google Scholar 

  6. Cuevas C, Yáñez EM, García N (2016) Labeled dataset for integral evaluation of moving object detection algorithms: LASIESTA. Comput Vis Image Understand 152:103–117

    Article  Google Scholar 

  7. Dai J, Li Y, He K, Sun J (2016) R-FCN: object detection via region-based fully convolutional networks

  8. Fan M, Li Y, Zheng S, Peng W, Tang W, Li L (2019) Computer-aided detection of mass in digital breast tomosynthesis using a faster region-based convolutional neural network. Methods 166:103–111

    Article  Google Scholar 

  9. Felzenszwalb PF, Girshick RB, Mcallester D, Ramanan D (2009) Object detection with discriminatively trained part based models. IEEE Trans Pattern Anal Mach Intell 32(9):1–20

    Google Scholar 

  10. Gomes GF, da Cunha SS Jr, Ancelotti AC Jr (2019) A sunflower optimization (SFO) algorithm applied to damage identification on laminated composite plates. Eng Comput 35:619–626

    Article  Google Scholar 

  11. Goyette N, Jodoin P-M, Porikli F, Konrad J, Ishwar P (2014) A novel video dataset for change detection benchmarking. IEEE Trans Image Process 23(11):4663–4679

    Article  MathSciNet  MATH  Google Scholar 

  12. Hambarde P, Talbar S, Mahajan A, Chavan S, Thakur M, Sable N (2020) Prostate lesion segmentation in MR images using radiomics based deeply supervised U-net. Biocybern Biomed Eng 40(4):1421–1435

    Article  Google Scholar 

  13. Han J, Zhang D, Cheng G, Liu N, Xu D (2018) Advanced deep-learning techniques for salient and category-specific object detection: a survey. IEEE Signal Process Mag 35(1):84–100

    Article  Google Scholar 

  14. Hu W-C, Chen C-H, Chen T-Y, Huang D-Y, Wu Z-C (2015) Moving object detection and tracking from video captured by moving camera. J Vis Commun Image Represent 30:164–180

    Article  Google Scholar 

  15. Hu Q, Paisitkriangkrai S, Shen C, van den Hengel A, Porikli F (2016) Fast detection of multiple objects in traffic scenes with a common detection framework. IEEE Trans Intell Transp Syst 17(4):1002–1014

    Article  Google Scholar 

  16. Hu Z, Yang D, Zhang K, Chen Z (2020) Object tracking in satellite videos based on convolutional regression network with appearance and motion features. IEEE J Sel Top Appl Earth Obs Remote Sens 13:783–793

    Article  Google Scholar 

  17. Huang L, Yan P, Li G, Wang Q, Lin L (2019) Attention embedded Spatio-temporal network for video salient object detection. IEEE Access 7:166203–166213

    Article  Google Scholar 

  18. Kim J-Y, Ha J-E (2020) Foreground Objects Detection Using a Fully Convolutional Network With a Background Model Image and Multiple Original Images. IEEE Access 8:159864–159878

    Article  Google Scholar 

  19. Kim JH, Kim B, Roy PP, Jeong D (2019) Efficient Facial Expression Recognition Algorithm Based on Hierarchical Deep Neural Network Structure. IEEE Access 7:41273–41285

    Article  Google Scholar 

  20. Manne R, Kantheti S, Kantheti S (2020) Classification of Skin cancer using deep learning,Convolutional Neural Networks -Opportunities and vulnerabilities. Int J Mod Trends Sci Technol 6(11):101–108

    Article  Google Scholar 

  21. Marsaline Beno M, Valarmathi IR, Swamy SM, Rajakumar BR (2014) Threshold prediction for segmenting tumour from brain MRI scans. Int J Imaging Syst Technol 24(2):129–137

    Article  Google Scholar 

  22. Murthy MYB, Koteswararao A, Babu MS (2021) Adaptive fuzzy deformable fusion and optimized CNN with ensemble classification for automated brain tumor diagnosis. Biomed Eng Lett

  23. Nirmala Sreedharan NP, Ganesan B, Raveendran R, Sarala P, Dennis B, Boothalingam R (2018) Grey Wolf optimisation-based feature selection and classification for facial emotion recognition. IET Biometrics 7(5):490–499

    Article  Google Scholar 

  24. Patil PW, Murala S (2019) MSFgNet: a novel compact end-to-end deep network for moving object detection. IEEE Trans Intell Transp Syst 20(11):4066–4077

    Article  Google Scholar 

  25. Reddy V, Sanderson C, Lovell BC (2013) Improved foreground detection via block-based classifier cascade with probabilistic decision integration. IEEE Trans Circuits Syst Video Technol 23(1):83–93

    Article  Google Scholar 

  26. Rhee PK, Erdenee E, Kyun SD, Ahmed MU, Jin S (2017) Active and semi-supervised learning for object detection with imperfect data. Cogn Syst Res 45:109–123

    Article  Google Scholar 

  27. Rodriguez-Ramos A, Rodriguez-Vazquez J, Sampedro C, Campoy P (2020) Adaptive Inattentional framework for video object detection with reward-conditional training. IEEE Access 8:124451–124466

    Article  Google Scholar 

  28. St-Charles P-L, Bilodeau G-A, Bergevin R (2015) SuBSENSE:Auniversal change detection method with local adaptive sensitivity. IEEE Trans Image Process 24(1):359–373

    Article  MathSciNet  MATH  Google Scholar 

  29. Tsang S, Kao B, Yip KY, Ho W, Lee SD (2011) Decision trees for uncertain data. IEEE Trans Knowl Data Eng 23(1):64–78

    Article  Google Scholar 

  30. Uçar A, Demir Y, Güzeliş C (2017) Object recognition and detection with deep learning for autonomous driving applications. Simulation 93(9):759–769

    Article  Google Scholar 

  31. Unnisa N, Tatineni M (2021) Adaptive deep learning strategy with Red Deer algorithm for Sparse Channel estimation and hybrid precoding in millimeter wave massive MIMO-OFDM systems. Wirel Pers Commun 122:3019–3051

    Article  Google Scholar 

  32. Varadarajan S, Miller P, Zhou H (2015) Region-based mixture of gaussians modelling for foreground detection in dynamic scenes. Pattern Recogn 48(11):3488–3503

    Article  MATH  Google Scholar 

  33. Wang S, Pan H, Zhang C, Tian Y (2014) Rgb-d image-based detection of stairs, pedestrian crosswalks and traffic signs. J Vis Commun Image Represent 25(2):263–272

    Article  Google Scholar 

  34. Wang K, Lin L, Yan X, Chen Z, Zhang D, Zhang L (2018) Cost-Effective Object Detection: Active Sample Mining With Switchable Selection Criteria. IEEE Trans Neural Networks Learn Syst PP:1–17

    Google Scholar 

  35. Wu J, Yang H (2015) Linear regression-based efficient SVM learning for large-scale classification. IEEE Trans Neural Netw Learn Syst 26(10):2357–2369

    Article  MathSciNet  Google Scholar 

  36. Yousif H, Yuan J, Kays R, He Z (2018) Object detection from dynamic scene using joint background modeling and fast deep learning classification. J Vis Commun Image Represent 55:802–815

    Article  Google Scholar 

  37. Yu H, Guo D, Yan Z, Fud L, Simmons J, Przybyla CP, Wang S (2020) Weakly supervised easy-to-hard learning for object detection in image sequences. Neurocomputing 398:71–82

    Article  Google Scholar 

  38. Zhang C, Kim J (2020) Video object detection with two-path convolutional LSTM pyramid. IEEE Access 8:151681–151691

    Article  Google Scholar 

  39. Zhang Z, He Z, Cao G, Cao W (2016) Animal detection from highly cluttered natural scenes using spatiotemporal object region proposals and patch verification. IEEE Trans Multimed 18(10):2079–2092

    Article  Google Scholar 

  40. Zhao W, Ma W, Jiao L, Chen P, Yang S, Hou B (2019) Multi-Scale Image Block-Level F-CNN for Remote Sensing Images Object Detection. IEEE Access 7:43607–43621

    Article  Google Scholar 

  41. Zhu Y, Huang C (2012) An improved median filtering algorithm for image noise reduction. Phys Procedia 25:609–616

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rajashekar Reddy Palle.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Ethical approval

This paper does not contain any studies with human participants or animals performed by any of the authors.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Palle, R.R., Boda, R. Automated image and video object detection based on hybrid heuristic-based U-net segmentation and faster region-convolutional neural network-enabled learning. Multimed Tools Appl 82, 3459–3484 (2023). https://doi.org/10.1007/s11042-022-13216-0

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-022-13216-0

Keywords

Navigation