Automated image and video object detection based on hybrid heuristic-based U-net segmentation and faster region-convolutional neural network-enabled learning

Palle, Rajashekar Reddy; Boda, Ravi

doi:10.1007/s11042-022-13216-0

Automated image and video object detection based on hybrid heuristic-based U-net segmentation and faster region-convolutional neural network-enabled learning

Published: 07 July 2022

Volume 82, pages 3459–3484, (2023)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Rajashekar Reddy Palle¹ &
Ravi Boda¹

435 Accesses
1 Altmetric
Explore all metrics

Abstract

Object detection is one of the major areas of computer vision, which adopts machine learning approaches in diverse contributions. Nowadays, the machine learning field has been directed through Deep Neural Networks (DNNs) that takes eminent features of progressions in data availability and computing power. In all the cases, the quality of images and videos are biased and noisy, and thus, the distributions of data are also considered as imbalanced and disturbed. Different techniques are developed for solving the abovementioned challenges, which are mostly considered based on deep learning and computer vision. Though, traditional algorithms constantly offer poor detection for dense and small objects and yet fail the detection of objects through random geometric transformations. One of the categories of deep learning called Convolutional Neural Network (CNN) is famous and well-matched method for image-related tasks, in which the network is trained for discovering the numerous features like colour differences, corners, and edges in the images and videos that are combined into more complex shapes. This proposal intends to develop improved object detection in images and videos with the advancements of deep learning models. The three main phases of the proposed object detection model are (a) pre-processing, (b) segmentation, and (c) detection. Once the pre-processing of the image is performed by median filtering approach, the adaptive U-Net segmentation is performed for the object segmentation using the newly proposed Sun Flower-Deer Hunting Optimization Algorithm (SF-DHOA). The maximization of segmentation accuracy and dice coefficient is considered as the main objective of the proposed segmentation. The hybrid meta-heuristic algorithm termed SF-DHOA is proposed with Sun Flower Optimization (SFO) and Deer Hunting Optimization Algorithm (DHOA), which is used for optimally tuning the U-Net by optimizing the encoder depth and the number of epoch. Further, the detection is performed by the modified Faster Region-Convolutional Neural Network (Faster-RCNN), in which the optimization of number of epoch is performed by hybrid SF-DHOA algorithm with the intention of minimizing the error and training loss function. The performance of the proposed algorithm is evaluated, and the proposed algorithm shows high improvement when compared to existing deep learning-based algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Hybrid Metaheuristic-Based Thresholding and Faster Region-Convolutional Neural Network for Object Detection in Images

Human object detection: An enhanced black widow optimization algorithm with deep convolution neural network

Article 02 August 2021

Tools, techniques, datasets and application areas for object detection in an image: a review

Article 23 April 2022

References

Azzam R, Kemouche MS, Aouf N, Richardson M (2016) Efficient visual object detection with spatially global gaussian mixture models and uncertainties. J Vis Commun Image Represent 36:90–106
Article Google Scholar
Bonyadi MR, Michalewicz Z (2016) Analysis of stability, local convergence, and transformation sensitivity of a variant of the particle swarm optimization algorithm. IEEE Trans Evol Comput 20(3):370–385
Article Google Scholar
Bouwmans T (2014) Traditional and recent approaches in background modeling for foreground detection: An overview. Comput Sci Rev 11–12:31–66
Article MATH Google Scholar
Brammya G, Praveena S, Ninu Preetha NS, Ramya R, Rajakumar BR, Binu D Deer hunting optimization algorithm: a new nature-inspired Meta-heuristic paradigm. 24 May 2019.
Cao W, Yuan J, He Z, Zhang Z, He Z (2018) Fast deep neural networks with knowledge guided training and predicted regions of interests for real-time video object detection. IEEE Access 6:8990–8999
Article Google Scholar
Cuevas C, Yáñez EM, García N (2016) Labeled dataset for integral evaluation of moving object detection algorithms: LASIESTA. Comput Vis Image Understand 152:103–117
Article Google Scholar
Dai J, Li Y, He K, Sun J (2016) R-FCN: object detection via region-based fully convolutional networks
Fan M, Li Y, Zheng S, Peng W, Tang W, Li L (2019) Computer-aided detection of mass in digital breast tomosynthesis using a faster region-based convolutional neural network. Methods 166:103–111
Article Google Scholar
Felzenszwalb PF, Girshick RB, Mcallester D, Ramanan D (2009) Object detection with discriminatively trained part based models. IEEE Trans Pattern Anal Mach Intell 32(9):1–20
Google Scholar
Gomes GF, da Cunha SS Jr, Ancelotti AC Jr (2019) A sunflower optimization (SFO) algorithm applied to damage identification on laminated composite plates. Eng Comput 35:619–626
Article Google Scholar
Goyette N, Jodoin P-M, Porikli F, Konrad J, Ishwar P (2014) A novel video dataset for change detection benchmarking. IEEE Trans Image Process 23(11):4663–4679
Article MathSciNet MATH Google Scholar
Hambarde P, Talbar S, Mahajan A, Chavan S, Thakur M, Sable N (2020) Prostate lesion segmentation in MR images using radiomics based deeply supervised U-net. Biocybern Biomed Eng 40(4):1421–1435
Article Google Scholar
Han J, Zhang D, Cheng G, Liu N, Xu D (2018) Advanced deep-learning techniques for salient and category-specific object detection: a survey. IEEE Signal Process Mag 35(1):84–100
Article Google Scholar
Hu W-C, Chen C-H, Chen T-Y, Huang D-Y, Wu Z-C (2015) Moving object detection and tracking from video captured by moving camera. J Vis Commun Image Represent 30:164–180
Article Google Scholar
Hu Q, Paisitkriangkrai S, Shen C, van den Hengel A, Porikli F (2016) Fast detection of multiple objects in traffic scenes with a common detection framework. IEEE Trans Intell Transp Syst 17(4):1002–1014
Article Google Scholar
Hu Z, Yang D, Zhang K, Chen Z (2020) Object tracking in satellite videos based on convolutional regression network with appearance and motion features. IEEE J Sel Top Appl Earth Obs Remote Sens 13:783–793
Article Google Scholar
Huang L, Yan P, Li G, Wang Q, Lin L (2019) Attention embedded Spatio-temporal network for video salient object detection. IEEE Access 7:166203–166213
Article Google Scholar
Kim J-Y, Ha J-E (2020) Foreground Objects Detection Using a Fully Convolutional Network With a Background Model Image and Multiple Original Images. IEEE Access 8:159864–159878
Article Google Scholar
Kim JH, Kim B, Roy PP, Jeong D (2019) Efficient Facial Expression Recognition Algorithm Based on Hierarchical Deep Neural Network Structure. IEEE Access 7:41273–41285
Article Google Scholar
Manne R, Kantheti S, Kantheti S (2020) Classification of Skin cancer using deep learning,Convolutional Neural Networks -Opportunities and vulnerabilities. Int J Mod Trends Sci Technol 6(11):101–108
Article Google Scholar
Marsaline Beno M, Valarmathi IR, Swamy SM, Rajakumar BR (2014) Threshold prediction for segmenting tumour from brain MRI scans. Int J Imaging Syst Technol 24(2):129–137
Article Google Scholar
Murthy MYB, Koteswararao A, Babu MS (2021) Adaptive fuzzy deformable fusion and optimized CNN with ensemble classification for automated brain tumor diagnosis. Biomed Eng Lett
Nirmala Sreedharan NP, Ganesan B, Raveendran R, Sarala P, Dennis B, Boothalingam R (2018) Grey Wolf optimisation-based feature selection and classification for facial emotion recognition. IET Biometrics 7(5):490–499
Article Google Scholar
Patil PW, Murala S (2019) MSFgNet: a novel compact end-to-end deep network for moving object detection. IEEE Trans Intell Transp Syst 20(11):4066–4077
Article Google Scholar
Reddy V, Sanderson C, Lovell BC (2013) Improved foreground detection via block-based classifier cascade with probabilistic decision integration. IEEE Trans Circuits Syst Video Technol 23(1):83–93
Article Google Scholar
Rhee PK, Erdenee E, Kyun SD, Ahmed MU, Jin S (2017) Active and semi-supervised learning for object detection with imperfect data. Cogn Syst Res 45:109–123
Article Google Scholar
Rodriguez-Ramos A, Rodriguez-Vazquez J, Sampedro C, Campoy P (2020) Adaptive Inattentional framework for video object detection with reward-conditional training. IEEE Access 8:124451–124466
Article Google Scholar
St-Charles P-L, Bilodeau G-A, Bergevin R (2015) SuBSENSE:Auniversal change detection method with local adaptive sensitivity. IEEE Trans Image Process 24(1):359–373
Article MathSciNet MATH Google Scholar
Tsang S, Kao B, Yip KY, Ho W, Lee SD (2011) Decision trees for uncertain data. IEEE Trans Knowl Data Eng 23(1):64–78
Article Google Scholar
Uçar A, Demir Y, Güzeliş C (2017) Object recognition and detection with deep learning for autonomous driving applications. Simulation 93(9):759–769
Article Google Scholar
Unnisa N, Tatineni M (2021) Adaptive deep learning strategy with Red Deer algorithm for Sparse Channel estimation and hybrid precoding in millimeter wave massive MIMO-OFDM systems. Wirel Pers Commun 122:3019–3051
Article Google Scholar
Varadarajan S, Miller P, Zhou H (2015) Region-based mixture of gaussians modelling for foreground detection in dynamic scenes. Pattern Recogn 48(11):3488–3503
Article MATH Google Scholar
Wang S, Pan H, Zhang C, Tian Y (2014) Rgb-d image-based detection of stairs, pedestrian crosswalks and traffic signs. J Vis Commun Image Represent 25(2):263–272
Article Google Scholar
Wang K, Lin L, Yan X, Chen Z, Zhang D, Zhang L (2018) Cost-Effective Object Detection: Active Sample Mining With Switchable Selection Criteria. IEEE Trans Neural Networks Learn Syst PP:1–17
Google Scholar
Wu J, Yang H (2015) Linear regression-based efficient SVM learning for large-scale classification. IEEE Trans Neural Netw Learn Syst 26(10):2357–2369
Article MathSciNet Google Scholar
Yousif H, Yuan J, Kays R, He Z (2018) Object detection from dynamic scene using joint background modeling and fast deep learning classification. J Vis Commun Image Represent 55:802–815
Article Google Scholar
Yu H, Guo D, Yan Z, Fud L, Simmons J, Przybyla CP, Wang S (2020) Weakly supervised easy-to-hard learning for object detection in image sequences. Neurocomputing 398:71–82
Article Google Scholar
Zhang C, Kim J (2020) Video object detection with two-path convolutional LSTM pyramid. IEEE Access 8:151681–151691
Article Google Scholar
Zhang Z, He Z, Cao G, Cao W (2016) Animal detection from highly cluttered natural scenes using spatiotemporal object region proposals and patch verification. IEEE Trans Multimed 18(10):2079–2092
Article Google Scholar
Zhao W, Ma W, Jiao L, Chen P, Yang S, Hou B (2019) Multi-Scale Image Block-Level F-CNN for Remote Sensing Images Object Detection. IEEE Access 7:43607–43621
Article Google Scholar
Zhu Y, Huang C (2012) An improved median filtering algorithm for image noise reduction. Phys Procedia 25:609–616
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of ECE, Koneru Lakshmaiah Educational Foundation (KLEF), Hyderabad, Telangana, India
Rajashekar Reddy Palle & Ravi Boda

Authors

Rajashekar Reddy Palle
View author publications
You can also search for this author inPubMed Google Scholar
Ravi Boda
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Rajashekar Reddy Palle.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Ethical approval

This paper does not contain any studies with human participants or animals performed by any of the authors.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Palle, R.R., Boda, R. Automated image and video object detection based on hybrid heuristic-based U-net segmentation and faster region-convolutional neural network-enabled learning. Multimed Tools Appl 82, 3459–3484 (2023). https://doi.org/10.1007/s11042-022-13216-0

Download citation

Received: 15 July 2021
Revised: 22 January 2022
Accepted: 11 May 2022
Published: 07 July 2022
Issue Date: January 2023
DOI: https://doi.org/10.1007/s11042-022-13216-0

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Automated image and video object detection based on hybrid heuristic-based U-net segmentation and faster region-convolutional neural network-enabled learning

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Hybrid Metaheuristic-Based Thresholding and Faster Region-Convolutional Neural Network for Object Detection in Images

Human object detection: An enhanced black widow optimization algorithm with deep convolution neural network

Tools, techniques, datasets and application areas for object detection in an image: a review

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now