Abstract
Object detection is one of the major areas of computer vision, which adopts machine learning approaches in diverse contributions. Nowadays, the machine learning field has been directed through Deep Neural Networks (DNNs) that takes eminent features of progressions in data availability and computing power. In all the cases, the quality of images and videos are biased and noisy, and thus, the distributions of data are also considered as imbalanced and disturbed. Different techniques are developed for solving the abovementioned challenges, which are mostly considered based on deep learning and computer vision. Though, traditional algorithms constantly offer poor detection for dense and small objects and yet fail the detection of objects through random geometric transformations. One of the categories of deep learning called Convolutional Neural Network (CNN) is famous and well-matched method for image-related tasks, in which the network is trained for discovering the numerous features like colour differences, corners, and edges in the images and videos that are combined into more complex shapes. This proposal intends to develop improved object detection in images and videos with the advancements of deep learning models. The three main phases of the proposed object detection model are (a) pre-processing, (b) segmentation, and (c) detection. Once the pre-processing of the image is performed by median filtering approach, the adaptive U-Net segmentation is performed for the object segmentation using the newly proposed Sun Flower-Deer Hunting Optimization Algorithm (SF-DHOA). The maximization of segmentation accuracy and dice coefficient is considered as the main objective of the proposed segmentation. The hybrid meta-heuristic algorithm termed SF-DHOA is proposed with Sun Flower Optimization (SFO) and Deer Hunting Optimization Algorithm (DHOA), which is used for optimally tuning the U-Net by optimizing the encoder depth and the number of epoch. Further, the detection is performed by the modified Faster Region-Convolutional Neural Network (Faster-RCNN), in which the optimization of number of epoch is performed by hybrid SF-DHOA algorithm with the intention of minimizing the error and training loss function. The performance of the proposed algorithm is evaluated, and the proposed algorithm shows high improvement when compared to existing deep learning-based algorithms.








Similar content being viewed by others
References
Azzam R, Kemouche MS, Aouf N, Richardson M (2016) Efficient visual object detection with spatially global gaussian mixture models and uncertainties. J Vis Commun Image Represent 36:90–106
Bonyadi MR, Michalewicz Z (2016) Analysis of stability, local convergence, and transformation sensitivity of a variant of the particle swarm optimization algorithm. IEEE Trans Evol Comput 20(3):370–385
Bouwmans T (2014) Traditional and recent approaches in background modeling for foreground detection: An overview. Comput Sci Rev 11–12:31–66
Brammya G, Praveena S, Ninu Preetha NS, Ramya R, Rajakumar BR, Binu D Deer hunting optimization algorithm: a new nature-inspired Meta-heuristic paradigm. 24 May 2019.
Cao W, Yuan J, He Z, Zhang Z, He Z (2018) Fast deep neural networks with knowledge guided training and predicted regions of interests for real-time video object detection. IEEE Access 6:8990–8999
Cuevas C, Yáñez EM, García N (2016) Labeled dataset for integral evaluation of moving object detection algorithms: LASIESTA. Comput Vis Image Understand 152:103–117
Dai J, Li Y, He K, Sun J (2016) R-FCN: object detection via region-based fully convolutional networks
Fan M, Li Y, Zheng S, Peng W, Tang W, Li L (2019) Computer-aided detection of mass in digital breast tomosynthesis using a faster region-based convolutional neural network. Methods 166:103–111
Felzenszwalb PF, Girshick RB, Mcallester D, Ramanan D (2009) Object detection with discriminatively trained part based models. IEEE Trans Pattern Anal Mach Intell 32(9):1–20
Gomes GF, da Cunha SS Jr, Ancelotti AC Jr (2019) A sunflower optimization (SFO) algorithm applied to damage identification on laminated composite plates. Eng Comput 35:619–626
Goyette N, Jodoin P-M, Porikli F, Konrad J, Ishwar P (2014) A novel video dataset for change detection benchmarking. IEEE Trans Image Process 23(11):4663–4679
Hambarde P, Talbar S, Mahajan A, Chavan S, Thakur M, Sable N (2020) Prostate lesion segmentation in MR images using radiomics based deeply supervised U-net. Biocybern Biomed Eng 40(4):1421–1435
Han J, Zhang D, Cheng G, Liu N, Xu D (2018) Advanced deep-learning techniques for salient and category-specific object detection: a survey. IEEE Signal Process Mag 35(1):84–100
Hu W-C, Chen C-H, Chen T-Y, Huang D-Y, Wu Z-C (2015) Moving object detection and tracking from video captured by moving camera. J Vis Commun Image Represent 30:164–180
Hu Q, Paisitkriangkrai S, Shen C, van den Hengel A, Porikli F (2016) Fast detection of multiple objects in traffic scenes with a common detection framework. IEEE Trans Intell Transp Syst 17(4):1002–1014
Hu Z, Yang D, Zhang K, Chen Z (2020) Object tracking in satellite videos based on convolutional regression network with appearance and motion features. IEEE J Sel Top Appl Earth Obs Remote Sens 13:783–793
Huang L, Yan P, Li G, Wang Q, Lin L (2019) Attention embedded Spatio-temporal network for video salient object detection. IEEE Access 7:166203–166213
Kim J-Y, Ha J-E (2020) Foreground Objects Detection Using a Fully Convolutional Network With a Background Model Image and Multiple Original Images. IEEE Access 8:159864–159878
Kim JH, Kim B, Roy PP, Jeong D (2019) Efficient Facial Expression Recognition Algorithm Based on Hierarchical Deep Neural Network Structure. IEEE Access 7:41273–41285
Manne R, Kantheti S, Kantheti S (2020) Classification of Skin cancer using deep learning,Convolutional Neural Networks -Opportunities and vulnerabilities. Int J Mod Trends Sci Technol 6(11):101–108
Marsaline Beno M, Valarmathi IR, Swamy SM, Rajakumar BR (2014) Threshold prediction for segmenting tumour from brain MRI scans. Int J Imaging Syst Technol 24(2):129–137
Murthy MYB, Koteswararao A, Babu MS (2021) Adaptive fuzzy deformable fusion and optimized CNN with ensemble classification for automated brain tumor diagnosis. Biomed Eng Lett
Nirmala Sreedharan NP, Ganesan B, Raveendran R, Sarala P, Dennis B, Boothalingam R (2018) Grey Wolf optimisation-based feature selection and classification for facial emotion recognition. IET Biometrics 7(5):490–499
Patil PW, Murala S (2019) MSFgNet: a novel compact end-to-end deep network for moving object detection. IEEE Trans Intell Transp Syst 20(11):4066–4077
Reddy V, Sanderson C, Lovell BC (2013) Improved foreground detection via block-based classifier cascade with probabilistic decision integration. IEEE Trans Circuits Syst Video Technol 23(1):83–93
Rhee PK, Erdenee E, Kyun SD, Ahmed MU, Jin S (2017) Active and semi-supervised learning for object detection with imperfect data. Cogn Syst Res 45:109–123
Rodriguez-Ramos A, Rodriguez-Vazquez J, Sampedro C, Campoy P (2020) Adaptive Inattentional framework for video object detection with reward-conditional training. IEEE Access 8:124451–124466
St-Charles P-L, Bilodeau G-A, Bergevin R (2015) SuBSENSE:Auniversal change detection method with local adaptive sensitivity. IEEE Trans Image Process 24(1):359–373
Tsang S, Kao B, Yip KY, Ho W, Lee SD (2011) Decision trees for uncertain data. IEEE Trans Knowl Data Eng 23(1):64–78
Uçar A, Demir Y, Güzeliş C (2017) Object recognition and detection with deep learning for autonomous driving applications. Simulation 93(9):759–769
Unnisa N, Tatineni M (2021) Adaptive deep learning strategy with Red Deer algorithm for Sparse Channel estimation and hybrid precoding in millimeter wave massive MIMO-OFDM systems. Wirel Pers Commun 122:3019–3051
Varadarajan S, Miller P, Zhou H (2015) Region-based mixture of gaussians modelling for foreground detection in dynamic scenes. Pattern Recogn 48(11):3488–3503
Wang S, Pan H, Zhang C, Tian Y (2014) Rgb-d image-based detection of stairs, pedestrian crosswalks and traffic signs. J Vis Commun Image Represent 25(2):263–272
Wang K, Lin L, Yan X, Chen Z, Zhang D, Zhang L (2018) Cost-Effective Object Detection: Active Sample Mining With Switchable Selection Criteria. IEEE Trans Neural Networks Learn Syst PP:1–17
Wu J, Yang H (2015) Linear regression-based efficient SVM learning for large-scale classification. IEEE Trans Neural Netw Learn Syst 26(10):2357–2369
Yousif H, Yuan J, Kays R, He Z (2018) Object detection from dynamic scene using joint background modeling and fast deep learning classification. J Vis Commun Image Represent 55:802–815
Yu H, Guo D, Yan Z, Fud L, Simmons J, Przybyla CP, Wang S (2020) Weakly supervised easy-to-hard learning for object detection in image sequences. Neurocomputing 398:71–82
Zhang C, Kim J (2020) Video object detection with two-path convolutional LSTM pyramid. IEEE Access 8:151681–151691
Zhang Z, He Z, Cao G, Cao W (2016) Animal detection from highly cluttered natural scenes using spatiotemporal object region proposals and patch verification. IEEE Trans Multimed 18(10):2079–2092
Zhao W, Ma W, Jiao L, Chen P, Yang S, Hou B (2019) Multi-Scale Image Block-Level F-CNN for Remote Sensing Images Object Detection. IEEE Access 7:43607–43621
Zhu Y, Huang C (2012) An improved median filtering algorithm for image noise reduction. Phys Procedia 25:609–616
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Ethical approval
This paper does not contain any studies with human participants or animals performed by any of the authors.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Palle, R.R., Boda, R. Automated image and video object detection based on hybrid heuristic-based U-net segmentation and faster region-convolutional neural network-enabled learning. Multimed Tools Appl 82, 3459–3484 (2023). https://doi.org/10.1007/s11042-022-13216-0
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-022-13216-0