Abstract
The primary role in many computer vision applications is text or character recognition in scenes. Under generic conditions, scene text recognition is the most complicated and open research challenge, and numerous scene techniques have been implemented to address this problem. Existing methods encountered a number of challenges during scene character recognition, including complex backgrounds, noise, blur, non-uniform lighting, local distortion, and different fonts. Hence, we present Bayesian interactive search algorithm (BISA) with AdaBoost-based convolutional neural network (BISA with AdaBoost-CNN) for scene character recognition to tackle the former issues. The word to consecutive conversion and scene character recognition are the two key components in the proposed work. At first, the HOG and SIFT feature descriptors are extracted in word to consecutive conversion. Next, the Bayesian interactive search algorithm (BISA) is utilized to enhance the presentation of AdaBoost-based convolutional neural network (BISA with AdaBoost-CNN) for scene character recognition. Experimentally, different kinds of evaluation measures are used thereby the implementation works handled in MATLAB software. The proposed BISA with AdaBoost-CNN outperforms higher recognition accuracy than other existing approaches.









Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Sheng B, Xiao Fu, Sha L, Sun L (2020) Deep Spatial-Temporal model based cross-scene action recognition using commodity WiFi. IEEE Internet Things J 7(4):3592–3601
Wang C, Peng G, De Baets B (2020) Deep feature fusion through adaptive discriminative metric learning for scene recognition. Inf Fusion 63:1–12
Sherly LTA, Jaya T (2021) Improved firefly algorithm-based optimized convolution neural network for scene character recognition. Signal Image Video Process 1–9
Oybek D, Abdusalomov A, Mukhriddin M, Oybek D, Utkir K, Taeg KW (2020) Automatic salient object extraction based on locally adaptive thresholding to generate tactile graphics. Appl Sci 10(10):3350
Kaliyar RK, Anurag G, Pratik N (2019) Multiclass fake news detection using ensemble machine learning. In: 2019 IEEE 9th international conference on advanced computing (IACC). IEEE, 2019, pp 103–107
Chandio AA, Asikuzzaman M, Pickering M, Leghari M (2020) Cursive-text: A comprehensive dataset for end-to-end Urdu text recognition in natural scene images. Data Brief 31:105749
Sherly LA, Jaya T (2021) Improved firefly algorithm-based optimized convolution neural network for scene character recognition. Signal Image Video Process 1–9
Gowthul Alam MM, Baulkani S (2019) Geometric structure information based multi-objective function to increase fuzzy clustering performance with artificial and real-life data. Soft Comput 23(4):1079–1098
Hassan BA (2020) CSCF: a chaotic sine cosine firefly algorithm for practical application problems. Neural Comput Appl 1–20
Kavitha RS (2021) IOT and context-aware learning-based optimal neural network model for real-time health monitoring. Trans Emerg Telecommun Technol 32(1):e4132
Rejeesh MR (2019) Interest point based face recognition using adaptive neuro fuzzy inference system. Multimed Tools Appl 78(16):22691–22710
Sundararaj V (2016) An efficient threshold prediction scheme for wavelet based ECG signal noise reduction using variable step size firefly algorithm. Int J Intell Eng Syst 9(3):117–126
Sundararaj V (2019) Optimised denoising scheme via opposition-based self-adaptive learning PSO algorithm for wavelet-based ECG signal noise reduction. Int J Biomed Eng Technol 31(4):325
Sundararaj V, Anoop V, Dixit P, Arjaria A, Chourasia U, Bhambri P, Rejeesh MR, Sundararaj R (2020) CCGPA-MPPT: cauchy preferential crossover-based global pollination algorithm for MPPT in photovoltaic system. Prog Photovolt Res Appl 28(11):1128–1145
Vinu S (2019) Optimal task assignment in mobile cloud computing by queue based ant-bee algorithm. Wirel Pers Commun 104(1):173–197
Jose J, Gautam N, Tiwari M, Tiwari T, Suresh A, Sundararaj V, Rejeesh MR (2021) An image quality enhancement scheme employing adolescent identity search algorithm in the NSST domain for multimodal medical image fusion. Biomed Signal Process Control 66:102480
Eltay M, Zidouri A, Ahmad I (2020) Exploring deep learning approaches to recognize handwritten arabic texts. IEEE Access 8:89882–89898
Guo Q, Wang F, Lei J, Dan Tu, Li G (2016) Convolutional feature learning and Hybrid CNN-HMM for scene number recognition. Neurocomputing 184:78–90
Jaderberg M, Karen S, Andrea V, Andrew Z (2014) Synthetic data and artificial neural networks for natural scene text recognition. http://www.ee.surrey.ac.uk/CVSSP/demos/chars74k/
Wang Y, Shi C, Wang C, Xiao B, Qi C (2017) Multi-order co-occurrence activations encoded with fisher vector for scene character recognition. Pattern Recogn Lett 97:69–76
Chen X, Tianwei W, Yuanzhi Z, Lianwen J, Canjie L (2020) Adaptive embedding gate for attention-based scene text recognition. Neurocomputing 381:261–271
Guo Q, Lei J, Tu D, Li G (2014) Reading numbers in natural scene images with convolutional neural networks. In: Proceedings 2014 IEEE international conference on security, pattern analysis, and cybernetics (SPAC). IEEE, pp 48–53
Mortazavi A, Vedat T, Ayhan N (2018) Interactive search algorithm: a new hybrid metaheuristic optimization algorithm. Eng Appl Artif Intell 71:275–292
Mortazavi A (2021) Bayesian interactive search algorithm: a new probabilistic swarm intelligence tested on mathematical and structural optimization problems. Adv Eng Softw 155:102994
Haseena KS, Anees S, Madheswari N (2014) Power optimization using EPAR protocol in MANET. Int J Innov Sci Eng Technol 1(6)
Azath M, Banu RW, Madheswari AN (2011) Improving fairness in network traffic by controlling congestion and unresponsive flows. In: International conference on network security and applications. Springer, Berlin, Heidelberg, pp 356–363
Liu Y et al (2016) Exponential stability of Markovian jumping Cohen-Grossberg neural networks with mixed mode-dependent timedelays. Neurocomputing 177:409–415
Du B, Liu Y, Abbas IA (2016) Existence and asymptotic behaviorresults of periodic solution for discrete-time neutral-type neural networks. J Frankl Inst 353(2):448–461
Abouelmagd EI et al (2014) Reduction the secular solution to periodic solution in the generalized restricted three-body problem. Astrophys Space Sci 350(2):495–505
Afif M, Riadh A, Yahia S, Mohamed A (2020) Deep learning based application for indoor scene recognition. Neural Process Lett 1–11
Su B, Shijian Lu (2017) Accurate recognition of words in scenes without character segmentation using recurrent neural network. Pattern Recogn 63:397–405
Zhang Z, Wang H, Liu S, Xiao B (2018) Deep contextual stroke pooling for scene character recognition. IEEE Access 6:16454–16463
Lin Q, Canjie L, Lianwen J, Songxuan L (2021) STAN: a sequential transformation attention-based network for scene text recognition. Pattern Recognit 111:107692
Wang Q, Huang Ye, Jia W, He X, Blumenstein M, Lyu S, Yue Lu (2020) FACLSTM: ConvLSTM with focused attention for scene text recognition. Sci China Inf Sci 63(2):1–14
Graves A, Liwicki M, Fernández S, Bertolami R, Bunke H, Schmidhuber J (2008) A novel connectionist system for unconstrained handwriting recognition. IEEE Trans Pattern Anal Mach Intell 31(5):855–868
Gers Felix A, Jurgen S, Cummins F (2000) Learning to forget: continual prediction with LSTM. Neural Comput 12(10):2451–2471
Hastie T, Rosset S, Zhu Ji, Zou H (2009) Multi-class adaboost. Stat Interface 2(3):349–360
Taherkhani A, Georgina C, Martin McGinnity T (2020) AdaBoost-CNN: an adaptive boosting algorithm for convolutional neural networks to classify multi-class imbalanced datasets using transfer learning. Neurocomputing 404:351–366
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. Adv Neural Inf Process Syst 25:1097–1105
Gülcü A, Zeki K (2020) Hyper-parameter selection in convolutional neural networks using microcanonical optimization algorithm. IEEE Access 8:52528–52540
Wang Y, Zhang H, Zhang G (2019) cPSO-CNN: An efficient PSO-based algorithm for fine-tuning hyper-parameters of convolutional neural networks. Swarm Evol Comput 49:114–123
Netzer Y, Tao W, Adam C, Alessandro B, Bo W, Andrew YN (2011) Reading digits in natural images with unsupervised feature learning
Mishra A, Alahari K, Jawahar CV (2012) Scene text recognition using higher order language priors. In: BMVC-British Machine Vision Conference. BMVAs
Karatzas D, Faisal S, Seiichi U, Masakazu I, Gomez i Bigorda L, Sergi RM, Joan M, David FM, Jon AA, Lluis Pere De Las H (2013) ICDAR 2013 robust reading competition. In: 2013 12th international conference on document analysis and recognition, pp 1484–1493. IEEE
Shi C-Z, Gao S, Liu M-T, Qi C-Z, Wang C-H, Xiao B-H (2015) Stroke detector and structure based models for character recognition: a comparative study. IEEE Trans Image Process 24(12):4952–4964
Gao S, Chunheng W, Baihua X, Cunzhao S, Zhong Z (2014) Stroke bank: a high-level representation for scene character recognition. In: 2014 22nd international conference on pattern recognition. IEEE, pp 2909–2913
Gao S, Wang C, Xiao B, Shi C, Zhou W, Zhang Z (2014) Learning co-occurrence strokes for scene character recognition based on spatiality embedded dictionary. In: 2014 IEEE international conference on image processing (ICIP). IEEE, pp 5956–5960
Shi C, Wang Y, Jia F, He K, Wang C, Xiao B (2017) Fisher vector for scene character recognition: a comprehensive evaluation. Pattern Recognit 72:1–14
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The author(s) declare that they have no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Sherly, L.T.A., Jaya, T. An efficient indoor scene character recognition using Bayesian interactive search algorithm-based adaboost-CNN classifier. Neural Comput & Applic 33, 15345–15356 (2021). https://doi.org/10.1007/s00521-021-06161-w
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-021-06161-w