Abstract
Big data is a large set of data that is analyzed with the calculation to manifest myriad sources. Big data is capable of handling various challenges to processing huge amounts of data. To handle issues based on large-scale databases, a MapReduce framework is employed which provides robust and simple infrastructure for huge datasets. This paper proposes a novel Elastic collision seeker optimization based Faster R-CNN (ECSO-FRCNN) classifier for efficient big data classification. The proposed ECSO-FRCNN classifier is capable of handling missing attributes, and incremental learning and improves training performance effectively. As the proposed technique deals with large data samples, it necessitates the inclusion of the MapReduce framework. The adaption of MapReduce design in big data classification prevents the classification results from uncertainties such as data redundancy, misclassification, and storage issues. The proposed method is examined with three standard datasets, namely the skin segmentation dataset, mushroom dataset, and localization dataset, collected from the University of California, UCI machine learning repository. Finally, extensive experimental analysis is carried out for various parameters to depict the efficiency of the system.
Similar content being viewed by others
Data and material availability
The data that support the findings of this study are available from the corresponding author upon reasonable request.
References
Dubey AK, Kumar A, Agrawal R (2021) An efficient ACO-PSO-based framework for data classification and preprocessing in big data. Evol Intel 14(2):909–922
Sleeman WC IV, Krawczyk B (2021) Multi-class imbalanced big data classification on spark. Knowl-Based Syst 212:106598
Shokrzade A, Ramezani M, Tab FA, Mohammad MA (2021) A novel extreme learning machine based kNN classification method for dealing with big data. Expert Syst Appl 183:115293
Lakshmanaprabu SK, Shankar K, Ilayaraja M, Nasir AW, Vijayakumar V, Chilamkurti N (2019) Random forest for big data classification in the internet of things using optimal features. Int J Mach Learn Cybern 10(10):2609–2618
González A, Pérez R, Romero-Zaliz R (2019) An incremental approach to address big data classification problems using cognitive models. Cogn Comput 11(3):347–366
Elkano M, Galar M, Sanz J, Bustince H (2018) CHI-BD: A fuzzy rule-based classification system for big data classification problems. Fuzzy Sets Syst 348:75–101
Xing W, Bei Y (2019) Medical health big data classification based on KNN classification algorithm. IEEE Access 8:28808–28819
Vennila V, Kannan AR (2019) Hybrid parallel linguistic fuzzy rules with canopy mapreduce for big data classification in cloud. Int J Fuzzy Syst 21(3):809–822
Varatharajan R, Manogaran G, Priyan MK (2018) A big data classification approach using LDA with an enhanced SVM method for ECG signals in cloud computing. Multimed Tools Appl 77(8):10195–10215
Ravindran S, Aghila G (2020) A data-independent reusable projection (DIRP) technique for dimension reduction in big data classification using k-nearest neighbor (k-NN). Natl Acad Sci Lett 43(1):13–21
Bari BS, Islam MN, Rashid M, Hasan MJ, Razman MAM, Musa RM, Ab Nasir AF, Majeed APA (2021) A real-time approach of diagnosing rice leaf disease using deep learning-based faster R-CNN framework. PeerJ Comput Sci 7:e432
Duan S, Luo H, Liu H (2022) An Elastic Collision Seeker Optimization Algorithm for Optimization Constrained Engineering Problems. Math Probl Eng. https://doi.org/10.1155/2022/1344667
Banchhor C, Srinivasu N (2020) FCNB: Fuzzy Correlative naive bayes classifier with mapreduce framework for big data classification. J Intell Syst 29(1):994–1006
Fong S, Wong R, Vasilakos AV (2015) Accelerated PSO swarm search feature selection for data stream mining big data. IEEE Trans Serv Comput 9(1):33–45
Liu Y, Xu L, Li M (2017) The parallelization of back propagation neural network in mapreduce and spark. Int J Parallel Prog 45:760–779
Alghunaim S, Al-Baity HH (2019) On the scalability of machine-learning algorithms for breast cancer prediction in big data context. IEEE Access 7:91535–91546
Sun L, Wang J, Hu Z, Xu Y, Cui Z (2019) Multi-view convolutional neural networks for mammographic image classification. IEEE Access 7:126273–126282
Chen W, Li J, Li X, Zhang L, Wang J (2019) Training back propagation neural networks in MapReduce on high-dimensional big datasets with global evolution. IEEE Access 7:159855–159867
Kadkhodaei H, Moghadam AME, Dehghan M (2021) Big data classification using heterogeneous ensemble classifiers in apache spark based on MapReduce paradigm. Expert Syst Appl 183:115369
Gong C, Su ZG, Wang PH, Wang Q, You Y (2021) Evidential instance selection for K-nearest neighbor classification of big data. Int J Approx Reason 138:123–144
Al-Thanoon NA, Algamal ZY, Qasim OS (2021) Feature selection based on a crow search algorithm for big data classification. Chemom Intell Lab Syst 212:104288
Banchhor C, Srinivasu N (2020) Integrating cuckoo search-grey wolf optimization and correlative naive bayes classifier with Map Reduce model for big data classification. Data Knowl Eng 127:101788
Long Y, Zhou W, Luo Y (2021) A fault diagnosis method based on one-dimensional data enhancement and convolutional neural network. Measurement 180:109532
Huang Q, Huang Y, Luo Y, Yuan F, Li X (2020) Segmentation of breast ultrasound image with semantic classification of superpixels. Med Image Anal 61:101657
Sherwani F, Ibrahim BSKK, Asad MM (2021) Hybridized classification algorithms for data classification applications: a review. Egypt Inf J 22(2):185–192
Geetha V, Aprameya KS, Hinduja DM (2020) Dental caries diagnosis in digital radiographs using back-propagation neural network. Health Inf Sci Syst 8:1–14
Muthusamy H, Ravindran S, Yaacob S, Polat K (2021) An improved elephant herding optimization using sine–cosine mechanism and opposition based learning for global optimization problems. Expert Syst Appl 172:114607
Kale GA, Yüzgeç U (2022) Advanced strategies on update mechanism of sine cosine optimization algorithm for feature selection in classification problems. Eng Appl Artif Intell 107:104506
Kuo T, Wang KJ (2022) A hybrid k-prototypes clustering approach with improved sine-cosine algorithm for mixed-data classification. Comput Ind Eng 169:108164
Acknowledgements
The authors are grateful to all respondents who participated in this study and to the data collectors for their contribution.
Author information
Authors and Affiliations
Contributions
All authors who participated in data analysis, drafting or revising the manuscript gave approval of the final version to be published.
Corresponding author
Ethics declarations
Conflicts of interest
The authors declare that they have no conflicts of interest.
Human and animal rights
This article does not contain any studies with human or animal subjects performed by any of the authors.
Informed consent
Consent was secured from all of the respondents who participated in the study.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Chidambaram, S., Cyril, C.P.D. & Ganesh, S.S. An efficient big data classification using elastic collision seeker optimization based faster R-CNN. Neural Comput & Applic 35, 19651–19668 (2023). https://doi.org/10.1007/s00521-023-08707-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-023-08707-6