Abstract
Recently, healthcare data consist of an enormous amount of information, which is challenging to maintain by manual methods. Due to the development of big data in the communities of biomedical and health care, accurate study of the medical data helps the recognition of the disease in early stage, patient care and community services. It mainly focuses on predicting and exploring the conditions due to some significant effects on health which are on the increase in multiple cities. The existing system in the medical field cannot extract complete information from the chronic disease database. It is complicated for the healthcare practitioner to analyze and diagnose constant disease since it plays a challenging task. This paper presents a modified artificial neural network (ANN) classifier technique with a MapReduce framework for the prediction of disease. For preprocessing, min–max normalization is carried out to enhance the accuracy of system. This MapReduce is applied for providing a feasible framework in predictive programming algorithms for the map and reduce functions. This is a simple programming interface, which helps in efficiently solving predictive problems. The primary intention of the proposed system is to analyze accurate, fast and optimal results on chronic disease datasets. It increases the throughput and redundancy in cases of retrieving the vast data. Thus, integrating a modified ANN classifier with a reduced framework is useful in providing better outcomes. The experimental results over chronic diabetic dataset prove that the proposed artificial neural network with MapReduce structure is capable of predicting the precision, sensitivity and specificity level modified on comparing with other existing deep neural network approaches.
Similar content being viewed by others
Change history
20 December 2022
This article has been retracted. Please see the Retraction Notice for more detail: https://doi.org/10.1007/s00500-022-07758-6
References
Abdul-Aziz A, Salama AS, Riad AM, Mahmoud AN (2018) A machine learning model for predicting of chronic kidney disease based internet of things and cloud computing in smart cities. Lect Notes Intell Transp Infrastruct. https://doi.org/10.1007/978-3-030-01560-2_5
Anderson UD, Gram M, Åkerström B, Hansson SR (2015) First trimester prediction of preeclampsia. Curr Hypertens Rep. https://doi.org/10.1007/s11906-015-0584-
Arbabshirani MR, Plis S, Sui J, Calhoun VD (2017) Single subject prediction of brain disorders in neuroimaging: Promises and pitfalls. NeuroImage 145:137–165. https://doi.org/10.1016/j.neuroimage.2016.02.079
Bayati M, Bhaskar S, Montanari A (2015) A low-cost method for multiple disease prediction. AMIA Annu Symp Proc 2015:329–338
Becht E, McInnes L, Healy J, Dutertre C-A, Kwok IWH, Ng LG, Newell EW (2018) Dimensionality reduction for visualizing single-cell data using UMAP. Nat Biotechnol. https://doi.org/10.1038/nbt.4314…
Brisimi TS, Xu T, Wang T, Dai W, Adams WG, Paschalidis IC (2018) Predicting diabetic chronic disease hospitalizations from electronic health records: an interpretable classification approach. Proc IEEE 106(4):690–707. https://doi.org/10.1109/jproc.2017.2789319
Desai MPS, Agarwal M (2015) Twitter word frequency count using hadoop components. J Eng Technol 02:1–8
García-Gil D, Luengo J, García S, Herrera F (2018) Enabling smart data: noise filtering in big data classification. Inf Sci. https://doi.org/10.1016/j.ins.2018.12.002
Gaur S (2017) Comparative analysis between GA, KNN and hybrid algorithm to optimize the classification of fuzzy KNN. In: 46th ISTE annual national convention and national conference 2017 International Journal of Advance Research and Innovation (ISSN 2347–3258)
Guan Z, Lv Z, Du X, Wu L, Guizani M (2019) Achieving data utility-privacy tradeoff in the internet of medical things: a machine learning approach. Fut Gener Comput Syst. https://doi.org/10.1016/j.future.2019.01.058
Ismail A, Shehab A, El-Henawy IM (2018) Healthcare analysis in smart big data analytics: reviews, challenges and recommendations. Lect Notes Intell Transp Infrastruct. https://doi.org/10.1007/978-3-030-01560-2_2
John J, Ashok S (2018) Process framework for modeling multivariate time series data. Adv Intell Syst Comput. https://doi.org/10.1007/978-981-13-0514-6_56
Khan A, Uddin S, Srinivasan U (2018a) Comorbidity network for diabetic chronic disease: a novel approach to understanding type 2 diabetes progression. Int J Med Inf 115:1–9. https://doi.org/10.1016/j.ijmedinf.2018.04.001
Khan S, Khan A, Maqsood M, Aadil F, Ghazanfar MA (2018b) Optimized gabor feature extraction for mass classification using cuckoo search for big data E-healthcare. J Grid Comput. https://doi.org/10.1007/s10723-018-9459-x
Koch F, Koster A, Riaño D, Montagna S, Schumacher M, Ten Teije A, Wiratunga N (eds) (2019) Artificial intelligence in health. Lect Notes Comput Sci. https://doi.org/10.1007/978-3-030-12738-1
Koti MS, Alamma BH (2018) Predictive analytics techniques using big data for healthcare databases. Smart Innov Syst Technol. https://doi.org/10.1007/978-981-13-1927-3_71
Kriplani H, Patel B, Roy S (2019) Prediction of chronic kidney diseases using deep artificial neural network technique. Lect Notes Comput Vis Biomech. https://doi.org/10.1007/978-3-030-04061-1_18
Kumar NMS, Eswari T, Sampath P, Lavanya S (2015) Predictive methodology for diabetic data analysis in big data. Procedia Comput Sci 50:203–208. https://doi.org/10.1016/j.procs.2015.04.069
Lakshmanaprabu SK, Shankar K, Ilayaraja M, Nasir AW, Vijayakumar V, Chilamkurti N (2019) Random forest for big data classification in the internet of things using optimal features. Int J Mach Learn Cybern. https://doi.org/10.1007/s13042-018-00916-z
Lokeswari YV, Jacob SG, Ramadoss R (2018) Parallel prediction algorithms for heterogeneous data: a case study with real-time big datasets. Adv Big Data Cloud Comput. https://doi.org/10.1007/978-981-13-1882-5_46
Mac Parthalain N, Jensen R, Diao R, Khan S, Khan A, Maqsood M, Aadil F, Ghazanfar MA (2018) Optimized gabor feature extraction for mass classification using cuckoo search for big data E-healthcare. J Grid Comput. https://doi.org/10.1007/s10723-018-9459-xhttps://doi.org/10.1109/tfuzz.2019.2921935
Marinescu R (2004) Detection strategies: metrics-based rules for detecting design flaws. In: 20th IEEE international conference on software maintenance, 2004. Proceedings. https://doi.org/10.1109/icsm.2004.1357820
Mehta T, Mangla N, Gurgaon G (2016) A survey paper on big data analytics using map reduce and hive on hadoop framework. Int J Recent Adv Eng Technol 4(2):112–118
Pham T, Tran T, Phung D, Venkatesh S (2017) Predicting healthcare trajectories from medical records: a deep learning approach. J Biomed Inform 69:218–229. https://doi.org/10.1016/j.jbi.2017.04.001
Rajathi GI, Jiji W (2019) Chronic liver disease classification using hybrid whale optimization with simulated annealing and ensemble classifier, Received: 16 November 2018; Accepted: 25 December 2018; Published: 2 January 2019
Ramage ER, Fini NA, Lynch EA, Patterson A, Said CM, English C (2019) Supervised exercise delivered via telehealth in real-time to manage chronic conditions in adults: a protocol for a scoping review to inform future research in stroke survivors. BMJ Open 9(3):e027416. https://doi.org/10.1136/bmjopen-2018-027416
Reddy ZP, Pavan Kumar PNVS (2016) Comparing the word count execution time in hadoop and spark. IJISET Int J Innov Sci Eng Tech 3(10):2348–7968 (October 2016 ISSN (Online) Impact Factor (2015)—4.332)
Sahoo PK, Mohapatra SK, Wu S-L (2016) Analyzing Healthcare big data with prediction for future health condition. IEEE Access 4:9786–9799. https://doi.org/10.1109/access.2016.2647619
Shakeel PM, Baskar S, Dhulipala VRS, Jaber MM (2018) Cloud-based framework for the diagnosis of diabetes mellitus using K-means clustering. Health Inf Sci Syst. https://doi.org/10.1007/s13755-018-0054-0
Siuly S, Zhang Y (2016) Medical big data: neurological diseases diagnosis through medical data analysis. Data Sci Eng 1(2):54–64. https://doi.org/10.1007/s41019-016-0011-3
Triguero I, García-Gil D, Maillo J, Luengo J, García S, Herrera F (2018) Transforming big data into smart data: an insight into the use of k-nearest neighbour's algorithm to obtain quality data. Wiley Interdiscip Rev Data Min Knowl Discov. https://doi.org/10.1002/widm.1289
Varatharajan R, Monogram G, Priyan MK (2017) A big data classification approach using LDA with an enhanced SVM method for ECG signals in cloud computing. Multimed Tools Appl 77(8):10195–10215. https://doi.org/10.1007/s11042-017-5318-1
Wu JM-T, Tsai M-H, Huang YZ, Hafizul Islam SK, Hassan MM, Alelaiwi A, Fortino G (2019) Applying an ensemble convolution neural network with a SavitzkyGolay filter to construct a phonocardiogram prediction model. Appl Soft Comput. https://doi.org/10.1016/j.asoc.2019.01.019
Xu X, Liang T, Zhu J, Zheng D, Sun T (2018) Review of classical dimensionality reduction and sample selection methods for large-scale data processing. Neurocomputing. https://doi.org/10.1016/j.neucom.2018.02.100
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
All authors state that there is no conflict of interest
Ethical approval
Humans and animals are not involved in this research work. We used our own data.
Additional information
Communicated by V. Loia.
Communicated by V. Loia.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This article has been retracted. Please see the retraction notice for more detail: https://doi.org/10.1007/s00500-022-07758-6"
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Ramani, R., Vimala Devi, K. & Ruba Soundar, K. RETRACTED ARTICLE: MapReduce-based big data framework using modified artificial neural network classifier for diabetic chronic disease prediction. Soft Comput 24, 16335–16345 (2020). https://doi.org/10.1007/s00500-020-04943-3
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00500-020-04943-3