Abstract
Innovation and adoption of new technologies in health care industries produce vast data every day. The diverse data in health care include clinical data, health history, and genetic data. Also, real-time monitoring in health care generates huge data, and efficiently examining these big data is a challenging task. Analysis of health care data becomes more important so that proper medications can be provided and issues can be reduced by taking proper precautions based on the history. Data analysis becomes efficient because of automation, however, due to data integrity, data diversity, and inconsistency, the performance gets lagged. Various machine learning models are introduced to handle big data management; however, researchers are still working to attain a better model with improved accuracy. So with the objective to attain maximum classification accuracy, fuzzy c means clustering and generative adversarial network are employed in this research work for health care data clustering and classification. Benchmark lung cancer dataset and Arrhythmia dataset are used in the experimentation. The proposed model exhibits the maximum accuracy of 97.8% for dataset 1 and 98.6% for dataset 2 compared to existing techniques like support vector machine, decision tree, and random forest algorithms.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data availability
Enquiries about data availability should be directed to the authors.
References
AD. Alahmar; R Benlamri, (2020) SNOMED CT-based standardized e-clinical pathways for enabling big data analytics in healthcare. IEEE Access 8:92765–92775
AJ. Boddy; W Hurst; M Mackay; A elRhalibi, (2019) Density-based outlier detection for safeguarding electronic patient record systems. IEEE Access 7:40285–40294
AliAbbas S, Aslam A, UrRehman A, ArshadAbbasi W, Arif S, Kazmi SZH (2020) K-means and K-medoids: cluster analysis on birth data collected in City Muzaffarabad, Kashmir. IEEE Access 8:151847–151855
Al-Zahrani FA (2020) Evaluating the usable-security of healthcare software through unified technique of fuzzy logic, ANP and TOPSIS. IEEE Access 8:109905–109916
Bahri S, Zoghlami N, Abed M, Tavares JMRS (2019) BIG DATA for healthcare: a survey. IEEE Access 7:7397–7408
Bindhu V, Ranganathan G (2021) Hyperspectral image processing in internet of things model using clustering algorithm. J ISMAC 3(02):163–175
Cai Q, Wang H, Li Z, Liu X (2019) A survey on multimodal data-driven smart healthcare systems: approaches and applications. IEEE Access 7:133583–133599
Chen JIZ, Hengjinda P (2021) Enhanced dragonfly algorithm based K-medoid clustering model for VANET. J ISMAC 3(01):50–59
Chen JIZ, Zong JI (2021) Automatic vehicle license plate detection using K-means clustering algorithm and CNN. J Electric Eng Autom 3(1):15–23
Chen M, Hao Y, Hwang K, Wang L, Wang L (2017) Disease prediction by machine learning over big data from healthcare communities. IEEE Access 5:8869–8879
Elhoseny M, Ramírez-González G, Abu-Elnasr OM, Shawkat SA, Arunkumar N, Farouk A (2018) Secure medical data transmission model for IoT-based healthcare systems. IEEE Access 6:20596–20608
Forkan ARM, Khalil I, Ibaida A, Tari Z (2017) BDCaM: big data for context-aware monitoring—a personalized knowledge discovery framework for assisted healthcare. IEEE Trans Cloud Comput 5(4):628–641
Galletta A, Carnevale L, Bramanti A, Fazio M (2019) An innovative methodology for big data visualization for telemedicine. IEEE Trans Industr Inf 15(1):490–497
Gao Y, Sun C, Li R, Li Q, Cui L, Gong B (2018) An efficient fraud identification method combining manifold learning and outliers detection in mobile healthcare services. IEEE Access 6:60059–60068
Ghorbani R, Ghousi R, Makui A, Atashi A (2020) A new hybrid predictive model to predict the early mortality risk in intensive care units on a highly imbalanced dataset. IEEE Access 8:141066–141079
Guo W, Ge W, Cui L, Li H, Kong L (2019) An interpretable disease onset predictive model using crossover attention mechanism from electronic health records. IEEE Access 7:134236–134244
Harerimana G, Jang B, Kim JW, Park HK (2018) Health big data analytics: a technology survey. IEEE Access 6:65661–65678
https://www.kaggle.com/bulentesen/cardiac-arrhythmia-database
Huang H, Gong T, Ye N, Wang R, Dou Y (2017) Private and secured medical data transmission and analysis for wireless sensing healthcare system. IEEE Trans Industr Inf 13(3):1227–1237
Jin H, Luo Y, Pg Li; J Mathew, (2019) A review of secure and privacy-preserving medical data sharing. IEEE Access 7:61656–61669
Kim J-C, Chung K (2020) Multi-modal stacked denoising autoencoder for handling missing data in healthcare big data. IEEE Access 8:104933–104943
Kumar S, Singh M (2019) Big data analytics for healthcare industry: impact, applications, and tools. Big Data Mining and Analytics 2(1):48–57
Li J, Tan X, Xu X, Wang F (2019) efficient mining template of predictive temporal clinical event patterns from patient electronic medical records. IEEE J Biomed Health Inform 23(5):2138–2147
Liu K, Chen Z, Wu J, Tan Y, Wang L, Yan Y, Zhang H, Long J (2019) Big medical data decision-making intelligent system exploiting fuzzy inference logic for prostate cancer in developing countries. IEEE Access 7:2348–2363
MofijulIslam MD, AbdurRazzaque MD, MehediHassan M, NagyIsmail W, Song B (2017) Mobile cloud-based big healthcare data processing in smart cities. IEEE Access 5:11887–11899
Muhammed T, Mehmood R, Albeshri A, Katib I (2018) UbeHealth: a personalized ubiquitous cloud and edge-enabled networked healthcare system for smart cities. IEEE Access 6:32258–32285
Nazir S, Nawaz M, Adnan A, Shahzad S, Asadi S (2019) Big data features, applications, and analytics in cardiology—a systematic literature review. IEEE Access 7:143742–143771
Nazir S, Khan S, Khan HU, Ali S, García-Magariño I, Atan RB, Nawaz M (2020) A comprehensive analysis of healthcare big data management, analytics and scientific programming. IEEE Access 8:95714–95733
Se Mohan; C Thirumalai; G Srivastava, (2019) Effective heart disease prediction using hybrid machine learning techniques. IEEE Access 7:81542–81554
Sun Y, Zhang D (2019) Diagnosis and analysis of diabetic retinopathy based on electronic health records. IEEE Access 7:86115–86120
Xu C, Wang N, Zhu L, Sharif K, Zhang C (2019) Achieving searchable and privacy-preserving data sharing for cloud-assisted e-healthcare system. IEEE Internet Things J 6(5):8345–8356
Funding
The authors have not disclosed any funding.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
All authors state that there is no conflict of interest.
Humans and animals
Humans and animals are not involved in this research work. We used our own data.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Purandhar, N., Ayyasamy, S. & Siva Kumar, P. Classification of clustered health care data analysis using generative adversarial networks (GAN). Soft Comput 26, 5511–5521 (2022). https://doi.org/10.1007/s00500-022-07026-7
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00500-022-07026-7