Investigating the Optimal Parameterization of Deep Neural Network and Synthetic Data Workflow for Imbalance Liver Disorder Dataset Classification

Diana, Nova Eka; Ahmad, Andi Batari; Mahardika, Zwasta Pribadi

doi:10.1007/978-3-030-36056-6_9

Nova Eka Diana¹⁸,
Andi Batari Ahmad¹⁸ &
Zwasta Pribadi Mahardika¹⁹

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 978))

Included in the following conference series:

International Conference on Soft Computing and Data Mining

785 Accesses
1 Citations

Abstract

DNN (Deep neural network) has emerged as one of the standard methods to create a classification model. The most common issue affecting DNN performance is the class-imbalanced distribution dataset. This research designed two workflows for generating synthetic dataset using SMOTE algorithm, SDS-1, and SDS-2 dataset. We further investigated the optimal DNN parameters that generate the best optimum performance over those datasets. We used Indian Liver Patient Dataset (ILPD) from the oldest source, UCI Machine Learning Repository, with a total of 583 records, consist of 416 positives and 167 negatives data. We measured the DNN performance using sensitivity and F-score metric following the nature of the medical domain that mainly focused on identifying a particular disease. The experiment results revealed that DNN model with the learning rate of 1E-1, TanH activation function, Xavier weighting, the epoch of 40, and the hidden layers of 10, delivered the best sensitivity and F-score value, 98.40% and 99.18%, respectively. The results suggested that the workflow for generating the class-balanced dataset will leverage the DNN performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Asrani SK, Devarbhavi H, Eaton J, Kamath PS (2019) Burden of liver diseases in the world. J Hepatol 70:151–171
Article Google Scholar
World Health Organization (2018) World health statistics 2018: monitoring health for the SDGs. Sustainable Development Goals, Geneva
Google Scholar
Patel OP, Tiwari A (2015) Liver disease diagnosis using quantum-based binary neural network learning algorithm. In: Proceedings of fourth international conference on soft computing for problem solving, advances in intelligent systems and computing, vol 336. Springer, New Delhi, pp 425—434
Google Scholar
Abdar M, Yen NY, Hung JCS (2018) Improving the diagnosis of liver disease using multilayer perceptron neural network and boosted decision trees. J Med Biol Eng 38(6):953–965
Article Google Scholar
Wu CC et al (2019) Prediction of fatty liver disease using machine learning algorithms. Comput Methods Programs Biomed 170:23–29
Article Google Scholar
Hassan TM, Elmogy M, Sallam ES (2017) Diagnosis of focal liver diseases based on deep learning technique for ultrasound images. Arab J Sci Eng 42(8):3127–3140
Article Google Scholar
Das A, Rajendra Acharya U, Panda SS, Sabut S (2019) Deep learning based liver cancer detection using watershed transform and Gaussian mixture model techniques. Cogn Syst Res 54:165–175
Article Google Scholar
Lee T, Kim J, Uh Y, Lee H (2019) Deep neural network for estimating low density lipoprotein cholesterol. Clin Chim Acta 489:35–40
Article Google Scholar
Kannadasan K, Edla DR, Kuppili V (2018) Type 2 diabetes data classification using stacked autoencoders in deep neural networks. Clin Epidemiol Glob Health
Google Scholar
Singaravel S, Suykens J, Geyer P (2018) Deep-learning neural-network architectures and methods: using component based models in building-design energy prediction. Adv Eng Inform 38:81–90
Article Google Scholar
Aung SWY, Khaing SS, Aung ST (2019) Multi-label land cover indices classification of satellite images using deep learning. In: ICBDL 2018: big data analysis and deep learning applications, vol 744. Springer, Singapore, pp 94–103
Google Scholar
Chemali E, Kollmeyer P, Preindl M, Emadi A (2018) State-of-charge estimation of Li-ion batteries using deep neural networks: a machine learning approach. J Power Sour 400:242–255
Article Google Scholar
Bazrafkan S, Thavalengal S, Corcoran P (2018) An end to end deep neural network for iris segmentation in unconstrained scenarios. Neural Netw 106:79–95
Article Google Scholar
Zhang L, Zhang C, Gao R, Yang R, Song Q (2016) Using the SMOTE technique and hybrid features to predict the types of ion channel-targeted conotoxins. J Theoret Biol 403:75–84
Article MathSciNet Google Scholar
Guo H, Zhou J, Wu C-A (2018) Imbalanced learning based on data-partition and SMOTE. Information 9:238–250
Article Google Scholar
Raghuwanshi BS, Shukla S (2019) SMOTE based class-specific extreme learning machine for imbalanced learning. Knowl-Based Syst (2019)
Google Scholar
Maldonado S, Lopez J, Vairetti C (2019) An alternative SMOTE oversampling strategy for high-dimensional datasets. Appl Soft Comput 76:380–389
Article Google Scholar
Douzas G, Bacao F, Last F (2018) Improving imbalanced learning through a heuristic oversampling method based on k-means and SMOTE. Inf Sci 465:1–20
Article Google Scholar
Goodfellow I et al (2016) Deep learning (Adaptive Computation and Machine Learning Series). The MIT Press
Google Scholar

Download references

Acknowledgments

The authors wish to thank Universitas YARSI for funding this research (No. 183/INT/UM/WRII/UY/VIII/2016).

Author information

Authors and Affiliations

Informatics Department, Faculty of Information Technology, Universitas YARSI, Jakarta, 10510, Indonesia
Nova Eka Diana & Andi Batari Ahmad
Faculty of Medicine, Universitas YARSI, Jakarta, 10510, Indonesia
Zwasta Pribadi Mahardika

Authors

Nova Eka Diana
View author publications
You can also search for this author in PubMed Google Scholar
Andi Batari Ahmad
View author publications
You can also search for this author in PubMed Google Scholar
Zwasta Pribadi Mahardika
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Nova Eka Diana .

Editor information

Editors and Affiliations

Faculty of Computer Science and Information Technology, Universiti Tun Hussein Onn Malaysia, Batu Pahat, Johor, Malaysia
Rozaida Ghazali
Faculty of Computer Science and Information Technology, Universiti Tun Hussein Onn Malaysia, Batu Pahat, Johor, Malaysia
Nazri Mohd Nawi
Faculty of Computer Science and Information Technology, Universiti Tun Hussein Onn Malaysia, Batu Pahat, Johor, Malaysia
Mustafa Mat Deris
School of Information Technology, Deakin University, Geelong Waurn Ponds Campus, VIC, Australia
Jemal H. Abawajy

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Diana, N.E., Ahmad, A.B., Mahardika, Z.P. (2020). Investigating the Optimal Parameterization of Deep Neural Network and Synthetic Data Workflow for Imbalance Liver Disorder Dataset Classification. In: Ghazali, R., Nawi, N., Deris, M., Abawajy, J. (eds) Recent Advances on Soft Computing and Data Mining. SCDM 2020. Advances in Intelligent Systems and Computing, vol 978. Springer, Cham. https://doi.org/10.1007/978-3-030-36056-6_9

Download citation

DOI: https://doi.org/10.1007/978-3-030-36056-6_9
Published: 05 December 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-36055-9
Online ISBN: 978-3-030-36056-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics