Skip to main content

Investigating the Optimal Parameterization of Deep Neural Network and Synthetic Data Workflow for Imbalance Liver Disorder Dataset Classification

  • Conference paper
  • First Online:
Recent Advances on Soft Computing and Data Mining (SCDM 2020)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 978))

Included in the following conference series:

Abstract

DNN (Deep neural network) has emerged as one of the standard methods to create a classification model. The most common issue affecting DNN performance is the class-imbalanced distribution dataset. This research designed two workflows for generating synthetic dataset using SMOTE algorithm, SDS-1, and SDS-2 dataset. We further investigated the optimal DNN parameters that generate the best optimum performance over those datasets. We used Indian Liver Patient Dataset (ILPD) from the oldest source, UCI Machine Learning Repository, with a total of 583 records, consist of 416 positives and 167 negatives data. We measured the DNN performance using sensitivity and F-score metric following the nature of the medical domain that mainly focused on identifying a particular disease. The experiment results revealed that DNN model with the learning rate of 1E-1, TanH activation function, Xavier weighting, the epoch of 40, and the hidden layers of 10, delivered the best sensitivity and F-score value, 98.40% and 99.18%, respectively. The results suggested that the workflow for generating the class-balanced dataset will leverage the DNN performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Asrani SK, Devarbhavi H, Eaton J, Kamath PS (2019) Burden of liver diseases in the world. J Hepatol 70:151–171

    Article  Google Scholar 

  2. World Health Organization (2018) World health statistics 2018: monitoring health for the SDGs. Sustainable Development Goals, Geneva

    Google Scholar 

  3. Patel OP, Tiwari A (2015) Liver disease diagnosis using quantum-based binary neural network learning algorithm. In: Proceedings of fourth international conference on soft computing for problem solving, advances in intelligent systems and computing, vol 336. Springer, New Delhi, pp 425—434

    Google Scholar 

  4. Abdar M, Yen NY, Hung JCS (2018) Improving the diagnosis of liver disease using multilayer perceptron neural network and boosted decision trees. J Med Biol Eng 38(6):953–965

    Article  Google Scholar 

  5. Wu CC et al (2019) Prediction of fatty liver disease using machine learning algorithms. Comput Methods Programs Biomed 170:23–29

    Article  Google Scholar 

  6. Hassan TM, Elmogy M, Sallam ES (2017) Diagnosis of focal liver diseases based on deep learning technique for ultrasound images. Arab J Sci Eng 42(8):3127–3140

    Article  Google Scholar 

  7. Das A, Rajendra Acharya U, Panda SS, Sabut S (2019) Deep learning based liver cancer detection using watershed transform and Gaussian mixture model techniques. Cogn Syst Res 54:165–175

    Article  Google Scholar 

  8. Lee T, Kim J, Uh Y, Lee H (2019) Deep neural network for estimating low density lipoprotein cholesterol. Clin Chim Acta 489:35–40

    Article  Google Scholar 

  9. Kannadasan K, Edla DR, Kuppili V (2018) Type 2 diabetes data classification using stacked autoencoders in deep neural networks. Clin Epidemiol Glob Health

    Google Scholar 

  10. Singaravel S, Suykens J, Geyer P (2018) Deep-learning neural-network architectures and methods: using component based models in building-design energy prediction. Adv Eng Inform 38:81–90

    Article  Google Scholar 

  11. Aung SWY, Khaing SS, Aung ST (2019) Multi-label land cover indices classification of satellite images using deep learning. In: ICBDL 2018: big data analysis and deep learning applications, vol 744. Springer, Singapore, pp 94–103

    Google Scholar 

  12. Chemali E, Kollmeyer P, Preindl M, Emadi A (2018) State-of-charge estimation of Li-ion batteries using deep neural networks: a machine learning approach. J Power Sour 400:242–255

    Article  Google Scholar 

  13. Bazrafkan S, Thavalengal S, Corcoran P (2018) An end to end deep neural network for iris segmentation in unconstrained scenarios. Neural Netw 106:79–95

    Article  Google Scholar 

  14. Zhang L, Zhang C, Gao R, Yang R, Song Q (2016) Using the SMOTE technique and hybrid features to predict the types of ion channel-targeted conotoxins. J Theoret Biol 403:75–84

    Article  MathSciNet  Google Scholar 

  15. Guo H, Zhou J, Wu C-A (2018) Imbalanced learning based on data-partition and SMOTE. Information 9:238–250

    Article  Google Scholar 

  16. Raghuwanshi BS, Shukla S (2019) SMOTE based class-specific extreme learning machine for imbalanced learning. Knowl-Based Syst (2019)

    Google Scholar 

  17. Maldonado S, Lopez J, Vairetti C (2019) An alternative SMOTE oversampling strategy for high-dimensional datasets. Appl Soft Comput 76:380–389

    Article  Google Scholar 

  18. Douzas G, Bacao F, Last F (2018) Improving imbalanced learning through a heuristic oversampling method based on k-means and SMOTE. Inf Sci 465:1–20

    Article  Google Scholar 

  19. Goodfellow I et al (2016) Deep learning (Adaptive Computation and Machine Learning Series). The MIT Press

    Google Scholar 

Download references

Acknowledgments

The authors wish to thank Universitas YARSI for funding this research (No. 183/INT/UM/WRII/UY/VIII/2016).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nova Eka Diana .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Diana, N.E., Ahmad, A.B., Mahardika, Z.P. (2020). Investigating the Optimal Parameterization of Deep Neural Network and Synthetic Data Workflow for Imbalance Liver Disorder Dataset Classification. In: Ghazali, R., Nawi, N., Deris, M., Abawajy, J. (eds) Recent Advances on Soft Computing and Data Mining. SCDM 2020. Advances in Intelligent Systems and Computing, vol 978. Springer, Cham. https://doi.org/10.1007/978-3-030-36056-6_9

Download citation

Publish with us

Policies and ethics