Skip to main content

Medical Diagnosis for Incomplete and Imbalanced Data

  • Conference paper
  • First Online:
Intelligent Data Engineering and Analytics

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 266))

Abstract

The process of identifying a disease that a patient is affected by with the help of signs, test reports, and symptoms is known as diagnosis. Deep learning plays a major role in automated diagnosis in the medical field. The efficiency of the automated diagnosis system depends on how well the data provided for training is and how it is used to train the system. The data is subject to data quality concerns like its accuracy, completeness, consistency, and data balance. Additionally, significantly, in reality, clinical data is created solely from many useful and important attributes, rather than the complete patient data. But, in the real world, data is of poor quality due to various reasons, for example, data validity, respectability, fulfillment, exactness, and so on. Specifically, in the medical domain also, the data is imbalanced and incomplete. So, in this project, we propose a multi-instance neural network to predict the disease based on the patients’ existing and reliable data. The proposed approach is planned to be tested with the imbalanced dataset named the Western Medicine (WM) and Disease Symptom Prediction. The proposed multi-instance neural network architecture predicts the disease with high accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 189.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 249.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 249.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Ghahramani, Z., Jordan, M.I.: Supervised learning from incomplete data via EM approach. Adv. Neural Inf. Process. Syst. (2016)

    Google Scholar 

  2. Fung, G., et al.: Multiple instances learning for computer aided diagnosis. Adv. Neural Inf. Process. Syst. 19 (2007): 425

    Google Scholar 

  3. Belarouci, S., Chikh, M.A.: Medical imbalanced data classification. Adv. Sci. Technol. Eng. Syst. J. (2017)

    Google Scholar 

  4. Mehrabani-Zeinabad, K., et al.: An efficient and effective model to handle missing data in classification. BioMed Res. Int. (2020)

    Google Scholar 

  5. Lin, W.-C., et al.: Clustering-based under sampling in class-imbalanced data. Inf. Sci. 409:17–26 (2017)

    Google Scholar 

  6. D’Addabbo, A., Maglietta, R.: Parallel selective sampling method for imbalanced and large data classification. Pattern Recogn. Lett. 62, 61–67 (2015)

    Article  Google Scholar 

  7. Wang, S., Yao, X.: Diversity analysis on imbalanced data sets by using ensemble models. In: 2009 IEEE Symposium on Computational Intelligence and Data Mining. IEEE (2009)

    Google Scholar 

  8. Yan, Y., et al.: Deep multi-instance learning with dynamic pooling. In: Asian Conference on Machine Learning. PMLR (2018)

    Google Scholar 

  9. Zeyuan, W., et al.: Attention-Based multi-instance neural network for medical diagnosis from incomplete and low-quality data

    Google Scholar 

  10. Ilse, M., Tomczak, J.M., Welling, M.: Attention-based deep multiple instance learning. arXiv preprint arXiv:1802.04712 (2018)

  11. Khalilia, M., et al.: Predicting disease risks from highly imbalanced data using random forest. 29 July 2011

    Google Scholar 

  12. Fotouhi, S., et al.: A comprehensive data level analysis for cancer diagnosis on imbalanced data. J. Biomed. Inf. 90 (2019)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sribhashyam, S., Koganti, S., Vineela, M.V., Kalyani, G. (2022). Medical Diagnosis for Incomplete and Imbalanced Data. In: Satapathy, S.C., Peer, P., Tang, J., Bhateja, V., Ghosh, A. (eds) Intelligent Data Engineering and Analytics. Smart Innovation, Systems and Technologies, vol 266. Springer, Singapore. https://doi.org/10.1007/978-981-16-6624-7_49

Download citation

Publish with us

Policies and ethics