Skip to main content

Selection of Important Attributes for Medical Diagnosis Systems

  • Chapter

Part of the book series: Lecture Notes in Computer Science ((TRS,volume 4400))

Abstract

Success of machine learning algorithms is usually dependent on a quality of a dataset they operate on. For datasets containing noisy, inadequate or irrelevant information these algorithms may produce less accurate results. Therefore a common pre-processing step in data mining domain is a selection of highly predictive attributes. In this case study we select subsets of attributes from medical data using filter feature selection algorithms. To validate the algorithms we induce decision rules from the selected subsets of attributes and compare classification accuracy on both training and test datasets. Additionally medical relevance of the selected attributes is checked with help of domain experts.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Rich, E., Knight, K.: Artificial Intelligence. McGraw-Hill Science, New York (1990)

    Google Scholar 

  2. Ghiselli, E.: Theory of Psychological Measurement. McGraw-Hill Book, New York (1964)

    Google Scholar 

  3. Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, Los Altos (1993)

    Google Scholar 

  4. Kononenko, I.: On Biases in Estimating Multi-Valued Attributes. In: International Joint Conference on Artificial Intelligence, Montreal, pp. 1034–1040 (1995)

    Google Scholar 

  5. Li, J., Cercone, N.: Introducing A Rule Importance Measure. In: Peters, J.F., Skowron, A. (eds.) Transactions on Rough Sets V. LNCS, vol. 4100, pp. 171–194. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  6. Quinlan, J.R.: Induction of Decision Trees. Mach. Learn., 81–106 (2003)

    Google Scholar 

  7. Hall, M.: Correlation-based Feature Selection for Machine Learning. Ph.D diss. Hamilton, NZ: Waikato University, Department of Computer Science (1998)

    Google Scholar 

  8. Kohavi, R., John, G.: Wrappers for Feature Subset Selection. Artif. Intell. 97, 273–324

    Google Scholar 

  9. Kohavi, R.: The Power of Decision Tables. In: Hsiang, J. (ed.) RTA 1995. LNCS, vol. 914, pp. 174–189. Springer, Heidelberg (1995)

    Google Scholar 

  10. John, G., Kohavi, R., Pfleger, K.: Irrelevant Features and the Subset Selection Problem. In: International Conference on Machine Learning, New Jersey, pp. 121–129 (1994)

    Google Scholar 

  11. Pawlak, Z.: Rough sets. International Journal of Computer and Information Science 11, 341–356 (1982)

    Article  MathSciNet  Google Scholar 

  12. Everitt, B.S.: The analysis of contingency tables. Chapman and Hall, London (1977)

    Google Scholar 

  13. Pawlak, Z.: Knowledge and Uncertainty: A Rough Set Approach. In: SOFTEKS Workshop on Incompleteness and Uncertainty in Information Systems, pp. 34–42 (1993)

    Google Scholar 

  14. Pawlak, Z., et al.: Rough Sets. Commun. ACM 38, 88–95 (1995)

    Article  Google Scholar 

  15. Ilczuk, G., Wakulicz-Deja, A.: Rough Sets Approach to Medical Diagnosis System. In: Szczepaniak, P.S., Kacprzyk, J., Niewiadomski, A. (eds.) AWIC 2005. LNCS (LNAI), vol. 3528, pp. 204–210. Springer, Heidelberg (2005)

    Google Scholar 

  16. Ilczuk, G., Wakulicz-Deja, A.: Attribute Selection and Rule Generation Techniques for Medical Diagnosis Systems. In: Ślęzak, D., et al. (eds.) RSFDGrC 2005. LNCS (LNAI), vol. 3641, pp. 352–361. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  17. Wakulicz-Deja, A., Paszek, P.: Applying Rough Set Theory to Multi Stage Medical Diagnosing. Fundam. Inform. 54, 387–408 (2003)

    MATH  MathSciNet  Google Scholar 

  18. Grzymala-Busse, J.W.: MLEM2 - Discretization During Rule Induction. In: IIS 2003, Zakopane, pp. 499–508 (2003)

    Google Scholar 

  19. Ilczuk, G., et al.: Rough Sets Techniques for Medical Diagnosis Systems. In: Computers in Cardiology 2005, Lyon, pp. 837–840 (2005)

    Google Scholar 

  20. Mlynarski, R., et al.: Automated Decision Support and Guideline Verification in Clinical Practice. In: Computers in Cardiology 2005, Lyon, pp. 375–378 (2005)

    Google Scholar 

  21. Chan, C.C., Grzymala-Busse, J.W.: On the two local inductive algorithms: PRISM and LEM2. Foundations of Computing and Decision Sciences 19, 185–203 (1994)

    MATH  Google Scholar 

  22. Chan, C.C., Grzymala-Busse, J.W.: On the attribute redundancy and the learning programs ID3, PRISM, and LEM2. Department of Computer Science, University of Kansas, TR-91-14 (1991)

    Google Scholar 

  23. Grzymala-Busse, J.W.: A new version of the rule induction system LERS. Fundam. Inform. 31, 27–39 (1997)

    MATH  Google Scholar 

  24. Komorowski, H.J., et al.: Rough Sets: A Tutorial. Springer, Singapore (1999)

    Google Scholar 

  25. Paszek, P., Wakulicz-Deja, A.: The Application of Support Diagnose in Mitochondrial Encephalomyopathies. In: Alpigini, J.J., et al. (eds.) RSCTC 2002. LNCS (LNAI), vol. 2475, pp. 586–593. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  26. Ziarko, W.: The discovery, analysis and representation of data dependencies in databases. In: Piatesky-Shapiro, G., Frawley, W.J. (eds.) Knowledge Discovery in Databases, MIT Press, Cambridge (1991)

    Google Scholar 

  27. Modrzejewski, M.: Feature selection using rough sets theory. In: Brazdil, P.B. (ed.) ECML 1993. LNCS, vol. 667, pp. 213–226. Springer, Heidelberg (1993)

    Google Scholar 

  28. Bazan, J., et al.: Rough set algorithms in classification problems. In: Polkowski, L., Lin, T.Y., Tsumoto, S. (eds.) Rough Set Methods and Applications: New Developments in Knowledge Discovery in Information Systems. Studies in Fuzziness and Soft Computing, vol. 56, pp. 49–88. Springer, Heidelberg (2000)

    Google Scholar 

  29. Polkowski, L., Lin, T.Y., Tsumoto, S. (eds.): Rough Set Methods and Applications: New Developments in Knowledge Discovery in Information Systems. Studies in Fuzziness and Soft Computing, vol. 56. Springer, Heidelberg (2000)

    Google Scholar 

  30. Pal, S.K., Polkowski, L., Skowron, A. (eds.): Rough-Neural Computing: Techniques for Computing with Words. Cognitive Technologies. Springer, Heidelberg (2004)

    Google Scholar 

  31. Wróblewski, J.: Adaptive aspects of combining approximation spaces. In: Pal, S.K., Polkowski, L., Skowron, A. (eds.) Rough-Neural Computing: Techniques for Computing with Words. Cognitive Technologies, pp. 139–156. Springer, Heidelberg (2004)

    Google Scholar 

  32. Bazan, J.G.: A comparison of dynamic and nondynamic rough set methods for extracting laws from decision tables. In: Polkowski, L., Skowron, A. (eds.) Rough Sets in Knowledge Discovery 1: Methodology and Applications. Studies in Fuzziness and Soft Computing, vol. 18, pp. 321–365. Physica, Heidelberg (1998)

    Google Scholar 

  33. Polkowski, L., Skowron, A. (eds.): Rough Sets in Knowledge Discovery 1: Methodology and Applications. Studies in Fuzziness and Soft Computing, vol. 18. Physica, Heidelberg (1998)

    MATH  Google Scholar 

  34. Pawlak, Z., Skowron, A.: Rudiments of rough sets. Information Sciences 177(1), 3–27 (2007)

    Article  MATH  MathSciNet  Google Scholar 

  35. Pawlak, Z., Skowron, A.: Rough sets: Some extensions. Information Sciences 177(1), 28–40 (2007)

    Article  MATH  MathSciNet  Google Scholar 

  36. Pawlak, Z., Skowron, A.: Rough sets and Boolean reasoning. Information Sciences 177(1), 41–73 (2007)

    Article  MATH  MathSciNet  Google Scholar 

  37. Swiniarski, R., Skowron, A.: Rough set methods in feature selection and extraction. Pattern Recognition Letters 24(6), 833–849 (2003)

    Article  MATH  Google Scholar 

  38. Swiniarski, R.W., Skowron, A.: Independent component analysis, principal component analysis and rough sets in face recognition. In: Peters, J.F., et al. (eds.) Transactions on Rough Sets I. LNCS, vol. 3100, pp. 392–404. Springer, Heidelberg (2004)

    Google Scholar 

  39. Peters, J.F., et al. (eds.): Transactions on Rough Sets I. LNCS, vol. 3100. Springer, Heidelberg (2004)

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

James F. Peters Andrzej Skowron Victor W. Marek Ewa Orłowska Roman Słowiński Wojciech Ziarko

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer Berlin Heidelberg

About this chapter

Cite this chapter

Ilczuk, G., Wakulicz-Deja, A. (2007). Selection of Important Attributes for Medical Diagnosis Systems. In: Peters, J.F., Skowron, A., Marek, V.W., Orłowska, E., Słowiński, R., Ziarko, W. (eds) Transactions on Rough Sets VII. Lecture Notes in Computer Science, vol 4400. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71663-1_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-71663-1_5

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-71662-4

  • Online ISBN: 978-3-540-71663-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics