Skip to main content

Mining the Most Interesting Patterns from Multiple Phenotypes Medical Data

  • Conference paper
Rough Sets and Current Trends in Computing (RSCTC 2006)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4259))

Included in the following conference series:

  • 1261 Accesses

Abstract

Mining the most interesting patterns from multiple phenotypes medical data poses a great challenge for previous work, which only focuses on bi-phenotypes (such as abnormal vs. normal) medical data. Association rule mining can be applied to analyze such dataset, whereas most rules generated are either redundancy or no sense. In this paper, we define two interesting patterns, namely VP (an acronym for “Vital Pattern”) and PP (an acronym for “Protect Pattern”), based on a statistical metric. We also propose a new algorithm called MVP that is specially designed to discover such two patterns from multiple phenotypes medical data. The algorithm generates useful rules for medical researchers, from which a clearly causal graph can be induced. The experiment results demonstrate that the proposed method enables the user to focus on fewer rules and assures that the survival rules are all interesting from the viewpoint of medical domain. The classifier build on the rules generated by our method outperforms existing classifiers.

The work was supported by the “fifteen” tackle key problem project of National Science and Technology Department under grant no.2004BA721A05.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Agrawal, R., Imielinski, T., Swami, A.: Mining association rules between sets of items in large databases. In: Proc. of ACM SIGMOD 1993 Conference, pp. 207–216 (1993)

    Google Scholar 

  2. Decker, K.M., Focardi, S.: Technology overview: A report on data mining, technical report cscs tr-95-02. In Swiss Scientific Computing Center (1995)

    Google Scholar 

  3. Li, J., Wong, L.: Using rules to analyse bio-medical data: A comparison between C4.5 and PCL. In: Dong, G., Tang, C., Wang, W. (eds.) WAIM 2003. LNCS, vol. 2762, pp. 254–265. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  4. Zhou, Z., Jiang, Y.: Medical diagnosis with c4.5 rule preceded by artificial neural network ensemble. IEEE Transactions on Information Technology in Biomedicine 7, 37–42 (2003)

    Article  Google Scholar 

  5. Brossette, J.S.E., Sprague, A.P., Moser, S.A.: Associarion rules and data mining in hospital infection control and public health surveillance. Journal of American Medical Informatics Association 5(4), 373–381 (1998)

    Google Scholar 

  6. Ohsaki, M., Sato, Y., Yokoi, H., Yamaguchi, T.: A rule discovery support system for sequential medical data in the case study of a chronic hepatitis dataset. In: Proc of the ECML/PKDD 2003 Discovery Challenge Workshop, pp. 154–165 (2003)

    Google Scholar 

  7. Liu, B., Hsu, W., Ma, Y.: Pruning and summarizing the discovered associations. In: Proc. of 5th Intl. Conf. on Knowledge Discovery and Data Mining (1999)

    Google Scholar 

  8. Kamber, M., Shinghal, R.: Evalusting the interestingness of characteristic rules. In: Proc. of 2nd Intl. Conf. on Knowledge Discovery and Data Mining, pp. 263–266 (1996)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Yin, Y., Zhang, B., Zhao, Y., Wang, G. (2006). Mining the Most Interesting Patterns from Multiple Phenotypes Medical Data. In: Greco, S., et al. Rough Sets and Current Trends in Computing. RSCTC 2006. Lecture Notes in Computer Science(), vol 4259. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11908029_72

Download citation

  • DOI: https://doi.org/10.1007/11908029_72

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-47693-1

  • Online ISBN: 978-3-540-49842-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics