Skip to main content

Mining for Mutually Exclusive Gene Expressions

  • Conference paper
  • 2122 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6040))

Abstract

Association rules mining is a popular task that involves the discovery of co-occurences of items in transaction databases. Several extensions of the traditional association rules mining model have been proposed so far, however, the problem of mining for mutually exclusive items has not been investigated. Such information could be useful in various cases in many application domains like bioinformatics (e.g. when the expression of a gene excludes the expression of another) In this paper, we address the problem of mining pairs and triples of genes, such that the presence of one excludes the presence of the other. First, we provide a concise review of the literature, then we define this problem, we propose a probability-based evaluation metric, and finally a mining algorithm that we apply on gene expression data gaining new biological insights.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agrawal, R., Imielinski, T., Swami, A.: Mining Association Rules Between Sets of Items in Large Databases. In: Proceedings of the ACM SIGMOD Conference on Management of Data, pp. 207–216 (1993)

    Google Scholar 

  2. Agrawal, R., Srikant, R.: Fast Algorithms for Mining Association Rules in Large Databases. In: Proceedings of the International Conference on Very Large Databases, pp. 487–499 (1994)

    Google Scholar 

  3. Alves, A., Zagoruiko, N., Okun, O., Kutnenko, O., Borisova, I.: Predictive Analysis of Gene Expression Data from Human SAGE Libraries. In: Proceedings of the ECML/PKDD Discovery Challenge Workshop, Porto, Portugal, pp. 60–71 (2005)

    Google Scholar 

  4. Becquet, C., Blachon, S., Jeudy, B., Boulicaut, J.F., Gandrillon, O.: Strong-association-rule mining for large-scale gene-expression data analysis: a case study on human SAGE data. Genome Biology 3(12) (2002)

    Google Scholar 

  5. Berberidis, C., Tzanis, G., Vlahavas, I.: Mining for Contiguous Frequent Itemsets in Transaction Databases. In: Proceedings of the IEEE 3rd International Workshop on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications (2005)

    Google Scholar 

  6. Chen, X., Petrounias, I.: Discovering Temporal Association Rules: Algorithms, Language and System. In: Proceedings of the 16th International Conference on Data Engineering (2000)

    Google Scholar 

  7. Gamberoni, G., Storari, S.: Supervised and unsupervised learning techniques for profiling SAGE results. In: Proceedings of the ECML/PKDD Discovery Challenge Workshop, Pisa, Italy, pp. 121–126 (2004)

    Google Scholar 

  8. Gandrillon, O.: Guide to the gene expression data. In: Proceedings of the ECML/PKDD Discovery Challenge Workshop, Pisa, Italy, pp. 116–120 (2004)

    Google Scholar 

  9. Gasmi, G., Hamrouni, T., Abdelhak, S., Ben Yahia, S., Mephu Nguifo, E.: Extracting Generic Basis of Association Rules from SAGE Data. In: Proceedings of the ECML/PKDD Discovery Challenge Workshop, Porto, Portugal, pp. 84–89 (2005)

    Google Scholar 

  10. Han, J., Fu, Y.: Discovery of Multiple-Level Association Rules from Large Databases. In: Proceedings of the 21st International Conference on Very Large Databases, pp. 420–431 (1995)

    Google Scholar 

  11. Han, J., Pei, J., Yin, Y.: Mining Frequent Patterns without Candidate Generation. In: Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, pp. 1–12 (2000)

    Google Scholar 

  12. Koperski, K., Han, J.: Discovery of Spatial Association Rules in Geographic Information Databases. In: Proceedings of the 4th International Symposium on Large Spatial Databases, pp. 47–66 (1995)

    Google Scholar 

  13. Lin, H.-T.: Li. L. “Analysis of SAGE Results with Combined Learning Techniques”. In: Proceedings of the ECML/PKDD Discovery Challenge Workshop, Porto, Portugal, pp. 102–113 (2005)

    Google Scholar 

  14. Martinez, R., Christen, R., Pasquier, C., Pasquier, N.: Exploratory Analysis of Cancer SAGE Data. In: Proceedings of the ECML/PKDD Discovery Challenge Workshop, Porto, Portugal, pp. 72–77 (2005)

    Google Scholar 

  15. Ng, R.T., Sander, J., Sleumer, M.C.: Hierarchical cluster analysis of SAGE data for cancer profiling. In: Proceedings of Workshop on Data Mining in Bioinformatics, pp. 65–72 (2001)

    Google Scholar 

  16. Rioult, F.: Mining strong emerging patterns in wide SAGE data. In: Proceedings of the ECML/PKDD Discovery Challenge Workshop, Pisa, Italy, pp. 484–487 (2004)

    Google Scholar 

  17. Savasere, A., Omiecinski, E., Navathe, S.B.: Mining for Strong Negative Associations in a Large Database of Customer Transactions. In: Proceedings of the 14th International Conference on Data Engineering, pp. 494–502 (1998)

    Google Scholar 

  18. Srikant, R., Agrawal, R.: Mining Generalized Association Rules. In: Proceedings of the 21st VLDB Conference, pp. 407–419 (1995)

    Google Scholar 

  19. Teng, C.M.: Learning form Dissociations. In: Proceedings of the 4th International Conference on Data Warehousing and Knowledge Discovery, pp. 11–20 (2002)

    Google Scholar 

  20. Thomas, S., Sarawagi, S.: Mining Generalized Association Rules and Sequential Patterns Using SQL Queries. In: Proceedings of the 4th International Conference on Knowledge Discovery and Data Mining, pp. 344–348 (1998)

    Google Scholar 

  21. Tung, A.K.H., Lu, H., Han, J., Feng, L.: Efficient Mining of Intertransaction Association Rules. IEEE Transactions on Knowledge and Data Engineering 15(1), 43–56 (2003)

    Article  Google Scholar 

  22. Tzanis, G., Berberidis, C.: Mining for Mutually Exclusive Items in Transaction Databases. International Journal of Data Warehousing and Mining 3(3) (2007)

    Google Scholar 

  23. Tzanis, G., Berberidis, C., Vlahavas, I.: On the Discovery of Mutually Exclusive Items in a Market Basket Database. In: Proceedings of the 2nd ADBIS Workshop on Data Mining and Knowledge Discovery, Thessaloniki, Greece, September 6 (2006)

    Google Scholar 

  24. Tzanis, G., Vlahavas, I.: Mining High Quality Clusters of SAGE Data. In: Proceedings of the 2nd VLDB Workshop on Data Mining in Bioinformatics, Vienna, Austria (2007)

    Google Scholar 

  25. Tzanis, G., Vlahavas, I.: Accurate Classification of SAGE Data Based on Frequent Patterns of Gene Expression. In: Proceedings of the 19th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2007), Patras, Greece, October 29-31. IEEE, Los Alamitos (2007)

    Google Scholar 

  26. Velculescu, V.E., Zhang, L., Vogelstein, B., Kinzler, K.W.: Serial analysis of gene expression. Science 270(5235), 484–487 (1995)

    Article  Google Scholar 

  27. Wu, X., Zhang, C., Zhang, S.: Efficient Mining of both Positive and Negative Association Rules. ACM Transactions on Information Systems 22(3), 381–405 (2004)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Tzanis, G., Vlahavas, I. (2010). Mining for Mutually Exclusive Gene Expressions. In: Konstantopoulos, S., Perantonis, S., Karkaletsis, V., Spyropoulos, C.D., Vouros, G. (eds) Artificial Intelligence: Theories, Models and Applications. SETN 2010. Lecture Notes in Computer Science(), vol 6040. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12842-4_29

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-12842-4_29

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-12841-7

  • Online ISBN: 978-3-642-12842-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics