Abstract
This paper presents a research on discovering a similarity relation in multidimensional bioinformatic data. In particular, utilization of a Rules-based Similarity model to define a similarity in microarray datasets is discussed. The Rules-based Similarity model is a rough set extension to the feature contrast model proposed by Amos Tversky. Its main aim is to achieve high accuracy in a case-based classification task and at the same time to simulate the human way of perceiving similar objects. The similarity relation derived from the Rules-based Similarity model is suitable for genes expression profiling as the rules naturally indicate the groups of genes whose activation or inactivation is relevant in the considered context. Experiments conducted on several microarray datasets show that this model of similarity is able to capture higher-level dependencies in data and it may be successfully used in cases when the standard distance-based approach turns out to be ineffective.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Aamodt, A., Plaza, E.: Case-based reasoning: Foundational issues, methodological variations, and system approaches. Artificial Intelligence Communications 7(1), 39–59 (1994)
MartÃn-Merino, M., Las Rivas, J.: Improving k-nn for human cancer classification using the gene expression profiles. In: Adams, N.M., Robardet, C., Siebes, A., Boulicaut, J.-F. (eds.) IDA 2009. LNCS, vol. 5772, pp. 107–118. Springer, Heidelberg (2009)
Ben-Dor, A., Bruhn, L., Friedman, N., Nachman, I., Schummer, M., Yakhini, Z.: Tissue classification with gene expression profiles. Journal of Computational Biology 7(3-4), 559–583 (2000)
Xiong, H., Chen, X.w.: Kernel-based distance metric learning for microarray data classification. BMC Bioinformatics 7(1), 299 (2006)
Janusz, A.: Rule-based similarity for classification. In: Proceedings of the WI/IAT 2009 Workshops, September 15-18, pp. 449–452. IEEE Computer Society, Los Alamitos (2009)
Janusz, A.: Learning a rule-based similarity: A comparison with the genetic approach. In: Proceedings of the Workshop on Concurrency, Specification and Programming (CS&P 2009), Krakw-Przegorzay, Poland, September 28-30, vol. 1, pp. 241–252 (2009)
Tversky, A.: Features of similarity. Psychological Review 84, 327–352 (1977)
Greco, S., Matarazzo, B., Slowinski, R.: Dominance-based rough set approach to case-based reasoning. In: Torra, V., Narukawa, Y., Valls, A., Domingo-Ferrer, J. (eds.) MDAI 2006. LNCS (LNAI), vol. 3885, pp. 7–18. Springer, Heidelberg (2006)
Pawlak, Z.: Information systems, theoretical foundations. Information systems 3(6), 205–218 (1981)
Gati, I., Tversky, A.: Studies of similarity. In: Rosch, E., Lloyd, B. (eds.) Cognition and Categorization, pp. 81–99. L. Erlbaum Associates, Hillsdale (1978)
Pawlak, Z.: Rough sets, rough relations and rough functions. Fundamenta Informaticae 27(2-3), 103–108 (1996)
Skowron, A., Stepaniuk, J.: Approximation of relations. In: RSKD ’93: Proceedings of the International Workshop on Rough Sets and Knowledge Discovery, London, UK, pp. 161–166. Springer, Heidelberg (1994)
Nguyen, H.S.: On efficient handling of continuous attributes in large data bases. Fundamenta Informaticae 48(1), 61–81 (2001)
Delimata, P., Moshkov, M.J., Skowron, A., Suraj, Z.: Inhibitory Rules in Data Analysis: A Rough Set Approach. Studies in Computational Intelligence, vol. 163. Springer, Heidelberg (2009)
R Development Core Team: R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria (2008)
Deutsch, J.M.: Evolutionary algorithms for finding optimal gene sets in microarray prediction. Bioinformatics 19(1), 45–52 (2003)
Jirapech-Umpai, T., Aitken, S.: Feature selection and classification for microarray data analysis: Evolutionary methods for identifying predictive genes. BMC Bioinformatics 6(1), 148 (2005)
Bazan, J.G., Skowron, A., Synak, P.: Dynamic reducts as a tool for extracting laws from decisions tables. In: Raś, Z.W., Zemankova, M. (eds.) ISMIS 1994. LNCS, vol. 869, pp. 346–355. Springer, Heidelberg (1994)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Janusz, A. (2010). Discovering Rules-Based Similarity in Microarray Data. In: Hüllermeier, E., Kruse, R., Hoffmann, F. (eds) Computational Intelligence for Knowledge-Based Systems Design. IPMU 2010. Lecture Notes in Computer Science(), vol 6178. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14049-5_6
Download citation
DOI: https://doi.org/10.1007/978-3-642-14049-5_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-14048-8
Online ISBN: 978-3-642-14049-5
eBook Packages: Computer ScienceComputer Science (R0)