Abstract
Prediction of gene function from expression profiles is an intriguing problem that has been attempted with both unsupervised clustering and supervised learning methods. By the incorporation of prior knowledge concerning gene function, supervised methods avoid some of the problems with clustering. However, even supervised methods ignore the fact that the functional classes associated with genes are typically organized in an ontology. Hence, we introduce a new supervised method for learning in such an ontology. It is tested on both an artificial data set and a data set containing measurements from human fibroblast cells. We also give an approach for measuring the classification performance in an ontology.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
The Gene Ontology Consortium. Gene ontology: tool for the unification of biology. Nature Genetics, 25(1):25–29, 2000.
M. P. S. Brown, W. N. Grundy, D. Lin, N. Cristianini, C. W. Sugnet, T. S. Furey, M. Ares, Jr., and D. Haussler. Knowledge-based analysis of microarray gene expression data by using support vector machines. PNAS, 97(1):262–267, 2000.
Peter Clark and Tim Niblett. The CN2 induction algorithm. Machine Learning, 3(4):261–283, 1989.
Thomas G. Dietterich. Ensemble methods in machine learning. In Proc. of MCS-2000, LNCS 1857, pp. 1–15.
M. B. Eisen, P. T. Spellman, P. O. Brown, and D. Botstein. Cluster analysis and display of genome-wide expression patterns. PNAS, 95:14863–14868, 1998.
T. R. Hvidsten, J. Komorowski, A. K. Sandvik, and A. Lægreid. Predicting gene function from gene expressions and ontologies. In Proc. of PSB-2001, pp. 299–310.
W. R. Iyer, M. B. Eisen, D. T. Ross, G. Schuler, T. Moore, J. C. F. Lee, J. M. Trent, L. M. Staudt, J. Hudson, M. S. Boguski, D. Lashkari, D. Shalon, D. Botstein, and P..O. Brown. The transcriptional program in the response of human fibroblasts to serum. Science, 283:83–87, 1999.
R. S. Michalski. A theory and methodology of inductive learning. In Michalski, Carbonell, and Mitchell (eds), Machine Learning: An Artificial Intelligence Approach, vol. 1, pp. 83–129. Morgan Kaufmann, 1983.
H. Shatkay, S. Edwards, W. J. Wilbur, and M. Boguski. Genes, themes and microarrays: Using information retrieval for large-scale gene analysis. In Proc. of ISMB-2000, pp. 317–328.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Midelfart, H., Lægreid, A., Komorowski, J. (2001). Classification of Gene Expression Data in an Ontology. In: Crespo, J., Maojo, V., Martin, F. (eds) Medical Data Analysis. ISMDA 2001. Lecture Notes in Computer Science, vol 2199. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45497-7_28
Download citation
DOI: https://doi.org/10.1007/3-540-45497-7_28
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42734-6
Online ISBN: 978-3-540-45497-7
eBook Packages: Springer Book Archive