Abstract
Recognizing patient samples with gene expression profiles is used to cancer diagnosis and therapy. In the high dimensional, huge redundant and noisy gene expression data the cancerogenic factor’s locality is studied. Using gene feature transformation a relative space to a cancer is built and a least spread space with least energy to the cancer is extracted. And it is proven that the cancer is able to be recognized in the least spread space and a cancer classification with least spread space (CCLSS) is proposed. In the Leukemia dataset and Colon dataset the correlation between the recognition rate and the rank of least spread space is explored, then the optimal least spread spaces to AML/ALL and to tumor colon tissue (TCT)/normal colon tissue (NCT) are extracted. At last using LOOCV the experiments with different classification algorithms are conducted and the results show CCLSS makes better precision than traditional classification algorithms.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Kuramochi, M., Karypis, G.: Gene classification using expression profiles:a feasibility study. International Journal on Artificial Intelligence Tools 14(4), 641–660 (2005)
Lu, X.G., et al.: Using Most Similarity Tree Based Clustering to Select the Top Most Discriminating Genes for Cancer Detection. In: Rutkowski, L., et al. (eds.) ICAISC 2006. LNCS (LNAI), vol. 4029, pp. 931–940. Springer, Heidelberg (2006)
Parsons, L., Haque, E., Liu, H.: Subspace clustering for high dimensional data: a review. SIGKDD Explorations 6(1), 90–105 (2004)
Kasabov, N.: Evolving Connectionist Systems, Methods and Applications in Bioinformatics. Brain Study and Intelligent Machines. Springer, Heidelberg (2002)
Shipp, M.A., et al.: Diffuse large B-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning. Nature Medicine 8, 68–74 (2002)
Golub, T.R., et al.: Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring. Science 286, 531–537 (1999)
Veer, L.J.V.T., et al.: Gene expression profiling predicts clinical outcome of breast cancer. Nature 415, 530–536 (2002)
Cho, S.B., Won, H.H.: Machine Learning in DNA Microarray Analysis for Cancer Classification. In: Proc. of the First Asia-Pacific Bioinformatics Conference(APBC 2003), pp. 189–198 (2003)
Conde, L., et al.: Unsupervised Reduction of the Dimensionality Followed by Supervised Learning with a Perceptron Improves the Classification of Conditions in DNA Microarray Gene Expression Data. In: Bourlard, H., et al. (eds.) Neural Networks for Signal Processing XII, pp. 77–86. IEEE Press, New York (2002)
Raychaudhur, S., Stuar, J.M., Altma, R.B.: Principal Components Analysis to Summarize Microarray Experiments: Application to Sporulation Time Series. In: Pacific Symposium on Biocomputing Honolulu, Hawaii, pp. 452–463 (2000)
Li, Y.X., Ruan, X.G.: Feature Selection for Cancer Classification Based on Support Vector machine. Journal of Computer Research and Development 42(10), 1796–1801 (2005)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer Berlin Heidelberg
About this paper
Cite this paper
Lu, X., Lin, Y., Wang, H., Zhou, S., Li, X. (2007). A Novel Relative Space Based Gene Feature Extraction and Cancer Recognition. In: Zhou, ZH., Li, H., Yang, Q. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2007. Lecture Notes in Computer Science(), vol 4426. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71701-0_77
Download citation
DOI: https://doi.org/10.1007/978-3-540-71701-0_77
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-71700-3
Online ISBN: 978-3-540-71701-0
eBook Packages: Computer ScienceComputer Science (R0)