A Comparative Study of Two Novel Predictor Set Scoring Methods

Ooi, Chia Huey; Chetty, Madhu

doi:10.1007/11508069_56

A Comparative Study of Two Novel Predictor Set Scoring Methods

Chia Huey Ooi¹⁹ &
Madhu Chetty¹⁹

Conference paper

1306 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3578))

Abstract

Due to the large number of genes measured in a typical microarray dataset, feature selection plays an essential role in tumor classification. In turn, relevance and redundancy are key components in determining the optimal predictor set. However, a third component – the relative weights given to the first two also assumes an equal, if not greater importance in feature selection. Based on this third component, we developed two novel feature selection methods capable of producing high, unbiased classification accuracy in multiclass microarray dataset. In an in-depth analysis comparing the two methods, the optimal values of the relative weights are also estimated.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Hall, M.A., Smith, L.A.: Practical feature subset selection for machine learning. In: McDonald, C. (ed.) Proc. of the 21st Australasian Computer Science Conference, pp. 181–191. Springer, Singapore (1998)
Google Scholar
Ding, C., Peng, H.: Minimum Redundancy Feature Selection from Microarray Gene Expression Data. In: Proc. 2nd IEEE Computational Systems Bioinformatics Conference, pp. 523–529. IEEE Computer Society, Los Alamitos (2003)
Google Scholar
Knijnenburg, T.A.: Selecting relevant and non-redundant features in microarray classification applications. M.Sc. Thesis. Faculty of Electrical Engineering, Mathematics, and Computer Science (EEMCS) of the Delft University of Technology (2004), http://ict.ewi.tudelft.nl/pub/marcel/Knij05b.pdf
Dudoit, S., Fridlyand, J., Speed, T.: Comparison of discrimination methods for the classification of tumors using gene expression data. JASA 97, 77–87 (2002)
MATH MathSciNet Google Scholar
Yu, L., Liu, H.: Efficiently Handling Feature Redundancy in High-Dimensional Data. In: Domingos, P., Faloutsos, C., Senator, T., Kargupta, H., Getoor, L. (eds.) Proc. of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 685–690. ACM Press, New York (2003)
Chapter Google Scholar
Ambroise, C., McLachlan, G.J.: Selection bias in gene extraction on the basis of microarray gene-expression data. Proc. Natl. Acad. Sci. 99, 6562–6566 (2002)
Article MATH Google Scholar
Platt, J.C., Cristianini, N., Shawe-Taylor, J.: Large Margin DAGs for Multiclass Classification. In: Advances in Neural Information Processing Systems (NIPS), vol. 12, pp. 547–553 (2000)
Google Scholar
Ramaswamy, S., Tamayo, P., Rifkin, R., Mukherjee, S., Yeang, C.H., Angelo, M., Ladd, C., Reich, M., Latulippe, E., Mesirov, J.P., Poggio, T., Gerald, W., Loda, M., Lander, E.S., Golub, T.R.: Multi-class cancer diagnosis using tumor gene expression signatures. Proc. Natl. Acad. Sci. 98, 15149–15154 (2001)
Article Google Scholar
Rifkin, R., Mukherjee, S., Tamayo, P., Ramaswamy, S., Yeang, C.H., Angelo, M., Reich, M., Poggio, T., Lander, E.S., Golub, T.R., Mesirov, J.P.: An Analytical Method for Multiclass Molecular Cancer Classification. SIAM Review 45(4), 706–723 (2003)
Article MATH MathSciNet Google Scholar
Linder, R., Dew, D., Sudhoff, H., Theegarten, D., Remberger, K., Poppl, S.J., Wagner, M.: The Subsequent Artificial Neural Network (SANN) Approach Might Bring More Classificatory Power To ANN-based DNA Microarray Analyses. Bioinformatics Advance Access. Published on July 29, Bioinformatics (2004), doi:10.1093/bioinformatics/bth441
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computing and Information Technology, Monash University, Churchill, VIC 3842, Australia
Chia Huey Ooi & Madhu Chetty

Authors

Chia Huey Ooi
View author publications
You can also search for this author in PubMed Google Scholar
Madhu Chetty
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Information Technology and Electrical Engineering, University of Queensland, 4072, Australia
Marcus Gallagher
, POB 30031, FL 32503-1031, Pensacola
James P. Hogan
Faculty of Information Technology, Queensland University of Technology, Box 2434, Q 4001, Brisbane, Australia
Frederic Maire

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ooi, C.H., Chetty, M. (2005). A Comparative Study of Two Novel Predictor Set Scoring Methods. In: Gallagher, M., Hogan, J.P., Maire, F. (eds) Intelligent Data Engineering and Automated Learning - IDEAL 2005. IDEAL 2005. Lecture Notes in Computer Science, vol 3578. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11508069_56

Download citation

DOI: https://doi.org/10.1007/11508069_56
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26972-4
Online ISBN: 978-3-540-31693-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics