Skip to main content

Using Fuzzy Patterns for Gene Selection and Data Reduction on Microarray Data

  • Conference paper
Intelligent Data Engineering and Automated Learning – IDEAL 2006 (IDEAL 2006)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4224))

  • 1721 Accesses

Abstract

The advent of DNA microarray technology has supplied a large volume of data to many fields like machine learning and data mining. Intelligent support is essential for managing and interpreting this great amount of information. One of the well-known constraints specifically related to microarray data is the large number of genes in comparison with the small number of available experiments. In this context, the ability of design methods capable of overcoming current limitations of state-of-the-art algorithms is crucial to the development of successful applications. In this paper we demonstrate how a supervised fuzzy pattern algorithm can be used to perform DNA microarray data reduction over real data. The benefits of our method can be employed to find biologically significant insights relating to meaningful genes in order to improve previous successful techniques. Experimental results on acute myeloid leukemia diagnosis show the effectiveness of the proposed approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  • Cakmakov, D., Bennani, Y.: Feature selection for pattern recognition. Informa Press (2002)

    Google Scholar 

  • Zheng, G., Olusegun, E., Narasimhan, G.: Neural network classifiers and gene selection methods for microarray data on human lung adenocarcinoma. In: Proc. of the CAMDA 2003 Conference, pp. 63–67 (2003)

    Google Scholar 

  • Fuhrman, S., Cunningham, M.J., Wen, X., Zweiger, G., Seilhamer, J.J., Somogyi, R.: The application of Shannon entropy in the identification of putative drug targets. Biosystems 55, 5–14 (2000)

    Article  Google Scholar 

  • Li, L., Darden, T.A., Weinberg, C.R., Levine, A.J., Pedersen, L.G.: Gene assessment and sample classification for gene expression data using a genetic algorithm/k-nearest neighbor method. Combinatorial Chemistry and High Throughput Screening 4(8), 727–739 (2001)

    Google Scholar 

  • Blanco, R., Larrañaga, P., Inza, I., Sierra, B.: Gene selection for cancer classification using wrapper approaches. International Journal of Pattern Recognition and Artificial Intelligence 18(8), 1373–1390 (2004)

    Article  Google Scholar 

  • Guyon, I., Weston, J., Barnhill, S., Vapnik, V.: Gene selection for cancer classification using support vector machines. Machine Learning 46(1-3), 389–422 (2002)

    Article  MATH  Google Scholar 

  • Chu, F., Wang, L.: Gene Expression Data Analysis Using Support Vector Machines. In: Seiffert, U., Jain, L.C. (eds.) Bioinformatics using Computational Intelligence Paradigms, pp. 167–189. Springer, Berlin (2005)

    Chapter  Google Scholar 

  • Liu, L., Wan, C.R., Wang, L.P.: Unsupervised gene selection via spectral biclustering. In: Proc. of the International Joint Conference on Neural Networks, pp. 1681–1686 (2005)

    Google Scholar 

  • Jaeger, J., Sengupta, R., Ruzzo, W.L.: Improved gene selection for classification of microarrays. In: Proc. of the PSB 2003 Conference, pp. 53–64 (2003)

    Google Scholar 

  • Qi, H.: Feature selection and kNN fusion in molecular classification of multiple tumor types. In: Proc. of the METMBS 2002 Conference (2002)

    Google Scholar 

  • Hanczar, B., Courtine, M., Benis, A., Hennegar, C., Clément, K., Zucker, J.D.: Improving classification of microarray data using prototype-based feature selection. ACM SIGKDD Explorations Newsletter 5(2), 23–30 (2003)

    Article  Google Scholar 

  • Fdez-Riverola, F., Díaz, F., Corchado, J.M., Hernández, J.M., San Miguel, J.: Improving Gene Selection in Microarray Data Analysis using Fuzzy Patterns inside a CBR System. In: Proc. of the ICCBR 2005 Conference, pp. 23–26 (2005)

    Google Scholar 

  • Díaz, F., Fdez-Riverola, F., Corchado, J.M.: GENE-CBR: a Case-Based Reasoning Tool for Cancer Diagnosis using Microarray Datasets. Computational Intelligence, (in Press) ISSN 0824-7935

    Google Scholar 

  • Tibshirani, R., Hastie, T., Narasimhan, B., Chu, G.: Diagnosis of multiple cancer types by shrunken centroids of gene expression. Proc. of the National Academy of Sciences 99(10), 6567–6572 (2002)

    Article  Google Scholar 

  • Fritzke, B.: Growing Cell Structures – A Self-Organizing Network for Unsupervised and Supervised Learning. Neural Networks 7, 1441–1460 (1994)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Díaz, F., Fdez-Riverola, F., Glez-Peña, D., Corchado, J.M. (2006). Using Fuzzy Patterns for Gene Selection and Data Reduction on Microarray Data. In: Corchado, E., Yin, H., Botti, V., Fyfe, C. (eds) Intelligent Data Engineering and Automated Learning – IDEAL 2006. IDEAL 2006. Lecture Notes in Computer Science, vol 4224. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11875581_129

Download citation

  • DOI: https://doi.org/10.1007/11875581_129

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-45485-4

  • Online ISBN: 978-3-540-45487-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics