Skip to main content

Unsupervised Gene Selection and Clustering Using Simulated Annealing

  • Conference paper
Book cover Fuzzy Logic and Applications (WILF 2005)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3849))

Included in the following conference series:

Abstract

When applied to genomic data, many popular unsupervised explorative data analysis tools based on clustering algorithms often fail due to their small cardinality and high dimensionality. In this paper we propose a wrapper method for gene selection based on simulated annealing and unsupervised clustering. The proposed approach, even if computationally intensive, permits to select the most relevant features (genes), and to rank their relevance, allowing to improve the results of clustering algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press, New York (1981)

    MATH  Google Scholar 

  2. Bickel, D.R.: Robust cluster analysis of microarray gene expression data with the number of cluster determined biologically. Bioinformatics 19(7), 818–824 (2003)

    Article  Google Scholar 

  3. Blum, A., Langley, P.: Selection of Relevant Features and Examples in Machine Learning. Artificial Intelligence 97(1-2), 245–271 (1997)

    Article  MATH  MathSciNet  Google Scholar 

  4. Bolshakova, N., Azuaje, F.: Cluster validation techniques for genome expression data Source. Signal Processing 83, 825–833 (2003)

    Article  MATH  Google Scholar 

  5. Duda, R.O., Hart, P.E.: Pattern Classification and Scene Analysis. Wiley, New York (1973)

    MATH  Google Scholar 

  6. Dy, J.G., Brodley, C.E., Kak, A., Broderick, L.S., Aisen, A.M.: Unsupervised Feature Selection Applied to Content-Based Retrieval of Lung Images. IEEE Trans. Pattern Analysis and Machine Intelligence 25(3), 373–378 (2003)

    Article  Google Scholar 

  7. Golub, T., Slonim, D., Tamayo, P., Huard, C., Gaasenbeek, M., Mesirov, J., Coller, H., Loh, M., Downing, J., Caligiuri, M., Bloomfield, C., Lander, E.: Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring. Science 286, 531–537 (1999)

    Article  Google Scholar 

  8. Jornsten, R., Yu, B.: Simultaneous gene clustering and subset selection for sample classification via MDL. Bioinformatics 19(8), 1100–1109 (2003)

    Article  Google Scholar 

  9. Kirkpatrick, S., Gelatt, C.D., Vecchi, M.P.: Optimization by simulated annealing. Science 220, 661–680 (1983)

    Article  MathSciNet  Google Scholar 

  10. Kohavi, R., John, G.: Wrappers for Feature Subset Selection. Artificial Intelligence 97(1-2), 273–324 (1997)

    Article  MATH  Google Scholar 

  11. Law, M.H., Figueiredo, M.A.T., Jain, A.K.: Simultaneous Feature Selection in and Clustering Using Mixture Models. IEEE Trans. Pattern Analysis and Machine Intelligence 28(9) (2004)

    Google Scholar 

  12. Metropolis, N., Rosenbluth, A.W., Rosenbluth, M.N., Teller, A.H., Teller, E.: Equation of state calculations for fast computing machines. Journal of Chemical Physics 21, 1087–1092 (1953)

    Article  Google Scholar 

  13. Mitra, P., Murthy, C.A.: Unsupervised Feature Selection Using Feature Similarity. IEEE Trans. Pattern Analysis and Machine Intelligence 24(3), 301–312 (2002)

    Article  Google Scholar 

  14. Mumey, B., Showe, L., Showe, M.: A Combinatorial Approach to Clustering Gene Expession Data. Bioinformatics (2003)

    Google Scholar 

  15. Wang, Q., Shen, Y., Zhang, Y., Zhang, J.: A quantitative method for evaluating the performances of hyperspectral image fusion. IEEE Trans. Instrumentation and Measurement 52, 1041–1047 (2003)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Filippone, M., Masulli, F., Rovetta, S. (2006). Unsupervised Gene Selection and Clustering Using Simulated Annealing. In: Bloch, I., Petrosino, A., Tettamanzi, A.G.B. (eds) Fuzzy Logic and Applications. WILF 2005. Lecture Notes in Computer Science(), vol 3849. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11676935_28

Download citation

  • DOI: https://doi.org/10.1007/11676935_28

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-32529-1

  • Online ISBN: 978-3-540-32530-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics