Skip to main content

Hierarchical Classification with Dynamic-Threshold SVM Ensemble for Gene Function Prediction

  • Conference paper
Advanced Data Mining and Applications (ADMA 2010)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6441))

Included in the following conference series:

  • 3026 Accesses

Abstract

The paper proposes a novel hierarchical classification approach with dynamic-threshold SVM ensemble. At training phrase, hierarchical structure is explored to select suit positive and negative examples as training set in order to obtain better SVM classifiers. When predicting an unseen example, it is classified for all the label classes in a top-down way in hierarchical structure. Particulary, two strategies are proposed to determine dynamic prediction threshold for different label class, with hierarchical structure being utilized again. In four genomic data sets, experiments show that the selection policies of training set outperform existing two ones and two strategies of dynamic prediction threshold achieve better performance than the fixed thresholds.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Ashburner, M., Ball, C., Blake, J., Botstein, D.: Gene ontology: tool for the unification of biology. Nat. Genet. 25, 25–29 (2000)

    Article  Google Scholar 

  2. Barbedo, J.G.A., Lopes, A.: Automatic genre classification of musical signals. In: EURASIP Journal on Advances in Signal Processing 2007 (2007)

    Google Scholar 

  3. Barutcuoglu, Z., Schapire, R.E., Troyanskaya, O.G.: Hierarchical multi-label prediction of gene function. Bioinformatics 22(7), 830–836 (2006)

    Article  Google Scholar 

  4. Bradley, A.P.: Use of the area under the roc curve in the evaluation of machine learning algorithms. Pattern Recognition 30, 1145–1159 (1997)

    Article  Google Scholar 

  5. Burred, J.J., Lerch, A.: A hierarchical approach to automatic musical genre classification. In: Proc. Of the 6 th Int. Conf. on Digital Audio Effects, pp. 8–11 (2003)

    Google Scholar 

  6. Cai, L., Hofmann, T.: Hierarchical document categorization with support vector machines. In: Proceedings of the Thirteenth ACM International Conference on Information and Knowledge Management, SIGIR, pp. 78–87. ACM, Washington (2004)

    Google Scholar 

  7. Clare, A.: Machine learning and data mining for yeast functional genomics. Ph.D. thesis, Department of Computer Science University of Wales Aberystwyth (2003)

    Google Scholar 

  8. Daphne, K., Mehran, S.: Hierarchically classifying documents using very few words. In: ICML 1997: Proceedings of the Fourteenth International Conference on Machine Learning, pp. 170–178. Morgan Kaufmann Publishers Inc., San Francisco (1997)

    Google Scholar 

  9. Guan, Y., Myers, C.L., Hess, D.C., Barutcuoglu, Z., Caudy, A.A., Troyanskaya, O.G.: Predicting gene function in a hierarchical context with an ensemble of classifiers. Genome Biology 9, S3 (2008)

    Google Scholar 

  10. Kiritchenko, S., Matwin, S., Nock, R., Famili, A.F.: Learning and evaluation in the presence of class hierarchies: Application to text categorization. In: Lamontagne, L., Marchand, M. (eds.) Canadian AI 2006. LNCS (LNAI), vol. 4013, pp. 397–408. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  11. Lanckriet, G., Deng, M., Cristianini, M., Jordan, M., Noble, W.: Kernel-based data fusion and its application to protein function prediction in yeast. In: Pac. Symp. Biocomput., pp. 300–311 (2004)

    Google Scholar 

  12. Mewes, H.W., Heumann, K., Kaps, A., Mayer, K., Pfeiffer, F., Stocker, S., Frishman, D.: Mips:a database for genomes and protein sequences. Nucleic Acids Res. 30(1), 31–34 (2002)

    Article  Google Scholar 

  13. Rousu, J., Saunders, C., Szedmak, S., Shawe-Taylor, J.: Kernel-based learning of hierarchical multilabel classification models? Journal of Machine Learning Research 7, 1601–1626 (2006)

    MathSciNet  MATH  Google Scholar 

  14. Tsoumakas, G., Katakis, I.: Multi-label classification: An overview. Int. J. Data Warehousing and Mining 2007, 1–13 (2007)

    Google Scholar 

  15. Vens, C., Struyf, J., Schietgat, L., Dzeroski, S., Blockeel, H.: Decision trees for hierarchical multi-label classification. Achine Learning 73(2), 85–214 (2008)

    Google Scholar 

  16. Xiao, Z., Dellandrea, E., Dou, W., Chen, L.: Hierarchical classification of emotional speech. Tech. rep., LIRIS UMR 5205 CNRS/INSA de Lyon/Universite Claude Bernard Lyon 1/Universit Lumiere Lyon 2/Ecole Centrale de Lyon (2007)

    Google Scholar 

  17. Zhao, X.M., Wang, Y., Chen, L., Aihara, K.: Gene function prediction using labeled and unlabeled data. BMC Bioinformatics 9(1), 57 (2008)

    Article  Google Scholar 

  18. Zhao, X., Li, X., Chen, L., Aihara, K.: Protein classification with imbalanced data. Proteins: Structure, Function, and Bioinformatics 70(4), 1125–1132 (2008)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Chen, Y., Li, Z., Hu, X., Liu, J. (2010). Hierarchical Classification with Dynamic-Threshold SVM Ensemble for Gene Function Prediction. In: Cao, L., Zhong, J., Feng, Y. (eds) Advanced Data Mining and Applications. ADMA 2010. Lecture Notes in Computer Science(), vol 6441. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17313-4_33

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-17313-4_33

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-17312-7

  • Online ISBN: 978-3-642-17313-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics