Abstract
The paper proposes a novel hierarchical classification approach with dynamic-threshold SVM ensemble. At training phrase, hierarchical structure is explored to select suit positive and negative examples as training set in order to obtain better SVM classifiers. When predicting an unseen example, it is classified for all the label classes in a top-down way in hierarchical structure. Particulary, two strategies are proposed to determine dynamic prediction threshold for different label class, with hierarchical structure being utilized again. In four genomic data sets, experiments show that the selection policies of training set outperform existing two ones and two strategies of dynamic prediction threshold achieve better performance than the fixed thresholds.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ashburner, M., Ball, C., Blake, J., Botstein, D.: Gene ontology: tool for the unification of biology. Nat. Genet. 25, 25–29 (2000)
Barbedo, J.G.A., Lopes, A.: Automatic genre classification of musical signals. In: EURASIP Journal on Advances in Signal Processing 2007 (2007)
Barutcuoglu, Z., Schapire, R.E., Troyanskaya, O.G.: Hierarchical multi-label prediction of gene function. Bioinformatics 22(7), 830–836 (2006)
Bradley, A.P.: Use of the area under the roc curve in the evaluation of machine learning algorithms. Pattern Recognition 30, 1145–1159 (1997)
Burred, J.J., Lerch, A.: A hierarchical approach to automatic musical genre classification. In: Proc. Of the 6 th Int. Conf. on Digital Audio Effects, pp. 8–11 (2003)
Cai, L., Hofmann, T.: Hierarchical document categorization with support vector machines. In: Proceedings of the Thirteenth ACM International Conference on Information and Knowledge Management, SIGIR, pp. 78–87. ACM, Washington (2004)
Clare, A.: Machine learning and data mining for yeast functional genomics. Ph.D. thesis, Department of Computer Science University of Wales Aberystwyth (2003)
Daphne, K., Mehran, S.: Hierarchically classifying documents using very few words. In: ICML 1997: Proceedings of the Fourteenth International Conference on Machine Learning, pp. 170–178. Morgan Kaufmann Publishers Inc., San Francisco (1997)
Guan, Y., Myers, C.L., Hess, D.C., Barutcuoglu, Z., Caudy, A.A., Troyanskaya, O.G.: Predicting gene function in a hierarchical context with an ensemble of classifiers. Genome Biology 9, S3 (2008)
Kiritchenko, S., Matwin, S., Nock, R., Famili, A.F.: Learning and evaluation in the presence of class hierarchies: Application to text categorization. In: Lamontagne, L., Marchand, M. (eds.) Canadian AI 2006. LNCS (LNAI), vol. 4013, pp. 397–408. Springer, Heidelberg (2006)
Lanckriet, G., Deng, M., Cristianini, M., Jordan, M., Noble, W.: Kernel-based data fusion and its application to protein function prediction in yeast. In: Pac. Symp. Biocomput., pp. 300–311 (2004)
Mewes, H.W., Heumann, K., Kaps, A., Mayer, K., Pfeiffer, F., Stocker, S., Frishman, D.: Mips:a database for genomes and protein sequences. Nucleic Acids Res. 30(1), 31–34 (2002)
Rousu, J., Saunders, C., Szedmak, S., Shawe-Taylor, J.: Kernel-based learning of hierarchical multilabel classification models? Journal of Machine Learning Research 7, 1601–1626 (2006)
Tsoumakas, G., Katakis, I.: Multi-label classification: An overview. Int. J. Data Warehousing and Mining 2007, 1–13 (2007)
Vens, C., Struyf, J., Schietgat, L., Dzeroski, S., Blockeel, H.: Decision trees for hierarchical multi-label classification. Achine Learning 73(2), 85–214 (2008)
Xiao, Z., Dellandrea, E., Dou, W., Chen, L.: Hierarchical classification of emotional speech. Tech. rep., LIRIS UMR 5205 CNRS/INSA de Lyon/Universite Claude Bernard Lyon 1/Universit Lumiere Lyon 2/Ecole Centrale de Lyon (2007)
Zhao, X.M., Wang, Y., Chen, L., Aihara, K.: Gene function prediction using labeled and unlabeled data. BMC Bioinformatics 9(1), 57 (2008)
Zhao, X., Li, X., Chen, L., Aihara, K.: Protein classification with imbalanced data. Proteins: Structure, Function, and Bioinformatics 70(4), 1125–1132 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chen, Y., Li, Z., Hu, X., Liu, J. (2010). Hierarchical Classification with Dynamic-Threshold SVM Ensemble for Gene Function Prediction. In: Cao, L., Zhong, J., Feng, Y. (eds) Advanced Data Mining and Applications. ADMA 2010. Lecture Notes in Computer Science(), vol 6441. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17313-4_33
Download citation
DOI: https://doi.org/10.1007/978-3-642-17313-4_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-17312-7
Online ISBN: 978-3-642-17313-4
eBook Packages: Computer ScienceComputer Science (R0)