Abstract
Algorithm which has taken the third place in “JRS 2012 Data Mining Competition” among 126 participants is described. The competition was related to the problem of predicting topical classification of scientific publications in a field of biomedicine. The presented algorithm is a combination (blend) of simple classification algorithms: a linear classifier, a k-NN classifier and two SVMs. We build the combination using special estimation matrices. It proves again that combinations have significantly better performance compared to their individual members.
This work was supported by the Russian Foundation for Basic Research, project 12-07-00187; by the President of the Russian Federation, project MD-757.2011.9. The author is also grateful to the organizers of “JRS 2012 Data Mining Competition” for running the interesting competition. Finally, we want to thank all the active participants of the challenge for their efforts.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
National Library of Medicine: PubMed Central (PMC): An Archive for Literature from Life Sciences Journals. In: McEntyre, J., Ostell, J. (Eds.): The NCBI Handbook, http://www.ncbi.nlm.nih.gov/books/NBK21087/
National Library of Medicine: Introduction to MeSH (2012), http://www.nlm.nih.gov/mesh/introduction.html
van Rijsbergen, C.J.: Information Retrieval, 2nd edn. Butterworth (1979)
Bauer, E., Kohavi, R.: An Empirical Comparison of Voting Classification Algorithms: Bagging, Boosting, and Variants. Machine Learning 36(1-2), 105–139 (1999)
Breiman, L.: Bagging predictors. Machine Learning 24, 123–140 (1996)
Freund, Y., Schapire, R.E.: A short introduction to boosting. Journal of Japanese Society for Artificial Intelligence 14(5), 771–780 (1999)
Zhuravlev, Y.I.: An Algebraic Approach to Recognition and Classification Problems. In: Problems of Cybernetics, vol. 33, pp. 5–68. Nauka, Moscow (1978); Hafner (1986)
Zhuravlev, Yu, I.: Correct Algorithms over Sets of Incorrect (Heuristic) Algorithms: Part II. Kibernetika 6, 21–27 (1977)
D’yakonov, A.: Two Recommendation Algorithms Based on Deformed Linear Combinations. In: Proc. of ECML-PKDD 2011 Discovery Challenge Workshop, pp. 21–28 (2011), http://ceur-ws.org/Vol-770/paper5.pdf
Manning, C., Raghavan, P., Schutze, H.: Introduction to Information Retrieval. Cambridge University Press (2008)
http://en.wikipedia.org/wiki/Cross-validation_(statistics)
Chang, C.-C., Lin, C.-J.: LIBSVM: a library for support vector machines. ACM Transactions on Intelligent Systems and Technology 2(3) (2011), http://www.csie.ntu.edu.tw/~cjlin/libsvm
Cortes, C., Vapnik, V.: Support-Vector Networks. Machine Learning, 20 (1995)
Fan, R.-E., Chang, K.-W., Hsieh, C.-J., Wang, X.-R., Lin, C.-J.: LIBLINEAR: A Library for Large Linear Classification. Journal of Machine Learning Research 9, 1871–1874 (2008), http://www.csie.ntu.edu.tw/~cjlin/liblinear
Haykin, S.: Neural Networks: A Comprehensive Foundation. Prentice Hall (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
D’yakonov, A. (2012). A Blending of Simple Algorithms for Topical Classification. In: Yao, J., et al. Rough Sets and Current Trends in Computing. RSCTC 2012. Lecture Notes in Computer Science(), vol 7413. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32115-3_51
Download citation
DOI: https://doi.org/10.1007/978-3-642-32115-3_51
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32114-6
Online ISBN: 978-3-642-32115-3
eBook Packages: Computer ScienceComputer Science (R0)