Skip to main content

A Blending of Simple Algorithms for Topical Classification

  • Conference paper
Rough Sets and Current Trends in Computing (RSCTC 2012)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7413))

Included in the following conference series:

Abstract

Algorithm which has taken the third place in “JRS 2012 Data Mining Competition” among 126 participants is described. The competition was related to the problem of predicting topical classification of scientific publications in a field of biomedicine. The presented algorithm is a combination (blend) of simple classification algorithms: a linear classifier, a k-NN classifier and two SVMs. We build the combination using special estimation matrices. It proves again that combinations have significantly better performance compared to their individual members.

This work was supported by the Russian Foundation for Basic Research, project 12-07-00187; by the President of the Russian Federation, project MD-757.2011.9. The author is also grateful to the organizers of “JRS 2012 Data Mining Competition” for running the interesting competition. Finally, we want to thank all the active participants of the challenge for their efforts.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. http://tunedit.org/challenge/JRS12Contest

  2. National Library of Medicine: PubMed Central (PMC): An Archive for Literature from Life Sciences Journals. In: McEntyre, J., Ostell, J. (Eds.): The NCBI Handbook, http://www.ncbi.nlm.nih.gov/books/NBK21087/

  3. National Library of Medicine: Introduction to MeSH (2012), http://www.nlm.nih.gov/mesh/introduction.html

  4. van Rijsbergen, C.J.: Information Retrieval, 2nd edn. Butterworth (1979)

    Google Scholar 

  5. Bauer, E., Kohavi, R.: An Empirical Comparison of Voting Classification Algorithms: Bagging, Boosting, and Variants. Machine Learning 36(1-2), 105–139 (1999)

    Article  Google Scholar 

  6. Breiman, L.: Bagging predictors. Machine Learning 24, 123–140 (1996)

    MathSciNet  MATH  Google Scholar 

  7. Freund, Y., Schapire, R.E.: A short introduction to boosting. Journal of Japanese Society for Artificial Intelligence 14(5), 771–780 (1999)

    Google Scholar 

  8. Zhuravlev, Y.I.: An Algebraic Approach to Recognition and Classification Problems. In: Problems of Cybernetics, vol. 33, pp. 5–68. Nauka, Moscow (1978); Hafner (1986)

    Google Scholar 

  9. Zhuravlev, Yu, I.: Correct Algorithms over Sets of Incorrect (Heuristic) Algorithms: Part II. Kibernetika 6, 21–27 (1977)

    Google Scholar 

  10. D’yakonov, A.: Two Recommendation Algorithms Based on Deformed Linear Combinations. In: Proc. of ECML-PKDD 2011 Discovery Challenge Workshop, pp. 21–28 (2011), http://ceur-ws.org/Vol-770/paper5.pdf

  11. http://en.wikipedia.org/wiki/Linear_regression

  12. Manning, C., Raghavan, P., Schutze, H.: Introduction to Information Retrieval. Cambridge University Press (2008)

    Google Scholar 

  13. http://en.wikipedia.org/wiki/Cross-validation_(statistics)

    Google Scholar 

  14. Chang, C.-C., Lin, C.-J.: LIBSVM: a library for support vector machines. ACM Transactions on Intelligent Systems and Technology 2(3) (2011), http://www.csie.ntu.edu.tw/~cjlin/libsvm

  15. Cortes, C., Vapnik, V.: Support-Vector Networks. Machine Learning, 20 (1995)

    Google Scholar 

  16. http://www.mathworks.com

  17. Fan, R.-E., Chang, K.-W., Hsieh, C.-J., Wang, X.-R., Lin, C.-J.: LIBLINEAR: A Library for Large Linear Classification. Journal of Machine Learning Research 9, 1871–1874 (2008), http://www.csie.ntu.edu.tw/~cjlin/liblinear

    MATH  Google Scholar 

  18. Haykin, S.: Neural Networks: A Comprehensive Foundation. Prentice Hall (1999)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

D’yakonov, A. (2012). A Blending of Simple Algorithms for Topical Classification. In: Yao, J., et al. Rough Sets and Current Trends in Computing. RSCTC 2012. Lecture Notes in Computer Science(), vol 7413. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32115-3_51

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-32115-3_51

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-32114-6

  • Online ISBN: 978-3-642-32115-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics