Abstract
This paper deals with issues of traditional one-layered book classification systems and employs the complementary attribute of various classifiers to propose a two layered book classification system using voting strategy. Moreover, the collection of dissertations from a university library and books from an electronic bookstore are used as the training and testing corpus. The classification codes of dissertations and books are employed as the gold standard as well. Each dissertation contains various components such as title, authors, table of contents, abstract or cited papers et al. To understand the classification effect of all the combinations of components, various combinations are studied as well and the best combination is recommended. The features extracted from abstracts and table of content are found to be most useful for document classification. On the other hand, to obtain the best classification performance, the combination of classifiers for a two-layered book classification system is studied and the best combination was also recommended as well.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Chen, K.H., Lo, S.C., Lin, C.J.: The Investigation of the Consistency of Subject Cataloging for Academic Journal Articles of Library and Information Science. In: Proceedings of Information and Communication Conference, Taipei, pp. 125–142 (2002) (Chinese)
Chen, S.Y., Yeh, J.Y., Hwang, M.J., Lin, X.J., Ke, H.R., Yang, W.P.: Automatic Book Classification Method combined with Support Vector Machine and Metadata. International Journal of Advanced Information Technologies (IJAIT) 3(1), 2–21 (2009) (Chinese)
AL-Nabi, D.L.A., Ahmed, S.: Survey on Classification Algorithms for Data Mining(Comparison and Evaluation). Computer Engineering and Intelligent Systems 4(8), 18–24 (2013)
Farbrizio, S.: Machine Learning in Automated Text Categorization. ACM Computing Surveys 34(1), 1–47 (2002)
Huang, C.M.: A Neural Network Approach to Automatic Classification of Thesis Documents.,Report of National Science Council (NSC 89-2416-H-224-053), Taiwan (2002) (Chinese)
Huang, J.H.: A Study of Book Title Feature Extraction Based on the Automatic Classification -An Example of BibliographyAutomatically Classified System.Unpublished Master Thesis, Department of Library and Information Science of Fu-Jen Catholic University, Taipei (2008) (Chinese)
Larson, R.R.: Experiments in Automatic Library of Congress Classification. Journal of the American Society for Information Science 43(2), 130–148 (1992)
Magdy, W., Darwish, K.: Book Search: Indexing the Valuable Parts. In: Proceedings of the 2008 Workshop on Research Advances in Large Digital Book Repositories, pp. 53–56 (2008)
Tokkola, K.: Discriminative Features for Document Classification. In: Proceedings of the 16th International Conference on Pattern Recognition (ICPR 2002), vol. 1, pp. 472–475 (2002)
Wang, et al.: Complementary classification approaches for protein sequences. Protein Engineering 9(5), 381–386 (1996)
Wu, et al.: Top 10 algorithms in data mining. Knowledge and Information Systems 14(1), 1–37 (2008)
Yi, K.: Challenges in Automatic Classification using Library Classification Schemes. In: Proceedings of World Library and Information Congress: 72nd IFLA General Conference and Council, pp. 1–14 (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Kuo, JJ. (2014). An Automatic Library Data Classification System Using Layer Structure and Voting Strategy. In: Tuamsuk, K., Jatowt, A., Rasmussen, E. (eds) The Emergence of Digital Libraries – Research and Practices. ICADL 2014. Lecture Notes in Computer Science, vol 8839. Springer, Cham. https://doi.org/10.1007/978-3-319-12823-8_29
Download citation
DOI: https://doi.org/10.1007/978-3-319-12823-8_29
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-12822-1
Online ISBN: 978-3-319-12823-8
eBook Packages: Computer ScienceComputer Science (R0)