Skip to main content

An Automatic Library Data Classification System Using Layer Structure and Voting Strategy

  • Conference paper
  • 2008 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8839))

Abstract

This paper deals with issues of traditional one-layered book classification systems and employs the complementary attribute of various classifiers to propose a two layered book classification system using voting strategy. Moreover, the collection of dissertations from a university library and books from an electronic bookstore are used as the training and testing corpus. The classification codes of dissertations and books are employed as the gold standard as well. Each dissertation contains various components such as title, authors, table of contents, abstract or cited papers et al. To understand the classification effect of all the combinations of components, various combinations are studied as well and the best combination is recommended. The features extracted from abstracts and table of content are found to be most useful for document classification. On the other hand, to obtain the best classification performance, the combination of classifiers for a two-layered book classification system is studied and the best combination was also recommended as well.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Chen, K.H., Lo, S.C., Lin, C.J.: The Investigation of the Consistency of Subject Cataloging for Academic Journal Articles of Library and Information Science. In: Proceedings of Information and Communication Conference, Taipei, pp. 125–142 (2002) (Chinese)

    Google Scholar 

  2. Chen, S.Y., Yeh, J.Y., Hwang, M.J., Lin, X.J., Ke, H.R., Yang, W.P.: Automatic Book Classification Method combined with Support Vector Machine and Metadata. International Journal of Advanced Information Technologies (IJAIT) 3(1), 2–21 (2009) (Chinese)

    Google Scholar 

  3. AL-Nabi, D.L.A., Ahmed, S.: Survey on Classification Algorithms for Data Mining(Comparison and Evaluation). Computer Engineering and Intelligent Systems 4(8), 18–24 (2013)

    Google Scholar 

  4. Farbrizio, S.: Machine Learning in Automated Text Categorization. ACM Computing Surveys 34(1), 1–47 (2002)

    Article  Google Scholar 

  5. Huang, C.M.: A Neural Network Approach to Automatic Classification of Thesis Documents.,Report of National Science Council (NSC 89-2416-H-224-053), Taiwan (2002) (Chinese)

    Google Scholar 

  6. Huang, J.H.: A Study of Book Title Feature Extraction Based on the Automatic Classification -An Example of BibliographyAutomatically Classified System.Unpublished Master Thesis, Department of Library and Information Science of Fu-Jen Catholic University, Taipei (2008) (Chinese)

    Google Scholar 

  7. Larson, R.R.: Experiments in Automatic Library of Congress Classification. Journal of the American Society for Information Science 43(2), 130–148 (1992)

    Article  Google Scholar 

  8. Magdy, W., Darwish, K.: Book Search: Indexing the Valuable Parts. In: Proceedings of the 2008 Workshop on Research Advances in Large Digital Book Repositories, pp. 53–56 (2008)

    Google Scholar 

  9. Tokkola, K.: Discriminative Features for Document Classification. In: Proceedings of the 16th International Conference on Pattern Recognition (ICPR 2002), vol. 1, pp. 472–475 (2002)

    Google Scholar 

  10. Wang, et al.: Complementary classification approaches for protein sequences. Protein Engineering 9(5), 381–386 (1996)

    Article  Google Scholar 

  11. Wu, et al.: Top 10 algorithms in data mining. Knowledge and Information Systems 14(1), 1–37 (2008)

    Article  Google Scholar 

  12. Yi, K.: Challenges in Automatic Classification using Library Classification Schemes. In: Proceedings of World Library and Information Congress: 72nd IFLA General Conference and Council, pp. 1–14 (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Kuo, JJ. (2014). An Automatic Library Data Classification System Using Layer Structure and Voting Strategy. In: Tuamsuk, K., Jatowt, A., Rasmussen, E. (eds) The Emergence of Digital Libraries – Research and Practices. ICADL 2014. Lecture Notes in Computer Science, vol 8839. Springer, Cham. https://doi.org/10.1007/978-3-319-12823-8_29

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-12823-8_29

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-12822-1

  • Online ISBN: 978-3-319-12823-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics