Abstract
In this paper, we propose a new document classification model which utilizes background knowledge gathered by ontologies for document representation. A document is represented using a set of ontology concepts that are acquired by exact matching technique and through identification and extraction of new terms which can be semantically related to these concepts. In addition, a new concept weighting scheme composed of concept relevance and importance is employed by the model to compute weight of concepts. We conducted experiments to test the model and the obtained results showed that a considerable improvement of classification performance is achieved by using our proposed model.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Camous, F., Blott, S., Smeaton, A.F.: Ontology-based MEDLINE document classification. In: Hochreiter, S., Wagner, R. (eds.) BIRD 2007. LNCS, vol. 4414, pp. 439–452. Springer, Heidelberg (2007). doi:10.1007/978-3-540-71233-6_34
Dinh, D., Tamine, L.: Biomedical concept extraction based on combining the content-based and word order similarities. In: ACM Symposium on Applied Computing, pp. 1159–1163 (2011)
Sy, M.-F., Ranwez, S., Montmain, J., Regnault, A., Crampes, M., Ranwez, V.: User centered and ontology based information retrieval system for life sciences. BMC Bioinform. 13(1) (2012)
Kastrati, Z., Imran, A., Yayilgan, S.: SEMCON - a semantic and contextual objective metric for enriching domain ontology concepts. Int. J. Semant. Web Inf. Syst. 12(2), 1–24 (2016)
Kastrati, Z., Imran, A., Yayilgan, S.Y.: An improved concept vector space model for ontology based classification. In: 11th International Conference on Signal Image Technology & Internet Systems, pp. 240–245 (2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Kastrati, Z., Yayilgan, S.Y. (2017). Improving Document Classification Effectiveness Using Knowledge Exploited by Ontologies. In: Frasincar, F., Ittoo, A., Nguyen, L., Métais, E. (eds) Natural Language Processing and Information Systems. NLDB 2017. Lecture Notes in Computer Science(), vol 10260. Springer, Cham. https://doi.org/10.1007/978-3-319-59569-6_52
Download citation
DOI: https://doi.org/10.1007/978-3-319-59569-6_52
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-59568-9
Online ISBN: 978-3-319-59569-6
eBook Packages: Computer ScienceComputer Science (R0)