Abstract
As more and more knowledge and information becomes available through computers, a critical capability of systems supporting knowledge management is the classification of documents into categories that are meaningful to the user. In a step beyond the use of keywords, we developed a system that analyzes the sentences contained in unstructured or semi-structured documents, and utilizes an ontology reflecting the domain knowledge for a semantic classification of the documents. An experimental system has been implemented for the analysis of small documents in combination with a limited ontology; an extension to larger sets of documents and extended ontologies, together with an application to practical tasks, is the focus of ongoing work.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Kim, H.J., Lee, S.G.: A semi-supervised document clustering technique for information organization. In: Proc. of the ninth international conference on Information and knowledge management, McLean, Virginia (2000)
Gruber, T.: A translation approach to portable ontology specifications. Knowledge Acquisition, An International Journal of Knowledge Acquisition for Knowledge-Based Systems, 5(2) (June 1993)
Pan, X.S.: A context-based free text interpreter, California Polytechnic State University San Luis Obispo Master’s Thesis - Computer Science Department (August 2002)
Sleator, D., Temperley, D.: Parsing English with a Link Grammar, Carnegie Mellon University Computer Science technical report CMU-CS-91-196 (1991)
Melcuk, I.: Dependency Syntax: Theory and Practice. State University of New York Press, New York (1988)
Temperley, D., Sleator, D., Lafferty, J.: An Introduction to the Link Grammar Parser, Technical report, Available (March 1999), http://www.link.cs.cmu.edu/link/
Fellbaum, C.: WordNet: An Electronic Lexical Database. MIT Press, Cambridge (1999)
Miller, G.: Wordnet: An Online Lexical Database. Int’l J. Lexicography 3(4), 235–312 (1990)
Hahn, J., Subramani, M.R.: A framework of knowledge management systems: issues and challenges for theory and practice. In: Proc. of the twenty first international conference on Information systems (December 2000)
Kim, H.J., Lee, S.G.: A.I. and computational logic: An effective document clustering method using user-adaptable distance metrics. In: Proc. of the 2002 ACM symposium on Applied computing (March 2002)
Minsky, M.: The society of Mind, p. 266. Simon and Schuster, New York (1985)
Cole, R., Mariani, J., Uszkoreit, H., Varile, G., Zaenen, A., Zampolli, A.: Survey of the State of the Art in Human Language Technology, p. 109. Cambridge University Press, Cambridge (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cheng, C.K., Pan, X., Kurfess, F. (2004). Ontology-Based Semantic Classification of Unstructured Documents. In: Nürnberger, A., Detyniecki, M. (eds) Adaptive Multimedia Retrieval. AMR 2003. Lecture Notes in Computer Science, vol 3094. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-25981-7_8
Download citation
DOI: https://doi.org/10.1007/978-3-540-25981-7_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22163-0
Online ISBN: 978-3-540-25981-7
eBook Packages: Springer Book Archive