Abstract
We present a knowledge-rich software agent, ContextExplicator, which mediates between the Web and the user’s information or knowledge needs. It provides a method for incremental knowledge-level management (i.e., knowledge discovery, acquisition and representation) for heterogeneous information in the Web.
In ContextExplicator, the incremental knowledge management works through iterative negotiations with the human user:
-
1
Automatic Word-Sense Disambiguation and Induction. General knowledge (e.g., from a lexicon) and previously discovered knowledge support the sense-disambiguation & sense-induction of a word in the given documents, resulting in an improved and refined organization of previously discovered knowledge,
-
2
Interactive Specialization of Query Criteria. At a given moment, the user can reduce certain semantic ambiguities of previously discovered knowledge by selecting one of the context-words which are suggested by ContextExplicator to discriminate between sets of retrieved documents. The selected context-word is also used to direct the discovery of new knowledge in the given documents, and
-
3
Visualization of the Discovered Knowledge. The discovered knowledge is represented in a conceptual lattice. Each lattice-node represents a single word-sense or a conjunction of senses of multiple words. To each node the respectively identified documents are associated. Each web-document is multi-classified into relevant word-sense clusters (lattice nodes), according to the occurrences of specific word-senses in the respective web-document. As a conceptual lattice allows the user to navigate the word-sense clusters and the classified web-documents with multi-level abstractions (i.e., super-/sub-lattice nodes), it provides a flexible scheme for managing knowledge and web-documents in a scalable way.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Barwise, J., Seligman, J.: The rights and wrongs of natural regularity. Ridgeview (1994)
Billhardt, H., Borrajo, D., Maojo, V.: A context vector model for information retrieval. JASIST 53(3), 236–249 (2002)
Bollmann-Sdorra, P., Raghavan, V.V.: On the necessity of term dependence in a query space for weighted retrieval. JASIS 49(13), 1161–1168 (1998)
Christopher Stokoe, M.P.O., Tait, J.: Word sense disambiguation in information retrieval revisited. In: The Proceedings of the 26th SIGIR Conference, pp. 159–166. ACM Press, New York (2003)
Deerwester, S., Dumais, S.T., Furnas, G.W., Landauer, T.K., Harshman, R.: Indexing by latent semantic analysis. JASIS 41(6), 391–407 (1990)
Fan, W., Gordon, M.D., Pathak, P.: Discovery of context-specific ranking functions for effective information retrieval using genetic programming. IEEE Transactions on Knowledge and Data Engineering 16(4), 523–527 (2004)
Joyce Yue Chai, A.W.B.: The use of word sense disambiguation in an information extraction system. In: AAAI/IAAI 1998, pp. 850–855. AAAI / MIT (1998)
Mihalcea, R., Moldovan, D.: Semantic indexing using wordnet senses. In: ACL Workshop, ACL (2000)
Rungsawang, A.: Dsir: the first trec-7 attempt. In: Proceedings of the Seventh Text REtrieval Conference (TREC 1998), National Institute of Standards and Technology (NIST), pp. 366–372 (1998)
Sanderson, M.: Retrieving with good sense. Information Retrieval 2(1), 49–69 (2000)
Voorhees, E.M.: Using wordnet to disambiguate word senses for text retrieval. In: The Proceedings of the 20th SIGIR Conference, pp. 171–180. ACM Press, New York (1993)
Wordnet (2004), http://www.cogsci.princeton.edu/~wn/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yoo, S.Y., Hoffmann, A. (2005). Knowledge-Level Management of Web Information. In: Zhang, Y., Tanaka, K., Yu, J.X., Wang, S., Li, M. (eds) Web Technologies Research and Development - APWeb 2005. APWeb 2005. Lecture Notes in Computer Science, vol 3399. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-31849-1_59
Download citation
DOI: https://doi.org/10.1007/978-3-540-31849-1_59
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25207-8
Online ISBN: 978-3-540-31849-1
eBook Packages: Computer ScienceComputer Science (R0)