Abstract
Named entities are ubiquitous in documents in the web and other document repositories. The information that a human user associates with named entities occurring in a document often suffices to derive a simplified picture, or a fingerprint, of its contents. Quite generally, background knowledge on named entities simplifies proper document understanding. In order to use this kind of information in automated document processing, resources are needed that make information implicitly carried by named entities explicit, formalizing it in an appropriate way. We describe the systematics and architecture of an experimental resource that contains a thematic-geographic-temporal hierarchy for classifying named entities, positions named entities of various kinds with respect to the hierarchy, lists synonyms, and gives formal descriptions of these entities and their relations. The resource should offer a general basis for semantic annotation, indexing, retrieval, querying, browsing and hyperlinking of (semi-)textual web documents, structured documents and flat texts.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
The ACM Computing Classification System (2001), http://www.acm.org/class/1998/homepage.html
Allen, J.F.: Maintaining knowledge about temporal intervals. Communications of the ACM 26(11), 832–843 (1983)
Baader, F., Calvanese, D., McGuinness, D., Nardi, D., Schneider, P.P.: The Description Logic Handbook: Theory, Implementation and Applications. Cambridge University Press, Cambridge (2003)
DIN 1463: Erstellung und Weiterentwicklung von Thesauri. Deutsches Institut für Normung (1987)
ECAI 1998 Workshop on Applications of Ontologies and Problem Solving Methods (1998), http://delicias.dia.fi.upm.es/WORKSHOP/ECAI98/papers.html#2
EuroWordNet Consortium, http://www.hum.uva.nl/ewn/index.html
Fall, A.: Reasoning with Taxonomies. PhD thesis, School of Computing Science, Simon Fraser University (1996)
Fellbaum, C. (ed.): WordNet - An Electronic Lexical Database. The MIT Press, Cambridge (1998)
Gross, G., Guenthner, F.: Traitement automatique des domaines. Révue francaise de linguistique appliquée (1998)
Getty Research Institute GRI. Getty Thesaurus of Geographic Names On Line, http://www.getty.edu/research/tools/vocabulary/tgn/
Ganter, B., Wille, R.: Formal Concept Analysis - Mathematical Foundations. Springer, Heidelberg (1999)
International Press Telecommunications Council IPTC. IPTC Subject Reference System, http://www.iptc.org/site/subject-codes/
McGuiness, D.L., Fikes, R., Hendler, J., Stein, L.A.: DAML+OIL: An Ontology Language for the Semantic Web. IEEE Intelligent Systems 17(5), 72–80 (2002)
National Library of Medicine NLM. Unified Medical Language System UMLS, http://www.nlm.nih.gov/research/umls/
Ohlbach, H.J.: About real time, calendar systems and temporal notions. In: Barringer, H., Gabbay, D. (eds.) Advances in Temporal Logic, pp. 319–338. Kluwer Academic Publishers, Dordrecht (2000)
Schlobach, S.: Description logics and knowledge discovery of data. In: Ohlbach, H.J., Endriss, U., Rodrigues, O., Schlobach, S. (eds.) Proceedings of the Seventh Workshop on Automated Reasoning, Bridging the Gap between Theory and Practice, CEUR Workshop Proceedings, vol. 32 (July 2000), On-line proceedings are available at http://SunSITE.Informatik.RWTH-Aachen.DE/Publications/CEURWS/Vol-32/
Stumme, G., Wille, R.: Begriffliche Wissensverarbeitung. Springer, Heidelberg (2000)
TopicMaps.Org. XML Topic Maps XTM, http://www.topicmaps.org/xtm/1.0/
Universal Decimal Classification Consortium UDC. Universal Decimal Classification, http://www.udcc.org/outline/outline.htm/
W3C. Semantic Web, http://www.w3.org/2001/sw/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Schulz, K.U., Weigel, F. (2003). Systematics and Architecture for a Resource Representing Knowledge about Named Entities. In: Bry, F., Henze, N., Małuszyński, J. (eds) Principles and Practice of Semantic Web Reasoning. PPSWR 2003. Lecture Notes in Computer Science, vol 2901. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24572-8_13
Download citation
DOI: https://doi.org/10.1007/978-3-540-24572-8_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20582-1
Online ISBN: 978-3-540-24572-8
eBook Packages: Springer Book Archive