Wie man mit der Wikipedia semantische Verfahren verbessern kann

Gillmeier, Stephan; Hengartner, Urs; Pedrazzini, Sandro

doi:10.1007/BF03340439

Wie man mit der Wikipedia semantische Verfahren verbessern kann

Published: 13 January 2014

Volume 47, pages 70–80, (2010)
Cite this article

HMD Praxis der Wirtschaftsinformatik Aims and scope Submit manuscript

Stephan Gillmeier¹,
Urs Hengartner¹ &
Sandro Pedrazzini¹

115 Accesses
Explore all metrics

Zusammenfassungen

Das automatische Zuweisen von Themengebieten zu beliebigen Dokumenten ist eine der anspruchsvollsten Aufgaben in der Computerlinguistik. Um dies technisch überhaupt bewerkstelligen zu können, setzt es ein gewisses »Verständnis« eines Textes voraus. Üblicherweise werden bei solchen Verfahren groβe — von Hand erstellte — thematisch vorsortierte Datenbanken benutzt. In dieser Arbeit wird zusammen mit statistischen Datenanalysen die »Datenbank« Wikipedia verwendet, um mit ihren semantischen Strukturen automatisch passende Themen von Dokumenten zu identifizieren und anschlieβend zuzuordnen. Darüber hinaus wird mit einem weiteren Verfahren gezeigt, wie das Auffinden ähnlicher Dokumente verbessert werden kann.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Literatur

Bunescu, R.; Pasca, M.: Using Encyclopedic Knowledge for Named Entity Disambiguation. In: Proceedings of the 11th Conference of the European Chapter of the Association for Computational Linguistics (EACL-06), Trento, Italy, 2006, S 9–16.
Google Scholar
Cucerzan, S.: Large-Scale Named Entity Disambiguation Based on Wikipedia Data. In: Proceedings of Empirical Methods in Natural Language Processing (EMNLP 2007), Prague, Czech Republic, 2007, S. 708–716.
Finkelstein, L.; Gabrilovich, Y. M.; Rivlin, E.; Solan, Z.; Wolfman, G.; Ruppin, E.: Placing search in context: The concept revisited. ACM Transactions on Information Systems, 20(1), 2002, S. 116–131.
Article Google Scholar
Gabrilovich, E.; Markovitch, S.: Overcoming the brittleness bottleneck using Wikipedia: Enhancing text categorization with encyclopedic knowledge. In: Proceedings of the 21st National Conference on Artificial Intelligence, Boston, MA, 2006, S. 1301–1306.
Google Scholar
Gabrilovich, E.; Markovitch, S.: Computing Semantic Relatedness using Wikipedia-based Explicit Semantic Analysis. In: Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI’07), Hyderabad, India, 2007.
Google Scholar
Hacken, P. ten: WordManager. In: State of the Art in Computational Morphology, Workshop on Systems and Frameworks for Computational Morphology (SFCM 2009), Zurich, Proceedings Series: Communications in Computer and Information Science, Vol. 41, Springer-Verlag, 2009.
Karttunen, L.: Constructing Lexical Transducers. In: The Proceedings of the 15th International Conference on Computational Linguistics. Coling 94, I, Kyoto, Japan, 1994, S. 406–411.
Koskenniemi, K.: Two-level Morphology. A General Computational Model for Word-Form Recognition and Production. Department of General Linguistics, University of Helsinki, 1983.
Milne, D.; Witten, I. H.: Learning to link with Wikipedia. In: Proceedings of the ACM Conference on Information and Knowledge Management (CIKM’2008), Napa Valley, California, 2008.
Salton, G.; McGill, M. J.: Introduction to modern information retrieval. McGraw-Hill, 1983.

Download references

Author information

Authors and Affiliations

Canoo Engineering AG, Kirschgartenstr. 5, CH-4051, Basel
Stephan Gillmeier, Urs Hengartner & Sandro Pedrazzini

Authors

Stephan Gillmeier
View author publications
You can also search for this author in PubMed Google Scholar
Urs Hengartner
View author publications
You can also search for this author in PubMed Google Scholar
Sandro Pedrazzini
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Stephan Gillmeier.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gillmeier, S., Hengartner, U. & Pedrazzini, S. Wie man mit der Wikipedia semantische Verfahren verbessern kann. HMD 47, 70–80 (2010). https://doi.org/10.1007/BF03340439

Download citation

Published: 13 January 2014
Issue Date: February 2010
DOI: https://doi.org/10.1007/BF03340439

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Wie man mit der Wikipedia semantische Verfahren verbessern kann

Zusammenfassungen

Access this article

Literatur

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation