Abstract
We describe how a feature-based semantic lexicon can be automatically extended using large, unstructured text corpora. Experiments are carried out using the lexicon HaGenLex and the Wortschatz corpus. The semantic classes of nouns are determined via the adjectives that modify them. It turns out to be reasonable to combine several classifiers for single attributes into one for complex semantic classes. The method is evaluated thoroughly and possible improvements are discussed.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
BIEMANN, C., BORDAG, S., HEYER, G., QUASTHOFF, U. and WOLFF, C. (2004): Language-independent Methods for Compiling Monolingual Lexical Data. In: Proceedings of CicLING 2004. LNCS 2945, Springer, Berlin, 215–228.
BIEMANN, C. and OSSWALD, R. (2005): Automatische Erweiterung eines semantikbasierten Lexikons durch Bootstrapping auf großen Korpora. In: B. Fisseni, H.-C. Schmitz, B. Schröder and P. Wagner (Eds.): Sprachtechnologie, mobile Kommunikation und linguistische Ressourcen — Beiträge zur GLDV-Tagung 2005 in Bonn. Peter Lang, Frankfurt am Main, 15–27.
BORDAG, S. (2003): Sentence Co-Occurrences as Small-World-Graphs: A Solution to Automatic Lexical Disambiguation. In: Proceedings of CicLING 2003. LNCS 2588, Springer, Berlin, 329–333.
DEMPSTER, A.P., LAIRD, N.M. and RUBIN, D.B. (1977): Maximum Likelihood from Incomplete Data via the EM Algorithm. Journal of the Royal Statistical Society, Series B, 39(1):1–38.
HARRIS, Z. (1968): Mathematical Structures of Language. John Wiley & Sons, New York.
HARTRUMPF, S., HELBIG, H. and OSSWALD, R. (2003): The Semantically Based Computer Lexicon HaGenLex — Structure and Technological Environment. Traitement automatique des langues, 44(2), 81–105.
HELBIG, H. (2001): Die semantische Struktur natürlicher Sprache: Wissensrepräsentation mit MultiNet. Springer, Berlin
MILLER, G.A. and CHARLES, W.G. (1991): Contextual correlates of semantic similarity. Language and Cognitive Processes, 6(1):1–28.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer Berlin · Heidelberg
About this paper
Cite this paper
Biemann, C., Osswald, R. (2006). Automatic Extension of Feature-based Semantic Lexicons via Contextual Attributes. In: Spiliopoulou, M., Kruse, R., Borgelt, C., Nürnberger, A., Gaul, W. (eds) From Data and Information Analysis to Knowledge Engineering. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-31314-1_39
Download citation
DOI: https://doi.org/10.1007/3-540-31314-1_39
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-31313-7
Online ISBN: 978-3-540-31314-4
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)