Skip to main content

Integrating Lexical Resources Through an Aligned Lemma List

  • Chapter
Linked Data in Linguistics

Abstract

This paper presents the modelling of a common meta-index for large modern and historical lexical resources of the DWDS project. Four dictionaries of the German language are part of DWDS: (1) eWDG2, a digital version of the Wörterbuch der deutschen Gegenwartssprache (WDG, 1962-1977); (2) DWDSWB, a new and extended edition of the WDG (started in 2010); (3) EtymWB, a digital version of Wolfgang Pfeifer’s Etymologisches Wörterbuch des Deutschen (1989); (4) 1DWB, a digital version of the first edition of Grimm’s Deutsches Wörterbuch (1832-1961). Due to the different lexicographical principles and traditions employed for these resources as well as the different historical periods covered, such a meta-index cannot be modelled as a simple list of 1:1-correspondences between entries across different dictionaries. In order to model the occurring phenomena such as graphematic headword variance, homography, semantic change and differences in the semantic entry structure a more complex typed link structure is required.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 54.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Behrens L (2002) Structuring of word meaning II: Aspects of polysemy. In: Cruse DA, Hundsnurscher F, Job M, Lutzeier PR (eds) Lexikologie – Lexicology. Ein internationales Handbuch zur Natur und Struktur von Wörtern und Wortschätzen, vol 1, de Gruyter, Berlin, pp 319–337

    Google Scholar 

  • DWB (1854–1961) Deutsches Wörterbuch. Hirzel, Leipzig

    Google Scholar 

  • Dückert J (ed) (1987) Das Grimmsche Wörterbuch. Untersuchungen zur lexikographischen Methodologie. Hirzel, Leipzig

    Google Scholar 

  • Geyken A (2007) The DWDS corpus: A reference corpus for the German language of the twentieth century. In: Fellbaum C (ed) Idioms and collocations: Corpus-based linguistic and lexicographic studies, Research in corpus and discourse, Continuum, London, pp 23–40

    Google Scholar 

  • Geyken A, Didakowski J, Siebert A (2009) Generation of word profiles for large German corpora. In: Kawaguchi Y, Minegishi M, Durand J (eds) Corpus analysis and variation in linguistics, Studies in Linguistics, vol 1, John Benjamins, pp 141–157

    Google Scholar 

  • Herold A (2011) Retrodigitalisierung und Modellierung des Wörterbuchs der deutschen Gegenwartssprache. In: Krafft A, Spiegel C (eds) Sprachliche Förderung und Weiterbildung – transdisziplinär, no. 51 in Forum Angewandte Linguistik, Peter Lang, Frankfurt (M.), Berlin

    Google Scholar 

  • Jurish B (2010) More than words. Using token context to improve canonicalization of historical German. JLCL 25(1):23–40

    Google Scholar 

  • Kempcke G (2001) Polysemie oder Homonymie? Zur Praxis der Bedeutungsgliederung in den Wörterbuchartikeln synchronischer einsprachiger Wörterbücher der Deutschen Sprache. Lexicographica 17:61–68

    Google Scholar 

  • Klein W (2004) Vom Wörterbuch zum Digitalen Lexikalischen System. Zeitschrift für Literaturwissenschaft und Linguistik 136:10–55

    Google Scholar 

  • Klein W, Geyken A (2010) Das digitale Wörterbuch der deutschen Sprache (DWDS). Lexicographica 26:79–93

    Google Scholar 

  • Kunze C, Lemnitzer L (2010) Lexical-semantic and conceptual relations in GermaNet. In: Storjohann P (ed) Lexical-semantic relations: Theoretical and practical perspectives, no. 28 in Lingvisticæ Investigationes Supplementa, John Benjamins, Amsterdam, pp 163–183

    Google Scholar 

  • Pfeifer W (1989) Etymologisches Wörterbuch des Deutschen. Akademie-Verlag, Berlin

    Google Scholar 

  • Schmidt H (2004) Das Deutsche Wörterbuch. Gebrauchsanweisung. In: Bartz HW, Burch T, Christmann R, Gärtner K, Hildenbrandt V, Schares T, Wegge K (eds) Deutsches Wörterbuch. Elektronische Ausgabe der Erstbearbeitung von Jacob Grimm und Wilhelm Grimm, Zweitausendeins, Frankfurt (M.), pp 25–64

    Google Scholar 

  • Sokirko A (2003) DDC – a search engine for linguistically annotated corpora. In: Proceedings of Dialog 2003, Protvino (Russia)

    Google Scholar 

  • WDG (1962–1977) Wörterbuch der deutschen Gegenwartssprache. Akademie-Verlag, Berlin

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Axel Herold .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Herold, A., Lemnitzer, L., Geyken, A. (2012). Integrating Lexical Resources Through an Aligned Lemma List. In: Chiarcos, C., Nordhoff, S., Hellmann, S. (eds) Linked Data in Linguistics. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28249-2_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-28249-2_4

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-28248-5

  • Online ISBN: 978-3-642-28249-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics