Skip to main content
Log in

Automatic analysis of inflectional morphems in German nouns

An algorithm for automatic indexing

  • Published:
Acta Informatica Aims and scope Submit manuscript

Summary

The underlying algorithm can be used for automatic indexing. It is supposed to analyze every inflected German noun automatically by reducing it to its proper standard form and by distinguishing every inflected noun from non-inflected ones. The algorithm starts out with two presuppositions:

  1. 1.

    Nouns first of all must be considered to be the key words for automatic indexing as they contain the most relevant information of all parts of speech, and as they have statistically the greatest frequency.

  2. 2.

    Since in German we use capital letters for every noun we are able to distinguish homographs of parts of speech to a large extent.

The testwords are to be looked up in an automatic dictionary and after the complete analysis we get either the correct analysis of the testword or the result that the testword is not contained in the dictionary. The algorithm uses a maximum of four pieces of information with every word: inflectional group singular (1–9), inflectional group plural (1–10); information about the umlaut in the plural or/and the change of β to S S both depending on the inflectional system. The algorithm is supposed to give the following results:

  1. 1.

    The correct analysis of every German noun including foreign words, loan words and exceptions.

  2. 2.

    Automatic recognition of inflected and non-inflected noun homographs.

  3. 3.

    The impossibility of wrong analyses.

  4. 4.

    The algorithm works with a minimum of dictionary operations.

  5. 5.

    The dictionary can be relatively small according to the number of special terms which the specialist considers necessary for his field or for his special task.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Abbreviations

INF:

INFINITVE

PS:

PERSON

PRÄS:

PRESENT TENSE

SG:

SINGULAR

PL:

PLURAL

NOM:

NOMINATIVE

GEN:

GENITIVE

DAT:

DATIVE

ACC:

ACCUSATIVE

FLS:

INFLECTIONAL GROUP SINGULAR

FLP:

INFLECTIONAL GROUP PLURAL

UML:

UMLAUT DEPENDENT ON INFLEXION

KW:

CHANGE OF ß — SS

KTABS:

TABLE OF CASE COMBINATION SINGULAR

KTABP:

TABLE OF CASE COMBINATION PLURAL

KKZS:

ENCODED NUMBER OF CASE COMBINATION SINGULAR

KKZP:

ENCODED NUMBER OF CASE COMBINATION PLURAL

NAG:

COMBINATION OF NOM, GEN, ACC

References

  • Admoni, V. G.: Stroj sovremennogo nemeckogo jazyka (Der deutsche Sprachbau), 2. Auflage. Moskau, Leningrad 1966.

    Google Scholar 

  • Braun, S., Schott, G.: Studien zur automatischen syntaktischen Analyse russischer Texte, TU-Berichte, Nr. 6910, München 1969.

  • Braun, S., Schott, G., Reber, N.: Studien zur automatischen syntaktischen Analyse russischer Texte, Teil II, TU-Berichte, Nr. 7008, München 1970.

  • Erben, J.: Deutsche Grammatik. Ein Leitfaden, Fischer-Bücherei Nr. 904, Hamburg 1969.

  • Griesbach, H., Schulz, D.: Grammatik der deutschen Sprache. München 1962.

  • Heringer, H.-J.: Deutsche Syntax, Göschen-Reihe Nr. 1246/1246a, Berlin 1970.

  • Jude, W. K.: Deutsche Grammatik, 3. Auflage. Braunschweig-München 1954.

  • Klein, W., Rath, R.: Automatische Lemmatisierung, Arbeitsbericht Nr. 10. Linguistische Arbeiten des Germanistischen Instituts und des Instituts für Angewandte Mathematik der Universität des Saarlandes, Saarbrücken, Juni 1971.

    Google Scholar 

  • Lehmann, W. P.: Stachowitz, R., Research in German-English Machine Translation on Syntactic Level, RADC-TR-69-368, vol. II, Final Technial Report, Austin, Texas 1970.

  • Maas, D.: Homographie und maschinelle Sprachübersetzung = Beiträge zur Linguistik und Informationsverarbeitung, Heft 21 (1971), p. 7–35.

  • Mater, E.: Rückläufiges Wörterbuch der deutschen Gegenwartssprache. Leipzig 1965.

  • Mel'čuk, I. A.: Morfologičeskij analiz pri mašinnom perevode = Problemy kibernetiki, vyp. 6 (1961), p. 207–276

  • Salton, G.: Automatic Information, Organization and Retrieval, McGraw Hill 1968.

  • Zimmermann, H. H.: Zur Auflösung von Mehrdeutigkeiten bei einer maschinellen Analyse des Deutschen = Beiträge zur Linguistik und Informationsverarbeitung, Heft 21 (1971), p. 36–49

Download references

Author information

Authors and Affiliations

Authors

Additional information

The German version of the text is available at the Mathematisches Institut TUM, D-8000 München 2, Arcisstr. 21, Bericht Nr. 7210.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Schott, G. Automatic analysis of inflectional morphems in German nouns. Acta Informatica 1, 360–374 (1972). https://doi.org/10.1007/BF00289515

Download citation

  • Received:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF00289515

Keywords

Navigation