To read this content please select one of the options below:

Morphological typology of languages for IR

Ari Pirkola (Department of Information Studies, University of Tampere, PO Box 607, 33101 Tampere, Finland)

Journal of Documentation

ISSN: 0022-0418

Article publication date: 1 June 2001

1130

Abstract

This paper presents a morphological classification of languages from the IR perspective. Linguistic typology research has shown that the morphological complexity of every language in the world can be described by two variables, index of synthesis and index of fusion. These variables provide a theoretical basis for IR research handling morphological issues. A common theoretical framework is needed in particular because of the increasing significance of cross‐language retrieval research and CLIR systems processing different languages. The paper elaborates the linguistic morphological typology for the purposes of IR research. It studies how the indexes of synthesis and fusion could be used as practical tools in mono‐ and cross‐lingual IR research. The need for semantic and syntactic typologies is discussed. The paper also reviews studies made in different languages on the effects of morphology and stemming in IR.

Keywords

Citation

Pirkola, A. (2001), "Morphological typology of languages for IR", Journal of Documentation, Vol. 57 No. 3, pp. 330-348. https://doi.org/10.1108/EUM0000000007085

Publisher

:

MCB UP Ltd

Copyright © 2001, MCB UP Limited

Related articles