Skip to main content
Log in

Text Indexation with INTEX

  • Published:
Computers and the Humanities Aims and scope Submit manuscript

Abstract

INTEX is a linguistic development environment that includes large-coverage dictionaries and grammars, and parses texts of several million words in real time. INTEX has tools to create and maintain large-coverage lexical resources as well as morphological and syntactic grammars. Dictionaries and grammars are applied to texts in order to locate morphological, lexical and syntactic patterns, remove ambiguities, and tag simple and compound words. INTEX can build lemmatized concordances and indices of large texts with respect to all types of Finite State patterns. INTEX is used as a corpus processor, to analyze literary, journalistic and technical texts. I describe here the subset of tools used to perform advanced search requests on large texts.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Max, Silberztein. Dictionnaires électroniques et analyse automatique de textes: le système INTEX. Masson: Paris, 1993.

    Google Scholar 

  • Emmanuel, Roche and Schabes Yves (Eds.). Finite State Language Processing. The MIT Press: Cambridge, Massachusetts, 1997.

    Google Scholar 

  • Blandine, Courtois and Silberztein Max (Eds.). Les dictionnaires électroniques. Langue française 87, Larousse: Paris, 1990.

    Google Scholar 

  • Maurice, Gross. Une grammaire locale de l'expression des sentiments. Langue française 105, Larousse: Paris, 1995.

    Google Scholar 

  • Eric, Laporte. Experiments in Lexical Disambiguation Using Local Grammars. In Papers in Computational Lexicography (COMPLEX). Eds. F. Kiefer, G. Kiss, and J. Pajzs, Budapest: Research Institute for Linguistics, Hungarian Academy of Sciences, 1994, pp. 163–172.

    Google Scholar 

  • Denis, Maurel. Reconnaissance de séquences de mots par automate. Thèse de doctorat en informatique, LADL, Université Paris 7: Paris, 1989.

    Google Scholar 

  • Dominique, Revuz. “Minimization of Acyclic Deterministic Automata in Linear Time”. Theoretical Computational Science 92 (1992), 181–189.

    Article  Google Scholar 

  • Emmanuel, Roche. Analyse syntaxique transformationnelle du français par transducteurs et lexique-grammaire. Thèse de doctorat en Informatique, Université Paris 7: Paris, 1993.

    Google Scholar 

  • Max, Silberztein (Ed.). Proceedings of the First INTEX User's Workshop. LADL: Paris, 1996.

    Google Scholar 

  • Cédrick, Fairon (Ed.). Proceedings of the Second INTEX User's Workshop. LADL: Paris,, 1999 forthcoming.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Silberztein, M. Text Indexation with INTEX. Computers and the Humanities 33, 265–280 (1999). https://doi.org/10.1023/A:1002493406213

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1002493406213

Navigation