Abstract
INTEX is a linguistic development environment that includes large-coverage dictionaries and grammars, and parses texts of several million words in real time. INTEX has tools to create and maintain large-coverage lexical resources as well as morphological and syntactic grammars. Dictionaries and grammars are applied to texts in order to locate morphological, lexical and syntactic patterns, remove ambiguities, and tag simple and compound words. INTEX can build lemmatized concordances and indices of large texts with respect to all types of Finite State patterns. INTEX is used as a corpus processor, to analyze literary, journalistic and technical texts. I describe here the subset of tools used to perform advanced search requests on large texts.
Similar content being viewed by others
References
Max, Silberztein. Dictionnaires électroniques et analyse automatique de textes: le système INTEX. Masson: Paris, 1993.
Emmanuel, Roche and Schabes Yves (Eds.). Finite State Language Processing. The MIT Press: Cambridge, Massachusetts, 1997.
Blandine, Courtois and Silberztein Max (Eds.). Les dictionnaires électroniques. Langue française 87, Larousse: Paris, 1990.
Maurice, Gross. Une grammaire locale de l'expression des sentiments. Langue française 105, Larousse: Paris, 1995.
Eric, Laporte. Experiments in Lexical Disambiguation Using Local Grammars. In Papers in Computational Lexicography (COMPLEX). Eds. F. Kiefer, G. Kiss, and J. Pajzs, Budapest: Research Institute for Linguistics, Hungarian Academy of Sciences, 1994, pp. 163–172.
Denis, Maurel. Reconnaissance de séquences de mots par automate. Thèse de doctorat en informatique, LADL, Université Paris 7: Paris, 1989.
Dominique, Revuz. “Minimization of Acyclic Deterministic Automata in Linear Time”. Theoretical Computational Science 92 (1992), 181–189.
Emmanuel, Roche. Analyse syntaxique transformationnelle du français par transducteurs et lexique-grammaire. Thèse de doctorat en Informatique, Université Paris 7: Paris, 1993.
Max, Silberztein (Ed.). Proceedings of the First INTEX User's Workshop. LADL: Paris, 1996.
Cédrick, Fairon (Ed.). Proceedings of the Second INTEX User's Workshop. LADL: Paris,, 1999 forthcoming.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Silberztein, M. Text Indexation with INTEX. Computers and the Humanities 33, 265–280 (1999). https://doi.org/10.1023/A:1002493406213
Issue Date:
DOI: https://doi.org/10.1023/A:1002493406213