Abstract
This paper presents an experiment to develop natural-language tools to improve the quality of documents. These softwares are using finite-state automata enriched with notions of proximity, optionality and contextual information. They are called bi-directional because they need to parse a sequence not only from the left to right-hand side of a sentence, but on both sides of a word. This method improves efficiency.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Abeillé A.: Les nouvelles syntaxes. Ed. A. Colin (Paris), (1993)
Abrial J.-R.: The B-book: assigning programs to meanings. Cambridge University Press (Cambridge) (1996)
Aho A., Sethi R., and Ullman J.: Compilers Principles, Techniques, and Tools. Ed. Addison-Wesley, (1986)
ANSSC (1967).: Code of good practices for the documentation of digital computer programs. American Nuclear Society Standards Committee, (1967)
Chomsky N. and Miller G.: Introduction to the formal analysis of natural languages. in Luce R.D., Bush R. & Galanter E., Handbook of Mathematical Psychology Wiley (New York), (1963) 269–322
Grefenstette G. and Tapanainen P.: What is a word, what is a sentence ? Problems of tokenization. COMPLEX (Budapest), (1994)
Gross M.: Méthodes en syntaxe. Ed. Hermann (Paris), (1975)
Gross M. and Perrin D. Electronic dictionaries and automata in computational linguistics. Lecture Notes in Computer Science 377, Springer Verlag (Berlin) (1989)
Huot H.: Sur la notion de racine. TAL, vol bf 35num 2, (1994) 46–76
Karttunen L., Chanod J.-P., Grefenstette G. and Schiller A.: Regular expressions for language engineering. Natural Language Engineering, vol. 2num 4, (1996) 305–328
Kozlowska-Heuchin R.: L’analyse de la notion d’exigence dans un document technique en vue d’une extraction de connaissance automatisée. Bulag (actes de FRAC-TAL’97, Besançon), (1997) 227–234
Lippold B., Pomian J., Henry J.Y. and Elsensohn O.: AVIS: une méthode d’analyse de la cohérence des documents. Proceedings of the European Safety and Reliability conference (La Baule), (1994) 888–899
Lippold B. and Pomian J.: AVIS: une méthode d’analyse de la cohérence des documents techniques. Proceedings of the Fifteenth International Conference IA 95-Language Engineering 95 (Montpellier), (1995) 301–312
Lippold B., Kozlowska-Heuchin R. and Poibeau T.: NTK.FOCUS: un outil d’aide á la rédaction des documents techniques. JST-FRANCIL’97 (Avignon), (1997) 369–374
Maurel D.: Reconnaissance de séquences de mots par automates, adverbes de date du français. Thése de doctorat en informatique, Université de Paris 7 (1989)
Poibeau T. and Maurel D.: A la fin de: preposition ou determinant complexe dans les adverbiaux de temps ? Cahiers de grammaire, num 20, (1995) 101–111
Pomian J.: Mémoire d’entreprise. Sapientia (Ivry), (1996).
Revuz D.: Dictionnaires et lexiques Méthodes et algorithmes. Thése de doctorat en informatique, Université de Paris 7, (1991)
Roche E. and Schabes Y.: Finite State Language Processing. MIT Press (Cambridge), (1997)
Silberztein M.: Dictionnaires électroniques et analyse automatique des textes. Ed. Masson (Paris), (1993)
Vogel C.: Génie cognitif. Ed. Masson (Paris), (1988)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Poibeau, T. (1999). Bi-directional Automata to Extract Complex Phrases from Texts. In: Champarnaud, JM., Ziadi, D., Maurel, D. (eds) Automata Implementation. WIA 1998. Lecture Notes in Computer Science, vol 1660. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48057-9_10
Download citation
DOI: https://doi.org/10.1007/3-540-48057-9_10
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66652-3
Online ISBN: 978-3-540-48057-0
eBook Packages: Springer Book Archive