Language Understanding Using n-multigram Models

Hurtado, Lluís; Segarra, Encarna; García, Fernando; Sanchis, Emilio

doi:10.1007/978-3-540-30228-5_19

Lluís Hurtado⁵,
Encarna Segarra⁵,
Fernando García⁵ &
…
Emilio Sanchis⁵

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3230))

Included in the following conference series:

International Conference on Natural Language Processing (in Spain)

675 Accesses
1 Citations

Abstract

In this work, we present an approach to language understanding using corpus-based and statistical language models based on multigrams. Assuming that we can assign meanings to segments of words, the n-multigram modelization is a good approach to model sequences of segments that have semantic information associated to them. This approach has been applied to the task of speech understanding in the framework of a dialogue system that answers queries about train timetables in Spanish. Some experimental results are also reported.

Work partially funded by CICYT under project TIC2002-04103-C03-03, Spain.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bahl, L., Jelinek, F., Mercer, R.: A maximum likelihood approach to continuous speech recognition. IEEE Trans. on PAMI-5, 179–190 (1983)
Google Scholar
Clarkson, P., Rosenfeld, R.: Statistical language modeling using the CMU-cambridge toolkit. In: Proc. Eurospeech, Rhodes, Greece, pp. 2707–2710 (1997)
Google Scholar
Bonafonte, A., Mariño, J.B.: Language modeling using X-grams. In: Proc. of ICSLP, Philadelphia, PA, pp. 394–397 (1996)
Google Scholar
Bonafonte, A., Mariño, J.B.: Using X-Gram For Efficient Speech Recognition. In: Proc. of ICSLP, Sydney, Australia (1998)
Google Scholar
Riccardi, G., Pieraccini, R., Bocchieri, E.: Stochastic automata for language modelling. Computer Speech and Language 10, 265–293 (1996)
Article Google Scholar
Deligne, S., Bimbot, F.: Language modeling by variable length sequences: theoretical formulation and evaluation of multigram. In: Proc. of ICASSP, pp. 169–172 (1995)
Google Scholar
Deligne, S., Bimbot, F.: Inference of variable-length acoustic units for continuous speech recognition. In: Proc. ICASSP, Munich, Germany, pp. 1731–1734 (1997)
Google Scholar
Deligne, S., Sagisaka, Y.: Statistical language modeling with a class-based n-multigram. Computer Speech and Language 14 (2000)
Google Scholar
Bonafonte, A., et al.: Desarrollo de un sistema de diálogo oral en dominios restringidos. In: I Jornadas en Tecnología del Habla, Sevilla (Spain) (2000)
Google Scholar
Segarra, E., Sanchis, E., García, F., Hurtado, L.: Extracting semantic information through automatic learning techniques. IJPRAI 16, 301–307 (2002)
Google Scholar
García, P., Segarra, E., Vidal, E., Galiano, I.: On the use of the Morphic Generator Grammatical Inference (MGGI) Methodology in automatic speech recognition. IJPRAI 4(4) (1990)
Google Scholar
Segarra, E., Hurtado, L.: Construction of Language Models using Morfic Generator Grammatical Inference MGGI Methodology. In: Proc. of Eurospeech, Rhodes, Greece, pp. 2695–2698 (1997)
Google Scholar
Prieto, N., Vidal, E.: Learning language models through the ECGI method. Speech Communication (1992)
Google Scholar
Prieto, N., Sanchis, E., Palmero, L.: Continuous speech understanding based on automatic learning of acoustic and semantic models. In: Proc. of ICSLP, pp. 2175–2178 (1994)
Google Scholar
García, P., Vidal, E.: Inference of k-testable languages in the strict sense and application to syntactic pattern recognition. IEEE Trans. on PAMI-12, 920–925 (1990)
Google Scholar
Fraser, N.M., Gilbert, G.N.: Simulating speech systems. Computer Speech and Languages 5, 81–99 (1991)
Article Google Scholar
Segarra, E., et al.: Achieving full coverage of automatically learnt finite-state language models. In: Proc. of EACL, Budapest, pp. 135–142 (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Departament de Sistemes Informàtics i Computació (DSIC), Universitat Politècnica de València (UPV), Camí de Vera s/n, 46022, València, Spain
Lluís Hurtado, Encarna Segarra, Fernando García & Emilio Sanchis

Authors

Lluís Hurtado
View author publications
You can also search for this author in PubMed Google Scholar
Encarna Segarra
View author publications
You can also search for this author in PubMed Google Scholar
Fernando García
View author publications
You can also search for this author in PubMed Google Scholar
Emilio Sanchis
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Software and Computing Systems, University of Alicante, Spain
José Luis Vicedo
Natural Language Processing and Information Systems Group, Department of Software and Computing Systems, University of Alicante, Spain
Patricio Martínez-Barco
Grupo de investigación del Procesamiento del Lenguaje y Sistemas de Información, Departamento de Lenguajes y Sistemas Informáticos, Universidad de Alicante, Alicante, Spain
Rafael Muńoz
Departamento de Lenguajes y Sistemas Informáticos, Carretera de San Vicente del Raspeig, Universidad de Alicante, 03690 San Vicente del Raspeig, Alicante, Spain
Maximiliano Saiz Noeda

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hurtado, L., Segarra, E., García, F., Sanchis, E. (2004). Language Understanding Using n-multigram Models. In: Vicedo, J.L., Martínez-Barco, P., Muńoz, R., Saiz Noeda, M. (eds) Advances in Natural Language Processing. EsTAL 2004. Lecture Notes in Computer Science(), vol 3230. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30228-5_19

Download citation

DOI: https://doi.org/10.1007/978-3-540-30228-5_19
Published: 20 October 2004
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23498-2
Online ISBN: 978-3-540-30228-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics