Generalization of Discriminative Approaches for Speech Language Understanding in a Multilingual Context

Jabaian, Bassam; Lefèvre, Fabrice; Besacier, Laurent

doi:10.1007/978-3-642-39593-2_11

Bassam Jabaian²²,
Fabrice Lefèvre²² &
Laurent Besacier²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7978))

Included in the following conference series:

International Conference on Statistical Language and Speech Processing

2659 Accesses

Abstract

Probabilistic approaches are now widespread in the various applications of natural language processing and elicitation of a particular approach usually depends on the task at hand. Targeting multilingual interpretation of speech, this paper presents a comparison between the state-of-the-art methods used for machine translation and speech understanding. This comparison justifies our proposition of a unified framework to perform a joint decoding which translates a sentence and assigns semantic tags to this translation in the same process. The decoding is achieved using a cascade of finite-state transducers allowing to compose translation and understanding hypothesis graphs. This representation is favorable as it can be generalized to allow rich transmission of information between the components of a human-machine vocal interface.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Allauzen, C., Riley, M., Schalkwyk, J., Skut, W., Mohri, M.: OpenFst: A general and efficient weighted finite-state transducer library. In: Holub, J., Žďárek, J. (eds.) CIAA 2007. LNCS, vol. 4783, pp. 11–23. Springer, Heidelberg (2007)
Chapter Google Scholar
Anoop Deoras, R.S.G.T., Hakkani-Tur, D.: Joint decoding for speech recognition and semantic tagging. In: INTERSPEECH (2012)
Google Scholar
Bonneau-Maynard, H., Rosset, S., Ayache, C., Kuhn, A., Mostefa, D.: Semantic annotation of the french media dialog corpus. In: EUROSPEECH (2005)
Google Scholar
Brown, P.F., Pietra, S.D., Pietra, V.J.D., Mercer, R.L.: The mathematics of statistical machine translation: Parameter estimation. Computational Linguistics 19(2), 263–311 (1993)
Google Scholar
Crego, J.M., Mariño, J.B.: Improving statistical mt by coupling reordering and decoding. Machine Translation 20(3), 199–215 (2006)
Article Google Scholar
Crego, J.M., Yvon, F., Mariño, J.B.: Ncode: an open source bilingual n-gram smt toolkit. The Prague Bulletin of Mathematical Linguistics 96, 49–58 (2011)
Article Google Scholar
Gascó i Mora, G., Sánchez Peiró, J.A.: Part-of-speech tagging based on machine translation techniques. In: Martí, J., Benedí, J.M., Mendonça, A.M., Serrat, J. (eds.) IbPRIA 2007. LNCS, vol. 4477, pp. 257–264. Springer, Heidelberg (2007)
Chapter Google Scholar
Hahn, S., Dinarelli, M., Raymond, C., Lefèvre, F., Lehnen, P., De Mori, R., Moschitti, A., Ney, H., Riccardi, G.: Comparing stochastic approaches to spoken language understanding in multiple languages. IEEE Transactions in Audio, Speech and Language Processing 19(6), 1569–1583 (2010)
Article Google Scholar
Hakkani-Tür, D.Z., Béchet, F., Riccardi, G., Tür, G.: Beyond asr 1-best: Using word confusion networks in spoken language understanding. In: Computer Speech and Language, pp. 495–514 (2006)
Google Scholar
Jabaian, B.: Systèmes de compréhension et de traduction de la parole: vers une approche unifiée dans le cadre de la portabilité multilingue des systèmes de dialogue. Ph.D. thesis, CERI - Universitré d’Avignon, Avignon (2012)
Google Scholar
Jabaian, B., Besacier, L., Lefèvre, F.: Investigating multiple approaches for slu portability to a new language. In: INTERSPEECH (2010)
Google Scholar
Jabaian, B., Besacier, L., Lefèvre, F.: Combination of stochastic understanding and machine translation systems for language portability of dialogue systems. In: ICASSP (2011)
Google Scholar
Koehn, P., Hoang, H., Birch, A., Callison-Burch, C., Federico, M., Bertoldi, N., Cowan, B., Shen, W., Moran, C., Zens, R., et al.: Moses: Open source toolkit for statistical machine translation. In: ACL (2007)
Google Scholar
Koehn, P., Och, F.J., Marcu, D.: Statistical phrase-based translation. In: HLT-NAACL (2003)
Google Scholar
Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: ICML (2001)
Google Scholar
Lavergne, T., Cappé, O., Yvon, F.: Practical very large scale CRFs. In: ACL (2010)
Google Scholar
Lavergne, T., Crego, J.M., Allauzen, A., Yvon, F.: From n-gram-based to crf-based translation models. In: WSMT (2011)
Google Scholar
Liang, P., Taskar, B., Klein, D.: Alignment by agreement. In: HLT-NAACL (2006)
Google Scholar
Macherey, K., Bender, O., Ney, H.: Application of statistical machine translation approaches to spoken language understanding. In: IEEE ICASSP (2009)
Google Scholar
Macherey, K., Och, F.J., Ney, H.: Natural language understanding using statistical machine translation. In: INTERSPEECH (2001)
Google Scholar
Mariño, J.B., Banchs, R.E., Crego, J.M., de Gispert, A., Lambert, P., Fonollosa, J.A.R., Costa-jussà, M.R.: N-gram-based machine translation. Computational Linguistic 32(4), 527–549 (2006)
Article MATH Google Scholar
Och, F.: Minimum error rate training in statistical machine translation. In: ACL (2003)
Google Scholar
Och, F.J., Ney, H.: Discriminative training and maximum entropy models for statistical machine translation. In: ACL (2002)
Google Scholar
Papineni, K., Roukos, S., Ward, T., Zhu, W.: Bleu: a method for automatic evaluation of machine translation. In: ACL (2002)
Google Scholar
Rama, T., Singh, A., Kolachina, S.: Modeling letter-to-phoneme conversion as a phrase based statistical machine translation problem with minimum error rate training. In: HLT-NAACL (2009)
Google Scholar
Ramshaw, L., Marcus, M.: Text chunking using transformation-based learning. In: The Workshop on Very Large Corpora (1995)
Google Scholar
Riedmiller, M., Braun, H.: A direct adaptive method for faster backpropagation learning: The RPROP algorithm. In: ICNN (1993)
Google Scholar
Schmid, H.: Probabilistic part-of-speech tagging using decision trees. In: NMLP (1994)
Google Scholar
Servan, C., Raymond, C., Béchet, F., Nocera, P.: Conceptual decoding from word lattices: application to the spoken dialogue corpus MEDIA. In: INTERSPEECH (2006)
Google Scholar
Stolcke, A.: Srilm-an extensible language modeling toolkit. In: ICASSP (2002)
Google Scholar
Tür, G., Wright, J.H., Gorin, A.L., Riccardi, G., Hakkani-Tür, D.Z.: Improving spoken language understanding using word confusion networks. In: INTERSPEECH (2002)
Google Scholar
Turian, J.P., Wellington, B., Melamed, I.D.: Scalable discriminative learning for natural language parsing and translation. In: NIPS (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

LIA, University of Avignon, Avignon, France
Bassam Jabaian & Fabrice Lefèvre
LIG, University Joseph Fourrier, Grenoble, France
Laurent Besacier

Authors

Bassam Jabaian
View author publications
You can also search for this author in PubMed Google Scholar
Fabrice Lefèvre
View author publications
You can also search for this author in PubMed Google Scholar
Laurent Besacier
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Research Group on Mathematical Linguistics, Universitat Rovira i Virgili, Avinguda Catalunya, 35, 43002, Tarragona, Spain
Adrian-Horia Dediu & Carlos Martín-Vide &
Research Institute for Information and Language Processing, Research Group in Computational Linguistics, University of Wolverhampton, WV1 1SB, Wolverhampton, UK
Ruslan Mitkov
Fakultät für Informatik, Institut für Wissens- und Sprachverarbeitung, Otto-von-Guericke-Universität Magdeburg, Universitätsplatz 2, 39106, Magdeburg, Germany
Bianca Truthe

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jabaian, B., Lefèvre, F., Besacier, L. (2013). Generalization of Discriminative Approaches for Speech Language Understanding in a Multilingual Context. In: Dediu, AH., Martín-Vide, C., Mitkov, R., Truthe, B. (eds) Statistical Language and Speech Processing. SLSP 2013. Lecture Notes in Computer Science(), vol 7978. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-39593-2_11

Download citation

DOI: https://doi.org/10.1007/978-3-642-39593-2_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-39592-5
Online ISBN: 978-3-642-39593-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics