On the Use of N-Gram Transducers for Dialogue Annotation

Tamarit, Vicent; Martínez-Hinarejos, Carlos-D.; Benedí, José-Miguel

doi:10.1007/978-1-4419-7934-6_11

Vicent Tamarit⁵,
Carlos-D. Martínez-Hinarejos⁵ &
José-Miguel Benedí⁵

480 Accesses
3 Citations
1 Altmetric

Abstract

The implementation of dialogue systems is one of the most interesting applications of language technologies. Statistical models can be used in this implementation, allowing for a more flexible approach than when using rules defined by a human expert. However, statistical models require large amounts of dialogues annotated with dialogue-function labels (usually Dialogue Acts), and theannotation process is hard and time-consuming. Consequently, the use of other statistical models to obtain faster annotations is really interesting for the development of dialogue systems. In this work we compare two statistical models for dialogue annotation, a more classical Hidden Markov Model (HMM) based model and the new N-gram Transducers (NGT) model. This comparison is performed on two corpora of different nature, the well-known SwitchBoard corpus and the DIHANA corpus. The results show that the NGT model produces a much more accurate annotation that the HMM-based model (even 11% less error in the SwitchBoard corpus).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Author information

Authors and Affiliations

Instituto Tecnológico de Informática, Universidad Politécnica de Valencia, Valencia, Spain
Vicent Tamarit, Carlos-D. Martínez-Hinarejos & José-Miguel Benedí

Authors

Vicent Tamarit
View author publications
You can also search for this author in PubMed Google Scholar
Carlos-D. Martínez-Hinarejos
View author publications
You can also search for this author in PubMed Google Scholar
José-Miguel Benedí
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Vicent Tamarit .

Editor information

Editors and Affiliations

Fak. Ingenieurwissenschaften und, Elektrotechnik, Universität Ulm, Albert-Einstein-Allee 43, Ulm, 89081, Germany
Wolfgang Minker
Technology (POSTECH), Dept. Computer Science & Engineering, Pohang University of Science &, San 31, Hyoja-dong, Pohang, Kyungbuk, 790-784, Korea, Republic of (South Korea)
Gary Geunbae Lee
Communications Technology, National Institute of Information and, Kyoto, 69121, Japan
Satoshi Nakamura
Multilingual and Multimedia Information, CNRS, Orsay, 91403, France
Joseph Mariani

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tamarit, V., Martínez-Hinarejos, CD., Benedí, JM. (2011). On the Use of N-Gram Transducers for Dialogue Annotation. In: Minker, W., Lee, G., Nakamura, S., Mariani, J. (eds) Spoken Dialogue Systems Technology and Design. Springer, New York, NY. https://doi.org/10.1007/978-1-4419-7934-6_11

Download citation

DOI: https://doi.org/10.1007/978-1-4419-7934-6_11
Published: 01 November 2010
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4419-7933-9
Online ISBN: 978-1-4419-7934-6
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics