Input segmentation of spontaneous speech in JANUS: A speech-to-speech translation system

Lavie, Alon; Gates, Donna; Coccaro, Noah; Levin, Lori

doi:10.1007/3-540-63175-5_39

Alon Lavie¹,
Donna Gates¹,
Noah Coccaro¹ &
…
Lori Levin¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1236))

Included in the following conference series:

Workshop on Dialogue Processing in Spoken Language Systems

109 Accesses
1 Citations

Abstract

JANUS is a multi-lingual speech-to-speech translation system designed to facilitate communication between two parties engaged in a spontaneous conversation in a limited domain. In this paper we describe how multi-level segmentation of single utterance turns improves translation quality and facilitates accurate translation in our system. We define the basic dialogue units that are handled by our system, and discuss the cues and methods employed by the system in segmenting the input utterance into such units. Utterance segmentation in our system is performed in a multi-level incremental fashion, partly prior and partly during analysis by the parser. The segmentation relies on a combination of acoustic, lexical, semantic and statistical knowledge sources, which are described in detail in the paper. We also discuss how our system is designed to disambiguate among alternative possible input segmentations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

R. Hausser. Principles of Computational Morphology. Technical Report, Laboratory for Computational Linguistics, Carnegie Mellon University, Pittsburgh, PA, 1989.
Google Scholar
A. Lavie. An Integrated Heuristic Scheme for Partial Parse Evaluation, Proceedings of the 32nd Annual Meeting of the ACL (ACL-94), Las Cruces, New Mexico, June 1994.
Google Scholar
A. Lavie and M. Tomita. GLR^* — An Efficient Noise Skipping Parsing Algorithm for Context Free Grammars, Proceedings of the third International Workshop on Parsing Technologies (IWPT-93), Tilburg, The Netherlands, August 1993.
Google Scholar
L. Levin, D. Evans, and D. Gates. The ALICE System: A Workbench for Learning and Using Language. Computer Assisted Language Instruction Consortium (CALICO) Journal, Autumn 1991, 27–56.
Google Scholar
L. Mayfield, M. Gavaldà, Y-H. Seo, B. Suhm, W. Ward, A. Waibel. Parsing Real Input in JANUS: a Concept-Based Approach. In Proceedings of TMI 95.
Google Scholar
C. P. Rosé, B. Di Eugenio, L. S. Levin, and C. Van Ess-Dykema. Discourse processing of dialogues with multiple threads. In Proceedings of ACL'95, Boston, MA, 1995.
Google Scholar
M. Seligman, J. Hosaka, and H. Singer: “Pause Units” and Analysis of Sponta.-neous Japanese Dialogues: Preliminary Studies This volume, 1997.
Google Scholar
S. M. Shieber. An Introduction to Unification-Based Approaches to Grammar, CSLI Lecture Notes, No. 4, 1986.
Google Scholar
M. Tomita and E. H. Nyberg 3rd. Generation Kit and Transformation Kit, Version 3.2: User's Manual. Technical Report CMU-CMT-88-MEMO, Carnegie Mellon University, Pittsburgh, PA, October 1988.
Google Scholar
M. Woszczyna, N. Aoki-Waibel, F. D. Buo, N. Coccaro, K. Horiguchi, T. Kemp, A. Lavie, A. McNair, T. Polzin, I. Rogina, C. P. Rosé, T. Schultz, B. Suhm, M. Tomita, and A. Waibel. JANUS-93: Towards Spontaneous Speech Translation. In Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'94), 1994.
Google Scholar

Download references

Author information

Authors and Affiliations

Center for Machine Translation, Carnegie Mellon University, 5000 Forbes Ave., 15213, Pittsburgh, PA, USA
Alon Lavie, Donna Gates, Noah Coccaro & Lori Levin

Authors

Alon Lavie
View author publications
You can also search for this author in PubMed Google Scholar
Donna Gates
View author publications
You can also search for this author in PubMed Google Scholar
Noah Coccaro
View author publications
You can also search for this author in PubMed Google Scholar
Lori Levin
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Elisabeth Maier Marion Mast Susann LuperFoy

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lavie, A., Gates, D., Coccaro, N., Levin, L. (1997). Input segmentation of spontaneous speech in JANUS: A speech-to-speech translation system. In: Maier, E., Mast, M., LuperFoy, S. (eds) Dialogue Processing in Spoken Language Systems. DPSLS 1996. Lecture Notes in Computer Science, vol 1236. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-63175-5_39

Download citation

DOI: https://doi.org/10.1007/3-540-63175-5_39
Published: 04 June 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-63175-0
Online ISBN: 978-3-540-69206-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics