Incremental Speech Translation: A Layered Chart Approach

Amtrup, Jan W.

doi:10.1007/978-3-642-72283-7_15

Jan W. Amtrup²

Part of the book series: Informatik aktuell ((INFORMAT))

116 Accesses

Abstract

Human speech understanding works incrementally. We begin to process acoustic input before the speaker’s utterance has ended. A system capable of performing sophisticated communication in a natural dialogue or simultaneous interpreting, has to work incrementally, too. The architecture of such a system should be modular, uniform and integrated. We present an architectural framework that suits these three requirements by implementing layered charts, a multi-purpose data structure intended to represent several competing hypotheses about linguistic content of utterance intervals based on hyper graphs. We demonstrate the feasibility by presenting results from an actual interpreting system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 54.99; Price excludes VAT (USA)

Softcover Book: USD 69.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Recognising Conversational Speech: What an Incremental ASR Should Do for a Dialogue System and How to Get There

Incremental Unit Networks for Distributed, Symbolic Multimodal Processing and Representation

AlignTool: The automatic temporal alignment of spoken utterances in German, Dutch, and British English for psycholinguistic purposes

Article 29 January 2018

References

Steven Abney. Partial Parsing via Finite-State Cascades. In Proceedings of the ESSLLI ’96 Robust Parsing Workshop, 1996.
Google Scholar
Jan Alexandersson, Norbert Reithinger, and Elisabeth Maier. Insights into the Dialogue Processing of Verbmobil. In Proc. of the 5th Conference on Applied Natural Language Processing, Washington, D.C., 1997.
Google Scholar
Jan W. Amtrup. ICE: A Communication Environment for Natural Language Processing. In Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA97), Las Vegas, NV, July 1997.
Google Scholar
Jan W. Amtrup. Layered Charts for Speech Translation. In Proceedings of the Seventh International Conference on Theoretical and Methodological Issues in Machine Translation, TMI ’97, Santa Fe, NM, July 1997.
Google Scholar
Jan W. Amtrup. Perspectives for Incremental MT with Charts. In Christa Hauenschild and Susanne Heizmann, editors, Machine Translation and Translation Theory. Perspectives of Cooperation, Text, Translation, Computational Processing (TTCP), number 1. Mouton de Gruyter, 1997.
Google Scholar
Jan W. Amtrup and Volker Weber. Time Mapping with Hypergraphs. In Proc. of the 17 th COLING, Montreal, Canada, 1998.
Google Scholar
Xavier Aubert and Hermann Ney. Large Vocabulary Continuous Speech Recognition Using Word Graphs. In ICASSP 95, 1995.
Google Scholar
Christian Boitet and Mark Seligman. The “Whiteboard” Architecture: A Way to Integrate Heterogeneous Components of NLP systems. In COLING-94: The 15th International Conference on Computational Linguistics, Kyoto, Japan, 1994.
Google Scholar
Thomas Bub, Wolfgang Wahlster, and Alex Waibel. Verbmobil: The Combination of Deep and Shallow Processing for Spontaneous Speech Translation. In Proc. of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, pages 1/7a1–1/74, Munich, Germany, 1997.
Google Scholar
Bob Carpenter. The Logic of Typed Feature Structures. Tracts in Theoretical Computer Science. Cambridge University Press, Cambridge, 1992.
Book Google Scholar
David Carter et al. Translation Methodology in the Spoken Language Translator: An Evaluation. In ACL Workshop on Spoken Language Translation, 1997.
Google Scholar
G. V. Chernov. Message redundancy and message anticipation in simultaneous interpretation. In Lambert and Moser-Mercer, editors, Bridging the Gap: Empirical Research in Simultaneous Interpretation, pages 139–153. John Benjamins, 1994.
Google Scholar
Michael Finke, Maria Lapata, Alon Lavie, Lori Levin, Laura Mayfield Tomokiyo, Thomas Polzin, Klaus Ries, Alex Waibel, and Klaus Zechner. Clarity: Inferring Discourse Structure from Speech. In Proceedings of the AAAI 98 Spring Symposium: Applying Machine Learning to Discourse Processing, pages 23–32, Stanford, CA, 1998.
Google Scholar
Wolfgang Finkler and Anne Schauder. Effects of Incremental Output on Incremental Natural Language Generation. In Proc. of the 10 th ECAI, pages 505–507, Vienna, Austria, August 1992.
Google Scholar
D. Gerver. Empirical studies of simultaneous interpretation: A review and a model. In R.W. Brislin, editor, Translation: Applicatons and Research, pages 165–207. Gardner Press, New York, 1997.
Google Scholar
Günther Görz, Marcus Kesseler, Jörg Spilker, and Hans Weber. Research on Architectures for Integrated Speech/Language Systems in Verbmobil. In Proc. of the 16 th COLING, pages 484–489, Copenhagen, Denmark, August 1996.
Google Scholar
Andreas Hauenstein and Hans Weber. An Investigation of Tightly Coupled Speech Language Interfaces Using an Unification Grammar. In Proceedings of the Workshop on Integration of Natural Language and Speech Processing at AAAI’94, pages 42–50, Seattle, WA, 1994.
Google Scholar
Kai Huebener, Uwe Jost, and Henrik Heine. Speech Recognition for Spontaneously Spoken German Dialogs. In ICSLP96, Philadelphia, 1996.
Google Scholar
Susanne J. Jekat. Automatic Interpretation of Dialogue Acts. In Christa Hauenschild and Susanne Heizmann, editors, Machine Translation and Translation Theory. Perspectives of Cooperation, Text, Translation, Computational Processing (TTCP), number 1. Mouton de Gruyter, 1997.
Google Scholar
Martin Kay. Algorithmic Schemata and Data Structures in Syntactic Processing. Technical Report CSL-80–12, Xerox Palo Alto Research Center, Palo Alto, 1980.
Google Scholar
Martin Kay. Chart generation. In Proc. of the 34 th ACL, pages 200–204, Santa Cruz, CA, June 1996.
Google Scholar
Hiroaki Kitano. Speech-to-Speech Translation: A Massively Parallel Memory-Based Approach. Kluwer Academic Publishers, Boston, 1994.
MATH Google Scholar
Marc Light. CHUMP: Partial Parsing and Underspecified Representations. In Proceedings of the ECAI-96 Workshop: Corpus-Oriented Semantic Analysis, 1996.
Google Scholar
W.D. Marslen-Wilson. Functional Parallelism in Spoken Word Recognition. Cognition, 25: 71–102, 1987.
Article Google Scholar
W.D Marslen-Wilson and A. Welsh. Processing Interactions during Word Recognition in Continuous Speech. Cognitive Psychology, 10: 29–63, 1978.
Article Google Scholar
Heinrich Niemann, Elmar Nöth, Andreas Kiessling, Ralf Kompe, and Anton Batliner. Prosodic Processing and its Use in Verbmobil. In Proc. of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, 1997.
Google Scholar
Michael Niv. A Computational Model of Syntactic Processing: Ambiguity Resolution from Interpretation. PhD thesis, Univ. of Pennsylvania, 1993.
Google Scholar
Manny Rayner and David Carter. Hybrid Language Processing in the Spoken Language Translator. In Proc. of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, Munich, Germany, 1997. http://www.cam.sri.com/tr/crc064/paper.ps.Z.
Google Scholar
Shuly Wintner and Nissim Francez. Parsing with Typed Feature Structures. In Proceedings of the 4 ^th International Workshop on Parking Technologies (IWPT95), pages 273–287, Prague, September 1995. Charles University.
Google Scholar
Klaus Zechner and Alex Waibel. Using Chunk Based Partial Parsing of Spontaneous Speech in Unrestricted Domains for Reducing Word Error Rate in Speech Recognition. In COLING98P, COLING98L, 1998.
Google Scholar
P. Zwitserlood. The Locus of Effects of Sentential-Semantic Context in Spoken-Word Processing. Cognition, 32: 25–64, 1989.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Computing Research Lab, New Mexico State University, Las Cruces, NM, USA
Jan W. Amtrup

Authors

Jan W. Amtrup
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institut für Wissens- und Sprachverarbeitung Fakultät für Informatik, Otto-von-Guericke-Universität Magdeburg, Universitätsplatz 2, D-39106, Magdeburg, Germany
Jürgen Dassow & Rudolf Kruse &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Amtrup, J.W. (1998). Incremental Speech Translation: A Layered Chart Approach. In: Dassow, J., Kruse, R. (eds) Informatik ’98. Informatik aktuell. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-72283-7_15

Download citation

DOI: https://doi.org/10.1007/978-3-642-72283-7_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-64938-0
Online ISBN: 978-3-642-72283-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics