Abstract
Computers have become an essential part of modern life, providing services in a multiplicity of ways. Access to these services, however, comes at a price: human attention is bound and directed toward a technical artifact in a human-machine interaction setting at the expense of time and attention for other humans. This paper explores a new class of computer services that support human-human interaction and communication implicitly and transparently. Computers in the Human Interaction Loop (CHIL), require consideration of all communication modalities, multimodal integration and more robust performance. We review the technologies and several CHIL services providing human-human support. Among them, we specifically highlight advanced computer services for cross-lingual communication.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Stiefelhagen, R., Bernardin, K., Bowers, R., Garafolo, J., Mostefa, D., Soundararajan, P.: The CLEAR 2006 Evaluation. In: Stiefelhagen, R., Garofolo, J. (eds.) CLEAR 2006. LNCS, vol. 4122, Springer, Heidelberg (2007)
Fiscus, J., Ajot, J., Michel, M., Garofolo, J.: The rich transcription 2006 spring meeting recognition evaluation. In: Renals, S., Bengio, S., Fiscus, J.G. (eds.) MLMI 2006. LNCS, vol. 4299, Springer, Heidelberg (2006)
Canton-Ferrer, C., Casas, J.R., Pardàs, M.: Human Model and Motion Based 3D Action Recognition in Multiple View Scenarios. In: EUSIPCO, Firenze (September 2006)
Lanz, O.: Approximate Bayesian Multibody Tracking. IEEE Trans. PAMI 28(9) (September 2006)
Stiefelhagen, R., Bernardin, K., Ekenel, H.K., McDonough, J., Nickel, K., Voit, M., Wölfel, M.: Audio-Visual Perception of a Lecturer in a Smart Seminar Room. Signal Processing 86(12) (December 2006)
Wölfel, M., Nickel, K., McDonough, J.: Microphone array driven speech recognition: Influence of localization on the word error rate. In: Renals, S., Bengio, S. (eds.) MLMI 2005. LNCS, vol. 3869, Springer, Heidelberg (2006)
Maganti, H.K., Gatica-Perez, D.: Speaker Localization for Microphone Array-Based ASR: The Effects of Accuracy on Overlapping Speech. In: ICMI, Banff, Canada (November 2006)
Wojek, C., Nickel, K., Stiefelhagen, R.: Activity Recognition and Room-Level Tracking in an Office Environment. In: Proc. of the IEEE Intl. Conference on Multisensor Fusion and Integration for Intelligent Systems, Heidelberg, Germany (2006)
Stiefelhagen, R., Yang, J., Waibel, A.: Modeling Focus of Attention for Meeting Indexing. In: ACM Multimedia, Orlando, Florida (October 1999)
Voit, M., Stiefelhagen, R.: Tracking Head Pose and Focus of Attention with Multiple Far-field Cameras. In: ICMI, Banff, Canada (November 2006)
CHIL – Computers in the Human Interaction Loop, http://chil.server.de
VACE – Video Analysis and Content Extraction, http://www.ic-arda.org
TRECVID – TREC Video Retrieval Evaluation, http://www-nlpir.nist.gov/projects/t01v/
PETS – Performance Evaluation of Tracking and Surveillance, http://www.pets2006.net/
ETISEO – Video Understanding Evaluation, http://www.silogic.fr/etiseo
D2.2 Functional Requirements & CHIL Cooperative Information System Software Design, Part 2, Cooperative Information System Software Design, http://chil.server.de
Waibel, A., Bett, M., Finke, M., Stiefelhagen, R.: Meeting browser: Tracking and summarizing meetings. In: Proceedings of the Broadcast News Transcription and Understanding Workshop, Lansdowne, Virginia, pp. 281–286 (1998)
Bouamrane, M.-M., Luz, S.: Meeting browsing. Multimedia Systems 12(4-5), 439–457 (2006)
Wang, Q.Y., Battocchi, A., Graziola, I., Pianesi, F., Tomasini, D., Zancanaro, M., Nass, C.: The Role of Psychological Ownership and Ownership Markers in Collaborative Working Environment. In: ICMI, Banff, Canada (2006)
Danninger, M., Kluge, T., Stiefelhagen, R.: MyConnector – Analysis of Context Cues to Predict Human Availability for Communication. In: ICMI, Banff, Canada (2006)
Neumann, J., Casas, J.R., Macho, D., Ruiz, J.: Multimodal Integration of Sensor Networks. In: Proc. of AIAI, Athens, Greece, pp. 312–323 (2006)
Waibel, A., Jain, A.N., McNair, A.E., Saito, H., Hauptmann, A.G., Tebelskis, J.: JANUS: A Speech-to-speech Translation Using Connectionist and Symbolic Processing Strategies. In: Proc. of ICASSP 1991, pp. 793–796 (May 1991)
Morimoto, T., Takezawa, T., Yato, F., Sagayama, S., Tashiro, T., Nagata, M., Kurematsu, A.: ATR’s speech translation system: ASURA. In: Proc. 3rd European Conf. on Speech Communication and Technology, pp. 1291–1294 (September 1993)
Hsiao, R., Venugopal, A., Köhler, T., Zhang, Y., Charoenpornsawat, P., Zollmann, A., Vogel, S., Black, A.W., Schultz, T., Waibel, A.: Optimizing Components for Handheld Two-way Speech Translation for English-Iraqi Arabic System. In: Proceedings of Interspeech (2006)
Gauvain, J.L.: Speech transcription: general presentation of existing technologies within TC-Star. In: TC-Star Review Workshop, May 28-30, 2007, Luxembourg (2007)
Ney, H.: TC-Star: Statistical MT of Text and Speech. In: TC-Star Review Workshop, May 28-30, 2007, Luxembourg (2007)
Choukri, K.: Importance of the Evaluation of Human-Language Technologies. In: TC-Star Review Workshop, May 28-30, 2007, Luxembourg (2007)
Kolss, M., Zhao, B., Vogel, S., Hildebrand, A., Niehues, J., Venugopal, A., Zhang, Y.: The ISL Statistical Machine Translation System for the TC-STAR Spring 2006 Evaluation. In: Proc. of the TC-STAR Workshop on Speech-to-Speech Translation, Barcelona, Spain (June 2006)
Fügen, C., Kolss, M., Paulik, M., Waibel, A.: Open Domain Speech Translation: From Seminars and Speeches to Lectures. In: Proc. of the TC-STAR Workshop on Speech-to-Speech Translation, Barcelona, Spain (2006)
Fiscus, J., Ajot, J.: The Rich Transcription 2007 Speech-To-Text (STT) and Speaker Attributed STT (SASTT) Results. In: The Rich Transcription 2007 Meeting Recognition (2007)
Olszewski, D., Prasetyo, F., Linhard, K.: Steerable Highly Directional Audio Beam Louspeaker. In: Proc. of the Interspeech, Lisboa, Portugal (September 2006)
Schultz, T.: Multilinguale Spracherkennung - Kombination akustischer Modelle zur Portierung auf neue Sprachen. PhD thesis, Universität Karlsruhe (June 2000)
Eck, M., Vogel, S., Waibel, A.: Low Cost Portability for Statistical Machine Translation based on N-gram Frequency and TF-IDF. In: Proc. of IWSLT, Pittsburgh, PA (October 2005)
Gavalda, M., Waibel, A.: Growing semantic grammars. In: Proceedings of the COLING/ACL, Montreal, Canada (1998)
Paulik, M., Stüker, S., Fügen, C., Schultz, T., Schaaf, T., Waibel, A.: Speech Translation Enhanced Automatic Speech Recognition. In: ASRU, Cancun, Mexico (December 2005)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Waibel, A., Bernardin, K., Wölfel, M. (2007). Computer-Supported Human-Human Multilingual Communication. In: Lungarella, M., Iida, F., Bongard, J., Pfeifer, R. (eds) 50 Years of Artificial Intelligence. Lecture Notes in Computer Science(), vol 4850. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77296-5_25
Download citation
DOI: https://doi.org/10.1007/978-3-540-77296-5_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-77295-8
Online ISBN: 978-3-540-77296-5
eBook Packages: Computer ScienceComputer Science (R0)