ABSTRACT
Blue Herd is a project in IBM Research to investigate automated captioning for videoconferences. Today videoconferences are held among meeting participants connected with a variety of devices: personal computers, mobile devices, and multi-participant meeting rooms. Blue Herd is charged with studying automated real-time captioning in that context. This poster explains the system that was developed for personal computers and describes our experiments to include mobile devices and multi-participant meeting rooms.
- Forman, I.R., T. Brunet, P. Luther, and A. Wilson, Using ASR for Transcription of Teleconferences in IM Systems, in Universal Access in Human-Computer Interaction. Applications and Services, C. Stephanidis (ed.) LNCS 5616, Springer-Verlag, 2009, 521--529. Google ScholarDigital Library
- IBM, http://www.ibm.com/software/lotus/sametimeGoogle Scholar
- Kanevsky, D. et al., "System and Method for Teleconferencing with Deaf or Hearing Impaired," US Patent 6,618,704, 2003.Google Scholar
Index Terms
Blue herd: automated captioning for videoconferences
Recommendations
Enhancing the usability of real-time speech recognition captioning through personalised displays and real-time multiple speaker editing and annotation
UAHCI'07: Proceedings of the 4th international conference on Universal access in human-computer interaction: applications and servicesText transcriptions of the spoken word can benefit deaf people and also anyone who needs to review what has been said (e.g. at lectures, presentations, meetings etc.) Real time captioning (i.e. creating a live verbatim transcript of what is being spoken)...
Behavioral Changes in Speakers who are Automatically Captioned in Meetings with Deaf or Hard-of-Hearing Peers
ASSETS '18: Proceedings of the 20th International ACM SIGACCESS Conference on Computers and AccessibilityDeaf and hard of hearing (DHH) individuals face barriers to communication in small-group meetings with hearing peers; we examine generation of captions on mobile devices by automatic speech recognition (ASR). While ASR output displays errors, we study ...
The CHIL RT07 Evaluation Data
Multimodal Technologies for Perception of HumansThis paper describes the CHIL 2007 evaluation data set provided for the Rich Transcription 2007 Meeting Recognition Evaluation (RT07) in terms of recording setup, scenario, speaker demagogic and transcription process. The corpus consists of 25 ...
Comments