skip to main content
10.1145/2207016.2207053acmotherconferencesArticle/Chapter ViewAbstractPublication Pagesw4aConference Proceedingsconference-collections
research-article

Enhancing learning accessibility through fully automatic captioning

Authors Info & Claims
Published:16 April 2012Publication History

ABSTRACT

The simple act of listening or of taking notes while attending a lesson may represent an insuperable burden for millions of people with some form of disabilities (e.g., hearing impaired, dyslexic and ESL students). In this paper, we propose an architecture that aims at automatically creating captions for video lessons by exploiting advances in speech recognition technologies. Our approach couples the usage of off-the-shelf ASR (Automatic Speech Recognition) software with a novel caption alignment mechanism that smartly introduces unique audio markups into the audio stream before giving it to the ASR and transforms the plain transcript produced by the ASR into a timecoded transcript.

References

  1. Unesco Report 2005 - The quality imperative. Global Monitoring Report, 2005. {on-line} Available at http://www.unesco.orgGoogle ScholarGoogle Scholar
  2. M. Xu, S. Yan, T-S. Chua, R. Hong, M. Wang. Dynamic captioning: video accessibility enhancement for hearing impairment. In Proc of the ACM Multimedia Conference, pp. 421--430, New York, NY, USA, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. L. Jelinek, D. Jackson. Television literacy: comprehension of program content using closed captions for the deaf. Journal of Deaf Stud. Deaf Educ., Vol. 6, N. 1, pp. 43--53, 2001.Google ScholarGoogle Scholar
  4. T. Garza. Evaluating the use of captioned video materials in advanced foreign language learning. Foreign Language Annals, Vol. 24, N. 3, pp. 239--258, May 1991.Google ScholarGoogle ScholarCross RefCross Ref
  5. S. Tsuboi, N. Shimogori, T. Ikeda. Automatically generated captions: will they help non-native speakers communicate in english? In Proc of Intercultural collaboration conference, ICIC '10, pp. 79--86, New York, NY, USA, 2010. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. G. Penn, E. Toms, D. James, C. Munteanu, R. Baecker. The effect of speech recognition accuracy rates on the usefulness and usability of webcast archives. In Proc of the SIGCHI conference on Human Factors in computing systems, CHI '06, pp. 493--502, New York, NY, USA, 2006. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. M. Wald. Crowdsourcing correction of speech recognition captioning errors. In Proc of the W4A Conference, New York, NY, USA, 2011. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. A. Knight, K. C. Almeroth. Fast caption alignment for automatic indexing of audio. International Journal of Multimedia Data Engineering and Management, Vol. 1, N. 2, pp. 1--17. June 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Enhancing learning accessibility through fully automatic captioning

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Other conferences
            W4A '12: Proceedings of the International Cross-Disciplinary Conference on Web Accessibility
            April 2012
            189 pages
            ISBN:9781450310192
            DOI:10.1145/2207016

            Copyright © 2012 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 16 April 2012

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article

            Acceptance Rates

            Overall Acceptance Rate171of371submissions,46%

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader