research-article

Enhancing learning accessibility through fully automatic captioning

Authors:
Maria Federico

Università di Modena e Reggio Emilia, Modena, Italy

Università di Modena e Reggio Emilia, Modena, Italy
View Profile

,
Marco Furini

Università di Modena e Reggio Emilia, Reggio Emilia, Italy

Università di Modena e Reggio Emilia, Reggio Emilia, Italy
View Profile

W4A '12: Proceedings of the International Cross-Disciplinary Conference on Web AccessibilityApril 2012Article No.: 40Pages 1–4https://doi.org/10.1145/2207016.2207053

Published:16 April 2012Publication History

W4A '12: Proceedings of the International Cross-Disciplinary Conference on Web Accessibility

Pages 1–4

ABSTRACT

The simple act of listening or of taking notes while attending a lesson may represent an insuperable burden for millions of people with some form of disabilities (e.g., hearing impaired, dyslexic and ESL students). In this paper, we propose an architecture that aims at automatically creating captions for video lessons by exploiting advances in speech recognition technologies. Our approach couples the usage of off-the-shelf ASR (Automatic Speech Recognition) software with a novel caption alignment mechanism that smartly introduces unique audio markups into the audio stream before giving it to the ASR and transforms the plain transcript produced by the ASR into a timecoded transcript.

References

Unesco Report 2005 - The quality imperative. Global Monitoring Report, 2005. {on-line} Available at http://www.unesco.orgGoogle Scholar
M. Xu, S. Yan, T-S. Chua, R. Hong, M. Wang. Dynamic captioning: video accessibility enhancement for hearing impairment. In Proc of the ACM Multimedia Conference, pp. 421--430, New York, NY, USA, 2010. Google ScholarDigital Library
L. Jelinek, D. Jackson. Television literacy: comprehension of program content using closed captions for the deaf. Journal of Deaf Stud. Deaf Educ., Vol. 6, N. 1, pp. 43--53, 2001.Google Scholar
T. Garza. Evaluating the use of captioned video materials in advanced foreign language learning. Foreign Language Annals, Vol. 24, N. 3, pp. 239--258, May 1991.Google ScholarCross Ref
S. Tsuboi, N. Shimogori, T. Ikeda. Automatically generated captions: will they help non-native speakers communicate in english? In Proc of Intercultural collaboration conference, ICIC '10, pp. 79--86, New York, NY, USA, 2010. ACM. Google ScholarDigital Library
G. Penn, E. Toms, D. James, C. Munteanu, R. Baecker. The effect of speech recognition accuracy rates on the usefulness and usability of webcast archives. In Proc of the SIGCHI conference on Human Factors in computing systems, CHI '06, pp. 493--502, New York, NY, USA, 2006. ACM. Google ScholarDigital Library
M. Wald. Crowdsourcing correction of speech recognition captioning errors. In Proc of the W4A Conference, New York, NY, USA, 2011. ACM. Google ScholarDigital Library
A. Knight, K. C. Almeroth. Fast caption alignment for automatic indexing of audio. International Journal of Multimedia Data Engineering and Management, Vol. 1, N. 2, pp. 1--17. June 2010. Google ScholarDigital Library

Index Terms

Enhancing learning accessibility through fully automatic captioning
1. Applied computing
  1. Document management and text processing
    1. Document preparation
      1. Multi / mixed media creation
  2. Law, social and behavioral sciences
    1. Sociology
2. Social and professional topics
  1. Professional topics
    1. Computing profession
      1. Assistive technologies
  2. User characteristics
    1. People with disabilities

Recommendations

Dynamic captioning: video accessibility enhancement for hearing impairment
MM '10: Proceedings of the 18th ACM international conference on Multimedia

There are more than 66 million people su®ering from hearing impairment and this disability brings them di±culty in the video content understanding due to the loss of audio information. If scripts are available, captioning technology can help them in a ...
Read More
Enhancing the accessibility of e-learning platforms through synthetic speech
PCI '16: Proceedings of the 20th Pan-Hellenic Conference on Informatics

This paper describes the design and development considerations for the adaptation of an accessibility enhancement tool to the Open eClass e-learning platform. The tool aims to improve the accessibility of the platform for the visually challenged ...
Read More
Universal access to communication and learning: the role of automatic speech recognition

This communication discusses how automatic speech recognition (ASR) can support universal access to communication and learning through the cost-effective production of text synchronised with speech and describes achievements and planned developments of ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
W4A '12: Proceedings of the International Cross-Disciplinary Conference on Web Accessibility
April 2012
189 pages
ISBN:9781450310192
DOI:10.1145/2207016
General Chairs:
Markel Vigo
University of Manchester, UK
,
Julio Abascal
University of the Basque Country, Spain
,
Program Chairs:
Rui Lopes
Google
,
Paola Salomoni
University of Bologna, Italy
Copyright © 2012 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 16 April 2012
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
accessibility
automatic captioning
learning
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate171of371submissions,46%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 32
  Total Citations
  View Citations
- 370
  Total Downloads
- Downloads (Last 12 months)23
- Downloads (Last 6 weeks)5
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Enhancing learning accessibility through fully automatic captioning

W4A '12: Proceedings of the International Cross-Disciplinary Conference on Web Accessibility

ABSTRACT

References

Cited By

Index Terms

Recommendations

Dynamic captioning: video accessibility enhancement for hearing impairment

Enhancing the accessibility of e-learning platforms through synthetic speech

Universal access to communication and learning: the role of automatic speech recognition