Abstract
The broadcasting and the Internet are important parts of modern society that a life without media is now unimaginable. However, hearing impaired people have difficulty in understanding media content due to the loss of audio information. If subtitles are available, subtitling with video can be helpful. In this paper, we propose a dynamic subtitle authoring method based on audio analysis for the hearing impaired. We analyze the audio signal and explore a set of audio features that include STE, ZCR, Pitch and MFCC. Using these features, we align the subtitle with the speech and match extracted speech features to subtitle as different text colors, sizes and thicknesses. Furthermore, it highlights the text via aligning them with the voice and tagging the speaker ID using the speaker recognition.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
BSeries, B.T.: Accessibility to broadcasting services for persons with disabilities (2011)
Abrahamian, S.: N. T. S. C. In: EIA-608 and EIA-708 closed captioning (2006)
Boyd, J., Vader, E.A.: Captioned television for the deaf. Am. Ann. Hearing Impaired 117(1), 32–37 (1972)
Hong, R., et al.: Dynamic captioning: video accessibility enhancement for hearing impairment. In: Proceedings of the International Conference on Multimedia. ACM (2010)
Seto, S., et al.: Subtitle system visualizing non-verbal expressions in voice for hearing impaired-Ambient Font. In: Proceeding of the 10th Asia-Pacific Industrial Engineering and Management Systems (2010)
Ververidis, D., Kotropoulos, C.: Emotional speech recognition: Resources, features, and methods. Speech Communication 48(9), 1162–1181 (2006)
Jalil, M., Butt, F.A., Malik, A.: Short-time energy, magnitude, zero crossing rate and autocorrelation measurement for discriminating voiced and unvoiced segments of speech signals. In: International Conference on Technological Advances in Electrical, Electronics and Computer Engineering (2013)
Hess, W.: Pitch Determination of Speech Signals. Springer (1983)
Hasan, M.R., Jamil, M., Rabbani, M.G., Rahman, M.S.: Speaker identification using mel frequency cepstral coefficients (2004)
https://instruct1.cit.cornell.edu/courses/ece576/FinalProjects/f2008/pae26_jsc59/pae26_jsc59/
Kim, N.: A Study on Multimedia Application Service using DTV Closed Caption Data. Journal of Broadcast Engineering (2009)
Peter, O.L.: Making Television Accessible. Report published by the International Tele-communications Union, in collaboration with The Global Initiative for Inclusive Information and Communication Technologies. ITU. Media accessibility 101 (2011)
Maryon, E.: The Science of Tone-Color. CC Birchard & Co., Boston (1924)
Reynolds, D.A., Quatieri, T.F., Dunn, R.B.: Speaker verification using adapted Gaussian mixture models. Digital Signal Processing 10(1) (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Lim, W., Jang, I., Ahn, C. (2014). Dynamic Subtitle Authoring Method Based on Audio Analysis for the Hearing Impaired. In: Miesenberger, K., Fels, D., Archambault, D., Peňáz, P., Zagler, W. (eds) Computers Helping People with Special Needs. ICCHP 2014. Lecture Notes in Computer Science, vol 8547. Springer, Cham. https://doi.org/10.1007/978-3-319-08596-8_9
Download citation
DOI: https://doi.org/10.1007/978-3-319-08596-8_9
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-08595-1
Online ISBN: 978-3-319-08596-8
eBook Packages: Computer ScienceComputer Science (R0)