Skip to main content

Support System for Lecture Captioning Using Keyword Detection by Automatic Speech Recognition

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9759))

Abstract

We propose a support system for lecture captioning. The system can detect the keywords of a lecture and present them to captionists. The captionists can understand what an instructor said even when they cannot understand the keywords, and can input keywords rapidly by pressing the corresponding function key. The system detects the keywords by automatic speech recognition (ASR). To improve the detection rate of keywords, we adapt the language model of ASR using web documents. We collect 2,700 web documents, which include 1.2 million words and 5,800 sentences. We conducted an experiment to detect keywords of a real lecture and showed that the system can achieve higher F-measure of 0.957 than that of a base language model (0.871).

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Miyoshi, S., Kawano, S., Nishioka, T., Kato, N., Shirasawa, M., Murakami, H., Minagawa, H., Ishihara, Y., Naito, I., Wakatsuki, D., Kuroki, H., Kobayashi, M.: A basic study on supplementary visual information for real-time captionists in the lecture of information science. IEICE Trans. Inf. Syst. (Japanese edition) J91(D(9)), 2236–2246 (2008)

    Google Scholar 

  2. Kato, N., Kawano, S., Kuroki, H., Murakami, H., Nishioka, T., Wakatsuki, D., Minagawa, H., Shionome, T., Miyoshi, S., Shirasawa, M., Ishihara, Y.: Basic Study of Keyword Presentation System for Hearing Impaired Students. IEICE Technical Report ET2007-81 107(462), pp. 71–76 (2008). (in Japanese)

    Google Scholar 

  3. Kawahara, T.: Recent progress of spontaneous speech recognition deployment in parliament and applications to lectures. J. Multimed. Educ. Res. 9(1), S1–S8 (2012). (in Japanese)

    Google Scholar 

  4. Miyoshi, S., Kuroki, H., Kawano, S., Shirasawa, M., Ishihara, Y., Kobayashi, M.: Support technique for real-time captionist to use speech recognition software. In: Miesenberger, K., Klaus, J., Zagler, W.L., Karshmer, A.I. (eds.) ICCHP 2008. LNCS, vol. 5105, pp. 647–650. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  5. Miyoshi, S., Kuroki, H., Kawano, S., Shirasawa, M., Ishihara, Y., Kobayashi, M.: Support Technique for Real-Time Captionist to Use Speech Recognition Software. Tsukuba University of Technology Techno Report 14, pp. 145–151 (2007). (in Japanese)

    Google Scholar 

  6. Munteanu, C., Penn, G., Beacker, R.: Web-based language modelling for automatic lecture transcription. In: Proceedings of 8th Annual Conference of the International Speech Communication Association, no. ThD.P3a-2, pp. 2353–2356 (2007)

    Google Scholar 

  7. Kawahara, T., Nemoto, Y., Akita, Y.: Automatic lecture transcription by exploiting presentation slide information for language model adaptation. In: Proceedings of ICASSP, pp. 4929–4932 (2008). (in Japanese)

    Google Scholar 

  8. Furui, S.: Recent advances in spontaneous speech recognition and understanding. In: Proceedings of ISCA & IEEE Workshop on Spontaneous Speech Processing and Recognition, pp. 1–6 (2003). (in Japanese)

    Google Scholar 

  9. Ito, A.: Palmkit (2009). http://palmkit.sourceforge.net/

  10. Stolcke, A.: SRILM – An extensible language modeling toolkit. In: Proceedings of ICSLP (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Naofumi Ikeda .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Ikeda, N., Takeuchi, Y., Matsumoto, T., Kudo, H., Ohnishi, N. (2016). Support System for Lecture Captioning Using Keyword Detection by Automatic Speech Recognition. In: Miesenberger, K., Bühler, C., Penaz, P. (eds) Computers Helping People with Special Needs. ICCHP 2016. Lecture Notes in Computer Science(), vol 9759. Springer, Cham. https://doi.org/10.1007/978-3-319-41267-2_53

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-41267-2_53

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-41266-5

  • Online ISBN: 978-3-319-41267-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics