Skip to main content

Automatic Summarization of Activities Depicted in Instructional Videos by Use of Speech Analysis

  • Conference paper
Ambient Assisted Living and Daily Activities (IWAAL 2014)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8868))

Included in the following conference series:

Abstract

Existing activity recognition based assistive living solutions have adopted a relatively rigid approach to modelling activities. To address the deficiencies of such approaches, a goal-oriented solution has been proposed that will offer a method of flexibly modelling activities. This approach does, however, have a disadvantage in that the performance of goals may vary hence requiring differing video clips to be associated with these variations. In order to address this shortcoming, the use of rich metadata to facilitate automatic sequencing and matching of appropriate video clips is necessary. This paper introduces a mechanism of automatically generating rich metadata which details the actions depicted in video files to facilitate matching and sequencing. This mechanism was evaluated with 14 video files, producing annotations with a high degree of accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. De Luca, d.E., Bonacci, S., Giraldi, G.: Aging populations: the health and quality of life of the elderly. Clin. Ter. 162, e13 (2011)

    Google Scholar 

  2. Acampora, G., Cook, D.J., Rashidi, P., Vasilakos, A.V.: A Survey on Ambient Intelligence in Health Care. Proc. IEEE. Inst. Electr. Electron. Eng. 101, 2470–2494 (2013)

    Article  Google Scholar 

  3. Chen, L., Hoey, J., Nugent, C.D., Cook, D.J., Yu, Z.: Sensor-Based Activity Recognition. IEEE Trans. Syst. Man, Cybern. Part C (Applications Rev.), 1–19 (2012)

    Google Scholar 

  4. Lapointe, J., Bouchard, B., Bouchard, J.: Smart homes for people with Alzheimer’s disease: adapting prompting strategies to the patient’s cognitive profile. In: Proc. 5th Int. Conf. PErvasive Technol. Relat. to Assist. Environ., vol. 3 (2012)

    Google Scholar 

  5. Chan, M., Estève, D., Escriba, C., Campo, E.: A review of smart homes- present state and future challenges. Comput. Methods Programs Biomed. 91, 55–81 (2008)

    Article  Google Scholar 

  6. Cook, D.J., Das, S.K.: How smart are our environments? An updated look at the state of the art. Pervasive Mob. Comput. 3, 53–73 (2007)

    Article  Google Scholar 

  7. Rafferty, J., Chen, L., Nugent, C.: Ontological Goal Modelling for Proactive Assistive Living in Smart Environments. Ubiquitous Computing and Ambient Intelligence. In: Context-Awareness and Context-Driven Interaction, pp. 262–269 (2013)

    Google Scholar 

  8. Mihailidis, A., Boger, J.N., Craig, T., Hoey, J.: The COACH prompting system to assist older adults with dementia through handwashing: an efficacy study. BMC Geriatr 8, 28 (2008)

    Article  Google Scholar 

  9. Filippova, K., Hall, K.: Improved video categorization from text metadata and user comments. In: Proc. 34th Int. ACM SIGIR Conf. Res. Dev. Inf. Retr., SIGIR 2011, pp. 835–842 (2011)

    Google Scholar 

  10. Papadopoulos, D.P., Kalogeiton, V.S., Chatzichristofis, S.A., Papamarkos, N.: Automatic summarization and annotation of videos with lack of metadata information. Expert Syst. Appl. 40, 5765–5778 (2013)

    Article  Google Scholar 

  11. Ballan, L., Bertini, M., Bimbo, A., Seidenari, L., Serra, G.: Event detection and recognition for semantic annotation of video. Multimed. Tools Appl. 51, 279–302 (2010)

    Article  Google Scholar 

  12. McCloskey, S., Davalos, P.: Activity detection in the wild using video metadata. Pattern Recognit, 3140–3143 (2012)

    Google Scholar 

  13. Perea-Ortega, J.M., Montejo-Ráez, A., Martín-Valdivia, M.T., Ureña-López, L.A.: Semantic tagging of video ASR transcripts using the web as a source of knowledge. Comput. Stand. Interfaces. 35, 519–528 (2013)

    Article  Google Scholar 

  14. Lawton, M., Brody, E.: Instrumental Activities of Daily Living Scale, IADL (1988)

    Google Scholar 

  15. Rafferty, J., Nugent, C., Chen, L., Qi, J., Dutton, R., Zirk, A., Boye, L.T., Kohn, M., Hellman, R.: NFC based provisioning of instructional videos to assist with instrumental activities of daily living. Engineering in Medicine and Biology Society (2014)

    Google Scholar 

  16. Mehla, R., Aggarwal, R.: Automatic Speech Recognition: A Survey. Int. J. Adv. Res. Comput. Sci. Electron. Eng. 3, 45–53 (2014)

    Google Scholar 

  17. Google: Google Speech API, http://www.google.com/speech-api/v1/recognize

  18. Dice, L.R.: Measures of the amount of ecologic association between species. Ecology 26, 297–302 (1945)

    Article  Google Scholar 

  19. Chen, W., Ananthakrishnan, S.: ASR error detection in a conversational spoken language translation system. In: 2013 IEEE Int. Conf. Acoust. Speech Signal Process (ICASSP), pp. 7418–7422 (2013)

    Google Scholar 

  20. SIL: American English Homophones, http://www-01.sil.org/linguistics/wordlists/english/

  21. Princeton University: About WordNet, http://wordnet.princeton.edu

  22. Brett Spell: Java API for WordNet Searching (JAWS), http://lyle.smu.edu/~tspell/jaws/index.html

  23. Apache: Lucene, http://lucene.apache.org

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Rafferty, J., Nugent, C.D., Liu, J., Chen, L. (2014). Automatic Summarization of Activities Depicted in Instructional Videos by Use of Speech Analysis. In: Pecchia, L., Chen, L.L., Nugent, C., Bravo, J. (eds) Ambient Assisted Living and Daily Activities. IWAAL 2014. Lecture Notes in Computer Science, vol 8868. Springer, Cham. https://doi.org/10.1007/978-3-319-13105-4_20

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-13105-4_20

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-13104-7

  • Online ISBN: 978-3-319-13105-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics