Automatic Summarization of Activities Depicted in Instructional Videos by Use of Speech Analysis

Rafferty, Joseph; Nugent, Chris D.; Liu, Jun; Chen, Liming

doi:10.1007/978-3-319-13105-4_20

Joseph Rafferty¹⁹,
Chris D. Nugent¹⁹,
Jun Liu¹⁹ &
…
Liming Chen²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8868))

Included in the following conference series:

International Workshop on Ambient Assisted Living

Abstract

Existing activity recognition based assistive living solutions have adopted a relatively rigid approach to modelling activities. To address the deficiencies of such approaches, a goal-oriented solution has been proposed that will offer a method of flexibly modelling activities. This approach does, however, have a disadvantage in that the performance of goals may vary hence requiring differing video clips to be associated with these variations. In order to address this shortcoming, the use of rich metadata to facilitate automatic sequencing and matching of appropriate video clips is necessary. This paper introduces a mechanism of automatically generating rich metadata which details the actions depicted in video files to facilitate matching and sequencing. This mechanism was evaluated with 14 video files, producing annotations with a high degree of accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Automatic Metadata Generation Through Analysis of Narration Within Instructional Videos

Article 08 August 2015

A Mechanism for Nominating Video Clips to Provide Assistance for Instrumental Activities of Daily Living

An approach to provide dynamic, illustrative, video-based guidance within a goal-driven smart home

Article 27 October 2016

References

De Luca, d.E., Bonacci, S., Giraldi, G.: Aging populations: the health and quality of life of the elderly. Clin. Ter. 162, e13 (2011)
Google Scholar
Acampora, G., Cook, D.J., Rashidi, P., Vasilakos, A.V.: A Survey on Ambient Intelligence in Health Care. Proc. IEEE. Inst. Electr. Electron. Eng. 101, 2470–2494 (2013)
Article Google Scholar
Chen, L., Hoey, J., Nugent, C.D., Cook, D.J., Yu, Z.: Sensor-Based Activity Recognition. IEEE Trans. Syst. Man, Cybern. Part C (Applications Rev.), 1–19 (2012)
Google Scholar
Lapointe, J., Bouchard, B., Bouchard, J.: Smart homes for people with Alzheimer’s disease: adapting prompting strategies to the patient’s cognitive profile. In: Proc. 5th Int. Conf. PErvasive Technol. Relat. to Assist. Environ., vol. 3 (2012)
Google Scholar
Chan, M., Estève, D., Escriba, C., Campo, E.: A review of smart homes- present state and future challenges. Comput. Methods Programs Biomed. 91, 55–81 (2008)
Article Google Scholar
Cook, D.J., Das, S.K.: How smart are our environments? An updated look at the state of the art. Pervasive Mob. Comput. 3, 53–73 (2007)
Article Google Scholar
Rafferty, J., Chen, L., Nugent, C.: Ontological Goal Modelling for Proactive Assistive Living in Smart Environments. Ubiquitous Computing and Ambient Intelligence. In: Context-Awareness and Context-Driven Interaction, pp. 262–269 (2013)
Google Scholar
Mihailidis, A., Boger, J.N., Craig, T., Hoey, J.: The COACH prompting system to assist older adults with dementia through handwashing: an efficacy study. BMC Geriatr 8, 28 (2008)
Article Google Scholar
Filippova, K., Hall, K.: Improved video categorization from text metadata and user comments. In: Proc. 34th Int. ACM SIGIR Conf. Res. Dev. Inf. Retr., SIGIR 2011, pp. 835–842 (2011)
Google Scholar
Papadopoulos, D.P., Kalogeiton, V.S., Chatzichristofis, S.A., Papamarkos, N.: Automatic summarization and annotation of videos with lack of metadata information. Expert Syst. Appl. 40, 5765–5778 (2013)
Article Google Scholar
Ballan, L., Bertini, M., Bimbo, A., Seidenari, L., Serra, G.: Event detection and recognition for semantic annotation of video. Multimed. Tools Appl. 51, 279–302 (2010)
Article Google Scholar
McCloskey, S., Davalos, P.: Activity detection in the wild using video metadata. Pattern Recognit, 3140–3143 (2012)
Google Scholar
Perea-Ortega, J.M., Montejo-Ráez, A., Martín-Valdivia, M.T., Ureña-López, L.A.: Semantic tagging of video ASR transcripts using the web as a source of knowledge. Comput. Stand. Interfaces. 35, 519–528 (2013)
Article Google Scholar
Lawton, M., Brody, E.: Instrumental Activities of Daily Living Scale, IADL (1988)
Google Scholar
Rafferty, J., Nugent, C., Chen, L., Qi, J., Dutton, R., Zirk, A., Boye, L.T., Kohn, M., Hellman, R.: NFC based provisioning of instructional videos to assist with instrumental activities of daily living. Engineering in Medicine and Biology Society (2014)
Google Scholar
Mehla, R., Aggarwal, R.: Automatic Speech Recognition: A Survey. Int. J. Adv. Res. Comput. Sci. Electron. Eng. 3, 45–53 (2014)
Google Scholar
Google: Google Speech API, http://www.google.com/speech-api/v1/recognize
Dice, L.R.: Measures of the amount of ecologic association between species. Ecology 26, 297–302 (1945)
Article Google Scholar
Chen, W., Ananthakrishnan, S.: ASR error detection in a conversational spoken language translation system. In: 2013 IEEE Int. Conf. Acoust. Speech Signal Process (ICASSP), pp. 7418–7422 (2013)
Google Scholar
SIL: American English Homophones, http://www-01.sil.org/linguistics/wordlists/english/
Princeton University: About WordNet, http://wordnet.princeton.edu
Brett Spell: Java API for WordNet Searching (JAWS), http://lyle.smu.edu/~tspell/jaws/index.html
Apache: Lucene, http://lucene.apache.org

Download references

Author information

Authors and Affiliations

School of Computing and Mathematics, University of Ulster, UK
Joseph Rafferty, Chris D. Nugent & Jun Liu
School of Computer Science and Informatics, De Montfort University, UK
Liming Chen

Authors

Joseph Rafferty
View author publications
You can also search for this author in PubMed Google Scholar
Chris D. Nugent
View author publications
You can also search for this author in PubMed Google Scholar
Jun Liu
View author publications
You can also search for this author in PubMed Google Scholar
Liming Chen
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Engineering, University of Warwick, CV4 7AL, Coventry, UK
Leandro Pecchia
School of Computer Science and Informatics, De Montfort University, The Gateway, LE1 9BH, Leicester, UK
Liming Luke Chen
Institute, School of Computing and Mathematics, University of Ulster, Computer Science Research, Jordanstown Campus, Shore Road, BT37 0QB, Newtownabbey, UK
Chris Nugent
Escuela Superior de Informática, Castilla La Mancha University, Paseo de la Universidad 4, 13071, Ciudad, Real, Spain
José Bravo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Rafferty, J., Nugent, C.D., Liu, J., Chen, L. (2014). Automatic Summarization of Activities Depicted in Instructional Videos by Use of Speech Analysis. In: Pecchia, L., Chen, L.L., Nugent, C., Bravo, J. (eds) Ambient Assisted Living and Daily Activities. IWAAL 2014. Lecture Notes in Computer Science, vol 8868. Springer, Cham. https://doi.org/10.1007/978-3-319-13105-4_20

Download citation

DOI: https://doi.org/10.1007/978-3-319-13105-4_20
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-13104-7
Online ISBN: 978-3-319-13105-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics