Abstract
Existing activity recognition based assistive living solutions have adopted a relatively rigid approach to modelling activities. To address the deficiencies of such approaches, a goal-oriented solution has been proposed that will offer a method of flexibly modelling activities. This approach does, however, have a disadvantage in that the performance of goals may vary hence requiring differing video clips to be associated with these variations. In order to address this shortcoming, the use of rich metadata to facilitate automatic sequencing and matching of appropriate video clips is necessary. This paper introduces a mechanism of automatically generating rich metadata which details the actions depicted in video files to facilitate matching and sequencing. This mechanism was evaluated with 14 video files, producing annotations with a high degree of accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
De Luca, d.E., Bonacci, S., Giraldi, G.: Aging populations: the health and quality of life of the elderly. Clin. Ter. 162, e13 (2011)
Acampora, G., Cook, D.J., Rashidi, P., Vasilakos, A.V.: A Survey on Ambient Intelligence in Health Care. Proc. IEEE. Inst. Electr. Electron. Eng. 101, 2470–2494 (2013)
Chen, L., Hoey, J., Nugent, C.D., Cook, D.J., Yu, Z.: Sensor-Based Activity Recognition. IEEE Trans. Syst. Man, Cybern. Part C (Applications Rev.), 1–19 (2012)
Lapointe, J., Bouchard, B., Bouchard, J.: Smart homes for people with Alzheimer’s disease: adapting prompting strategies to the patient’s cognitive profile. In: Proc. 5th Int. Conf. PErvasive Technol. Relat. to Assist. Environ., vol. 3 (2012)
Chan, M., Estève, D., Escriba, C., Campo, E.: A review of smart homes- present state and future challenges. Comput. Methods Programs Biomed. 91, 55–81 (2008)
Cook, D.J., Das, S.K.: How smart are our environments? An updated look at the state of the art. Pervasive Mob. Comput. 3, 53–73 (2007)
Rafferty, J., Chen, L., Nugent, C.: Ontological Goal Modelling for Proactive Assistive Living in Smart Environments. Ubiquitous Computing and Ambient Intelligence. In: Context-Awareness and Context-Driven Interaction, pp. 262–269 (2013)
Mihailidis, A., Boger, J.N., Craig, T., Hoey, J.: The COACH prompting system to assist older adults with dementia through handwashing: an efficacy study. BMC Geriatr 8, 28 (2008)
Filippova, K., Hall, K.: Improved video categorization from text metadata and user comments. In: Proc. 34th Int. ACM SIGIR Conf. Res. Dev. Inf. Retr., SIGIR 2011, pp. 835–842 (2011)
Papadopoulos, D.P., Kalogeiton, V.S., Chatzichristofis, S.A., Papamarkos, N.: Automatic summarization and annotation of videos with lack of metadata information. Expert Syst. Appl. 40, 5765–5778 (2013)
Ballan, L., Bertini, M., Bimbo, A., Seidenari, L., Serra, G.: Event detection and recognition for semantic annotation of video. Multimed. Tools Appl. 51, 279–302 (2010)
McCloskey, S., Davalos, P.: Activity detection in the wild using video metadata. Pattern Recognit, 3140–3143 (2012)
Perea-Ortega, J.M., Montejo-Ráez, A., MartÃn-Valdivia, M.T., Ureña-López, L.A.: Semantic tagging of video ASR transcripts using the web as a source of knowledge. Comput. Stand. Interfaces. 35, 519–528 (2013)
Lawton, M., Brody, E.: Instrumental Activities of Daily Living Scale, IADL (1988)
Rafferty, J., Nugent, C., Chen, L., Qi, J., Dutton, R., Zirk, A., Boye, L.T., Kohn, M., Hellman, R.: NFC based provisioning of instructional videos to assist with instrumental activities of daily living. Engineering in Medicine and Biology Society (2014)
Mehla, R., Aggarwal, R.: Automatic Speech Recognition: A Survey. Int. J. Adv. Res. Comput. Sci. Electron. Eng. 3, 45–53 (2014)
Google: Google Speech API, http://www.google.com/speech-api/v1/recognize
Dice, L.R.: Measures of the amount of ecologic association between species. Ecology 26, 297–302 (1945)
Chen, W., Ananthakrishnan, S.: ASR error detection in a conversational spoken language translation system. In: 2013 IEEE Int. Conf. Acoust. Speech Signal Process (ICASSP), pp. 7418–7422 (2013)
SIL: American English Homophones, http://www-01.sil.org/linguistics/wordlists/english/
Princeton University: About WordNet, http://wordnet.princeton.edu
Brett Spell: Java API for WordNet Searching (JAWS), http://lyle.smu.edu/~tspell/jaws/index.html
Apache: Lucene, http://lucene.apache.org
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Rafferty, J., Nugent, C.D., Liu, J., Chen, L. (2014). Automatic Summarization of Activities Depicted in Instructional Videos by Use of Speech Analysis. In: Pecchia, L., Chen, L.L., Nugent, C., Bravo, J. (eds) Ambient Assisted Living and Daily Activities. IWAAL 2014. Lecture Notes in Computer Science, vol 8868. Springer, Cham. https://doi.org/10.1007/978-3-319-13105-4_20
Download citation
DOI: https://doi.org/10.1007/978-3-319-13105-4_20
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-13104-7
Online ISBN: 978-3-319-13105-4
eBook Packages: Computer ScienceComputer Science (R0)