Recognition of Listener’s Nodding by LSTM Based on Movement of Facial Keypoints and Speech Intonation

Yamashita, Takayoshi; Nakagawa, Maya; Fujiyoshi, Hironobu; Haikawa, Yuji

doi:10.1007/978-3-030-23528-4_22

Recognition of Listener’s Nodding by LSTM Based on Movement of Facial Keypoints and Speech Intonation

Takayoshi Yamashita⁸,
Maya Nakagawa⁸,
Hironobu Fujiyoshi⁸ &
…
Yuji Haikawa⁹

Conference paper
First Online: 06 July 2019

1971 Accesses

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1033))

Abstract

Communication between humans and robots is crucial to achieve successful cooperation in real-life scenarios. The robot must understand not only linguistic expressions, but also non-linguistic expressions such as nodding and gestures. In this research, we examine whether a listener nods in response to a speaker’s utterance. Our proposed method judges nodding based on the movement of the listener’s facial keypoints and the speaker’s speech intonation. The proposed method achieves approximately 84.4% recognition accuracy when we input the movement and intonation simultaneously. This improves nodding recognition accuracy by 8.8% over movement only approach. This result indicates that the movement of the listener’s facial keypoints and the speaker’s intonation are important information in nodding recognition.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Kobayashi, N., et al.: Quantitative evaluation of infant behavior and mother infant interaction. Early Dev. Parent. 1(1), 23–31 (1992)
Article MathSciNet Google Scholar
Graves, A., et al.: Speech recognition with deep recurrent neural networks. In: Acoustics, Speech and Signal Processing (ICASSP), pp. 6645–6649 (2013)
Google Scholar
Hochreiter, S., et al.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Article Google Scholar
Simonyan, K., et al.: Two-stream convolutional networks for action recognition in videos. NIPS (2014)
Google Scholar
Wu, L., et al.: In Vivo evaluation of wearable head impact sensors. Ann. Biomed. Eng. 44(4), 1234–45 (2015)
Article Google Scholar
King, D.E.: Dlib-ml: a machine learning toolkit. J. Mach. Learn. Res. 10, 1755–1758 (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Chubu University, Kasugai, Japan
Takayoshi Yamashita, Maya Nakagawa & Hironobu Fujiyoshi
Honda Research Institute Japan, Wako, Japan
Yuji Haikawa

Authors

Takayoshi Yamashita
View author publications
You can also search for this author in PubMed Google Scholar
Maya Nakagawa
View author publications
You can also search for this author in PubMed Google Scholar
Hironobu Fujiyoshi
View author publications
You can also search for this author in PubMed Google Scholar
Yuji Haikawa
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Takayoshi Yamashita .

Editor information

Editors and Affiliations

University of Crete and Foundation for Research and Technology – Hellas (FORTH), Heraklion, Crete, Greece
Constantine Stephanidis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yamashita, T., Nakagawa, M., Fujiyoshi, H., Haikawa, Y. (2019). Recognition of Listener’s Nodding by LSTM Based on Movement of Facial Keypoints and Speech Intonation. In: Stephanidis, C. (eds) HCI International 2019 - Posters. HCII 2019. Communications in Computer and Information Science, vol 1033. Springer, Cham. https://doi.org/10.1007/978-3-030-23528-4_22

Download citation

DOI: https://doi.org/10.1007/978-3-030-23528-4_22
Published: 06 July 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-23527-7
Online ISBN: 978-3-030-23528-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics