Multimodal Analysis of Client Persuasion in Consulting Interactions: Toward Understanding Successful Consulting

Amari, Yasushi; Okada, Shogo; Matsumoto, Maiko; Sadamitsu, Kugatsu; Nakamoto, Atsushi

doi:10.1007/978-3-030-77685-5_3

Yasushi Amari⁹,
Shogo Okada⁹,
Maiko Matsumoto¹⁰,
Kugatsu Sadamitsu¹⁰ &
…
Atsushi Nakamoto¹⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12775))

Included in the following conference series:

International Conference on Human-Computer Interaction

1469 Accesses
2 Citations

Abstract

To analyze successful consulting processes using multimodal analysis, the aim of this research is to develop a model for recognizing when a client is persuaded by a consultant using multimodal features. These models enable us to analyze the utterances of highly skilled professional consultants in persuading clients. For this purpose, first, we collect a multimodal counseling interaction corpus including audio and spoken dialogue content (manual transcription) on dialogue sessions between a professional beauty counselor and five clients. Second, we developed a recognition model of persuasion labels using acoustic and linguistic features that are extracted from a multimodal corpus by training a machine learning model as a binary classification task. The experimental results show that the persuasion was 0.697 for accuracy and 0.661 for F1-score with bidirectional LSTM.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
https://www.audeering.com/opensmile/.

References

Breiman, L.: Random forests. Mach. Learn. 45, 5–32 (2001)
Google Scholar
Chen, L., Feng, G., Joe, J., Leong, C.W., Kitchen, C., Lee, C.M.: Towards automated assessment of public speaking skills using multimodal cues. In: Proceedings of the ACM International Conference on Multimodal Interaction, pp. 200–203 (2014)
Google Scholar
Cortes, C., Vapnik, V.: Support vector networks. Mach. Learn. 20, 273–297 (1995)
Google Scholar
DeVault, D., et al.: SimSensei kiosk: a virtual human interviewer for healthcare decision support. In: Proceedings of the International Conference on Autonomous Agents and Multi-agent Systems, pp. 1061–1068 (2014)
Google Scholar
Graves, A., Schmidhuber, J.: Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Netw. 18, 602–610 (2005)
Google Scholar
Higashiyama, M., Inui, K., Matsumoto, Y.: Learning sentiment of nouns from selectional preferences of verbs and adjectives. In: Proceedings of the Annual Meeting of the Association for Natural Language Processing, pp. 584–587 (2008)
Google Scholar
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Google Scholar
Hoque, M.E., Courgeon, M., Martin, J.-C., Mutlu, B., Picard, R.W.: MACH: My automated conversation coach. In: Proceedings of the ACM International Joint Conference on Pervasive and Ubiquitous Computing, pp. 697–706 (2013)
Google Scholar
Ishii, R., Otsuka, K., Kumano, S., Higashinaka, R., Tomita, J.: Analyzing gaze behavior and dialogue act during turn-taking for estimating empathy skill level. In: Proceedings of the ACM International Conference on Multimodal Interaction, pp. 31–39 (2018)
Google Scholar
Kingma, D., Adam, J.B.: A method for stochastic optimization. In: Proceedings of the International Conference on Learning Representations, pp. 1–15 (2015)
Google Scholar
Nozomi, K., Kentaro, I., Yuji, M., Kenji, T.: Collecting evaluative expressions for opinion extraction. J. Nat. Lang. Process. 12(3), 203–222 (2005)
Google Scholar
Kudo, T., Yamamoto, K., Matsumoto, Y.: Applying conditional random fields to Japanese morphological analysis. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 230–237 (2004)
Google Scholar
Nair, V., Hinton, G.E.: Rectified linear units improve Restricted Boltzmann machines. In: Proceedings of the International Conference on Machine Learning, pp. 807–814 (2010)
Google Scholar
Nguyen, L.S., Frauendorfer, D., Mast, M.S., Gatica-Perez, D.: Hire me: Computational inference of hirability in employment interviews based on nonverbal behavior. IEEE Trans. Multimed. 16(4), 1018–1031 (2014)
Google Scholar
Okada, S., et al.: Estimating communication skills using dialogue acts and nonverbal features in multiple discussion datasets. In: Proceedings of the ACM International Conference on Multimodal Interaction, pp. 169–176 (2016)
Google Scholar
Park, S., Shim, H.S., Chatterjee, M., Sagae, K., Morency, L.-P.: Computational analysis of persuasiveness in social multimedia: a novel dataset and multimodal prediction approach. In: Proceedings of the ACM International Conference on Multimodal Interaction, pp. 50–57 (2014)
Google Scholar
Ramanarayanan, V., Leong, C.W., Chen, L., Feng, G., Suendermann-Oeft, D.: Evaluating speech, face, emotion and body movement time-series features for automated multimodal presentation scoring. In: Proceedings of the ACM International Conference on Multimodal Interaction, pp. 23–30 (2015)
Google Scholar
Sanchez-Cortes, D., Aran, O., Mast, M.S., Gatica-Perez, D.: A nonverbal behavior approach to identify emergent leaders in small groups. IEEE Trans. Multimed. 14, 816–832 (2012)
Google Scholar
Schuller, B., Steidl, S., Batliner, A., Burkhardt, F., Devillers, L., Müller, C., Narayanan, S.: The interspeech 2010 paralinguistic challenge. In: Proceedings of the Annual Conference of the International Speech Communication Association, pp. 2794–2797 (2010)
Google Scholar
Srivastava, N., Hinton, G.E., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15(1), 1929–1958 (2014)
Google Scholar
Suzuki, M., Matsuda, K., Sekine, S., Okazaki, N., Inui, K.: A joint neural model for fine-grained named entity classification of wikipedia articles. IEICE Trans. Inf. Syst. E101.D(1), 73–81 (2018)
Google Scholar
Tanaka, H., Negoro, H., Iwasaka, H., Nakamura, S.: Listening skills assessment through computer agents. In: Proceedings of the ACM International Conference on Multimodal Interaction, pp. 492–496 (2018)
Google Scholar
Tanaka, H., et al.: Automated social skills trainer. In: Proceedings of the ACM International Conference on Inteligent User Interface, pp. 17–27 (2015)
Google Scholar
Tavabi, L., Stefanov, K., Gilani, S.N., Traum, D., Soleymani, M.: Multimodal learning for identifying opportunities for empathetic responses. In: Proceedings of the ACM International Conference on Multimodal Interaction, pp. 95–104 (2019)
Google Scholar
Tianqi, C., Carlos, G.: XGBoost: a scalable tree boosting system. In: Proceedings of the International Conference on Knowledge Discovery and Data Mining, pp. 785–794 (2016)
Google Scholar
Vinciarelli, A., Pantic, M., Bourlard, H.: Social signal processing: survey of an emerging domain. Image Vis. Comput. 27(12), 1743–1759 (2009)
Google Scholar
Wörtwein, T., Chollet, M., Schauerte, B., Morency, L.-P., Stiefelhagen, R., Scherer, S.: Multimodal public speaking performance assessment. In: Proceedings of ACM International Conference on Multimodal Interaction, pp. 43–50 (2015)
Google Scholar
Xiao, B., Imel, Z.E., Georgiou, P., Atkins, D.C., Narayanan, S.S.: Computational analysis and simulation of empathic behaviors: a survey of empathy modeling with behavioral signal processing framework. Curr. Psych. Rep. 18(5), 1–11 (2016)
Google Scholar
Xiao, B., Imel, Z.E., Georgiou, P.G., Atkins, D.C., Narayanan, S.S.: “rate my therapist”: Automated detection of empathy in drug and alcohol counseling via speech and language processing. PLOS ONE 10(12), 1–15 (2015)
Google Scholar

Download references

Acknowledgements

We sincerely appreciate Be \(\cdot \) Fine Co. ltd. and Ms. Teruko Kobayashi who is the professional beauty counselor.

Author information

Authors and Affiliations

Japan Advanced Institute of Science and Technology, Nomi, Ishikawa, Japan
Yasushi Amari & Shogo Okada
Future Corporation, Shinagawa-ku, Tokyo, Japan
Maiko Matsumoto, Kugatsu Sadamitsu & Atsushi Nakamoto

Authors

Yasushi Amari
View author publications
You can also search for this author in PubMed Google Scholar
Shogo Okada
View author publications
You can also search for this author in PubMed Google Scholar
Maiko Matsumoto
View author publications
You can also search for this author in PubMed Google Scholar
Kugatsu Sadamitsu
View author publications
You can also search for this author in PubMed Google Scholar
Atsushi Nakamoto
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shogo Okada .

Editor information

Editors and Affiliations

Department of Computer Science, Towson University, Towson, MD, USA
Gabriele Meiselwitz

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Amari, Y., Okada, S., Matsumoto, M., Sadamitsu, K., Nakamoto, A. (2021). Multimodal Analysis of Client Persuasion in Consulting Interactions: Toward Understanding Successful Consulting. In: Meiselwitz, G. (eds) Social Computing and Social Media: Applications in Marketing, Learning, and Health. HCII 2021. Lecture Notes in Computer Science(), vol 12775. Springer, Cham. https://doi.org/10.1007/978-3-030-77685-5_3

Download citation

DOI: https://doi.org/10.1007/978-3-030-77685-5_3
Published: 03 July 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-77684-8
Online ISBN: 978-3-030-77685-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics