Skip to main content

Multimodal Analysis of Client Persuasion in Consulting Interactions: Toward Understanding Successful Consulting

  • Conference paper
  • First Online:
Social Computing and Social Media: Applications in Marketing, Learning, and Health (HCII 2021)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12775))

Included in the following conference series:

Abstract

To analyze successful consulting processes using multimodal analysis, the aim of this research is to develop a model for recognizing when a client is persuaded by a consultant using multimodal features. These models enable us to analyze the utterances of highly skilled professional consultants in persuading clients. For this purpose, first, we collect a multimodal counseling interaction corpus including audio and spoken dialogue content (manual transcription) on dialogue sessions between a professional beauty counselor and five clients. Second, we developed a recognition model of persuasion labels using acoustic and linguistic features that are extracted from a multimodal corpus by training a machine learning model as a binary classification task. The experimental results show that the persuasion was 0.697 for accuracy and 0.661 for F1-score with bidirectional LSTM.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://www.audeering.com/opensmile/.

References

  1. Breiman, L.: Random forests. Mach. Learn. 45, 5–32 (2001)

    Google Scholar 

  2. Chen, L., Feng, G., Joe, J., Leong, C.W., Kitchen, C., Lee, C.M.: Towards automated assessment of public speaking skills using multimodal cues. In: Proceedings of the ACM International Conference on Multimodal Interaction, pp. 200–203 (2014)

    Google Scholar 

  3. Cortes, C., Vapnik, V.: Support vector networks. Mach. Learn. 20, 273–297 (1995)

    Google Scholar 

  4. DeVault, D., et al.: SimSensei kiosk: a virtual human interviewer for healthcare decision support. In: Proceedings of the International Conference on Autonomous Agents and Multi-agent Systems, pp. 1061–1068 (2014)

    Google Scholar 

  5. Graves, A., Schmidhuber, J.: Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Netw. 18, 602–610 (2005)

    Google Scholar 

  6. Higashiyama, M., Inui, K., Matsumoto, Y.: Learning sentiment of nouns from selectional preferences of verbs and adjectives. In: Proceedings of the Annual Meeting of the Association for Natural Language Processing, pp. 584–587 (2008)

    Google Scholar 

  7. Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)

    Google Scholar 

  8. Hoque, M.E., Courgeon, M., Martin, J.-C., Mutlu, B., Picard, R.W.: MACH: My automated conversation coach. In: Proceedings of the ACM International Joint Conference on Pervasive and Ubiquitous Computing, pp. 697–706 (2013)

    Google Scholar 

  9. Ishii, R., Otsuka, K., Kumano, S., Higashinaka, R., Tomita, J.: Analyzing gaze behavior and dialogue act during turn-taking for estimating empathy skill level. In: Proceedings of the ACM International Conference on Multimodal Interaction, pp. 31–39 (2018)

    Google Scholar 

  10. Kingma, D., Adam, J.B.: A method for stochastic optimization. In: Proceedings of the International Conference on Learning Representations, pp. 1–15 (2015)

    Google Scholar 

  11. Nozomi, K., Kentaro, I., Yuji, M., Kenji, T.: Collecting evaluative expressions for opinion extraction. J. Nat. Lang. Process. 12(3), 203–222 (2005)

    Google Scholar 

  12. Kudo, T., Yamamoto, K., Matsumoto, Y.: Applying conditional random fields to Japanese morphological analysis. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 230–237 (2004)

    Google Scholar 

  13. Nair, V., Hinton, G.E.: Rectified linear units improve Restricted Boltzmann machines. In: Proceedings of the International Conference on Machine Learning, pp. 807–814 (2010)

    Google Scholar 

  14. Nguyen, L.S., Frauendorfer, D., Mast, M.S., Gatica-Perez, D.: Hire me: Computational inference of hirability in employment interviews based on nonverbal behavior. IEEE Trans. Multimed. 16(4), 1018–1031 (2014)

    Google Scholar 

  15. Okada, S., et al.: Estimating communication skills using dialogue acts and nonverbal features in multiple discussion datasets. In: Proceedings of the ACM International Conference on Multimodal Interaction, pp. 169–176 (2016)

    Google Scholar 

  16. Park, S., Shim, H.S., Chatterjee, M., Sagae, K., Morency, L.-P.: Computational analysis of persuasiveness in social multimedia: a novel dataset and multimodal prediction approach. In: Proceedings of the ACM International Conference on Multimodal Interaction, pp. 50–57 (2014)

    Google Scholar 

  17. Ramanarayanan, V., Leong, C.W., Chen, L., Feng, G., Suendermann-Oeft, D.: Evaluating speech, face, emotion and body movement time-series features for automated multimodal presentation scoring. In: Proceedings of the ACM International Conference on Multimodal Interaction, pp. 23–30 (2015)

    Google Scholar 

  18. Sanchez-Cortes, D., Aran, O., Mast, M.S., Gatica-Perez, D.: A nonverbal behavior approach to identify emergent leaders in small groups. IEEE Trans. Multimed. 14, 816–832 (2012)

    Google Scholar 

  19. Schuller, B., Steidl, S., Batliner, A., Burkhardt, F., Devillers, L., Müller, C., Narayanan, S.: The interspeech 2010 paralinguistic challenge. In: Proceedings of the Annual Conference of the International Speech Communication Association, pp. 2794–2797 (2010)

    Google Scholar 

  20. Srivastava, N., Hinton, G.E., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15(1), 1929–1958 (2014)

    Google Scholar 

  21. Suzuki, M., Matsuda, K., Sekine, S., Okazaki, N., Inui, K.: A joint neural model for fine-grained named entity classification of wikipedia articles. IEICE Trans. Inf. Syst. E101.D(1), 73–81 (2018)

    Google Scholar 

  22. Tanaka, H., Negoro, H., Iwasaka, H., Nakamura, S.: Listening skills assessment through computer agents. In: Proceedings of the ACM International Conference on Multimodal Interaction, pp. 492–496 (2018)

    Google Scholar 

  23. Tanaka, H., et al.: Automated social skills trainer. In: Proceedings of the ACM International Conference on Inteligent User Interface, pp. 17–27 (2015)

    Google Scholar 

  24. Tavabi, L., Stefanov, K., Gilani, S.N., Traum, D., Soleymani, M.: Multimodal learning for identifying opportunities for empathetic responses. In: Proceedings of the ACM International Conference on Multimodal Interaction, pp. 95–104 (2019)

    Google Scholar 

  25. Tianqi, C., Carlos, G.: XGBoost: a scalable tree boosting system. In: Proceedings of the International Conference on Knowledge Discovery and Data Mining, pp. 785–794 (2016)

    Google Scholar 

  26. Vinciarelli, A., Pantic, M., Bourlard, H.: Social signal processing: survey of an emerging domain. Image Vis. Comput. 27(12), 1743–1759 (2009)

    Google Scholar 

  27. Wörtwein, T., Chollet, M., Schauerte, B., Morency, L.-P., Stiefelhagen, R., Scherer, S.: Multimodal public speaking performance assessment. In: Proceedings of ACM International Conference on Multimodal Interaction, pp. 43–50 (2015)

    Google Scholar 

  28. Xiao, B., Imel, Z.E., Georgiou, P., Atkins, D.C., Narayanan, S.S.: Computational analysis and simulation of empathic behaviors: a survey of empathy modeling with behavioral signal processing framework. Curr. Psych. Rep. 18(5), 1–11 (2016)

    Google Scholar 

  29. Xiao, B., Imel, Z.E., Georgiou, P.G., Atkins, D.C., Narayanan, S.S.: “rate my therapist”: Automated detection of empathy in drug and alcohol counseling via speech and language processing. PLOS ONE 10(12), 1–15 (2015)

    Google Scholar 

Download references

Acknowledgements

We sincerely appreciate Be \(\cdot \) Fine Co. ltd. and Ms. Teruko Kobayashi who is the professional beauty counselor.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shogo Okada .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Amari, Y., Okada, S., Matsumoto, M., Sadamitsu, K., Nakamoto, A. (2021). Multimodal Analysis of Client Persuasion in Consulting Interactions: Toward Understanding Successful Consulting. In: Meiselwitz, G. (eds) Social Computing and Social Media: Applications in Marketing, Learning, and Health. HCII 2021. Lecture Notes in Computer Science(), vol 12775. Springer, Cham. https://doi.org/10.1007/978-3-030-77685-5_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-77685-5_3

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-77684-8

  • Online ISBN: 978-3-030-77685-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics