research-article

Predicting User Intents and Satisfaction with Dialogue-based Conversational Recommendations

Authors:
Wanling Cai

Hong Kong Baptist University, Hong Kong, China

Hong Kong Baptist University, Hong Kong, China
View Profile

,
Li Chen

Hong Kong Baptist University, Hong Kong, China

Hong Kong Baptist University, Hong Kong, China
View Profile

UMAP '20: Proceedings of the 28th ACM Conference on User Modeling, Adaptation and PersonalizationJuly 2020Pages 33–42https://doi.org/10.1145/3340631.3394856

Published:13 July 2020Publication History

UMAP '20: Proceedings of the 28th ACM Conference on User Modeling, Adaptation and Personalization

Pages 33–42

ABSTRACT

To develop a multi-turn dialogue-based conversational recommender system (DCRS), it is important to predict users' intents behind their utterances and their satisfaction with the recommendation, so as to allow the system to incrementally refine user preference model and adjust its dialogue strategy. However, little work has investigated these issues so far. In this paper, we first contribute with two hierarchical taxonomies for classifying user intents and recommender actions respectively based on grounded theory. We then define various categories of feature considering content, discourse, sentiment, and context to predict users' intents and satisfaction by comparing different machine learning methods. The experimental results for user intent prediction task show that some models (such as XGBoost and SVM) can perform well in predicting user intents, and incorporating context features into the prediction model can significantly boost the performance. Our empirical study also demonstrates that leveraging dialogue behavior features (i.e., including both user intents and recommender actions) can achieve good results in predicting user satisfaction.

Supplemental Material

3340631.3394856.mp4

Supplemental Video

mp4

46.7 MB

Download

Available for Download

vtt

3340631.3394856.vtt (21.5 KB)

References

Charu C. Aggarwal and ChengXiang Zhai. 2012. Mining Text Data .Springer Science & Business Media.Google ScholarDigital Library
A. Bhargava, A. Celikyilmaz, D. Hakkani-Tür, and R. Sarikaya. 2013. Easy Contextual Intent Prediction and Slot Detection. In 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP '13). 8337--8341.Google Scholar
Sumit Bhatia and Prasenjit Mitra. 2012. Classifying User Messages For Managing Web Forum Data. In WebDB. 13--18.Google Scholar
Andrei Broder. 2002. A Taxonomy of Web Search. SIGIR Forum, Vol. 36, 2 (2002), 3--10. http://doi.acm.org/10.1145/792550.792552Google ScholarDigital Library
Wanling Cai and Li Chen. 2019. Towards a Taxonomy of User Feedback Intents for Conversational Recommendations. In Proceedings of ACM RecSys 2019 Late-breaking Results co-located with the 13th ACM Conference on Recommender Systems.Google Scholar
Joyce Yue Chai, Malgorzata Budzikowska, Veronika Horvath, Nicolas Nicolov, Nanda Kambhatla, and Wlodek Zadrozny. 2001. Natural Language Sales Assistant - A Web-Based Dialog System for Online Sales. In Proceedings of the Thirteenth Conference on Innovative Applications of Artificial Intelligence Conference (IAAI '01). 19--26. http://dl.acm.org/citation.cfm?id=645453.653001Google ScholarDigital Library
Hongshen Chen, Xiaorui Liu, Dawei Yin, and Jiliang Tang. 2017. A Survey on Dialogue Systems: Recent Advances and New Frontiers. ACM SIGKDD Explorations Newsletter, Vol. 19, 2 (2017), 25--35. http://doi.acm.org/10.1145/3166054.3166058Google ScholarDigital Library
Li Chen and Pearl Pu. 2006. Evaluating Critiquing-based Recommender Agents. In Proceedings of the 21st National Conference on Artificial Intelligence - Volume 1 (AAAI '06). 157--162. http://dl.acm.org/citation.cfm?id=1597538.1597564Google ScholarDigital Library
Li Chen and Pearl Pu. 2012. Critiquing-based Recommenders: Survey and Emerging Trends. User Modeling and User-Adapted Interaction, Vol. 22, 1--2 (2012), 125--150. http://dx.doi.org/10.1007/s11257-011--9108--6Google ScholarDigital Library
Konstantina Christakopoulou, Alex Beutel, Rui Li, Sagar Jain, and Ed H. Chi. 2018. Q&R: A Two-Stage Approach Toward Interactive Recommendation. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD '18). 139--148. http://doi.acm.org/10.1145/3219819.3219894Google Scholar
Konstantina Christakopoulou, Filip Radlinski, and Katja Hofmann. 2016. Towards Conversational Recommender Systems. In Proceedings of the 22Nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '16). 815--824. http://doi.acm.org/10.1145/2939672.2939746Google ScholarDigital Library
Jacob Cohen. 1960. A Coefficient of Agreement for Nominal Scales. Educational and Psychological Measurement, Vol. 20, 1 (1960), 37--46.Google ScholarCross Ref
Jane F Dye, Irene M Schatz, Brian A Rosenberg, and Susanne T Coleman. 2000. Constant Comparison Method: A Kaleidoscope of Data. The Qualitative Report, Vol. 4, 1 (2000), 1--10.Google Scholar
Klaus-Peter Engelbrech, Florian Gödde, Felix Hartard, Hamed Ketabdar, and Sebastian Möller. 2009. Modeling User Satisfaction with Hidden Markov Model. In Proceedings of the SIGDIAL 2009 Conference: The 10th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL '09). 170--177. http://dl.acm.org/citation.cfm?id=1708376.1708402Google ScholarDigital Library
Elena V. Epure, Dario Compagno, Camille Salinesi, Rebecca Deneckere, Marko Bajec, and Slavko vZ itnik. 2018. Process Models of Interrelated Speech Intentions from Online Health-Related Conversations. Artificial Intelligence in Medicine, Vol. 91 (2018), 23--38.Google ScholarCross Ref
Barney G. Glaser. 1998. Doing Grounded Theory: Issues and Discussions .Sociology Press.Google Scholar
Ian Goodfellow, Yoshua Bengio, and Aaron Courville. 2016. Deep Learning. The MIT Press.Google ScholarDigital Library
Jonathan Grudin and Richard Jacques. 2019. Chatbots, Humbots, and the Quest for Artificial General Intelligence. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems (CHI '19). 1--11. https://doi.org/10.1145/3290605.3300439Google ScholarDigital Library
Seyyed Hadi Hashemi, Kyle Williams, Ahmed El Kholy, Imed Zitouni, and Paul A. Crook. 2018. Measuring User Satisfaction on Smart Speaker Intelligent Assistants Using Intent Sensitive Query Embeddings. In Proceedings of the 27th ACM International Conference on Information and Knowledge Management (CIKM '18). 1183--1192. http://doi.acm.org/10.1145/3269206.3271802Google Scholar
Ryuichiro Higashinaka, Yasuhiro Minami, Kohji Dohsaka, and Toyomi Meguro. 2010. Modeling User Satisfaction Transitions in Dialogues from Overall Ratings. In Proceedings of the 11th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL '10). 18--27. http://dl.acm.org/citation.cfm?id=1944506.1944510Google ScholarDigital Library
C.J. Hutto and Eric Gilbert. 2014. VADER: A Parsimonious Rule-Based Model for Sentiment Analysis of Social Media Text. In International AAAI Conference on Web and Social Media (ICWSM '14). https://www.aaai.org/ocs/index.php/ICWSM/ICWSM14/paper/view/8109Google Scholar
Jie Kang, Kyle Condiff, Shuo Chang, Joseph A. Konstan, Loren Terveen, and F. Maxwell Harper. 2017. Understanding How People Use Natural Language to Ask for Recommendations. In Proceedings of the Eleventh ACM Conference on Recommender Systems (RecSys '17). 229--237. http://doi.acm.org/10.1145/3109859.3109873Google Scholar
Tsuneo Kato, Atsushi Nagai, Naoki Noda, Ryosuke Sumitomo, Jianming Wu, and Seiichi Yamamoto. 2017. Utterance Intent Classification of a Spoken Dialogue System with Efficiently Untied Recursive Autoencoders. In Proceedings of the 18th Annual SIGdial Meeting on Discourse and Dialogue. 60--64. https://www.aclweb.org/anthology/W17--5508Google ScholarCross Ref
Diane Kelly. 2009. Methods for Evaluating Interactive Information Retrieval Systems with Users. Foundations and Trends in Information Retrieval, Vol. 3, 1--2 (2009), 1--224. https://doi.org/10.1561/1500000012Google ScholarDigital Library
Yoon Kim. 2014. Convolutional Neural Networks for Sentence Classification. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP '14). 1746--1751. https://www.aclweb.org/anthology/D14--1181Google ScholarCross Ref
Raymond Li, Samira Ebrahimi Kahou, Hannes Schulz, Vincent Michalski, Laurent Charlin, and Chris Pal. 2018. Towards Deep Conversational Recommendations. In Advances in Neural Information Processing Systems 32. 9748--9758. http://papers.nips.cc/paper/8180-towards-deep-conversational-recommendations.pdfGoogle Scholar
Bing Liu, Minqing Hu, and Junsheng Cheng. 2005. Opinion Observer: Analyzing and Comparing Opinions on the Web. In Proceedings of the 14th International Conference on World Wide Web (WWW '05). 342--351. https://doi.org/10.1145/1060745.1060797Google ScholarDigital Library
Chunxi Liu, Puyang Xu, and Ruhi Sarikaya. 2015. Deep Contextual Language Understanding in Spoken Dialogue Systems. In Sixteenth Annual Conference of the International Speech Communication Association.Google Scholar
Yang Liu, Kun Han, Zhao Tan, and Yun Lei. 2017. Using Context Information for Dialog Act Classification in DNN Framework. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP '17). 2170--2178. https://www.aclweb.org/anthology/D17--1231Google ScholarCross Ref
Christopher D. Manning, Prabhakar Raghavan, and Hinrich Schütze. 2008. Introduction to Information Retrieval. Cambridge University Press.Google Scholar
Mary L McHugh. 2012. Interrater Reliability: the Kappa Statistic. Biochemia Medica, Vol. 22, 3 (2012), 276--282.Google ScholarCross Ref
Rishabh Mehrotra, Imed Zitouni, Ahmed Hassan Awadallah, Ahmed El Kholy, and Madian Khabsa. 2017. User Interaction Sequences for Search Satisfaction Prediction. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '17). 165--174. http://doi.acm.org/10.1145/3077136.3080833Google ScholarDigital Library
Lian Meng and Minlie Huang. 2018. Dialogue Intent Classification with Long Short-Term Memory Networks. In National CCF Conference on Natural Language Processing and Chinese Computing (NLPCC '18). Springer, 42--50.Google Scholar
Andrew Olney, Max Louwerse, Eric Matthews, Johanna Marineau, Heather Hite-Mitchell, and Arthur Graesser. 2003. Utterance Classification in AutoTutor. In Proceedings of the HLT-NAACL 03 Workshop on Building Educational Applications Using Natural Language Processing - Volume 2 (HLT-NAACL-EDUC '03). 1--8. https://doi.org/10.3115/1118894.1118895Google ScholarDigital Library
Jeffrey Pennington, Richard Socher, and Christopher D Manning. 2014. Glove: Global Vectors for Word Representation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP '14). 1532--1543. https://www.aclweb.org/anthology/D14--1162Google ScholarCross Ref
Bilih Priyogi. 2019. Preference Elicitation Strategy for Conversational Recommender System. In Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining (WSDM '19). 824--825. http://doi.acm.org/10.1145/3289600.3291604Google ScholarDigital Library
Chen Qu, Liu Yang, W. Bruce Croft, Yongfeng Zhang, Johanne R. Trippas, and Minghui Qiu. 2019. User Intent Prediction in Information-seeking Conversations. In Proceedings of the 2019 Conference on Human Information Interaction and Retrieval (CHIIR '19). 25--33. http://doi.acm.org/10.1145/3295750.3298924Google ScholarDigital Library
Dimitrios Rafailidis and Yannis Manolopoulos. 2019. The Technological Gap Between Virtual Assistants and Recommendation Systems. CoRR, Vol. abs/1901.00431 (2019). http://arxiv.org/abs/1901.00431Google Scholar
Jesse Read, Bernhard Pfahringer, Geoff Holmes, and Eibe Frank. 2011. Classifier Chains for Multi-label Classification. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases. 254--269.Google Scholar
Daniel E. Rose and Danny Levinson. 2004. Understanding User Goals in Web Search. In Proceedings of the 13th International Conference on World Wide Web (WWW '04). 13--19. http://doi.acm.org/10.1145/988672.988675Google ScholarDigital Library
Hideo Shimazu. 2001. ExpertClerk: Navigating Shoppers' Buying Process with the Combination of Asking and Proposing. In Proceedings of the 17th International Joint Conference on Artificial Intelligence - Volume 2 (IJCAI '01). 1443--1448. http://dl.acm.org/citation.cfm?id=1642194.1642287Google ScholarDigital Library
Mohammad S. Sorower. 2010. A Literature Survey on Algorithms for Multi-label Learning. Oregon State University, Corvallis, Vol. 18 (2010), 1--25.Google Scholar
Andreas Stolcke, Noah Coccaro, Rebecca Bates, Paul Taylor, Carol Van Ess-Dykema, Klaus Ries, Elizabeth Shriberg, Daniel Jurafsky, Rachel Martin, and Marie Meteer. 2000. Dialogue Act Modeling for Automatic Tagging and Recognition of Conversational Speech. Computational Linguistics, Vol. 26, 3 (2000), 339--373. https://doi.org/10.1162/089120100561737Google ScholarDigital Library
Ming Sun, Yun-Nung Chen, and Alexander I. Rudnicky. 2015. Understanding User's Cross-domain Intentions in Spoken Dialog Systems. In NIPS workshop on Machine Learning for SLU and Interaction (NIPS-SLU).Google Scholar
Yueming Sun and Yi Zhang. 2018. Conversational Recommender System. In The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval (SIGIR '18). 235--244. http://doi.acm.org/10.1145/3209978.3210002Google ScholarDigital Library
Dinoj Surendran and Gina-Anne Levow. [n.d.]. Dialog Act Tagging with Support Vector Machines and Hidden Markov Models. In Ninth International Conference on Spoken Language Processing (ICSLP-Interspeech '06).Google Scholar
Cynthia A. Thompson, Mehmet H. Göker, and Pat Langley. 2004. A Personalized System for Conversational Recommendations. Journal of Artificial Intelligence Research, Vol. 21, 1 (2004), 393--428. http://dl.acm.org/citation.cfm?id=1622467.1622479Google ScholarCross Ref
Kyle Williams and Imed Zitouni. 2017. Does That Mean You'Re Happy?: RNN-based Modeling of User Interaction Sequences to Detect Good Abandonment. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management (CIKM '17). 727--736. http://doi.acm.org/10.1145/3132847.3133035Google ScholarDigital Library
Zhao Yan, Nan Duan, Peng Chen, Ming Zhou, Jianshe Zhou, and Zhoujun Li. 2017. Building Task-oriented Dialogue Systems for Online Shopping. In Thirty-First AAAI Conference on Artificial Intelligence (AAAI '17). https://aaai.org/ocs/index.php/AAAI/AAAI17/paper/view/14261Google Scholar
Xiang Zhang, Junbo Zhao, and Yann LeCun. 2015. Character-level Convolutional Networks for Text Classification. In Proceedings of the 28th International Conference on Neural Information Processing Systems - Volume 1 (NIPS '15). 649--657. http://dl.acm.org/citation.cfm?id=2969239.2969312Google ScholarDigital Library
Yongfeng Zhang, Xu Chen, Qingyao Ai, Liu Yang, and W. Bruce Croft. 2018. Towards Conversational Search and Recommendation: System Ask, User Respond. In Proceedings of the 27th ACM International Conference on Information and Knowledge Management (CIKM '18). 177--186. http://doi.acm.org/10.1145/3269206.3271776Google ScholarDigital Library

Index Terms

Predicting User Intents and Satisfaction with Dialogue-based Conversational Recommendations
1. Human-centered computing
  1. Human computer interaction (HCI)
    1. HCI design and evaluation methods
      1. User models
      2. User studies
2. Information systems
  1. Information retrieval
    1. Retrieval tasks and goals
      1. Recommender systems

Recommendations

User Intent Prediction in Information-seeking Conversations
CHIIR '19: Proceedings of the 2019 Conference on Human Information Interaction and Retrieval

Conversational assistants are being progressively adopted by the general population. However, they are not capable of handling complicated information-seeking tasks that involve multiple turns of information exchange. Due to the limited communication ...
Read More
Understanding and Predicting User Satisfaction with Conversational Recommender Systems
User satisfaction depicts the effectiveness of a system from the user’s perspective. Understanding and predicting user satisfaction is vital for the design of user-oriented evaluation methods for conversational recommender systems (CRSs). Current ...
Read More
A Transformer-Based User Satisfaction Prediction for Proactive Interaction Mechanism in DuerOS
CIKM '22: Proceedings of the 31st ACM International Conference on Information & Knowledge Management

Recently, spoken dialogue systems have been widely deployed in a variety of applications, serving a huge number of end-users. A common issue is that the errors resulting from noisy utterances, semantic misunderstandings, or lack of knowledge make it ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
UMAP '20: Proceedings of the 28th ACM Conference on User Modeling, Adaptation and Personalization
July 2020
426 pages
ISBN:9781450368612
DOI:10.1145/3340631
Editors:
Tsvi Kuflik
University of Haifa, Israel
,
Ilaria Torre
University of Genoa, Italy
,
Robin Burke
University of Colorado, Boulder, USA
,
Cristina Gena
University of Turin, Italy
Copyright © 2020 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 13 July 2020
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Badges
- Best Student Paper
Author Tags
dialogue-based conversational recommender systems
intent taxonomy
user intent prediction
user satisfaction prediction
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate162of633submissions,26%
Upcoming Conference
UMAP '24

Sponsor:

sigchi

sigchi

32nd ACM Conference on User Modeling, Adaptation and Personalization

July 1 - 4, 2024

Cagliari , Italy
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 21
  Total Citations
  View Citations
- 878
  Total Downloads
- Downloads (Last 12 months)155
- Downloads (Last 6 weeks)26
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Predicting User Intents and Satisfaction with Dialogue-based Conversational Recommendations

UMAP '20: Proceedings of the 28th ACM Conference on User Modeling, Adaptation and Personalization

ABSTRACT

Supplemental Material

Available for Download

References

Cited By

Index Terms

Recommendations

User Intent Prediction in Information-seeking Conversations

Understanding and Predicting User Satisfaction with Conversational Recommender Systems

A Transformer-Based User Satisfaction Prediction for Proactive Interaction Mechanism in DuerOS