skip to main content
10.1145/3340531.3412683acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

aDMSCN: A Novel Perspective for User Intent Prediction in Customer Service Bots

Published: 19 October 2020 Publication History

Abstract

As one of the core components of customer service bot, User Intent Prediction (UIP) aims at predicting users? intents (usually represented as predefined user questions) before they ask, and has been widely applied in real applications. However, when developing a machine learning system for this problem, two critical issues, i.e., the problem of feature drift and class imbalance, may emerge and seriously deprave the system performance. Moreover, various scenarios may arise due to business demands, making the aforementioned problems much more severe. To address these two problems, we propose an attention-based Deep Multi-instance Sequential Cross Network (aDMSCN) to deal with the UIP task. On the one hand,the UIP task can be subtly formalized as multi-instance learning(MIL) task with an attention-based method proposed to alleviate the influences of feature drift. To the best of our knowledge, this is the first attempt to model the problem from a MIL perspective.On the other hand, a ratio-sensitive loss is also developed in our model, which can mitigate the negative impact of class imbalance. Extensive experiments on both offline real-world datasets and on-line A/B testing show that our proposed framework significantly out performs other state-of-art methods for the UIP task.

Supplementary Material

MP4 File (3340531.3412683.mp4)
An introduction to a novel perspective when dealing with user intent prediction for practical online service bots.

References

[1]
Jaume Amores. 2013. Multiple instance classification: Review, taxonomy and comparative study. Artificial intelligence, Vol. 201 (2013), 81--105.
[2]
Jean Paul Barddal, Heitor Murilo Gomes, Fabrício Enembreck, and Bernhard Pfahringer. [n.d.]. A survey on feature drift adaptation: Definition, benchmark, challenges and future directions. Journal of Systems & Software ([n.,d.]), S0164121216301030.
[3]
Yoshua Bengio, Réjean Ducharme, Pascal Vincent, and Christian Jauvin. 2003. A neural probabilistic language model. Journal of machine learning research, Vol. 3, Feb (2003), 1137--1155.
[4]
Nitesh V Chawla, Kevin W Bowyer, Lawrence O Hall, and W Philip Kegelmeyer. 2002. SMOTE: synthetic minority over-sampling technique. Journal of artificial intelligence research, Vol. 16 (2002), 321--357.
[5]
Cen Chen, Chilin Fu, Xu Hu, Xiaolu Zhang, Jun Zhou, Xiaolong Li, and Forrest Sheng Bao. 2019 a. Reinforcement Learning for User Intent Prediction in Customer Service Bots. In Proceedings of the 42Nd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'19). ACM, New York, NY, USA, 1265--1268. https://doi.org/10.1145/3331184.3331370
[6]
Cen Chen, Xiaolu Zhang, Sheng Ju, Chilin Fu, Caizhi Tang, Jun Zhou, and Xiaolong Li. 2019 b. AntProphet: an Intention Mining System behind Alipay's Intelligent Customer Service Bot. 6497--6499. https://www.ijcai.org/proceedings/2019/935
[7]
Cen Chen, Peilin Zhao, Longfei Li, Jun Zhou, Xiaolong Li, and Minghui Qiu. 2017. Locally Connected Deep Learning Framework for Industrial-scale Recommender Systems. In WWW.
[8]
Heng-Tze Cheng, Levent Koc, Jeremiah Harmsen, Tal Shaked, Tushar Chandra, Hrishi Aradhye, Glen Anderson, Greg Corrado, Wei Chai, and Mustafa Ispir. 2016. Wide & deep learning for recommender systems. In Proceedings of the 1st workshop on deep learning for recommender systems. ACM, 7--10. https://doi.org/10/gfwc7n
[9]
Ofer Dekel, Ohad Shamir, and Lin Xiao. 2010. Learning to classify with missing and corrupted features. Machine learning, Vol. 81, 2 (2010), 149--178.
[10]
Mukund Deshpande and George Karypis. 2004. Item-based top-n recommendation algorithms. ACM Transactions on Information Systems (TOIS), Vol. 22, 1 (2004), 143--177.
[11]
Thomas G. Dietterich, Richard H. Lathrop, and Tomás Lozano-Pérez. 1997. Solving the multiple instance problem with axis-parallel rectangles. Artificial intelligence, Vol. 89, 1--2 (1997), 31--71. https://doi.org/10/b738d7
[12]
Huifeng Guo, Ruiming Tang, Yunming Ye, Zhenguo Li, and Xiuqiang He. 2017. DeepFM: A Factorization-Machine based Neural Network for CTR Prediction. In Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence. International Joint Conferences on Artificial Intelligence Organization, Melbourne, Australia, 1725--1731. https://doi.org/10.24963/ijcai.2017/239
[13]
Xiangnan He and Tat-Seng Chua. 2017. Neural factorization machines for sparse predictive analytics. In Proceedings of the 40th International ACM SIGIR conference on Research and Development in Information Retrieval. ACM, 355--364. https://doi.org/10/gfvsvd
[14]
Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural computation, Vol. 9, 8 (1997), 1735--1780. https://doi.org/10/bxd65w
[15]
Maximilian Ilse, Jakub M. Tomczak, and Max Welling. 2018. Attention-based deep multiple instance learning. In International Conference on Machine Learning.
[16]
Yuchin Juan, Yong Zhuang, Wei-Sheng Chin, and Chih-Jen Lin. 2016. Field-aware factorization machines for CTR prediction. In Proceedings of the 10th ACM Conference on Recommender Systems. ACM, 43--50. https://doi.org/10/gf5psn
[17]
Diederik Kingma and Jimmy Ba. 2014. Adam: A Method for Stochastic Optimization. In International Conference on Learning Representations.
[18]
Yehuda Koren. 2009. Collaborative filtering with temporal dynamics. In Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 447--456. https://doi.org/10/dhw5qt
[19]
Jianxun Lian, Xiaohuan Zhou, Fuzheng Zhang, Zhongxia Chen, Xing Xie, and Guangzhong Sun. 2018. xDeepFM: Combining explicit and implicit feature interactions for recommender systems. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. ACM, 1754--1763. https://doi.org/10/gf3nkw
[20]
Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, and Piotr Dollár. 2017. Focal loss for dense object detection. In Proceedings of the IEEE international conference on computer vision. 2980--2988. https://doi.org/10/gf226d
[21]
Maxime Oquab, Léon Bottou, Ivan Laptev, and Josef Sivic. 2014. Weakly supervised object recognition with convolutional neural networks. In Proc. of NIPS.
[22]
Chen Qu, Liu Yang, W. Bruce Croft, Yongfeng Zhang, Johanne R. Trippas, and Minghui Qiu. 2019. User intent prediction in information-seeking conversations. In Proceedings of the 2019 Conference on Human Information Interaction and Retrieval. ACM, 25--33.
[23]
Steffen Rendle. 2010. Factorization machines. In 2010 IEEE International Conference on Data Mining. IEEE, 995--1000. https://doi.org/10/bgfknx
[24]
Ying Shan, T. Ryan Hoens, Jian Jiao, Haijing Wang, Dong Yu, and J. C. Mao. 2016. Deep crossing: Web-scale modeling without manually crafted combinatorial features. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 255--262.
[25]
Jie Shen, Ying Gao, Cang Chen, and HaiPing Gong. 2012. A rank-based Prediction Algorithm of Learning User's Intention. Physics Procedia, Vol. 24 (2012), 1742--1748.
[26]
Weiping Song, Chence Shi, Zhiping Xiao, Zhijian Duan, Yewen Xu, Ming Zhang, and Jian Tang. 2019. AutoInt: Automatic Feature Interaction Learning via Self-Attentive Neural Networks. In CIKM.
[27]
Christian Szegedy, Vincent Vanhoucke, Sergey Ioffe, Jon Shlens, and Zbigniew Wojna. 2016. Rethinking the inception architecture for computer vision. In Proceedings of the IEEE conference on computer vision and pattern recognition. 2818--2826.
[28]
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Advances in neural information processing systems. 5998--6008.
[29]
Ruoxi Wang, Bin Fu, Gang Fu, and Mingliang Wang. 2017. Deep & cross network for ad click predictions. In Proceedings of the ADKDD'17. ACM, 12. https://doi.org/10/gf2wst
[30]
Jun Xiao, Hao Ye, Xiangnan He, Hanwang Zhang, Fei Wu, and Tat-Seng Chua. 2017. Attentional factorization machines: Learning the weight of feature interactions via attention networks. arXiv preprint arXiv:1708.04617 (2017).
[31]
Weinan Zhang, Tianming Du, and Jun Wang. 2016. Deep learning over multi-field categorical data. In European conference on information retrieval. Springer, 45--57. https://doi.org/10/gf5psm
[32]
Ya-Lin Zhang and Zhi-Hua Zhou. 2017. Multi-instance Learning with Key Instance Shift. In Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI'17). 3441--3447. http://dl.acm.org/citation.cfm?id=3172077.3172370
[33]
Guorui Zhou, Na Mou, Ying Fan, Qi Pi, Weijie Bian, Chang Zhou, Xiaoqiang Zhu, and Kun Gai. [n.d.]. Deep Interest Evolution Network for Click-Through Rate Prediction.
[34]
Guorui Zhou, Xiaoqiang Zhu, Chenru Song, Ying Fan, Han Zhu, Xiao Ma, Yanghui Yan, Junqi Jin, Han Li, and Kun Gai. 2018. Deep Interest Network for Click-Through Rate Prediction. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD '18). ACM, New York, NY, USA, 1059--1068. https://doi.org/10/gfx98v
[35]
Zhi-Hua Zhou, Kai Jiang, and Ming Li. 2005. Multi-instance learning based web mining. Applied Intelligence, Vol. 22, 2 (2005), 135--147.

Cited By

View all
  • (2023)ALT: An Automatic System for Long Tail Scenario Modeling2023 IEEE 39th International Conference on Data Engineering (ICDE)10.1109/ICDE55515.2023.00231(3017-3030)Online publication date: Apr-2023
  • (2022)FinBrain 2.0: when finance meets trustworthy AI金融大脑2.0:当金融遇到可信人工智能Frontiers of Information Technology & Electronic Engineering10.1631/FITEE.220003923:12(1747-1764)Online publication date: 30-Sep-2022
  • (2022)Design possibilities and challenges of DNN models: a review on the perspective of end devicesArtificial Intelligence Review10.1007/s10462-022-10138-z55:7(5109-5167)Online publication date: 1-Oct-2022
  • Show More Cited By

Index Terms

  1. aDMSCN: A Novel Perspective for User Intent Prediction in Customer Service Bots

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CIKM '20: Proceedings of the 29th ACM International Conference on Information & Knowledge Management
    October 2020
    3619 pages
    ISBN:9781450368599
    DOI:10.1145/3340531
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 19 October 2020

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. multiple instancelearning
    2. recommender system
    3. user intent prediction

    Qualifiers

    • Research-article

    Conference

    CIKM '20
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

    Upcoming Conference

    CIKM '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)15
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 03 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)ALT: An Automatic System for Long Tail Scenario Modeling2023 IEEE 39th International Conference on Data Engineering (ICDE)10.1109/ICDE55515.2023.00231(3017-3030)Online publication date: Apr-2023
    • (2022)FinBrain 2.0: when finance meets trustworthy AI金融大脑2.0:当金融遇到可信人工智能Frontiers of Information Technology & Electronic Engineering10.1631/FITEE.220003923:12(1747-1764)Online publication date: 30-Sep-2022
    • (2022)Design possibilities and challenges of DNN models: a review on the perspective of end devicesArtificial Intelligence Review10.1007/s10462-022-10138-z55:7(5109-5167)Online publication date: 1-Oct-2022
    • (2021)A Classification Based Ensemble Pruning Framework with Multi-metric ConsiderationIntelligent Systems and Applications10.1007/978-3-030-82193-7_44(650-667)Online publication date: 4-Aug-2021

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media