Abstract
This paper presents the task of deeply analyzing user requests: the situation in ordering bots where users input an utterance, the bots would hopefully extract its full product descriptions and then parse them to recognize each product information (PI). This information is useful to help bots better understand user requests, and act upon a much wider range of actions. We model it as a two-layer sequence labeling problem and apply CRFs to solve the task. We investigate two different feature settings, which are manually designed and automatically learnt from neural models of LSTM and CNN, to build good CRF models. In designing features, we propose additional ones based on Brown clustering to enhance the performance of CRF models. To verify the effectiveness, we build a corpus in the retail domain to conduct extensive experiments. The results show that automatically learnt features are very effective and commonly yield better performance than manually designed features. In both settings, adding the information of tags in one layer can also boost the performance of the other layer. Overall, we achieve the best performance with the F-measure of 93.08% in recognizing full product descriptions, and the F-measure of 92.97% in recognizing PI. To our knowledge, this is the first attempt towards understanding user utterances in the context of building Vietnamese ordering bots.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
via another intent detection module.
- 2.
Selling online requires companies to collect clear basic PI that consumers can actually understand. Without PI, e.g. the name of the product, price and product category, the product could not be found and sold online at all.
- 3.
part-of-speech labels are not used here because current Vietnamese pos taggers did not yield good performance on social media texts.
- 4.
References
Bach, N.X., Minh, N.L., Oanh, T.T., Shimazu, A.: A two-phase framework for learning logical structures of paragraphs in legal articles. ACM Trans. Asian Lang. Inf. Process. (ACM TALIP) 12(1), 1–32 (2013)
Bikel, D.M., Schwartz, R.L., Weischedel, R.M.: An algorithm that learns what’s in a name. Mach. Learn. J. 34(1–3), 211–231 (1999)
Borthwick: Maximum entropy approach to named entity recognition. Ph.D. thesis, New York University (1999)
Brixey, J., et al.: SHIHbot: a Facebook chatbot for sexual health information on HIV/AIDS. In: 18th Annual Meeting of the Special Interest Group on Discourse and Dialogue, pp. 370–373. Association for Computational Linguistics (2017)
Brown, P., deSouza, P., Mercer, R., Pietra, V., Lai, J.: Class-based n-gram models of natural language. J. Comput. Linguist. 18(4), 467–479 (1992)
Cui, L., Huang, S., Wei, F., Tan, C., Duan, C., Zhou, M.: Superagent: a customer service chatbot for e-commerce websites. In: Proceedings of ACL 2017, System Demonstrations, pp. 97–102. Association for Computational Linguistics (2017)
Hochreiter, S., Schmidhuber, J.: Long short-term memory. J. Neural Comput. 9(8), 1735–1780 (1997)
Huang, P.S., He, X., Gao, J.F., Deng, L., Acero, A., Heck, L.: Learning deep structured semantic models for web search using clickthrough data. In: Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, pp. 2333–2338. ACM, New York (2013)
Ji, Z.J., Lu, Z.D., Li, H.: An information retrieval approach to short text conversation. arXiv:1408.6988 [cs.IR] (2014)
Lafferty, J.D., McCallum, A., Perera, F.C.N.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: 18th International Conference on Machine Learning, pp. 282–289. Morgan Kaufmann Publishers Inc., San Francisco (2001)
Lample, G., Ballesteros, M., Subramanian, S., Kawakami, K., Dyer, C.: Neural architectures for named entity recognition. In: 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 260–270. Association for Computational Linguistics, San Diego (2016)
LeCun, Y., Bengio, Y.: Convolutional networks for images, speech, and time series. In: The Handbook of Brain Theory and Neural Networks, pp. 255–258. MIT Press, Cambridge (1998)
Li, W., McCallum, A.: Rapid development of Hindi named entity recognition using conditional random fields and feature induction. ACM Trans. Asian Lang. Inf. Process. (TALIP) 2(3), 290–294 (2003)
Ngo, T.-L., et al.: Identifying user intents in vietnamese spoken language commands and its application in smart mobile voice interaction. In: Nguyen, N.T., Trawiński, B., Fujita, H., Hong, T.-P. (eds.) ACIIDS 2016. LNCS (LNAI), vol. 9621, pp. 190–201. Springer, Heidelberg (2016). https://doi.org/10.1007/978-3-662-49381-6_19
Nguyen, C.T., Tran, T.O., Phan, X.H., Ha, Q.T.: Named entity recognition in Vietnamese free-text and web documents using conditional random fields. In: The 8th Conference on Some selection problems of Information Technology and Telecommunication, HaiPhong, Vietnam (2005)
Qiu, M., et al.: AliMe chat: a sequence to sequence and rerank based chatbot engine. In: Annual Meeting of the Association for Computational Linguistics (2017)
Sang, E.F., Veenstra, J.: Representing text chunks. In: Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics, pp. 173–179. Association for Computational Linguistics (1999)
Sobhana, N.V., Mitra, P., Ghosh, S.K.: Conditional random field based named entity recognition in geological text. Int. J. Comput. Appl. 1(3), 119–122 (2010)
Tran, P.-N., Ta, V.-D., Truong, Q.-T., Duong, Q.-V., Nguyen, T.-T., Phan, X.-H.: Named entity recognition for Vietnamese spoken texts and its application in smart mobile voice interaction. In: Nguyen, N.T., Trawiński, B., Fujita, H., Hong, T.-P. (eds.) ACIIDS 2016. LNCS (LNAI), vol. 9621, pp. 170–180. Springer, Heidelberg (2016). https://doi.org/10.1007/978-3-662-49381-6_17
Yan, Z., et al.: DocChat: an information retrieval approach for chatbot engines using unstructured documents. In: The Proceedings of ACL (2016)
Yan, Z., Duan, N., Chen, P., Zhou, M., Zhou, J., Li., Z.: Building task-oriented dialogue systems for online shopping. In: Proceedings of AAAI, pp. 4618–4626 (2017)
Zhu, Q., Li, X., Conesa, A., Pereira, C.: GRAM-CNN: a deep learning approach with local context for named entity recognition in biomedical text. J. Bioinform. 18 (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Tran, O.T., Luong, T.C. (2018). Towards Understanding User Requests in AI Bots. In: Geng, X., Kang, BH. (eds) PRICAI 2018: Trends in Artificial Intelligence. PRICAI 2018. Lecture Notes in Computer Science(), vol 11012. Springer, Cham. https://doi.org/10.1007/978-3-319-97304-3_66
Download citation
DOI: https://doi.org/10.1007/978-3-319-97304-3_66
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-97303-6
Online ISBN: 978-3-319-97304-3
eBook Packages: Computer ScienceComputer Science (R0)