Skip to main content

Towards Understanding User Requests in AI Bots

  • Conference paper
  • First Online:
PRICAI 2018: Trends in Artificial Intelligence (PRICAI 2018)

Abstract

This paper presents the task of deeply analyzing user requests: the situation in ordering bots where users input an utterance, the bots would hopefully extract its full product descriptions and then parse them to recognize each product information (PI). This information is useful to help bots better understand user requests, and act upon a much wider range of actions. We model it as a two-layer sequence labeling problem and apply CRFs to solve the task. We investigate two different feature settings, which are manually designed and automatically learnt from neural models of LSTM and CNN, to build good CRF models. In designing features, we propose additional ones based on Brown clustering to enhance the performance of CRF models. To verify the effectiveness, we build a corpus in the retail domain to conduct extensive experiments. The results show that automatically learnt features are very effective and commonly yield better performance than manually designed features. In both settings, adding the information of tags in one layer can also boost the performance of the other layer. Overall, we achieve the best performance with the F-measure of 93.08% in recognizing full product descriptions, and the F-measure of 92.97% in recognizing PI. To our knowledge, this is the first attempt towards understanding user utterances in the context of building Vietnamese ordering bots.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    via another intent detection module.

  2. 2.

    Selling online requires companies to collect clear basic PI that consumers can actually understand. Without PI, e.g. the name of the product, price and product category, the product could not be found and sold online at all.

  3. 3.

    part-of-speech labels are not used here because current Vietnamese pos taggers did not yield good performance on social media texts.

  4. 4.

    https://github.com/standfordnlp/GloVe.

References

  1. Bach, N.X., Minh, N.L., Oanh, T.T., Shimazu, A.: A two-phase framework for learning logical structures of paragraphs in legal articles. ACM Trans. Asian Lang. Inf. Process. (ACM TALIP) 12(1), 1–32 (2013)

    Article  Google Scholar 

  2. Bikel, D.M., Schwartz, R.L., Weischedel, R.M.: An algorithm that learns what’s in a name. Mach. Learn. J. 34(1–3), 211–231 (1999)

    Article  Google Scholar 

  3. Borthwick: Maximum entropy approach to named entity recognition. Ph.D. thesis, New York University (1999)

    Google Scholar 

  4. Brixey, J., et al.: SHIHbot: a Facebook chatbot for sexual health information on HIV/AIDS. In: 18th Annual Meeting of the Special Interest Group on Discourse and Dialogue, pp. 370–373. Association for Computational Linguistics (2017)

    Google Scholar 

  5. Brown, P., deSouza, P., Mercer, R., Pietra, V., Lai, J.: Class-based n-gram models of natural language. J. Comput. Linguist. 18(4), 467–479 (1992)

    Google Scholar 

  6. Cui, L., Huang, S., Wei, F., Tan, C., Duan, C., Zhou, M.: Superagent: a customer service chatbot for e-commerce websites. In: Proceedings of ACL 2017, System Demonstrations, pp. 97–102. Association for Computational Linguistics (2017)

    Google Scholar 

  7. Hochreiter, S., Schmidhuber, J.: Long short-term memory. J. Neural Comput. 9(8), 1735–1780 (1997)

    Article  Google Scholar 

  8. Huang, P.S., He, X., Gao, J.F., Deng, L., Acero, A., Heck, L.: Learning deep structured semantic models for web search using clickthrough data. In: Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, pp. 2333–2338. ACM, New York (2013)

    Google Scholar 

  9. Ji, Z.J., Lu, Z.D., Li, H.: An information retrieval approach to short text conversation. arXiv:1408.6988 [cs.IR] (2014)

  10. Lafferty, J.D., McCallum, A., Perera, F.C.N.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: 18th International Conference on Machine Learning, pp. 282–289. Morgan Kaufmann Publishers Inc., San Francisco (2001)

    Google Scholar 

  11. Lample, G., Ballesteros, M., Subramanian, S., Kawakami, K., Dyer, C.: Neural architectures for named entity recognition. In: 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 260–270. Association for Computational Linguistics, San Diego (2016)

    Google Scholar 

  12. LeCun, Y., Bengio, Y.: Convolutional networks for images, speech, and time series. In: The Handbook of Brain Theory and Neural Networks, pp. 255–258. MIT Press, Cambridge (1998)

    Google Scholar 

  13. Li, W., McCallum, A.: Rapid development of Hindi named entity recognition using conditional random fields and feature induction. ACM Trans. Asian Lang. Inf. Process. (TALIP) 2(3), 290–294 (2003)

    Article  Google Scholar 

  14. Ngo, T.-L., et al.: Identifying user intents in vietnamese spoken language commands and its application in smart mobile voice interaction. In: Nguyen, N.T., Trawiński, B., Fujita, H., Hong, T.-P. (eds.) ACIIDS 2016. LNCS (LNAI), vol. 9621, pp. 190–201. Springer, Heidelberg (2016). https://doi.org/10.1007/978-3-662-49381-6_19

    Chapter  Google Scholar 

  15. Nguyen, C.T., Tran, T.O., Phan, X.H., Ha, Q.T.: Named entity recognition in Vietnamese free-text and web documents using conditional random fields. In: The 8th Conference on Some selection problems of Information Technology and Telecommunication, HaiPhong, Vietnam (2005)

    Google Scholar 

  16. Qiu, M., et al.: AliMe chat: a sequence to sequence and rerank based chatbot engine. In: Annual Meeting of the Association for Computational Linguistics (2017)

    Google Scholar 

  17. Sang, E.F., Veenstra, J.: Representing text chunks. In: Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics, pp. 173–179. Association for Computational Linguistics (1999)

    Google Scholar 

  18. Sobhana, N.V., Mitra, P., Ghosh, S.K.: Conditional random field based named entity recognition in geological text. Int. J. Comput. Appl. 1(3), 119–122 (2010)

    Google Scholar 

  19. Tran, P.-N., Ta, V.-D., Truong, Q.-T., Duong, Q.-V., Nguyen, T.-T., Phan, X.-H.: Named entity recognition for Vietnamese spoken texts and its application in smart mobile voice interaction. In: Nguyen, N.T., Trawiński, B., Fujita, H., Hong, T.-P. (eds.) ACIIDS 2016. LNCS (LNAI), vol. 9621, pp. 170–180. Springer, Heidelberg (2016). https://doi.org/10.1007/978-3-662-49381-6_17

    Chapter  Google Scholar 

  20. Yan, Z., et al.: DocChat: an information retrieval approach for chatbot engines using unstructured documents. In: The Proceedings of ACL (2016)

    Google Scholar 

  21. Yan, Z., Duan, N., Chen, P., Zhou, M., Zhou, J., Li., Z.: Building task-oriented dialogue systems for online shopping. In: Proceedings of AAAI, pp. 4618–4626 (2017)

    Google Scholar 

  22. Zhu, Q., Li, X., Conesa, A., Pereira, C.: GRAM-CNN: a deep learning approach with local context for named entity recognition in biomedical text. J. Bioinform. 18 (2017)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Oanh Thi Tran .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Tran, O.T., Luong, T.C. (2018). Towards Understanding User Requests in AI Bots. In: Geng, X., Kang, BH. (eds) PRICAI 2018: Trends in Artificial Intelligence. PRICAI 2018. Lecture Notes in Computer Science(), vol 11012. Springer, Cham. https://doi.org/10.1007/978-3-319-97304-3_66

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-97304-3_66

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-97303-6

  • Online ISBN: 978-3-319-97304-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics