Skip to main content

Clustering Approach to Topic Modeling in Users Dialogue

  • Conference paper
  • First Online:
Intelligent Systems and Applications (IntelliSys 2020)

Abstract

This article describes an algorithm for clustering messages from user dialogues. We focus on the fact that the quality of clustering is significantly affected by the number of user questions included in the analyzed subset. The technique was tested on dialogues of Telecom domain, each dialogue can include one to eight questions. The algorithm involves the use of basic and additional methods of data preprocessing, methods of feature extraction, data augmentation, dimensionality reduction method, comparative analysis of the application of clustering methods. The article presents a comparison results of the bag-of-word model, agglomerative clustering and k-means clustering on the sets with different number of users questions. It is shown that the best cluster results are obtained when only the first user questions are included in the analyzed subset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Hiraoka, T., Tsuchida, M., Watanabe, Y.: Deep Reinforcement Learning for Inquiry Dialog Policies with Logical Formula Embeddings (2017)

    Google Scholar 

  2. Liu, H., Lin, T., Sun, H., Lin, W., Chang, C.-W., Zhong, T., Rudnicky, A.: RubyStar: A Non-Task-Oriented Mixture Model Dialog System (2017)

    Google Scholar 

  3. Koltsov, S., Pashakhin, S., Dokuka, S.: A full-cycle methodology for news topic modeling and user feedback research. In: Staab, S., Koltsova, O., Ignatov, D. (eds.) Social Informatics. SocInfo 2018. LNCS, vol. 11185, pp. 308–321. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01129-1_19

  4. Sanandres, E., Llanos, R., Madariaga, C.: Topic Modeling of Twitter Conversations (2018)

    Google Scholar 

  5. Shilkina, N., Maltseva, A., Makhnytkina, O., Titova, M., Gubernatorova, E., Katsko, I., Mirzabalaeva, F., Shusharina, S.: Social media as a display of students’ communication culture: case of educational, professional and labor verbal markers analysis. In: Communications in Computer and Information Science, pp. 384–397 (2019)

    Google Scholar 

  6. Liu, L., Huang, H., Gao, Y., Zhang, Y., Wei, X.: Neural variational correlated topic modeling. In: The World Wide Web Conference, pp. 1142–1152 (2019)

    Google Scholar 

  7. Ram, A., Prasad, R., Khatri, C., Venkatesh, A.: Conversational AI: the science behind the alexa prize (2017)

    Google Scholar 

  8. Boteanu, A., Chernova, S.: Modeling topics in user dialog for interactive tablet media. In: AAAI Workshop, pp. 2–8 (2012)

    Google Scholar 

  9. Hisano, R.: Learning topic models by neighborhood aggregation. In: Twenty-Eighth International Joint Conference on Artificial Intelligence IJCAI 2019, pp. 2498–2505 (2019)

    Google Scholar 

  10. Akhtar, N., Beg, M., Javed, H.: Topic modelling with fuzzy document representation. In: Advances in Computing and Data Sciences, pp. 577–587 (2019)

    Google Scholar 

  11. Dieng, A., Ruiz, F., Blei, D.: The Dynamic Embedded Topic Model (2019)

    Google Scholar 

  12. Nugmanova, A., Smirnov, A., Lavrentyeva, G., Chernykh, I.: Strategy of the negative sampling for training retrieval-based dialogue systems. In: IEEE International Conference on Pervasive Computing and Communications Workshops, pp. 844–848 (2019)

    Google Scholar 

  13. Zhang, P., Wang, S., Li, D., Li, X., Xu, Z.: Combine topic modeling with semantic embedding: embedding enhanced topic model. IEEE Trans. Knowl. Data Eng. 1 (2019)

    Google Scholar 

  14. Mao, Q., Feng, B., Pan, S.: A Bayesian nonparametric topic model for user interest modeling. In: Conference: 2014 IEEE 17th International Conference on Computational Science and Engineering, pp. 527–534 (2014)

    Google Scholar 

  15. Mähr, M., Hoffmann, H., Zetti, D.: Topic modelling and explorative search. In: Conference: Workshop DARIAH-CH (2018)

    Google Scholar 

  16. Korshunova, I., Xiong, H., Fedoryszak, M., Theis, L.: Discriminative Topic Modeling with Logistic LDA (2019)

    Google Scholar 

  17. Tkachenko, M., Lauw, H.: CompareLDA: a topic model for document comparison. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, pp. 7112–7119 (2019)

    Google Scholar 

  18. Yang, Y., Wang, F., Jiang, F., Jin, S., Xu, J.: A topic model for hierarchical documents. In: 1st IEEE International Conference on Data Science in Cyberspace (2016)

    Google Scholar 

  19. Gerlach, M., Peixoto, T., Altmann, E.: A network approach to topic models. Sci. Adv. (2018)

    Google Scholar 

  20. Pfeifer, D., Leidner, J.: Topic Grouper: An Agglomerative Clustering Approach to Topic Modeling (2019)

    Google Scholar 

  21. Iwata, T., Hirao, T., Ueda, N.: Topic models for unsupervised cluster matching. IEEE Trans. Knowl. Data Eng. 1 (2017)

    Google Scholar 

  22. Krasnashchok, K., Cherif, A.: Coherence regularization for neural topic models. In: Advances in Neural Networks (2019)

    Google Scholar 

  23. Nan, F., Ding, R., Nallapati, R., Xiang, B.: Topic Modeling with Wasserstein Autoencoders (2019)

    Google Scholar 

  24. Khatri, C., Goel, R., Hedayatnia, B., Metanillou, A., Venkatesh, A., Gabriel, R., Mandal, A.: Contextual topic modeling for dialog systems. In: Conference: 2018 IEEE Spoken Language Technology Workshop (SLT), pp. 892–899 (2018)

    Google Scholar 

  25. Ma, Y., Fosler-Lussier, E.: Detecting ‘Request Alternatives’ user dialog acts from dialog context. In: Situated Dialog in Speech-Based Human-Computer Interaction (2016)

    Google Scholar 

Download references

Acknowledgments

This work was partially financially supported by the Government of the Russian Federation (Grant 08-08).

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to E. Feldina or O. Makhnytkina .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Feldina, E., Makhnytkina, O. (2021). Clustering Approach to Topic Modeling in Users Dialogue. In: Arai, K., Kapoor, S., Bhatia, R. (eds) Intelligent Systems and Applications. IntelliSys 2020. Advances in Intelligent Systems and Computing, vol 1251. Springer, Cham. https://doi.org/10.1007/978-3-030-55187-2_44

Download citation

Publish with us

Policies and ethics