Abstract
In the recent years, a lot of methods have been proposed for detection of topicality of user discussions. Recently, the scholars have suggested approaches to tracing topicality evolution, including dynamic topic modeling. However, these approaches are overwhelmingly limited by representation of topics via lists of top words, which only hint to possible contents of topics and does not allow for real mapping of opinion cumulation [1]. We suggest a methodology for discussion mapping that combines neural-network-based encoding of user posts, HDBSCAN-based topic modeling, and abstractive summarization to map large-scale online discussions and trace bifurcation points in opinion cumulation. We test the proposed method on a mid-range dataset on climate change from Reddit and show how discussions may be summarized in a feasible and easily accessible way. Among the rest, we show that the bifurcation points in topicality are often followed by growth of a given topic, which may in future allow for predicting discussion outbursts.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bodrunova, S.S.: Practices of cumulative deliberation: a meta-review of the recent research findings. In: Chugunov, A.V., Janssen, M., Khodachek, I., Misnikov, Y., Trutnev, D. (eds.) EGOSE 2021. CCIS, vol. 1529, pp. 89–104. Springer, Cham (2022). https://doi.org/10.1007/978-3-031-04238-6_8
Vaswani, A., et al.: Attention is All You Need. In: Advances in Neural Information Processing Systems, vol. 30 (2017)
Angelov, D.: Top2Vec: distributed representations of topics (2020). arXiv preprint arXiv:2008.09470
Grootendorst, M.: BERTopic: neural topic modeling with a class-based TF-IDF procedure (2022). arXiv preprint arXiv:2203.05794
Gupta, S., Gupta, S.K.: Abstractive summarization: an overview of the state of the art. Expert Syst. Appl. 121, 49–65 (2019)
Reimers, N., Gurevych, I.: Making monolingual sentence embeddings multilingual using knowledge distillation (2020). arXiv preprint arXiv:2004.09813
Henderson, M., et al.: A repository of conversational datasets (2019). arXiv preprint arXiv:1904.06472
McInnes, L., Healy, J., Melville, J.: UMAP: uniform manifold approximation and projection for dimension reduction (2018). arXiv preprint arXiv:1802.03426
McInnes, L., Healy, J., Astels, S.: HDBSCAN: hierarchical density based clustering. J. Open Source Softw. 2(11), 205 (2017)
Guo, M., et al.: LongT5: efficient text-to-text transformer for long sequences (2021). arXiv preprint arXiv:2112.07916
pszemraj/long-t5-tglobal-base-16384-book-summary \(\cdot \) Hugging Face. https://huggingface.co/pszemraj/long-t5-tglobal-base-16384-book-summary. Accessed 4 Feb 2023
Raffel, C., et al.: Exploring the limits of transfer learning with a unified text-to-text transformer. J. Mach. Learn. Res. 21(1), 5485–5551 (2020)
Blekanov, I.S., Tarasov, N., Bodrunova, S.S.: Transformer-based abstractive summarization for reddit and Twitter: single posts vs. comment pools in three languages. Future Internet 14(3), 69 (2022)
Acknowledgements
This research has been supported in full by Russian Science Foundation, grant 21-18-00454 (2021–2023).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Appendix
Appendix
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Blekanov, I.S., Tarasov, N., Bodrunova, S.S., Sergeev, S.L. (2023). Mapping Opinion Cumulation: Topic Modeling-Based Dynamic Summarization of User Discussions on Social Networks. In: Coman, A., Vasilache, S. (eds) Social Computing and Social Media. HCII 2023. Lecture Notes in Computer Science, vol 14025. Springer, Cham. https://doi.org/10.1007/978-3-031-35915-6_3
Download citation
DOI: https://doi.org/10.1007/978-3-031-35915-6_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-35914-9
Online ISBN: 978-3-031-35915-6
eBook Packages: Computer ScienceComputer Science (R0)