Abstract
Local topic detection from spatio-temporal social media (e.g. Twitter) plays an important role in many applications, therefore, it has attracted a surge of research attention in recent years. However, most existing studies consider time and location separately, they often assume tweets are correlated as long as their time is adjacent or their location is neighboring. But in reality, only tweets posted at both adjacent time and neighboring location tend to talk about the same local topic. To address this issue, we propose a network based embedding model to capture the correlation between time and location, and jointly model spatio-temporal information and semantic information together. This embedding model can ensure that the generated keyword embeddings are semantically coherent and spatio-temporally close. Based on the keyword embeddings, we present a novel topic model to obtain high-quality local topics. This topic model presumes that each tweet is represented by only one topic and takes the background mode of words into consideration to address the concise and noisy problem of Twitter. The experiments demonstrate that the effectiveness and efficiency of our method have been improved significantly compared to the state-of-the-art existing methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Batmanghelich, K., Saeedi, A., Narasimhan, K., Gershman, S.: Nonparametric spherical topic modeling with word embeddings. In: ACL (2). The Association for Computer Linguistics (2016)
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
Chang, B., Park, Y., Park, D., Kim, S., Kang, J.: Content-aware hierarchical point-of-interest embedding model for successive POI recommendation. In: IJCAI, pp. 3301–3307. ijcai.org (2018)
Das, R., Zaheer, M., Dyer, C.: Gaussian LDA for topic models with word embeddings. In: ACL (1), pp. 795–804. The Association for Computer Linguistics (2015)
Ding, R., Nallapati, R., Xiang, B.: Coherence-aware neural topic modeling. In: EMNLP, pp. 830–836. Association for Computational Linguistics (2018)
Giannakopoulos, K., Chen, L.: Incremental and adaptive topic detection over social media. In: Pei, J., Manolopoulos, Y., Sadiq, S., Li, J. (eds.) DASFAA 2018. LNCS, vol. 10827, pp. 460–473. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-91452-7_30
Gopal, S., Yang, Y.: Von mises-fisher clustering models. In: ICML. JMLR Workshop and Conference Proceedings, vol. 32, pp. 154–162. JMLR.org (2014)
Liao, L., He, X., Zhang, H., Chua, T.: Attributed social network embedding. IEEE Trans. Knowl. Data Eng. 30(12), 2257–2270 (2018)
Liu, H., Ge, Y., Zheng, Q., Lin, R., Li, H.: Detecting global and local topics via mining Twitter data. Neurocomputing 273, 120–132 (2018)
Liu, Y., Ester, M., Hu, B., Cheung, D.W.: Spatio-temporal topic models for check-in data. In: ICDM, pp. 889–894. IEEE Computer Society (2015)
Mahdaouy, A.E., El Alaoui, S.O., Gaussier, É.: Improving Arabic information retrieval using word embedding similarities. Int. J. Speech Technol. 21(1), 121–136 (2018)
Rosner, F., Hinneburg, A., Röder, M., Nettling, M., Both, A.: Evaluating topic coherence measures. CoRR abs/1403.6397 (2014)
Sizov, S.: GeoFolk: latent spatial semantics in web 2.0 social media. In: WSDM, pp. 281–290. ACM (2010)
Xu, G., Meng, Y., Chen, Z., Qiu, X., Wang, C., Yao, H.: Research on topic detection and tracking for online news texts. IEEE Access 7, 58407–58418 (2019)
Yan, X., Guo, J., Lan, Y., Cheng, X.: A biterm topic model for short texts. In: WWW, pp. 1445–1456. International World Wide Web Conferences Steering Committee/ACM (2013)
Yin, Z., Cao, L., Han, J., Zhai, C., Huang, T.S.: Geographical topic discovery and comparison. In: WWW, pp. 247–256. ACM (2011)
Zhang, C., et al.: TrioVecEvent: embedding-based online local event detection in geo-tagged tweet streams. In: KDD, pp. 595–604. ACM (2017)
Zhang, C., et al.: Regions, periods, activities: uncovering urban dynamics via cross-modal representation learning. In: WWW, pp. 361–370. ACM (2017)
Zhao, W.X., et al.: Comparing twitter and traditional media using topic models. In: Clough, P., et al. (eds.) ECIR 2011. LNCS, vol. 6611, pp. 338–349. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-20161-5_34
Acknowledgement
This work is supported by the National Key Research and Development Program of China, and National Natural Science Foundation of China (No. U163620068).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Chen, J., Gao, N., Zhang, Y., Tu, C. (2019). Local Topic Detection Using Word Embedding from Spatio-Temporal Social Media. In: Gedeon, T., Wong, K., Lee, M. (eds) Neural Information Processing. ICONIP 2019. Communications in Computer and Information Science, vol 1143. Springer, Cham. https://doi.org/10.1007/978-3-030-36802-9_67
Download citation
DOI: https://doi.org/10.1007/978-3-030-36802-9_67
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-36801-2
Online ISBN: 978-3-030-36802-9
eBook Packages: Computer ScienceComputer Science (R0)