Abstract
We present a methodology for identifying user communities on Twitter, by defining a number of similarity metrics based on their shared content, following relationships and interactions. We then introduce a novel method based on latent Dirichlet allocation to extract user clusters discussing interesting local topics and propose a methodology to eliminate trivial topics. In order to evaluate the methodology, we experiment with a real-world dataset created using the Twitter Searching API.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ding, Y.: Community detection: Topological vs. topical. Journal of Informetrics 5(4), 498–514 (2011)
Li, D., Ding, Y., Shuai, X., Bollen, J., Tang, J., Chen, S., Zhu, J., Rocha, G.: Adding community and dynamic to topic models. Journal of Informetrics 6(2), 237–253 (2012)
Ruan, Y., Fuhry, D., Parthasarathy, S.: Efficient community detection in large networks using content and links. In: 22nd International Conference on World Wide Web, pp. 1089–1098 (2013)
Lim, K.H., Datta, A.: Tweets Beget Propinquity: Detecting Highly Interactive Communities on Twitter Using Tweeting Links. IEEE/WIC/ACM International Conference on Web Intelligence 1, 214–221 (2012)
Zhao, Z., Feng, S., Wang, Q., Huang, J.Z., Williams, G.J., Fan, J.: Topic oriented community detection through social objects and link analysis in social networks. Knowledge-Based Systems 26, 164–173 (2012)
Goel, A., Sharma, A., Wang, D., Yin, Z.: Discovering similar users on twitter. In: 11th Workshop on Mining and Learning with Graphs, Chicago, Illinois (2013)
Zhang, Y., Wu, Y., Yang, Q.: Community Discovery in Twitter Based on User Interests. Journal of Computational Information Systems 8(3), 991–1000 (2012)
Xie, J., Kelley, S., Szymanski, B.K.: Overlapping Community Detection in Networks: The State-of-the-art and Comparative Study. ACM Computing Surveys 45(4), 1–35 (2013)
Yang, J., McAuley, J.J., Leskovec, J.: Community detection in networks with node attributes. In: IEEE 13th International Conference on Data Mining, pp. 1151–1156 (2013)
Salton, G., Wong, A., Yang, C.S.: A Vector Space Model for Automatic Indexing. Communications of the ACM 18(11), 613–620 (1975)
Frey, B.J., Dueck, D.: Clustering by Passing Messages Between Data Points. Science 315, 972–976 (2007)
Sander, J., Qin, X., Lu, Z., Niu, N., Kovarsky, A.: Automatic extraction of clusters from hierarchical clustering representations. 7th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining. LNCS, vol. 2637, pp. 75–87. Springer, Heidelberg (2003)
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet Allocation. Journal of Machine Learning Research 3, 993–1022 (2003)
Caliński, T., Harabasz, J.: A dendrite method for cluster analysis. Communications in Statistics-Simulation and Computation 3(1), 1–27 (1974)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Vathi, E., Siolas, G., Stafylopatis, A. (2015). Mining Interesting Topics in Twitter Communities. In: Núñez, M., Nguyen, N., Camacho, D., Trawiński, B. (eds) Computational Collective Intelligence. Lecture Notes in Computer Science(), vol 9329. Springer, Cham. https://doi.org/10.1007/978-3-319-24069-5_12
Download citation
DOI: https://doi.org/10.1007/978-3-319-24069-5_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-24068-8
Online ISBN: 978-3-319-24069-5
eBook Packages: Computer ScienceComputer Science (R0)