Abstract
Visualization is an important but tough way to make sense of large scale dataset. In this paper, we propose a graph based method to visualize microblog data. In our scheme, the graph is constructed using the content similarities between data which is more robust than the widely used data relationships. Given a targeted dataset, we first adopt a duplicates removal strategy to reduce the size of the data and a subset is randomly sampled for visualization. Then a multilevel graph layout with a heat map is applied to generate an interactive interface which allows users to move on and scale the layout. In this way, different granularities of summarization information can be immediately presented to users when a certain area is specified in the interface; meanwhile more detailed knowledge on the selected area can be shown in nearly real time by leveraging a hash based microblog retrieval approach. Experiments are conducted on a Brand-Social-Net dataset which contains 3,000,000 microblogs and the experimental results show that, with our visualization method, some meaningful patterns of dataset can be found easily.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
A Chinese word segmentation python library.
References
Doyle, M., Smeaton, A.F., Bermingham, A.: TriVis: visualising multivariate data from sentiment analysis (2014)
Gansner, E., Hu, North, S.: Visualizing streaming text data with dynamic maps (2012). arXiv preprint arXiv:1206.3980
Paulovich, F.V., Minghim, R.: Hipp: A novel hierarchical point placement strategy and its application to the exploration of document collections. IEEE Trans. Visual. Comput. Graph. 14(6), 1229–1236 (2008)
James, A., Van Ham, F., Krishnan, N.: Ask-graphview: A large scale graph visualization system. IEEE Trans. Visual. Comput. Graph. 12(5), 669–676 (2006)
Gansner, E.R., Hu, Y., Kobourov, S.: GMap: visualizing graphs and clusters as maps. In: 2010 IEEE Pacific Visualization Symposium (PacificVis). IEEE (2010)
Gretarsson, B., et al.: Smallworlds: visualizing social recommendations. Comput. Graph. Forum 29(3), 833–842 (2010). Blackwell Publishing Ltd
Jing, L., Yu, X., Wan, W.: Visualization research of the tweet diffusion in the microblog network. In: 2014 International Conference on Audio, Language and Image Processing (ICALIP). IEEE (2014)
Changbo, W., et al.: Analyzing internet topics by visualizing microblog retweeting. J. Vis. Lang. Comput. 28, 122–133 (2015)
Ren, D., et al.: WeiboEvents: a crowd sourcing weibo visual analytic system. In: 2014 IEEE Pacific Visualization Symposium (PacificVis). IEEE (2014)
Nan, C., et al.: Facetatlas: multifaceted visualization for rich text corpora. IEEE Trans. Visual. Comput. Graphics 16(6), 1172–1181 (2010)
Datar, M., et al.: Locality-sensitive hashing scheme based on p-stable distributions. In: Proceedings of the Twentieth Annual Symposium on Computational Geometry. ACM (2004)
Charikar, M.S.: Similarity estimation techniques from rounding algorithms. In: Proceedings of the Thiry-Fourth Annual ACM Symposium on Theory of Computing. ACM (2002)
Andreas, N.: Modularity clustering is force-directed layout. Phys. Rev. E 79(2), 026102 (2009)
Manku, G.S., Jain, A., Das Sarma, A.: Detecting near-duplicates for web crawling. In: Proceedings of the 16th International Conference on World Wide Web. ACM (2007)
Walshaw, C.: A multilevel algorithm for force-directed graph drawing. In: Marks, J. (ed.) GD 2000. LNCS, vol. 1984, pp. 171–182. Springer, Heidelberg (2001)
Yue, G., et al.: Brand data gathering from live social media streams. In: Proceedings of International Conference on Multimedia Retrieval. ACM (2014)
Acknowledgements
This work was partially supported by National Natural Science Funds of China (61173104, 61472059, 61428202).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Guan, Y., Meng, K., Li, H. (2015). Graph Based Visualization of Large Scale Microblog Data. In: Ho, YS., Sang, J., Ro, Y., Kim, J., Wu, F. (eds) Advances in Multimedia Information Processing -- PCM 2015. PCM 2015. Lecture Notes in Computer Science(), vol 9315. Springer, Cham. https://doi.org/10.1007/978-3-319-24078-7_46
Download citation
DOI: https://doi.org/10.1007/978-3-319-24078-7_46
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-24077-0
Online ISBN: 978-3-319-24078-7
eBook Packages: Computer ScienceComputer Science (R0)