Skip to main content

Graph Based Visualization of Large Scale Microblog Data

  • Conference paper
  • First Online:
Advances in Multimedia Information Processing -- PCM 2015 (PCM 2015)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9315))

Included in the following conference series:

  • 1876 Accesses

Abstract

Visualization is an important but tough way to make sense of large scale dataset. In this paper, we propose a graph based method to visualize microblog data. In our scheme, the graph is constructed using the content similarities between data which is more robust than the widely used data relationships. Given a targeted dataset, we first adopt a duplicates removal strategy to reduce the size of the data and a subset is randomly sampled for visualization. Then a multilevel graph layout with a heat map is applied to generate an interactive interface which allows users to move on and scale the layout. In this way, different granularities of summarization information can be immediately presented to users when a certain area is specified in the interface; meanwhile more detailed knowledge on the selected area can be shown in nearly real time by leveraging a hash based microblog retrieval approach. Experiments are conducted on a Brand-Social-Net dataset which contains 3,000,000 microblogs and the experimental results show that, with our visualization method, some meaningful patterns of dataset can be found easily.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    www.weibo.com

  2. 2.

    A Chinese word segmentation python library.

References

  1. Doyle, M., Smeaton, A.F., Bermingham, A.: TriVis: visualising multivariate data from sentiment analysis (2014)

    Google Scholar 

  2. Gansner, E., Hu, North, S.: Visualizing streaming text data with dynamic maps (2012). arXiv preprint arXiv:1206.3980

    Google Scholar 

  3. Paulovich, F.V., Minghim, R.: Hipp: A novel hierarchical point placement strategy and its application to the exploration of document collections. IEEE Trans. Visual. Comput. Graph. 14(6), 1229–1236 (2008)

    Article  Google Scholar 

  4. James, A., Van Ham, F., Krishnan, N.: Ask-graphview: A large scale graph visualization system. IEEE Trans. Visual. Comput. Graph. 12(5), 669–676 (2006)

    Article  Google Scholar 

  5. Gansner, E.R., Hu, Y., Kobourov, S.: GMap: visualizing graphs and clusters as maps. In: 2010 IEEE Pacific Visualization Symposium (PacificVis). IEEE (2010)

    Google Scholar 

  6. Gretarsson, B., et al.: Smallworlds: visualizing social recommendations. Comput. Graph. Forum 29(3), 833–842 (2010). Blackwell Publishing Ltd

    Article  Google Scholar 

  7. Jing, L., Yu, X., Wan, W.: Visualization research of the tweet diffusion in the microblog network. In: 2014 International Conference on Audio, Language and Image Processing (ICALIP). IEEE (2014)

    Google Scholar 

  8. Changbo, W., et al.: Analyzing internet topics by visualizing microblog retweeting. J. Vis. Lang. Comput. 28, 122–133 (2015)

    Article  Google Scholar 

  9. Ren, D., et al.: WeiboEvents: a crowd sourcing weibo visual analytic system. In: 2014 IEEE Pacific Visualization Symposium (PacificVis). IEEE (2014)

    Google Scholar 

  10. Nan, C., et al.: Facetatlas: multifaceted visualization for rich text corpora. IEEE Trans. Visual. Comput. Graphics 16(6), 1172–1181 (2010)

    Article  Google Scholar 

  11. Datar, M., et al.: Locality-sensitive hashing scheme based on p-stable distributions. In: Proceedings of the Twentieth Annual Symposium on Computational Geometry. ACM (2004)

    Google Scholar 

  12. Charikar, M.S.: Similarity estimation techniques from rounding algorithms. In: Proceedings of the Thiry-Fourth Annual ACM Symposium on Theory of Computing. ACM (2002)

    Google Scholar 

  13. Andreas, N.: Modularity clustering is force-directed layout. Phys. Rev. E 79(2), 026102 (2009)

    Google Scholar 

  14. Manku, G.S., Jain, A., Das Sarma, A.: Detecting near-duplicates for web crawling. In: Proceedings of the 16th International Conference on World Wide Web. ACM (2007)

    Google Scholar 

  15. Walshaw, C.: A multilevel algorithm for force-directed graph drawing. In: Marks, J. (ed.) GD 2000. LNCS, vol. 1984, pp. 171–182. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  16. Yue, G., et al.: Brand data gathering from live social media streams. In: Proceedings of International Conference on Multimedia Retrieval. ACM (2014)

    Google Scholar 

Download references

Acknowledgements

This work was partially supported by National Natural Science Funds of China (61173104, 61472059, 61428202).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Haojie Li .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Guan, Y., Meng, K., Li, H. (2015). Graph Based Visualization of Large Scale Microblog Data. In: Ho, YS., Sang, J., Ro, Y., Kim, J., Wu, F. (eds) Advances in Multimedia Information Processing -- PCM 2015. PCM 2015. Lecture Notes in Computer Science(), vol 9315. Springer, Cham. https://doi.org/10.1007/978-3-319-24078-7_46

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-24078-7_46

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-24077-0

  • Online ISBN: 978-3-319-24078-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics