Measuring and Visualizing Interest Similarity between Microblog Users

Tang, Jiayu; Liu, Zhiyuan; Sun, Maosong

doi:10.1007/978-3-642-38562-9_49

Jiayu Tang²¹,
Zhiyuan Liu²¹ &
Maosong Sun²¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7923))

Included in the following conference series:

International Conference on Web-Age Information Management

3541 Accesses
1 Citations

Abstract

Microblog users share their life status and opinions via microposts, which usually reflect their interests. Measuring interest similarity between microblog users has thus received increasing attention from both academia and industry. In this paper, we design a novel framework for measuring and visualizing user interest similarity. The framework consists of four components: (1) Interest representation. We extract keywords from microposts to represent user interests. (2) Interest similarity computation. Based on the interest keywords, we design a ranking framework for measuring the interest similarity. (3) Interest similarity visualization. We propose a integrated word cloud scenario to provide a novel visual representation of user interest similarity. (4) Annotation data collection. We design an interactive game for microblog users to collect user annotations, which are used as training dataset for our similarity measuring method. We carry out experiments on Sina Weibo, the largest microblogging service in China, and get encouraging results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Banerjee, N., Chakraborty, D., Dasgupta, K., Mittal, S., Joshi, A., Nagar, S., Rai, A., Madan, S.: User interests in social media sites: an exploration with micro-blogs. In: CIKM 2009, pp. 1823–1826. ACM, New York (2009)
Google Scholar
Viegas, F.B., Wattenberg, M., Feinberg, J.: Participatory Visualization with Wordle. IEEE Transactions on Visualization and Computer Graphics 15, 1137–1144 (2009)
Article Google Scholar
Java, A., Song, X., Finin, T., Tseng, B.: Why we twitter: understanding microblogging usage and communities. In: WebKDD/SNA-KDD 2007, pp. 56–65. ACM, New York (2007)
Google Scholar
Kwak, H., Lee, C., Park, H., Moon, S.: What is Twitter, a social network or a news media? In: WWW 2010, pp. 591–600. ACM, New York (2010)
Google Scholar
Wu, S., Hofman, J.M., Mason, W.A., Watts, D.J.: Who says what to whom on twitter. In: WWW 2011, pp. 705–714. ACM, New York (2011)
Google Scholar
Bakshy, E., Hofman, J.M., Mason, W.A., Watts, D.J.: Everyone’s an influencer: quantifying influence on twitter. In: WSDM 2011, pp. 65–74. ACM, New York (2011)
Google Scholar
Zhao, D., Rosson, M.B.: How and why people Twitter: the role that micro-blogging plays in informal communication at work. In: GROUP 2009, pp. 243–252. ACM, New York (2009)
Google Scholar
Krishnamurthy, B., Gill, P., Arlitt, M.: A few chirps about twitter. In: 1st Workshop on Online Social Networks, pp. 19–24. ACM, New York (2008)
Chapter Google Scholar
Piao, S., Whittle, J.: A Feasibility Study on Extracting Twitter Users’ Interests Using NLP Tools for Serendipitous Connections. In: PASSAT/SocialCom 2011, pp. 910–915. IEEE CS Press, New Jersey (2011)
Google Scholar
Wu, W., Zhang, B., Ostendorf, M.: Automatic generation of personalized annotation tags for Twitter users. In: HLT 2010, pp. 689–692. ACL, Stroudsburg (2010)
Google Scholar
Yamaguchi, Y., Amagasa, T., Kitagawa, H.: Tag-based User Topic Discovery Using Twitter Lists. In: ASONAM 2011, pp. 13–20. IEEE CS Press, New Jersey (2011)
Google Scholar
Michelson, M., Macskassy, S.A.: Discovering users’ topics of interest on twitter: a first look. In: AND 2010, pp. 73–80. ACM, New York (2010)
Google Scholar
Paulovich, F.V., Toledo, F.M.B., Telles, G.P., Minghim, R., Nonato, L.G.: Semantic Wordification of Document Collections. Computer Graphics Forum 31, 1145–1153 (2012)
Article Google Scholar
Cui, W., Wu, Y., Liu, S., Wei, F., Zhou, M.X., Qu, H.: Context-Preserving, Dynamic Word Cloud Visualization. IEEE Computer Graphics and Applications 30, 42–53 (2010)
Article Google Scholar
Rivadeneira, A.W., Gruen, D.M., Muller, M.J., Millen, D.R.: Getting our head in the clouds: toward evaluation studies of tagclouds. In: CHI 2007, pp. 995–998. ACM, New York (2007)
Google Scholar
Lohmann, S., Ziegler, J., Tetzlaff, L.: Comparison of Tag Cloud Layouts: Task-Related Performance and Visual Exploration. In: Gross, T., Gulliksen, J., Kotzé, P., Oestreicher, L., Palanque, P., Prates, R.O., Winckler, M. (eds.) INTERACT 2009, Part I. LNCS, vol. 5726, pp. 392–404. Springer, Heidelberg (2009)
Chapter Google Scholar
Yu, L., Asur, S., Huberman, B.A.: What Trends in Chinese Social Media. arXiv:1107.3522v1 (2011)
Google Scholar
A stacked model based on word lattice for Chinese word segmentation and part-of-speech tagging, http://nlp.csai.tsinghua.edu.cn/thulac
Toutanova, K., Klein, D., Manning, C.D., Singer, Y.: Feature-rich part-of-speech tagging with a cyclic dependency network. In: NAACL 2003, pp. 173–180. ACL, Stroudsburg (2003)
Google Scholar
Liu, Z., Chen, X., Sun, M.: Mining the interests of Chinese microbloggers via keyword extraction. Frontiers of Computer Science in China 6, 76–87 (2012)
MathSciNet Google Scholar
Joachims, T.: Optimizing search engines using clickthrough data. In: KDD 2002, pp. 133–142. ACM, New York (2002)
Google Scholar
Halvey, M.J., Keane, M.T.: An assessment of tag presentation techniques. In: WWW 2007, pp. 1313–1314. ACM, New York (2007)
Google Scholar
Chang, C., Lin, C.: LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology 2, 1–27 (2011)
Article Google Scholar
Fan, R., Chang, K., Hsieh, C., Wang, X., Lin, C.: LIBLINEAR: A Library for Large Linear Classification. Journal of Machine Learning Research 9, 1871–1874 (2008)
MATH Google Scholar
Joachims, T.: Training linear SVMs in linear time. In: KDD 2006, pp. 217–226. ACM, New York (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

State Key Laboratory of Intelligent Technology and Systems, Tsinghua National Laboratory for Information Science and Technology, Department of Computer Science and Technology, Tsinghua University, Beijing, 100084, China
Jiayu Tang, Zhiyuan Liu & Maosong Sun

Authors

Jiayu Tang
View author publications
You can also search for this author in PubMed Google Scholar
Zhiyuan Liu
View author publications
You can also search for this author in PubMed Google Scholar
Maosong Sun
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Technology, Tsinghua University, 100084, Beijing, China
Jianyong Wang
Management Science and Information Systems Department, Rutgers, the State University of New Jersey, 1, Washington Park, 07102, Newark, NJ, USA
Hui Xiong
Department of Information Engineering, Nagoya University, 464-8601, Nagoya, Japan
Yoshiharu Ishikawa
Department of Computer Science, Hong Kong Baptist University, Hong Kong
Jianliang Xu
School of Information Science and Engineering, Yanshan University, Qinhuangdao, China
Junfeng Zhou

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tang, J., Liu, Z., Sun, M. (2013). Measuring and Visualizing Interest Similarity between Microblog Users. In: Wang, J., Xiong, H., Ishikawa, Y., Xu, J., Zhou, J. (eds) Web-Age Information Management. WAIM 2013. Lecture Notes in Computer Science, vol 7923. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38562-9_49

Download citation

DOI: https://doi.org/10.1007/978-3-642-38562-9_49
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38561-2
Online ISBN: 978-3-642-38562-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics