Skip to main content

Towards Understanding URL Resources in Recent Sina Weibo

  • Conference paper
  • First Online:
Web Information Systems Engineering – WISE 2016 (WISE 2016)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10041))

Included in the following conference series:

  • 1264 Accesses

Abstract

With the rapid development of Internet, micro-blog service has become the fastest growing Internet application, where URLs play an important role in the social network. However, the studies on analyzing the URL resources especially for Chinese micro-blog system are extremely scarce. In this paper, we construct a corpus which contains the dissemination and classification information about URLs in Sina Weibo. Then we focus on the typical questions who publishes the URLs, what the URLs point to and how the URLs are disseminated and answer all the questions above by analyzing a recent Sina Weibo corpus. We find that verified users tend to publish about twice the amount of URLs as non-verfied users; Video URLs are more easily to disseminate in Sina Weibo. Our findings provide insights on downstream IR applications such as search engine and recommender systems effectively.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://aminer.org/billboard/Influencelocality.

  2. 2.

    Sina API. http://open.weibo.com/.

  3. 3.

    http://www.dmoz.org/.

References

  1. Suh, B., Hong, L., Pirolli, P., et al.: Want to be retweeted? large scale analytics on factors impacting retweet in twitter network. In: 2010 IEEE Second International Conference on Social computing (socialcom), pp. 177–184. IEEE (2010)

    Google Scholar 

  2. Liu, Y., Kliman-Silver, C., Mislove, A.: The tweets they are a-changin: evolution of twitter users and behavior. In: International AAAI Conference on Weblogs and Social Media (ICWSM), vol. 13, p. 55 (2014)

    Google Scholar 

  3. Antoniades, D., Polakis, I., Kontaxis, G., et al.: we. b: The web of short URLs. In: Proceedings of the 20th International Conference on World Wide Web, pp. 715–724. ACM (2011)

    Google Scholar 

  4. Kandylas, V., Dasdan, A.: The utility of tweeted URLs for web search. In: Proceedings of the 19th International Conference on World Wide Web, pp. 1127–1128. ACM (2010)

    Google Scholar 

  5. Wang, Y., Tao, H., Cao, J., Wu, Z.: Understanding user behavior through URL analysis in sina tweets. In: Huang, Z., Liu, C., He, J., Huang, G. (eds.) WISE 2013. LNCS, vol. 8182, pp. 98–108. Springer, Heidelberg (2014). doi:10.1007/978-3-642-54370-8_9

    Chapter  Google Scholar 

  6. Odijk, D., White, R.W., Hassan Awadallah, A., et al.: Struggling and success in web search. In: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, pp. 1551–1560. ACM (2015)

    Google Scholar 

  7. Ritter, A., Etzioni, O., Clark, S.: Open domain event extraction from twitter. In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1104–1112. ACM (2012)

    Google Scholar 

  8. Spina, D., Gonzalo, J., Amigó, E.: Discovering filter keywords for company name disambiguation in twitter. Expert Syst. Appl. 40(12), 4986–5003 (2013)

    Article  Google Scholar 

  9. Cheng, X., Dale, C., Liu, J.: Statistics and social network of youtube videos. In: 16th International Workshop on Quality of Service, IWQoS 2008, pp. 229–238. IEEE (2008)

    Google Scholar 

  10. Lee, S., Kim, J.: Warningbird: a near real-time detection system for suspicious urls in twitter stream. IEEE Trans. Dependable Secure Comput. 10(3), 183–195 (2013)

    Article  Google Scholar 

  11. Klien, F., Strohmaier, M.: Short links under attack:geographical analysis of spam in a URL shortener network. In: Proceedings of the 23rd ACM Conference on Hypertext and Social Media, pp. 83–88. ACM (2012)

    Google Scholar 

Download references

Acknowledgments

This work is supported by the National Natural Science Foundation of China (grant Nos. 61402466 and 61572494) and the Strategic Priority Research Program of the Chinese Academy of Sciences (grant No. XDA06030200).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Peng Li .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing AG

About this paper

Cite this paper

Wan, Y., Li, P., Li, R., Zhou, M., Ye, Y., Wang, B. (2016). Towards Understanding URL Resources in Recent Sina Weibo. In: Cellary, W., Mokbel, M., Wang, J., Wang, H., Zhou, R., Zhang, Y. (eds) Web Information Systems Engineering – WISE 2016. WISE 2016. Lecture Notes in Computer Science(), vol 10041. Springer, Cham. https://doi.org/10.1007/978-3-319-48740-3_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-48740-3_4

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-48739-7

  • Online ISBN: 978-3-319-48740-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics