Skip to main content

Can Cross-Lingual Information Cascades Be Predicted on Twitter?

  • Conference paper
  • First Online:
Social Informatics (SocInfo 2017)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10539))

Included in the following conference series:

Abstract

Social network services (SNSs) have provided many opportunities for sharing information and knowledge in various languages due to their international popularity. Understanding the information flow between different countries and languages on SNSs can not only provide better insights into global connectivity and sociolinguistics, but is also beneficial for practical applications such as globally-influential event detection and global marketing. In this study, we characterized and attempted to detect influential cross-lingual information cascades on Twitter. With a large-scale Twitter dataset, we conducted statistical analysis of the growth and language distribution of information cascades. Based on this analysis, we propose a feature-based model to detect influential cross-lingual information cascades and show its effectiveness in predicting the growth and language distribution of cascades in the early stage.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://dev.twitter.com/overview/api.

  2. 2.

    https://github.com/shuyo/language-detection.

  3. 3.

    Liblinear: https://www.csie.ntu.edu.tw/~cjlin/liblinear/.

  4. 4.

    Multi-output regression: http://scikit-learn.org/stable/modules/multiclass.html.

  5. 5.

    https://en.wikipedia.org/wiki/Help:Interlanguage_links.

References

  1. Bakshy, E., Karrer, B., Adamic, L.A.: Social influence and the diffusion of user-created content. In: Proceedings of the 10th ACM conference on Electronic commerce, pp. 325–334. ACM (2009)

    Google Scholar 

  2. Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3(Jan), 993–1022 (2003)

    MATH  Google Scholar 

  3. Borge-Holthoefer, J., Baños, R.A., González-Bailón, S., Moreno, Y.: Cascading behaviour in complex socio-technical networks. J. Complex Netw. 1(1), 3–24 (2013)

    Article  Google Scholar 

  4. Cheng, J., Adamic, L., Dow, P.A., Kleinberg, J.M., Leskovec, J.: Can cascades be predicted? In: Proceedings of the 23rd International Conference on World Wide Web, pp. 925–936. ACM (2014)

    Google Scholar 

  5. Eleta, I., Golbeck, J.: Bridging languages in social networks: how multilingual users of twitter connect language communities? Proc. Am. Soc. Inf. Sci. Technol. 49(1), 1–4 (2012)

    Article  Google Scholar 

  6. Goel, S., Watts, D.J., Goldstein, D.G.: The structure of online diffusion networks. In: Proceedings of the 13th ACM Conference on Electronic Commerce, pp. 623–638. ACM (2012)

    Google Scholar 

  7. Halavais, A.: National borders on the world wide web. New Media Soc. 2(1), 7–28 (2000)

    Article  Google Scholar 

  8. Hale, S.A.: Global connectivity and multilinguals in the twitter network. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 833–842. ACM (2014)

    Google Scholar 

  9. Herring, S.C., Paolillo, J.C., Ramos-Vielba, I., Kouper, I., Wright, E., Stoerger, S., Scheidt, L.A., Clark, B.: Language networks on livejournal. In: 40th Annual Hawaii International Conference on System Sciences, 2007, HICSS 2007, p. 79. IEEE (2007)

    Google Scholar 

  10. Hong, L., Convertino, G., Chi, E.H.: Language matters in twitter: a large scale study. In: Proceedings of the 5th International AAAI Conference on Weblogs and Social Media (ICWSM 2011) (2011)

    Google Scholar 

  11. Kupavskii, A., Umnov, A., Gusev, G., Serdyukov, P.: Predicting the audience size of a tweet. In: Proceedings of the 7th International AAAI Conference on Weblogs and Social Media (2013)

    Google Scholar 

  12. Kwak, H., Lee, C., Park, H., Moon, S.: What is twitter, a social network or a news media? In: Proceedings of the 19th International Conference on World Wide Web, pp. 591–600. ACM (2010)

    Google Scholar 

  13. Leskovec, J., McGlohon, M., Faloutsos, C., Glance, N.S., Hurst, M.: Patterns of cascading behavior in large blog graphs. In: Proceedings of the 2007 SIAM International Conference on Data Mining, vol. 7, pp. 551–556. SIAM (2007)

    Google Scholar 

  14. Papalexakis, E., Doğruöz, A.S.: Understanding multilingual social networks in online immigrant communities. In: Proceedings of the 24th International Conference on World Wide Web, pp. 865–870. ACM (2015)

    Google Scholar 

  15. Reed, M.: Who owns ellen’s oscar selfie: deciphering rights of attribution concerning user generated content on social media. J. Marshall Rev. Intell. Prop. L. 14, 564 (2014)

    Google Scholar 

  16. Townsend, L.: How much has the ice bucket challenge achieved? BBC News Mag. (2014)

    Google Scholar 

Download references

Acknowledgments

This work was supported by the Research and Development on Real World Big Data Integration and Analysis program of RIKEN, and the Ministry of Education, Culture, Sports, Science, and Technology, JAPAN.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hongshan Jin .

Editor information

Editors and Affiliations

A List of Features Used for Learning

A List of Features Used for Learning

Root user features

Whether a user is verified number of friends/followers/followees

Number of listed/statues/favorites

Number of original/total tweets

Number of reshares

Number of reshared tweets

Resharer features

Ratio of k resharers who are verified

Average/max number of friends of k resharers

Average/max number of followers of k resharers

Average/max number of listed of k resharers

Average/max number of statues of k resharers

Average/max number of favorites of k resharers

Average/max number of original tweets of k resharers

Average/max number of total tweets of k resharers

Average/max number of reshares of k resharers

Average/max number of reshared tweets of k resharers

Content features

Language of root tweet

Whether a hashtag/mention/url is contained

Topic distribution of the root tweet

Structural features

Out-degree of root user and kth resharers

In-degree of root user and kth reshares

Number of common followers between the root user and kth resharers

Total number of unique followers of the root user and k resharers

Ratio of k resharers who are not first-degree connections of the root user

Temporal features

Time interval between the root user and kth resharers

Time interval between \(k-1th\) resharers and kth resharers

Average time interval between first half of reshares

Average time interval between second half of reshares

Language features

Main language of root user

Whether a root user is a multilingual user

Usage rate of main language of the root user

Whether the follower community of the root user is multilingual

Whether the followee community of the root user is multilingual

Language distribution of tweets of the root user

Main language distribution of followers of the root user

Main language distribution of followees of the root user

Cross-lingual ratio of k resharers

Ratio of k resharers who are multilingual users

Ratio of k resharers whose follower community are multilingual users

Ratio of k resharers whose followee community are multilingual users

Average main language distribution of k resharers’ tweets

Average main language distribution of k resharers’ followers

Average main language distribution of k resharers’ followees

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Jin, H., Toyoda, M., Yoshinaga, N. (2017). Can Cross-Lingual Information Cascades Be Predicted on Twitter?. In: Ciampaglia, G., Mashhadi, A., Yasseri, T. (eds) Social Informatics. SocInfo 2017. Lecture Notes in Computer Science(), vol 10539. Springer, Cham. https://doi.org/10.1007/978-3-319-67217-5_28

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-67217-5_28

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-67216-8

  • Online ISBN: 978-3-319-67217-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics