skip to main content
10.1145/2637748.2638428acmotherconferencesArticle/Chapter ViewAbstractPublication Pagesi-knowConference Proceedingsconference-collections
research-article

Predicting the stability of user interaction ties in Twitter

Published: 16 September 2014 Publication History

Abstract

In this paper, we analyze the stability of user interaction within Twitter focusing on link decay prediction: for a tweet created by one user mentioning another user we study the task of predicting the decay of the corresponding interaction link over time. For this task, we employ the history of timestamped mention interactions between both users as time series features. We also tackle the problem of efficiently balancing a large dataset with a skewed class distribution, which arises naturally in our context. The proposed impurity-based supervised sampling (ISS) approach balances the data in one pass by removing trivial training data of the overrepresented class. Our approach is evaluated using the well known Twitter dump of 2009 [25]. We show, that ISS outperforms down-sampling with regard to the resulting predictor performance.

References

[1]
G. E. P. Box and G. Jenkins. Time Series Analysis, Forecasting and Control. Holden-Day, Incorporated, 1990.
[2]
M. L. Calle and V. Urrea. Letter to the editor: Stability of random forest importance measures. Briefings in Bioinformatics, 2010.
[3]
N. V. Chawla, K. W. Bowyer, L. O. Hall, and W. P. Kegelmeyer. SMOTE: Synthetic Minority Over-sampling Technique. Journal of Art. Int. Research, 16:321--357, 2002.
[4]
R. I. M. Dunbar. Coevolution of neocortical size, group size and language in humans. Behavioral and Brain Sciences, 16(4):681--735, 1993.
[5]
Y.-S. Jeong, M. K. Jeong, and O. A. Omitaomu. Weighted Dynamic Time Warping for Time Series Classification. Pattern Recogn., 44(9):2231--2240, Sept. 2011.
[6]
A. Jovic, K. Brkic, and N. Bogunovic. Decision Tree Ensembles in Biomedical Time-Series Classification. In Pattern Recognition, volume 7476 of Lecture Notes in Computer Science, pages 408--417. Springer, 2012.
[7]
D. Kempe, J. Kleinberg, and E. Tardos. Maximizing the Spread of Influence through a Social Network. In Proc. KDD, pages 137--146. ACM Press, 2003.
[8]
E. Keogh and C. A. Ratanamahatana. Exact Indexing of Dynamic Time Warping. Knowl. Inf. Syst., 7(3):358--386, 2005.
[9]
F. Kivran-Swaine, P. Govindan, and M. Naaman. The Impact of Network Structure on Breaking Ties in Online Social Networks: Unfollowing on Twitter. In Proc. CHI, pages 1101--1104, New York, NY, USA, 2011. ACM Press.
[10]
M. Kubat and S. Matwin. Addressing the Curse of Imbalanced Training Sets: One-Sided Selection. In Proc. ICML, pages 179--186. Morgan Kaufmann, 1997.
[11]
H. Kwak, H. Chun, and S. Moon. Fragile Online Relationship: A First Look at Unfollow Dynamics in Twitter. In Proc. CHI, pages 1091--1100, New York, NY, USA, 2011. ACM Press.
[12]
J. Lehmann, B. Gonçalves, J. J. Ramasco, and C. Cattuto. Dynamical Classes of Collective Attention in Twitter. In Proc. WWW, pages 251--260, New York, NY, USA, 2012. ACM Press.
[13]
D. Liben-Nowell and J. Kleinberg. The link-prediction problem for social networks. J. Am. Soc. Inf. Sci. Technol., 58(7):1019--1031, May 2007.
[14]
D. Lindsay and S. Cox. Effective Probability Forecasting for Time Series Data Using Standard Machine Learning Techniques. In Pattern Recognition and Data Mining, volume 3686 of Lecture Notes in Computer Science, pages 35--44. Springer, 2005.
[15]
G. Louppe, L. Wehenkel, A. Sutera, and P. Geurts. Understanding Variable Importances in Forests of Randomized Trees. In NIPS, pages 431--439, 2013.
[16]
B.-E. Macek and M. Atzmueller. Visualizing the Impact of Time Series Data for Predicting User Interactions. In Proc. ASONAM, pages 1477--1478, New York, NY, USA, 2013. ACM Press.
[17]
M. McPherson, L. Smith-Lovin, and J. M. Cook. Birds of a Feather: Homophily in Social Networks. Annual Review of Sociology, 27(1):415--444, 2001.
[18]
F. Mitzlaff, M. Atzmueller, D. Benz, A. Hotho, and G. Stumme. User-Relatedness and Community Structure in Social Interaction Networks. CoRR/abs, 1309.3888, 2013.
[19]
F. Mitzlaff, M. Atzmueller, G. Stumme, and A. Hotho. Semantics of User Interaction in Social Media. In Complex Networks (Proc. CompleNet 2013), pages 13--25. Springer, Heidelberg, Germany, 2013.
[20]
T. Raeder, O. Lizardo, D. Hachen, and N. V. Chawla. Predictors of Short-Term Decay of Cell Phone Contacts in a Large Scale Communication Network. Social Networks, 33(4):245--257, 2011.
[21]
D. M. Romero, B. Meeder, and J. Kleinberg. Differences in the Mechanics of Information Diffusion Across Topics: Idioms, Political Hashtags, and Complex Contagion on Twitter. In Proc. WWW, pages 695--704, New York, NY, USA, 2011. ACM Press.
[22]
C. Scholz, M. Atzmueller, A. Barrat, C. Cattuto, and G. Stumme. New Insights and Methods For Predicting Face-To-Face Contacts. In Proc. ICWSM, Palo Alto, CA, USA, 2013. AAAI Press.
[23]
C. Scholz, M. Atzmueller, and G. Stumme. On the Predictability of Human Contacts: Influence Factors and the Strength of Stronger Ties. In Proc. SocialCom, 2012.
[24]
D. Wilson and T. Martinez. Reduction Techniques for Instance-Based Learning Algorithms. Machine Learning, 38(3):257--286, 2000.
[25]
J. Yang and J. Leskovec. Patterns of Temporal Variation in Online Media. In Proc. WSDM, pages 177--186, New York, NY, USA, 2011. ACM Press.
[26]
H. Zhang, M. Korayem, E. You, and D. J. Crandall. Beyond Co-Occurrence: Discovering and Visualizing Tag Relationships From Geo-Spatial and Temporal Similarities. In Proc. WSDM, pages 33--42, New York, NY, USA, 2012. ACM Press.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
i-KNOW '14: Proceedings of the 14th International Conference on Knowledge Technologies and Data-driven Business
September 2014
262 pages
ISBN:9781450327695
DOI:10.1145/2637748
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 16 September 2014

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. class balancing
  2. link decay
  3. link prediction
  4. machine learning
  5. social network analysis
  6. user interactions

Qualifiers

  • Research-article

Conference

i-KNOW '14

Acceptance Rates

i-KNOW '14 Paper Acceptance Rate 25 of 73 submissions, 34%;
Overall Acceptance Rate 77 of 238 submissions, 32%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 88
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 28 Feb 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media