skip to main content
10.1145/2487575.2487685acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Estimating sharer reputation via social data calibration

Published: 11 August 2013 Publication History

Abstract

Online social networks have become important channels for users to share content with their connections and diffuse information. Although much work has been done to identify socially influential users, the problem of finding "reputable" sharers, who share good content, has received relatively little attention. Availability of such reputation scores can be useful or various applications like recommending people to follow, procuring high quality content in a scalable way, creating a content reputation economy to incentivize high quality sharing, and many more. To estimate sharer reputation, it is intuitive to leverage data that records how recipients respond (through clicking, liking, etc.) to content items shared by a sharer. However, such data is usually biased --- it has a selection bias since the shared items can only be seen and responded to by users connected to the sharer in most social networks, and it has a response bias since the response is usually influenced by the relationship between the sharer and the recipient (which may not indicate whether the shared content is good). To correct for such biases, we propose to utilize an additional data source that provides unbiased goodness estimates for a small set of shared items, and calibrate biased social data through a novel multi-level hierarchical model that describes how the unbiased data and biased data are jointly generated according to sharer reputation scores. The unbiased data also provides the ground truth for quantitative evaluation of different methods. Experiments based on such ground-truth data show that our proposed model significantly outperforms existing methods that estimate social influence using biased social data.

References

[1]
D. Agarwal, B.-C. Chen, and P. Elango. Explore/exploit schemes for web content optimization. In ICDM, 2009.
[2]
A. Anderson, D. P. Huttenlocher, J. M. Kleinberg, and J. Leskovec. Discovering value from community activity on focused question answering sites: a case study of stack overflow. In KDD, 2012.
[3]
A. Anderson, D. P. Huttenlocher, J. M. Kleinberg, and J. Leskovec. Effects of user similarity in social media. In WSDM, 2012.
[4]
S. Brin and L. Page. The anatomy of a large-scale hypertextual web search engine. Computer Networks, 1998.
[5]
M. Cha, H. Haddadi, F. Benevenuto, and P. K. Gummadi. Measuring user influence in twitter: The million follower fallacy. In ICWSM, 2010.
[6]
B.-C. Chen, A. Dasgupta, X. Wang, and J. Yang. Vote calibration in community question-answering systems. In SIGIR, 2012.
[7]
B.-C. Chen, J. Guo, B. L. Tseng, and J. Yang. User reputation in a comment rating environment. In KDD, 2011.
[8]
D. J. Crandall, D. Cosley, D. P. Huttenlocher, J. M. Kleinberg, and S. Suri. Feedback effects between similarity and social influence in online communities. In KDD, 2008.
[9]
C. Danescu-Niculescu-Mizil, M. Gamon, and S. T. Dumais. Mark my words!: linguistic style accommodation in social media. In WWW, 2011.
[10]
K. El-Arini, U. Paquet, R. Herbrich, J. V. Gael, and B. A. y Arcas. Transparent user models for personalization. In KDD, 2012.
[11]
R.-E. Fan, K.-W. Chang, C.-J. Hsieh, X.-R. Wang, and C.-J. Lin. Liblinear: A library for large linear classification. JMLR, 2008.
[12]
W.-S. Hwang, H.-J. Lee, S.-W. Kim, and M. Lee. On using category experts for improving the performance and accuracy in recommender systems. In CIKM, 2012.
[13]
U. Kang, S. Papadimitriou, J. Sun, and H. Tong. Centralities in large networks: Algorithms and observations. In SDM, 2011.
[14]
E. Katz and P. Lazarsfeld. Personal influence: the part played by people in the flow of mass communications. Foundations of communications research. 1955.
[15]
D. Kempe, J. M. Kleinberg, and É. Tardos. Maximizing the spread of influence through a social network. In KDD, 2003.
[16]
Y. Kim and K. Shim. Twitobi: A recommendation system for twitter using probabilistic modeling. In ICDM, 2011.
[17]
J. M. Kleinberg. Authoritative sources in a hyperlinked environment. JACM, 1999.
[18]
P. Li, J. X. Yu, H. Liu, J. He, and X. Du. Ranking individuals and groups by influence propagation. In PAKDD, 2011.
[19]
B. Liu and L. Zhang. A survey of opinion mining and sentiment analysis. In Mining Text Data. 2012.
[20]
L. Liu, J. Tang, J. Han, M. Jiang, and S. Yang. Mining topic-level influence in heterogeneous networks. In CIKM, 2010.
[21]
A. S. Maiya and T. Y. Berger-Wolf. Online sampling of high centrality individuals in social networks. In PAKDD, 2010.
[22]
A. Pal and S. Counts. Identifying topical authorities in microblogs. In WSDM, 2011.
[23]
B. A. Prakash and C. Faloutsos. Understanding and managing cascades on large graphs. PVLDB, 2012.
[24]
M. Sachan, D. Contractor, T. A. Faruquie, and L. V. Subramaniam. Using content and interactions for discovering communities in social networks. In WWW, 2012.
[25]
D. Sáez-Trumper, G. Comarela, V. A. F. Almeida, R. A. Baeza-Yates, and F. Benevenuto. Finding trendsetters in information networks. In KDD, 2012.
[26]
T. Sakai, D. Ishikawa, N. Kando, Y. Seki, K. Kuriyama, and C.-Y. Lin. Using graded-relevance metrics for evaluating community qa answer selection. In WSDM, 2011.
[27]
Y. R. Tausczik and J. W. Pennebaker. Participation in an online mathematics community: differentiating motivations to add. In CSCW, 2012.
[28]
G. Wang, Y. Zhao, X. Shi, and P. S. Yu. Magnet community identification on social networks. In KDD, 2012.
[29]
D. J. Watts and P. S. Dodds. Influentials, networks, and public opinion formation. Journal of Consumer Research, 2007.
[30]
J. Weng, E.-P. Lim, J. Jiang, and Q. He. Twitterrank: finding topic-sensitive influential twitterers. In WSDM, 2010.

Cited By

View all
  • (2020)Node Trust: an effective method to detect non-overlapping community in social networksModern Physics Letters B10.1142/S021798492150036635:01(2150036)Online publication date: 7-Dec-2020
  • (2017)An Influence Propagation View of PageRankACM Transactions on Knowledge Discovery from Data10.1145/304694111:3(1-30)Online publication date: 21-Mar-2017
  • (2016)A learning-based model for predicting information diffusion in social networks: Case of Twitter2016 International Conference on Control, Decision and Information Technologies (CoDIT)10.1109/CoDIT.2016.7593657(752-757)Online publication date: Apr-2016
  • Show More Cited By

Index Terms

  1. Estimating sharer reputation via social data calibration

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    KDD '13: Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
    August 2013
    1534 pages
    ISBN:9781450321747
    DOI:10.1145/2487575
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 11 August 2013

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. influential users
    2. sharer reputation

    Qualifiers

    • Research-article

    Conference

    KDD' 13
    Sponsor:

    Acceptance Rates

    KDD '13 Paper Acceptance Rate 125 of 726 submissions, 17%;
    Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

    Upcoming Conference

    KDD '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)5
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 01 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2020)Node Trust: an effective method to detect non-overlapping community in social networksModern Physics Letters B10.1142/S021798492150036635:01(2150036)Online publication date: 7-Dec-2020
    • (2017)An Influence Propagation View of PageRankACM Transactions on Knowledge Discovery from Data10.1145/304694111:3(1-30)Online publication date: 21-Mar-2017
    • (2016)A learning-based model for predicting information diffusion in social networks: Case of Twitter2016 International Conference on Control, Decision and Information Technologies (CoDIT)10.1109/CoDIT.2016.7593657(752-757)Online publication date: Apr-2016
    • (2016)Modeling Information Diffusion via Reputation EstimationProceedings, Part I, 27th International Conference on Database and Expert Systems Applications - Volume 982710.1007/978-3-319-44403-1_9(136-150)Online publication date: 5-Sep-2016

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media