research-article

Estimating sharer reputation via social data calibration

Authors:

Bee-Chung Chen,

Deepak AgarwalAuthors Info & Claims

KDD '13: Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining

Pages 59 - 67

https://doi.org/10.1145/2487575.2487685

Published: 11 August 2013 Publication History

Abstract

Online social networks have become important channels for users to share content with their connections and diffuse information. Although much work has been done to identify socially influential users, the problem of finding "reputable" sharers, who share good content, has received relatively little attention. Availability of such reputation scores can be useful or various applications like recommending people to follow, procuring high quality content in a scalable way, creating a content reputation economy to incentivize high quality sharing, and many more. To estimate sharer reputation, it is intuitive to leverage data that records how recipients respond (through clicking, liking, etc.) to content items shared by a sharer. However, such data is usually biased --- it has a selection bias since the shared items can only be seen and responded to by users connected to the sharer in most social networks, and it has a response bias since the response is usually influenced by the relationship between the sharer and the recipient (which may not indicate whether the shared content is good). To correct for such biases, we propose to utilize an additional data source that provides unbiased goodness estimates for a small set of shared items, and calibrate biased social data through a novel multi-level hierarchical model that describes how the unbiased data and biased data are jointly generated according to sharer reputation scores. The unbiased data also provides the ground truth for quantitative evaluation of different methods. Experiments based on such ground-truth data show that our proposed model significantly outperforms existing methods that estimate social influence using biased social data.

References

[1]

D. Agarwal, B.-C. Chen, and P. Elango. Explore/exploit schemes for web content optimization. In ICDM, 2009.

Digital Library

[2]

A. Anderson, D. P. Huttenlocher, J. M. Kleinberg, and J. Leskovec. Discovering value from community activity on focused question answering sites: a case study of stack overflow. In KDD, 2012.

Digital Library

[3]

A. Anderson, D. P. Huttenlocher, J. M. Kleinberg, and J. Leskovec. Effects of user similarity in social media. In WSDM, 2012.

Digital Library

[4]

S. Brin and L. Page. The anatomy of a large-scale hypertextual web search engine. Computer Networks, 1998.

Digital Library

[5]

M. Cha, H. Haddadi, F. Benevenuto, and P. K. Gummadi. Measuring user influence in twitter: The million follower fallacy. In ICWSM, 2010.

[6]

B.-C. Chen, A. Dasgupta, X. Wang, and J. Yang. Vote calibration in community question-answering systems. In SIGIR, 2012.

Digital Library

[7]

B.-C. Chen, J. Guo, B. L. Tseng, and J. Yang. User reputation in a comment rating environment. In KDD, 2011.

Digital Library

[8]

D. J. Crandall, D. Cosley, D. P. Huttenlocher, J. M. Kleinberg, and S. Suri. Feedback effects between similarity and social influence in online communities. In KDD, 2008.

Digital Library

[9]

C. Danescu-Niculescu-Mizil, M. Gamon, and S. T. Dumais. Mark my words!: linguistic style accommodation in social media. In WWW, 2011.

Digital Library

[10]

K. El-Arini, U. Paquet, R. Herbrich, J. V. Gael, and B. A. y Arcas. Transparent user models for personalization. In KDD, 2012.

Digital Library

[11]

R.-E. Fan, K.-W. Chang, C.-J. Hsieh, X.-R. Wang, and C.-J. Lin. Liblinear: A library for large linear classification. JMLR, 2008.

Digital Library

[12]

W.-S. Hwang, H.-J. Lee, S.-W. Kim, and M. Lee. On using category experts for improving the performance and accuracy in recommender systems. In CIKM, 2012.

Digital Library

[13]

U. Kang, S. Papadimitriou, J. Sun, and H. Tong. Centralities in large networks: Algorithms and observations. In SDM, 2011.

[14]

E. Katz and P. Lazarsfeld. Personal influence: the part played by people in the flow of mass communications. Foundations of communications research. 1955.

[15]

D. Kempe, J. M. Kleinberg, and É. Tardos. Maximizing the spread of influence through a social network. In KDD, 2003.

Digital Library

[16]

Y. Kim and K. Shim. Twitobi: A recommendation system for twitter using probabilistic modeling. In ICDM, 2011.

Digital Library

[17]

J. M. Kleinberg. Authoritative sources in a hyperlinked environment. JACM, 1999.

Digital Library

[18]

P. Li, J. X. Yu, H. Liu, J. He, and X. Du. Ranking individuals and groups by influence propagation. In PAKDD, 2011.

Digital Library

[19]

B. Liu and L. Zhang. A survey of opinion mining and sentiment analysis. In Mining Text Data. 2012.

[20]

L. Liu, J. Tang, J. Han, M. Jiang, and S. Yang. Mining topic-level influence in heterogeneous networks. In CIKM, 2010.

Digital Library

[21]

A. S. Maiya and T. Y. Berger-Wolf. Online sampling of high centrality individuals in social networks. In PAKDD, 2010.

Digital Library

[22]

A. Pal and S. Counts. Identifying topical authorities in microblogs. In WSDM, 2011.

Digital Library

[23]

B. A. Prakash and C. Faloutsos. Understanding and managing cascades on large graphs. PVLDB, 2012.

Digital Library

[24]

M. Sachan, D. Contractor, T. A. Faruquie, and L. V. Subramaniam. Using content and interactions for discovering communities in social networks. In WWW, 2012.

Digital Library

[25]

D. Sáez-Trumper, G. Comarela, V. A. F. Almeida, R. A. Baeza-Yates, and F. Benevenuto. Finding trendsetters in information networks. In KDD, 2012.

Digital Library

[26]

T. Sakai, D. Ishikawa, N. Kando, Y. Seki, K. Kuriyama, and C.-Y. Lin. Using graded-relevance metrics for evaluating community qa answer selection. In WSDM, 2011.

Digital Library

[27]

Y. R. Tausczik and J. W. Pennebaker. Participation in an online mathematics community: differentiating motivations to add. In CSCW, 2012.

Digital Library

[28]

G. Wang, Y. Zhao, X. Shi, and P. S. Yu. Magnet community identification on social networks. In KDD, 2012.

Digital Library

[29]

D. J. Watts and P. S. Dodds. Influentials, networks, and public opinion formation. Journal of Consumer Research, 2007.

[30]

J. Weng, E.-P. Lim, J. Jiang, and Q. He. Twitterrank: finding topic-sensitive influential twitterers. In WSDM, 2010.

Digital Library

Cited By

Sheng JLi QWang BGuan WDai JHu JZuo H(2020)Node Trust: an effective method to detect non-overlapping community in social networksModern Physics Letters B10.1142/S021798492150036635:01(2150036)Online publication date: 7-Dec-2020
https://doi.org/10.1142/S0217984921500366
Liu QXiang BYuan NChen EXiong HZheng YYang Y(2017)An Influence Propagation View of PageRankACM Transactions on Knowledge Discovery from Data10.1145/304694111:3(1-30)Online publication date: 21-Mar-2017
https://dl.acm.org/doi/10.1145/3046941
Hoang BChelghoum KKacem I(2016)A learning-based model for predicting information diffusion in social networks: Case of Twitter2016 International Conference on Control, Decision and Information Technologies (CoDIT)10.1109/CoDIT.2016.7593657(752-757)Online publication date: Apr-2016
https://doi.org/10.1109/CoDIT.2016.7593657
Show More Cited By

Index Terms

Estimating sharer reputation via social data calibration
1. Information systems
  1. Information systems applications
    1. Data mining

Recommendations

Online Bonding and Bridging Social Capital via Social Networking Sites

This research aimed to explore types of online social capital bridging and bonding that the Emiratis perceive in the context of social networking site SNS usage. A snow-ball sample of 230 Emiratis from two Emirates, Abu Dhabi and Dubai was used. The ...
Reputation, trust, and norms as mechanisms leading to academic reciprocity in data sharing: An empirical test of theory of collective action
ABSTRACT
This research investigated how scientists' academic reputation, community trust, and norms regarding data sharing all influence their academic reciprocity in data sharing, which eventually leads to the scientists' data sharing intentions. This ...
Finding influential users in social networks based on novel features & link-based analysis

The social web appears to enrich human lives by providing effective applications for online social interactions. Microblogs are one of the most important applications of the social Web. The Microbloggers who influence the social community users through ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

KDD '13: Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining

August 2013

1534 pages

ISBN:9781450321747

DOI:10.1145/2487575

Editors:
Rayid Ghani
University of Chicago
,
Ted E. Senator
SAIC
,
Paul Bradley
MethodCare, Inc.
,
Rajesh Parekh
Groupon
,
Jingrui He
Stevens Institute of Technology
,
General Chairs:
Robert L. Grossman
University of Chicago and Open Data Group
,
Ramasamy Uthurusamy
General Motors Corporation (retired)
,
Program Chairs:
Inderjit S. Dhillon
University of Texas
,
Yehuda Koren
Google

Copyright © 2013 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 August 2013

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

KDD' 13

Sponsor:

KDD' 13: The 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

August 11 - 14, 2013

Illinois, Chicago, USA

Acceptance Rates

KDD '13 Paper Acceptance Rate 125 of 726 submissions, 17%;

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Upcoming Conference

KDD '25

Sponsor:
sigkdd
sigkdd

The 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining

August 3 - 7, 2025

Toronto , ON , Canada

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

4
Total Citations
View Citations
823
Total Downloads

Downloads (Last 12 months)5
Downloads (Last 6 weeks)1

Reflects downloads up to 01 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Sheng JLi QWang BGuan WDai JHu JZuo H(2020)Node Trust: an effective method to detect non-overlapping community in social networksModern Physics Letters B10.1142/S021798492150036635:01(2150036)Online publication date: 7-Dec-2020
https://doi.org/10.1142/S0217984921500366
Liu QXiang BYuan NChen EXiong HZheng YYang Y(2017)An Influence Propagation View of PageRankACM Transactions on Knowledge Discovery from Data10.1145/304694111:3(1-30)Online publication date: 21-Mar-2017
https://dl.acm.org/doi/10.1145/3046941
Hoang BChelghoum KKacem I(2016)A learning-based model for predicting information diffusion in social networks: Case of Twitter2016 International Conference on Control, Decision and Information Technologies (CoDIT)10.1109/CoDIT.2016.7593657(752-757)Online publication date: Apr-2016
https://doi.org/10.1109/CoDIT.2016.7593657
Hoang BChelghoum KKacem I(2016)Modeling Information Diffusion via Reputation EstimationProceedings, Part I, 27th International Conference on Database and Expert Systems Applications - Volume 982710.1007/978-3-319-44403-1_9(136-150)Online publication date: 5-Sep-2016
https://dl.acm.org/doi/10.1007/978-3-319-44403-1_9

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten