research-article

WMEgo: Willingness Maximization for Ego Network Data Extraction in Online Social Networks

Authors:
Bay-Yuan Hsu

National Taipei University, New Taipei City, Taiwan Roc

National Taipei University, New Taipei City, Taiwan Roc
View Profile

,
Chih-Ya Shen

National Tsing Hua University, Hsinchu, Taiwan Roc

National Tsing Hua University, Hsinchu, Taiwan Roc
View Profile

,
Ming-Yi Chang

National Tsing Hua University, Hsinchu, Taiwan Roc

National Tsing Hua University, Hsinchu, Taiwan Roc
View Profile

CIKM '20: Proceedings of the 29th ACM International Conference on Information & Knowledge ManagementOctober 2020Pages 515–524https://doi.org/10.1145/3340531.3411867

Published:19 October 2020Publication History

CIKM '20: Proceedings of the 29th ACM International Conference on Information & Knowledge Management

Pages 515–524

ABSTRACT

The data of egocentric networks (ego networks) are very important for evaluating and validating the algorithms and machine learning approaches in Online Social Networks (OSNs). Nevertheless, obtaining the ego network data from OSNs is not a trivial task. Conventional manual approaches are time-consuming, and only a small number of users would agree to contribute their data. This is because there are two important factors that should be considered simultaneously for this data acquisition task: i) users' willingness to contribute their data, and ii) the structure of the ego network. However, addressing the above two factors to obtain the more complete ego network data has not received much research attention. Therefore, in this paper, we make our first attempt to address this issue by proposing a new research problem, named Willingness Maximization for Ego Network Extraction in Online Social Networks (WMEgo), to identify a set of ego networks from the OSN such that the willingness of the users to contribute their data is maximized. We prove that WMEgo is NP-hard and propose a 1/2*(1 1/e)-approximation algorithm, named Ego Network Identification with Maximum Willingness (EIMW). We conduct an evaluation study with 672 volunteers to validate the proposed WMEgo and EIMW, and perform extensive experiments on multiple real datasets to demonstrate the effectiveness and efficiency of our approach.

Supplemental Material

3340531.3411867.mp4

mp4

52.7 MB

Download

References

Nesreen K Ahmed, Nick Duffield, Theodore L Willke, and Ryan A Rossi. 2017. On sampling from massive graph streams. Proceedings of the VLDB Endowment, Vol. 10, 11 (2017), 1430--1441.Google ScholarDigital Library
Nesreen K Ahmed, Jennifer Neville, and Ramana Kompella. 2014. Network sampling: From static to streaming graphs. ACM Transactions on Knowledge Discovery from Data (TKDD), Vol. 8, 2 (2014), 7.Google ScholarDigital Library
Norbert Blenn, Christian Doerr, Bas Van Kester, and Piet Van Mieghem. 2012. Crawling and detecting community structure in online social networks using local information. In International Conference on Research in Networking. Springer, 56--67.Google ScholarDigital Library
Francesco Bonchi, Arijit Khan, and Lorenzo Severini. 2019. Distance-generalized Core Decomposition. In Proceedings of the 2019 International Conference on Management of Data. ACM, 1006--1023.Google ScholarDigital Library
Nicholas A Christakis and James H Fowler. 2007. The spread of obesity in a large social network over 32 years. New England journal of medicine, Vol. 357, 4 (2007), 370--379.Google Scholar
Maximilien Danisch, T-H Hubert Chan, and Mauro Sozio. 2017. Large scale density-friendly graph decomposition via convex programming. In Proceedings of the 26th International Conference on World Wide Web. International World Wide Web Conferences Steering Committee, 233--242.Google ScholarDigital Library
Abir De, Isabel Valera, Niloy Ganguly, Sourangshu Bhattacharya, and Manuel Gomez Rodriguez. 2016. Learning and forecasting opinion dynamics in social networks. In Advances in Neural Information Processing Systems. 397--405.Google Scholar
Yixiang Fang, Reynold Cheng, Siqiang Luo, and Jiafeng Hu. 2016. Effective community search for large attributed graphs. Proceedings of the VLDB Endowment, Vol. 9, 12 (2016), 1233--1244.Google ScholarDigital Library
Uriel Feige, David Peleg, and Guy Kortsarz. 2001. The dense k-subgraph problem. Algorithmica, Vol. 29, 3 (2001), 410--421.Google ScholarDigital Library
Minas Gjoka, Maciej Kurant, Carter T Butts, and Athina Markopoulou. 2011. Practical recommendations on crawling online social networks. IEEE Journal on Selected Areas in Communications, Vol. 29, 9 (2011), 1872--1892.Google ScholarCross Ref
Reinhard Heckel, Michail Vlachos, Thomas Parnell, and Celestine Dünner. 2017. Scalable and interpretable product recommendations via overlapping co-clustering. In 2017 IEEE 33rd International Conference on Data Engineering (ICDE). IEEE, 1033--1044.Google ScholarCross Ref
Stephen T Hedetniemi and Renu C Laskar. 1991. Bibliography on domination in graphs and some basic definitions of domination parameters. In Annals of Discrete Mathematics. Vol. 48. Elsevier, 257--277.Google Scholar
Bay-Yuan Hsu, Yi-Feng Lan, and Chih-Ya Shen. 2018. On automatic formation of effective therapy groups in social networks. IEEE Transactions on Computational Social Systems, Vol. 5, 3 (2018), 713--726.Google ScholarCross Ref
Bay-Yuan Hsu, Chia-Lin Tu, Ming-Yi Chang, and Chih-Ya Shen. 2019. On crawling community-aware online social network data. In Proceedings of the 30th ACM Conference on Hypertext and Social Media. 265--266.Google ScholarDigital Library
Xin Huang, Hong Cheng, Lu Qin, Wentao Tian, and Jeffrey Xu Yu. 2014. Querying k-truss community in large and dynamic graphs. In Proceedings of the 2014 ACM SIGMOD international conference on Management of data. ACM, 1311--1322.Google ScholarDigital Library
Xin Huang, Laks VS Lakshmanan, Jeffrey Xu Yu, and Hong Cheng. 2015. Approximate closest community search in networks. Proceedings of the VLDB Endowment, Vol. 9, 4 (2015), 276--287.Google ScholarDigital Library
Nadin Kökciyan and Pinar Yolum. 2016. Priguard: A semantic approach to detect privacy violations in online social networks. IEEE Transactions on Knowledge and Data Engineering, Vol. 28, 10 (2016), 2724--2737.Google ScholarDigital Library
Pankaj Kumar and Akbar Zaheer. 2019. Ego-network stability and innovation in alliances. Academy of Management Journal, Vol. 62, 3 (2019), 691--716.Google ScholarCross Ref
Ricky Laishram, Jeremy D Wendt, and Sucheta Soundarajan. 2019. Crawling the Community Structure of Multiplex Networks. In AAAI.Google Scholar
Jure Leskovec, Kevin J Lang, Anirban Dasgupta, and Michael W Mahoney. 2009. Community structure in large networks: Natural cluster sizes and the absence of large well-defined clusters. Internet Mathematics, Vol. 6, 1 (2009), 29--123.Google ScholarCross Ref
Jure Leskovec and Julian J Mcauley. 2012. Learning to discover social circles in ego networks. In Advances in neural information processing systems. 539--547.Google Scholar
Rong-Hua Li, Lu Qin, Jeffrey Xu Yu, and Rui Mao. 2015. Influential community search in large networks. Proceedings of the VLDB Endowment, Vol. 8, 5 (2015), 509--520.Google ScholarDigital Library
Xiang-Yang Li, Chunhong Zhang, Taeho Jung, Jianwei Qian, and Linlin Chen. 2016. Graph-based privacy-preserving data publication. In IEEE INFOCOM 2016-The 35th Annual IEEE International Conference on Computer Communications. IEEE, 1--9.Google ScholarDigital Library
Ye Li, Chaofeng Sha, Xin Huang, and Yanchun Zhang. 2018. Community detection in attributed graphs: an embedding approach. In Thirty-Second AAAI Conference on Artificial Intelligence.Google ScholarCross Ref
Peter J. Mucha, Thomas Richardson, Kevin Macon, Mason A. Porter, and Jukka-Pekka Onnela. 2010. Community structure in time-dependent, multiscale, and multiplex networks. Science, Vol. 328, 5980 (2010), 876--878.Google Scholar
Leonardo FR Ribeiro, Pedro HP Saverese, and Daniel R Figueiredo. 2017. Struc2vec: Learning node representations from structural identity. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 385--394.Google ScholarDigital Library
J Niels Rosenquist, James H Fowler, and Nicholas A Christakis. 2011. Social network determinants of depression. Molecular psychiatry, Vol. 16, 3 (2011), 273.Google Scholar
Ahmet Erdem Sariyüce and Ali Pinar. 2016. Fast hierarchy construction for dense subgraphs. Proceedings of the VLDB Endowment, Vol. 10, 3 (2016), 97--108.Google ScholarDigital Library
David R Schaefer, Olga Kornienko, and Andrew M Fox. 2011. Misery does not love company: Network selection mechanisms and depression homophily. American Sociological Review, Vol. 76, 5 (2011), 764--785.Google ScholarCross Ref
Chih-Ya Shen, CP Kankeu Fotsing, Yi-Shin Chen, Wang-chien Lee, et al. 2018. On organizing online soirees with live multi-streaming. In 32nd AAAI Conference on Artificial Intelligence, AAAI 2018. AAAI press, 151--159.Google ScholarCross Ref
Chih-Ya Shen, Hong-Han Shuai, De-Nian Yang, Yi-Feng Lan, Wang-Chien Lee, Philip S Yu, and Ming-Syan Chen. 2015. Forming online support groups for internet and behavior related addictions. In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management. 163--172.Google ScholarDigital Library
Chih-Ya Shen, De-Nian Yang, Wang-Chien Lee, and Ming-Syan Chen. 2016. Spatial-proximity optimization for rapid task group deployment. ACM Transactions on Knowledge Discovery from Data (TKDD), Vol. 10, 4 (2016), 1--36.Google Scholar
Jaewon Yang and Jure Leskovec. 2015. Defining and evaluating network communities based on ground-truth. Knowledge and Information Systems, Vol. 42, 1 (2015), 181--213.Google ScholarDigital Library
Shaozhi Ye, Juan Lang, and Felix Wu. 2010. Crawling online social graphs. In Proceedings of the 2010 12th International Asia-Pacific Web Conference. 236--242.Google ScholarDigital Library
Daokun Zhang, Jie Yin, Xingquan Zhu, and Chengqi Zhang. 2017a. User profile preserving social network embedding. In IJCAI International Joint Conference on Artificial Intelligence.Google ScholarCross Ref
Fan Zhang, Ying Zhang, Lu Qin, Wenjie Zhang, and Xuemin Lin. 2017b. When engagement meets similarity: efficient (k, r)-core computation on social networks. Proceedings of the VLDB Endowment, Vol. 10, 10 (2017), 998--1009.Google ScholarDigital Library
Dingyuan Zhu, Peng Cui, Daixin Wang, and Wenwu Zhu. 2018. Deep variational network embedding in wasserstein space. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. ACM, 2827--2836.Google ScholarDigital Library

Index Terms

WMEgo: Willingness Maximization for Ego Network Data Extraction in Online Social Networks
1. Information systems
  1. Information systems applications
    1. Collaborative and social computing systems and tools
      1. Social networking sites
  2. World Wide Web
    1. Web applications
      1. Social networks
    2. Web mining
      1. Data extraction and integration

Recommendations

Ego network structure in online social networks and its impact on information diffusion

In the last few years, Online Social Networks (OSNs) attracted the interest of a large number of researchers, thanks to their central role in the society. Through the analysis of OSNs, many social phenomena have been studied, such as the viral diffusion ...
Read More
Ego-net digger: a new way to study ego networks in online social networks
HotSocial '12: Proceedings of the First ACM International Workshop on Hot Topics on Interdisciplinary Social Networks Research

The vast proliferation of Online Social Networks (OSN) is generating many new ways to interact and create social relationships with others. While substantial results have been obtained in anthropology literature describing the properties of human social ...
Read More
Towards a characterization of egocentric networks in online social networks
OTM'11: Proceedings of the 2011th Confederated international conference on On the move to meaningful internet systems

Online Social Networks (OSNs) are more and more establishing as one of the key means to create and enforce social relationships between individuals. While substantial results have been obtained in the anthropology literature describing the properties of ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CIKM '20: Proceedings of the 29th ACM International Conference on Information & Knowledge Management
October 2020
3619 pages
ISBN:9781450368599
DOI:10.1145/3340531
General Chairs:
Mathieu d'Aquin
DSI, Insight, NUI Galway, Ireland
,
Stefan Dietze
GESIS, Cologne, Germany, Heinrich-Heine-University Düsseldorf, Germany, L3S Research Center, Germany
,
Program Chairs:
Claudia Hauff
TU Delft, The Netherlands
,
Edward Curry
DSI, Insight, NUI Galway, Ireland
,
Philippe Cudre Mauroux
eXascale, University of Fribourg, Switzerland
Copyright © 2020 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 19 October 2020
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
crawling
ego networks
online social networks
willingness
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate1,861of8,427submissions,22%
Upcoming Conference
CIKM '24

Sponsor:

sigir

sigir

The 33rd ACM International Conference on Information and Knowledge Management

October 21 - 25, 2024

Boise , ID , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 10
  Total Citations
  View Citations
- 272
  Total Downloads
- Downloads (Last 12 months)31
- Downloads (Last 6 weeks)3
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

WMEgo: Willingness Maximization for Ego Network Data Extraction in Online Social Networks

CIKM '20: Proceedings of the 29th ACM International Conference on Information & Knowledge Management

ABSTRACT

Supplemental Material

References

Cited By

Index Terms

Recommendations

Ego network structure in online social networks and its impact on information diffusion

Ego-net digger: a new way to study ego networks in online social networks

Towards a characterization of egocentric networks in online social networks