skip to main content
10.1145/3340531.3411867acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

WMEgo: Willingness Maximization for Ego Network Data Extraction in Online Social Networks

Authors Info & Claims
Published:19 October 2020Publication History

ABSTRACT

The data of egocentric networks (ego networks) are very important for evaluating and validating the algorithms and machine learning approaches in Online Social Networks (OSNs). Nevertheless, obtaining the ego network data from OSNs is not a trivial task. Conventional manual approaches are time-consuming, and only a small number of users would agree to contribute their data. This is because there are two important factors that should be considered simultaneously for this data acquisition task: i) users' willingness to contribute their data, and ii) the structure of the ego network. However, addressing the above two factors to obtain the more complete ego network data has not received much research attention. Therefore, in this paper, we make our first attempt to address this issue by proposing a new research problem, named Willingness Maximization for Ego Network Extraction in Online Social Networks (WMEgo), to identify a set of ego networks from the OSN such that the willingness of the users to contribute their data is maximized. We prove that WMEgo is NP-hard and propose a 1/2*(1 1/e)-approximation algorithm, named Ego Network Identification with Maximum Willingness (EIMW). We conduct an evaluation study with 672 volunteers to validate the proposed WMEgo and EIMW, and perform extensive experiments on multiple real datasets to demonstrate the effectiveness and efficiency of our approach.

Skip Supplemental Material Section

Supplemental Material

3340531.3411867.mp4

mp4

52.7 MB

References

  1. Nesreen K Ahmed, Nick Duffield, Theodore L Willke, and Ryan A Rossi. 2017. On sampling from massive graph streams. Proceedings of the VLDB Endowment, Vol. 10, 11 (2017), 1430--1441.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Nesreen K Ahmed, Jennifer Neville, and Ramana Kompella. 2014. Network sampling: From static to streaming graphs. ACM Transactions on Knowledge Discovery from Data (TKDD), Vol. 8, 2 (2014), 7.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Norbert Blenn, Christian Doerr, Bas Van Kester, and Piet Van Mieghem. 2012. Crawling and detecting community structure in online social networks using local information. In International Conference on Research in Networking. Springer, 56--67.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Francesco Bonchi, Arijit Khan, and Lorenzo Severini. 2019. Distance-generalized Core Decomposition. In Proceedings of the 2019 International Conference on Management of Data. ACM, 1006--1023.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Nicholas A Christakis and James H Fowler. 2007. The spread of obesity in a large social network over 32 years. New England journal of medicine, Vol. 357, 4 (2007), 370--379.Google ScholarGoogle Scholar
  6. Maximilien Danisch, T-H Hubert Chan, and Mauro Sozio. 2017. Large scale density-friendly graph decomposition via convex programming. In Proceedings of the 26th International Conference on World Wide Web. International World Wide Web Conferences Steering Committee, 233--242.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Abir De, Isabel Valera, Niloy Ganguly, Sourangshu Bhattacharya, and Manuel Gomez Rodriguez. 2016. Learning and forecasting opinion dynamics in social networks. In Advances in Neural Information Processing Systems. 397--405.Google ScholarGoogle Scholar
  8. Yixiang Fang, Reynold Cheng, Siqiang Luo, and Jiafeng Hu. 2016. Effective community search for large attributed graphs. Proceedings of the VLDB Endowment, Vol. 9, 12 (2016), 1233--1244.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Uriel Feige, David Peleg, and Guy Kortsarz. 2001. The dense k-subgraph problem. Algorithmica, Vol. 29, 3 (2001), 410--421.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Minas Gjoka, Maciej Kurant, Carter T Butts, and Athina Markopoulou. 2011. Practical recommendations on crawling online social networks. IEEE Journal on Selected Areas in Communications, Vol. 29, 9 (2011), 1872--1892.Google ScholarGoogle ScholarCross RefCross Ref
  11. Reinhard Heckel, Michail Vlachos, Thomas Parnell, and Celestine Dünner. 2017. Scalable and interpretable product recommendations via overlapping co-clustering. In 2017 IEEE 33rd International Conference on Data Engineering (ICDE). IEEE, 1033--1044.Google ScholarGoogle ScholarCross RefCross Ref
  12. Stephen T Hedetniemi and Renu C Laskar. 1991. Bibliography on domination in graphs and some basic definitions of domination parameters. In Annals of Discrete Mathematics. Vol. 48. Elsevier, 257--277.Google ScholarGoogle Scholar
  13. Bay-Yuan Hsu, Yi-Feng Lan, and Chih-Ya Shen. 2018. On automatic formation of effective therapy groups in social networks. IEEE Transactions on Computational Social Systems, Vol. 5, 3 (2018), 713--726.Google ScholarGoogle ScholarCross RefCross Ref
  14. Bay-Yuan Hsu, Chia-Lin Tu, Ming-Yi Chang, and Chih-Ya Shen. 2019. On crawling community-aware online social network data. In Proceedings of the 30th ACM Conference on Hypertext and Social Media. 265--266.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Xin Huang, Hong Cheng, Lu Qin, Wentao Tian, and Jeffrey Xu Yu. 2014. Querying k-truss community in large and dynamic graphs. In Proceedings of the 2014 ACM SIGMOD international conference on Management of data. ACM, 1311--1322.Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Xin Huang, Laks VS Lakshmanan, Jeffrey Xu Yu, and Hong Cheng. 2015. Approximate closest community search in networks. Proceedings of the VLDB Endowment, Vol. 9, 4 (2015), 276--287.Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Nadin Kökciyan and Pinar Yolum. 2016. Priguard: A semantic approach to detect privacy violations in online social networks. IEEE Transactions on Knowledge and Data Engineering, Vol. 28, 10 (2016), 2724--2737.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Pankaj Kumar and Akbar Zaheer. 2019. Ego-network stability and innovation in alliances. Academy of Management Journal, Vol. 62, 3 (2019), 691--716.Google ScholarGoogle ScholarCross RefCross Ref
  19. Ricky Laishram, Jeremy D Wendt, and Sucheta Soundarajan. 2019. Crawling the Community Structure of Multiplex Networks. In AAAI.Google ScholarGoogle Scholar
  20. Jure Leskovec, Kevin J Lang, Anirban Dasgupta, and Michael W Mahoney. 2009. Community structure in large networks: Natural cluster sizes and the absence of large well-defined clusters. Internet Mathematics, Vol. 6, 1 (2009), 29--123.Google ScholarGoogle ScholarCross RefCross Ref
  21. Jure Leskovec and Julian J Mcauley. 2012. Learning to discover social circles in ego networks. In Advances in neural information processing systems. 539--547.Google ScholarGoogle Scholar
  22. Rong-Hua Li, Lu Qin, Jeffrey Xu Yu, and Rui Mao. 2015. Influential community search in large networks. Proceedings of the VLDB Endowment, Vol. 8, 5 (2015), 509--520.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Xiang-Yang Li, Chunhong Zhang, Taeho Jung, Jianwei Qian, and Linlin Chen. 2016. Graph-based privacy-preserving data publication. In IEEE INFOCOM 2016-The 35th Annual IEEE International Conference on Computer Communications. IEEE, 1--9.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Ye Li, Chaofeng Sha, Xin Huang, and Yanchun Zhang. 2018. Community detection in attributed graphs: an embedding approach. In Thirty-Second AAAI Conference on Artificial Intelligence.Google ScholarGoogle ScholarCross RefCross Ref
  25. Peter J. Mucha, Thomas Richardson, Kevin Macon, Mason A. Porter, and Jukka-Pekka Onnela. 2010. Community structure in time-dependent, multiscale, and multiplex networks. Science, Vol. 328, 5980 (2010), 876--878.Google ScholarGoogle Scholar
  26. Leonardo FR Ribeiro, Pedro HP Saverese, and Daniel R Figueiredo. 2017. Struc2vec: Learning node representations from structural identity. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 385--394.Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. J Niels Rosenquist, James H Fowler, and Nicholas A Christakis. 2011. Social network determinants of depression. Molecular psychiatry, Vol. 16, 3 (2011), 273.Google ScholarGoogle Scholar
  28. Ahmet Erdem Sariyüce and Ali Pinar. 2016. Fast hierarchy construction for dense subgraphs. Proceedings of the VLDB Endowment, Vol. 10, 3 (2016), 97--108.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. David R Schaefer, Olga Kornienko, and Andrew M Fox. 2011. Misery does not love company: Network selection mechanisms and depression homophily. American Sociological Review, Vol. 76, 5 (2011), 764--785.Google ScholarGoogle ScholarCross RefCross Ref
  30. Chih-Ya Shen, CP Kankeu Fotsing, Yi-Shin Chen, Wang-chien Lee, et al. 2018. On organizing online soirees with live multi-streaming. In 32nd AAAI Conference on Artificial Intelligence, AAAI 2018. AAAI press, 151--159.Google ScholarGoogle ScholarCross RefCross Ref
  31. Chih-Ya Shen, Hong-Han Shuai, De-Nian Yang, Yi-Feng Lan, Wang-Chien Lee, Philip S Yu, and Ming-Syan Chen. 2015. Forming online support groups for internet and behavior related addictions. In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management. 163--172.Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Chih-Ya Shen, De-Nian Yang, Wang-Chien Lee, and Ming-Syan Chen. 2016. Spatial-proximity optimization for rapid task group deployment. ACM Transactions on Knowledge Discovery from Data (TKDD), Vol. 10, 4 (2016), 1--36.Google ScholarGoogle Scholar
  33. Jaewon Yang and Jure Leskovec. 2015. Defining and evaluating network communities based on ground-truth. Knowledge and Information Systems, Vol. 42, 1 (2015), 181--213.Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Shaozhi Ye, Juan Lang, and Felix Wu. 2010. Crawling online social graphs. In Proceedings of the 2010 12th International Asia-Pacific Web Conference. 236--242.Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Daokun Zhang, Jie Yin, Xingquan Zhu, and Chengqi Zhang. 2017a. User profile preserving social network embedding. In IJCAI International Joint Conference on Artificial Intelligence.Google ScholarGoogle ScholarCross RefCross Ref
  36. Fan Zhang, Ying Zhang, Lu Qin, Wenjie Zhang, and Xuemin Lin. 2017b. When engagement meets similarity: efficient (k, r)-core computation on social networks. Proceedings of the VLDB Endowment, Vol. 10, 10 (2017), 998--1009.Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Dingyuan Zhu, Peng Cui, Daixin Wang, and Wenwu Zhu. 2018. Deep variational network embedding in wasserstein space. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. ACM, 2827--2836.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. WMEgo: Willingness Maximization for Ego Network Data Extraction in Online Social Networks

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          CIKM '20: Proceedings of the 29th ACM International Conference on Information & Knowledge Management
          October 2020
          3619 pages
          ISBN:9781450368599
          DOI:10.1145/3340531

          Copyright © 2020 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 19 October 2020

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          Overall Acceptance Rate1,861of8,427submissions,22%

          Upcoming Conference

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader