Abstract
Recently, with the widespread popularity of SNS (Social Network Service), such as Twitter, Facebook, people are increasingly accustomed to sharing feeling, experience and knowledge with each other on Internet. The high accessibility of these web sites has allowed the information to be spread across the social media more quickly and widely, which leads to more and more populations being engaged into this so-called social stream environment. All these make the organization of user relationships become increasingly important and necessary. In this study, we try to discover the potential and dynamical user correlations using those organized social streams in accordance with users’ current interests and needs, in order to assist the collaborative information seeking process. We develop a heuristic approach to build a Dynamically Socialized User Networking (DSUN) model, and define a set of measures (such as interest degree, and popularity degree) and concepts (such as complementary tie, weak tie, and strong tie), to discover and represent users’ current profiling and dynamical correlations. The corresponding algorithms are developed respectively. Finally, the architecture of the functional modules is presented, and the experiment results are demonstrated and discussed based on an application of the proposed model.
Similar content being viewed by others
References
Aggarwal C, Han J, Wang J, Yu PS (2003) A framework for clustering evolving data streams. In: Proc. 2003 Int. Conf. on Very Large Data Bases, Berlin, Germany, pp 81–92
Aiello LM, Barrat A, Schifanella R, Cattuto C, Markines B, Menczer F (2012) Friendship prediction and homophily in social media. ACM Trans Web (TWEB) 6(2), article 9
Bickmore T, Schulman D (2012) Empirical validation of an accommodation theory-based model of user-agent relationship. In: Yukiko N, Michael N, Ana P, Marilyn W (eds) Proc. IVA’12 (12th international conference on Intelligent Virtual Agents). Springer-Verlag, Berlin, pp 390–403
Black A, Mascaro C, Gallagher M, Goggins SP (2012) Twitter zombie: architecture for capturing, socially transforming and analyzing the Twittersphere. In: Proc. GROUP ’12 (17th ACM international conference on Supporting group work), ACM, New York, pp 229–238
Byun C, Kim Y, Lee H, Kim KK (2012) Automated Twitter data collecting tool and case study with rule-based analysis. In: Proc. IIWAS ‘12 (14th International Conference on Information Integration and Web-based Applications & Services), ACM, New York, pp 196–204
Carpineto C, Osiński S, Romano G, Weiss D (2009) A survey of web clustering engines. ACM Comput Surv (CSUR) 41(3)
Chen H, Zhou XK, Man HF, Wu Y, Ahmed AU, Jin Q (2010) A framework of organic streams: integrating dynamically diversified contents into ubiquitous personal study. In: 2nd International Symposium on Multidisciplinary Emerging Networks and Systems, Xi’an
Cogan P, Andrews M, Bradonjic M, Kennedy WS, Sala A, Tucci G (2012) Reconstruction and analysis of Twitter conversation graphs. In: Proc. HotSocial ’12 (1st ACM international workshop on Hot topics on interdisciplinary social networks research). ACM, New York, pp 25–31
Ed H (2009) Chi: information seeking can be social. Computer 42(3):42–46
Gaber MM, Zaslavsky A, Krishnaswamy S (2005) Mining data streams: a review. ACM SIGMOD Rec Arch 34(2):18–26
Gama S, Barata G, Gonçalves D, Prada R, Paiva A (2012) A model for social regulation of user-agent relationships. In: Yukiko N, Michael N, Ana P, Marilyn W (eds) Proc. IVA’12 (12th international conference on intelligent virtual agents). Springer-Verlag, Berlin, pp 319–326
Guha S, Meyerson A, Mishra N, Motwani R, O’Callaghan L (2003) Clustering data streams: theory and practice. TKDE Spec Issue Clust 15(3):515–528
Guha S, Mishra N, Motwani R, O’Callaghan L (2000) Clustering data streams. In: Proc. The annual symposium on foundations of computer science. IEEE Press, California
Gupta C, Wang S, Ari I, Hao M, Dayal U, Mehta A, Marwah M, Sharma R (2009) CHAOS: a data stream analysis architecture for enterprise applications. In: Proc. CEC ’09 (2009 I.E. Conference on Commerce and Enterprise Computing), IEEE Computer Society, Washington, DC, pp 33–40
Han J, Chen Y, Dong G, Pei J, Wah BW, Wang J, Cai YD (2005) Stream cube: an architecture for multidimensional analysis of data streams. Distrib Parallel Database 18(2):173–197
Hashemi S, Yang Y, Mirzamomen Z, Kangavari M (2009) Adapted one-versus-all decision trees for data stream classification. IEEE Trans Knowl Data Eng 21(5):624–637
Hu B, Jamali M, Ester M (2012) Learning the strength of the factors influencing user behavior in online social networks. In: Proc. ASONAM (IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining), pp 368–375
Huang J, Martin P, Powley W, Bird P, Abrashkevich D (2010) Lightweight problem determination in DBMSs using data stream analysis techniques. In: Müller HA, Arthur R, Kark AW (eds) Proc. CASCON ’10 (2010 Conference of the Center for Advanced Studies on Collaborative Research). IBM Corp, Riverton, pp 199–211
Junco R, Heiberger G, Loken E (2011) The effect of Twitter on college student engagement and grades. J Comput Assist Learn 27(2):119–132
Kendall L, Hartzler A, Klasnja P, Pratt W (2011) Descriptive analysis of physical activity conversations on Twitter. In: CHI EA ’11 (CHI ’11 extended abstracts on human factors in computing systems. ACM, New York, pp 1555–1560
Kirsten AJ (2011) The effect of Twitter posts on students’ perceptions of instructor credibility. Learn Media Technol 36(1):21–38
Leroy V, Cambazoglu BB, Bonchi F (2010) Cold start link prediction. In: Proc. KDD ’10(16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining), ACM, New York, pp 393–402
Leskovec J, Horvitz E (2008) Planetary-scale views on a large instant-messaging network. In: Proc. WWW’08 (17th International Conference on the World Wide Web), Beijing, China, pp 915–924
Li R, Wang S, Chang KCC (2012) Multiple location profiling for users and relationships from social network and content. Proc VLDB Endow 5(11):1603–1614
Masud MM, Al-Khateeb TM, Hamlen KW, Gao J, Khan L, Han J, Thuraisingham B (2008) Cloud-based malware detection for evolving data streams. ACM Trans Manag Inf Syst (TMIS) 2(3), article 16
Mitzlaff F, Benz D, Stumme G, Hotho A (2010) Visit me, click me, be my friend: an analysis of evidence networks of user relationships in bibsonomy. In: Proc. HT ’10 (21st ACM Conference on Hypertext and Hypermedia), ACM, New York, pp 265–270
Musselle C (2012) Rethinking concepts of the dendritic cell algorithm for multiple data stream analysis. In: Coello CA, Greensmith J, Krasnogor N, Liò P, Nicosia G (eds) Proc. ICARIS’12 (11th international conference on Artificial Immune Systems). Springer-Verlag, Berlin, pp 246–259
Ordonez C (2003) Clustering binary data streams with K-means. In: Proc. 8th ACM SIGMOD workshop on research issues in data mining and knowledge discovery. ACM, San Diego, pp 12–19
Pervin N, Fang F, Datta A, Dutta K, Vandermeer D (2013) Fast, scalable, and context-sensitive detection of trending topics in microblog post streams. ACM Trans Manag Inf Syst (TMIS) 3(4), article 19
Shin H, Xu Z, Kim EY (2008) Discovering and browsing of power users by social relationship analysis in large-scale online communities. In: Proc. WI-IAT ’08 (2008 IEEE/WIC/ACMInternational Conference on Web Intelligence and Intelligent Agent Technology), IEEE Computer Society, Washington, DC, vol 1, pp 105–111
Signorini A, Segre AM, Polgreen PM (2011) The use of Twitter to track levels of disease activity and public concern in the US during the influenza a H1N1 pandemic. PLoS One 6:5
Vosecky J, Jiang D, Limosa WN (2013) A system for geographic user interest analysis in Twitter. In: Proc. EDBT ’13 (16th International Conference on Extending Database Technology). ACM, New York, pp 709–712
Wang X, Wei F, Liu X, Zhou M, Zhang M (2011) Topic sentiment analysis in twitter: a graph-based Hashtag sentiment classification approach. In: Berendt B, de Vries A, Fan W, Macdonald C, Ounis I, Ruthven I (eds) Proc. CIKM ’11 (20th ACM international conference on Information and knowledge management). ACM, New York, pp 1031–1040
Xiang R, Neville J, Rogati M (2010) Modeling relationship strength in online social networks. In: Proc. WWW’10 (the 19th International Conference on World Wide Web), ACM, NewYork, pp 981–990
Yang CC, Yang H, Tang X, Jiang L (2012) Identifying implicit relationships between social media users to support social commerce. In: Proc. ICEC ’12 (14th Annual International Conference on Electronic Commerce), ACM, New York, pp 41–47
Zhou XK, Chen H, Jin Q, Yong JM (2011) Generating associative ripples of relevant information from a variety of data streams by throwing a heuristic stone. In: ACM ICUIMC 2011 (5th International Conference on Ubiquitous Information Management and Communication), Seoul, Korea
Zhou XK, Jin Q (2011) Dynamical user networking and profiling based on activity streams for enhanced social learning. In: Proc. ICWL’11 (10th International Conference on Web-based Learning), Lecture Notes in Computer Science, Springer, Hong Kong, vol 7048, pp 219–225
Zhou XK, Jin Q (2012) User correlation discovery and dynamical profiling based on social streams. In: Proc. AMT 2012 (2012 International Conference on Active Media Technology), Lecture Notes in Computer Science/Lecture Notes in Artificial Intelligence, Springer, Macao, vol 7669, pp 53–62
Acknowledgments
The work has been partly supported by 2012, 2013 and 2014 Waseda University Grants for Special Research Project No. 2012B-215, No. 2013A-6395, No. 2013B-207, and No. 2014K-6214.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Zhou, X., Jin, Q. A heuristic approach to discovering user correlations from organized social stream data. Multimed Tools Appl 76, 11487–11507 (2017). https://doi.org/10.1007/s11042-014-2153-5
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-014-2153-5