Skip to main content
Log in

A heuristic approach to discovering user correlations from organized social stream data

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Recently, with the widespread popularity of SNS (Social Network Service), such as Twitter, Facebook, people are increasingly accustomed to sharing feeling, experience and knowledge with each other on Internet. The high accessibility of these web sites has allowed the information to be spread across the social media more quickly and widely, which leads to more and more populations being engaged into this so-called social stream environment. All these make the organization of user relationships become increasingly important and necessary. In this study, we try to discover the potential and dynamical user correlations using those organized social streams in accordance with users’ current interests and needs, in order to assist the collaborative information seeking process. We develop a heuristic approach to build a Dynamically Socialized User Networking (DSUN) model, and define a set of measures (such as interest degree, and popularity degree) and concepts (such as complementary tie, weak tie, and strong tie), to discover and represent users’ current profiling and dynamical correlations. The corresponding algorithms are developed respectively. Finally, the architecture of the functional modules is presented, and the experiment results are demonstrated and discussed based on an application of the proposed model.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12

Similar content being viewed by others

Notes

  1. http://nislab.human.waseda.ac.jp/statusnet/.

References

  1. Aggarwal C, Han J, Wang J, Yu PS (2003) A framework for clustering evolving data streams. In: Proc. 2003 Int. Conf. on Very Large Data Bases, Berlin, Germany, pp 81–92

  2. Aiello LM, Barrat A, Schifanella R, Cattuto C, Markines B, Menczer F (2012) Friendship prediction and homophily in social media. ACM Trans Web (TWEB) 6(2), article 9

  3. Bickmore T, Schulman D (2012) Empirical validation of an accommodation theory-based model of user-agent relationship. In: Yukiko N, Michael N, Ana P, Marilyn W (eds) Proc. IVA’12 (12th international conference on Intelligent Virtual Agents). Springer-Verlag, Berlin, pp 390–403

    Google Scholar 

  4. Black A, Mascaro C, Gallagher M, Goggins SP (2012) Twitter zombie: architecture for capturing, socially transforming and analyzing the Twittersphere. In: Proc. GROUP ’12 (17th ACM international conference on Supporting group work), ACM, New York, pp 229–238

  5. Byun C, Kim Y, Lee H, Kim KK (2012) Automated Twitter data collecting tool and case study with rule-based analysis. In: Proc. IIWAS ‘12 (14th International Conference on Information Integration and Web-based Applications & Services), ACM, New York, pp 196–204

  6. Carpineto C, Osiński S, Romano G, Weiss D (2009) A survey of web clustering engines. ACM Comput Surv (CSUR) 41(3)

  7. Chen H, Zhou XK, Man HF, Wu Y, Ahmed AU, Jin Q (2010) A framework of organic streams: integrating dynamically diversified contents into ubiquitous personal study. In: 2nd International Symposium on Multidisciplinary Emerging Networks and Systems, Xi’an

  8. Cogan P, Andrews M, Bradonjic M, Kennedy WS, Sala A, Tucci G (2012) Reconstruction and analysis of Twitter conversation graphs. In: Proc. HotSocial ’12 (1st ACM international workshop on Hot topics on interdisciplinary social networks research). ACM, New York, pp 25–31

  9. Ed H (2009) Chi: information seeking can be social. Computer 42(3):42–46

    Article  Google Scholar 

  10. Gaber MM, Zaslavsky A, Krishnaswamy S (2005) Mining data streams: a review. ACM SIGMOD Rec Arch 34(2):18–26

  11. Gama S, Barata G, Gonçalves D, Prada R, Paiva A (2012) A model for social regulation of user-agent relationships. In: Yukiko N, Michael N, Ana P, Marilyn W (eds) Proc. IVA’12 (12th international conference on intelligent virtual agents). Springer-Verlag, Berlin, pp 319–326

    Google Scholar 

  12. Guha S, Meyerson A, Mishra N, Motwani R, O’Callaghan L (2003) Clustering data streams: theory and practice. TKDE Spec Issue Clust 15(3):515–528

    Google Scholar 

  13. Guha S, Mishra N, Motwani R, O’Callaghan L (2000) Clustering data streams. In: Proc. The annual symposium on foundations of computer science. IEEE Press, California

  14. Gupta C, Wang S, Ari I, Hao M, Dayal U, Mehta A, Marwah M, Sharma R (2009) CHAOS: a data stream analysis architecture for enterprise applications. In: Proc. CEC ’09 (2009 I.E. Conference on Commerce and Enterprise Computing), IEEE Computer Society, Washington, DC, pp 33–40

  15. Han J, Chen Y, Dong G, Pei J, Wah BW, Wang J, Cai YD (2005) Stream cube: an architecture for multidimensional analysis of data streams. Distrib Parallel Database 18(2):173–197

    Article  Google Scholar 

  16. Hashemi S, Yang Y, Mirzamomen Z, Kangavari M (2009) Adapted one-versus-all decision trees for data stream classification. IEEE Trans Knowl Data Eng 21(5):624–637

    Article  Google Scholar 

  17. Hu B, Jamali M, Ester M (2012) Learning the strength of the factors influencing user behavior in online social networks. In: Proc. ASONAM (IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining), pp 368–375

  18. Huang J, Martin P, Powley W, Bird P, Abrashkevich D (2010) Lightweight problem determination in DBMSs using data stream analysis techniques. In: Müller HA, Arthur R, Kark AW (eds) Proc. CASCON ’10 (2010 Conference of the Center for Advanced Studies on Collaborative Research). IBM Corp, Riverton, pp 199–211

    Chapter  Google Scholar 

  19. Junco R, Heiberger G, Loken E (2011) The effect of Twitter on college student engagement and grades. J Comput Assist Learn 27(2):119–132

    Article  Google Scholar 

  20. Kendall L, Hartzler A, Klasnja P, Pratt W (2011) Descriptive analysis of physical activity conversations on Twitter. In: CHI EA ’11 (CHI ’11 extended abstracts on human factors in computing systems. ACM, New York, pp 1555–1560

  21. Kirsten AJ (2011) The effect of Twitter posts on students’ perceptions of instructor credibility. Learn Media Technol 36(1):21–38

    Article  Google Scholar 

  22. Leroy V, Cambazoglu BB, Bonchi F (2010) Cold start link prediction. In: Proc. KDD ’10(16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining), ACM, New York, pp 393–402

  23. Leskovec J, Horvitz E (2008) Planetary-scale views on a large instant-messaging network. In: Proc. WWW’08 (17th International Conference on the World Wide Web), Beijing, China, pp 915–924

  24. Li R, Wang S, Chang KCC (2012) Multiple location profiling for users and relationships from social network and content. Proc VLDB Endow 5(11):1603–1614

    Article  Google Scholar 

  25. Masud MM, Al-Khateeb TM, Hamlen KW, Gao J, Khan L, Han J, Thuraisingham B (2008) Cloud-based malware detection for evolving data streams. ACM Trans Manag Inf Syst (TMIS) 2(3), article 16

  26. Mitzlaff F, Benz D, Stumme G, Hotho A (2010) Visit me, click me, be my friend: an analysis of evidence networks of user relationships in bibsonomy. In: Proc. HT ’10 (21st ACM Conference on Hypertext and Hypermedia), ACM, New York, pp 265–270

  27. Musselle C (2012) Rethinking concepts of the dendritic cell algorithm for multiple data stream analysis. In: Coello CA, Greensmith J, Krasnogor N, Liò P, Nicosia G (eds) Proc. ICARIS’12 (11th international conference on Artificial Immune Systems). Springer-Verlag, Berlin, pp 246–259

    Chapter  Google Scholar 

  28. Ordonez C (2003) Clustering binary data streams with K-means. In: Proc. 8th ACM SIGMOD workshop on research issues in data mining and knowledge discovery. ACM, San Diego, pp 12–19

  29. Pervin N, Fang F, Datta A, Dutta K, Vandermeer D (2013) Fast, scalable, and context-sensitive detection of trending topics in microblog post streams. ACM Trans Manag Inf Syst (TMIS) 3(4), article 19

  30. Shin H, Xu Z, Kim EY (2008) Discovering and browsing of power users by social relationship analysis in large-scale online communities. In: Proc. WI-IAT ’08 (2008 IEEE/WIC/ACMInternational Conference on Web Intelligence and Intelligent Agent Technology), IEEE Computer Society, Washington, DC, vol 1, pp 105–111

  31. Signorini A, Segre AM, Polgreen PM (2011) The use of Twitter to track levels of disease activity and public concern in the US during the influenza a H1N1 pandemic. PLoS One 6:5

    Article  Google Scholar 

  32. Vosecky J, Jiang D, Limosa WN (2013) A system for geographic user interest analysis in Twitter. In: Proc. EDBT ’13 (16th International Conference on Extending Database Technology). ACM, New York, pp 709–712

  33. Wang X, Wei F, Liu X, Zhou M, Zhang M (2011) Topic sentiment analysis in twitter: a graph-based Hashtag sentiment classification approach. In: Berendt B, de Vries A, Fan W, Macdonald C, Ounis I, Ruthven I (eds) Proc. CIKM ’11 (20th ACM international conference on Information and knowledge management). ACM, New York, pp 1031–1040

    Google Scholar 

  34. Xiang R, Neville J, Rogati M (2010) Modeling relationship strength in online social networks. In: Proc. WWW’10 (the 19th International Conference on World Wide Web), ACM, NewYork, pp 981–990

  35. Yang CC, Yang H, Tang X, Jiang L (2012) Identifying implicit relationships between social media users to support social commerce. In: Proc. ICEC ’12 (14th Annual International Conference on Electronic Commerce), ACM, New York, pp 41–47

  36. Zhou XK, Chen H, Jin Q, Yong JM (2011) Generating associative ripples of relevant information from a variety of data streams by throwing a heuristic stone. In: ACM ICUIMC 2011 (5th International Conference on Ubiquitous Information Management and Communication), Seoul, Korea

  37. Zhou XK, Jin Q (2011) Dynamical user networking and profiling based on activity streams for enhanced social learning. In: Proc. ICWL’11 (10th International Conference on Web-based Learning), Lecture Notes in Computer Science, Springer, Hong Kong, vol 7048, pp 219–225

  38. Zhou XK, Jin Q (2012) User correlation discovery and dynamical profiling based on social streams. In: Proc. AMT 2012 (2012 International Conference on Active Media Technology), Lecture Notes in Computer Science/Lecture Notes in Artificial Intelligence, Springer, Macao, vol 7669, pp 53–62

Download references

Acknowledgments

The work has been partly supported by 2012, 2013 and 2014 Waseda University Grants for Special Research Project No. 2012B-215, No. 2013A-6395, No. 2013B-207, and No. 2014K-6214.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Qun Jin.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhou, X., Jin, Q. A heuristic approach to discovering user correlations from organized social stream data. Multimed Tools Appl 76, 11487–11507 (2017). https://doi.org/10.1007/s11042-014-2153-5

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-014-2153-5

Keywords

Navigation