Abstract
Communities on the Internet are highly self-organizing, dynamic, and ubiquitous. One objective of peers in such a community is sharing common interests, even when compromising privacy. This paper presents a model for peers on the Internet that allows them to discover their common interests in terms of sets of frequently visited URLs. This model assists online learning by automatically presenting users with URLs related to what they are currently browsing, thus saving users’ time searching for additional information and helping to educate them on the current topic. To implement the model and collect test data, FireShare was developed as a plugin for the popular Web browser Firefox. Data was collected and analyzed on the number of discovered frequently visited URL sets, relevancy of mined association rules, and the overhead FireShare imposes on a network. While FireShare favorably validated the proposed model, analysis of the submitted test data shows high potential for success with future versions.
Similar content being viewed by others
References
Agrawal R, Imielinski T, Swami AN (1993) Mining association rules between sets of items in large databases. In: Proceedings of the 1993 ACM SIGMOD International conference on management of data, May 1993. ACM Press, New York, pp 207–216
Agrawal R, Shafer JC (1996) Parallel mining of association rules. IEEE Trans Knowl Data Eng 8(6): 962–969
Bayardo RJ Jr, Agrawal R (1999) Mining the most interesting rules. In: Proceedings of the 5th ACM SIGKDD international conference on knowledge discovery and data mining, New York, NY, USA, 1999. ACM Press, pp 145–154
Berners-Lee T (2000) What the semantic Web can represent. http://www.w3.org/DesignIssues/RDFnot.html
Blood R (2004) How blogging software reshapes the online community. Commun ACM 47(12): 53–55
Brusilovsky P (2004) Knowledgetree: a distributed architecture for adaptive e-learning. In: Proceedings of the thirteenth international World Wide Web conference, May 2004, pp 104–113
Claypool M, Brown D, Le P, Waseda M (2001) Inferring user interest. IEEE Internet Comput 5(6): 32–39
Cohen E, Fiat A, Kaplan H (2003) A case for associative peer to peer overlays. SIGCOMM Comput Commun Rev 33(1): 95–100
da Silva JC, Giannella C, Bhargava R, Kargupta H, Klusch M (2005) Distributed data mining and agents. Eng Appl Artif Intell 18: 791–807
Dang XH, Ng W-K, Ong K-L (2008) Online mining of frequent sets in data streams with error guarantee. Knowl Inf Syst 16(2): 245–258
Datta S, Bhaduri K, Giannella C, Wolff R, Kargupta H (2006) Distributed data mining in peer-to-peer networks. IEEE Internet Comput 10(4): 18–26
Del.icio.us. Social bookmarking. http://del.icio.us. Accessed March 2007
Digg. All News, Videos & Images. http://digg.com. Accessed September 2007
Grossman L (2006) Time’s person of the year: you. Time Magazine 168(26), December 2006
Gündüz Ş, Özsu MT (2003) A Web page prediction model based on click-stream tree representation of user behavior. In: Proceedings of the 9th ACM SIGKDD international conference on knowledge discovery and data mining, New York, NY, USA. ACM Press, pp 535–540
Halpin H, Robu V, Shepherd H (2007) The complex dynamics of collaborative tagging. In: WWW ’07: Proceedings of the 16th international conference on World Wide Web, New York, NY, USA. ACM Press, pp 211–220
Herlocker JL, Konstan JA, Riedl J (2000) Explaining collaborative filtering recommendations. In: CSCW ’00: Proceedings of the 2000 ACM conference on computer supported cooperative work, New York, NY, USA. ACM Press, pp 241–250
Ivncsy R, Vajk I (2006) Frequent pattern mining in Web log data. Acta Polytech Hung J Appl Sci Budapest Tech Hungary Special Issue Comput Intell 3(1): 77–90
Johnson S (2007) ExtraLife—Scott Johnson’s Comics, Podcasts, Blog, Artwork, Humor and MORE! http://myextralife.com. Accessed September 2007
ki Leung CW, fai Chan SC, lai Chung F (2006) A collaborative filtering framework based on fuzzy association rules and multiple-level similarity. Knowl Inf Syst 10(3): 357–381
Korthaus A, Hildenbrand T (2003) Creating a Java- and CORBA-based enterprise knowledge grid using topic maps. In: Proceedings of the workshop on knowledge grid and grid intelligence, pp 207–218
Lam C-M, Zhang X-F, Cheung WK (2004) Mining local data sources for learning global cluster models. In: WI ’04: Proceedings of the 2004 IEEE/WIC/ACM international conference on Web intelligence, Washington, DC, USA. IEEE Computer Society, pp 748–751
Lerman K (2006) Social networks and social information filtering on digg. http://www.citebase.org/abstract?id=oai:arXiv.org:cs/0612046
Lim S, Ko Y (2006) A comparative study of Web resource mining algorithms for one-stop learning. Int J Web Inf Syst 2(2): 77–84
Lisha G, Junzhou L (2006) Performance analysis of a p2p-based voip software. In: AICT-ICIW ’06: Proceedings of the advanced international conference on telecommunications and international conference on Internet and Web applications and services, p 11, Washington, DC, USA. IEEE Computer Society
Mobasher B, Dai H, Luo T, Nakagawa M (2001) Effective personalization based on association rule discovery from Web usage data. In: Proceedings of the 3rd international workshop on Web information and data management, New York, NY, USA. ACM Press, pp 9–15
Mobasher B, Dai H, Luo T, Sun Y, Zhu J (2000) Integrating Web usage and content mining for more effective personalization. In: EC-WEB ’00: Proceedings of the First International Conference on Electronic Commerce and Web Technologies, London, UK, Springer-Verlag, pp 165–176
Mozilla Foundation. About JavaScript. http://developer.mozilla.org/en/docs/About_JavaScript. Accessed September 2007
Mozilla Foundation. Extensions. http://developer.mozilla.org/en/docs/Extensions. Accessed September 2007
Mozilla Foundation. FireFox—Rediscover the Web. http://www.mozilla.com/en-US/firefox/. Accessed September 2007
Mozilla Foundation. Xml user interface language. http://www.mozilla.org/projects/xul/. Accessed September 2007
Muhlestein D, Lim S (2007) A common interest share model for on-line peer communities. In: Proceedings of the 2007 international conference on multimedia and ubiquitous engineering (MUE). IEEE Computer Society, pp 83–88
Online Publishers Association. Internet activity index. http://www.online-publishers.org/page.php/prmID/421. Accessed October 2007
Pierrakos D, Paliouras G, Papatheodorou C, Spyropoulos CD (2003) Web usage mining as a tool for personalization: a survey. User Model User-Adapted Interact 13(4): 311–372
Porter MF (1997) An algorithm for suffix stripping. Morgan Kaufmann Publishers Inc., San Francisco, pp 313–316
PostgreSQL Global Development Group. Postgresql: The world’s most advanced open source database. http://www.postgresql.org/. Accessed September 2007
Python Software Foundation. Python programming language—official website. http://python.org/. Accessed September 2007
Reddit. reddit.com: what’s new online. http://reddit.com. Accessed September 2007
Salton G, Wong A, Yang CS (1975) A vector space model for automatic indexing. Commun ACM 18(11): 613–620
Schmidt A, Winterhalter C (2003) User context aware delivery of e-learning material: approach and architecture. J Univers Comput Sci 10(1): 28–36
Schuster A, Wolff R (2004) Communication-efficient distributed mining of association rules. Data Min Knowl Discov 8(2): 171–196
Shyu M-L, Haruechaiyasak C, Chen S-C (2006) Mining user access patterns with traversal constraint for predicting Web page requests. Knowl Inf Syst 10(4): 515–528
Silvestri C, Orlando S (2005) Distributed approximate mining of frequent patterns. In: SAC ’05: Proceedings of the 2005 ACM symposium on applied computing, New York, NY, USA. ACM Press, pp 529–536
Skype. Internet calls. http://skype.com. Accessed September 2007
Sripanidkulchai K, Maggs B, Zhang H (2003) Efficient content location using interest-based locality in peer-to-peer systems. In: INFOCOM ’03: twenty-second annual joint conference of the IEEE computer and communications societies, vol 3. IEEE Computer Society, pp 2166–2176
Stojanovic L, Staab S, Studer R (2001) Elearning based on the semantic Web. In: WebNet2001— Proceedings of the world conference on the WWW and Internet, pp 23–27
Tan A-H, Ong H-L, Pan H, Ng J, Li Q-X (2004) Towards personalised Web intelligence. Knowl Inf Syst 6(5): 595–616
Technorati. Who says what. Right now. http://technorati.com. Accessed March 2007
Wolff R, Schuster A (2003) Association rule mining in peer-to-peer systems. In: ICDM ’03: proceedings of the third IEEE international conference on data mining, Washington, DC, USA. IEEE Computer Society, p 363
Yanbe Y, Jatowt A, Nakamura S, Tanaka K (2007) Can social bookmarking enhance search in the Web? In: JCDL ’07: proceedings of the 2007 conference on digital libraries, New York, NY, USA. ACM Press, pp 107–116
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Muhlestein, D., Lim, S. Online learning with social computing based interest sharing. Knowl Inf Syst 26, 31–58 (2011). https://doi.org/10.1007/s10115-009-0265-4
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10115-009-0265-4