skip to main content
10.1145/2661829.2662026acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

"Strength Lies in Differences": Diversifying Friends for Recommendations through Subspace Clustering

Published:03 November 2014Publication History

ABSTRACT

Nowadays, WWW brings overwhelming variety of choices to consumers. Recommendation systems facilitate the selection by issuing recommendations to them. Recommendations for users, or groups, are determined by considering users similar to the users in question. Scanning the whole database for locating similar users, though, is expensive. Existing approaches build cluster models by employing full-dimensional clustering to find sets of similar users. As the datasets we deal with are high-dimensional and incomplete, full-dimensional clustering is not the best option. To this end, we explore the fault-tolerant subspace clustering approach. We extend the concept of fault tolerance to density-based subspace clustering, and to speed up our algorithms, we introduce the significance threshold for considering only promising dimensions for subspace extension. Moreover, as we potentially receive a multitude of users from subspace clustering, we propose a weighted ranking approach to refine the set of like-minded users. Our experiments on real movie datasets show that the diversification of the similar users that the subspace clustering approaches offer results in better recommendations compared to traditional collaborative filtering and full-dimensional clustering approaches.

References

  1. N. Agarwal, E. Haque, H. Liu, and L. Parsons. Research paper recommender systems: A subspace clustering approach. In WAIM, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. R. Agrawal, J. Gehrke, D. Gunopulos, and P. Raghavan. Automatic subspace clustering of high dimensional data for data mining applications. In SIGMOD, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. S. Amer-Yahia, S. B. Roy, A. Chawla, G. Das, and C. Yu. Group recommendation: Semantics and efficiency. PVLDB, 2(1):754--765, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. A. Boutet, A.-M. Kermarrec, D. A. Frey, R. Guerraoui, and A. Jegou. Whatsup: A decentralized instant news recommender. In IPDPS, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. W. Dong, M. Charikar, and K. Li. Efficient k-nearest neighbor graph construction for generic similarity measures. In WWW, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. M. Ester, H.-P. Kriegel, J. Sander, and X. Xu. A density-based algorithm for discovering clusters in large spatial databases with noise. In KDD, 1996.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. S. Günnemann, E. Müller, S. Raubach, and T. Seidl. Flexible fault tolerant subspace clustering for data with missing values. In ICDM, 2011.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. K. Kailing, H.-P. Kriegel, and P. Kröger. Density-connected subspace clustering for high-dimensional data. In SIAM, 2004.Google ScholarGoogle Scholar
  9. J. A. Konstan, B. N. Miller, D. Maltz, J. L. Herlocker, L. R. Gordon, and J. Riedl. Grouplens: Applying collaborative filtering to usenet news. Commun. ACM, 40(3):77--87, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. G. Koutrika, B. Bercovitz, and H. Garcia-Molina. Flexrecs: expressing and combining exible recommendations. In SIGMOD, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. H.-P. Kriegel, P. Kröger, and A. Zimek. Clustering high-dimensional data: A survey on subspace clustering, pattern-based clustering, and correlation clustering. ACM Trans. Knowl. Discov. Data, 3(1):1:1--1:58, Mar. 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. X. Li and T. Murata. Using multidimensional clustering based collaborative filtering approach improving recommendation diversity. In Web Intelligence/IAT Workshops, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. R. J. Mooney and L. Roy. Content-based book recommending using learning for text categorization. In ACM DL, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. E. Ntoutsi, K. Stefanidis, K. Nørvåg, and H.-P. Kriegel. Fast group recommendations by applying user clustering. In ER, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. K. Rausch, E. Ntoutsi, K. Stefanidis, and H.-P. Kriegel. Exploring subspace clustering for recommendations. In SSDBM, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. K. Stefanidis, M. Drosou, and E. Pitoura. You May Also Like results in relational databases. In PersDB, 2009.Google ScholarGoogle Scholar
  17. K. Stefanidis, E. Ntoutsi, M. Petropoulos, K. Nørvåg, and H.-P. Kriegel. A framework for modeling, computing and presenting time-aware recommendations. T. Large-Scale Data- and Knowledge-Centered Systems, 10:146--172, 2013.Google ScholarGoogle Scholar
  18. L. Xiang, Q. Yuan, S. Zhao, L. Chen, X. Zhang, Q. Yang, J. Sun, and J. Sun. Temporal recommendation on graphs via long- and short-term preference fusion. In KDD, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. "Strength Lies in Differences": Diversifying Friends for Recommendations through Subspace Clustering

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      CIKM '14: Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management
      November 2014
      2152 pages
      ISBN:9781450325981
      DOI:10.1145/2661829

      Copyright © 2014 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 3 November 2014

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      CIKM '14 Paper Acceptance Rate175of838submissions,21%Overall Acceptance Rate1,861of8,427submissions,22%

      Upcoming Conference

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader