Skip to main content
Log in

Distributed architecture for k-nearest neighbors recommender systems

  • Published:
World Wide Web Aims and scope Submit manuscript

Abstract

Collaborative filtering is one of the most popular recommendation techniques. While the quality of the recommendations has been significantly improved in the last years, most approaches present poor efficiency and scalability. In this paper, we study several factors that affect the performance of a k-Nearest Neighbors algorithm, and we propose a distributed architecture that significantly improves both throughput and response time. Two techniques for distributing recommender systems, user and item partition, were proposed and evaluated using that simulation model. We have found that user partition is generally better, with a faster response time and higher throughput.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Badue, C.S., Baeza-Yates, R., Ribeiro-Neto, B., Ziviani, A., Ziviani, N.: Analyzing imbalance among homogeneous index servers in a web search system. Inf. Process. Manage. 43, 592–608 (2007). http://dl.acm.org/citation.cfm?id=1224561.1224707

    Article  Google Scholar 

  2. Bennett, J., Lanning, S.: The netflix prize.In:proceedings of KDD cup and workshop, KDDCup ’07, pp 3–6 (2007). ACM, San Jose,California, USA

  3. Breese, J.S., Heckerman, D., Kadie, C.: Empirical analysis of predictive algorithms for collaborative filtering.In:proceedings of the fourteenth conference on uncertainty in artificial intelligence, UAI’98, pp 43–52. Morgan Kaufmann Publishers Inc., San Francisco (1998). URL http://dl.acm.org/citation.cfm?id=2074094.2074100

    Google Scholar 

  4. Büttcher, S., Clarke, C., Cormack, G.V.: Information retrieval: implementing and evaluating search engines. The MIT Press (2010)

  5. Cacheda, F., Carneiro, V., Fernández, D., Formoso, V.: Comparison of collaborative filtering algorithms: Limitations of current techniques and proposals for scalable, high-performance recommender systems. ACM Trans. Web 5, 2:1–2:33 (2011). doi:10.1145/1921591.1921593

    Article  Google Scholar 

  6. Cacheda, F., Carneiro, V., Plachouras, V., Ounis, I.: Performance analysis of distributed information retrieval architectures using an improved network simulation model. Inf. Process. Manage. 43(1), 204–224 (2007). doi:10.1016/j.ipm.2006.06.002

    Article  Google Scholar 

  7. Cöster, R., Svensson, M.: Inverted file search algorithms for collaborative filtering. In Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, SIGIR ’02, pp 246–252 (2002). ACM, New York, NY, USA

  8. Desrosiers, C., Karypis, G. In: Ricci, F., Rokach, L., Shapira, B., Kantor, P.B. (eds.) : Handbook, Recommender Systems, pp 107–144, Springer (2011)

  9. Ding, S., Suel, T.: Faster top-k document retrieval using block-max indexes. In Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval, SIGIR ’11, pp 993–1002 (2011). ACM, New York, NY, USA

  10. Formoso, V., Fernández, D., Cacheda, F., Carneiro, V.: Using neighborhood pre-computation to increase recommendation efficiency. In Proceedings of the fourth international conference on knowledge discovery and information retrieval (KDIR 2012). (To Appear) (2012)

  11. Herlocker, J., Konstan, J.A., Riedl, J.: An empirical analysis of design choices in neighborhood-based collaborative filtering algorithms. Inf. Retr. 5(4), 287–310 (2002). doi:10.1023/A:1020443909834

    Article  Google Scholar 

  12. Jiang, J., Lu, J., Zhang, G., Long, G.: Scaling-up item-based collaborative filtering recommendation algorithm based on hadoop. In Proceedings of the 2011 IEEE world congress on services, SERVICES ’11, pp 490–497. IEEE Computer Society, Washington (2011)

    Chapter  Google Scholar 

  13. Koren, Y., Bell, R., Volinsky, C.: Matrix factorization techniques for recommender systems. Computer 42(8), 30–37 (2009). doi:10.1109/MC.2009.263

    Article  Google Scholar 

  14. Linden, G., Smith, B., York, J.: Amazon.com recommendations: Item-to-item collaborative filtering. IEEE Internet Computing 7(1), 76–80 (2003). doi:10.1109/MIC.2003.1167344

    Article  Google Scholar 

  15. Little, M.C.: Javasim user’s guide. public release 0.3, version 1.0. (1999). http://javasim.codehaus.org

  16. Manning, C.D., Raghavan, P., Schtze, H.: Introduction to Information Retrieval. Cambridge University Press, New York (2008)

    Book  MATH  Google Scholar 

  17. Papagelis, M., Rousidis, I., Plexousakis, D., Theoharopoulos, E.: Incremental collaborative filtering for highly-scalable recommendation algorithms. In Proceedings of the 15th international conference on Foundations of Intelligent Systems, ISMIS’05, pp 553–561. Springer-Verlag, Berlin, Heidelberg (2005)

    Chapter  Google Scholar 

  18. Resnick, P., Iacovou, N., Suchak, M., Bergstrom, P., Riedl, J.: Grouplens: an open architecture for collaborative filtering of netnews. In Proceedings of the 1994 ACM conference on Computer supported cooperative work, CSCW ’94, pp 175–186 (1994). ACM, New York, NY, USA

  19. Ribeiro-Neto, B.A., Barbosa, R.A.: Query performance for tightly coupled distributed digital libraries. In Proceedings of the third ACM conference on digital libraries, DL ’98, pp 182–190 (1998). ACM, New York, NY, USA

  20. Schelter, S., Boden, C., Markl, V.: Scalable similarity-based neighborhood methods with mapreduce. In Proceedings of the sixth ACM conference on Recommender systems, RecSys ’12, pp 163–170 (2012). ACM, New York, NY, USA

  21. Turtle, H., Flood, J.: Inf. Process. Manage. 31(6), 831–850 (1995). doi:10.1016/0306-4573(95)00020-H

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Vreixo Formoso.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Formoso, V., Fernández, D., Cacheda, F. et al. Distributed architecture for k-nearest neighbors recommender systems. World Wide Web 18, 997–1017 (2015). https://doi.org/10.1007/s11280-014-0298-9

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11280-014-0298-9

Keywords

Navigation