Skip to main content
Log in

Efficient continuous KNN join processing for real-time recommendation

  • Original Article
  • Published:
Personal and Ubiquitous Computing Aims and scope Submit manuscript

Abstract

Along with the sustainable and rapid accumulation of user-generated contents in social networking websites, how to push a certain content to the corresponding interested users, named recommendation for short, has successfully received wide attention. Considering the continuous updated contents and the constant changing of users’ interests, recommendation is expected to be completed immediately to send the most fresh content to appropriate users after the corresponding new contents have become available. In other words, recommendation needs to meet real-time requirements to fit the content-consumption behavior of users. In the traditional recommendation system field, the corresponding attributes of users and contents could be characterized by feature vectors in a certain high-dimensional space, subsequently the recommendation problem could also be converted into how to obtain the K appropriate contents for each user, which could be called kNN join. Due to the massive, high-dimensional, and continuously updated contents, the corresponding recommendation based on traditional kNN join (continuously updating the kNN join results) will be undoubtedly faced with unacceptable computational costs. In this paper, we propose a locality-sensitive hashing (LSH)–based index called LSHI, which is built on user set to find the specific users who might be affected by the updated contents efficiently. With the help of LSHI, the recommendation lists of the affected users could be adjusted accordingly and the holistic effectiveness of the recommendation (for all users) could be guaranteed simultaneously. Finally, extensive experiments have been conducted to demonstrate the superiorities of our proposed method in this paper.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

References

  1. Abbar S, Amer-Yahia S, Indyk P, Mahabadi S (2013) Real-time recommendation of diverse related articles. In: WWW, pp 1–12

  2. Agarwal D, Chen BC, Elango P (2010) Fast online learning through offline initialization for time-sensitive recommendation. In: KDD, pp 703–712

  3. Apaza RG, Cervantes EV, Quispe LC, Luna JO (2014) Online courses recommendation based on LDA. In: SIMBig, pp 42–48

  4. Böhm C, Krebs F (2004) The k-nearest neighbour join: turbo charging the KDD process. KAIS 6 (6):728–749

    Google Scholar 

  5. Bohm C, Ooi B, Plant C, Yan Y (2007) Efficiently processing continuous k-nn queries on data streams. In: ICDE, pp 156–165

  6. Breese JS, Heckerman D, Kadie C (1998) Empirical analysis of predictive algorithms for collaborative filtering. In: UAI, pp 43–52

  7. Brusilovsky P, Kobsa A, Nejdl W (2007) The adaptive web. In: LNCS, pp 325–326

  8. Chua TS, Tang J, Hong R, Li H, Luo Z, Zheng YT (2009) NUS-WIDE: a real-world web image database from National University of Singapore. In: ACM International Conference on Image and Video Retrieval

  9. Diaz-Aviles E, Drumond L, Schmidt-Thieme L, Nejdl W (2012) Real-time top-n recommendation in social streams. In: RecSys, pp 59–66

  10. Gionis A, Indyk P, Motwani R, et al. (1999) Similarity search in high dimensions via hashing. In: VLDB, vol 99, pp 518–529

  11. Kataria S, Agarwal A (2015) Distributed representations for content-based and personalized tag recommendation. In: 2015 IEEE International Conference on Data Mining Workshop (ICDMW)

  12. Le QV, Mikolov T (2014) Distributed representations of sentences and documents. ICML

  13. Lowe DG (1999) Object recognition from local scale-invariant features. In: Proceedings of the Seventh IEEE International Conference on Computer Vision, vol 2, pp 1150–1157

  14. Melville P, Sindhwani V (2010) Recommender systems. In: Encyclopedia of machine learning, pp 829–838

  15. Mitchell J, Lapata M (2010) Composition in distributional models of semantics. Cognit Sci 34(8):1388–1429

    Article  Google Scholar 

  16. Novak CL, Shafer SA (1992) Anatomy of a color histogram. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp 599–605

  17. Pazzani MJ, Billsus D (2007) Content-based recommendation systems. In: The adaptive web, pp 325–341

  18. Rajaraman A, Ullman J (2012) Mining of massive datasets, vol 77

  19. Salton G, Buckley C (1988) Term weighting approaches in automatic text retrieval. Inf Process Manag 24 (5):513–523

    Article  Google Scholar 

  20. Tok WH, Bressan S (2013) Progressive and approximate join algorithms on data streams. In: Advanced query processing, pp 157–185

  21. Xia C, Lu H, Ooi BC, Hu J (2004) Gorder: an efficient method for knn join processing. In: VLDB, vol 30, pp 756–767

  22. Yang C, Yu X, Liu Y (2014) Continuous knn join processing for real-time recommendation. In: ICDM, pp 640–649

  23. Yu C, Cui B, Wang S, Su J (2007) Efficient index-based knn join processing for high-dimensional data. Inf Softw Technol 49(4):332–344

    Article  Google Scholar 

  24. Yu C, Zhang R, Huang Y, Xiong H (2010) High-dimensional knn joins with incremental updates. Geoinformatica 14(1):55–82

    Article  Google Scholar 

Download references

Funding

This study was financially supported by the National Natural Science Foundation of China (61402263, 91546203), the National High Technology Research and Development Program of China (2014AA01A302), the special funds of the Taishan Scholar Construction Project, the Independent Innovation Projects of Shandong Province (2014ZZCX08102, 2014ZZCX03409, 2014CGZH1106), the Science & Technology Development Projects of Shandong Province (2014GGX101028), and the Key Research and Development Program of Shandong Province (2015GGX101009).

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Yujun Li or Xueqing Li.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Informed consent

Informed consent was obtained from all individual participants included in the study

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hu, Y., Yang, C., Zhan, P. et al. Efficient continuous KNN join processing for real-time recommendation. Pers Ubiquit Comput 25, 1001–1011 (2021). https://doi.org/10.1007/s00779-019-01282-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00779-019-01282-5

Keywords

Navigation