Skip to main content
Log in

Semi-supervised collaborative filtering ensemble

  • Published:
World Wide Web Aims and scope Submit manuscript

Abstract

Collaborative filtering (CF) plays a central role in recommender systems, but often suffers from the data sparsity issue that dramatically degrades the recommendation performance. In this paper, we propose a Semi-Supervised Ensemble Filtering (SSEF) method to improve the recommendation performance by assembling three popular CF techniques in a co-training framework. Concretely, SSEF first initializes three weak predictors with labeled examples by three different CF algorithms independently. Two predictors generated by neighborhood methods are then merged, along with the remaining one generated by latent factor model, serve as two base recommenders, each of which labels the unlabeled examples for the other recommender during the co-training process. To exploit unlabeled data safely, the labeling confidence is estimated by validating the influence of the pseudo-labeled examples on the labeled ones. The final prediction is made by blending the outputs from the three predictors enhanced with unlabeled data. Extensive experiments on three public benchmarks demonstrate the effectiveness of the proposed SSEF by comparing to a number of state-of-the-art CF techniques, including semi-supervised, ensemble, and side-information based solutions.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

Notes

  1. Without loss of generality, we rescale all ratings in the interval [0, 1].

  2. Pearson correlation coefficient (2) is different from adjusted cosine similarity (4), though their equations are very similar. Concretely, the former conducts decentering on rows of rating matrix while the later does that on columns.

  3. We do not run trust-based algorithms on ML-100K, because the trust links are not available in such a dataset, as described in Table 2

References

  1. Bell, R. M., Koren, Y.: Lessons from the netflix prize challenge. SIGKDD Explor. Newsl. 9(2), 75–79 (2007)

    Article  Google Scholar 

  2. Blum, A., Mitchell, T.: Combining labeled and unlabeled data with co-training. In: COLT, pp 92–100 (1998)

  3. Bobadilla, J., Ortega, F., Hernando, A., Gutierrez, A.: Recommender systems survey. Knowl.-Based Syst. 46, 109–132 (2013)

    Article  Google Scholar 

  4. Cacheda, F., Formoso, V.: Comparison of collaborative filtering algorithms:limitations of current techniques and proposals for scalable, high-performance recommender systems. ACM T. Web 5(1), 1–33 (2011)

    Article  Google Scholar 

  5. Cui, W., Wu, J.: Actively semi-supervised collaborative filtering. In: BESC, pp 1–6 (2019)

  6. Dacrema, M. F., Cremonesi, P., Jannach, D.: Are we really making much progress? A worrying analysis of recent neural recommendation approaches. In: RecSys, pp 101–109 (2019)

  7. Fan, W., Ma, Y., Li, Q., He, Y., Zhao, E., Tang, J., Yin, D.: Graph neural networks for social recommendation. In: WWW, pp 417–426 (2019)

  8. Grover, A., Leskovec, J.: Node2vec: Scalable feature learning for networks. In: SIGKDD, pp 855–864 (2016)

  9. Guo, G., Zhang, J., Yorke-Smith, N.: Trustsvd: collaborative filtering with both the explicit and implicit influence of user trust and of item ratings. In: AAAI, pp 123–129 (2015)

  10. He, X., Liao, L., Zhang, H., Nie, L., Hu, X., Chua, T.: Neural collaborative filtering. In: WWW, pp 173–182 (2017)

  11. He, X., Du, X., Wang, X., Tian, F., Tang, J., Chua, T.: Outer product-based neural collaborative filtering. In: IJCAI, pp 2227–2233 (2018)

  12. Herlocker, J. L., Konstan, J. A., Borchers, A., Riedl, J.: An algorithmic framework for performing collaborative filtering. In: SIGIR, pp 230–237 (1999)

  13. Kang, W., Fang, C., Wang, Z., Mcauley, J.: Visually-aware fashion recommendation and design with generative image models. In: ICDM, pp 207–216 (2017)

  14. Kim, D., Park, C., Oh, J., Lee, S., Yu, H.: Convolutional matrix factorization for document context-aware recommendation. In: RecSys, pp 233–240 (2016)

  15. Kipf, T. N., Welling, M.: Semi-supervised classification with graph convolutional networks. In: ICLR (2017)

  16. Koren, Y.: Factorization meets the neighborhood: a multifaceted collaborative filtering model. In: SIGKDD, pp 426–434 (2008)

  17. Koren, Y.: The bellkor solution to the netflix grand prize. Tech. rep. Netflix prize documentation (2009)

  18. Koren, Y., Bell, R., Volinsky, C.: Matrix factorization techniques for recommender systems. Computer 42(8), 30–37 (2009)

    Article  Google Scholar 

  19. Krogh, A., Vedelsby, J.: Neural network ensembles, cross validation, and active learning. In: NIPS, pp 231–238 (1994)

  20. Li, M., Zhou, Z.: Setred: self-training with editing. In: PAKDD, pp 611–621 (2005)

  21. Li, Y., Zhou, Z.: Towards making unlabeled data never hurt. IEEE Trans. Pattern Anal. 37(1), 175–188 (2015)

    Article  Google Scholar 

  22. Li, F., Xu, G., Cao, L.: Two-level matrix factorization for recommender systems. Neural Comput. Appl. 27, 2267–2278 (2016)

    Article  Google Scholar 

  23. Liu, C., Zhou, C., Wu, J., Hu, Y., Guo, L.: Social recommendation with an essential preference space. In: AAAI, pp 346–353 (2018)

  24. Ma, H., Yang, H., Lyu, M. R., King, I.: Sorec: social recommendation using probabilistic matrix factorization. In: CIKM, pp 931–940 (2008)

  25. Ma, H., Zhou, D., Liu, C., Lyu, M. R., King, I.: Recommender systems with social regularization. In: WSDM, pp 287–296 (2011)

  26. Nigam, K., Ghani, R.: Analyzing the effectiveness and applicability of co-training. In: CIKM, pp 86–93 (2000)

  27. Quang, T. N., Phuong, N. D., et al.: Collaborative filtering by co-training method. In: KSE, pp 273–285 (2015)

  28. Ren, Y., Li, G., Zhang, J., Zhou, W.: The efficient imputation method for neighborhood-based collaborative filtering. In: CIKM, pp 684–693 (2012)

  29. Sarwar, B., Karypis, G., Konstan, J., Riedl, J.: Item-based collaborative filtering recommendation algorithms. In: WWW, pp 285–295 (2001)

  30. Shi, Y., Larson, M., Hanjalic, A.: Collaborative filtering beyond the user-item matrix:a survey of the state of the art and future challenges. ACM Comput. Surv. 47(1), 1–45 (2014)

    Article  Google Scholar 

  31. van den Berg, R., Kipf, T. N., Welling, M.: Graph convolutional matrix completion. In: SIGKDD (2018)

  32. Wang, J., De Vries, A. P., Reinders, M. J. T.: Unifying user-based and item-based collaborative filtering approaches by similarity fusion. In: SIGIR, pp 501–508 (2006)

  33. Wang, H., Zhang, F., Zhao, M., Li, W., Xie, X., Guo, M.: Multi-task feature learning for knowledge graph enhanced recommendation. In: WWW, pp 2000–2010 (2019)

  34. Wen, Y., Guo, L., Chen, Z., Ma, J.: Network embedding based recommendation method in social networks. In: WWW, pp 11–12 (2018)

  35. Yang, C., Bai, L., Zhang, C., Yuan, Q., Han, J.: Bridging collaborative filtering and semi-supervised learning: a neural approach for poi recommendation. In: SIGKDD, pp 1245–1254 (2017)

  36. Zhang, J., Pu, P.: A recursive prediction algorithm for collaborative filtering recommender systems. In: RecSys, pp 57–64 (2007)

  37. Zhang, M., Tang, J., Zhang, X., Xue, X.: Addressing cold start in recommender systems: a semi-supervised co-training algorithm. In: SIGKDD, pp 73–82 (2014)

  38. Zhang, D., Yin, J., Zhu, X., Zhang, C.: Network representation learning: a survey. IEEE Trans. Big Data 6(1), 3–28 (2020)

    Article  Google Scholar 

  39. Zhou, Z.: When semi-supervised learning meets ensemble learning. Front. Electr. Electron. Eng. China 6(1), 6–16 (2011)

    Article  Google Scholar 

  40. Zhou, Z. H., Li, M.: Semi-supervised regression with co-training. In: IJCAI, pp 908–913 (2005)

  41. Zhou, Z., Li, M.: Semi-supervised learning by disagreement. Knowl. Inf. Syst. 24(3), 415–439 (2010)

    Article  MathSciNet  Google Scholar 

  42. Zhu, X.: Semi-supervised learning literature survey. Tech. rep. Computer Sciences, University of Wisconsin-Madison (2006)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jun Wu.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This work was supported in part by the National Natural Science Foundation of China (Grants No. 61671048 and U1934220 )and the Fundamental Research Funds for the Central Universities (Grant No. 2019JBM316).

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wu, J., Sang, X. & Cui, W. Semi-supervised collaborative filtering ensemble. World Wide Web 24, 657–673 (2021). https://doi.org/10.1007/s11280-021-00866-7

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11280-021-00866-7

Keywords

Navigation