Skip to main content

Missing Data Modeling with User Activity and Item Popularity in Recommendation

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11292))

Abstract

User feedback such as movie watching history, ratings and consumptions of products, is valuable for improving the performance of recommender systems. However, only a few interactions between users and items can be observed in implicit data. The missing of a user-item entry is caused by two reasons: the user didn’t see the item (in most cases); or the user saw but disliked it. Separating these two cases leads to modeling missing interactions at a finer granularity, which is helpful in understanding users’ preferences more accurately. However, the former case has not been well-studied in previous work. Most existing studies resort to assign a uniform weight to the missing data, while such a uniform assumption is invalid in real-world settings. In this paper, we propose a novel approach to weight the missing data based on user activity and item popularity, which is more effective and flexible than the uniform-weight assumption. Experimental results based on 2 real-world datasets (Movielens, Flixster) show that our approach outperforms 3 state-of-the-art models including BPR, WMF, and ExpoMF.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    http://grouplens.org/datasets/movielens/1m/.

  2. 2.

    http://www.sfu.ca/~sja25/datasets/.

  3. 3.

    https://www.librec.net.

  4. 4.

    https://github.com/benanne/wmf.

  5. 5.

    https://github.com/dawenl/expo-mf.

References

  1. Cremonesi, P., Koren, Y., Turrin, R.: Performance of recommender algorithms on top-N recommendation tasks. In: Proceedings of the Fourth ACM Conference on Recommender Systems, pp. 39–46. ACM (2010)

    Google Scholar 

  2. Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood from incomplete data via the EM algorithm. J. Roy. Stat. Soc. Ser. B (Methodol.) 39, 1–38 (1977)

    MathSciNet  MATH  Google Scholar 

  3. Devooght, R., Kourtellis, N., Mantrach, A.: Dynamic matrix factorization with priors on unknown values. In: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 189–198. ACM (2015)

    Google Scholar 

  4. He, X., Gao, M., Kan, M.Y., Liu, Y., Sugiyama, K.: Predicting the popularity of web 2.0 items based on user comments. In: Proceedings of the 37th International ACM SIGIR Conference on Research & Development in Information Retrieval, pp. 233–242. ACM (2014)

    Google Scholar 

  5. He, X., Liao, L., Zhang, H., Nie, L., Hu, X., Chua, T.S.: Neural collaborative filtering. In: Proceedings of the 26th International Conference on World Wide Web, pp. 173–182. International World Wide Web Conferences Steering Committee (2017)

    Google Scholar 

  6. He, X., Zhang, H., Kan, M.Y., Chua, T.S.: Fast matrix factorization for online recommendation with implicit feedback. In: Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 549–558. ACM (2016)

    Google Scholar 

  7. Hu, Y., Koren, Y., Volinsky, C.: Collaborative filtering for implicit feedback datasets. In: Eighth IEEE International Conference on Data Mining, 2008. ICDM 2008, pp. 263–272. IEEE (2008)

    Google Scholar 

  8. Koren, Y.: Factorization meets the neighborhood: a multifaceted collaborative filtering model. In: Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 426–434. ACM (2008)

    Google Scholar 

  9. Koren, Y., Bell, R., Volinsky, C.: Matrix factorization techniques for recommender systems. Computer 42(8) (2009)

    Article  Google Scholar 

  10. Liang, D., Charlin, L., McInerney, J., Blei, D.M.: Modeling user exposure in recommendation. In: Proceedings of the 25th International Conference on World Wide Web, pp. 951–961. International World Wide Web Conferences Steering Committee (2016)

    Google Scholar 

  11. Ning, X., Karypis, G.: SLIM: sparse linear methods for top-n recommender systems. In: 2011 IEEE 11th International Conference on Data Mining (ICDM), pp. 497–506. IEEE (2011)

    Google Scholar 

  12. Pan, R., et al.: One-class collaborative filtering. In: Eighth IEEE International Conference on Data Mining. ICDM 2008, pp. 502–511. IEEE (2008)

    Google Scholar 

  13. Pilászy, I., Zibriczky, D., Tikk, D.: Fast als-based matrix factorization for explicit and implicit feedback datasets. In: Proceedings of the Fourth ACM Conference on Recommender Systems, pp. 71–78. ACM (2010)

    Google Scholar 

  14. Rendle, S.: Factorization machines. In: 2010 IEEE 10th International Conference on Data Mining (ICDM), pp. 995–1000. IEEE (2010)

    Google Scholar 

  15. Rendle, S., Freudenthaler, C., Gantner, Z., Schmidt-Thieme, L.: BPR: Bayesian personalized ranking from implicit feedback. In: Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence, pp. 452–461. AUAI Press (2009)

    Google Scholar 

  16. Salakhutdinov, R., Mnih, A.: Probabilistic matrix factorization. In: NIPS, vol. 1, pp. 1–2 (2007)

    Google Scholar 

  17. Steck, H.: Training and testing of recommender systems on data missing not at random. In: Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 713–722. ACM (2010)

    Google Scholar 

  18. Volkovs, M., Yu, G.W.: Effective latent models for binary feedback in recommender systems. In: Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 313–322. ACM (2015)

    Google Scholar 

Download references

Acknowledgments

We thank the anonymous reviewers for their valuable comments and suggestions. This work is supported by the Natural Science Foundation of China under Grant No.: 61672311 and 61532011.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Min Zhang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Chen, C., Zhang, M., Liu, Y., Ma, S. (2018). Missing Data Modeling with User Activity and Item Popularity in Recommendation. In: Tseng, YH., et al. Information Retrieval Technology. AIRS 2018. Lecture Notes in Computer Science(), vol 11292. Springer, Cham. https://doi.org/10.1007/978-3-030-03520-4_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-03520-4_11

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-03519-8

  • Online ISBN: 978-3-030-03520-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics