Skip to main content
Log in

Towards better news article recommendation

With the help of user comments

  • Published:
World Wide Web Aims and scope Submit manuscript

Abstract

News media platforms publish articles about daily events letting their users comment on them, and forming interesting discussions in almost real-time. To keep users always active and interested, media platforms need an effective recommender system to bring up new articles that match user interests. In this article, we show that we can improve the quality of recommendation by exploiting valuable information provided by user comments. This information reveals aspects not directly tackled by the news article on which they have been posted. We call such aspects latent aspects. We demonstrate how these latent aspects can make a crucial difference in the accuracy of future recommendation. The challenge in detecting them is due to the noisy nature of user comments. To support our claim, we propose a novel news recommendation system that (1) enriches the description of news articles by latent aspects extracted from user comments, (2) deals with noisy comments by proposing a model for user comments ranking, and (3) proposes a diversification model to remove redundancies and provide a wide coverage of aspects. We have tested our approach using large collections of real user activities in four news Web sites, namely The INDEPENDENT, The Telegraph, CNN and Al-Jazeera. The results show that our approach outperforms baseline approaches achieving a significantly higher accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Figure 1
Figure 2
Figure 3
Figure 4

Similar content being viewed by others

Notes

  1. www.cnn.com

  2. www.independent.co.uk

  3. We have experimentally set λ=0.5

References

  1. Abbar, S., Amer-Yahia, S., Indyk, P., Mahabadi, S.: Real-time recommendation of diverse related articles. In: Proceedings of the 22nd international conference on world wide Web, pp 1–12. International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva (2013)

  2. Abel, F., Gao, Q., Houben, G. -J., Tao, K.: Analyzing user modeling on twitter for personalized news recommendations. In: Proceedings of the 19th international conference on user modeling, adaption, and personalization, pp 1–12. Springer, Berlin (2011)

  3. Aker, A., Kurtic, E., Balamurali, A., Paramita, M., Barker, E., Hepple, M., Gaizauskas, R: A graph-based approach to topic clustering for online comments to news. In: Advances in information retrieval, pp 15–29. Springer (2016)

  4. Bansal, T., Das, M., Bhattacharyya, C: Content driven user profiling for commentworthy recommendations of news and blog articles. In: Proceedings of the 9th acm conference on recommender systems, pp 195–202. ACM, New York (2015)

  5. Chen, J., Nairn, R., Nelson, L., Bernstein, M., Chi, E.: Short and tweet: Experiments on recommending content from information streams. In: Proceedings of the sigchi conference on human factors in computing systems, pp 1185–1194. ACM, New York (2010)

  6. Danescu-Niculescu-Mizil, C., Kossinets, G., Kleinberg, J., Lee, L.: How opinions are received by online communities: a case study on amazon.com helpfulness votes. In: Proceedings of the 18th international conference on world wide Web, pp 141–150. ACM, New York (2009)

  7. Ganesan, K., Zhai, C.: Opinion-based entity ranking. Inf. Retr. 15(2), 116–150 (2012)

    Article  Google Scholar 

  8. Gollapudi, S., Sharma, A.: An axiomatic approach for result diversification. In: Proceedings of the 18th international conference on world wide Web, pp 381–390. ACM, New York (2009a)

  9. Gollapudi, S., Sharma, A.: An axiomatic approach for result diversification. In: Proceedings of the 18th international conference on world wide Web, pp 381–390. ACM, New York (2009b)

  10. Hassin, R., Rubinstein, S., Tamir, A.: Approximation algorithms for maximum dispersion. Oper. Res. Lett. 21, 133–137 (1997)

    Article  MathSciNet  MATH  Google Scholar 

  11. Hu, M., Sun, A., Lim, E. -P.: Comments-oriented document summarization: Understanding documents with readers’ feedback. In: Proceedings of the 31st annual international acm sigir conference on research and development in information retrieval, pp 291–298. ACM, New York (2008)

  12. Kant, R., Sengamedu, S.H., Kumar, K.S.: Comment spam detection by sequence mining. In: Proceedings of the fifth acm international conference on Web search and data mining, pp 183–192. ACM, New York (2012)

  13. Kim, S., Pantel, P., Chklovski, T., Pennacchiotti, M.: Automatically assessing review helpfulness. In: Proceedings of the 2006 conference on empirical methods in natural language processing, pp 423–430 (2006)

  14. Korte, B., Hausmann, D.: An analysis of the greedy heuristic for independence systems. Ann. Discret. Math. 2, 65–74 (1978)

    Article  MathSciNet  MATH  Google Scholar 

  15. Li, Q., Wang, J., Chen, Y.P., Lin, Z.: User comments for news recommendation in forum-based social media. Inform. Sci. 180(24), 4929–4939 (2010a)

  16. Li, Q., Wang, J., Chen, Y.P., Lin, Z: User comments for news recommendation in forum-based social media. Inf. Sci 180(24), 4929–4939 (2010b)

  17. Lin, C., He, Y.: Joint sentiment/topic model for sentiment analysis. In: Proceedings of the 18th acm conference on information and knowledge management, pp 375–384. ACM, New York (2009)

  18. Litvak, M., Matz, L.: Smartnews: Bringing order into comments chaos. In: Proceedings of the international conference on knowledge discovery and information retrieval, kdir, vol. 13 (2013)

  19. Liu, C. -Y., Chen, M. -S., Tseng, C. -Y.: Incrests: Towards real-time incremental short text summarization on comment streams from social network services. IEEE Trans. Knowl. Data Eng. 27, 2986–3000 (2015)

    Article  Google Scholar 

  20. Liu, Y., Huang, X., An, A., Yu, X.: Modeling and predicting the helpfulness of online reviews. In: Eighth ieee international conference on data mining, 2008. icdm’08, pp 443–452 (2008)

  21. Llewellyn, C., Grover, C., Oberlander, J: Summarizing newspaper comments. In: Icwsm (2014)

  22. Marcheggiani, D, Täckström, O., Esuli, A., Sebastiani, F: Hierarchical multilabel conditional random fields for aspect-oriented opinion mining. In: Advances in information retrieval, pp 273–285. Springer (2014)

  23. Meguebli, Y., Kacimi, M., Doan, B.-L., Popineau, F: Unsupervised approach for identifying users? political orientations. In: Advances in information retrieval, pp 507–512. Springer (2014)

  24. Persing, I., Ng, V: Vote prediction on comments in social polls. In: Emnlp, pp 1127–1138 (2014)

  25. Phelan, O., McCarthy, K., Smyth, B.: Using twitter to recommend real-time topical news. In: Proceedings of the third acm conference on recommender systems, pp 385–388. ACM, New York (2009)

  26. Popescu, A. -M., Etzioni, O.: Extracting product features and opinions from reviews. In: Proceedings of the conference on human language technology and empirical methods in natural language processing, pp 339–346. Association for Computational Linguistics, Stroudsburg (2005)

  27. Rayana, S., Akoglu, L.: Collective opinion spam detection: Bridging review networks and metadata. In: Proceedings of the 21th acm sigkdd international conference on knowledge discovery and data mining, pp 985–994. ACM, New York (2015)

  28. Rendle, S., Freudenthaler, C., Gantner, Z., Schmidt-Thieme, L.: Bpr: Bayesian personalized ranking from implicit feedback. In: Proceedings of the twenty-fifth conference on uncertainty in artificial intelligence, pp 452–461. AUAI Press, Arlington (2009)

  29. San Pedro, J., Yeh, T., Oliver, N.: Leveraging user comments for aesthetic aware image search reranking. In: Proceedings of the 21st international conference on world wide Web, pp 439–448. ACM, New York (2012)

  30. Shmueli, E., Kagian, A., Koren, Y., Lempel, R.: Care to comment?: Recommendations for commenting on news stories. In: Proceedings of the 21st international conference on world wide Web, pp 429–438. ACM, New York (2012)

  31. Terra, E., Clarke, C.L.A.: Frequency estimates for statistical word similarity measures. In: Proceedings of the 2003 conference of the north american chapter of the association for computational linguistics on human language technology, vol. 1, pp 165–172. Association for Computational Linguistics, Stroudsburg (2003)

  32. Tsagkias, M., Weerkamp, W., de Rijke, M.: Predicting the volume of comments on online news stories. In: Proceedings of the 18th acm conference on information and knowledge management, pp 1765–1768. ACM, New York (2009)

  33. Tsagkias, M., Weerkamp, W., de Rijke, M.: News comments: Exploring, modeling, and online prediction. In: Proceedings of the 32nd european conference on advances in information retrieval, pp 191–203. Springer, Berlin (2010)

  34. Tsur, O., Rappoport, A: Revrank: a fully unsupervised algorithm for selecting the most helpful book reviews. In: International aaai conference on weblogs and social media (2009)

  35. Wang, H., Lu, Y., Zhai, C.: Latent aspect rating analysis on review text data: a rating regression approach. In: Proceedings of the 16th acm sigkdd international conference on knowledge discovery and data mining, pp 783–792. ACM, New York (2010)

  36. Ye, J., Kumar, S., Akoglu, L.: Temporal opinion spam detection by multivariate indicative signals (2016). arXiv:1603.01929

  37. Yee, W.G., Yates, A., Liu, S., Frieder, O: Are Web user comments useful for search. In: Proceedings of LSDS-IR, pp 63–70 (2009)

  38. Zhao, L., Huang, M., Sun, J., Luo, H., Yang, X., Zhu, X.: Sentiment extraction by leveraging aspect-opinion association structure. In: Proceedings of the 24th acm international on conference on information and knowledge management, pp 343–352. ACM, New York (2015)

  39. Zhuang, L., Jing, F., Zhu, X. -Y.: Movie review mining and summarization. In: Proceedings of the 15th acm international conference on information and knowledge management, pp 43–50. ACM, New York (2006)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mouna Kacimi.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Meguebli, Y., Kacimi, M., Doan, BL. et al. Towards better news article recommendation. World Wide Web 20, 1293–1312 (2017). https://doi.org/10.1007/s11280-017-0436-2

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11280-017-0436-2

Keywords

Navigation