Skip to main content
Log in

Recommender system architecture based on Mahout and a main memory database

  • Published:
The Journal of Supercomputing Aims and scope Submit manuscript

Abstract

In this study, we propose a news recommendation system architecture using a main memory database (DB) and Mahout. The user’s news preference rate is calculated automatically based on the time the user spends reading news items and their length. While existing systems also infer the user’s preferred fields, our system adjusts the volume and ratio of news stories using these categories. We collect web pages accessed by the user on a smart device and classify them using a naive Bayes classifier to determine the user’s preferred news categories. Collaborative filtering is then used to search for related news items read by others and to recommend news in a ratio consistent with the user’s preferred fields. Using a main memory DB, recommendations are computed 2.1 times faster than with a traditional DB when recommending from among 100,000 items; further, the more data used for recommendations, the bigger the speed difference between the proposed and traditional systems becomes.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

References

  1. Dong H, Zhu J, Tang Y, Xu C, Ding R, Chen L (2015) UBS: a novel news recommendation system based on user behavior sequence. In: International Conference on Knowledge Science, Engineering and Management. Springer, Berlin, pp 738–750

  2. Kim SD, Park MG (2009) An adaptation system based on personalized web content items for mobile devices. KSII Trans Internet Inf Syst 3(6):628–646

    Google Scholar 

  3. Liu J, Dolan P, Pedersen ER (2010) Personalized news recommendation based on click behavior. In: Proceedings of the 15th International Conference on Intelligent User Interfaces. ACM, New York, pp 31–40

  4. Ziny news (2016) http://www.zinynews.com/. Accessed 3 Mar 2016

  5. British Broadcasting Corporation (2016) http://www.bbc.com/news. Accessed 7 Apr 2016

  6. Times (2016) https://www.time.com. Accessed 9 Apr 2016

  7. Google (2016) https://news.google.co.kr/. Accessed 2 Apr 2016

  8. Baidu (2016) http://jian.news.baidu.com/. Accessed 7 Apr 2016

  9. Daum (2016) http://media.daum.net/. Accessed 8 Apr 2016

  10. Sridevi M, Rajeshwara Rao R, Varaprasad Rao M (2016) A survey on recommender system. Int J Comput Sci Inf Secur 14(5):265–272

    Google Scholar 

  11. Garcin F, Zhou K, Faltings B, Schickel V (2012) Personalized news recommendation based on collaborative filtering. In: Proceedings of the 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology. IEEE Computer Society. 01, pp 437–441

  12. Zheng X, Lou Y, Xu Z, Yu Q, Lu L (2016) Tourism destination recommender system for the cold start problem. KSII Trans Internet Inf Syst 10(7):3192–3212

    Google Scholar 

  13. Lika B, Kolomvatsos K, Hadjiefthymiades S (2014) Facing the cold start problem in recommender systems. Expert Syst Appl 41:2065–2073

    Article  Google Scholar 

  14. Tavakolifard M, Gulla JA, Almeroth KC, Ingvaldesn JE, Nygreen G, Berg E (2013) Tailored news in the palm of your hand: a multi-perspective transparent approach to news recommendation. In: Proceedings of the 22nd International Conference on World Wide Web. ACM, New York, pp 305–308

  15. Bobadilla J, Ortega F, Hernando A, Gutiérrez A (2013) Recommender systems survey. Knowl Based Syst 46(2013):109–132

    Article  Google Scholar 

  16. Pazzani MJ, Billsus D (2007) Content-based recommendation systems. In: Brusilovsky P, Kobsa A, Nejdl W (eds) The Adaptive Web. Springer, Berlin, pp 325–341

  17. Lops P, De Gemmis M, Semeraro G (2011) Content-based recommender systems: State of the art and trends. In: Ricci F, et al. (eds) Recommender Systems Handbook, pp 73–105

  18. Cantador I, Bellogín A, Vallet D (2010) Content-based recommendation in social tagging systems. In: Proceedings of the Fourth ACM Conference on Recommender Systems. pp 237–240

  19. Aizawa K, Tancharoen D, Kawasaki S, Yamasaki T (2004) Efficient retrieval of life log based on context and content. In: Proceedings of the 1st ACM Workshop on Continuous Archival and Retrieval of Personal Experiences. ACM, New York, pp 22–31

  20. Zhu H, Chen E, Xiong H, Yu K, Cao H, Tian J (2015) Mining mobile user preferences for personalized context-aware recommendation. ACM Trans Intell Syst Technol (TIST) 5(4):58

    Google Scholar 

  21. Burke R (2007) Hybrid web recommender systems. In: Brusilovsky P, Kobsa A, Nejdl W (eds) The Adaptive Web. Springer, Berlin, pp 377–408

  22. Claypool M, Le P, Wased M, Brown D (2001) Implicit interest indicators. In: Proceedings of the 6th International Conference on Intelligent User Interfaces. ACM, New York, pp 33–40

  23. Kawai Y, Kumamoto T, Tanaka K (2007) Fair news reader: recommending news articles with different sentiments based on user preference. In: Bruno Apolloni B, Howlett RJ, Jain L (eds) Knowledge-Based Intelligent Information and Engineering Systems. Springer, Berlin, pp 612–622

  24. Lee HJ, Park SJ (2007) MONERS: a news recommender for the mobile web. Expert Syst Appl 32(1):143–150

    Article  Google Scholar 

  25. Brunato M, Battiti R (2003) PILGRIM: A location broker and mobility-aware recommendation system. In: Proceedings of IEEE PerCom. pp 265–272

  26. Yeung KF, Yang Y (2010) A proactive personalized mobile news recommendation system. In: Developments in E-Systems Engineering (DESE). pp 207–212. doi:10.1109/DeSE.2010.40

  27. Thelwall M (2001) A web crawler design for data mining. J Inf Sci 27(5):319–325

    Article  Google Scholar 

  28. Gu YH, Yoo SJ, Piao Z, Lin Y, Yan J, Park JH (2015) User preference analysis and visualization through the browser history of smart devices. In: Proceedings of the 2015 International Conference on Big Data Applications and Services. ACM, New York, pp 264–267

  29. Zhang W, Yoshida T, Tang X (2011) A comparative study of TF* IDF, LSI and multi-words for text classification. Expert Syst Appl 38(3):2758–2765

    Article  Google Scholar 

  30. Zhang ML, Pena JM, Robles V (2009) Feature selection for multi-label naive Bayes classification. Inf Sci 179(19):3218–3229

    Article  MATH  Google Scholar 

  31. Chang HY, Huang SC, Lai CC (2014) A personalized IPTV channel-recommendation mechanism based on the MapReduce framework. J Supercomput 69(1):225–247

    Article  Google Scholar 

  32. Li C, Lin M, Yang LT, Ding C (2014) Integrating the enriched feature with machine learning algorithms for human movement and fall detection. J Supercomput 67(3):854–865

    Article  Google Scholar 

  33. Li CH, Gondra I, Liu L (2012) An efficient parallel neural network-based multi-instance learning algorithm. J Supercomput 62(2):724–740

    Article  Google Scholar 

Download references

Acknowledgements

This research was partly supported by the Global IT Talent support program of MSIP/IITP. [IITP-2016-H0905-15-1005, Research on the Development of Automatic Personalized Big-Data Curation Technology with In-Memory Database] and ICT R&D program of MSIP/IITP [2014-0-00616, Building an Infrastructure of a Large Size Data Center].

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Seong Joon Yoo.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Piao, Z., Yoo, S.J., Gu, Y.H. et al. Recommender system architecture based on Mahout and a main memory database. J Supercomput 74, 105–121 (2018). https://doi.org/10.1007/s11227-017-2111-x

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11227-017-2111-x

Keywords

Navigation