Skip to main content

A Personalized Recommendation Approach Based on Content Similarity Calculation in Large-Scale Data

  • Conference paper
  • First Online:
Algorithms and Architectures for Parallel Processing (ICA3PP 2015)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9528))

Abstract

Recommendation algorithms are widely used to discover interesting content for users from massive data in many fields. However, with more diversification of user requirements, the recommended accuracy and efficiency become a serious concern for improving user satisfaction degree. In this paper, we redefine the concept of content similarity by combining search words with personalized search references and describing their dimensions, then propose the calculation method of content similarity by defining the Hamming distance among current keywords, classified items and historical keywords. Through the pretreatment of support vector data description (SVDD), we may find specific tendency from the personal preference of classified items and present the final recommendation results arranged from high similarity to low one. Simulation experiments show that our proposed approach improves recommendation performance over the other two classical algorithms by an average of 17.2 % and reduces the MAE by 6.3 % on our large-scale dataset. At the same time, our proposed approach has a better performance on recall rate and coverage rate, and user satisfaction degree is also improved at higher extent.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Wang, J., Li, G., Feng, J.: Can we beat the prefix filtering?: an adaptive framework for similarity join and search. In: Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data, pp. 85–96. ACM (2012)

    Google Scholar 

  2. Kusumoto, M., Maehara, T., Kawarabayashi, K.I.: Scalable similarity search for simrank. In: Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, pp. 325–336. ACM (2014)

    Google Scholar 

  3. Yang, B., Zhao, P.F.: Review of the art of recommendation algorithms. J. Shanxi Univ. (Nat Sci Ed) 34(3), 337–350 (2011)

    Google Scholar 

  4. Deng, D., Li, G., Feng, J.: A pivotal prefix based filtering algorithm for string similarity search. In: Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, pp. 673–684. ACM (2014)

    Google Scholar 

  5. Bobadilla, J., Ortega, F., Hernando, A., Arroyo, Á.: A balanced memory-based collaborative filtering similarity measure. Int. J. Intell. Syst. 27(10), 939–946 (2012)

    Article  Google Scholar 

  6. Yin, H., Cui, B., Chen, L., Hu, Z., Huang, Z.: A temporal context-aware model for user behavior modeling in social media systems. In: Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, pp. 1543–1554. ACM (2014)

    Google Scholar 

  7. Wei, S., Ye, N., Zhang, S., Huang, X., Zhu, J.: Collaborative filtering recommendation algorithm based on item clustering and global similarity. In: 2012 Fifth International Conference on Business Intelligence and Financial Engineering (BIFE), pp. 69–72. IEEE (2012)

    Google Scholar 

  8. Zhang, Y., Zhu, X., Shen, Q.: A recommendation model based on collaborative filtering and factorization machines for social networks. In: 2013 5th IEEE International Conference on Broadband Network & Multimedia Technology (IC-BNMT), pp. 110–114. IEEE (2013)

    Google Scholar 

  9. Lops, P., De Gemmis, M., Semeraro, G.: Content-based recommender systems: state of the art and trends. Recommender Systems Handbook, pp. 73–105. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  10. Zhu, G., Lin, X., Zhu, K., Zhang, W., Yu, J.X.: Treespan: efficiently computing similarity all-matching. In: Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data, pp. 529–540. ACM (2012)

    Google Scholar 

  11. Zhang, J., Pu, P.: A recursive prediction algorithm for collaborative filtering recommender systems. In: Proceedings of the 2007 ACM Conference on Recommender Systems, pp. 57–64. ACM (2007)

    Google Scholar 

  12. Pizzato, L., Rej, T., Chung, T., Koprinska, I., Kay, J.: Recon: a reciprocal recommender for online dating. In: Proceedings of the Fourth ACM Conference on Recommender Systems, pp. 207–214. ACM (2010)

    Google Scholar 

  13. Tian, B., Nan, L., Zheng, Q., Yang, L.: Customer credit scoring method based on the SVDD classification model with imbalanced dataset. In: Zaman, M., Liang, Y., Siddiqui, S.M., Wang, T., Liu, V., Lu, C. (eds.) CETS 2010. CCIS, vol. 113, pp. 46–60. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  14. Huigui, R., Shengxu, H., Chunhua, H., Jinxia, M.: User similarity-based collaborative filtering recommendation algorithm. J. Commun. 35(2), 16–24 (2014)

    Google Scholar 

  15. Yang, Y., Wang, X.R., Hu, Y.C.: Researches of collaborative filtering recommendation algorithm based on user and item clustering combination. J. Guangxi Univ. Technol. 4, 019 (2011)

    Google Scholar 

  16. Yanhong, L., Anrong, X., Xiyun, S.: New classification algorithm k-means clustering combined with svdd. Appl. Res. Comput. 27(3), 883–886 (2010)

    Google Scholar 

  17. Xiaopeng, H., Xianfeng, L., Jun, G., Ming, T.: Updated learning algorithm of support vector data description based on k-means clustering. Comput. Eng. 35(17), 184–186 (2009)

    Google Scholar 

  18. Xiao, L., Xiangru, M., Zelong, C., Xuchun, Z.: Comprehensive evaluation for network survivability based on support vector data description. Appl. Res. Comput. 30(3), 853–856 (2013)

    Google Scholar 

  19. Lu, J., Lin, C., Wang, W., Li, C., Wang, H.: String similarity measures and joins with synonyms. In: Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data, pp. 373–384. ACM (2013)

    Google Scholar 

  20. Shizhe, S.: Research on the locality sensitive hashing. Master’s thesis, Xidian University (2013)

    Google Scholar 

  21. Joachims, T.: Svmlight: support vector machine. SVM-Light Support Vector Machine, 19(4)U. University of Dortmund (1999). http:/svmlight.joachims.org/

  22. Chang, C.C., Lin, C.J.: Libsvm: a library for support vector machines. ACM Trans. Intell. Syst. Technol. (TIST) 2(3), 27 (2011)

    Google Scholar 

  23. Wang, Z., Zhao, Z.-S., Zhang, C.: SVM-SVDD: a new method to solve data description problem with negative examples. In: Guo, C., Hou, Z.-G., Zeng, Z. (eds.) ISNN 2013, Part I. LNCS, vol. 7951, pp. 283–290. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  24. Herlocker, J.L., Konstan, J.A., Terveen, L.G., Riedl, J.T.: Evaluating collaborative filtering recommender systems. ACM Trans. Inf. Syst. (TOIS) 22(1), 5–53 (2004)

    Article  Google Scholar 

Download references

Acknowledgments

This work is partly supported by National Natural Science Foundation of China under under Grant No. 61273232, 61472131, 61272546, 61300218 and 61572181, by the Program for New Century Excellent Talents in University under Grant Number NCET-13-0785.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Huigui Rong .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Rong, H., Gong, L., Qin, Z., Hu, Y., Hu, C. (2015). A Personalized Recommendation Approach Based on Content Similarity Calculation in Large-Scale Data. In: Wang, G., Zomaya, A., Martinez, G., Li, K. (eds) Algorithms and Architectures for Parallel Processing. ICA3PP 2015. Lecture Notes in Computer Science(), vol 9528. Springer, Cham. https://doi.org/10.1007/978-3-319-27119-4_32

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-27119-4_32

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-27118-7

  • Online ISBN: 978-3-319-27119-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics