Abstract
Recommendation algorithms are widely used to discover interesting content for users from massive data in many fields. However, with more diversification of user requirements, the recommended accuracy and efficiency become a serious concern for improving user satisfaction degree. In this paper, we redefine the concept of content similarity by combining search words with personalized search references and describing their dimensions, then propose the calculation method of content similarity by defining the Hamming distance among current keywords, classified items and historical keywords. Through the pretreatment of support vector data description (SVDD), we may find specific tendency from the personal preference of classified items and present the final recommendation results arranged from high similarity to low one. Simulation experiments show that our proposed approach improves recommendation performance over the other two classical algorithms by an average of 17.2 % and reduces the MAE by 6.3 % on our large-scale dataset. At the same time, our proposed approach has a better performance on recall rate and coverage rate, and user satisfaction degree is also improved at higher extent.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Wang, J., Li, G., Feng, J.: Can we beat the prefix filtering?: an adaptive framework for similarity join and search. In: Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data, pp. 85–96. ACM (2012)
Kusumoto, M., Maehara, T., Kawarabayashi, K.I.: Scalable similarity search for simrank. In: Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, pp. 325–336. ACM (2014)
Yang, B., Zhao, P.F.: Review of the art of recommendation algorithms. J. Shanxi Univ. (Nat Sci Ed) 34(3), 337–350 (2011)
Deng, D., Li, G., Feng, J.: A pivotal prefix based filtering algorithm for string similarity search. In: Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, pp. 673–684. ACM (2014)
Bobadilla, J., Ortega, F., Hernando, A., Arroyo, Á.: A balanced memory-based collaborative filtering similarity measure. Int. J. Intell. Syst. 27(10), 939–946 (2012)
Yin, H., Cui, B., Chen, L., Hu, Z., Huang, Z.: A temporal context-aware model for user behavior modeling in social media systems. In: Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, pp. 1543–1554. ACM (2014)
Wei, S., Ye, N., Zhang, S., Huang, X., Zhu, J.: Collaborative filtering recommendation algorithm based on item clustering and global similarity. In: 2012 Fifth International Conference on Business Intelligence and Financial Engineering (BIFE), pp. 69–72. IEEE (2012)
Zhang, Y., Zhu, X., Shen, Q.: A recommendation model based on collaborative filtering and factorization machines for social networks. In: 2013 5th IEEE International Conference on Broadband Network & Multimedia Technology (IC-BNMT), pp. 110–114. IEEE (2013)
Lops, P., De Gemmis, M., Semeraro, G.: Content-based recommender systems: state of the art and trends. Recommender Systems Handbook, pp. 73–105. Springer, Heidelberg (2011)
Zhu, G., Lin, X., Zhu, K., Zhang, W., Yu, J.X.: Treespan: efficiently computing similarity all-matching. In: Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data, pp. 529–540. ACM (2012)
Zhang, J., Pu, P.: A recursive prediction algorithm for collaborative filtering recommender systems. In: Proceedings of the 2007 ACM Conference on Recommender Systems, pp. 57–64. ACM (2007)
Pizzato, L., Rej, T., Chung, T., Koprinska, I., Kay, J.: Recon: a reciprocal recommender for online dating. In: Proceedings of the Fourth ACM Conference on Recommender Systems, pp. 207–214. ACM (2010)
Tian, B., Nan, L., Zheng, Q., Yang, L.: Customer credit scoring method based on the SVDD classification model with imbalanced dataset. In: Zaman, M., Liang, Y., Siddiqui, S.M., Wang, T., Liu, V., Lu, C. (eds.) CETS 2010. CCIS, vol. 113, pp. 46–60. Springer, Heidelberg (2010)
Huigui, R., Shengxu, H., Chunhua, H., Jinxia, M.: User similarity-based collaborative filtering recommendation algorithm. J. Commun. 35(2), 16–24 (2014)
Yang, Y., Wang, X.R., Hu, Y.C.: Researches of collaborative filtering recommendation algorithm based on user and item clustering combination. J. Guangxi Univ. Technol. 4, 019 (2011)
Yanhong, L., Anrong, X., Xiyun, S.: New classification algorithm k-means clustering combined with svdd. Appl. Res. Comput. 27(3), 883–886 (2010)
Xiaopeng, H., Xianfeng, L., Jun, G., Ming, T.: Updated learning algorithm of support vector data description based on k-means clustering. Comput. Eng. 35(17), 184–186 (2009)
Xiao, L., Xiangru, M., Zelong, C., Xuchun, Z.: Comprehensive evaluation for network survivability based on support vector data description. Appl. Res. Comput. 30(3), 853–856 (2013)
Lu, J., Lin, C., Wang, W., Li, C., Wang, H.: String similarity measures and joins with synonyms. In: Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data, pp. 373–384. ACM (2013)
Shizhe, S.: Research on the locality sensitive hashing. Master’s thesis, Xidian University (2013)
Joachims, T.: Svmlight: support vector machine. SVM-Light Support Vector Machine, 19(4)U. University of Dortmund (1999). http:/svmlight.joachims.org/
Chang, C.C., Lin, C.J.: Libsvm: a library for support vector machines. ACM Trans. Intell. Syst. Technol. (TIST) 2(3), 27 (2011)
Wang, Z., Zhao, Z.-S., Zhang, C.: SVM-SVDD: a new method to solve data description problem with negative examples. In: Guo, C., Hou, Z.-G., Zeng, Z. (eds.) ISNN 2013, Part I. LNCS, vol. 7951, pp. 283–290. Springer, Heidelberg (2013)
Herlocker, J.L., Konstan, J.A., Terveen, L.G., Riedl, J.T.: Evaluating collaborative filtering recommender systems. ACM Trans. Inf. Syst. (TOIS) 22(1), 5–53 (2004)
Acknowledgments
This work is partly supported by National Natural Science Foundation of China under under Grant No. 61273232, 61472131, 61272546, 61300218 and 61572181, by the Program for New Century Excellent Talents in University under Grant Number NCET-13-0785.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Rong, H., Gong, L., Qin, Z., Hu, Y., Hu, C. (2015). A Personalized Recommendation Approach Based on Content Similarity Calculation in Large-Scale Data. In: Wang, G., Zomaya, A., Martinez, G., Li, K. (eds) Algorithms and Architectures for Parallel Processing. ICA3PP 2015. Lecture Notes in Computer Science(), vol 9528. Springer, Cham. https://doi.org/10.1007/978-3-319-27119-4_32
Download citation
DOI: https://doi.org/10.1007/978-3-319-27119-4_32
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-27118-7
Online ISBN: 978-3-319-27119-4
eBook Packages: Computer ScienceComputer Science (R0)