Abstract
An important characteristic feature of recommender systems for web pages is the abundance of textual information in and about the items being recommended (web pages). To improve recommendations and enhance user experience, we propose to use automatic tag (keyword) extraction for web pages entering the recommender system. We present a novel tag extraction algorithm that employs semi-supervised classification based on a dataset consisting of pre-tagged documents and (for the most part) partially tagged documents whose tags are automatically mined from the content. We also compare several classification algorithms for tag extraction in this context.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Guy, I., Zwerdling, N., Ronen, I., Carmel, D., Uziel, E.: Social media recommendation based on people and tags. In: Proceedings of the 33rd Annual ACM SIGIR Conference, pp. 194–201 (2010)
Sen, S., Vig, J., Riedl, J.: Tagommenders: Connecting users to items through tags. In: 18th International World Wide Web Conference, p. 671 (April 2009)
Zhou, T.C., Ma, H., King, I., Lyu, M.R.: UserRec: A user recommendation framework in social tagging systems. In: Proceedings of the 24th AAAI Conference on Artificial Intelligence, pp. 1486–1491 (2010)
Sigurbjörnsson, B., van Zwol, R.: Flickr tag recommendation based on collective knowledge. In: Proceedings of the 2nd ACM Conference on Recommender Systems, pp. 327–336 (2008)
Jäschke, R., Marinho, L., Hotho, A., Schmidt-Thieme, L., Stumme, G.: Tag recommendations in folksonomies. In: Kok, J.N., Koronacki, J., Lopez de Mantaras, R., Matwin, S., Mladenič, D., Skowron, A. (eds.) PKDD 2007. LNCS(LNAI), vol. 4702, pp. 506–514. Springer, Heidelberg (2007)
Rendle, S., Schmidt-Tieme, L.: Pairwise interaction tensor factorization for personalized tag recommendation. In: Proceedings of the 3rd ACM International Conference on Web Search and Data Mining, pp. 81–90 (2010)
Symeonidis, P., Nanopoulos, A., Manolopoulos, Y.: Tag recommendations based on tensor dimensionality reduction. In: Proceedings of the 2nd ACM Conference on Recommender Systems, pp. 43–50 (2008)
Rifkin, R.M., Yeo, G., Poggio, T.: Regularized least-squares classification. In: Advances in Learning Theory: Methods, Model and Applications. NATO Science Series III: Computer and Systems Sciences, vol. 1, pp. 131–154. IOS Press, Amsterdam (2011)
Keerthi, S.S., DeCoste, D.: A modified finite Newton method for fast solution of large scale linear SVMs. Journal of Machine Learning Research 6, 341–361 (2005)
Sindhwani, V., Keerthi, S.S.: Large scale semi-supervised linear svms. In: Proceedings of the 29th Annual ACM SIGIR Conference, pp. 477–484. ACM, New York (2006)
Illig, J., Hotho, A., Jäschke, R., Stumme, G.: A comparison of content-based tag recommendations in folksonomy systems. In: Wolff, K.E., Palchunov, D.E., Zagoruiko, N.G., Andelfinger, U. (eds.) KONT/KPP 2007. LNCS(LNAI), vol. 6581, pp. 136–149. Springer, Heidelberg (2011)
Fan, R.E., Lin, C.J.: A study on threshold selection for multi-label classification. Technical report, National Taiwan University (2007)
Medelyan, O., Frank, E., Witten, I.H.: Human-competitive tagging using automatic keyphrase extraction. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2009), vol. 3, pp. 1318–1327 (2009)
Turney, P.D.: Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL. In: Flach, P.A., De Raedt, L. (eds.) ECML 2001. LNCS(LNAI), vol. 2167, pp. 491–502. Springer, Heidelberg (2001)
Si, X., Sun, M.: Tag-LDA for scalable real-time tag recommendation. Journal of Computational Information Systems 6, 23–31 (2009)
Krestel, R., Fankhauser, P.: Personalized topic-based tag recommendation. Neural Computation 76(1), 61–70 (2012)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Leksin, V.A., Nikolenko, S.I. (2013). Semi-supervised Tag Extraction in a Web Recommender System. In: Brisaboa, N., Pedreira, O., Zezula, P. (eds) Similarity Search and Applications. SISAP 2013. Lecture Notes in Computer Science, vol 8199. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41062-8_21
Download citation
DOI: https://doi.org/10.1007/978-3-642-41062-8_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-41061-1
Online ISBN: 978-3-642-41062-8
eBook Packages: Computer ScienceComputer Science (R0)