ABSTRACT
Stack Overflow is one of the most popular online programming question and answer websites for developers around the world. Generally, developers need to provide tags for their posting. High-quality tags are expected to facilitate correct classification and efficient search. Unfortunately, tagging process is distributed and uncoordinated due to developers' understanding of their postings, English skills and preferences. Automatic tag recommendation becomes increasingly important for these information sites. In this paper, we propose SOTagRec, a novel tag recommendation approach combing convolutional neural network model and collaborative filtering method. By learning historical postings and their tags from existing information, SOTagRec can accurately infer tags for new postings. We have evaluated SOTagRec on Stackoverflow and compare with the state-of-the-art methods. Experiments Results show that SOTagRec achieves 81.7% and 88.7% respectively for Recall@5 and Recall@10, which outperforms the previous relevant methods.
- Xia, X., Lo, D., Wang, X., and Zhou, B. 2013. Tag recommendation in software information sites. In 2013 10th Working Conference on Mining Software Repositories (MSR), IEEE, pp. 287--296. Google ScholarDigital Library
- Zhou, P., Liu, J., Yang, Z., and Zhou, G. 2017. Scalable tag recommendation for software information sites. In 2017 IEEE 24th International Conference on Software Analysis, Evolution and Reengineering (SANER), IEEE, pp. 272--282.Google Scholar
- Wang, S., Lo, D., Vasilescu, B., and Serebrenik, A. 2014. September). EnTagRec: An enhanced tag recommendation system for software information sites. In 2014 IEEE International Conference on Software Maintenance and Evolution (ICSME), IEEE, pp. 291--300.Google Scholar
- Wang, M., Ni, B., Hua, X. S., and Chua, T. S. 2012. Assistive tagging: A survey of multimedia tagging with human-computer joint exploration. ACM Comput. Surv. 44, 4, Article 25 (September 2012), 24 pages. Google ScholarDigital Library
- Mishne, G. 2006. AutoTag: a collaborative approach to automated tag assignment for weblog posts. In Proceedings of the 15th international conference on World Wide Web (WWW '06). ACM, New York, NY, USA, 953--954. Google ScholarDigital Library
- Wang, H., Shi, X., & Yeung, D. Y. 2015. Relational stacked denoising autoencoder for tag recommendation. In Twenty-ninth AAAI conference on artificial intelligence, pp. 3052--3058. Google ScholarDigital Library
- Si, X., and Sun, M. 2009. Tag-LDA for scalable real-time tag recommendation. JOURNAL OF INFORMATION & COMPUTATIONAL SCIENCE, 6(2), 1009--1016.Google Scholar
- Wang, H., Chen, B., & Li, W. J. 2013. Collaborative topic regression with social regularization for tag recommendation. In Twenty-Third International Joint Conference on Artificial Intelligence. Google ScholarDigital Library
- Saha, Avigit K., Ripon K. Saha, and Kevin A. Schneider. 2013. A discriminative model approach for suggesting tags automatically for stack overflow questions. In Proceedings of the 10th Working Conference on Mining Software Repositories, IEEE, pp. 73--76.Google Scholar
- Zhang, J., Sun, H., Tian, Y., and Liu, X. 2018. Semantically enhanced tag recommendation for software CQAs via deep learning. In Proceedings of the 40th International Conference on Software Engineering: Companion Proceeedings (ICSE '18). ACM, New York, NY, USA, 294--295. Google ScholarDigital Library
- Kim, Y. 2014. Convolutional neural networks for sentence classification. In Proc. Empirical Methods Natural Lang. Process., 2014, pp. 1746--1751.Google Scholar
- Kalchbrenner, N., Grefenstette, E., & Blunsom, P. 2014. A convolutional neural network for modelling sentences. arXiv preprint arXiv:1404.2188.Google Scholar
- Liu, P., Qiu, X., & Huang, X. 2016. Recurrent neural network for text classification with multi-task learning. In Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, pp. 2873--2879. Google ScholarDigital Library
- Lai, S., Xu, L., Liu, K., & Zhao, J. 2015. Recurrent convolutional neural networks for text classification. In Twenty-ninth AAAI conference on artificial intelligence. Google ScholarDigital Library
- Mikolov, T., Sutskever, I., Chen, K., Corrado, G. S., and Dean, J. (2013). Distributed representations of words and phrases and their compositionality. In Advances in neural information processing systems, pp. 3111--3119. Google ScholarDigital Library
- Ronan Collobert, Jason Weston, Léon Bottou, Michael Karlen, Koray Kavukcuoglu, and Pavel Kuksa. 2011. Natural Language Processing (Almost) from Scratch. J. Mach. Learn. Res. 12 (November 2011), 2493--2537.Google Scholar
- Baeza-Yates, R., and Ribeiro, B. D. A. N. 2011. Modern information retrieval. New York: ACM Press; Harlow, England: Addison-Wesley. Google ScholarDigital Library
- Loper, E., and Bird, S. 2002. NLTK: the natural language toolkit. arXiv preprint cs/0205028. Google ScholarDigital Library
- Willett, P. 2006. The Porter stemming algorithm: then and now. Program, 40(3), 219--223.Google ScholarCross Ref
- J Han, J., Pei, J., & Kamber, M. 2011. Data mining: concepts and techniques. Elsevier. Google ScholarDigital Library
Index Terms
- SOTagRec: A Combined Tag Recommendation Approach for Stack Overflow
Recommendations
TAGme: A Topical Folksonomy Based Collaborative Filtering for Tag Recommendation in Community Sites
MISNC '17: Proceedings of the 4th Multidisciplinary International Social Networks ConferenceCommunity Question Answering1 (CQA) sites allow users to share and exchange the knowledge on various fields. In recent years the CQA sites gain the huge popularity on the web. Searching the information is a very difficult task in CQA sites which is ...
A tag recommendation system for folksonomy
SWSM '09: Proceedings of the 2nd ACM workshop on Social web search and miningSocial bookmarking tools become more and more popular nowadays and tagging is used to organize information and allow users to recall or search the resources. Users need to type the tags whenever they post a resource, so that a good tag recommendation ...
Tag recommendation for social bookmarking: Probabilistic approaches
Principles and Practice of Multi-Agent SystemsTagging has become increasingly popular with the explosion of user-created content on the web. A 'tag' can be defined as a group of keywords that makes organizing, browsing and searching for content more efficient. Users apply tags to a variety of web-...
Comments