ABSTRACT
Discovering users' interests is essential in order to help them explore resources in large digital repositories. In particular, correctly identifying users' interests is commonly a good approach for organising information and providing personalised recommendations. We consider here the case of discovering users' research interests in Mendeley a research platform for scholarly article management and discovery. Prior work in this area has considered approaches such as matrix factorisation and text-based topic modelling for inferring topics of interest in recommendation scenarios. These approaches present several problems, such as little or no interpretability of the inferred topics and difficulty handling similarities in vocabulary in different research disciplines. We present an effective solution for extracting coherent and interpretable research topics that leverages the reference management data in Mendeley in a three-step approach: 1) a topic model based on the interactions between users and articles rather than article content, 2) keyword extraction to label the topics using article titles and author-declared keywords and 3) identifying the research interests of users based on the articles that they have added to their libraries. An evaluation comprised of a research interest prediction task and an article recommendation task shows the validity of our proposal in different research disciplines (clearly outperforming a text-based latent topic model) and provides further insights regarding the effects of number of latent topics in the model and the trade-off between recency and quantity of the users' libraries.
- Joeran Beel, Bela Gipp, Stefan Langer, and Corinna Breitinger. 2016. Research-paper recommender systems: a literature survey. International Journal on Digital Libraries 17, 4 (nov 2016), 305--338. DOI:http://dx.doi.org/10.1007/s00799-015-0156-0 Google ScholarDigital Library
- Joeran Beel, Stefan Langer, Marcel Genzmehr, and Andreas Nürnberger. 2013. Introducing Docear's Research Paper Recommender System. In Proceedings of the 13th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL '13). 459--460. DOI:http://dx.doi.org/10.1145/2467696.2467786 Google ScholarDigital Library
- David M Blei. 2012. Probabilistic topic models. Commun. ACM 55, 4 (2012), 77--84. https://doi.org/10.1145/2133806.2133826Google ScholarDigital Library
- David M Blei and John D Lafferty. 2009. Visualizing topics with multi-word expressions. arXivpreprint arXiv:0907.1013 (2009). https://arxiv.org/abs/0907.1013Google Scholar
- David M. Blei, Andrew Y. Ng, and Michael I. Jordan. 2003. Latent Dirichlet Allocation. J. Mach. Learn. Res. 3 (March 2003), 993--1022. http://dl.acm.org/citation.cfm?id=944919.944937Google Scholar
- Scott Deerwester, Susan T. Dumais, George W. Furnas, Thomas K. Landauer, and Richard Harshman. 1990. Indexing by latent semantic analysis. Journal of the American Society for Information Science 41, 6 (1990), 391--407. DOI:http://dx.doi.org/10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9 Google ScholarCross Ref
- Thomas L Griffiths and Mark Steyvers. 2004. Finding scientific topics. Proceedings of the National academy of Sciences of the United States of America 101, Suppl 1 (2004), 5228--5235. DOI:http://dx.doi.org/10.1073/pnas.0307752101 Google ScholarCross Ref
- Asela Gunawardana and Guy Shani. 2009. A Survey of Accuracy Evaluation Metrics of Recommendation Tasks. J. Mach. Learn. Res. 10 (Dec. 2009), 2935--2962. http://dl.acm.org/citation.cfm?id=1577069.1755883Google Scholar
- Jonathan L. Herlocker, Joseph A. Konstan, and John Riedl. 2000. Explaining Collaborative Filtering Recommendations. In Proceedings of the 2000 ACM Conference on Computer Supported Cooperative Work (CSCW '00). ACM, New York, NY, USA, 241--250. DOI: http://dx.doi.org/10.1145/358916.358995 Google ScholarDigital Library
- Thomas Hofmann. 2004. Latent Semantic Models for Collaborative Filtering. ACM Trans. Inf. Syst. 22, 1 (Jan. 2004), 89--115. DOI:http://dx.doi.org/10.1145/963770.963774 Google ScholarDigital Library
- Pijitra Jomsri, Siripun Sanguansintukul, and Worasit Choochaiwattana. 2010. A Framework for Tag-Based Research Paper Recommender System: An IR Approach. In Proceedings of the 2010 IEEE 24th International Conference on Advanced Information Networking and Applications Workshops (WAINA '10). IEEE Computer Society, Washington, DC, USA, 103--108. DOI:http://dx.doi.org/10.1109/WAINA.2010.35 Google ScholarDigital Library
- Y. Koren, R. Bell, and C. Volinsky. 2009. Matrix Factorization Techniques for Recommender Systems. Computer 42, 8 (Aug 2009), 30--37. DOI: http://dx.doi.org/10.1109/MC.2009.263 Google ScholarDigital Library
- Onur Küçüktunç, Erik Saule, Kamer Kaya, and Ümit V. Çatalyürek. 2012. Recommendation on Academic Networks using Direction Aware Citation Analysis. CoRR abs/1205.1143 (2012). http://arxiv.org/abs/1205.1143Google Scholar
- Davide Magatti, Fabio Stella, Silvia Calegari, and Davide Ciucci. 2009. Automatic Labeling of Topics. Intelligent Systems Design and Applications, International Conference on 00 (2009), 1227--1232. DOI: http://dx.doi.org/doi.ieeecomputersociety.org/10.1109/ISDA.2009.165Google Scholar
- Andrew Kachites McCallum. 2002. MALLET: A Machine Learning for Language Toolkit. (2002). http://mallet.cs.umass.eduGoogle Scholar
- Sean M. McNee, Istvan Albert, Dan Cosley, Prateep Gopalkrishnan, Shyong K. Lam, Al Mamunur Rashid, Joseph A. Konstan, and John Riedl. 2002. On the Recommending of Citations for Research Papers. In Proceedings of the 2002 ACM Conference on Computer Supported Cooperative Work (CSCW '02). 116--125. DOI:http://dx.doi.org/10.1145/587078.587096 Google ScholarDigital Library
- Anna Ritchie, Simone Teufel, and Stephen Robertson. 2008. Using Terms from Citations for IR: Some First Results. In Proceedings of the IR Research, 30th European Conference on Advances in Information Retrieval (ECIR'08). 211--221. http://dl.acm.org/citation.cfm?id=1793274.1793303Google ScholarCross Ref
- M. Rossetti, F. Stella, and M. Zanker. 2013. Towards Explaining Latent Factors with Topic Models in Collaborative Recommender Systems. In 2013 24th International Workshop on Database and Expert Systems Applications. 162--167. DOI: http://dx.doi.org/10.1109/DEXA.2013.26 Google ScholarDigital Library
- Saül Vargas, Maya Hristakeva, and Kris Jack. 2016. Mendeley: Recommendations for Researchers. In Proceedings of the 10th ACM Conference on Recommender Systems - RecSys '16. 365--365. DOI:http://dx.doi.org/10.1145/2959100.2959116 Google ScholarDigital Library
- Chong Wang and David M. Blei. 2011. Collaborative Topic Modeling for Recommending Scientific Articles. In Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '11). ACM, New York, NY, USA, 448--456. DOI:http://dx.doi.org/10.1145/2020408.2020480 Google ScholarDigital Library
- Yuan Cao Zhang, Diarmuid Ó Séaghdha, Daniele Quercia, and Tamas Jambor. 2012. Auralist: Introducing Serendipity into Music Recommendation. In Proceedings of the Fifth ACM International Conference on Web Search and Data Mining (WSDM '12). 13--22. DOI:http://dx.doi.org/10.1145/2124295.2124300 Google ScholarDigital Library
Recommendations
Exploiting user interests for collaborative filtering: interests expansion via personalized ranking
CIKM '10: Proceedings of the 19th ACM international conference on Information and knowledge managementIn real applications, a given user buys or rates an item based on his/her interests. Learning to leverage this interest information is often critical for recommender systems. However, in existing recommender systems, the information about latent user ...
Extending a Tag-based Collaborative Recommender with Co-occurring Information Interests
UMAP '19: Proceedings of the 27th ACM Conference on User Modeling, Adaptation and PersonalizationCollaborative Filtering is largely applied to personalize item recommendation but its performance is affected by the sparsity of rating data. In order to address this issue, recent systems have been developed to improve recommendation by extracting ...
Identifying Grey Sheep Users By The Distribution of User Similarities In Collaborative Filtering
RIIT '17: Proceedings of the 6th Annual Conference on Research in Information TechnologyRecommender Systems have been successfully applied to alleviate the information overload problem and assist the process of decision making. Collaborative filtering, as one of the most popular recommendation algorithms, has been fully explored and ...
Comments