Abstract
The information explosion in today’s electronic world has created the need for information filtering techniques that help users filter out extraneous content to identify the right information they need to make important decisions. Recommender systems are one approach to this problem, based on presenting potential items of interest to a user rather than requiring the user to go looking for them. In this paper, we propose a recommender system that recommends research papers of potential interest to authors known to the CiteSeer database. For each author participating in the study, we create a user profile based on their previously published papers. Based on similarities between the user profile and profiles for documents in the collection, additional papers are recommended to the author. We introduce a novel way of representing the user profiles as trees of concepts and an algorithm for computing the similarity between the user profiles and document profiles using a tree-edit distance measure. Experiments with a group of volunteers show that our concept-based algorithm provides better recommendations than a traditional vector-space model based technique.
This research was supported in part by the National Science Foundation grant number 0454121: CRI: Collaborative: Next Generation CiteSeer.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Bollacker, K., Lawrence, S., Giles, C.L.: Citeseer: an autonomous web agent for automatic retrieval and identification of interesting publications. In: Bollacker, K., Lawrence, S., Giles, C.L. (eds.) Agents 1998, 2nd International ACM Conference On Autonomous Agents, pp. 116–123. ACM Press, New York (1998)
Torres, R., McNee, S.M., Abel, M., Konstan, J.A., Riedl, J.: Enhancing digital libraries with techlens. In: Digital Libraries. Proceedings of the Joint ACM/IEEE Conference, June 2004, pp. 228–236 (2004)
Salton, G., Buckley, C.: Term weighting approaches in Automatic Text Retrieval. Information Processing and Management 24(5), 513–523 (1988)
Burke, R.: Hybrid recommender systems: survey and experiments. User Modeling and User-adapted Interaction 12(4), 331–370 (2002)
Huang, Z., Chung, W., Ong, T.-H., Chen, H.: A graph-based recommender system for digital library. In: Proc. Joint Conf. on Digital Libraries, Portland, pp. 65–73 (2002)
Chen, H., Ng, T.: An algorithmic approach to concept exploration in a large knowledge network (automatic thesaurus consultation): symbolic branch-and-bound search vs. connectionist hopfield net activation. Journal of the American Society for Information Science 46(5), 348–369 (1995)
Balabanović, M., Shoham, Y.: Fab: content-based, collaborative recommendation. Communications of the ACM 40(3), 66–72 (1997)
Basu, C., Hirsh, H., Cohen, W., Nevill-Manning, C.: Technical paper recommendation: a study in combining multiple information sources. Journal of Artificial Intelligence Research (14), 231–252 (2001)
Cohen, W.: The WHIRL approach to information integration. In: Hearst, M. (ed.) Trends and Controversies, IEEE Intelligent Systems, pp. 20–23 (October 1998)
Billsus, D., Pazzani, M.J., Chen, J.: A learning agent for wireless news access. In: Proc. of the International Conference on Intelligent User Interfaces, pp. 33–36 (2000)
Si, L., Jin, R.: Flexible mixture model for collaborative filtering. In: Proc. 20th Int’l Conf. Machine Learning, August 2003, pp. 704–711 (2003)
Hofmann, T., Puzicha, J.: Latent class models for collaborative filtering. In: The Proceedings of IJCAI, pp. 688–693 (1999)
Dempster, A., Laird, N., Rubin, D.: Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society 39(1), 1–38 (1977)
Hofmann, T., Puzicha, J.: Statistical models for co-occurrence data (Technical Report). Artificial Intelligence Laboratory Memo 1625, M.I.T (1998)
Sarwar, B., Karypis, G., Konstan, J., Riedl, J.: Application of dimensionality reduction in recommender systems-a case study. In: ACM WebKDD Wk. shop (2000)
Deerwester, S., Dumais, S.T., Furnas, G.W., Landauer, T.K., Harshman, R.: Indexing by latent semantic analysis. Journal of the American Society for Information Science 41(6), 391–407 (1990)
Zhang, Y., Callan, J., Minka, T.: Novelty and redundancy detection in adaptive filtering. In: Proc. ACM SIGIR 2002, pp. 81–88 (2002)
Gauch, S., Speretta, M., Chandramouli, A., Micarelli, A.: User Profiles for Personalized Information Access. In: Brusilovsky, P., Kobsa, A., Nejdl, W. (eds.) Adaptive Web 2007. LNCS, vol. 4321, pp. 54–89. Springer, Heidelberg (2007)
Geisler, G., McArthur, D., Giersch, S.: Developing recommendation services for a digital library with uncertain and changing data. In: Proceedings of the 1st ACM/IEEE-CS. JCDL 2001, pp. 199–200. ACM Press, New York (2001)
Schafer, J.B., Frankowski, D., Herlocker, J., Sen, S.: Collaborative Filtering Recommender Systems. In: Brusilovsky, P., Kobsa, A., Nejdl, W. (eds.) Adaptive Web 2007. LNCS, vol. 4321, pp. 291–324. Springer, Heidelberg (2007)
CiteSeer.IST Scientific Literature Digital Library, http://citeseer.ist.psu.edu/
Pazzani, M.J., Billsus, D.: Content-Based Recommendation Systems. In: Brusilovsky, P., Kobsa, A., Nejdl, W. (eds.) Adaptive Web 2007. LNCS, vol. 4321, pp. 325–341. Springer, Heidelberg (2007)
Adomavicius, G., Tuzhilin, A.: Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions. IEEE Trans. On Knowledge and Data Engineering 17(6), 734–749 (2005)
Breese, J.S., Heckerman, D., Kadie, C.: Empirical analysis of predictive algorithms for collaborative filtering. In: Proc. 14th Conf. Uncertainty in AI, July 1998, pp. 43–52 (1998)
Lakkaraju, P., Gauch, S., Speretta, M.: Document Similarity Based on Concept Tree Distance. In: 19th International Conference on Hypertext and Hypermedia (Hypertext 2008), Pittsburgh, PA, June 19-21 (to appear, 2008)
The ACM Computing Classification System, http://acm.org/class/1998/
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chandrasekaran, K., Gauch, S., Lakkaraju, P., Luong, H.P. (2008). Concept-Based Document Recommendations for CiteSeer Authors. In: Nejdl, W., Kay, J., Pu, P., Herder, E. (eds) Adaptive Hypermedia and Adaptive Web-Based Systems. AH 2008. Lecture Notes in Computer Science, vol 5149. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-70987-9_11
Download citation
DOI: https://doi.org/10.1007/978-3-540-70987-9_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-70984-8
Online ISBN: 978-3-540-70987-9
eBook Packages: Computer ScienceComputer Science (R0)