Skip to main content

Concept-Based Document Recommendations for CiteSeer Authors

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5149))

Abstract

The information explosion in today’s electronic world has created the need for information filtering techniques that help users filter out extraneous content to identify the right information they need to make important decisions. Recommender systems are one approach to this problem, based on presenting potential items of interest to a user rather than requiring the user to go looking for them. In this paper, we propose a recommender system that recommends research papers of potential interest to authors known to the CiteSeer database. For each author participating in the study, we create a user profile based on their previously published papers. Based on similarities between the user profile and profiles for documents in the collection, additional papers are recommended to the author. We introduce a novel way of representing the user profiles as trees of concepts and an algorithm for computing the similarity between the user profiles and document profiles using a tree-edit distance measure. Experiments with a group of volunteers show that our concept-based algorithm provides better recommendations than a traditional vector-space model based technique.

This research was supported in part by the National Science Foundation grant number 0454121: CRI: Collaborative: Next Generation CiteSeer.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bollacker, K., Lawrence, S., Giles, C.L.: Citeseer: an autonomous web agent for automatic retrieval and identification of interesting publications. In: Bollacker, K., Lawrence, S., Giles, C.L. (eds.) Agents 1998, 2nd International ACM Conference On Autonomous Agents, pp. 116–123. ACM Press, New York (1998)

    Chapter  Google Scholar 

  2. Torres, R., McNee, S.M., Abel, M., Konstan, J.A., Riedl, J.: Enhancing digital libraries with techlens. In: Digital Libraries. Proceedings of the Joint ACM/IEEE Conference, June 2004, pp. 228–236 (2004)

    Google Scholar 

  3. Salton, G., Buckley, C.: Term weighting approaches in Automatic Text Retrieval. Information Processing and Management 24(5), 513–523 (1988)

    Article  Google Scholar 

  4. Burke, R.: Hybrid recommender systems: survey and experiments. User Modeling and User-adapted Interaction 12(4), 331–370 (2002)

    Article  MATH  Google Scholar 

  5. Huang, Z., Chung, W., Ong, T.-H., Chen, H.: A graph-based recommender system for digital library. In: Proc. Joint Conf. on Digital Libraries, Portland, pp. 65–73 (2002)

    Google Scholar 

  6. Chen, H., Ng, T.: An algorithmic approach to concept exploration in a large knowledge network (automatic thesaurus consultation): symbolic branch-and-bound search vs. connectionist hopfield net activation. Journal of the American Society for Information Science 46(5), 348–369 (1995)

    Article  Google Scholar 

  7. Balabanović, M., Shoham, Y.: Fab: content-based, collaborative recommendation. Communications of the ACM 40(3), 66–72 (1997)

    Article  Google Scholar 

  8. Basu, C., Hirsh, H., Cohen, W., Nevill-Manning, C.: Technical paper recommendation: a study in combining multiple information sources. Journal of Artificial Intelligence Research (14), 231–252 (2001)

    MATH  Google Scholar 

  9. Cohen, W.: The WHIRL approach to information integration. In: Hearst, M. (ed.) Trends and Controversies, IEEE Intelligent Systems, pp. 20–23 (October 1998)

    Google Scholar 

  10. Billsus, D., Pazzani, M.J., Chen, J.: A learning agent for wireless news access. In: Proc. of the International Conference on Intelligent User Interfaces, pp. 33–36 (2000)

    Google Scholar 

  11. Si, L., Jin, R.: Flexible mixture model for collaborative filtering. In: Proc. 20th Int’l Conf. Machine Learning, August 2003, pp. 704–711 (2003)

    Google Scholar 

  12. Hofmann, T., Puzicha, J.: Latent class models for collaborative filtering. In: The Proceedings of IJCAI, pp. 688–693 (1999)

    Google Scholar 

  13. Dempster, A., Laird, N., Rubin, D.: Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society 39(1), 1–38 (1977)

    MATH  MathSciNet  Google Scholar 

  14. Hofmann, T., Puzicha, J.: Statistical models for co-occurrence data (Technical Report). Artificial Intelligence Laboratory Memo 1625, M.I.T (1998)

    Google Scholar 

  15. Sarwar, B., Karypis, G., Konstan, J., Riedl, J.: Application of dimensionality reduction in recommender systems-a case study. In: ACM WebKDD Wk. shop (2000)

    Google Scholar 

  16. Deerwester, S., Dumais, S.T., Furnas, G.W., Landauer, T.K., Harshman, R.: Indexing by latent semantic analysis. Journal of the American Society for Information Science 41(6), 391–407 (1990)

    Article  Google Scholar 

  17. Zhang, Y., Callan, J., Minka, T.: Novelty and redundancy detection in adaptive filtering. In: Proc. ACM SIGIR 2002, pp. 81–88 (2002)

    Google Scholar 

  18. http://en.wikipedia.org/wiki/Kullback-Leibler_divergence

  19. Gauch, S., Speretta, M., Chandramouli, A., Micarelli, A.: User Profiles for Personalized Information Access. In: Brusilovsky, P., Kobsa, A., Nejdl, W. (eds.) Adaptive Web 2007. LNCS, vol. 4321, pp. 54–89. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  20. Geisler, G., McArthur, D., Giersch, S.: Developing recommendation services for a digital library with uncertain and changing data. In: Proceedings of the 1st ACM/IEEE-CS. JCDL 2001, pp. 199–200. ACM Press, New York (2001)

    Google Scholar 

  21. Schafer, J.B., Frankowski, D., Herlocker, J., Sen, S.: Collaborative Filtering Recommender Systems. In: Brusilovsky, P., Kobsa, A., Nejdl, W. (eds.) Adaptive Web 2007. LNCS, vol. 4321, pp. 291–324. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  22. CiteSeer.IST Scientific Literature Digital Library, http://citeseer.ist.psu.edu/

  23. Pazzani, M.J., Billsus, D.: Content-Based Recommendation Systems. In: Brusilovsky, P., Kobsa, A., Nejdl, W. (eds.) Adaptive Web 2007. LNCS, vol. 4321, pp. 325–341. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  24. Adomavicius, G., Tuzhilin, A.: Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions. IEEE Trans. On Knowledge and Data Engineering 17(6), 734–749 (2005)

    Article  Google Scholar 

  25. Breese, J.S., Heckerman, D., Kadie, C.: Empirical analysis of predictive algorithms for collaborative filtering. In: Proc. 14th Conf. Uncertainty in AI, July 1998, pp. 43–52 (1998)

    Google Scholar 

  26. Lakkaraju, P., Gauch, S., Speretta, M.: Document Similarity Based on Concept Tree Distance. In: 19th International Conference on Hypertext and Hypermedia (Hypertext 2008), Pittsburgh, PA, June 19-21 (to appear, 2008)

    Google Scholar 

  27. The ACM Computing Classification System, http://acm.org/class/1998/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Wolfgang Nejdl Judy Kay Pearl Pu Eelco Herder

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Chandrasekaran, K., Gauch, S., Lakkaraju, P., Luong, H.P. (2008). Concept-Based Document Recommendations for CiteSeer Authors. In: Nejdl, W., Kay, J., Pu, P., Herder, E. (eds) Adaptive Hypermedia and Adaptive Web-Based Systems. AH 2008. Lecture Notes in Computer Science, vol 5149. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-70987-9_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-70987-9_11

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-70984-8

  • Online ISBN: 978-3-540-70987-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics