Skip to main content

Content-based collaborative information filtering: Actively learning to classify and recommend documents

  • Conference paper
  • First Online:
Cooperative Information Agents II Learning, Mobility and Electronic Commerce for Information Discovery on the Internet (CIA 1998)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1435))

Included in the following conference series:

Abstract

Next generation of intelligent information systems will rely on cooperative agents for playing a fundamental role in actively searching and finding relevant information on behalf of their users in complex and open environments, such as the Internet. Whereas relevant can be defined solely for a specific user, and under the context of a particular domain or topic. On the other hand shared “social” information can be used to improve the task of retrieving relevant information, and for refining each agent's particular knowledge. In this paper, we combine both approaches developing a new content-based filtering technique for learning up-to-date users' profile that serves as basis for a novel collaborative information-filtering algorithm. We demonstrate our approach through a system called RAAP (Research Assistant Agent Project) devoted to support collaborative research by classifying domain specific information, retrieved from the Web, and recommending these “bookmarks” to other researcher with similar research interests.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Blum, A., “On-line Algorithms in Machine Learning” (a survey). Dagstuhl workshop on On-Line algorithms (June 1996).

    Google Scholar 

  2. Foner, L., “A Multi-Agent Referral System for Matchmaking”, in Proceedings of the First International Conference on the Practical Applications of Intelligent Agent Technology (PAAM'96), London (April 1996).

    Google Scholar 

  3. Buckley, C., Sallon,G., et.al.: “The effect of adding relevance information in a relevance feedback environment”. In Proceedings of the 17th International ACM/SIGIR Conference on Research and Development in Information Retrieval (1994).

    Google Scholar 

  4. Sallon,G., Buckley, C., “Improving retrieval performance by relevance feedback”. Journal of the American Society for Information Science, 41, 288–297 (1990).

    Article  Google Scholar 

  5. Maes, P.:,“Agents that Reduce Work and Information Overload”, Comm ACM, 37, No7 (1994).

    Google Scholar 

  6. Tokunaga, T., Iwayama M.: “Text categorization based on weighted inverse document frequency”, Technical Report 94-TR0001, Department of Computer Science, Tokyo Institute of Technology (March 1994).

    Google Scholar 

  7. Lewis, D.: “Challenges in machine learning for text classification”, in Proceedings of the Ninth Annual Conference on Computational Learning Theory, P1. New York (1996). ACM

    Google Scholar 

  8. Yang, Y., Pederscn, J. “Feature selection in statistical learning of text categorization”, Proceedings of the Fourteenth International Conference on Machine Learning (ICML'97), (1997).

    Google Scholar 

  9. Quinlan, J.R., “Induction of decision trees”. Machine Learning, 1(1):81–106 (1986)

    Google Scholar 

  10. Mitchell T., “Machine Learning” McGraw Hill, 1996

    Google Scholar 

  11. Pazzani,M.,Muramatsu,J.,and Billsus, D., “Syskill & Wcbcrt: Identifying interesting websites”, In Proceedings of the American National Conference on Artificial Intelligence (AAAI'96), Portland, OR. (1996)

    Google Scholar 

  12. Frakes, W.,Baeza-Yates,R.: “Information Retrieval: Data Structure & Algorithms” Printice Hall, NJ (1992)

    Google Scholar 

  13. Blum, A.,Langley, P.: “Selection of Relevant Features and Examples in Machine Learning”, Artificial Intelligence, 97:245–277, (1997)

    Article  MATH  MathSciNet  Google Scholar 

  14. Blum, A.: “Empirical support for Winnow and Weighted-Majority based algorithm: results on a calendar scheduling domain”, Machine Learning 26:5–23. (1997).

    Article  MathSciNet  Google Scholar 

  15. Armstrong, R., Frietag, D., Joachims, T. and T.M. Mitchell: “Web Watcher: a learning apprentice for the world wide web” In Proceedings of the 1995 AAAI Spring Symposium of Information Gathering from Heterogeneous, Distributed Environments, Stanford, CA, 1995. AAAI Press.

    Google Scholar 

  16. Shardanand, U. and Maes P.: “Social Information Filtering: Algorithms for Automation “Word of Mouth””: ACM/CHI'95. hltp://www.acm.org/sigchi/chi95/Electronic/documnts/papers/us_bdy.htm

    Google Scholar 

  17. Resnick, P., Iacovou N., Sushak, M., Bergstrom, P., Riedl, J.: “GroupLens: An Open Architecture for Collaborative Filtering of Netnews”, in the Proceedings of the CSCW 1994 conference, October 1994.

    Google Scholar 

  18. Kautz, H., Selman, B. and Shah, M.: “The Hidden Web”, Al Magazine, Summer 1997. AAAI Press.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Matthias Klusch Gerhard Weiß

Rights and permissions

Reprints and permissions

Copyright information

© 1998 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Joaquin, D., Naohiro, I., Tomoki, U. (1998). Content-based collaborative information filtering: Actively learning to classify and recommend documents. In: Klusch, M., Weiß, G. (eds) Cooperative Information Agents II Learning, Mobility and Electronic Commerce for Information Discovery on the Internet. CIA 1998. Lecture Notes in Computer Science, vol 1435. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0053686

Download citation

  • DOI: https://doi.org/10.1007/BFb0053686

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-64676-1

  • Online ISBN: 978-3-540-69109-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics