Skip to main content

Learning Semantic User Profiles from Text

  • Conference paper
Advanced Data Mining and Applications (ADMA 2006)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4093))

Included in the following conference series:

Abstract

This paper focuses on the problem of choosing a representation of documents that can be suitable to induce more advanced semantic user profiles, in which concepts are used instead of keywords to represent user interests. We propose a method which integrates a word sense disambiguation algorithm based on the WordNet IS-A hierarchy, with two machine learning techniques to induce semantic user profiles, namely a relevance feedback method and a probabilistic one. The document representation proposed, that we called Bag-Of-Synsets improves the classic Bag-Of-Words approach, as shown by an extensive experimental session.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Asnicar, F., Tasso, C.: ifweb: a prototype of user model-based intelligent agent for documentation filtering and navigation in the word wide web. In: Proc. of 1st Int. Workshop on adaptive systems and user modeling on the WWW (1997)

    Google Scholar 

  2. Balabanovic, M., Shoham, Y.: Fab: content-based, collaborative recommendation. Communications of the ACM 40(3), 66–72 (1997)

    Article  Google Scholar 

  3. Bloedhorn, S., Hotho, A.: Boosting for text classification with semantic features. In: Proc. of 10th ACM SIGKDD Intern. Conf. on Knowledge Discovery and Data Mining, Mining for and from the Semantic Web Workshop, pp. 70–87 (2004)

    Google Scholar 

  4. Degemmis, M., Lops, P., Ferilli, S., Mauro, N.D., Basile, T., Semeraro, G.: Learning user profiles from text in e-commerce. In: Li, X., Wang, S., Dong, Z.Y. (eds.) ADMA 2005. LNCS (LNAI), vol. 3584, pp. 370–381. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  5. Degemmis, M., Lops, P., Semeraro, G.: Wordnet-based word sense disambiguation for learning user profiles. In: Proc. of the 2nd European Web Mining Forum (2005)

    Google Scholar 

  6. Leacock, C., Chodorow, M.: Combining local context and WordNet similarity for word sense identification. In: Fellbaum, C. (ed.), pp. 305–332. MIT Press, Cambridge (1998)

    Google Scholar 

  7. Magnini, B., Strapparava, C.: Improving user modelling with content-based techniques. In: Bauer, M., Gmytrasiewicz, P.J., Vassileva, J. (eds.) UM 2001. LNCS (LNAI), vol. 2109, pp. 74–83. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  8. Manning, C.D., Schutze, H.: Foundations of Statistical Natural Language Processing. The MIT Press, Cambridge (1984)

    Google Scholar 

  9. McCallum, A., Nigam, K.: A comparison of event models for naive bayes text classification. In: Proceedings of the AAAI/ICML-98 Workshop on Learning for Text Categorization, pp. 41–48. AAAI Press, Menlo Park (1998)

    Google Scholar 

  10. Miller, G.A.: Wordnet: an on-line lexical database. International Journal of Lexicography 3(4), 235–244 (1990)

    Article  Google Scholar 

  11. Mitchell, T.: Machine Learning. McGraw-Hill, New York (1997)

    MATH  Google Scholar 

  12. Orkin, M., Drogin, R.: Vital Statistics. McGraw-Hill, New York (1990)

    Google Scholar 

  13. Pazzani, M., Billsus, D.: Learning and revising user profiles: The identification of interesting web sites. Machine Learning 27(3), 313–331 (1997)

    Article  Google Scholar 

  14. Rocchio, J.: Relevance feedback information retrieval. In: Salton, G. (ed.) The SMART retrieval system - experiments in automated document processing, pp. 313–323. Prentice-Hall, Englewood Cliffs (1971)

    Google Scholar 

  15. Scott, S., Matwin, S.: Text classification using wordnet hypernyms. In: COLING-ACL Workshop on usage of WordNet for in NLP Systems, pp. 45–51 (1998)

    Google Scholar 

  16. Sebastiani, F.: Machine learning in automated text categorization. ACM Computing Surveys 34(1) (2002)

    Google Scholar 

  17. Witten, I., Bell, T.: The zero-frequency problem: Estimating the probabilities of novel events in adaptive text compression. IEEE Transactions on Information Theory 37(4) (1991)

    Google Scholar 

  18. Yao, Y.Y.: Measuring retrieval effectiveness based on user preference of documents. Journal of the American Society for Information Science 46(2), 133–145 (1995)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Degemmis, M., Lops, P., Semeraro, G. (2006). Learning Semantic User Profiles from Text. In: Li, X., Zaïane, O.R., Li, Z. (eds) Advanced Data Mining and Applications. ADMA 2006. Lecture Notes in Computer Science(), vol 4093. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11811305_73

Download citation

  • DOI: https://doi.org/10.1007/11811305_73

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-37025-3

  • Online ISBN: 978-3-540-37026-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics