Skip to main content

Textual Information Retrieval with User Profiles Using Fuzzy Clustering and Inferencing

  • Chapter
Intelligent Exploration of the Web

Part of the book series: Studies in Fuzziness and Soft Computing ((STUDFUZZ,volume 111))

Abstract

We present a fuzzy-logic based approach to construction and use of user profiles in web textual information retrieval. A classical user profile is a collection of terms extracted from the set of documents for a specific user or a group of users. We use a fuzzy representation for user profiles where each term in a profile is associated with a fuzzy membership value. The construction of user profiles is performed by a combination of fuzzy clustering and fuzzy inferencing, a new approach developed recently. We apply fuzzy clustering methods (such as fuzzy cmeans and fuzzy hierarchical clustering) to cluster documents relevant to a user. From the cluster centers (prototypes), a user profile is constructed which indicates the user’s general preference of various terms. Fuzzy logic rules are also extracted from the cluster centers or from the user profiles. The fuzzy rules specify the semantic correlation among query terms. The user profiles and the fuzzy rules are subsequently used to expand user queries for better retrieval performance. Additional nontopical information about the user can be added to personalize the retrieval process. Moreover, fuzzy clustering can be applied to profiles of many users to extract knowledge about different user groups. The extracted knowledge is potentially useful for personalized marketing on the web.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Berzal F., Larsen H.L., Martin-Bautista M.J., Vila M.A., (2001), Computer with Words in Information Retrieval, Proc. of IFSA/NAFIPS International Conference, Vancouver, Canada, July 2001.

    Google Scholar 

  2. Bezdek J.C., (1980), A convergence theorem for the fuzzy ISODATA clustering algorithms, IEEE Transactions on Pattern Analysis and Machine Intelligence (2), 1980, pp. 1–8.

    Google Scholar 

  3. Chan P.K., (1999), Constructing Web User Profiles: A Non-invasive Learning Approach, International WEBKDD’99 Workshop, San Diego, CA, USA, pp. 39–55, Aug. 1999.

    Google Scholar 

  4. Chen J., Kundu S., (1996), A sound and complete fuzzy logic system using Zadeh’s implication operator, Foundations of Intelligent Systems: Lecture Notes in Computer Science 1079, 1996, pp. 233–242.

    Article  Google Scholar 

  5. Delgado M., Gomez-Skarmeta A.F., Vila M.A., (1996), On the Use of Hierarchical Clustering in Fuzzy Modeling, International Journal of Approximate Reasoning, 14, pp. 237–257, 1996.

    Article  MATH  Google Scholar 

  6. Fu Y., Sandhu K., Shih M-Y., (1999), A Generalization-Based Approach to Clustering of Web Usage Sessions, International WEBKDD’99 Workshop, San Diego, CA, USA, pp. 21–38, Aug. 1999.

    Google Scholar 

  7. Gomez-Skarmeta A.F., Delgado M., Vila M.A., (1999), About the Use of Fuzzy Clustering Techniques for Fuzzy Model Identification, Fuzzy Sets and Systems 106: pp. 194–216, 1999.

    Article  Google Scholar 

  8. Korfhage, R.R., (1997), Information Storage and Retrieval, New York: NY: John Wiley & Sons, 1997.

    Google Scholar 

  9. Kraft D.H. and Buell D.A., (1983), Fuzzy Sets and Generalized Boolean Retrieval Systems, International Journal of Man-Machine Studies, v. 19, 1983, pp. 45–56; reprinted in D. Dubois, H. Prade, and R. Yager, (Eds), Readings in Fuzzy Sets for Intelligent Systems, San Mateo, CA: Morgan Kaufmann Publishers, 1992.

    Google Scholar 

  10. Kraft D.H., Bordogna G., Pasi G., (1999), Fuzzy Set Techniques in Information Retrieval, In D. Dubois, H. Prade (Eds.), Handbook of Fuzzy Sets (Vol. 3): Approximate Reasoning and Information Systems. Kluwer Academic Publishers, The Neitherlands, pp. 469–510, 1999.

    Google Scholar 

  11. Kraft D.H., Chen J., (2000), Integrating and Extending Fuzzy clustering and inferencing to improve text retrieval performance, in Flexible Query Answering Systems: Recent Advances, Proceedings of the 4th International Conference on Flexible Query Answering Systems, Oct. 2000, Warsaw, Poland, Heidelberg, Germany: Physica-Verlag, pp. 386–395.

    Google Scholar 

  12. Martin-Bautists M.J., Vila M.A., Kraft D.H., Chen J., (2001), User Profiles in Web Retrieval, FLINT’2001, July 2001.

    Google Scholar 

  13. Martin-Bautista M.J., Vila M.A., Sanchez D., Larsen H.L., (2001), Intelligent filtering with genetic algorithms and fuzzy logic. In B. Bouchon-Meunier, J. Gutierrez-Rios, L. Magdalena, R.R. Yager (eds.) Technologies for Constructing Intelligent Systems. Springer-Verlag, 2001 (in press).

    Google Scholar 

  14. Martin-Bautista M.J., Vila M.A., Larsen H.L., (2000), Building adaptive user profiles by a genetic fuzzy classifier with feature selection. Proceedings of the IEEE Conference on Fuzzy Systems vol.1, pp. 308–312, San Antonio, Texas, 2000.

    Google Scholar 

  15. Martin-Bautista M.J., Vila M.A., Larsen H.L., (1999), A Fuzzy Genetic Algorithm Approach to An Adaptive Information Retrieval Agent, Journal of the American Society for Information Science, 50 (9), pp. 760–771, 1999.

    Article  Google Scholar 

  16. Masand B., Spiliopoulou M., (Eds.), (1999), Web Usage Analysis and User Profiling, International WEBKDD’99 Workshop, San Diego, CA, USA, Aug. 1999.

    Google Scholar 

  17. Nasraoui O., Frigui H., Krishnapuram R., Joshi A., (2000), Extracting Web User Profiles Using Relational Competitive Fuzzy Clustering, International Journal on Artificial Intelligence Tools, 9 (4), pp. 509–526, 2000.

    Article  Google Scholar 

  18. Pazzani M., Billsus D., (1997), Learning and revising User profiles: The identification of Interesting Web Sites, Machine Learning 27, pp. 313–331, 1997.

    Article  Google Scholar 

  19. Salton G., (1989), Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer, Reading, MA, Addison Wesley, 1989.

    Google Scholar 

  20. Srinivasan P., Ruiz M.E., Kraft D.H., Chen J., (2001), Vocabulary Mining for Information Retrieval: Rough Sets and Fuzzy Sets, Information Processing and Management, 37, pp. 15–38, 2001.

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Kraft, D.H., Chen, J., Martin-Bautista, M.J., Vila, MA. (2003). Textual Information Retrieval with User Profiles Using Fuzzy Clustering and Inferencing. In: Szczepaniak, P.S., Segovia, J., Kacprzyk, J., Zadeh, L.A. (eds) Intelligent Exploration of the Web. Studies in Fuzziness and Soft Computing, vol 111. Physica, Heidelberg. https://doi.org/10.1007/978-3-7908-1772-0_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-7908-1772-0_10

  • Publisher Name: Physica, Heidelberg

  • Print ISBN: 978-3-7908-2519-0

  • Online ISBN: 978-3-7908-1772-0

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics