Abstract
We present a fuzzy-logic based approach to construction and use of user profiles in web textual information retrieval. A classical user profile is a collection of terms extracted from the set of documents for a specific user or a group of users. We use a fuzzy representation for user profiles where each term in a profile is associated with a fuzzy membership value. The construction of user profiles is performed by a combination of fuzzy clustering and fuzzy inferencing, a new approach developed recently. We apply fuzzy clustering methods (such as fuzzy cmeans and fuzzy hierarchical clustering) to cluster documents relevant to a user. From the cluster centers (prototypes), a user profile is constructed which indicates the user’s general preference of various terms. Fuzzy logic rules are also extracted from the cluster centers or from the user profiles. The fuzzy rules specify the semantic correlation among query terms. The user profiles and the fuzzy rules are subsequently used to expand user queries for better retrieval performance. Additional nontopical information about the user can be added to personalize the retrieval process. Moreover, fuzzy clustering can be applied to profiles of many users to extract knowledge about different user groups. The extracted knowledge is potentially useful for personalized marketing on the web.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Berzal F., Larsen H.L., Martin-Bautista M.J., Vila M.A., (2001), Computer with Words in Information Retrieval, Proc. of IFSA/NAFIPS International Conference, Vancouver, Canada, July 2001.
Bezdek J.C., (1980), A convergence theorem for the fuzzy ISODATA clustering algorithms, IEEE Transactions on Pattern Analysis and Machine Intelligence (2), 1980, pp. 1–8.
Chan P.K., (1999), Constructing Web User Profiles: A Non-invasive Learning Approach, International WEBKDD’99 Workshop, San Diego, CA, USA, pp. 39–55, Aug. 1999.
Chen J., Kundu S., (1996), A sound and complete fuzzy logic system using Zadeh’s implication operator, Foundations of Intelligent Systems: Lecture Notes in Computer Science 1079, 1996, pp. 233–242.
Delgado M., Gomez-Skarmeta A.F., Vila M.A., (1996), On the Use of Hierarchical Clustering in Fuzzy Modeling, International Journal of Approximate Reasoning, 14, pp. 237–257, 1996.
Fu Y., Sandhu K., Shih M-Y., (1999), A Generalization-Based Approach to Clustering of Web Usage Sessions, International WEBKDD’99 Workshop, San Diego, CA, USA, pp. 21–38, Aug. 1999.
Gomez-Skarmeta A.F., Delgado M., Vila M.A., (1999), About the Use of Fuzzy Clustering Techniques for Fuzzy Model Identification, Fuzzy Sets and Systems 106: pp. 194–216, 1999.
Korfhage, R.R., (1997), Information Storage and Retrieval, New York: NY: John Wiley & Sons, 1997.
Kraft D.H. and Buell D.A., (1983), Fuzzy Sets and Generalized Boolean Retrieval Systems, International Journal of Man-Machine Studies, v. 19, 1983, pp. 45–56; reprinted in D. Dubois, H. Prade, and R. Yager, (Eds), Readings in Fuzzy Sets for Intelligent Systems, San Mateo, CA: Morgan Kaufmann Publishers, 1992.
Kraft D.H., Bordogna G., Pasi G., (1999), Fuzzy Set Techniques in Information Retrieval, In D. Dubois, H. Prade (Eds.), Handbook of Fuzzy Sets (Vol. 3): Approximate Reasoning and Information Systems. Kluwer Academic Publishers, The Neitherlands, pp. 469–510, 1999.
Kraft D.H., Chen J., (2000), Integrating and Extending Fuzzy clustering and inferencing to improve text retrieval performance, in Flexible Query Answering Systems: Recent Advances, Proceedings of the 4th International Conference on Flexible Query Answering Systems, Oct. 2000, Warsaw, Poland, Heidelberg, Germany: Physica-Verlag, pp. 386–395.
Martin-Bautists M.J., Vila M.A., Kraft D.H., Chen J., (2001), User Profiles in Web Retrieval, FLINT’2001, July 2001.
Martin-Bautista M.J., Vila M.A., Sanchez D., Larsen H.L., (2001), Intelligent filtering with genetic algorithms and fuzzy logic. In B. Bouchon-Meunier, J. Gutierrez-Rios, L. Magdalena, R.R. Yager (eds.) Technologies for Constructing Intelligent Systems. Springer-Verlag, 2001 (in press).
Martin-Bautista M.J., Vila M.A., Larsen H.L., (2000), Building adaptive user profiles by a genetic fuzzy classifier with feature selection. Proceedings of the IEEE Conference on Fuzzy Systems vol.1, pp. 308–312, San Antonio, Texas, 2000.
Martin-Bautista M.J., Vila M.A., Larsen H.L., (1999), A Fuzzy Genetic Algorithm Approach to An Adaptive Information Retrieval Agent, Journal of the American Society for Information Science, 50 (9), pp. 760–771, 1999.
Masand B., Spiliopoulou M., (Eds.), (1999), Web Usage Analysis and User Profiling, International WEBKDD’99 Workshop, San Diego, CA, USA, Aug. 1999.
Nasraoui O., Frigui H., Krishnapuram R., Joshi A., (2000), Extracting Web User Profiles Using Relational Competitive Fuzzy Clustering, International Journal on Artificial Intelligence Tools, 9 (4), pp. 509–526, 2000.
Pazzani M., Billsus D., (1997), Learning and revising User profiles: The identification of Interesting Web Sites, Machine Learning 27, pp. 313–331, 1997.
Salton G., (1989), Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer, Reading, MA, Addison Wesley, 1989.
Srinivasan P., Ruiz M.E., Kraft D.H., Chen J., (2001), Vocabulary Mining for Information Retrieval: Rough Sets and Fuzzy Sets, Information Processing and Management, 37, pp. 15–38, 2001.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Kraft, D.H., Chen, J., Martin-Bautista, M.J., Vila, MA. (2003). Textual Information Retrieval with User Profiles Using Fuzzy Clustering and Inferencing. In: Szczepaniak, P.S., Segovia, J., Kacprzyk, J., Zadeh, L.A. (eds) Intelligent Exploration of the Web. Studies in Fuzziness and Soft Computing, vol 111. Physica, Heidelberg. https://doi.org/10.1007/978-3-7908-1772-0_10
Download citation
DOI: https://doi.org/10.1007/978-3-7908-1772-0_10
Publisher Name: Physica, Heidelberg
Print ISBN: 978-3-7908-2519-0
Online ISBN: 978-3-7908-1772-0
eBook Packages: Springer Book Archive