Abstract
We consider the problem of categorization of the textual documents which is relevant and challenging both from the point of view of theory and applications. We assume a perspective (cf. Zadrożny and Nowacka [28]) that the problem is seen as a sort of the basic information retrieval task, that is, of finding documents relevant to a given query. Specifically, we employ here some extension of a fuzzy logic based information retrieval model due to Nowacka, Kacprzyk and Zadrożny [21] in which the representation of documents and queries is based on Zadeh’s linguistic statements of the type X IS A and their matching is computed by pairs of the necessity and possibility measures. We show the use of interval valued fuzzy sets to implement the new method proposed. Moreover, these new concepts are proposed as tools to adapt an inductive learning method of Koriche and Quinqueton [15] for the purposes of text categorization in the case of imprecise (fuzzy) information.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Baeza-Yates, R., Ribeiro-Neto, B.: Modern information retrieval. ACM Press and Addison Wesley (1999)
Bieniek, K., Gola, M., Kacprzyk, J., Zadrożny, S.: An Approach to Use Possibility Theory in Information Retrieval. In: Proceedings of the 12th Zittau East-West Fuzzy Colloquium, Zittau, Germany, pp. 1–5 (2005)
Bordogna, G., Carrara, P., Pasi, G.: Fuzzy approaches to extend Boolean information retrieval. In: Bosc, P., Kacprzyk, J. (eds.) Fuzziness in Database Management Systems, pp. 231–274. Physica Verlag, Heidelberg (1995)
Cornelis, C., Deschrijver, G., Kerre, E.E.: Implication in intuitionistic fuzzy and interval-valued fuzzy set theory: construction, classification, application. International Journal of Approximate Reasoning 35, 55–95 (2004)
Deschrijver, G.: Arithmetic operators in interval-valued fuzzy set theory. Information Sciences 177, 2906–2924 (2007)
Deschrijver, G., Kerre, E.: On the relationship between some extensions of fuzzy set theory. Fuzzy Sets and Systems 133, 227–235 (2004)
Dubois, D., Prade, H.: Fuzzy Sets in Approximate Reasoning, part 1: Inference with possibility distributions. Fuzzy Sets and Systems 40, 143–202 (1991)
Ishibuchi, H., Tanaka, H.: Multiobjective Programming in Optimisation of the Interval Objective Function. European Journal of Operational Research 48, 219–225 (1990)
Joachims, T.: Text Categorization with Support Vector Machines: Learning with Many Relevant Features. In: Nédellec, C., Rouveirol, C. (eds.) ECML 1998. LNCS, vol. 1398, pp. 137–142. Springer, Heidelberg (1998)
Kacprzyk, J., Iwański, C.: Fuzzy logic with linguistic quantifiers in inductive learning. In: Zadeh, L.A., Kacprzyk, J. (eds.) Fuzzy Logic for the Management of Uncertainty, pp. 465–478. Wiley, New York (1992)
Kacprzyk, J., Szkatuła, G.: An algorithm for learning from erroneous and incorrigible examples. International Journal of Intelligent Systems 11, 565–582 (1996)
Kacprzyk, J., Szkatuła, G.: An inductive learning algorithm with a preanalysis of data. International Journal of Knowledge-Based Intelligent Engineering Systems 3, 135–146 (1999)
Kacprzyk, J., Nowacka, K., Zadrożny, S.: A Possibilistic-logic-based Information Retrieval Model with Various Term-weighting Approaches. In: Rutkowski, L., Tadeusiewicz, R., Zadeh, L.A., Żurada, J.M. (eds.) ICAISC 2006. LNCS (LNAI), vol. 4029, pp. 1120–1129. Springer, Heidelberg (2006)
Kacprzyk, J., Zadrożny, S., Nowacka, K.: An Experimental Comparison of Various Aggregation Operators in a Fuzzy Information Retrieval Model. In: North American Fuzzy Information Processing Society Annual Conference (NAFIPS 2008), New York, USA (2008) (submitted)
Koriche, F., Quinqueton, J.: Robust -DNF Learning via Inductive Belief Merging. In: Lavrač, N., Gamberger, D., Todorovski, L., Blockeel, H. (eds.) ECML 2003. LNCS (LNAI), vol. 2837, pp. 229–240. Springer, Heidelberg (2003)
Koronacki, J., Raś, Z., Wierzchoń, S.T., Kacprzyk, J. (eds.): Advances in Machine Learning I. Springer, Heidelberg (2010)
Koronacki, J., Raś, Z., Wierzchoń, S.T., Kacprzyk, J. (eds.): Advances in Machine Learning II. Springer, Heidelberg (2010)
Lewis, D.D.: Reuters-21578, Distribution 1.0., http://www.daviddlewis.com/resources/testcollections/reuters21578
Mitchell, T.M.: Generalization as Search. Artificial Intelligence 18, 203–226 (1982)
Neumaier, A.: Clouds, fuzzy sets and probability intervals. Reliable Computing 10, 249–272 (2004)
Nowacka, K., Zadrożny, S., Kacprzyk, J.: A New Fuzzy Logic Based Information Retrieval Model. In: 12th International Conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems (IPMU 2008), Malaga, Spain (2008) (submitted)
Porter, M.F.: An Algorithm for Suffix Stripping. Program 14(3), 130–137 (1980)
Salton, G., Buckley, C.: Term Weighting Approaches in Automatic Text Retrieval. Information Processing and Management 24, 513–523 (1988)
Sebastiani, F.: A Tutorial on Automated Text Categorisation. In: Amandi, A., Zunino, A. (eds.) Proceedings of ASAI 1999, 1st Argentinian Symposium on Artificial Intelligence, Buenos Aires, Argentina, pp. 7–35 (1999)
Zadeh, L.A.: The Concept of a Linguistic Variable and its Application to Approximate Reasoning (Part I-III). Information Sciences 8(8,9), 199–249, 301–357, 43–80 (1975)
Zadeh, L.A.: Fuzzy Sets as a Basis for a Theory of Possibility. Fuzzy Sets and Systems 1, 3–28 (1978)
Zadrożny, S., Kacprzyk, J.: Computing with Words for Text Processing: An Approach to the Text Categorization. Information Sciences 176(4), 415–437 (2006)
Zadrożny, S., Nowacka, K.: Fuzzy Information Retrieval Model Revisited. Fuzzy Sets and Systems 160(15), 2173–2191 (2008)
Zadrożny, S., Kacprzyk, J.: An Extended Fuzzy Boolean Model of Information Retrieval Revisited. In: The 14th Annual IEEE International Conference on FUZZY Systems, Reno, USA, pp. 1020–1025 (2005)
Zadrożny, S., Nowacka, K., Kacprzyk, J.: A Concept of a Possibilistic Logic Based Information Retrieval Model. In: 11th International Conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems (IPMU 2006), Paris, France, pp. 992–999 (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Zadrożny, S., Kacprzyk, J., Nowacka, K. (2010). Using Fuzzy and Interval-Valued Fuzzy Sets in Automatic Text Categorization Based on a Fuzzy Information Retrieval Model. In: Cornelis, C., Deschrijver, G., Nachtegael, M., Schockaert, S., Shi, Y. (eds) 35 Years of Fuzzy Set Theory. Studies in Fuzziness and Soft Computing, vol 261. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16629-7_13
Download citation
DOI: https://doi.org/10.1007/978-3-642-16629-7_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16628-0
Online ISBN: 978-3-642-16629-7
eBook Packages: EngineeringEngineering (R0)