Abstract
This paper presents a practical evaluation of a document retrieval method based on a certain textual fuzzy similarity measure. The similarity measure was originally introduced in [1] — cf. also [2], and later used in Internet-related applications [3,4]. Three textual databases of diverse level of freedom in the content of documents are used for experiments in the search. In other words, the relation of the documents within each group to the chosen topic is (according to the evaluating person) strong, average, and random. The results of the search coincide with intuition and confirm the expectation that methods based on similarity are advantageous as long as the database contains documents of a relatively well-defined topic.
Keywords
This work has partly been supported by the NATO Scientific Committee via the Spanish Ministry for Science and Technology; grantholder — P.S.Szczepaniak; host institution — Politechnical University, Madrid, Spain, 2002/2003.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Niewiadomski A. (2000): Appliance of fuzzy relations for text document comparing. Proceedings of the 5th Conference NNSC, Zakopane, Poland, pp. 347–352.
Niewiadomski A., Szczepaniak P.S. (2001): Intutionistic Fuzzy Relations in Approximate Text Comparison. Published in Polish: Intuicjonistyczne relacje rozmyte w przybliżonym porównywaniu tekstów. In: Chojcan J. Łeski J. (Eds.): Zbiory rozmyte i ich zastosowania. Silesian Technical University Press, Gliwice, Poland, pp. 271–282; ISBN 83-88000-64-0.
Niewiadomski A., Szczepaniak P.S., (2002). Fuzzy Similarity in E-Commerce Domains. In: Segovia J., Szczepaniak P.S., Niedzwiedzinski M. (Eds.) E-Commerce and Intelligent Methods. Physica-Verlag, A Springer-Verlag Company, Heidelberg, New York.
Szczepaniak P.S., Niewiadomski A. (2003). Internet Search Based on Text Intuitionistic Fuzzy Similarity. In: Szczepaniak P.S., Segovia J., Kacprzyk J., Zadeh L. (Eds.) Intelligent Exploration of the Web. Physica-Verlag, A Springer-Verlag Company, Heidelberg, New York.
Lebart L., Salem A., Berry L. (1998). Exploring Textual Data. Kluwer Academic Publisher.
Baeza-Yates R., Ribeiro-Neto B. (1999). Modern Information Retrieval. Addison Wesley, New York.
Ho T.B., Kawasaki S., Nguyen N.B. (2003). Documents Clustering using Tolerance Rough Set Model and Its Application to Information Retrieval. In: Szczepaniak P.S., Segovia J., Kacprzyk J., Zadeh L. (Eds.) Intelligent Exploration of the Web. Physica-Verlag, A Springer-Verlag Company, Heidelberg, New York.
Zadeh L. (1965). Fuzzy Sets. Information and Control, 8, pp. 338–353.
Pedrycz W., Gomide F. (1998): An Introduction to Fuzzy Sets; Analysis and Design. A Bradford Book, The MIT Press, Cambridge, Massachusetts and London, England.
SleepyCat Software, Inc. BerkeleyDB Documentation; http://www.sleepycat.com
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Szczepaniak, P.S., Gil, M. (2003). Practical Evaluation of Textual Fuzzy Similarity as a Tool for Information Retrieval. In: Menasalvas, E., Segovia, J., Szczepaniak, P.S. (eds) Advances in Web Intelligence. AWIC 2003. Lecture Notes in Computer Science, vol 2663. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44831-4_26
Download citation
DOI: https://doi.org/10.1007/3-540-44831-4_26
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40124-7
Online ISBN: 978-3-540-44831-0
eBook Packages: Springer Book Archive