Summary
Recent research on fuzzy quantification for information retrieval has proposed the application of semi-fuzzy quantifiers for improving query languages. Fuzzy quantified sentences are useful as they allow additional restrictions to be imposed on the retrieval process unlike more popular retrieval approaches, which lack the facility to accurately express information needs. For instance, fuzzy quantification supplies a variety of methods for combining query terms whereas extended boolean models can only handle extended boolean-like operators to connect query terms. Although some experiments validating these advantages have been reported in recent works, a comparison against state-of-the-art techniques has not been addressed. In this work we provide empirical evidence on the adequacy of fuzzy quantifiers to enhance information retrieval systems. We show that our fuzzy approach is competitive with respect to models such as the vector-space model with pivoted document-length normalization, which is at the heart of some high-performance web search systems. These empirical results strengthen previous theoretical works that suggested fuzzy quantification as an appropriate technique for modeling information needs. In this respect, we demonstrate here the connection between the retrieval framework based on the concept of semi-fuzzy quantifier and the seminal proposals for modeling linguistic statements through Ordered Weighted Averaging operators (OWA).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
S. Barro, A. Bugarín, P. Cariñena, and F. Díaz-Hermida. A framework for fuzzy quantification models analysis. IEEE Transactions on Fuzzy Systems, 11:89–99, 2003.
G. Bordogna and G. Pasi. Linguistic aggregation operators of selection criteria in fuzzy information retrieval. International Journal of Intelligent Systems, 10(2):233–248, 1995.
G. Bordogna and G. Pasi. Modeling vagueness in information retrieval. In F. Crestani M. Agosti and G. Pasi, editors, Lectures on Information Retrieval (LNCS 1980). Springer Verlag, 2000.
G. Bordogna and G. Pasi. Modeling vagueness in information retrieval. In M. Agosti, F. Crestani, and G. Pasi, editors, ESSIR 2000, LNCS 1980, pages 207–241. Springer-Verlag Berlin Heidelberg, 2000.
P. Bosc, L. Lietard, and O. Pivert. Quantified statements and database fuzzy querying. In P. Bosc and J. Kacprzyk, editors, Fuzziness in Database Management Systems, volume 5 of Studies in Fuzziness, pages 275–308. Physica-Verlag, 1995.
F. Crestani and G. Pasi (eds). Soft Computing in Information Retrieval: techniques and applications. Studies in fuzziness and soft computing. Springer-Verlag, 2000.
M. Delgado, D. Sánchez, and M. A. Vila. Fuzzy cardinality based evaluation of quantified sentences. International Journal of Approximate Reasoning, 23(1):23–66, 2000.
F. Díaz-Hermida, A. Bugarín, P. Cariñena, and S. Barro. Voting model based evaluation of fuzzy quantified sentences: a general framework. Fuzzy Sets and Systems, 146:97–120, 2004.
I. Glöckner. A framework for evaluating approaches to fuzzy quantification. Technical Report TR99-03, Universität Bielefeld, May 1999.
I. Glöckner. Fuzzy Quantifiers in Natural Language: Semantics and Computational Models. PhD thesis, Universität Bielefeld, 2003.
I. Glöckner and A. Knoll. A formal theory of fuzzy natural language quantification and its role in granular computing. In W. Pedrycz, editor, Granular computing: An emerging paradigm, volume 70 of Studies in Fuzziness and Soft Computing, pages 215–256. Physica-Verlag, 2001.
D. Harman. Overview of the third text retrieval conference. In Proc. TREC-3, the 3rd text retrieval conference, 1994.
D. Hawking, E. Voorhees, N. Craswell, and P. Bailey. Overview of the trec-8 web track. In Proc. TREC-8, the 8th Text Retrieval Conference, pages 131–150, Gaithersburg, United States, November 1999.
E. Herrera-Viedma and G. Pasi. Fuzzy approaches to access information on the web: recent developments and research trends. In Proc. International Conference on Fuzzy Logic and Technology (EUSFLAT 2003), pages 25–31, Zittau (Germany), 2003.
D.H. Kraft and D.A. Buell. A model for a weighted retrieval system. Journal of the american society for information science, 32(3):211–216, 1981.
D.H. Kraft and D.A. Buell. Fuzzy sets and generalized boolean retrieval systems. International journal of man-machine studies, 19:45–56, 1983.
J. H. Lee. Properties of extended boolean models in information retrieval. In Proc. of SIGIR-94, the 17th ACM Conference on Research and Development in Information Retrieval, Dublin, Ireland, July 1994.
J. H. Lee, W. Y. Kim, and Y. J. Lee. On the evaluation of boolean operators in the extended boolean framework. In Proc. of SIGIR-93, the 16th ACM Conference on Research and Development in Information Retrieval, Pittsburgh, USA, 1993.
D. E. Losada, F. Díaz-Hermida, A. Bugarín, and S. Barro. Experiments on using fuzzy quantified sentences in adhoc retrieval. In Proc. SAC-04, the 19th ACM Symposium on Applied Computing-Special Track on Information Access and Retrieval, Nicosia, Cyprus, March 2004.
GNU mifluz. http://www.gnu.org/software/mifluz. 2001.
Y. Ogawa, T. Morita, and K. Kobayashi. A fuzzy document retrieval system using the keyword connection matrix and a learning method. Fuzzy sets and systems, 39:163–179, 1991.
M.F. Porter. An algorithm for suffix stripping. In K. Sparck Jones and P. Willet, editors, Readings in Information Retrieval, pages 313–316. Morgan Kaufmann Publishers, 1997.
T. Radecki. Outline of a fuzzy logic approach to information retrieval. International Journal of Man-Machine studies, 14:169–178, 1981.
G. Salton, E. A. Fox, and H. Wu. Extended boolean information retrieval. Communications of the ACM, 26(12):1022–1036, 1983.
G. Salton and M.J. McGill. Introduction to modern information retrieval. McGraw-Hill, New York, 1983.
A. Singhal. Modern information retrieval: a brief overview. IEEE Data Engineering Bulletin, 24(4):35–43, 2001.
A. Singhal, S. Abney, M. Bacchiani, M. Collins, D. Hindle, and F. Pereira. At&t at trec-8. In Proc. TREC-8, the 8th Text Retrieval Conference, pages 317–330, Gaithersburg, United States, November 1999.
A. Singhal, C. Buckley, and M Mitra. Pivoted document length normalization. In Proc. SIGIR-96, the 19th ACM Conference on Research and Development in Information Retrieval, pages 21–29, Zurich, Switzerland, July 1996.
A Singhal and M. Kaszkiel. At&t at trec-9. In Proc. TREC-9, the 9th Text Retrieval Conference, pages 103–116, Gaithersburg, United States, November 2000.
R.R. Yager. On ordered weighted averaging aggregation operators in multi criteria decision making. IEEE Transactions on Systems, Man and Cybernetics, 18(1):183–191, 1988.
R.R. Yager. Connectives and quantifiers in fuzzy sets. Fuzzy Sets and Systems, 40:39–75, 1991.
R.R. Yager. A general approach to rule aggregation in fuzzy logic control. Applied Intelligence, 2:333–351, 1992.
R.R. Yager. Families of owa operators. Fuzzy Sets and Systems, 59(2):125–244, 1993.
L.A. Zadeh. A computational approach to fuzzy quantifiers in natural languages. Comp. and Machs. with Appls., 8:149–184, 1983.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Losada, D.E., Díaz-Hermida, F., Bugarín, A. (2006). Semi-fuzzy Quantifiers for Information Retrieval. In: Herrera-Viedma, E., Pasi, G., Crestani, F. (eds) Soft Computing in Web Information Retrieval. Studies in Fuzziness and Soft Computing, vol 197. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-31590-X_10
Download citation
DOI: https://doi.org/10.1007/3-540-31590-X_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-31588-9
Online ISBN: 978-3-540-31590-2
eBook Packages: EngineeringEngineering (R0)