Skip to main content

Semi-fuzzy Quantifiers for Information Retrieval

  • Chapter
Soft Computing in Web Information Retrieval

Part of the book series: Studies in Fuzziness and Soft Computing ((STUDFUZZ,volume 197))

Summary

Recent research on fuzzy quantification for information retrieval has proposed the application of semi-fuzzy quantifiers for improving query languages. Fuzzy quantified sentences are useful as they allow additional restrictions to be imposed on the retrieval process unlike more popular retrieval approaches, which lack the facility to accurately express information needs. For instance, fuzzy quantification supplies a variety of methods for combining query terms whereas extended boolean models can only handle extended boolean-like operators to connect query terms. Although some experiments validating these advantages have been reported in recent works, a comparison against state-of-the-art techniques has not been addressed. In this work we provide empirical evidence on the adequacy of fuzzy quantifiers to enhance information retrieval systems. We show that our fuzzy approach is competitive with respect to models such as the vector-space model with pivoted document-length normalization, which is at the heart of some high-performance web search systems. These empirical results strengthen previous theoretical works that suggested fuzzy quantification as an appropriate technique for modeling information needs. In this respect, we demonstrate here the connection between the retrieval framework based on the concept of semi-fuzzy quantifier and the seminal proposals for modeling linguistic statements through Ordered Weighted Averaging operators (OWA).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. S. Barro, A. Bugarín, P. Cariñena, and F. Díaz-Hermida. A framework for fuzzy quantification models analysis. IEEE Transactions on Fuzzy Systems, 11:89–99, 2003.

    Article  Google Scholar 

  2. G. Bordogna and G. Pasi. Linguistic aggregation operators of selection criteria in fuzzy information retrieval. International Journal of Intelligent Systems, 10(2):233–248, 1995.

    Google Scholar 

  3. G. Bordogna and G. Pasi. Modeling vagueness in information retrieval. In F. Crestani M. Agosti and G. Pasi, editors, Lectures on Information Retrieval (LNCS 1980). Springer Verlag, 2000.

    Google Scholar 

  4. G. Bordogna and G. Pasi. Modeling vagueness in information retrieval. In M. Agosti, F. Crestani, and G. Pasi, editors, ESSIR 2000, LNCS 1980, pages 207–241. Springer-Verlag Berlin Heidelberg, 2000.

    Google Scholar 

  5. P. Bosc, L. Lietard, and O. Pivert. Quantified statements and database fuzzy querying. In P. Bosc and J. Kacprzyk, editors, Fuzziness in Database Management Systems, volume 5 of Studies in Fuzziness, pages 275–308. Physica-Verlag, 1995.

    Google Scholar 

  6. F. Crestani and G. Pasi (eds). Soft Computing in Information Retrieval: techniques and applications. Studies in fuzziness and soft computing. Springer-Verlag, 2000.

    Google Scholar 

  7. M. Delgado, D. Sánchez, and M. A. Vila. Fuzzy cardinality based evaluation of quantified sentences. International Journal of Approximate Reasoning, 23(1):23–66, 2000.

    Article  MATH  MathSciNet  Google Scholar 

  8. F. Díaz-Hermida, A. Bugarín, P. Cariñena, and S. Barro. Voting model based evaluation of fuzzy quantified sentences: a general framework. Fuzzy Sets and Systems, 146:97–120, 2004.

    Article  MATH  MathSciNet  Google Scholar 

  9. I. Glöckner. A framework for evaluating approaches to fuzzy quantification. Technical Report TR99-03, Universität Bielefeld, May 1999.

    Google Scholar 

  10. I. Glöckner. Fuzzy Quantifiers in Natural Language: Semantics and Computational Models. PhD thesis, Universität Bielefeld, 2003.

    Google Scholar 

  11. I. Glöckner and A. Knoll. A formal theory of fuzzy natural language quantification and its role in granular computing. In W. Pedrycz, editor, Granular computing: An emerging paradigm, volume 70 of Studies in Fuzziness and Soft Computing, pages 215–256. Physica-Verlag, 2001.

    Google Scholar 

  12. D. Harman. Overview of the third text retrieval conference. In Proc. TREC-3, the 3rd text retrieval conference, 1994.

    Google Scholar 

  13. D. Hawking, E. Voorhees, N. Craswell, and P. Bailey. Overview of the trec-8 web track. In Proc. TREC-8, the 8th Text Retrieval Conference, pages 131–150, Gaithersburg, United States, November 1999.

    Google Scholar 

  14. E. Herrera-Viedma and G. Pasi. Fuzzy approaches to access information on the web: recent developments and research trends. In Proc. International Conference on Fuzzy Logic and Technology (EUSFLAT 2003), pages 25–31, Zittau (Germany), 2003.

    Google Scholar 

  15. D.H. Kraft and D.A. Buell. A model for a weighted retrieval system. Journal of the american society for information science, 32(3):211–216, 1981.

    Google Scholar 

  16. D.H. Kraft and D.A. Buell. Fuzzy sets and generalized boolean retrieval systems. International journal of man-machine studies, 19:45–56, 1983.

    Article  Google Scholar 

  17. J. H. Lee. Properties of extended boolean models in information retrieval. In Proc. of SIGIR-94, the 17th ACM Conference on Research and Development in Information Retrieval, Dublin, Ireland, July 1994.

    Google Scholar 

  18. J. H. Lee, W. Y. Kim, and Y. J. Lee. On the evaluation of boolean operators in the extended boolean framework. In Proc. of SIGIR-93, the 16th ACM Conference on Research and Development in Information Retrieval, Pittsburgh, USA, 1993.

    Google Scholar 

  19. D. E. Losada, F. Díaz-Hermida, A. Bugarín, and S. Barro. Experiments on using fuzzy quantified sentences in adhoc retrieval. In Proc. SAC-04, the 19th ACM Symposium on Applied Computing-Special Track on Information Access and Retrieval, Nicosia, Cyprus, March 2004.

    Google Scholar 

  20. GNU mifluz. http://www.gnu.org/software/mifluz. 2001.

    Google Scholar 

  21. Y. Ogawa, T. Morita, and K. Kobayashi. A fuzzy document retrieval system using the keyword connection matrix and a learning method. Fuzzy sets and systems, 39:163–179, 1991.

    Article  MathSciNet  Google Scholar 

  22. M.F. Porter. An algorithm for suffix stripping. In K. Sparck Jones and P. Willet, editors, Readings in Information Retrieval, pages 313–316. Morgan Kaufmann Publishers, 1997.

    Google Scholar 

  23. T. Radecki. Outline of a fuzzy logic approach to information retrieval. International Journal of Man-Machine studies, 14:169–178, 1981.

    Article  MATH  MathSciNet  Google Scholar 

  24. G. Salton, E. A. Fox, and H. Wu. Extended boolean information retrieval. Communications of the ACM, 26(12):1022–1036, 1983.

    Article  MATH  MathSciNet  Google Scholar 

  25. G. Salton and M.J. McGill. Introduction to modern information retrieval. McGraw-Hill, New York, 1983.

    MATH  Google Scholar 

  26. A. Singhal. Modern information retrieval: a brief overview. IEEE Data Engineering Bulletin, 24(4):35–43, 2001.

    Google Scholar 

  27. A. Singhal, S. Abney, M. Bacchiani, M. Collins, D. Hindle, and F. Pereira. At&t at trec-8. In Proc. TREC-8, the 8th Text Retrieval Conference, pages 317–330, Gaithersburg, United States, November 1999.

    Google Scholar 

  28. A. Singhal, C. Buckley, and M Mitra. Pivoted document length normalization. In Proc. SIGIR-96, the 19th ACM Conference on Research and Development in Information Retrieval, pages 21–29, Zurich, Switzerland, July 1996.

    Google Scholar 

  29. A Singhal and M. Kaszkiel. At&t at trec-9. In Proc. TREC-9, the 9th Text Retrieval Conference, pages 103–116, Gaithersburg, United States, November 2000.

    Google Scholar 

  30. R.R. Yager. On ordered weighted averaging aggregation operators in multi criteria decision making. IEEE Transactions on Systems, Man and Cybernetics, 18(1):183–191, 1988.

    Article  MATH  MathSciNet  Google Scholar 

  31. R.R. Yager. Connectives and quantifiers in fuzzy sets. Fuzzy Sets and Systems, 40:39–75, 1991.

    Article  MATH  MathSciNet  Google Scholar 

  32. R.R. Yager. A general approach to rule aggregation in fuzzy logic control. Applied Intelligence, 2:333–351, 1992.

    Article  Google Scholar 

  33. R.R. Yager. Families of owa operators. Fuzzy Sets and Systems, 59(2):125–244, 1993.

    Article  MATH  MathSciNet  Google Scholar 

  34. L.A. Zadeh. A computational approach to fuzzy quantifiers in natural languages. Comp. and Machs. with Appls., 8:149–184, 1983.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Losada, D.E., Díaz-Hermida, F., Bugarín, A. (2006). Semi-fuzzy Quantifiers for Information Retrieval. In: Herrera-Viedma, E., Pasi, G., Crestani, F. (eds) Soft Computing in Web Information Retrieval. Studies in Fuzziness and Soft Computing, vol 197. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-31590-X_10

Download citation

  • DOI: https://doi.org/10.1007/3-540-31590-X_10

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-31588-9

  • Online ISBN: 978-3-540-31590-2

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics