Skip to main content
Log in

A many-sorted theory proposal for information retrieval: axiomatization and semantics

  • Regular Paper
  • Published:
Knowledge and Information Systems Aims and scope Submit manuscript

Abstract

Logic-based models have been already proposed for information retrieval purpose. However, there is a need for new formalisms providing more generic frameworks. For this purpose, an information retrieval axiomatic theory is proposed in this paper, independently of any model. Our proposal which mainly relies on many-sorted logic allows to consider various sets in the domain of discourse that provides us a rich framework to model the different items such as documents, index terms, queries. The theory relies on a sound set of axioms driving the retrieval process as proof of theorems. As such the genericity consists of a main motivation; it will be proved that three classical information retrieval models, namely the Boolean model; the fuzzy-set-based extension of the Boolean model; and the vector space model, satisfy the proposed theory, establishing then its consistency. Beyond the genericity, the proposed approach may face concrete problems. Indeed, it is well known that the use of the classical settings of formal concept analysis theory for information retrieval does not allow disjunctions and negations in queries. For this purpose, this paper gives a characterization of these queries forms using appropriates theorems of the theory. Useful algebraic properties (i.e., isomorphisms) are then established for this end.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

References

  1. Abdulahhad K (2014) Information retrieval (IR) modeling by logic and lattice. Application to conceptual IR. Thesis, Université de Grenoble

  2. Bookstein A (1980) Fuzzy requests: an approach to weighted boolean searches. J Am Soc Inf Sci 31(4):240–247

    Article  MathSciNet  Google Scholar 

  3. Bordogna G, Carrara P, Pasi G (1991) Query term weights as constraints in fuzzy information retrieval. Inf Process Manag 27(1):15–26

    Article  Google Scholar 

  4. Bruza PD, Huibers TWC (1994) Investigating aboutness axioms using information fields. In: Croft BW, van Rijsbergen CJ (ed) SIGIR ’94: proceedings of the seventeenth annual international ACM-SIGIR conference on research and development in information retrieval, organised by Dublin City University. Springer London, pp 112–121

  5. Carpineto C, Romano G (1998) Effective reformulation of boolean queries with concept lattices. In: Andreasen T, Christiansen H, Larsen HL (ed) Flexible query answering systems: third international conference, FQAS’98 Roskilde, Denmark, May 13–15, 1998 Proceedings. Springer, Berlin, Heidelberg, pp 83–94

  6. Carpineto C, Romano G (2004) Exploiting the potential of concept lattices for information retrieval with credo. J Univers Comput Sci 10(8):985–1013

    MATH  Google Scholar 

  7. Carpineto C, Romano G (2005) Using concept lattices for text retrieval and mining, In: Ganter B, Stumme G, Wille R (ed) Formal concept analysis, foundations and applications, vol 3626 of lecture notes in computer science. Springer, Berlin, pp 161–179

  8. Djouadi Y (2011) Extended galois derivation operators for information retrieval based on fuzzy formal concept lattice, In: Benferhat S, Grant J (ed) Scalable uncertainty management: 5th international conference, SUM 2011, Dayton, OH, USA, October 10–13, 2011, proceedings. Springer, Berlin, Heidelberg, pp 346–358

  9. Djouadi Y (2012) Généralisation des opérateurs de dérivation de galois en recherche d’information basée sur l’analyse formelle de concepts. In: Beigbeder M, Eglin V, Ragot N, Géry M (ed) CORIA, pp 373–386

  10. Djouadi Y, Prade H (2011) Possibility-theoretic extension of derivation operators in formal concept analysis over fuzzy lattices. Fuzzy Optim Decis Mak 10(4):287–309

    Article  MathSciNet  MATH  Google Scholar 

  11. Dubois D, de Saint Dupin, Cyr Bannay F, Prade H (2007) A possibility-theoretic view of formal concept analysis. Fundam Inf 75(1–4):195–213

    MathSciNet  MATH  Google Scholar 

  12. Dubois D, Prade H (2009) Possibility theory and formal concept analysis in information systems. In: Carvalho J, Dubois D, Kaymak U, Sousa J (ed) International fuzzy systems association world congress and conference of the European society for fuzzy logic and technology (IFSA-EUSFLAT). Lisbonne (Portugal), 20/07/09-24/07/09, pp 1021–1026

  13. Ducrou J, Eklund PW (2007) Searchsleuth: the conceptual neighbourhood of an web query. In: Eklund PW, Diatta J, Liquiere M (ed) Proceedings of the fifth international conference on concept lattices and their applications. Montpellier, France

  14. Düntsch I, Gediga G (2003) Approximation operators in qualitative data analysis. In: de Swart H, Orłowska E, Schmidt G, Roubens M (ed) Theory and applications of relational structures as knowledge instruments: COST Action 274, TARSKI. Revised Papers. Springer, Berlin, Heidelberg, pp 214–230

  15. Fuhr N (1995) Probabilistic datalog—a logic for powerful retrieval methods. In: Proceedings of the 18th annual international ACM SIGIR conference on research and development in information retrieval, SIGIR ’95. ACM, New York, NY, USA, pp 282–290

  16. Ganter B, Wille R (1999) Formal concept analysis: mathematical foundations. Springer, Berlin

    Book  MATH  Google Scholar 

  17. Godin R, Missaoui R, April A (1993) Experimental comparison of navigation in a galois lattice with conventional information retrieval methods. Int J Man Mach Stud 38(5):747–767

    Article  Google Scholar 

  18. Godin R, Saunders E, Gecsei J (1986) Lattice model of browsable data spaces. Inf Sci 40(2):89–116

    Article  MATH  Google Scholar 

  19. Huibers TWC (1996) An axiomatic theory for information retrieval. Ph.D. thesis, Utrecht University, The Netherlands

  20. Koester B (2006) Conceptual knowledge retrieval with fooca: improving web search engine results with contexts and concept hierarchies. In: Perner P (ed) Advances in data mining. Applications in medicine, web mining, marketing, image and signal mining: 6th industrial conference on data mining, ICDM 2006, Leipzig, Germany, July 14–15, 2006 proceedings. Springer, Berlin, Heidelberg, pp 176–190

  21. Kraft DH, Buell DA (1983) Fuzzy sets and generalized boolean retrieval systems. Int J Man Mach Stud 19(1):45–56

    Article  Google Scholar 

  22. Lalmas M (1998) Logical models in information retrieval: introduction and overview. Inf Process Manag 34(1):19–33

    Article  Google Scholar 

  23. Lalmas M, Bruza PD (1998) The use of logic in information retrieval modelling. Knowl Eng Rev 13(3):263–295

    Article  Google Scholar 

  24. Lee JH (1994) Properties of extended boolean models in information retrieval. In: Croft BW, van Rijsbergen CJ, (ed) SIGIR ’94: proceedings of the seventeenth annual international ACM-SIGIR conference on research and development in information retrieval, organised by Dublin City University. Springer, London, pp 182–190

  25. Li Q, Guo L (2013) Formal query systems on contexts and a representation of algebraic lattices. Inf Sci 239:72–84

    Article  MathSciNet  MATH  Google Scholar 

  26. Losada DE, Barreiro A (2001) A logical model for information retrieval based on propositional logic and belief revision. Comput J 44(5):410

    Article  MATH  Google Scholar 

  27. Nauer E, Toussaint Y (2009) Crechaindo: an iterative and interactive web information retrieval system based on lattices. Int J Gen Syst 38(4):363–378

    Article  MATH  Google Scholar 

  28. Nie J (1989) An information retrieval model based on modal logic. Inf Process Manag 25(5):477–491

    Article  Google Scholar 

  29. Paice CD (1984) Soft evaluation of boolean search queries in information retrieval systems. Inf Technol Res Dev Appl 3(1):33–41

    Google Scholar 

  30. Priss U (2000) Lattice-based information retrieval. Knowl Organ 27:132–142

    Google Scholar 

  31. Radecki T (1979) Fuzzy set theoretical approach to document retrieval. Inf Process Manag 15(5):247–259

    Article  MATH  Google Scholar 

  32. Reiter R (1978) On closed world data bases. In: Gallaire H, Minker J (eds) Logic and data bases. Springer, Boston, pp 55–76

    Chapter  Google Scholar 

  33. Salton G (1968) Automatic information organization and retrieval. McGraw-Hill, New York

    Google Scholar 

  34. Salton G, Wong A, Yang CS (1975) A vector space model for automatic indexing. Commun ACM 18(11):613–620

    Article  MATH  Google Scholar 

  35. Salton G, Fox EA, Wu H (1983) Extended boolean information retrieval. Commun ACM 26(11):1022–1036

    Article  MathSciNet  MATH  Google Scholar 

  36. van Rijsbergen CJ (1986) A non-classical logic for information retrieval. Comput J 29(6):481–485

    Article  MATH  Google Scholar 

  37. Waller W, Kraft DH (1979) A mathematical model of a weighted boolean retrieval system. Inf Process Manag 15(5):235–245

    Article  MATH  Google Scholar 

  38. Wille R (1982) Restructuring lattice theory: an approach based on hierarchies of concepts. In: Rival I (ed) Ordered sets. Reidel, Dordrecht, pp 445–470

    Chapter  Google Scholar 

  39. Zadeh L (1978) Fuzzy sets as a basis for a theory of possibility. Fuzzy Sets Syst 1(1):3–28

    Article  MathSciNet  MATH  Google Scholar 

  40. Zerarga L, Djouadi Y (2012) Interval-valued fuzzy extension of formal concept analysis for information retrieval. In: Huang T, Zeng Z, Li C, Leung CS (ed) Neural information processing: 19th international conference, ICONIP 2012, Doha, Qatar, November 12–15, 2012, proceedings, Part I, Springer, Berlin, Heidelberg, pp 608–615

  41. Zerarga L, Djouadi Y (2013) Content-only querying structured contexts using formal concept analysis. In: 2013 11th International symposium on programming and systems (ISPS), pp 58–64

Download references

Acknowledgements

We thank anonymous reviewers for their constructive comments that helped us to highly improve the quality of the paper.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Loutfi Zerarga.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zerarga, L., Djouadi, Y. A many-sorted theory proposal for information retrieval: axiomatization and semantics. Knowl Inf Syst 55, 113–139 (2018). https://doi.org/10.1007/s10115-017-1074-9

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10115-017-1074-9

Keywords

Navigation