Abstract
Logic-based models have been already proposed for information retrieval purpose. However, there is a need for new formalisms providing more generic frameworks. For this purpose, an information retrieval axiomatic theory is proposed in this paper, independently of any model. Our proposal which mainly relies on many-sorted logic allows to consider various sets in the domain of discourse that provides us a rich framework to model the different items such as documents, index terms, queries. The theory relies on a sound set of axioms driving the retrieval process as proof of theorems. As such the genericity consists of a main motivation; it will be proved that three classical information retrieval models, namely the Boolean model; the fuzzy-set-based extension of the Boolean model; and the vector space model, satisfy the proposed theory, establishing then its consistency. Beyond the genericity, the proposed approach may face concrete problems. Indeed, it is well known that the use of the classical settings of formal concept analysis theory for information retrieval does not allow disjunctions and negations in queries. For this purpose, this paper gives a characterization of these queries forms using appropriates theorems of the theory. Useful algebraic properties (i.e., isomorphisms) are then established for this end.
Similar content being viewed by others
References
Abdulahhad K (2014) Information retrieval (IR) modeling by logic and lattice. Application to conceptual IR. Thesis, Université de Grenoble
Bookstein A (1980) Fuzzy requests: an approach to weighted boolean searches. J Am Soc Inf Sci 31(4):240–247
Bordogna G, Carrara P, Pasi G (1991) Query term weights as constraints in fuzzy information retrieval. Inf Process Manag 27(1):15–26
Bruza PD, Huibers TWC (1994) Investigating aboutness axioms using information fields. In: Croft BW, van Rijsbergen CJ (ed) SIGIR ’94: proceedings of the seventeenth annual international ACM-SIGIR conference on research and development in information retrieval, organised by Dublin City University. Springer London, pp 112–121
Carpineto C, Romano G (1998) Effective reformulation of boolean queries with concept lattices. In: Andreasen T, Christiansen H, Larsen HL (ed) Flexible query answering systems: third international conference, FQAS’98 Roskilde, Denmark, May 13–15, 1998 Proceedings. Springer, Berlin, Heidelberg, pp 83–94
Carpineto C, Romano G (2004) Exploiting the potential of concept lattices for information retrieval with credo. J Univers Comput Sci 10(8):985–1013
Carpineto C, Romano G (2005) Using concept lattices for text retrieval and mining, In: Ganter B, Stumme G, Wille R (ed) Formal concept analysis, foundations and applications, vol 3626 of lecture notes in computer science. Springer, Berlin, pp 161–179
Djouadi Y (2011) Extended galois derivation operators for information retrieval based on fuzzy formal concept lattice, In: Benferhat S, Grant J (ed) Scalable uncertainty management: 5th international conference, SUM 2011, Dayton, OH, USA, October 10–13, 2011, proceedings. Springer, Berlin, Heidelberg, pp 346–358
Djouadi Y (2012) Généralisation des opérateurs de dérivation de galois en recherche d’information basée sur l’analyse formelle de concepts. In: Beigbeder M, Eglin V, Ragot N, Géry M (ed) CORIA, pp 373–386
Djouadi Y, Prade H (2011) Possibility-theoretic extension of derivation operators in formal concept analysis over fuzzy lattices. Fuzzy Optim Decis Mak 10(4):287–309
Dubois D, de Saint Dupin, Cyr Bannay F, Prade H (2007) A possibility-theoretic view of formal concept analysis. Fundam Inf 75(1–4):195–213
Dubois D, Prade H (2009) Possibility theory and formal concept analysis in information systems. In: Carvalho J, Dubois D, Kaymak U, Sousa J (ed) International fuzzy systems association world congress and conference of the European society for fuzzy logic and technology (IFSA-EUSFLAT). Lisbonne (Portugal), 20/07/09-24/07/09, pp 1021–1026
Ducrou J, Eklund PW (2007) Searchsleuth: the conceptual neighbourhood of an web query. In: Eklund PW, Diatta J, Liquiere M (ed) Proceedings of the fifth international conference on concept lattices and their applications. Montpellier, France
Düntsch I, Gediga G (2003) Approximation operators in qualitative data analysis. In: de Swart H, Orłowska E, Schmidt G, Roubens M (ed) Theory and applications of relational structures as knowledge instruments: COST Action 274, TARSKI. Revised Papers. Springer, Berlin, Heidelberg, pp 214–230
Fuhr N (1995) Probabilistic datalog—a logic for powerful retrieval methods. In: Proceedings of the 18th annual international ACM SIGIR conference on research and development in information retrieval, SIGIR ’95. ACM, New York, NY, USA, pp 282–290
Ganter B, Wille R (1999) Formal concept analysis: mathematical foundations. Springer, Berlin
Godin R, Missaoui R, April A (1993) Experimental comparison of navigation in a galois lattice with conventional information retrieval methods. Int J Man Mach Stud 38(5):747–767
Godin R, Saunders E, Gecsei J (1986) Lattice model of browsable data spaces. Inf Sci 40(2):89–116
Huibers TWC (1996) An axiomatic theory for information retrieval. Ph.D. thesis, Utrecht University, The Netherlands
Koester B (2006) Conceptual knowledge retrieval with fooca: improving web search engine results with contexts and concept hierarchies. In: Perner P (ed) Advances in data mining. Applications in medicine, web mining, marketing, image and signal mining: 6th industrial conference on data mining, ICDM 2006, Leipzig, Germany, July 14–15, 2006 proceedings. Springer, Berlin, Heidelberg, pp 176–190
Kraft DH, Buell DA (1983) Fuzzy sets and generalized boolean retrieval systems. Int J Man Mach Stud 19(1):45–56
Lalmas M (1998) Logical models in information retrieval: introduction and overview. Inf Process Manag 34(1):19–33
Lalmas M, Bruza PD (1998) The use of logic in information retrieval modelling. Knowl Eng Rev 13(3):263–295
Lee JH (1994) Properties of extended boolean models in information retrieval. In: Croft BW, van Rijsbergen CJ, (ed) SIGIR ’94: proceedings of the seventeenth annual international ACM-SIGIR conference on research and development in information retrieval, organised by Dublin City University. Springer, London, pp 182–190
Li Q, Guo L (2013) Formal query systems on contexts and a representation of algebraic lattices. Inf Sci 239:72–84
Losada DE, Barreiro A (2001) A logical model for information retrieval based on propositional logic and belief revision. Comput J 44(5):410
Nauer E, Toussaint Y (2009) Crechaindo: an iterative and interactive web information retrieval system based on lattices. Int J Gen Syst 38(4):363–378
Nie J (1989) An information retrieval model based on modal logic. Inf Process Manag 25(5):477–491
Paice CD (1984) Soft evaluation of boolean search queries in information retrieval systems. Inf Technol Res Dev Appl 3(1):33–41
Priss U (2000) Lattice-based information retrieval. Knowl Organ 27:132–142
Radecki T (1979) Fuzzy set theoretical approach to document retrieval. Inf Process Manag 15(5):247–259
Reiter R (1978) On closed world data bases. In: Gallaire H, Minker J (eds) Logic and data bases. Springer, Boston, pp 55–76
Salton G (1968) Automatic information organization and retrieval. McGraw-Hill, New York
Salton G, Wong A, Yang CS (1975) A vector space model for automatic indexing. Commun ACM 18(11):613–620
Salton G, Fox EA, Wu H (1983) Extended boolean information retrieval. Commun ACM 26(11):1022–1036
van Rijsbergen CJ (1986) A non-classical logic for information retrieval. Comput J 29(6):481–485
Waller W, Kraft DH (1979) A mathematical model of a weighted boolean retrieval system. Inf Process Manag 15(5):235–245
Wille R (1982) Restructuring lattice theory: an approach based on hierarchies of concepts. In: Rival I (ed) Ordered sets. Reidel, Dordrecht, pp 445–470
Zadeh L (1978) Fuzzy sets as a basis for a theory of possibility. Fuzzy Sets Syst 1(1):3–28
Zerarga L, Djouadi Y (2012) Interval-valued fuzzy extension of formal concept analysis for information retrieval. In: Huang T, Zeng Z, Li C, Leung CS (ed) Neural information processing: 19th international conference, ICONIP 2012, Doha, Qatar, November 12–15, 2012, proceedings, Part I, Springer, Berlin, Heidelberg, pp 608–615
Zerarga L, Djouadi Y (2013) Content-only querying structured contexts using formal concept analysis. In: 2013 11th International symposium on programming and systems (ISPS), pp 58–64
Acknowledgements
We thank anonymous reviewers for their constructive comments that helped us to highly improve the quality of the paper.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Zerarga, L., Djouadi, Y. A many-sorted theory proposal for information retrieval: axiomatization and semantics. Knowl Inf Syst 55, 113–139 (2018). https://doi.org/10.1007/s10115-017-1074-9
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10115-017-1074-9