Abstract
Patent retrieval has emerged as an important application of information retrieval (IR). It is considered to be a complex search task because patent search requires an extended chain of reasoning beyond basic document retrieval. As logic-based IR is capable of modelling both document retrieval and decision-making, it can be seen as a suitable framework for modelling patent data and search strategies. In particular, we demonstrate logic-based modelling for semantic data in patent documents and retrieval strategies which are tailored to patent search and exploit more than just the text in the documents. Given the expressiveness of logic-based IR, however, there is an attendant compromise on issues of scalability and quality. To address these trade-offs we suggest how a parallelised architecture can ensure that logical IR scales in spite of its expressiveness.
Notes
References
Callan J (2000) Distributed information retrieval. In: Advances in information retrieval. Kluwer Academic, Dordrecht, pp 127–150
Fuhr N (1995) Probabilistic datalog—a logic for powerful retrieval methods. In: Fox E, Ingwersen P, Fidel R (eds) Proceedings of the 18th annual international ACM SIGIR conference on research and development in information retrieval. ACM, New York, pp 282–290
Fuhr N (1996) Optimum database selection in networked ir. In: SIGIR workshop on networked information retrieval
Klampanos IA, Azzam H, Roelleke T (2009) A case for probabilistic logic for scalable patent retrieval. In: CIKM workshop on patent information retrieval, pp 1–8
Klampanos IA, Wu H, Roelleke T, Azzam H (2010) Logic-based retrieval: Technology for content-oriented and analytical querying of patent data. In: IRFC, pp 100–119
Poole D (1993) Probabilistic horn abduction and Bayesian networks. Artif Intell 64(1):81–129
Richardson M, Domingos P (2006) Markov logic networks. Mach Learn 62(1–2):107–136
Roelleke T, Fuhr N (1998) Information retrieval with probabilistic datalog. In: Crestani F, Lalmas M, Rijsbergen CJ (eds) Uncertainty and logics—advanced models for the representation and retrieval of information. Kluwer Academic, Dordrecht
Roelleke T, Wu H, Wang J, Azzam H (2008) Modelling retrieval models in a probabilistic relational algebra with a new operator: The Relational Bayes. VLDB J 17(1):5–37
Sato T, Kameya Y (2001) Parameter learning of logic programs for symbolic-statistical modeling. J Artif Intell Res 15:391–454
Scholl M, Schek HJ (1990) A relational object model. In: Abiteboul S, Kanellakis P (eds) ICDT ’90. Springer, Berlin, pp 89–105
Stonebraker M, Moore D, Brown P (1998) Object-relational DBMSs: Tracking the next great wave. Morgan Kaufmann, San Francisco
Tait J, Lupu M, Berger H, Roda G, Dittenbach M, Pesenhofer A, Graf E, van Rijsbergen K (2009) Patent search: An important new test bed for ir. In: Aly R, Hauff C, den Hamer I, Hiemstra D, Huibers T, de Jong F (eds) 9th Dutch–Belgian information retrieval workshop (DIR 2009), TNO ICT, Delft, The Netherlands. Neslia Paniculata, Enschede
Wu H, Kazai G, Roelleke T (2008) Modelling anchor text retrieval in book search based on back-of-book index. In: SIGIR workshop on focused retrieval, pp 51–58
Acknowledgements
We would like to thank Matrixware Information Services GmbH and the Information Retrieval Facility (IRF) for supporting this work. We also would like to thank Helmut Berger for his management of the LSLR project. Finally, many thanks to the reviewers for their excellent suggestions.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Azzam, H., Klampanos, I.A., Roelleke, T. (2011). Large-Scale Logical Retrieval: Technology for Semantic Modelling of Patent Search. In: Lupu, M., Mayer, K., Tait, J., Trippe, A. (eds) Current Challenges in Patent Information Retrieval. The Information Retrieval Series, vol 29. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19231-9_9
Download citation
DOI: https://doi.org/10.1007/978-3-642-19231-9_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19230-2
Online ISBN: 978-3-642-19231-9
eBook Packages: Computer ScienceComputer Science (R0)