Abstract
This study explores the benefits of integrating knowledge representations in prior art patent retrieval. Key to the introduced approach is the utilization of human judgment available in the form of classifications assigned to patent documents. The paper first outlines in detail how a methodology for the extraction of knowledge from such an hierarchical classification system can be established. Further potential ways of integrating this knowledge with existing Information Retrieval paradigms in a scalable and flexible manner are investigated. Finally based on these integration strategies the effectiveness in terms of recall and precision is evaluated in the context of a prior art search task for European patents. As a result of this evaluation it can be established that in general the proposed knowledge expansion techniques are particularly beneficial to recall and, with respect to optimizing field retrieval settings, further result in significant precision gains.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
The Cross-Language Evaluation Forum (CLEF)
Guidelines for Examination in the European Patent Office (December 2007)
Allan, J.: HARD track overview in TREC 2004: High accuracy retrieval from documents. In: Proceedings of the thirteenth Text REtrieval Conference (TREC 2004), no. Ldc, NIST, pp. 1–11 (2004)
Billerbeck, B., Zobel, J.: Document expansion versus query expansion for ad-hoc retrieval. In: Proceedings of the 10th Australasian Document Computing Symposium (2005)
Boldi, P., Vigna, S.: MG4J at TREC 2005. In: The Fourteenth Text REtrieval Conference (TREC 2005) Proceedings, number SP, Citeseer, vol. 500, p. 266 (2005)
Bordag, S.: Elements of Knowledge-free and Unsupervised lexical acquisition. PhD thesis (2007)
Cohen, P.: Information retrieval by constrained spreading activation in semantic networks. Information Processing & Management 23(4), 255–268 (1987)
Coley, J.: Knowledge, expectations, and inductive reasoning within conceptual hierarchies. Cognition 90(3), 217–253 (2004)
Crestani, F.: Application of spreading activation techniques in information retrieval. Artificial Intelligence Review 11(6), 453–482 (1997)
Croft, W.B.: Approaches to intelligent information retrieval. Information Processing & Management 23(4), 249–254 (1987)
Croft, W.: Combining approaches to information retrieval. Advances in information retrieval 7, 1–36 (2000)
Davis, R., Shrobe, H., Szolovits, P.: What is a knowledge representation? AI magazine 14(1), 17 (1993)
Fujii, A., Iwayama, M., Kando, N.: Overview of Patent Retrieval Task at NTCIR-5. In: Proceedings of NTCIR-5 Workshop Meeting (2005)
Furnas, G., Landauer, T., Gomez, L., Dumais, S.: The vocabulary problem in human-system communication. Communications of the ACM 30(11), 971 (1987)
Graf, E., Azzopardi, L.: A methodology for building a patent test collection for prior art search. In: Proceedings of the Second International Workshop on Evaluating Information Access, EVIA (2008)
Graf, E., Azzopardi, L., van Rijsbergen, K.: Automatically Generating Queries for Prior Art Search
Hutchins, W.: The concept of aboutness in subject indexing. Aslib Proceedings 30(5), 172–181 (1978)
Iwayama, M., Fujii, A., Kando, N.: Overview of Classification Subtask at NTCIR-6 Patent Retrieval Task. In: Proceedings of NTCIR-6 Workshop Meeting, pp. 366–372 (2007)
Jing, Y., Croft, W.: An association thesaurus for information retrieval. In: Proceedings of RIAO, vol. 94, Citeseer, pp. 146–160 (1994)
Kintsch, W.: The role of knowledge in discourse comprehension: a construction-integration model. Psychological review 95(2), 163–182 (1988)
Markman, E., Callanan, M.: An analysis of hierarchical classification, pp. 325–366. Erlbaum, Hillsdale (1980)
McCune, B., Tong, R., Dean, J., Shapiro, D.: RUBRIC: a system for rule-based information retrieval. Readings in information retrieval 9, 445 (1997)
Medin, D.L., Rips, L.J.: Concepts and categories: Memory, meaning, and metaphysics. Cambridge Univ. Press, Cambridge (2005)
Piroi, F., Roda, G., Zenz, V.: CLEF-IP 2009 Evaluation Summary (2009)
Robertson, S., Zaragoza, H., Taylor, M.: Simple BM25 extension to multiple weighted fields. In: CIKM 2004: Proceedings of the thirteenth ACM international conference on Information and knowledge management, pp. 42–49. ACM Press, New York (2004)
Roda, G., Tait, J., Piroi, F., Zenz, V.: CLEF-IP 2009: retrieval experiments in the Intellectual Property domain. In: CLEF working notes 2009 (2009)
Salton, G.: Associative document retrieval techniques using bibliographic information. Journal of the ACM (JACM) 10(4), 440–457 (1963)
Wharton, C., Kintsch, W.: An overview of construction-integration model. ACM SIGART Bulletin 2(4), 169–173 (1991)
Zaragoza, H., Craswell, N., Taylor, M., Saria, S., Robertson, S.: Microsoft Cambridge at TREC-13: Web and HARD tracks. In: Proceedings of TREC 2004, Citeseer (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Graf, E., Frommholz, I., Lalmas, M., van Rijsbergen, K. (2010). Knowledge Modeling in Prior Art Search. In: Cunningham, H., Hanbury, A., Rüger, S. (eds) Advances in Multidisciplinary Retrieval. IRFC 2010. Lecture Notes in Computer Science, vol 6107. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13084-7_4
Download citation
DOI: https://doi.org/10.1007/978-3-642-13084-7_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-13083-0
Online ISBN: 978-3-642-13084-7
eBook Packages: Computer ScienceComputer Science (R0)