ABSTRACT
Finding similar patents is a challenging task in patent information retrieval. A patent application is often a starting point to find similar inventions. Keyword search for similar patents requires significant domain expertise and may not fetch relevant results. We propose a novel representation for patents and use a two stage approach to find similar patents. Each patent is represented as an IPC class vector. Citation network of patents is used to propagate these vectors from a node (patent) to its neighbors (cited patents). Thus, each patent is represented as a weighted combination of its IPC information as well as of its neighbors. A query patent is represented as a vector using its IPC information and similar patents can be simply found by comparing this vector with vectors of patents in the corpus. Text based search is used to re-rank this solution set to improve precision. We experiment with two similarity measures and re-ranking strategies to empirically show that our representation is effective in improving both precision and recall of queries of CLEF-2011 dataset.
- Y.-L. Chen and Y.-T. Chiu. An ipc-based vector space model for patent retrieval. Inf. Process. Manage., 47:309--322, May 2011. Google ScholarDigital Library
- A. Fujii. Enhancing patent retrieval by citation analysis. In SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, pages 793--794, New York, NY, USA, 2007. ACM. Google ScholarDigital Library
- J. Gobeill, E. Pasche, D. Teodoro, and P. Ruch. Simple pre and post processing strategies for patent searching in clef intellectual property track 2009. In Proceedings of the 10th cross-language evaluation forum conference on Multilingual information access evaluation: text retrieval experiments, CLEF'09, pages 444--451, Berlin, Heidelberg, 2009. Springer-Verlag. Google ScholarDigital Library
- C. G. Harris, R. Arens, and P. Srinivasan. Comparison of ipc and uspc classification systems in patent prior art searches. In Proceedings of the 3rd international workshop on Patent information retrieval, PaIR '10, pages 27--32, New York, NY, USA, 2010. ACM. Google ScholarDigital Library
- C. G. Harris, S. Foster, R. Arens, and P. Srinivasan. On the role of classification in patent invalidity searches. In Proceeding of the 2nd international workshop on Patent information retrieval, PaIR '09, pages 29--32, New York, NY, USA, 2009. ACM. Google ScholarDigital Library
- I.-S. Kang, S.-H. Na, J. Kim, and J.-H. Lee. Cluster-based patent retrieval. Inf. Process. Manage., 43(5):1173--1182, 2007. Google ScholarDigital Library
- J. Kim, I.-S. Kang, and J.-H. Lee. Cluster-based patent retrieval using international patent classification system. In Y. Matsumoto, R. Sproat, K.-F. Wong, and M. Zhang, editors, Computer Processing of Oriental Languages. Beyond the Orient: The Research Challenges Ahead, volume 4285 of Lecture Notes in Computer Science, pages 205--212. Springer Berlin / Heidelberg, 2006. Google ScholarDigital Library
- H. Mase, T. Matsubayashi, Y. Ogawa, M. Iwayama, and T. Oshio. Proposal of two-stage patent retrieval method considering the claim structure. ACM Transactions on Asian Language Information Processing (TALIP), 4(2):190--206, 2005. Google ScholarDigital Library
Index Terms
- Patent search using IPC classification vectors
Recommendations
Comparison of IPC and USPC classification systems in patent prior art searches
PaIR '10: Proceedings of the 3rd international workshop on Patent information retrievalPatent classification systems are used to help scrutinize patent applications for possible violations of the novelty and non-obviousness/inventive steps of a patentability test. There are several different patent classification systems in use today, ...
Automatic query generation for patent search
CIKM '09: Proceedings of the 18th ACM conference on Information and knowledge managementPatent search is the task of finding relevant existing patents, which is an important part of the patent's examiner's process of validating a patent application. In this paper, we studied how to transform a query patent (the application) into search ...
Cluster-based patent retrieval using international patent classification system
ICCPOL'06: Proceedings of the 21st international conference on Computer Processing of Oriental Languages: beyond the orient: the research challenges aheadA patent collection provides a great test-bed for cluster-based information retrieval. International Patent Classification (IPC) system provides a hierarchical taxonomy with 5 levels of specificity. We regard IPC codes of patent applications as cluster ...
Comments