Abstract
At present, there is a lack of in-depth processing and indexing of Chinese patents in China, which makes the patent data retrieval inaccurate and incomplete, leading to duplication of applications and waste of resources. Aiming at the problem of lacking annotated patent data in Chinese patent indexing, this paper studies an incremental patent annotation method. By using co-training method, keyword extraction and list extraction can cooperate with each other and iteratively annotate the functional clauses, which achieves the effect of obtaining much more annotated data through a small quantity of training data. Experiment results indicate this method can gradually improve the recall without sacrificing much precision.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Zhang, J., Liu, M.-J., Zhai, D.-S.: Technology topic in RFID based on patent co-word analysis. Scientific Management Research, Oct. 2013
Gurry, F.: World intellectual property indicators—2011 Edition [EB/OL], 7 Apr 2014. http://www.wipo.int/export/sites/www/freepublications/en/intproperty/941/wipo_pub_941_2011.pdf
Fa-guo, Z.H.O.U., Ying-long, W.A.N.G., Bing-ru, Y.A.N.G., et al.: Research on key technologies of unstructured information extraction. Comput. Eng. Appl. 45(14), 1–6 (2009)
Parapatics, P., Dittenbach, M.: Patent claim decomposition for improved information extraction[M]. In: Parapatics, P., Dittenbach, M. (eds.) Current Challenges in Patent Information Retrieval, pp. 197–216. Springer, Berlin Heidelberg (2011)
Wang, P.-Y., Zhang, G.-P., Cai, D.-F.: An automatic generation method for patent keyword extraction template. Journal of ShenYang Institute of Aeronautical Engineering 27, 46–49 (2010)
Nanba, H., Kondo, T., Takezawa, T.: Hiroshima city university at NTCIR-7 patent mining task[C]. In: Proceedings of the 7th NTCIR Workshop Meeting, pp. 369–372, 2008
Gui J., Li P., Zhang C., et al. Integrating crf and rule method for knowledge extraction in patent mining task at NTCIR-8[C]. In: Proceedings of the 8th NTCIR Workshop Meeting, pp. 341–344, 2009
Sun Yan-ling, Liu Hua-bing, Wang Hai-hong, et al. Deep indexed chinese pharmaceutical patent database[j]. Chin. J. Med. Guide 10(1), 22–24 + 26 (2008)
Zhu, L., Lv, X., Xu, L.: Patent subject words extraction based on integrated strategy method[C]. In: International Symposium on Parallel & Distributed Computing (2017)
Bopei, Z., Yongping, D., Wenjian, M.: Efficacy word recognition based on hidden markov model [J]. Inf. Eng. 1(03), 81–89 (2015)
Guangpu, F., Xu, C., Zhiyong, P.: A rules and statistical learning based method for Chinese patent information extraction[C]. In: Eighth Web Information Systems & Applications Conference. IEEE Computer Society, 2011
Chen, Y., Zhou, R., Zhu, W., et al.: Mining patent knowledge for automatic keyword extraction. J. Comput. Res. Dev. 53(8), 1740–1752 (2016)
Acknowledgments
This work was supported by the Zhongnan University of Economics and Law (2722019JCT035, 2722019JCG074), the National Natural Science Foundation of China (61602518), and the Fundamental Research Funds for the Central Universities National Social Science Fund of China (NO:16CXW019).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Chen, X., Zong, W., Deng, N., Liu, S., Li, Y. (2020). Incremental Patent Semantic Annotation Based on Keyword Extraction and List Extraction. In: Barolli, L., Hussain, F., Ikeda, M. (eds) Complex, Intelligent, and Software Intensive Systems. CISIS 2019. Advances in Intelligent Systems and Computing, vol 993. Springer, Cham. https://doi.org/10.1007/978-3-030-22354-0_9
Download citation
DOI: https://doi.org/10.1007/978-3-030-22354-0_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-22353-3
Online ISBN: 978-3-030-22354-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)