Abstract
Inventor name disambiguation is a task that distinguishes each unique inventor from all other inventor records in patent database. This task is essential for processing person name queries in order to get information related to certain inventor. We proposed a mixture approach that applies to the combination of supervised learning, stochastic record linkage and ruled-based method to determine whether each pair of inventor records are from same inventor or not. Our algorithm tested on the USPTO patent database disambiguated 12 million inventor records in 7 h. Evaluation is on labeled dataset from USPTO PatentsView inventor name disambiguation competition and showed our approach have an excellent output.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Huberty, M., Serwaah, A., Zachmann, G.: A flexible, scaleable approach to the international patent ‘name game’, Bruegel Working Paper (2014/10i), (2014)
Li, G., Lai, R., Amour, D., Doolin, D.M., Sun, Y., Torvik, V.I., Yu, A.Z., Fleming, L.: Disambiguation and co-authorship networks of the U.S. patent inventor database (1975–2010). Res. Policy 43(6), 941–955 (2014). doi:10.1016/j.respol.2014.01.012
Smalheiser, N.R., Torvik, V.I.: Author name disambiguation. Annu Rev Inform Sci 43(1), 1–43 (2009)
Ferreira, A.A., Gonçalves, M.A., Laender, A.H.F.: A brief survey of automatic methods for author name disambiguation. SIGMOD Rec. 41(2), 15–26 (2012)
Fleming, L., King III, C., Juda, A.I.: Small worlds and regional innovation. Org. Sci. 18(6), 938–954 (2007)
Ventura, S.L., Nugent, R., Fuchs, E.R.: Methods Matter: Rethinking Inventor Disambiguation with Classification & Labeled Inventor RecordsAcademy of Management Proceedings, 2013. Academy of Management, p 14537. (2013)
Ventura, S.L., Nugent, R., Fuchs, E.R.H.: Seeing the non-stars: (some) sources of bias in past disambiguation approaches and a new public tool leveraging labeled records. Res. Policy 44(9), 1672–1701 (2015). doi:10.1016/j.respol.2014.12.010
Sariyar, M., Borg, A.: The RecordLinkage package: detecting errors in data. The R Journal 2(2), 61–67 (2010)
Azoulay, P., Michigan, R., Sampat, B.N.: The anatomy of medical school patenting. New Engl J Med 357(20), 2049–2056 (2007). doi:10.1056/NEJMsa067417
Trajtenberg, M., Shiff, G.: Identification and mobility of Israeli patenting inventors Pinhas Sapir Center for Development, Tel Aviv University (2008)
Ge, C., Huang, K., Png, I.P.L.: Engineer/scientist careers patents, online profiles, and misclassification bias. Strateg. Manag. J. 37(1), 232–253 (2016). doi:10.1002/smj.2460
Bailey, J.: Evaluation approach and outcomes of the workshop, http://patentsview.org/data/presentations/Bailey_PV_Workshop.pptx
Acknowledgments
This research is supported by National Natural Science Foundations of China (Nos. 71403256 and 71303023), and also supported by National Key Technology R&D Program of China (No. 2013BAH21B00).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Yang, GC., Liang, C., Jing, Z., Wang, DR., Zhang, HC. (2017). A Mixture Record Linkage Approach for US Patent Inventor Disambiguation. In: Park, J., Chen, SC., Raymond Choo, KK. (eds) Advanced Multimedia and Ubiquitous Engineering. FutureTech MUE 2017 2017. Lecture Notes in Electrical Engineering, vol 448. Springer, Singapore. https://doi.org/10.1007/978-981-10-5041-1_55
Download citation
DOI: https://doi.org/10.1007/978-981-10-5041-1_55
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-5040-4
Online ISBN: 978-981-10-5041-1
eBook Packages: EngineeringEngineering (R0)