Skip to main content

A Mixture Record Linkage Approach for US Patent Inventor Disambiguation

  • Conference paper
  • First Online:
Advanced Multimedia and Ubiquitous Engineering (FutureTech 2017, MUE 2017)

Abstract

Inventor name disambiguation is a task that distinguishes each unique inventor from all other inventor records in patent database. This task is essential for processing person name queries in order to get information related to certain inventor. We proposed a mixture approach that applies to the combination of supervised learning, stochastic record linkage and ruled-based method to determine whether each pair of inventor records are from same inventor or not. Our algorithm tested on the USPTO patent database disambiguated 12 million inventor records in 7 h. Evaluation is on labeled dataset from USPTO PatentsView inventor name disambiguation competition and showed our approach have an excellent output.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Huberty, M., Serwaah, A., Zachmann, G.: A flexible, scaleable approach to the international patent ‘name game’, Bruegel Working Paper (2014/10i), (2014)

    Google Scholar 

  2. Li, G., Lai, R., Amour, D., Doolin, D.M., Sun, Y., Torvik, V.I., Yu, A.Z., Fleming, L.: Disambiguation and co-authorship networks of the U.S. patent inventor database (1975–2010). Res. Policy 43(6), 941–955 (2014). doi:10.1016/j.respol.2014.01.012

    Article  Google Scholar 

  3. Smalheiser, N.R., Torvik, V.I.: Author name disambiguation. Annu Rev Inform Sci 43(1), 1–43 (2009)

    Article  Google Scholar 

  4. Ferreira, A.A., Gonçalves, M.A., Laender, A.H.F.: A brief survey of automatic methods for author name disambiguation. SIGMOD Rec. 41(2), 15–26 (2012)

    Article  Google Scholar 

  5. Fleming, L., King III, C., Juda, A.I.: Small worlds and regional innovation. Org. Sci. 18(6), 938–954 (2007)

    Article  Google Scholar 

  6. Ventura, S.L., Nugent, R., Fuchs, E.R.: Methods Matter: Rethinking Inventor Disambiguation with Classification & Labeled Inventor RecordsAcademy of Management Proceedings, 2013. Academy of Management, p 14537. (2013)

    Google Scholar 

  7. Ventura, S.L., Nugent, R., Fuchs, E.R.H.: Seeing the non-stars: (some) sources of bias in past disambiguation approaches and a new public tool leveraging labeled records. Res. Policy 44(9), 1672–1701 (2015). doi:10.1016/j.respol.2014.12.010

    Article  Google Scholar 

  8. Sariyar, M., Borg, A.: The RecordLinkage package: detecting errors in data. The R Journal 2(2), 61–67 (2010)

    Google Scholar 

  9. Azoulay, P., Michigan, R., Sampat, B.N.: The anatomy of medical school patenting. New Engl J Med 357(20), 2049–2056 (2007). doi:10.1056/NEJMsa067417

    Article  Google Scholar 

  10. Trajtenberg, M., Shiff, G.: Identification and mobility of Israeli patenting inventors Pinhas Sapir Center for Development, Tel Aviv University (2008)

    Google Scholar 

  11. Ge, C., Huang, K., Png, I.P.L.: Engineer/scientist careers patents, online profiles, and misclassification bias. Strateg. Manag. J. 37(1), 232–253 (2016). doi:10.1002/smj.2460

    Article  Google Scholar 

  12. Bailey, J.: Evaluation approach and outcomes of the workshop, http://patentsview.org/data/presentations/Bailey_PV_Workshop.pptx

Download references

Acknowledgments

This research is supported by National Natural Science Foundations of China (Nos. 71403256 and 71303023), and also supported by National Key Technology R&D Program of China (No. 2013BAH21B00).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hai-Chao Zhang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer Nature Singapore Pte Ltd.

About this paper

Cite this paper

Yang, GC., Liang, C., Jing, Z., Wang, DR., Zhang, HC. (2017). A Mixture Record Linkage Approach for US Patent Inventor Disambiguation. In: Park, J., Chen, SC., Raymond Choo, KK. (eds) Advanced Multimedia and Ubiquitous Engineering. FutureTech MUE 2017 2017. Lecture Notes in Electrical Engineering, vol 448. Springer, Singapore. https://doi.org/10.1007/978-981-10-5041-1_55

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-5041-1_55

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-5040-4

  • Online ISBN: 978-981-10-5041-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics