A Comparison of Patent Classifications with Clustering Analysis

Smith, Mick; Agrawal, Rajeev

doi:10.1007/978-3-319-26187-4_38

Mick Smith²⁰ &
Rajeev Agrawal²¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9419))

Included in the following conference series:

International Conference on Web Information Systems Engineering

1490 Accesses

Abstract

There is an abundance of data and knowledge within any given patent. Through the use of textual mining and machine learning clustering techniques it is possible to discover meaningful associations throughout a corpus of patents. This research demonstrates that such relationships between USPTO patents exist. Through the use of k-means and k-medians clustering, the accuracy of the USPTO classes will be assessed. It will also be demonstrated that a more refined classification process would be beneficial to other areas of analysis and forecasting.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Automatics Tools and Methods for Patents Analysis: Efficient Methodology for Patent Document Clustering

Patent Document Clustering Using Dimensionality Reduction

Clustering the Patent Data Using K-Means Approach

References

Blake, C.: Text mining. Ann. Rev. Inf. Sci. Technol. 45(1), 121–155 (2011)
Article Google Scholar
Chen, Y.-L., Chang, Y.-C.: A three-phase method for patent classification. Inf. Process. Manage. 48, 1017–1030 (2012)
Article Google Scholar
Chernoff, H., Gillick, L.S., Hartigan, J.A.: k-Means algorithms. Encycl. Stat. Sci. 6, 3858–3859 (2006)
Google Scholar
Chou, L.-Y.: Knowledge discovery through bibiometrics and data mining: an example on marketing ethics. Int. J. Organ. Innov. 3, 106–139 (2011)
Google Scholar
Goswami, S., Shishodia, M.S.: A fuzzy based approach to text mining and document clustering. Int. J. Data Min. Knowl. Manage. Proc. 3(3), 43–52 (2013)
Article Google Scholar
Han, J., Kamber, M., Pei, J.: Data Mining: Concepts and Techniques. Elsevier, Waltham (2012)
Book Google Scholar
Hsu, C.C., Huang, Y.-P., Chang, K.-W.: Extended Naïve Bayes classifier for mixed data. Expert Syst. Appl. 35, 1080–1083 (2008)
Article Google Scholar
Jun, S., Park, S.S., Jang, D.S.: Technology forecasting using matrix mapping and patent clustering. Ind. Manage. Data Syst. 112, 786–807 (2011)
Article Google Scholar
Kang, I.-S., Na, S.-H., Kim, J., Lee, J.-H.: Cluster based patent retrieval. Inf. Process. Manage. 43, 1173–1182 (2007)
Article Google Scholar
Kasravi, K., Risov, M.: Patent mining - discovery of business value from patent repositories. In: Proceedings of the Fortieth Annual Hawaii International Conference on System Sciences, Waikoloa, Hawaii, USA (2007)
Google Scholar
Kim, J.-H., Choi, K.-S.: Patent document categorization based on semantic structural information. Inf. Process. Manage. 43, 1200–1215 (2007)
Article Google Scholar
Karmakar, S., Zhu, Y.: Mining collaboration through textual semantic interpretation. In: 2011 11th International Conference on Hybrid Intelligent Systems (HIS), pp. 728–733 (2011)
Google Scholar
Li, Y., Chung, S.M., Holt, J.D.: Text document clustering based on frequent word meaning sequences. Data Knowl. Eng. 64, 381–404 (2008)
Article Google Scholar
Maechler, M.: “Finding Groups in Data”: Cluster Analysis Extended Rousseeuw et al, Package “Cluster” (R Documentation). https://cran.r-project.org/web/packages/cluster/cluster.pdf. Accessed 21 July 2015
Ruffaldi, E., Sani, E., Bergamasco, M.: Visualizing perspectives and trends in robotics based on patent mining. In: 2010 IEEE International Conference on Robotics and Automation, Anchorage, Alaska, USA 3–8 May 2010
Google Scholar
Trappery, A.J.C., Hsu, F.-C., Trappery, C.V., Lin, C.-I.: Development of a patent document classification and search platform using a back-propagation network. Expert Syst. Appl. 31, 755–765 (2006)
Article Google Scholar
Trappey, C.V., Wu, H.-Y., Taghaboni-Dutta, F., Trappey, A.J.C.: Using patent data for technology forecasting: China RFID patent analysis. Adv. Eng. Inform. 25, 53–64 (2011)
Article Google Scholar
Tseng, Y.H., Lin, C.J., Lin, Y.I.: Text mining techniques for patent analysis. Inf. Process. Manage. 43, 1216–1247 (2007)
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Technology, North Carolina A&T State University, 1601 E. Market Street, Greensboro, NC, 27411, USA
Mick Smith
Department of Computer Systems Technology, North Carolina A&T State University, 209 Price Hall, Greensboro, NC, 27411, USA
Rajeev Agrawal

Authors

Mick Smith
View author publications
You can also search for this author in PubMed Google Scholar
Rajeev Agrawal
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mick Smith .

Editor information

Editors and Affiliations

Tsinghua University, Beijing, China
Jianyong Wang
Poznan University of Economics, Poznan, Poland
Wojciech Cellary
Florida Atlantic University, Boca Raton, Florida, USA
Dingding Wang
Victoria University, Melbourne, Victoria, Australia
Hua Wang
Florida International University, Miami, Florida, Florida, USA
Shu-Ching Chen
Florida International University, Miami, Florida, USA
Tao Li
Victoria University, Melbourne, Victoria, Australia
Yanchun Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Smith, M., Agrawal, R. (2015). A Comparison of Patent Classifications with Clustering Analysis. In: Wang, J., et al. Web Information Systems Engineering – WISE 2015. WISE 2015. Lecture Notes in Computer Science(), vol 9419. Springer, Cham. https://doi.org/10.1007/978-3-319-26187-4_38

Download citation

DOI: https://doi.org/10.1007/978-3-319-26187-4_38
Published: 18 December 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-26186-7
Online ISBN: 978-3-319-26187-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics