research-article

Patent classification of the new invention using PLSA

Authors:
Ranjeet Kumar

IIIT-ALLAHABAD, Deoghat Jhalwa

IIIT-ALLAHABAD, Deoghat Jhalwa
View Profile

,
Shrishail Math

IIIT-ALLAHABAD, Deoghat Jhalwa

IIIT-ALLAHABAD, Deoghat Jhalwa
View Profile

,
R. C. Tripathi

IIIT-ALLAHABAD, Deoghat Jhalwa

IIIT-ALLAHABAD, Deoghat Jhalwa
View Profile

,
M. D. Tiwari

IIIT-ALLAHABAD, Deoghat Jhalwa

IIIT-ALLAHABAD, Deoghat Jhalwa
View Profile

IITM '10: Proceedings of the First International Conference on Intelligent Interactive Technologies and MultimediaDecember 2010Pages 222–225https://doi.org/10.1145/1963564.1963602

Published:27 December 2010Publication History

IITM '10: Proceedings of the First International Conference on Intelligent Interactive Technologies and Multimedia

Pages 222–225

ABSTRACT

In the current scenario of the world for Research and Development leading to patenting, content classification in accordance with the subject areas to which it belongs to is a challenging task. This is because today's R&D draws its novelty/newness not in one technical area but a unique combination of different technical areas. For example, a Typical ICT patent may be a composite effect for advancing the knowledge in some combination of Control Engg, Electronic Components, Databases Technology, Information retrieval methodology, Internet and Wireless technology, Speech, Signal, and Image Processing etc. In this paper, the work has been reported for the content classification for a newly drafted patent document using Probabilistic Latent Semantic Analysis technique. The probabilistic latent semantic analysis (PLSA) is used for automated indexing of the document by creating an indexer which tokenizes the documents and creates a proper generative model. Herein a singular value decomposition model is used for compacting the size of term document matrix and their co-occurrences in the matrix. The objective is to take up the large document corpora generated from the past patent document to categorize documents based on the concept generated model. The approach is illustrated and has been tested for by an example classification of the content for two typical US Patent Classes, and has been found to work well for them.

References

Atsushi Fujii, Makoto lwayama, Noriko kando, Introduction to the Special issue on patnet proceesing, Information Processing & Management, Science Direct, Volume 43, issue 5, September 2007 Google ScholarDigital Library
Kuei-Kuei Lai, Shiao-Jun, Wu, Using the Patent co-citation approach to establish a new patent classification system, Information Processing and Management International Journal (ACM) Volume 41, issue2, 2005 Google ScholarDigital Library
Andreea Moldovan, Radu Ioan Bot, Gert Wanka, Latent Semantic indexing for Patent Documents, International Journal of Applied mathematics and Computer Science, 2005, Vol, 15, No, 4, 551--560.Google Scholar
S. Deerwester, S. T. Dumais, G. W. Furnas, Landauer. T. K., and R. Harshman. Indexing by latent semantic analysis. Journal of the American Society for Information Science, 41, 1990.Google Scholar
T. Hofmann, J. Puzicha, and M. I. Jordan. Unsupervised learning from dyadic data. In Advances in Neural Information Processing Systems, volume 11. MIT Press, 1999 Google ScholarDigital Library
Yanhong Liang, Runhua Tan, Chaoyang Wang, Zhiguang, Computer aided Classification of Patents Oriented to TRIZ, proceeding of the 2009 IEEE. 978-1-4244-4870-8.Google Scholar
R. H. Tan, Q. Y. Tan, C. Y. Yuan, "Theory of Inventive Problem Solving (TRIZ)---The process, tools and developing trends of TRIZ ", Journal of Machine Design, vol.18, no. 7, 2001, pp7--11 (in Chinese)Google Scholar

Index Terms

Patent classification of the new invention using PLSA
1. Information systems
2. Theory of computation
  1. Theory and algorithms for application domains
    1. Database theory
      1. Database query processing and optimization (theory)

Recommendations

Patent Mining: A Survey

Patent documents are important intellectual resources of protecting interests of individuals, organizations and companies. Different from general web documents, patent documents have a well-defined format including frontpage, description, nclaims, and ...
Read More
Comparison of IPC and USPC classification systems in patent prior art searches
PaIR '10: Proceedings of the 3rd international workshop on Patent information retrieval

Patent classification systems are used to help scrutinize patent applications for possible violations of the novelty and non-obviousness/inventive steps of a patentability test. There are several different patent classification systems in use today, ...
Read More
Searching in Cooperative Patent Classification: Comparison between keyword and concept-based search

International patent corpus is a gigantic source containing today about 80million of documents. Every patent is manually analyzed by patent officers and then classified by a specific code called Patent Class (PC). Cooperative Patent Classification CPC ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
IITM '10: Proceedings of the First International Conference on Intelligent Interactive Technologies and Multimedia
December 2010
355 pages
ISBN:9781450304085
DOI:10.1145/1963564
Editors:
M. D. Tiwari
IIIT Allahabad, India
,
R. C. Tripathi
IIIT Allahabad, India
,
Anupam Agrawal
IIIT Allahabad, India
Copyright © 2010 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 27 December 2010
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
PLSA
automated indexing
content classification
inventions
patent classification
Qualifiers
- research-article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 1
  Total Citations
  View Citations
- 259
  Total Downloads
- Downloads (Last 12 months)0
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Patent classification of the new invention using PLSA

IITM '10: Proceedings of the First International Conference on Intelligent Interactive Technologies and Multimedia

ABSTRACT

References

Cited By

Index Terms

Recommendations

Patent Mining: A Survey

Comparison of IPC and USPC classification systems in patent prior art searches

Searching in Cooperative Patent Classification: Comparison between keyword and concept-based search

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Patent classification of the new invention using PLSA

IITM '10: Proceedings of the First International Conference on Intelligent Interactive Technologies and Multimedia

ABSTRACT

References

Cited By

Index Terms

Recommendations

Patent Mining: A Survey

Comparison of IPC and USPC classification systems in patent prior art searches

Searching in Cooperative Patent Classification: Comparison between keyword and concept-based search

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media