skip to main content
10.1145/3077136.3080698acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
short-paper

Applying Information Extraction for Patent Structure Analysis

Published: 07 August 2017 Publication History

Abstract

Patent engineers are spending significant time analyzing patent claim structures to grasp the range of technology covered or to compare similar patents in the same patent family. Though patent claims are the most important section in a patent, it is hard for a human to examine them. In this paper, we propose an information-extraction-based technique to grasp the patent claim structure. We confirmed that our approach is promising through empirical evaluation of entity mention extraction and the relation extraction method. We also built a preliminary interface to visualize patent structures, compare patents, and search similar patents.

References

[1]
Nadjet Bouayad-Agha, Gerard Casamayor, Gabriela Ferraro, Simon Mille, Vanesa Vidal, and Leo Wanner. 2009. Improving the Comprehension of Legal Documentation: The Case of Patent Claim. Proceedings of International Conference on Artificial Intelligence and Law (ICAIL 2009). 78--87.
[2]
Pedro Domingos and Daniel Lowd. 2009. Markov Logic: An Interface Layer for Artificial Intelligence. Morgan and Claypool Publishers.
[3]
Gabriela Ferraro, Hanna Suominen, and Jaume Nualart. 2014. Segmentation of patent claims for improving their readability. Proceedings of Workshop on Predicting and Improving Text Readability for Target Reader Populations (PITR 2014). 66--73.
[4]
Emily K. Mallory, Ce Zhang, Christopher Ré, and Russ B. Altman. 2015. Large-scale extraction of gene interactions from full-text literature using DeepDive. Bioinformatics (2015).
[5]
Christopher D. Manning, Mihai Surdeanu, John Bauer, Jenny Finkel, Steven J. Bethard, and David McClosky. 2014. The Stanford CoreNLP Natural Language Processing Toolkit. Proceedings of ACL 2014. 55--60.
[6]
Mike Mintz, Steven Bills, Rion Snow, and Dan Jurafsky. 2009. Distant supervision for relation extraction without labeled data. Proceedings of ACL 2009. 1003--1011.
[7]
Masayuki Okamoto, Yuichi Miyamura, Ayana Yamamoto, Shuichi Toriyama, and Kentaro Takagi. 2016. Automatic Property Visualization for Material Survey Support. Proceedings of International Symposium on Semiconductor Manufacturing (ISSM 2016).
[8]
Peter Parapatics and Michael Dittenbach. 2009. Patent Claim Decomposition for Improved Information Extraction. Proceedings of International Workshop on Patent Information Retrieval (PaIR 2009). 33--36.
[9]
Shanan E. Peters, Ce Zhang, Miron Livny, and Christopher Ré. 2014. A Machine Reading System for Assembling Synthetic Paleontological Databases. PLoS ONE, Vol. 9, 12 (2014).
[10]
David Pressman. 2006. Patent It Yourself. Nolo, Berkeley, CA.
[11]
David V. Radack. 1995. Reading and Understanding Patent Claims. JOM, Vol. 47, 11 (1995), 69.
[12]
Christopher Ré, Amir Abbas Sadeghian, Zifei Shan, Jaeho Shin, Feiran Wang, Sen Wu, and Ce Zhang. 2014. Feature Engineering for Knowledge Base Construction. IEEE Data Engineering Bulletin Vol. 37, 3 (2014), 26--40.
[13]
Svetlana Sheremetyeva. 2003. Natural language analysis of patent claims. In Proceedings of ACL Workshop on Patent Corpus Processing (PATENT 2003). 66--73.
[14]
Svetlana Sheremetyeva. 2014. Automatic Text Simplification For Handling Intellectual Property. Proceedings of the Workshop on Automatic Text Simplification: Methods and Applications in the Multilingual Society. 41--52.
[15]
Jaeho Shin, Sen Wu, Feiran Wang, Christopher De Sa, Ce Zhang, and Christopher Ré. 2015. Incremental knowledge base construction using DeepDive. Proceedings of the VLDB Endowment Vol. 8, 11 (2015), 1310--1321.
[16]
Akihiro Shinmori and Manabu Okumura. 2004. Aligning Patent Claims with Detailed Descriptions for Readability. Working Notes of NTCIR-4.
[17]
Akihiro Shinmori, Manabu Okumura, Yuzo Marukawa, and Makoto Iwayama. 2003. Patent claim processing for readability: structure analysis and term explanation. Proceedings of ACL Workshop on Patent Corpus Processing (PATENT 2003). 56--65.
[18]
Mihai Surdeanu and Heng Ji. 2014. Overview of the English Slot Filling Track at the TAC2014 Knowledge Base Population Evaluation. Proceedings of TAC-KBP 2014 Workshop.
[19]
Ayana Yamamoto, Yuichi Miyamura, Kouta Nakata, and Masayuki Okamoto. 2017. Company Relation Extraction from Web News Articles for Analyzing Industry Structure. Proceedings of IEEE International Conference on Semantic Computing (ICSC 2017). 89--92.

Cited By

View all
  • (2024)An approach for enhancing and measuring information comprehensibility for engineering designers: applied to patent documentsArtificial Intelligence for Engineering Design, Analysis and Manufacturing10.1017/S089006042400007638Online publication date: 20-Sep-2024
  • (2023)Creating a Silver Standard for Patent SimplificationProceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3539618.3591657(1045-1055)Online publication date: 19-Jul-2023
  • (2023)Discovering new applications: Cross-domain exploration of patent documents using causal extraction and similarity analysisWorld Patent Information10.1016/j.wpi.2023.10223875(102238)Online publication date: Dec-2023
  • Show More Cited By

Index Terms

  1. Applying Information Extraction for Patent Structure Analysis

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGIR '17: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval
    August 2017
    1476 pages
    ISBN:9781450350228
    DOI:10.1145/3077136
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 07 August 2017

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. information extraction
    2. patent analysis
    3. visualization

    Qualifiers

    • Short-paper

    Conference

    SIGIR '17
    Sponsor:

    Acceptance Rates

    SIGIR '17 Paper Acceptance Rate 78 of 362 submissions, 22%;
    Overall Acceptance Rate 792 of 3,983 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)12
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 15 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)An approach for enhancing and measuring information comprehensibility for engineering designers: applied to patent documentsArtificial Intelligence for Engineering Design, Analysis and Manufacturing10.1017/S089006042400007638Online publication date: 20-Sep-2024
    • (2023)Creating a Silver Standard for Patent SimplificationProceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3539618.3591657(1045-1055)Online publication date: 19-Jul-2023
    • (2023)Discovering new applications: Cross-domain exploration of patent documents using causal extraction and similarity analysisWorld Patent Information10.1016/j.wpi.2023.10223875(102238)Online publication date: Dec-2023
    • (2022)MiDTD: A Simple and Effective Distillation Framework for Distantly Supervised Relation ExtractionACM Transactions on Information Systems10.1145/350391740:4(1-32)Online publication date: 11-Jan-2022
    • (2022)Summarization, simplification, and generationExpert Systems with Applications: An International Journal10.1016/j.eswa.2022.117627205:COnline publication date: 1-Nov-2022
    • (2021)Extraction and Evaluation of Knowledge Entities from Scientific DocumentsJournal of Data and Information Science10.2478/jdis-2021-00256:3(1-5)Online publication date: 9-Aug-2021
    • (2021)Open Relation Extraction in Patent Claims with a Hybrid NetworkWireless Communications & Mobile Computing10.1155/2021/55472812021Online publication date: 28-Apr-2021
    • (2021)Text segmentation for patent claim simplification via Bidirectional Long‐Short Term Memory and Conditional Random FieldComputational Intelligence10.1111/coin.1245538:1(205-215)Online publication date: 14-May-2021
    • (2021)Method and dataset entity mining in scientific literature: A CNN + BiLSTM model with self-attentionKnowledge-Based Systems10.1016/j.knosys.2021.107621(107621)Online publication date: Oct-2021
    • (2021)A sequence labeling model for catchphrase identification from legal case documentsArtificial Intelligence and Law10.1007/s10506-021-09296-230:3(325-358)Online publication date: 30-Jul-2021
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media