skip to main content
10.1145/1815330.1815380acmotherconferencesArticle/Chapter ViewAbstractPublication PagesdasConference Proceedingsconference-collections
research-article

Associating figures with descriptions for patent documents

Published: 09 June 2010 Publication History

Abstract

Patent document images maintained by the U.S. patent database have a specific format, in which figures and text descriptions are separated into different sections. This makes it difficult for users to refer to a figure while reading the description or vice versa. This paper introduces a method to associate figures with corresponding description paragraphs, and thus help to make patent documents more friendly for users to browse. In this method, after extracting individual figures out of the drawing section, figures and relevant descriptions are associated by evaluating the similarity between the text content of figures and description paragraphs using vector space model.

References

[1]
H. S. Baird. Background structure in document images. Document Image Analysis, pages 17--34, 1994.
[2]
M. de Berg, O. Cheong, M. van Kreveld, and M. Overmars. Computational Geometry: Algorithms and Applications, Chapter 9. Springer-Verlag, 2008.
[3]
R. Gonzalez and R. Woods. Digital Image Processing. Addison-Wesley Publishing Company, 1992.
[4]
L. Gorman. The document spectrum for page layout analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence, 15:1162--1173, 1993.
[5]
A. and Iwata M. Kise, K. and Sato. Segmentation of page images using the area voronoi diagram. Computer Vision and Image Understanding, 70:370--382, 1998.
[6]
Linlin Li and Chew Lim Tan. A graphics image processing system. In The Eighth IAPR International Workshop on Document Analysis Systems, pages 455--462, 2008.
[7]
G. Nagy, S. Seth, and M. Viswanathan. A prototype document image analysis system for technical journals. Computer, 25:10--22, 1992.
[8]
G. Salton, A. Wong, and C. S. Yang. A vector space model for automatic indexing. Communications of the ACM, 18:613--620, 1975.
[9]
K. Y. Wong, R. G. Casey, and F. M. Wahl. Document analysis system. IBM Journal of Research and Development, pages 647--656, 1982.

Cited By

View all
  • (2024)The effect of nutritional risk management program on the growth and development of infants and toddlers with congenital heart disease after dischargeFrontiers in Pediatrics10.3389/fped.2024.141677812Online publication date: 27-Aug-2024
  • (2013)Patent RetrievalFoundations and Trends in Information Retrieval10.1561/15000000277:1(1-97)Online publication date: 20-Feb-2013
  • (2012)Patent images - a glass-encased toolProceedings of the 12th International Conference on Knowledge Management and Knowledge Technologies10.1145/2362456.2362477(1-8)Online publication date: 5-Sep-2012

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
DAS '10: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
June 2010
490 pages
ISBN:9781605587738
DOI:10.1145/1815330
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 June 2010

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. graphics segmentation
  2. patent document processing

Qualifiers

  • Research-article

Funding Sources

Conference

DAS '10

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)4
  • Downloads (Last 6 weeks)0
Reflects downloads up to 02 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)The effect of nutritional risk management program on the growth and development of infants and toddlers with congenital heart disease after dischargeFrontiers in Pediatrics10.3389/fped.2024.141677812Online publication date: 27-Aug-2024
  • (2013)Patent RetrievalFoundations and Trends in Information Retrieval10.1561/15000000277:1(1-97)Online publication date: 20-Feb-2013
  • (2012)Patent images - a glass-encased toolProceedings of the 12th International Conference on Knowledge Management and Knowledge Technologies10.1145/2362456.2362477(1-8)Online publication date: 5-Sep-2012

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media