research-article

Analysis of whole-book recognition

Authors:
Pingping Xiu

Lehigh Univ., Bethlehem, PA

Lehigh Univ., Bethlehem, PA
View Profile

,
Henry S. Baird

Lehigh Univ., Bethlehem, PA

Lehigh Univ., Bethlehem, PA
View Profile

DAS '10: Proceedings of the 9th IAPR International Workshop on Document Analysis SystemsJune 2010Pages 199–206https://doi.org/10.1145/1815330.1815356

Published:09 June 2010Publication History

DAS '10: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems

Pages 199–206

ABSTRACT

Whole-book recognition is a document image analysis strategy that operates on the complete set of a book's page images, attempting to improve accuracy by automatic unsupervised adaptation. Our algorithm expects to be given initial iconic and linguistic models---derived from (generally errorful) OCR results and (generally incomplete) dictionaries---and then, guided entirely by evidence internal to the test set, the algorithm corrects the models yielding improved accuracy. We have found that successful corrections are often closely associated with "disagreements" between the models which can be detected within the test set by measuring cross entropy between (a) the posterior probability distribution of character classes (the recognition results from image classification alone), and (b) the posterior probability distribution of word classes (the recognition results from image classification combined with linguistic constraints). We report experiments on long passages (up to 180 pages) revealing that: (1) disagreements and error rates are strongly correlated; (2) our algorithm can drive down recognition error rates by nearly an order of magnitude; and (3) the longer the passage, the lower the error rate achievable. We also propose formal models for a book's text, for iconic and linguistic constraints, and for our whole-book recognition algorithm---and, using these, we rigorously prove sufficient conditions for the whole-book recognition strategy to succeed in the ways illustrated in the experiments.

References

British English Word Lists for Spell Checkers, Version 0.6, www.curlewcommunications.co.uk/wordlist.html.Google Scholar
S. A. Cook. The complexity of theorem proving procedures. In Proc. 3rd Ann. ACM Symp. on Theory of Computing, pages 151--158, 1971. Google ScholarDigital Library
T. Hong. Degraded Text Recognition Using Visual And Linguistic Context. PhD thesis, State University of New York at Buffalo, 1995. Google ScholarDigital Library
G. Nagy and H. S. Baird. A self-correcting 100-font classifier. In Proc., IS&T/SPIE Symp. on Electronic Imaging: Science & Technology, San Jose, CA, February 1994.Google Scholar
P. Sarkar. Style Consistency in Pattern Fields. PhD thesis, Rensselaer Polytechnic Institute, 2000. Google ScholarDigital Library
P. Sarkar. An iterative algorithm for optimal style conscious field classification. In Proc., IAPR 16th Int'l Conf. on Pattern Recognition (ICPR2002), volume 4, pages 40--43, 2002.Google ScholarCross Ref
P. Sarkar and G. Nagy. Style consistent classification of isogenous patterns. IEEE Trans. on PAMI, 27(1), January 2005. Google ScholarDigital Library
S. Veeramachaneni and G. Nagy. Style context with second order statistics. IEEE Trans. on PAMI, 27(1), January 2005. Google ScholarDigital Library
L. Vincent. Google Book Search: Document understanding on a massive scale. In Proceedings, IAPR 9th Int'l Conf. on Document Analysis and Recognition (ICDAR'07), Curitiba, BRAZIL, August 2007. Google ScholarDigital Library
P. Xiu and H. S. Baird. Towards whole-book recognition. In Proceedings., 8th IAPR Document Analysis Workshop (DAS'08), Nara, Japan, September 2008. Google ScholarDigital Library
P. Xiu and H. S. Baird. Whole book recognition using mutual-entropy-based model adaptation. In Proc., IS&T/SPIE Document Recognition & Retrieval XII Conf., San Jose, CA, January 2008.Google ScholarCross Ref
P. Xiu and H. S. Baird. Scaling-up whole book recognition. In Proceedings, IAPR 10th Int'l Conf. on Document Analysis and Recognition (ICDAR'09), Barcelona, Spain, July 2009. Google ScholarDigital Library
P. Xiu and H. S. Baird. Incorporating linguistic post-processing into whole-book recognition. In Proc., IS&T/SPIE Document Recognition & Retrieval XII Conf., San Jose, CA, January 2010.Google Scholar

Recommendations

Whole-Book Recognition

Whole-book recognition is a document image analysis strategy that operates on the complete set of a book's page images using automatic adaptation to improve accuracy. We describe an algorithm which expects to be initialized with approximate iconic and ...
Read More
Towards Whole-Book Recognition
DAS '08: Proceedings of the 2008 The Eighth IAPR International Workshop on Document Analysis Systems

We describe experimental results for unsupervised recognition of the textual contents of book-images using fully automatic mutual-entropy-based model adaptation. Each experiment starts with approximate {\it iconic} and{\it linguistic} models---derived ...
Read More
Scaling Up Whole-Book Recognition
ICDAR '09: Proceedings of the 2009 10th International Conference on Document Analysis and Recognition

We describe the results of large-scale experiments with algorithms for unsupervised improvement of recognition of book-images using fully automatic mutual-entropy-based model adaptation. Each experiment is initialized with an imperfect iconic model ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
DAS '10: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
June 2010
490 pages
ISBN:9781605587738
DOI:10.1145/1815330
General Chairs:
David Doermann
University of Maryland, College Park
,
Venu Govindaraju
University at Buffalo, SUNY
,
Daniel Lopresti
Lehigh University
,
Prem Natarajan
Raytheon BBN Technologies
Copyright © 2010 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 9 June 2010
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
adaptive classification
anytime algorithm
cross entropy
digital library
isogeny
whole-book recognition
Qualifiers
- research-article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 3
  Total Citations
  View Citations
- 113
  Total Downloads
- Downloads (Last 12 months)1
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Analysis of whole-book recognition

DAS '10: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems

ABSTRACT

References

Cited By

Recommendations

Whole-Book Recognition

Towards Whole-Book Recognition

Scaling Up Whole-Book Recognition