skip to main content
10.1145/1815330.1815371acmotherconferencesArticle/Chapter ViewAbstractPublication PagesdasConference Proceedingsconference-collections
research-article

Higher order MRF for foreground-background separation in multi-spectral images of historical manuscripts

Published: 09 June 2010 Publication History

Abstract

Multi-spectral imaging for the analysis and preservation of ancient documents has gained high attention in recent years. While readability enhancement is based on the multi-spectral image corpus, foreground-background separation still relies mainly on gray level or color images. In this paper we propose a foreground-background separation algorithm designed for multi-spectral images. The main contribution is the simultaneously utilization of spectral and spatial features. While spectral features incorporate the spectral components of the multi-spectral images, the spatial features are based on stroke properties. Higher order Markov Random Fields enables an efficient way to combine both features. To solve higher order energy functions, we introduce a new message update rule in the well known belief propagation algorithm based on a higher order potential function.

References

[1]
J. Banerjee, A. Namboodiri, and C. Jawahar. Contextual restoration of severely degraded document images. In International Conference on Computer Vision and Pattern Recognition, pages 517--524, Los Alamitos, CA, USA, 2009.
[2]
J. Besag. On the statistical analysis of dirty pictures. Journal of the Royal Statistical Society, 48(3):259--302, 1986.
[3]
J. Brauers, N. Schulte, and T. Aach. Multispectral Filter-Wheel Cameras: Geometric Distortion Model and Compensation Algorithms. IEEE Trans. on Image Processing, 17(2):2368--2380, December 2008.
[4]
H. Cao and V. Govindaraju. Preprocessing of Low-Quality Handwritten Documents Using Markov Random Fields. IEEE Transactions on Pattern Analysis and Machine Intelligence, 31(7):1184--1194, 2009.
[5]
Y. T. Cui and Q. Huang. Character extraction of license plates from video. In International Conference on Computer Vision and Pattern Recognition, pages 502--507, Washington, DC, USA, 1997.
[6]
M. Diem, M. Lettner, and R. Sablatnig. Registration of multi-spectral manuscript images. In 8th International Symposium on Virtual Reality, Archaeology and Cultural Heritage, pages 133--140, Brighton, UK, 2007.
[7]
R. Duda, P. Hart, and D. Stork. Pattern Classification. Wiley-Interscience, New York, 2nd edition, 2001.
[8]
R. Easton, K. Knox, and W. Christens-Barry. Multispectral Imaging of the Archimedes Palimpsest. In 32nd Applied Image Pattern Recognition Workshop, pages 111--118, Washington, DC, October 2003.
[9]
R. Farrahi Moghaddam and M. Cheriet. RSLDI: Restoration of single-sided low-quality document images. Pattern Recognition Letters, 42(12):3355--3364, 2009.
[10]
P. F. Felzenszwalb and D. P. Huttenlocher. Efficient belief propagation for early vision. International Journal of Computer Vision, 70(1):41--54, 2006.
[11]
U. Garain, T. Paquet, and L. Heutte. On foreground - background separation in low quality document images. International Journal on Document Analysis and Recognition, 8(1):47--63, 2006.
[12]
B. Gatos, K. Ntirogiannis, and I. Pratikakis. Icdar 2009 document image binarization contest (dibco 2009). In 10th International Conference on Document Analysis and Recognition, Barcelona, Spain, July 2009.
[13]
B. Gatos, I. Pratikakis, and I. Perantonis. Adaptive degraded document image binarization. Pattern Recognition, 39(3):317--327, 2006.
[14]
S. Geman and D. Geman. Stochastic relaxation, Gibbs distributions and the Bayesian restoration of images. IEEE Transactions on Pattern Analysis and Machine Intelligence, 6(6):721--741, November 1984.
[15]
M. Gupta, S. Rajaram, N. Petrovic, and T. Huang. Models for patch based image restoration. In Computer Vision and Pattern Recognition Workshop (CVPR 2006), pages 17--17, June 2006.
[16]
X. He, R. S. Zemel, and M. A. Carreira-Perpinan. Multiscale conditional random fields for image labeling. International Conference on Computer Vision and Pattern Recognition, 2:695--702, 2004.
[17]
Z. Kato and T.-C. Pong. A markov random field image segmentation model for color textured images. Image and Vision Computing, 24:1103--1114, 2006.
[18]
P. Kohli, M. Kumar, and P. Torr. P3 & beyond: Move making algorithms for solving higher order functions. IEEE Transactions on Pattern Analysis and Machine Intelligence, 31(9):1645--1656, 2009.
[19]
V. Kolmogorov and C. Rother. Comparison of energy minimization algorithms for highly connected graphs. In 9th European Conference on Computer Vision, pages 1--15, 2006.
[20]
J. G. Kuk, N. I. Cho, and K. M. Lee. Map-mrf approach for binarization of degraded document image. In International Conference on Image Processing, pages 2612--2615, San Diego, California, U.S.A., October 2008.
[21]
J. D. Lafferty, A. McCallum, and F. C. N. Pereira. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data. In Proceedings of the 18th International Conference on Machine Learning, pages 282--289, San Francisco, CA, USA, 2001. Morgan Kaufmann Publishers Inc.
[22]
T. Lelore and F. Bouchara. Document image binarisation using markov field model. In International Conference on Document Analysis and Recognition, pages 551--555, Barcelona, Spain, 2009.
[23]
M. Lettner, F. Kleber, R. Sablatnig, and H. Miklas. Contrast Enhancement in Multispectral Images by Emphasizing Text Region. In 8th IAPR International Workshop on Document Analysis Systems, pages 225--232, Nara, Japan, September 2008.
[24]
M. Lettner and R. Sablatnig. Spatial and spectral based segmentation of text in multispectral images of ancient documents. In 10th International Conference on Document Analysis and Recognition (ICDAR 2009), pages 813--817, Barcelona, Spain, July 2009.
[25]
S. Li. Markov Random Field Modeling in Image Analysis. Springer, 3 edition, 2009.
[26]
H. Miklas, M. Gau, F. Kleber, M. Lettner, M. Vill, R. Sablatnig, M. Schreiner, M. Melcher, and G. Hammerschmid. St. Catherine's Monastery on Mount Sinai and the Balkan-Slavic Manuscript-Tradition. In H. Miklas and A. Miltenova, editors, Slovo: Towards a Digital Library of South Slavic Manuscripts, pages 13--36, 2008.
[27]
T. Obafemi-Ajayi, G. Agam, and O. Frieder. Efficient MRF approach to document image enhancement. In 19th International Conference on Pattern Recognition, pages 1--4, Tampa, Florida, USA, December 2008.
[28]
V. Pervouchine, G. Leedham, and K. Melikhov. Handwritten character skeletonisation for forensic document analysis. In Proceedings of the ACM Symposium on Applied Computing, pages 754--758, Santa Fe, New Mexico, USA, 2005.
[29]
B. Potetz and T. S. Lee. Efficient belief propagation for higher-order cliques using linear constraint nodes. Computer Vision and Image Understanding, 112(1):39--54, 2008.
[30]
K. Rapantzikos and C. Balas. Hyperspectral imaging: potential in non-destructive analysis of palimpsests. In IEEE International Conference on Image Processing, volume 2, pages 618--21, 2005.
[31]
J. Richards and X. Jia. Remote Sensing Digital Image Analysis: An Introduction. Springer, 1999.
[32]
S. Roth and M. J. Black. Fields of experts. International Journal of Computer Vision, 82(2):205--229, 2009.
[33]
J. Sauvola and M. Pietikainen. Adaptive document image binarization. Pattern Recognition, 33(2):225--236, 2000.
[34]
P. Shiel, M. Rehbein, and J. Keating. The ghost in the manuscript: Hyperspectral text recovery and segmentation. In M. Rehbein and P. S. und Torsten Schaßan, editors, Codicology and Palaeography in the Digital Age, chapter 2, pages 159--174. Norderstedt: Books on Demand, 2009.
[35]
R. Szeliski, R. Zabih, D. Scharstein, O. Veksler, V. Kolmogorov, A. Agarwala, M. Tappen, and C. Rother. A Comparative Study of Energy Minimization Methods for Markov Random Fields with Smoothness-Based Priors. IEEE Transactions on Pattern Analysis and Machine Intelligence, 30(6):1068--1080, 2008.
[36]
M. F. Tappen and W. T. Freeman. Comparison of graph cuts with belief propagation for stereo, using identical mrf parameters. In International Conference on Computer Vision and Pattern Recognition, page 900, Washington, DC, USA, 2003.
[37]
P. Thouin, Y. Du, and C. Chang. Low resolution expansion of gray scale text images using gibbs-markov random field model. In Symposium on Document Image Understanding Technology, pages 41--47, April 2001.
[38]
M. J. Wainwright, T. S. Jaakkola, and A. S. Willsky. Map estimation via agreement on trees: message-passing and linear programming. IEEE Trans. on Information Theory, 51:2005, 2005.
[39]
C. Wolf and D. S. Doermann. Binarization of low quality text using a markov random field model. In Internationa Conference on Pattern Recognition, Quebec, Canada, August 2002.
[40]
X. Wu and S. Shah. A Conditional Random Field Model for Cell Segmentation Using Multispectral Data. In Proceedings of MICCAI Workshop on Optical Tissue Image analysis in Microscopy, Histopathology and Endoscopy, London, September 2009.
[41]
J. S. Yedidia, W. T. Freeman, and Y. Weiss. Generalized belief propagation. In Advances in Neural Information Processing Systems, pages 689--695. MIT Press, 2000.
[42]
D.-Q. Zhang and S.-F. Chang. Learning to detect scene text using a higher-order mrf with belief propagation. In International Conference on Computer Vision and Pattern Recognition, pages 101--108, Washington, DC, USA, 2004.

Cited By

View all
  • (2021)Blind Decomposition of Multispectral Document Images Using Orthogonal Nonnegative Matrix FactorizationIEEE Transactions on Image Processing10.1109/TIP.2021.308826630(5997-6012)Online publication date: 2021
  • (2020)MSdB-NMF: MultiSpectral Document Image Binarization Framework via Non-Negative Matrix Factorization ApproachIEEE Transactions on Image Processing10.1109/TIP.2020.302361329(9099-9112)Online publication date: 2020
  • (2019)Blind Source Separation Based Framework for Multispectral Document Images Binarization2019 International Conference on Document Analysis and Recognition (ICDAR)10.1109/ICDAR.2019.00237(1476-1481)Online publication date: Sep-2019
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
DAS '10: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
June 2010
490 pages
ISBN:9781605587738
DOI:10.1145/1815330
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 June 2010

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Markov random fields
  2. document image analysis
  3. foreground-background separation
  4. multi-spectral images

Qualifiers

  • Research-article

Funding Sources

Conference

DAS '10

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)1
Reflects downloads up to 02 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2021)Blind Decomposition of Multispectral Document Images Using Orthogonal Nonnegative Matrix FactorizationIEEE Transactions on Image Processing10.1109/TIP.2021.308826630(5997-6012)Online publication date: 2021
  • (2020)MSdB-NMF: MultiSpectral Document Image Binarization Framework via Non-Negative Matrix Factorization ApproachIEEE Transactions on Image Processing10.1109/TIP.2020.302361329(9099-9112)Online publication date: 2020
  • (2019)Blind Source Separation Based Framework for Multispectral Document Images Binarization2019 International Conference on Document Analysis and Recognition (ICDAR)10.1109/ICDAR.2019.00237(1476-1481)Online publication date: Sep-2019
  • (2018)MultiSpectral Image Binarization using GMMs2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR)10.1109/ICFHR-2018.2018.00105(570-575)Online publication date: Aug-2018
  • (2015)Binarization of MultiSpectral Document ImagesProceedings, Part II, of the 16th International Conference on Computer Analysis of Images and Patterns - Volume 925710.1007/978-3-319-23117-4_10(109-120)Online publication date: 2-Sep-2015
  • (2013)Ground-Truth Estimation in Multispectral Representation SpaceProceedings of the 2013 12th International Conference on Document Analysis and Recognition10.1109/ICDAR.2013.45(190-194)Online publication date: 25-Aug-2013
  • (2011)Combining statistical and geometrical classifiers for text extraction in multispectral document imagesProceedings of the 2011 Workshop on Historical Document Imaging and Processing10.1145/2037342.2037359(98-105)Online publication date: 16-Sep-2011
  • (2011)Novel Data Representation for Text Extraction from Multispectral Historical Document ImagesProceedings of the 2011 International Conference on Document Analysis and Recognition10.1109/ICDAR.2011.43(172-176)Online publication date: 18-Sep-2011
  • (2011)Scale Space Binarization Using Edge Information Weighted by a Foreground EstimationProceedings of the 2011 International Conference on Document Analysis and Recognition10.1109/ICDAR.2011.238(1180-1184)Online publication date: 18-Sep-2011

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media