Automatic segmentation of digitalized historical manuscripts

Grana, Costantino; Borghesani, Daniele; Cucchiara, Rita

doi:10.1007/s11042-010-0561-8

Automatic segmentation of digitalized historical manuscripts

Published: 09 July 2010

Volume 55, pages 483–506, (2011)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Costantino Grana¹,
Daniele Borghesani¹ &
Rita Cucchiara¹

348 Accesses
21 Citations
Explore all metrics

Abstract

The artistic content of historical manuscripts provides a lot of challenges in terms of automatic text extraction, picture segmentation and retrieval by similarity. In particular this work addresses the problem of automatic extraction of meaningful pictures, distinguishing them from handwritten text and floral and abstract decorations. The proposed solution firstly employs a circular statistics description of a directional histogram in order to extract text. Then visual descriptors are computed over the pictorial regions of the page: the semantic content is distinguished from the decorative parts using color histograms and a novel texture feature called Gradient Spatial Dependency Matrix. The feature vectors are finally processed using an embedding procedure which allows increased performance in later SVM classification. Results for both feature extraction and embedding based classification are reported, supporting the effectiveness of the proposal on high resolution replicas of artistic manuscripts.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Barbu A Learning real-time MRF inference for image denoising. In: Computer vision and pattern recognition
Bigun J, Bhattacharjee SK, Michel S (1996) In: Orientation radiograms for image retrieval: an alternative to segmentation, vol 3, pp 346–350
Bishop C (2006) Pattern recognition and machine learning. Springer
Chen N, Blostein D (2007) A survey of document image classification: problem statement, classifier architecture and performance evaluation. Int J Doc Anal Recog 10:1–16
Article MATH Google Scholar
Crouse MS, Nowak RD, Baraniuk RG (1998) Wavelet-based statistical signal processing using hidden Markov models. IEEE Trans Signal Process 46:886–902
Article MathSciNet Google Scholar
Diligenti M, Frasconi P, Gori M (2003) Hidden tree Markov models for document image classification. IEEE Trans Pattern Anal Mach Intell 25:519–523
Article Google Scholar
Fataicha Y, Cheriet M, Nie J, et al (2002) Content analysis in document images: a scale Space approach. In: International conference on pattern recognition, vol 3. IEEE Computer Society, pp 335–338
Gill G (1981) Evaluation and inversion of the ratios of modified Bessel functions, $I_0\left(x\right)/I_1\left(x\right)$ and $I_{1.5}\left(x\right)/I_{0.5}\left(x\right)$. ACM Trans Math Softw 7:199–208
Article Google Scholar
Grana C, Vezzani R, Cucchiara R (2007) Enhancing HSV histograms with achromatic points detection for video retrieval. In: International conference on image and video retrieval, pp 302–308
Grana C, Borghesani D, Cucchiara R (2008) Describing texture directions with Von Mises distributions. In: International conference on pattern recognition
Grana C, Borghesani D, Cucchiara R (2009) Fast block based connected components labeling. In: Proceedings of the IEEE international conference on image processing. Cairo, Egypt
Haralick RM, Shanmugam K, Dinstein I (1973) Textural features for image classification. IEEE Trans Syst Man Cybern 3:610–621
Article Google Scholar
Hjaltason G, Samet H (2003) Properties of Embedding Methods for Similarity Searching in Metric Spaces. IEEE Transactions on Pattern Analysis and Machine Intelligence 25:530–549.
Article Google Scholar
Hu J, Kashi R, Wilfong R (1999) Document classification using layout analysis. In: International workshop on database and expert systems applications. IEEE Computer Society, pp 556–560
Jain A, Dubes R (1988) Algorithms for clustering data. Prentice-Hall, Inc
Joachims T (1998) Text categorization with support vector machines: learning with many relevant features. In: European conference on machine learning. Springer, pp 137–142
Joachims T (2002) Learning to classify text using support vector machines: methods, theory, and algorithms. Kluwer Academic Publishers/Springer
Journet N, Ramel J, Mullot R et al (2008) Document image characterization using a multiresolution analysis of the texture: application to old documents. Int J Doc Anal Recog 11:9–18
Article Google Scholar
Kavallieratou E (2005) A binarization algorithm specialized on document images and photos. In: International conference on document analysis and recognition. IEEE Computer Society, pp 463–467
Kitamoto A, Onishi M, Ikezaki T, et al (2006) Digital bleaching and content extraction for the digital archive of rare books. In: International conference on document image analysis for libraries. IEEE Computer Society, pp 133–144
Kittler J, Illingworth J (1985) Relaxation labelling algorithms—a review. Image Vis Comput 3:206–216
Article Google Scholar
Konidaris T, Gatos B, Ntzios K, et al (2007) Keyword-guided word spotting in historical printed documents using synthetic data and user feedback. Int J Doc Anal Recog 9:167–177
Article Google Scholar
Le Bourgeois F, Emptoz H (2007) DEBORA: Digital accEss to BOoks of the RenAissance. Int J Doc Anal Recog 9:193–221
Article Google Scholar
Le Bourgeois F, Trinh E, Allier B, et al (2004) Document images analysis solutions for digital libraries. In: International conference on document image analysis for libraries. IEEE Computer Society, pp 2–24
Meng G, Zheng N, Song Y, et al (2007) Document images retrieval based on multiple features combination. In: International conference on document analysis and recognition, vol 1. IEEE Computer Society, pp 143–147
Nagy G (2000) Twenty years of document image analysis in PAMI. IEEE Trans Pattern Anal Mach Intell 22:38–62
Article Google Scholar
Nicolas S, Dardenne J, Paquet T, et al (2007) Document image segmentation using a 2D conditional random field model. In: International conference on document analysis and recognition, vol 1, pp 407–411
Ogier J, Tombre K (2006) Madonne: document image analysis techniques for cultural heritage documents. In: Digital cultural heritage. Proceedings of 1st EVA conference, Oesterreichische Computer Gesellschaft, pp 107–114
Pekalska E, Duin RPW (2002) Dissimilarity representations allow for building good classifiers. Pattern Recogn Lett 23:943–956
Article MATH Google Scholar
Prati A, Calderara S, Cucchiara R (2008) Using circular statistics for trajectory analysis. In: International conference on image and video retrieval, pp 1–8
Ramel J, Busson S, Demonet M (2006) AGORA: the interactive document image analysis tool of the BVH project. In: International conference on document image analysis for libraries, pp 145–155
Sezgin M, Sankur B (2004) Survey over image thresholding techniques and quantitative performance evaluation. J Electron Imaging 13:146–168
Article Google Scholar
Shih FY, Cheng S (2005) Automatic seeded region growing for color image segmentation. Image Vis Comput 23:877–886
Article Google Scholar

Download references

Author information

Authors and Affiliations

Università degli Studi di Modena e Reggio Emilia, Via Vignolese 905/b, 41100, Modena, Italy
Costantino Grana, Daniele Borghesani & Rita Cucchiara

Authors

Costantino Grana
View author publications
You can also search for this author inPubMed Google Scholar
Daniele Borghesani
View author publications
You can also search for this author inPubMed Google Scholar
Rita Cucchiara
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Costantino Grana.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Grana, C., Borghesani, D. & Cucchiara, R. Automatic segmentation of digitalized historical manuscripts. Multimed Tools Appl 55, 483–506 (2011). https://doi.org/10.1007/s11042-010-0561-8

Download citation

Published: 09 July 2010
Issue Date: December 2011
DOI: https://doi.org/10.1007/s11042-010-0561-8

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Automatic segmentation of digitalized historical manuscripts

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Historical document layout analysis using anisotropic diffusion and geometric features

Digitization of Ancient Manuscripts and Inscriptions - A Review

Texture feature benchmarking and evaluation for historical document image analysis

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Automatic segmentation of digitalized historical manuscripts

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Historical document layout analysis using anisotropic diffusion and geometric features

Digitization of Ancient Manuscripts and Inscriptions - A Review

Texture feature benchmarking and evaluation for historical document image analysis

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now