Resolution enhancement of textual images via multiple coupled dictionaries and adaptive sparse representation selection

Walha, Rim; Drira, Fadoua; Lebourgeois, Frank; Garcia, Christophe; Alimi, Adel M.

doi:10.1007/s10032-014-0235-6

Resolution enhancement of textual images via multiple coupled dictionaries and adaptive sparse representation selection

Original Paper
Published: 08 January 2015

Volume 18, pages 87–107, (2015)
Cite this article

International Journal on Document Analysis and Recognition (IJDAR) Aims and scope Submit manuscript

Rim Walha¹,
Fadoua Drira¹,
Frank Lebourgeois²,
Christophe Garcia² &
…
Adel M. Alimi¹

605 Accesses
20 Citations
Explore all metrics

Abstract

Resolution enhancement has become a valuable research topic due to the rapidly growing need for high-quality images in various applications. Various resolution enhancement approaches have been successfully applied on natural images. Nevertheless, their direct application to textual images is not efficient enough due to the specificities that distinguish these particular images from natural images. The use of insufficient resolution introduces substantial loss of details which can make a text unreadable by humans and unrecognizable by OCR systems. To address these issues, a sparse coding-based approach is proposed to enhance the resolution of a textual image. Three major contributions are presented in this paper: (1) Multiple coupled dictionaries are learned from a clustered database and selected adaptively for a better reconstruction. (2) An automatic process is developed to collect the training database, which contains writing patterns extracted from high-quality character images. (3) A new local feature descriptor well suited for writing specificities is proposed for the clustering of the training database. The performance of these propositions is evaluated qualitatively and quantitatively on various types of low-resolution textual images. Significant improvements in visual quality and character recognition rates are achieved using the proposed approach, confirmed by a detailed comparative study with state-of-the-art upscaling approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Comprehensive Overview of Image Enhancement Techniques

Article 23 April 2021

Review on image-stitching techniques

Article 20 March 2020

Super-resolution: a comprehensive survey

Article 14 June 2014

Notes

References

Aharon, M., Elad, M., Bruckstein, A.: K-SVD: an algorithm for designing overcomplete dictionaries for sparse representation. IEEE Trans. Signal Process. 54(11), 4311–4322 (2006)
Article Google Scholar
Alvarez, L., Lions, P.L., Morel, J.M.: Image selective smoothing and edge detection by nonlinear diffusion. ii. SIAM J. Numer. Anal. 29(3), 845–866 (1992)
Article MATH MathSciNet Google Scholar
Arbelaitz, O., Gurrutxaga, I., Muguerza, J., PéRez, J.M., Perona, I.: An extensive comparative study of cluster validity indices. Pattern Recogn. 46(1), 243–256 (2013)
Article Google Scholar
Banerjee, J., Namboodiri, A.M., Jawahar, C.V.: Contextual restoration of severely degraded document images. In: CVPR, pp. 517–524, IEEE (2009)
Batagelj, V., Bren, M.: Comparing resemblance measures. J. Classif. 12(1), 73–90 (1995)
Article MATH MathSciNet Google Scholar
Ben-Ezra, M., Lin, Z., Wilburn, B.: Penrose pixels super-resolution in the detector layout domain. In: ICCV, pp. 1–8. IEEE (2007)
Bern, M.W., Goldberg, D.: Scanner-model-based document image improvement. In: ICIP, pp. 582–585 (2000)
Caliński, T., Harabasz, J.: A dendrite method for cluster analysis. Commun. Stat. Simul. Comput. 3(1), 1–27 (1974)
Article MATH Google Scholar
Caner, G., Haritaoglu, I.: Shape-dna: effective character restoration and enhancement for arabic text documents. In: Proceedings of ICPR, pp. 2053–2056. IEEE Computer Society, Washington, DC, USA (2010)
Chang, H., Yeung, D.Y., Xiong, Y.: Super-resolution through neighbor embedding. In: CVPR (1), pp. 275–282 (2004)
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: CVPR (1), pp. 886–893 (2005)
Dalley, G., Freeman, W.T., Marks, J.: Single-frame text super-resolution: a Bayesian approach. In: ICIP, pp. 3295–3298. IEEE (2004)
Davies, D.L., Bouldin, D.W.: A cluster separation measure. IEEE Trans. Pattern Anal. Mach. Intell. 1(2), 224–227 (1979)
Article Google Scholar
Di Zenzo, S.: A note on the gradient of a multi-image. Comput. Vis. Graph. Image Process. 33(1), 116–125 (1986)
Article MATH Google Scholar
Dong, W., Zhang, L., Lukac, R., Shi, G.: Sparse representation based image interpolation with nonlocal autoregressive modeling. IEEE Trans. Image Process. 22(4), 1382–1394 (2013)
Article MathSciNet Google Scholar
Drira, F., Lebourgeois, F., Emptoz, H.: Document images restoration by a new tensor based diffusion process: application to the recognition of old printed documents. In: ICDAR, pp. 321–325. IEEE Computer Society (2009)
Drira, F., Lebourgeois, F., Emptoz, H.: A new pde-based approach for singularity-preserving regularization: application to degraded characters restoration. IJDAR 15(3), 183–212 (2012)
Article Google Scholar
Einsele, F., Ingold, R.: A study of the variability of very low resolution characters and the feasibility of their discrimination using geometrical features. In: Proceedings of International Conference on Pattern Recognition and Computer Vision, pp. 213–217, Istanbul (Turkey) (2005)
Einsele, F., Ingold, R., Hennebert, J.: A language-independent, open-vocabulary system based on hmms for recognition of ultra low resolution words. In: Proceedings of ACM SAC, Fortaleza (Brasil) (2008)
Fadili, M.J., Starck, J.L.: Em algorithm for sparse representation-based image inpainting. In: ICIP (2), pp. 61–64. IEEE (2005)
Fan, W., 0004, J.S., Naoi, S., Minagawa, A., Hotta, Y.: Local consistency constrained adaptive neighbor embedding for text image super-resolution. In: DAS, pp. 90–94. IEEE (2012)
Freeman, G., Fattal, R.: Image and video upscaling from local self-examples. ACM Trans. Graph. 28(3), 1–10 (2010)
Google Scholar
Hoang, T.V., Smith, E.H.B., Tabbone, S.: Sparsity-based edge noise removal from bilevel graphical document images. IJDAR 17(2), 161–179 (2014)
Article Google Scholar
Hobby, J.D., Ho, T.K.: Enhancing degraded document images via bitmap clustering and averaging. In: Proceedings of ICDAR, pp. 394–400. Washington, DC (1997)
Keys, R.: Cubic convolution interpolation for digital image processing. IEEE Trans. Acoust. Speech Signal Process. 29(6), 1153–1160 (1981)
Article MATH MathSciNet Google Scholar
Kim, H.Y.: Binary operator design by k-nearest neighbor learning with application to image resolution increasing. Int. J. Imaging Syst. Technol. 11(5), 331–339 (2000)
Article Google Scholar
Kthe, U.: Edge and junction detection with an improved structure tensor. In: Krell, G. (ed.) Pattern Recognition. Proceedings of 25th DAGM Symposium, Springer LNCS, vol. 2781, pp. 25–32. Springer (2003)
Kumar, V., Bansal, A., Tulsiyan, G.H., Mishra, A., Namboodiri, A.M., Jawahar, C.V.: Sparse document image coding for restoration. In: ICDAR, pp. 713–717 (2013)
Lee, H., Battle, A., Raina, R., Ng, A.Y.: Efficient sparse coding algorithms. In: NIPS, pp. 801–808. NIPS (2007)
Li, X., Orchard, M.T.: New edge-directed interpolation. IEEE Trans. Image Process. 10(10), 1521–1527 (2001)
Liang, J., Doermann, D.S., Li, H.: Camera-based analysis of text and documents: a survey. IJDAR 7(2–3), 84–104 (2005)
Article Google Scholar
Lopresti, D.P., Zhou, J., Nagy, G., Sarkar, P.: Spatial sampling effects in optical character recognition. In: ICDAR, pp. 309–314. IEEE Computer Society (1995)
Lukin, A., Krylov, A., Nasonov, A.: Image interpolation by super-resolution. In: 16th International Conference Graphicon’2006, pp. 239–242 (2006)
Luong, H., Philips, W.: Non-local text image reconstruction. In: Proceedings of ICDAR, vol. 1, pp. 546–550. Curitiba, Brazil (2007)
Luong, H.Q., Philips, W.: Robust reconstruction of low-resolution document images by exploiting repetitive character behaviour. IJDAR 11(1), 39–51 (2008)
Article Google Scholar
Mairal, J., Mairal, J., Elad, M., Elad, M., Sapiro, G., Sapiro, G.: Sparse representation for color image restoration. IEEE Trans. Image Process. 17(1), 53–69 (2007)
Minetto, R., Thome, N., Cord, M., Leite, N.J., Stolfi, J.: T-hog: an effective gradient-based descriptor for single line text regions. Pattern Recogn. 46(3), 1078–1090 (2013)
Article Google Scholar
Mirkin, B.: Clustering for Data Mining: A Data Recovery Approach (Chapman & Hall/CRC Computer Science & Data Analysis), 1st edn. Chapman and Hall, London (2005)
Book Google Scholar
Namane, A.: Sid-Ahmed: character scaling by contour method. IEEE Trans. Pattern Anal. Mach. Intell. 12(6), 600–606 (1990)
Article Google Scholar
Ojala, T., Pietikäinen, M., Harwood, D.: A comparative study of texture measures with classification based on featured distributions. Pattern Recogn. 29(1), 51–59 (1996)
Article Google Scholar
Park, J., Kwon, Y., Kim, J.H.: An example-based prior model for text image super-resolution. In: ICDAR, pp. 374–378. IEEE Computer Society (2005)
Rashid, S.F., Shafait, F., Breuel, T.M.: An evaluation of hmm-based techniques for the recognition of screen rendered text. In: ICDAR, pp. 1260–1264. IEEE (2011)
Rice, S.V.: Measuring the accuracy of page-reading systems. Ph.D. thesis (1996)
Rousseeuw, P.: Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math. 20(1), 53–65 (1987)
Article MATH Google Scholar
Rowley-Brooke, R., Pitié, F., Kokaram, A.: A ground truth bleed-through document image database. In: Proceedings of the Second International Conference on Theory and Practice of Digital Libraries. TPDL’12, pp. 185–196. Springer, Berlin (2012)
Sarkar, P., Nagy, G., Zhou, J., Lopresti, D.P.: Spatial sampling of printed patterns. IEEE Trans. Pattern Anal. Mach. Intell. 20(3), 344–351 (1998)
Article Google Scholar
Shan, Q., Li, Z., Jia, J., Tang, C.K.: Fast image/video upsampling. ACM Trans. Graph. 27(5), 153:1–153:7 (2008)
Article Google Scholar
Slimane, F., Kanoun, S., Hennebert, J., Alimi, A.M., Ingold, R.: A study on font-family and font-size recognition applied to arabic word images at ultra-low resolution. Pattern Recogn. Lett. 34, 209–218 (2013)
Sun, J., Xu, Z., Shum, H.Y.: Image super-resolution using gradient profile prior. In: CVPR 2008, 24–26 June 2008, Anchorage, Alaska, USA. IEEE Computer Society (2008)
Thouin, P.D., Chang, C.I.: A method for restoration of low-resolution document images. IJDAR 2(4), 200–210 (2000)
Article Google Scholar
Turkan, M.: Nouvelles méthodes de synthèse de texture; application à la prédiction et à l’inpainting d’images, 1st edn. These, Université Rennes (2011)
Walha, R., Drira, F., Lebourgeois, F., Alimi, A.M.: Super-resolution of single text image by sparse representation. In: Proceeding of the Workshop on Document Analysis and Recognition. DAR ’12, pp. 22–29. ACM, New York, NY (2012)
Walha, R., Drira, F., Lebourgeois, F., Garcia, C., Alimi, A.M.: Multiple learned dictionaries based clustered sparse coding for the super-resolution of single text image. In: International Conference on Document Analysis and Recognition, ICDAR, pp. 484–488 (2013)
Walha, R., Drira, F., Lebourgeois, F., Garcia, C., Alimi, A.M.: Single textual image super-resolution using multiple learned dictionaries based sparse coding. In: International Conference on Image Analysis and Processing, ICIAP, vol. 2, pp. 439–448 (2013)
Walha, R., Drira, F., Lebourgeois, F., Garcia, C., Alimi, A.M.: Sparse coding with a coupled dictionary learning approach for textual image super-resolution. In: International Conference on Pattern Recognition, ICPR, pp. 4459–4464 (2014)
Wang, Z., Bovik, A.C., Sheikh, H.R., Simoncelli, E.P.: Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process. 13(4), 600–612 (2004)
Article Google Scholar
Weickert, J.: Coherence-enhancing diffusion of colour images. Image Vis. Comput. 17(3–4), 201–212 (1999)
Article Google Scholar
van de Weijer, J., van Vliet, L.J., Verbeek, P.W., van Ginkel, M.: Curvature estimation in oriented patterns using curvilinear models applied to gradient vector fields. IEEE Trans. Pattern Anal. Mach. Intell. 23(9), 1035–1042 (2001)
Article Google Scholar
Yan, Z., Lu, Y., Li, J.: Super resolution of text image by pruning outlier. In: Lu, B.L., Zhang, L., Kwok, J.T. (eds.) ICONIP (3). Lecture Notes in Computer Science, vol. 7064, pp. 649–656. Springer (2011)
Yang, J., Wright, J., Huang, T.S., Ma, Y.: Image super-resolution via sparse representation. IEEE Trans. Image Process. 19(11), 2861–2873 (2010)
Yang, J., Wright, J., Ma, Y., Huang, T.: Image superresolution as sparse representation of raw image patches. In: CVPR (2008)
Yang, S., Wang, M., Chen, Y., Sun, Y.: Single-image super-resolution reconstruction via learned geometric dictionaries and clustered sparse coding. IEEE Trans. Image Process. 21(9), 4016–4028 (2012)
Article MathSciNet Google Scholar
Zeyde, R., Elad, M., Protter, M.: On single image scale-up using sparse-representations. In: Proceedings of the 7th International Conference on Curves and Surfaces, pp. 711–730. Springer, Berlin (2012)
Zhang, L., 0006, L.Z., Mou, X., Zhang, D.: Fsim: A feature similarity index for image quality assessment. IEEE Trans. Image Process. 20(8), 2378–2386 (2011)
Zhang, L., Wu, X.: An edge-guided image interpolation algorithm via directional filtering and data fusion. IEEE Trans. Image Process. 15(8), 2226–2238 (2006)
Article Google Scholar
Zhou, J., Lopresti, D., Sarkar, P., Nagy, G.: Spatial sampling effects on scanned 2-d patterns. Adv. Vis. Form Anal. 666–676 (1997)

Download references

Author information

Authors and Affiliations

REGIM-Lab, ENIS, University of Sfax, BP 1173, 3038, Sfax, Tunisia
Rim Walha, Fadoua Drira & Adel M. Alimi
LIRIS, INSA-Lyon, CNRS, UMR5205, University of Lyon, Lyon, 69621, France
Frank Lebourgeois & Christophe Garcia

Authors

Rim Walha
View author publications
You can also search for this author in PubMed Google Scholar
Fadoua Drira
View author publications
You can also search for this author in PubMed Google Scholar
Frank Lebourgeois
View author publications
You can also search for this author in PubMed Google Scholar
Christophe Garcia
View author publications
You can also search for this author in PubMed Google Scholar
Adel M. Alimi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Rim Walha.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Walha, R., Drira, F., Lebourgeois, F. et al. Resolution enhancement of textual images via multiple coupled dictionaries and adaptive sparse representation selection. IJDAR 18, 87–107 (2015). https://doi.org/10.1007/s10032-014-0235-6

Download citation

Received: 20 May 2014
Revised: 15 December 2014
Accepted: 17 December 2014
Published: 08 January 2015
Issue Date: March 2015
DOI: https://doi.org/10.1007/s10032-014-0235-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Resolution enhancement of textual images via multiple coupled dictionaries and adaptive sparse representation selection

Abstract

Access this article

Similar content being viewed by others

A Comprehensive Overview of Image Enhancement Techniques

Review on image-stitching techniques

Super-resolution: a comprehensive survey

Notes

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Resolution enhancement of textual images via multiple coupled dictionaries and adaptive sparse representation selection

Abstract

Access this article

Similar content being viewed by others

A Comprehensive Overview of Image Enhancement Techniques

Review on image-stitching techniques

Super-resolution: a comprehensive survey

Notes

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation