Skip to main content

Estimating Human Legibility in Historic Manuscript Images - A Baseline

  • Conference paper
  • First Online:
Book cover Document Analysis and Recognition – ICDAR 2021 (ICDAR 2021)

Abstract

For accessing degraded historic text sources, humanist research increasingly relies on image processing for the digital restoration of written artifacts. A problem of these restoration approaches is the lack of a generally applicable objective method to assess the results. In this work we motivate the need for a quality metric for historic manuscript images, that explicitly targets human legibility. Reviewing previous attempts to evaluate the quality of manuscript images or the success of text restoration methods, we can not find a satisfying solution: either the approaches have a limited applicability, or they are insufficiently validated with respect to human perception. In order to establish a baseline for further research in this area, we test several candidates for human legibility estimators, while proposing an evaluation framework based on a recently published dataset of expert-rated historic manuscript images.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Arsene, C.T.C., Church, S., Dickinson, M.: High performance software in multidimensional reduction methods for image processing with application to ancient manuscripts. Manuscript Cult. 11, 73–96 (2018)

    Google Scholar 

  2. Bosse, S., Maniry, D., Muller, K.R., Wiegand, T., Samek, W.: Deep neural networks for no-reference and full-reference image quality assessment. IEEE Trans. Image Process. 27(1), 206–219 (2018)

    Article  MathSciNet  Google Scholar 

  3. Brenner, S., Sablatnig, R.: Subjective assessments of legibility in ancient manuscript images - the SALAMI dataset. In: Del Bimbo, A., et al. (eds.) ICPR 2021. LNCS, vol. 12667, pp. 68–82. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-68787-8_5

    Chapter  Google Scholar 

  4. Bukhari, S.S., Kadi, A., Jouneh, M.A., Mir, F.M., Dengel, A.: anyOCR: an open-source OCR system for historical archives. In: 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), vol. 01, pp. 305–310, November 2017

    Google Scholar 

  5. Chattopadhay, A., Sarkar, A., Howlader, P., Balasubramanian, V.N.: Grad-CAM++: generalized gradient-based visual explanations for deep convolutional networks. In: 2018 IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 839–847, March 2018

    Google Scholar 

  6. Davies, D.L., Bouldin, D.W.: A cluster separation measure. IEEE Trans. Pattern Anal. Mach. Intell. 1(2), 224–227 (1979)

    Article  Google Scholar 

  7. Dey, S., et al.: Script independent approach for multi-oriented text detection in scene image. Neurocomputing 242, 96–112 (2017)

    Article  Google Scholar 

  8. Diem, M., Kleber, F., Sablatnig, R.: Text line detection for heterogeneous documents. In: 2013 12th International Conference on Document Analysis and Recognition, pp. 743–747, August 2013

    Google Scholar 

  9. Diem, M., Sablatnig, R.: Registration of ancient manuscript images using local descriptors. In: Digital Heritage, Proceedings of the 14th International Conference on Virtual Systems and Multimedia, pp. 188–192 (2008)

    Google Scholar 

  10. Dunn, J.C.: A fuzzy relative of the ISODATA process and its use in detecting compact well-separated clusters. J. Cybern. 3(3), 32–57 (1973)

    Article  MathSciNet  Google Scholar 

  11. Easton, R.L., Christens-Barry, W.A., Knox, K.T.: Spectral image processing and analysis of the Archimedes Palimpsest. In: European Signal Processing Conference (Eusipco), pp. 1440–1444 (2011)

    Google Scholar 

  12. Faigenbaum-Golovin, S., et al.: Multispectral images of ostraca: acquisition and analysis. J. Archaeol. Sci. 39 (2012)

    Google Scholar 

  13. Garg, R., Chaudhury, S.: Automatic selection of parameters for document image enhancement using image quality assessment. In: 2016 12th IAPR Workshop on Document Analysis Systems (DAS), pp. 422–427, April 2016

    Google Scholar 

  14. Gatos, B., Ntirogiannis, K., Pratikakis, I.: ICDAR 2009 document image binarization contest (DIBCO 2009). In: 2009 10th International Conference on Document Analysis and Recognition (ICDAR), pp. 1375–1382, July 2009

    Google Scholar 

  15. Giacometti, A., et al.: The value of critical destruction: evaluating multispectral image processing methods for the analysis of primary historical texts. Digit. Scholarsh. Hum. 32(1), 101–122 (2017)

    Google Scholar 

  16. Grüning, T., Labahn, R., Diem, M., Kleber, F., Fiel, S.: READ-BAD: a new dataset and evaluation scheme for baseline detection in archival documents. In: 2018 13th IAPR International Workshop on Document Analysis Systems (DAS), pp. 351–356, April 2018

    Google Scholar 

  17. Grüning, T., Leifert, G., Strauß, T., Michael, J., Labahn, R.: A two-stage method for text line detection in historical documents. Int. J. Doc. Anal. Recogn. (IJDAR) 22(3), 285–302 (2019)

    Article  Google Scholar 

  18. He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778. IEEE, Las Vegas, June 2016

    Google Scholar 

  19. Hedjam, R., Nafchi, H.Z., Moghaddam, R.F., Kalacska, M., Cheriet, M.: ICDAR 2015 contest on MultiSpectral text extraction (MS-TEx 2015). In: Proceedings of the International Conference on Document Analysis and Recognition, ICDAR, pp. 1181–1185 (2015)

    Google Scholar 

  20. Hollaus, F., Brenner, S., Sablatnig, R.: CNN based binarization of MultiSpectral document images. In: Proceedings of the International Conference on Document Analysis and Recognition, ICDAR, pp. 533–538 (2019)

    Google Scholar 

  21. Hollaus, F., Diem, M., Sablatnig, R.: Improving OCR accuracy by applying enhancement techniques on MultiSpectral images. In: Proceedings - International Conference on Pattern Recognition, pp. 3080–3085 (2014)

    Google Scholar 

  22. Hollaus, F., Gau, M., Sablatnig, R.: Multispectral image acquisition of ancient manuscripts. In: Ioannides, M., Fritsch, D., Leissner, J., Davies, R., Remondino, F., Caffo, R. (eds.) EuroMed 2012. LNCS, vol. 7616, pp. 30–39. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-34234-9_4

    Chapter  Google Scholar 

  23. Kahle, P., Colutto, S., Hackl, G., Mühlberger, G.: Transkribus - a service platform for transcription, recognition and retrieval of historical documents. In: 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), vol. 04, pp. 19–24, November 2017

    Google Scholar 

  24. Karatzas, D., et al.: ICDAR 2015 competition on robust reading. In: 2015 13th International Conference on Document Analysis and Recognition (ICDAR), pp. 1156–1160, August 2015

    Google Scholar 

  25. Karatzas, D., et al.: ICDAR 2013 robust reading competition. In: 2013 12th International Conference on Document Analysis and Recognition (ICDAR), pp. 1484–1493, August 2013

    Google Scholar 

  26. Leifert, G., Labahn, R., Sánchez, J.A.: Two semi-supervised training approaches for automated text recognition. In: 2020 17th International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 145–150 (Sep 2020)

    Google Scholar 

  27. Li, H., Zhu, F., Qiu, J.: CG-DIQA: no-reference document image quality assessment based on character gradient. In: 2018 24th International Conference on Pattern Recognition (ICPR), pp. 3622–3626, August 2018

    Google Scholar 

  28. Li, H., Zhu, F., Qiu, J.: Towards document image quality assessment: a text line based framework and a synthetic text line image dataset. In: 2019 International Conference on Document Analysis and Recognition (ICDAR), pp. 551–558, September 2019

    Google Scholar 

  29. Liao, M., Shi, B., Bai, X., Wang, X., Liu, W.: TextBoxes: a fast text detector with a single deep neural network. In: Thirty-First AAAI Conference on Artificial Intelligence, February 2017

    Google Scholar 

  30. Likforman-Sulem, L., Darbon, J., Smith, E.H.: Enhancement of historical printed document images by combining total variation regularization and non-local means filtering. Image Vis. Comput. 29(5), 351–363 (2011)

    Article  Google Scholar 

  31. Liu, X., Van De Weijer, J., Bagdanov, A.D.: RankIQA: learning from rankings for no-reference image quality assessment. In: 2017 IEEE International Conference on Computer Vision (ICCV), pp. 1040–1049. IEEE, October 2017

    Google Scholar 

  32. Lu, T., Dooms, A.: A deep transfer learning approach to document image quality assessment. In: 2019 International Conference on Document Analysis and Recognition (ICDAR), pp. 1372–1377, September 2019

    Google Scholar 

  33. Manap, R.A., Shao, L.: Non-distortion-specific no-reference image quality assessment: a survey. Inf. Sci. 301, 141–160 (2015)

    Article  Google Scholar 

  34. Mindermann, S.: Hyperspectral Imaging for Readability Enhancement of Historic Manuscripts. Master’s thesis, TU München (2018)

    Google Scholar 

  35. Mittal, A., Moorthy, A.K., Bovik, A.C.: No-reference image quality assessment in the spatial domain. IEEE Trans. Image Process. 21(12), 4695–4708 (2012)

    Article  MathSciNet  Google Scholar 

  36. Neumann, L., Matas, J.: Real-time scene text localization and recognition. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (ICPR), pp. 3538–3545. IEEE, Providence, June 2012

    Google Scholar 

  37. Obafemi-Ajayi, T., Agam, G.: Character-based automated human perception quality assessment in document images. IEEE Trans. Syst. Man Cybern. Part A Syst. Hum. 42(3), 584–595 (2012)

    Article  Google Scholar 

  38. Ponomarenko, N., et al.: Image database TID2013: peculiarities, results and perspectives. Sig. Process. Image Commun. 30, 57–77 (2015)

    Article  Google Scholar 

  39. Pouyet, E., et al.: Revealing the biography of a hidden medieval manuscript using synchrotron and conventional imaging techniques. Anal. Chim. Acta 982, 20–30 (2017)

    Article  Google Scholar 

  40. Pratikakis, I., Zagoris, K., Karagiannis, X., Tsochatzidis, L., Mondal, T., Marthot-Santaniello, I.: ICDAR 2019 competition on document image binarization (DIBCO 2019). In: 2019 International Conference on Document Analysis and Recognition (ICDAR), pp. 1547–1556 (Sep 2019)

    Google Scholar 

  41. Puth, M.T., Neuhäuser, M., Ruxton, G.D.: Effective use of Spearman’s and Kendall’s correlation coefficients for association between two measured traits. Anim. Behav. 102, 77–84 (2015)

    Article  Google Scholar 

  42. Ryu, J., Koo, H.I., Cho, N.I.: Language-independent text-line extraction algorithm for handwritten documents. IEEE Signal Process. Lett. 21(9), 1115–1119 (2014)

    Article  Google Scholar 

  43. Shahkolaei, A., Nafchi, H.Z., Al-Maadeed, S., Cheriet, M.: Subjective and objective quality assessment of degraded document images. J. Cult. Herit. 30, 199–209 (2018)

    Article  Google Scholar 

  44. Shaus, A., Faigenbaum-Golovin, S., Sober, B., Turkel, E.: Potential contrast - a new image quality measure. Electron. Imaging 2017(12), 52–58 (2017)

    Article  Google Scholar 

  45. Sheikh, H.R., Sabir, M.F., Bovik, A.C.: A statistical evaluation of recent full reference image quality assessment algorithms. IEEE Trans. Image Process. 15(11), 3441–3452 (2006)

    Article  Google Scholar 

  46. Stommel, M., Frieder, G.: Automatic estimation of the legibility of binarised historic documents for unsupervised parameter tuning. In: 2011 International Conference on Document Analysis and Recognition (ICDAR), pp. 104–108, September 2011

    Google Scholar 

  47. Sulaiman, O.: Nasrudin: degraded historical document binarization: a review on issues, challenges, techniques, and future directions. J. Imaging 5(4), 48 (2019)

    Article  Google Scholar 

  48. Virtanen, T., Nuutinen, M., Vaahteranoksa, M., Oittinen, P., Häkkinen, J.: CID2013: a database for evaluating no-reference image quality assessment algorithms. IEEE Trans. Image Process. 24(1), 390–402 (2015)

    Article  MathSciNet  Google Scholar 

  49. Xu, X., Liu, L., Li, B.: A survey of CAPTCHA technologies to distinguish between human and computer. Neurocomputing 408, 292–307 (2020)

    Article  Google Scholar 

  50. Ye, P., Doermann, D.: Document image quality assessment: a brief survey. In: 2013 12th International Conference on Document Analysis and Recognition (ICDAR), pp. 723–727, August 2013

    Google Scholar 

  51. Ye, P., Kumar, J., Kang, L., Doermann, D.: Real-time no-reference image quality assessment based on filter learning. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 987–994, June 2013

    Google Scholar 

  52. Ye, P., Doermann, D.: Combining preference and absolute judgements in a crowd-sourced setting. In: Proceedings of International Conference on Machine Learning, pp. 1–7 (2013)

    Google Scholar 

  53. Yousefi, M.R., Soheili, M.R., Breuel, T.M., Kabir, E., Stricker, D.: Binarization-free OCR for historical documents using LSTM networks. In: 2015 13th International Conference on Document Analysis and Recognition (ICDAR), pp. 1121–1125, August 2015

    Google Scholar 

  54. Zhou, X., et al.: EAST: an efficient and accurate scene text detector. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2642–2651. IEEE, Honolulu, HI, July 2017)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Simon Brenner .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Brenner, S., Schügerl, L., Sablatnig, R. (2021). Estimating Human Legibility in Historic Manuscript Images - A Baseline. In: Lladós, J., Lopresti, D., Uchida, S. (eds) Document Analysis and Recognition – ICDAR 2021. ICDAR 2021. Lecture Notes in Computer Science(), vol 12823. Springer, Cham. https://doi.org/10.1007/978-3-030-86334-0_32

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-86334-0_32

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-86333-3

  • Online ISBN: 978-3-030-86334-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics