Estimating Human Legibility in Historic Manuscript Images - A Baseline

Brenner, Simon; Schügerl, Lukas; Sablatnig, Robert

doi:10.1007/978-3-030-86334-0_32

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 12823))

Included in the following conference series:

International Conference on Document Analysis and Recognition

3343 Accesses

Abstract

For accessing degraded historic text sources, humanist research increasingly relies on image processing for the digital restoration of written artifacts. A problem of these restoration approaches is the lack of a generally applicable objective method to assess the results. In this work we motivate the need for a quality metric for historic manuscript images, that explicitly targets human legibility. Reviewing previous attempts to evaluate the quality of manuscript images or the success of text restoration methods, we can not find a satisfying solution: either the approaches have a limited applicability, or they are insufficiently validated with respect to human perception. In order to establish a baseline for further research in this area, we test several candidates for human legibility estimators, while proposing an evaluation framework based on a recently published dataset of expert-rated historic manuscript images.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Arsene, C.T.C., Church, S., Dickinson, M.: High performance software in multidimensional reduction methods for image processing with application to ancient manuscripts. Manuscript Cult. 11, 73–96 (2018)
Google Scholar
Bosse, S., Maniry, D., Muller, K.R., Wiegand, T., Samek, W.: Deep neural networks for no-reference and full-reference image quality assessment. IEEE Trans. Image Process. 27(1), 206–219 (2018)
Article MathSciNet Google Scholar
Brenner, S., Sablatnig, R.: Subjective assessments of legibility in ancient manuscript images - the SALAMI dataset. In: Del Bimbo, A., et al. (eds.) ICPR 2021. LNCS, vol. 12667, pp. 68–82. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-68787-8_5
Chapter Google Scholar
Bukhari, S.S., Kadi, A., Jouneh, M.A., Mir, F.M., Dengel, A.: anyOCR: an open-source OCR system for historical archives. In: 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), vol. 01, pp. 305–310, November 2017
Google Scholar
Chattopadhay, A., Sarkar, A., Howlader, P., Balasubramanian, V.N.: Grad-CAM++: generalized gradient-based visual explanations for deep convolutional networks. In: 2018 IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 839–847, March 2018
Google Scholar
Davies, D.L., Bouldin, D.W.: A cluster separation measure. IEEE Trans. Pattern Anal. Mach. Intell. 1(2), 224–227 (1979)
Article Google Scholar
Dey, S., et al.: Script independent approach for multi-oriented text detection in scene image. Neurocomputing 242, 96–112 (2017)
Article Google Scholar
Diem, M., Kleber, F., Sablatnig, R.: Text line detection for heterogeneous documents. In: 2013 12th International Conference on Document Analysis and Recognition, pp. 743–747, August 2013
Google Scholar
Diem, M., Sablatnig, R.: Registration of ancient manuscript images using local descriptors. In: Digital Heritage, Proceedings of the 14th International Conference on Virtual Systems and Multimedia, pp. 188–192 (2008)
Google Scholar
Dunn, J.C.: A fuzzy relative of the ISODATA process and its use in detecting compact well-separated clusters. J. Cybern. 3(3), 32–57 (1973)
Article MathSciNet Google Scholar
Easton, R.L., Christens-Barry, W.A., Knox, K.T.: Spectral image processing and analysis of the Archimedes Palimpsest. In: European Signal Processing Conference (Eusipco), pp. 1440–1444 (2011)
Google Scholar
Faigenbaum-Golovin, S., et al.: Multispectral images of ostraca: acquisition and analysis. J. Archaeol. Sci. 39 (2012)
Google Scholar
Garg, R., Chaudhury, S.: Automatic selection of parameters for document image enhancement using image quality assessment. In: 2016 12th IAPR Workshop on Document Analysis Systems (DAS), pp. 422–427, April 2016
Google Scholar
Gatos, B., Ntirogiannis, K., Pratikakis, I.: ICDAR 2009 document image binarization contest (DIBCO 2009). In: 2009 10th International Conference on Document Analysis and Recognition (ICDAR), pp. 1375–1382, July 2009
Google Scholar
Giacometti, A., et al.: The value of critical destruction: evaluating multispectral image processing methods for the analysis of primary historical texts. Digit. Scholarsh. Hum. 32(1), 101–122 (2017)
Google Scholar
Grüning, T., Labahn, R., Diem, M., Kleber, F., Fiel, S.: READ-BAD: a new dataset and evaluation scheme for baseline detection in archival documents. In: 2018 13th IAPR International Workshop on Document Analysis Systems (DAS), pp. 351–356, April 2018
Google Scholar
Grüning, T., Leifert, G., Strauß, T., Michael, J., Labahn, R.: A two-stage method for text line detection in historical documents. Int. J. Doc. Anal. Recogn. (IJDAR) 22(3), 285–302 (2019)
Article Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778. IEEE, Las Vegas, June 2016
Google Scholar
Hedjam, R., Nafchi, H.Z., Moghaddam, R.F., Kalacska, M., Cheriet, M.: ICDAR 2015 contest on MultiSpectral text extraction (MS-TEx 2015). In: Proceedings of the International Conference on Document Analysis and Recognition, ICDAR, pp. 1181–1185 (2015)
Google Scholar
Hollaus, F., Brenner, S., Sablatnig, R.: CNN based binarization of MultiSpectral document images. In: Proceedings of the International Conference on Document Analysis and Recognition, ICDAR, pp. 533–538 (2019)
Google Scholar
Hollaus, F., Diem, M., Sablatnig, R.: Improving OCR accuracy by applying enhancement techniques on MultiSpectral images. In: Proceedings - International Conference on Pattern Recognition, pp. 3080–3085 (2014)
Google Scholar
Hollaus, F., Gau, M., Sablatnig, R.: Multispectral image acquisition of ancient manuscripts. In: Ioannides, M., Fritsch, D., Leissner, J., Davies, R., Remondino, F., Caffo, R. (eds.) EuroMed 2012. LNCS, vol. 7616, pp. 30–39. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-34234-9_4
Chapter Google Scholar
Kahle, P., Colutto, S., Hackl, G., Mühlberger, G.: Transkribus - a service platform for transcription, recognition and retrieval of historical documents. In: 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), vol. 04, pp. 19–24, November 2017
Google Scholar
Karatzas, D., et al.: ICDAR 2015 competition on robust reading. In: 2015 13th International Conference on Document Analysis and Recognition (ICDAR), pp. 1156–1160, August 2015
Google Scholar
Karatzas, D., et al.: ICDAR 2013 robust reading competition. In: 2013 12th International Conference on Document Analysis and Recognition (ICDAR), pp. 1484–1493, August 2013
Google Scholar
Leifert, G., Labahn, R., Sánchez, J.A.: Two semi-supervised training approaches for automated text recognition. In: 2020 17th International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 145–150 (Sep 2020)
Google Scholar
Li, H., Zhu, F., Qiu, J.: CG-DIQA: no-reference document image quality assessment based on character gradient. In: 2018 24th International Conference on Pattern Recognition (ICPR), pp. 3622–3626, August 2018
Google Scholar
Li, H., Zhu, F., Qiu, J.: Towards document image quality assessment: a text line based framework and a synthetic text line image dataset. In: 2019 International Conference on Document Analysis and Recognition (ICDAR), pp. 551–558, September 2019
Google Scholar
Liao, M., Shi, B., Bai, X., Wang, X., Liu, W.: TextBoxes: a fast text detector with a single deep neural network. In: Thirty-First AAAI Conference on Artificial Intelligence, February 2017
Google Scholar
Likforman-Sulem, L., Darbon, J., Smith, E.H.: Enhancement of historical printed document images by combining total variation regularization and non-local means filtering. Image Vis. Comput. 29(5), 351–363 (2011)
Article Google Scholar
Liu, X., Van De Weijer, J., Bagdanov, A.D.: RankIQA: learning from rankings for no-reference image quality assessment. In: 2017 IEEE International Conference on Computer Vision (ICCV), pp. 1040–1049. IEEE, October 2017
Google Scholar
Lu, T., Dooms, A.: A deep transfer learning approach to document image quality assessment. In: 2019 International Conference on Document Analysis and Recognition (ICDAR), pp. 1372–1377, September 2019
Google Scholar
Manap, R.A., Shao, L.: Non-distortion-specific no-reference image quality assessment: a survey. Inf. Sci. 301, 141–160 (2015)
Article Google Scholar
Mindermann, S.: Hyperspectral Imaging for Readability Enhancement of Historic Manuscripts. Master’s thesis, TU München (2018)
Google Scholar
Mittal, A., Moorthy, A.K., Bovik, A.C.: No-reference image quality assessment in the spatial domain. IEEE Trans. Image Process. 21(12), 4695–4708 (2012)
Article MathSciNet Google Scholar
Neumann, L., Matas, J.: Real-time scene text localization and recognition. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (ICPR), pp. 3538–3545. IEEE, Providence, June 2012
Google Scholar
Obafemi-Ajayi, T., Agam, G.: Character-based automated human perception quality assessment in document images. IEEE Trans. Syst. Man Cybern. Part A Syst. Hum. 42(3), 584–595 (2012)
Article Google Scholar
Ponomarenko, N., et al.: Image database TID2013: peculiarities, results and perspectives. Sig. Process. Image Commun. 30, 57–77 (2015)
Article Google Scholar
Pouyet, E., et al.: Revealing the biography of a hidden medieval manuscript using synchrotron and conventional imaging techniques. Anal. Chim. Acta 982, 20–30 (2017)
Article Google Scholar
Pratikakis, I., Zagoris, K., Karagiannis, X., Tsochatzidis, L., Mondal, T., Marthot-Santaniello, I.: ICDAR 2019 competition on document image binarization (DIBCO 2019). In: 2019 International Conference on Document Analysis and Recognition (ICDAR), pp. 1547–1556 (Sep 2019)
Google Scholar
Puth, M.T., Neuhäuser, M., Ruxton, G.D.: Effective use of Spearman’s and Kendall’s correlation coefficients for association between two measured traits. Anim. Behav. 102, 77–84 (2015)
Article Google Scholar
Ryu, J., Koo, H.I., Cho, N.I.: Language-independent text-line extraction algorithm for handwritten documents. IEEE Signal Process. Lett. 21(9), 1115–1119 (2014)
Article Google Scholar
Shahkolaei, A., Nafchi, H.Z., Al-Maadeed, S., Cheriet, M.: Subjective and objective quality assessment of degraded document images. J. Cult. Herit. 30, 199–209 (2018)
Article Google Scholar
Shaus, A., Faigenbaum-Golovin, S., Sober, B., Turkel, E.: Potential contrast - a new image quality measure. Electron. Imaging 2017(12), 52–58 (2017)
Article Google Scholar
Sheikh, H.R., Sabir, M.F., Bovik, A.C.: A statistical evaluation of recent full reference image quality assessment algorithms. IEEE Trans. Image Process. 15(11), 3441–3452 (2006)
Article Google Scholar
Stommel, M., Frieder, G.: Automatic estimation of the legibility of binarised historic documents for unsupervised parameter tuning. In: 2011 International Conference on Document Analysis and Recognition (ICDAR), pp. 104–108, September 2011
Google Scholar
Sulaiman, O.: Nasrudin: degraded historical document binarization: a review on issues, challenges, techniques, and future directions. J. Imaging 5(4), 48 (2019)
Article Google Scholar
Virtanen, T., Nuutinen, M., Vaahteranoksa, M., Oittinen, P., Häkkinen, J.: CID2013: a database for evaluating no-reference image quality assessment algorithms. IEEE Trans. Image Process. 24(1), 390–402 (2015)
Article MathSciNet Google Scholar
Xu, X., Liu, L., Li, B.: A survey of CAPTCHA technologies to distinguish between human and computer. Neurocomputing 408, 292–307 (2020)
Article Google Scholar
Ye, P., Doermann, D.: Document image quality assessment: a brief survey. In: 2013 12th International Conference on Document Analysis and Recognition (ICDAR), pp. 723–727, August 2013
Google Scholar
Ye, P., Kumar, J., Kang, L., Doermann, D.: Real-time no-reference image quality assessment based on filter learning. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 987–994, June 2013
Google Scholar
Ye, P., Doermann, D.: Combining preference and absolute judgements in a crowd-sourced setting. In: Proceedings of International Conference on Machine Learning, pp. 1–7 (2013)
Google Scholar
Yousefi, M.R., Soheili, M.R., Breuel, T.M., Kabir, E., Stricker, D.: Binarization-free OCR for historical documents using LSTM networks. In: 2015 13th International Conference on Document Analysis and Recognition (ICDAR), pp. 1121–1125, August 2015
Google Scholar
Zhou, X., et al.: EAST: an efficient and accurate scene text detector. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2642–2651. IEEE, Honolulu, HI, July 2017)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Visual Computing and Human-Centered Technology, TU Wien, 1040, Vienna, Austria
Simon Brenner, Lukas Schügerl & Robert Sablatnig

Authors

Simon Brenner
View author publications
You can also search for this author in PubMed Google Scholar
Lukas Schügerl
View author publications
You can also search for this author in PubMed Google Scholar
Robert Sablatnig
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Simon Brenner .

Editor information

Editors and Affiliations

Universitat Autònoma de Barcelona, Barcelona, Spain
Josep Lladós
Lehigh University, Bethlehem, PA, USA
Daniel Lopresti
Kyushu University, Fukuoka-shi, Japan
Seiichi Uchida

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Brenner, S., Schügerl, L., Sablatnig, R. (2021). Estimating Human Legibility in Historic Manuscript Images - A Baseline. In: Lladós, J., Lopresti, D., Uchida, S. (eds) Document Analysis and Recognition – ICDAR 2021. ICDAR 2021. Lecture Notes in Computer Science(), vol 12823. Springer, Cham. https://doi.org/10.1007/978-3-030-86334-0_32

Download citation

DOI: https://doi.org/10.1007/978-3-030-86334-0_32
Published: 02 September 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-86333-3
Online ISBN: 978-3-030-86334-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)