Abstract
For accessing degraded historic text sources, humanist research increasingly relies on image processing for the digital restoration of written artifacts. A problem of these restoration approaches is the lack of a generally applicable objective method to assess the results. In this work we motivate the need for a quality metric for historic manuscript images, that explicitly targets human legibility. Reviewing previous attempts to evaluate the quality of manuscript images or the success of text restoration methods, we can not find a satisfying solution: either the approaches have a limited applicability, or they are insufficiently validated with respect to human perception. In order to establish a baseline for further research in this area, we test several candidates for human legibility estimators, while proposing an evaluation framework based on a recently published dataset of expert-rated historic manuscript images.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Arsene, C.T.C., Church, S., Dickinson, M.: High performance software in multidimensional reduction methods for image processing with application to ancient manuscripts. Manuscript Cult. 11, 73–96 (2018)
Bosse, S., Maniry, D., Muller, K.R., Wiegand, T., Samek, W.: Deep neural networks for no-reference and full-reference image quality assessment. IEEE Trans. Image Process. 27(1), 206–219 (2018)
Brenner, S., Sablatnig, R.: Subjective assessments of legibility in ancient manuscript images - the SALAMI dataset. In: Del Bimbo, A., et al. (eds.) ICPR 2021. LNCS, vol. 12667, pp. 68–82. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-68787-8_5
Bukhari, S.S., Kadi, A., Jouneh, M.A., Mir, F.M., Dengel, A.: anyOCR: an open-source OCR system for historical archives. In: 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), vol. 01, pp. 305–310, November 2017
Chattopadhay, A., Sarkar, A., Howlader, P., Balasubramanian, V.N.: Grad-CAM++: generalized gradient-based visual explanations for deep convolutional networks. In: 2018 IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 839–847, March 2018
Davies, D.L., Bouldin, D.W.: A cluster separation measure. IEEE Trans. Pattern Anal. Mach. Intell. 1(2), 224–227 (1979)
Dey, S., et al.: Script independent approach for multi-oriented text detection in scene image. Neurocomputing 242, 96–112 (2017)
Diem, M., Kleber, F., Sablatnig, R.: Text line detection for heterogeneous documents. In: 2013 12th International Conference on Document Analysis and Recognition, pp. 743–747, August 2013
Diem, M., Sablatnig, R.: Registration of ancient manuscript images using local descriptors. In: Digital Heritage, Proceedings of the 14th International Conference on Virtual Systems and Multimedia, pp. 188–192 (2008)
Dunn, J.C.: A fuzzy relative of the ISODATA process and its use in detecting compact well-separated clusters. J. Cybern. 3(3), 32–57 (1973)
Easton, R.L., Christens-Barry, W.A., Knox, K.T.: Spectral image processing and analysis of the Archimedes Palimpsest. In: European Signal Processing Conference (Eusipco), pp. 1440–1444 (2011)
Faigenbaum-Golovin, S., et al.: Multispectral images of ostraca: acquisition and analysis. J. Archaeol. Sci. 39 (2012)
Garg, R., Chaudhury, S.: Automatic selection of parameters for document image enhancement using image quality assessment. In: 2016 12th IAPR Workshop on Document Analysis Systems (DAS), pp. 422–427, April 2016
Gatos, B., Ntirogiannis, K., Pratikakis, I.: ICDAR 2009 document image binarization contest (DIBCO 2009). In: 2009 10th International Conference on Document Analysis and Recognition (ICDAR), pp. 1375–1382, July 2009
Giacometti, A., et al.: The value of critical destruction: evaluating multispectral image processing methods for the analysis of primary historical texts. Digit. Scholarsh. Hum. 32(1), 101–122 (2017)
Grüning, T., Labahn, R., Diem, M., Kleber, F., Fiel, S.: READ-BAD: a new dataset and evaluation scheme for baseline detection in archival documents. In: 2018 13th IAPR International Workshop on Document Analysis Systems (DAS), pp. 351–356, April 2018
Grüning, T., Leifert, G., Strauß, T., Michael, J., Labahn, R.: A two-stage method for text line detection in historical documents. Int. J. Doc. Anal. Recogn. (IJDAR) 22(3), 285–302 (2019)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778. IEEE, Las Vegas, June 2016
Hedjam, R., Nafchi, H.Z., Moghaddam, R.F., Kalacska, M., Cheriet, M.: ICDAR 2015 contest on MultiSpectral text extraction (MS-TEx 2015). In: Proceedings of the International Conference on Document Analysis and Recognition, ICDAR, pp. 1181–1185 (2015)
Hollaus, F., Brenner, S., Sablatnig, R.: CNN based binarization of MultiSpectral document images. In: Proceedings of the International Conference on Document Analysis and Recognition, ICDAR, pp. 533–538 (2019)
Hollaus, F., Diem, M., Sablatnig, R.: Improving OCR accuracy by applying enhancement techniques on MultiSpectral images. In: Proceedings - International Conference on Pattern Recognition, pp. 3080–3085 (2014)
Hollaus, F., Gau, M., Sablatnig, R.: Multispectral image acquisition of ancient manuscripts. In: Ioannides, M., Fritsch, D., Leissner, J., Davies, R., Remondino, F., Caffo, R. (eds.) EuroMed 2012. LNCS, vol. 7616, pp. 30–39. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-34234-9_4
Kahle, P., Colutto, S., Hackl, G., Mühlberger, G.: Transkribus - a service platform for transcription, recognition and retrieval of historical documents. In: 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), vol. 04, pp. 19–24, November 2017
Karatzas, D., et al.: ICDAR 2015 competition on robust reading. In: 2015 13th International Conference on Document Analysis and Recognition (ICDAR), pp. 1156–1160, August 2015
Karatzas, D., et al.: ICDAR 2013 robust reading competition. In: 2013 12th International Conference on Document Analysis and Recognition (ICDAR), pp. 1484–1493, August 2013
Leifert, G., Labahn, R., Sánchez, J.A.: Two semi-supervised training approaches for automated text recognition. In: 2020 17th International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 145–150 (Sep 2020)
Li, H., Zhu, F., Qiu, J.: CG-DIQA: no-reference document image quality assessment based on character gradient. In: 2018 24th International Conference on Pattern Recognition (ICPR), pp. 3622–3626, August 2018
Li, H., Zhu, F., Qiu, J.: Towards document image quality assessment: a text line based framework and a synthetic text line image dataset. In: 2019 International Conference on Document Analysis and Recognition (ICDAR), pp. 551–558, September 2019
Liao, M., Shi, B., Bai, X., Wang, X., Liu, W.: TextBoxes: a fast text detector with a single deep neural network. In: Thirty-First AAAI Conference on Artificial Intelligence, February 2017
Likforman-Sulem, L., Darbon, J., Smith, E.H.: Enhancement of historical printed document images by combining total variation regularization and non-local means filtering. Image Vis. Comput. 29(5), 351–363 (2011)
Liu, X., Van De Weijer, J., Bagdanov, A.D.: RankIQA: learning from rankings for no-reference image quality assessment. In: 2017 IEEE International Conference on Computer Vision (ICCV), pp. 1040–1049. IEEE, October 2017
Lu, T., Dooms, A.: A deep transfer learning approach to document image quality assessment. In: 2019 International Conference on Document Analysis and Recognition (ICDAR), pp. 1372–1377, September 2019
Manap, R.A., Shao, L.: Non-distortion-specific no-reference image quality assessment: a survey. Inf. Sci. 301, 141–160 (2015)
Mindermann, S.: Hyperspectral Imaging for Readability Enhancement of Historic Manuscripts. Master’s thesis, TU München (2018)
Mittal, A., Moorthy, A.K., Bovik, A.C.: No-reference image quality assessment in the spatial domain. IEEE Trans. Image Process. 21(12), 4695–4708 (2012)
Neumann, L., Matas, J.: Real-time scene text localization and recognition. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (ICPR), pp. 3538–3545. IEEE, Providence, June 2012
Obafemi-Ajayi, T., Agam, G.: Character-based automated human perception quality assessment in document images. IEEE Trans. Syst. Man Cybern. Part A Syst. Hum. 42(3), 584–595 (2012)
Ponomarenko, N., et al.: Image database TID2013: peculiarities, results and perspectives. Sig. Process. Image Commun. 30, 57–77 (2015)
Pouyet, E., et al.: Revealing the biography of a hidden medieval manuscript using synchrotron and conventional imaging techniques. Anal. Chim. Acta 982, 20–30 (2017)
Pratikakis, I., Zagoris, K., Karagiannis, X., Tsochatzidis, L., Mondal, T., Marthot-Santaniello, I.: ICDAR 2019 competition on document image binarization (DIBCO 2019). In: 2019 International Conference on Document Analysis and Recognition (ICDAR), pp. 1547–1556 (Sep 2019)
Puth, M.T., Neuhäuser, M., Ruxton, G.D.: Effective use of Spearman’s and Kendall’s correlation coefficients for association between two measured traits. Anim. Behav. 102, 77–84 (2015)
Ryu, J., Koo, H.I., Cho, N.I.: Language-independent text-line extraction algorithm for handwritten documents. IEEE Signal Process. Lett. 21(9), 1115–1119 (2014)
Shahkolaei, A., Nafchi, H.Z., Al-Maadeed, S., Cheriet, M.: Subjective and objective quality assessment of degraded document images. J. Cult. Herit. 30, 199–209 (2018)
Shaus, A., Faigenbaum-Golovin, S., Sober, B., Turkel, E.: Potential contrast - a new image quality measure. Electron. Imaging 2017(12), 52–58 (2017)
Sheikh, H.R., Sabir, M.F., Bovik, A.C.: A statistical evaluation of recent full reference image quality assessment algorithms. IEEE Trans. Image Process. 15(11), 3441–3452 (2006)
Stommel, M., Frieder, G.: Automatic estimation of the legibility of binarised historic documents for unsupervised parameter tuning. In: 2011 International Conference on Document Analysis and Recognition (ICDAR), pp. 104–108, September 2011
Sulaiman, O.: Nasrudin: degraded historical document binarization: a review on issues, challenges, techniques, and future directions. J. Imaging 5(4), 48 (2019)
Virtanen, T., Nuutinen, M., Vaahteranoksa, M., Oittinen, P., Häkkinen, J.: CID2013: a database for evaluating no-reference image quality assessment algorithms. IEEE Trans. Image Process. 24(1), 390–402 (2015)
Xu, X., Liu, L., Li, B.: A survey of CAPTCHA technologies to distinguish between human and computer. Neurocomputing 408, 292–307 (2020)
Ye, P., Doermann, D.: Document image quality assessment: a brief survey. In: 2013 12th International Conference on Document Analysis and Recognition (ICDAR), pp. 723–727, August 2013
Ye, P., Kumar, J., Kang, L., Doermann, D.: Real-time no-reference image quality assessment based on filter learning. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 987–994, June 2013
Ye, P., Doermann, D.: Combining preference and absolute judgements in a crowd-sourced setting. In: Proceedings of International Conference on Machine Learning, pp. 1–7 (2013)
Yousefi, M.R., Soheili, M.R., Breuel, T.M., Kabir, E., Stricker, D.: Binarization-free OCR for historical documents using LSTM networks. In: 2015 13th International Conference on Document Analysis and Recognition (ICDAR), pp. 1121–1125, August 2015
Zhou, X., et al.: EAST: an efficient and accurate scene text detector. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2642–2651. IEEE, Honolulu, HI, July 2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Brenner, S., Schügerl, L., Sablatnig, R. (2021). Estimating Human Legibility in Historic Manuscript Images - A Baseline. In: Lladós, J., Lopresti, D., Uchida, S. (eds) Document Analysis and Recognition – ICDAR 2021. ICDAR 2021. Lecture Notes in Computer Science(), vol 12823. Springer, Cham. https://doi.org/10.1007/978-3-030-86334-0_32
Download citation
DOI: https://doi.org/10.1007/978-3-030-86334-0_32
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-86333-3
Online ISBN: 978-3-030-86334-0
eBook Packages: Computer ScienceComputer Science (R0)