DocNER: A Deep Learning System for Named Entity Recognition in Handwritten Document Images

Dhiaf, Marwa; Jemni, Sana Khamekhem; Kessentini, Yousri

doi:10.1007/978-3-030-92310-5_28

Marwa Dhiaf¹⁰,
Sana Khamekhem Jemni¹¹ &
Yousri Kessentini¹⁰

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1517))

Included in the following conference series:

International Conference on Neural Information Processing

1841 Accesses
2 Citations

Abstract

The extraction of relevant information from handwritten document images is still a challenging task. In this paper, we introduce a framework for named entities recognition from a collection of real handwritten marriage records. For this purpose, we perform an extensive evaluation of two different information extraction approaches to locate and recognize named entities from unstructured handwritten documents. The first one is based on an end-to-end neural network model that jointly performs transcription and semantic annotation of handwritten document images. While the second relies on two stages, the first one focuses on Handwritten Text Recognition (HTR) to transcribe documents into electronic texts, while the second seeks to identify semantic named entities using a state-of-the-art Natural Language Processing (NLP) model. This work is evaluated on a collection of real French handwritten marriage records, discussing the advantages and drawbacks of the explored approaches. The achieved results show the efficiency of the proposed framework even though it does not use any post-processing stage. Additional experiments are conducted using the Esposalles database to compare our methods to the participated systems in the ICDAR 2017 Information Extraction competition.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 99.00; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
https://opennlp.apache.org/.

References

Romero, V., Fornés, A., Vidal, E., Sánchez, J.: Using the MGGI methodology for category-based language modeling in handwritten marriage licenses books. In: 201615th International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 331–336. IEEE (2016)
Google Scholar
Grover, C., Givon, S., Tobin, R., Ball, J.: Named entity recognition for digitised historical texts. In: LREC (2008)
Google Scholar
Grover, C., Matheson, C., Mikheev, A., Moens, M.: LT TTT-a flexible tokenisation tool. In: LREC, vol. 2000, p. 2nd (2000)
Google Scholar
Rodriquez, K., Bryant, M., Blanke, T., Luszczynska, M.: Comparison of named entity recognition tools for raw OCR text. In: Konvens, pp. 410–414 (2012)
Google Scholar
Liu, H., Feng, J., Qi, M., Jiang, J., Yan, S.: End-to-end comparative attention networks for person re-identification. IEEE Trans. Image Process. 26, 3492–3506 (2017)
Article MathSciNet Google Scholar
Carbonell, M., Villegas, M., Fornés, A., Lladós, J.: Joint recognition of handwritten text and named entities with a neural end-to-end model. In: 2018 13th IAPR International Workshop on Document Analysis Systems (DAS), pp. 399–404. IEEE (2018)
Google Scholar
Carbonell, M., Fornés, A., Villegas, M., Lladós, J.: A neural model for text localization, transcription and named entity recognition in full pages. Pattern Recogn. Lett. 136, 219–227 (2020)
Article Google Scholar
Kessentini, Y., Paquet, T., Hamadou, A.: Off-line handwritten word recognition using multi-stream hidden Markov models. Pattern Recogn. Lett. 31, 60–70 (2010)
Article Google Scholar
Surinta, O., Holtkamp, M., Karabaa, F., Vanoosten, J., Schomaker, L., Wiering, M.: A path planning for line segmentation of handwritten documents. In: 2014 14th International Conference on Frontiers in Handwriting Recognition, pp. 175–180. IEEE (2014)
Google Scholar
Augustin, E., Carré, M., Grosicki, E., Brodin, J., Geoffrois, E., Prêteux, F.: RIMES evaluation campaign for handwritten mail processing. In: International Workshop on Frontiers in Handwriting Recognition (IWFHR 2006), pp. 231–235 (2006)
Google Scholar
Devlin, J., Chang, M., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv Preprint arXiv:1810.04805 (2018)
Martin, L., et al.: CamemBERT: a tasty French language model. arXiv Preprint arXiv:1911.03894 (2019)
Fornés, A., et al.: Competition on information extraction in historical handwritten records. In: 2013 12th International Conference on Document Analysis and Recognition. IEEE (2017)
Google Scholar
Levenshtein, V.I.: Binary codes capable of correcting deletions, insertions, andreversals. In: Soviet physics doklady, vol. 10, pp. 707–710. Soviet Union (1966)
Google Scholar
Graves, A.: Generating sequences with recurrent neural networks. arXiv preprint arXiv:1308.0850 (2013)

Download references

Author information

Authors and Affiliations

Digital Research Center of Sfax, SM@RTS: Laboratory of Signals, systeMs, aRtificial Intelligence and neTworkS, Sfax, Tunisia
Marwa Dhiaf & Yousri Kessentini
Digital Research Center of Sfax, MIR@CL: Multimedia, InfoRmation systems and Advanced Computing Laboratory, Sfax, Tunisia
Sana Khamekhem Jemni

Authors

Marwa Dhiaf
View author publications
You can also search for this author in PubMed Google Scholar
Sana Khamekhem Jemni
View author publications
You can also search for this author in PubMed Google Scholar
Yousri Kessentini
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Marwa Dhiaf .

Editor information

Editors and Affiliations

Sampoerna University, Jakarta, Indonesia
Teddy Mantoro
Kyungpook National University, Daegu, Korea (Republic of)
Minho Lee
Sampoerna University, Jakarta, Indonesia
Media Anugerah Ayu
Murdoch University, Murdoch, WA, Australia
Kok Wai Wong
Universitas Indonesia, Depok, Indonesia
Achmad Nizar Hidayanto

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dhiaf, M., Jemni, S.K., Kessentini, Y. (2021). DocNER: A Deep Learning System for Named Entity Recognition in Handwritten Document Images. In: Mantoro, T., Lee, M., Ayu, M.A., Wong, K.W., Hidayanto, A.N. (eds) Neural Information Processing. ICONIP 2021. Communications in Computer and Information Science, vol 1517. Springer, Cham. https://doi.org/10.1007/978-3-030-92310-5_28

Download citation

DOI: https://doi.org/10.1007/978-3-030-92310-5_28
Published: 02 December 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-92309-9
Online ISBN: 978-3-030-92310-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics