Published June 30, 2023 | Version v1
Conference paper Open

Mapping spatial named entities from noisy OCR output: Epimetheus from OCR to map.

  • 1. ObTIC, Observatoire des textes des idées et des corpus, Sorbonne Université, France; STIH, Sens Texte Informatique Histoire, Sorbonne Université, France; SCAI, Sorbonne Center for Artificial Intelligence, Sorbonne Université, France
  • 2. ObTIC, Observatoire des textes des idées et des corpus, Sorbonne Université, France
  • 3. Lattice, Langues, Textes, Traitements informatiques, Cognition, France
  • 4. STIH, Sens Texte Informatique Histoire, Sorbonne Université, France
  • 1. University of Graz
  • 2. Belgrade Center for Digital Humanities
  • 3. Le Mans Université
  • 4. Digital Humanities im deutschsprachigen Raum

Description

This contribution presents the difficulties encountered and methods to overcome them when using ready-to-use tools for the elaboration of a processing chain going from OCR to NER and then to the cartographic representation of spaces mentioned in literary texts.

Files

KOUDORO_PARFAIT_Caroline_Mapping_spatial_named_entities_from.pdf

Additional details

Related works

Is part of
Book: 10.5281/zenodo.7961822 (DOI)