Skip to main content

Resolving Ambiguities in Toponym Recognition in Cartographic Maps

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3088))

Abstract

To date many methods and programs for automatic text recognition exist. However there are no effective text recognition systems for graphic documents. Graphic documents usually contain a great variety of textual information. As a rule the text appears in arbitrary spatial positions, in different fonts, sizes and colors. The text can touch and overlap graphic symbols. The text meaning is semantically much more ambiguous in comparison with standard text. To recognize a text of graphic documents, it is necessary first to separate it from linear objects, solids, and symbols and to define its orientation. Even so, the recognition programs nearly always produce errors. In the context of raster-to-vector conversion of graphic documents, the problem of text recognition is of special interest, because textual information can be used for verification of vectorization results (post-processing). In this work, we propose a method that combines OCR-based text recognition in raster-scanned maps with heuristics specially adapted for cartographic data to resolve the recognition ambiguities using, among other information sources, the spatial object relation-ships. Our goal is to form in the vector thematic layers geographically meaningful words correctly attached to the cartographic objects.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Levachkine, S., Velázquez, A., Alexandrov, V., Kharinov, M.: Semantic Analysis and Recognition of Raster-scanned Color Cartographic Images. In: Blostein, D., Kwon, Y.-B. (eds.) GREC 2001. LNCS, vol. 2390, pp. 178–189. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  2. Doermann, D.S.: An Introduction to Vectorization and Segmentation. In: Chhabra, A.K., Tombre, K. (eds.) GREC 1997. LNCS, vol. 1389, pp. 1–8. Springer, Heidelberg (1998)

    Chapter  Google Scholar 

  3. Nagy, G.: Twenty Years of Document Image Analysis in PAMI. PAMI 22(1), 38–62 (2000)

    Google Scholar 

  4. Ganesan, A.: Integration of Surveying and Cadastral GIS: From Field-to-fabric & Land Records-to-fabric. In: Proc. 22nd ESRI User Conference, Redlands, CA, USA, July 7-12 (2002), see, gis.esri.com/library/userconf/proc02/abstracts/a0868.html

  5. Fletcher, L.A., Kasturi, R.: A Robust Algorithm for Text String Separation from Mixed Text/Graphics Images. PAMI 10(6), 910–918 (1988)

    Google Scholar 

  6. Tan, C.L., Ng, P.O.: Text Extraction using Pyramid. PR 31(1), 63–72 (1998)

    Google Scholar 

  7. Velázquez, A.: Localización, Recuperación e Identificación de la Capa de Caracteres contenida en los Planos Cartográficos. Ph.D. Thesis. Centre for Computing Research-IPN. Mexico City, Mexico (2002) (in Spanish)

    Google Scholar 

  8. Velázquez, A., Levachkine, S.: Text/Graphics Separation in Raster-scanned Color Cartographic Maps. In: Levachkine, S., Serra, J., Egenhofer, M. (eds.) Proc. 2nd Int. Workshop on Semantic Processing of Spatial Data (GEOPRO 2003), Mexico City, Mexico, November 4–5, pp. 34–41 (2003)

    Google Scholar 

  9. Cao, R., Tan, C.L.: Text/Graphics Separation in Maps. In: Blostein, D., Kwon, Y.-B. (eds.) GREC 2001. LNCS, vol. 2390, pp. 168–177. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  10. Gelbukh, A.: Syntactic Disambiguation with Weighted Extended Subcategorization Frames. In: Proc. Pacific Association for Computational Linguistics (PACLING 1999), Canada, August 25–28, pp. 244–249 (1999)

    Google Scholar 

  11. Gelbukh, A.: Exact and approximate prefix search under access locality requirements for morphological analysis and spelling correction. Computación y Sistemas 6(3), 167–182 (2003), see: www.gelbukh.com/CV/Publications/2001/CyS-2001-Morph.htm

    Google Scholar 

  12. Angell, R.C., Freund, G.E., Willett, P.: Automatic Spelling Correction using a Trigram Similarity Measure. Inf. Processing & Management 19(4), 255–261 (1983)

    Article  Google Scholar 

  13. Hirst, G., Budanitsky, A.: Correcting Real-Word Spelling Errors by Restoring Lexical Cohesion. In: Natural Language Engineering (2004) (to appear)

    Google Scholar 

  14. Levachkine, S., Guzman, A.: Hierarchies as a New Data Type for Qualitative Variables. Journal of Data Knowledge Engineering (DKE) (to appear)

    Google Scholar 

  15. Gelbukh, A., SangYong, H., Levachkine, S.: Combining Sources of Evidence to Resolve Ambiguities in Toponym Recognition in Cartographic Maps. In: Levachkine, S., Serra, J., Egenhofer, M. (eds.) Proc. 2nd Int. Workshop on Semantic Processing of Spatial Data (GEOPRO 2003), Mexico City, Mexico, November 4–5, pp. 42–51 (2003) ISBN 970-36-0103-0

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Gelbukh, A., Levachkine, S., Han, SY. (2004). Resolving Ambiguities in Toponym Recognition in Cartographic Maps. In: Lladós, J., Kwon, YB. (eds) Graphics Recognition. Recent Advances and Perspectives. GREC 2003. Lecture Notes in Computer Science, vol 3088. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-25977-0_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-25977-0_7

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-22478-5

  • Online ISBN: 978-3-540-25977-0

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics