Abstract
Digital map processing has been an interest in the image processing and pattern recognition community since the early 80s. With the exponential growth of available map scans in the archives and on the internet, a variety of disciplines in the natural and social sciences grow interests in using historical maps as a primary source of geographical and political information in their studies. Today, many organizations such as the United States Geological Survey, David Rumsey Map Collection, OldMapsOnline.org, and National Library of Scotland, store numerous historical maps in either paper or scanned format. Only a small portion of these historical maps is georeferenced, and even fewer of them have machine-readable content or comprehensive metadata. The lack of a searchable textual content including the spatial and temporal information prevents researchers from efficiently finding relevant maps for their research and using the map content in their studies. These challenges present a tremendous collaboration opportunity for the image processing and pattern recognition community to build advance map processing technologies for transforming the natural and social science studies that use historical maps. This paper presents the potentials of using historical maps in scientific research, describes the current trends and challenges in extracting and recognizing text content from historical maps, and discusses the future outlook.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
USGS NGMDB (2016) [Website]. Retrieved from http://ngmdb.usgs.gov/ngmdb/ngmdb_home.html.
- 2.
USGS topoView (2016) [Website]. Retrieved from http://ngmdb.usgs.gov/maps/TopoView/.
- 3.
David Rumsey. (2016). [Website]. Retrieved from http://www.davidrumsey.com/.
- 4.
OldMapsOnline (2016) [Website]. Retrieved from http://www.oldmapsonline.org/.
- 5.
NLS (2016) [Website]. Retrieved from http://maps.nls.uk/.
- 6.
USGS topoView (2016) [Website]. Retrieved from http://ngmdb.usgs.gov/maps/TopoView/.
- 7.
CalFlora (2016) [Data set]. Retrieved from http://www.calflora.org/.
- 8.
CLAVIN (2016) [Computer software]. Retrieved from https://clavin.bericotechnologies.com/.
- 9.
U.S. Census Gazetteer (2016) [Data set]. Retrieved from https://www.census.gov/geo/maps-data/data/gazetteer.html.
- 10.
USGS GNIS (2016) [Data set]. Retrieved from http://geonames.usgs.gov/.
- 11.
GeoNames (2016) [Data set]. Retrieved from http://www.geonames.org/.
- 12.
OpenStreetMap (2016) [Website]. Retrieved from https://www.openstreetmap.org/.
- 13.
Los Angeles Public Library Map Collection (2016) [Website]. Retrieved from https://www.lapl.org/collections-resources/visual-collections/map-collection.
- 14.
NHGIS (2016) [Website]. Retrieved from https://www.nhgis.org/.
- 15.
A Vision of Britain through Time (2016) [Website]. Retrieved from http://www.visionofbritain.org.uk/.
- 16.
Dr. Kurashige’s article published in the Southern California Quarterly won the 2015 Carl I. Wheat Award for the best demonstration of scholarship in that journal from 2012–2014 by a senior historian.
- 17.
- 18.
Spatial technology opens a window into history (2016) [News article]. Retrieved from https://news.usc.edu/91625/spatial-technology-opens-a-window-into-history/.
- 19.
Peter Feigl's Journey Through Historical Maps (2016) [Website]. Retrieved from http://www.arcgis.com/apps/MapJournal/index.html?appid=6c3b4136b9304df09c9adcf86dd30dd5.
- 20.
Tesseract-OCR (2016) [Computer software]. Retrieved from https://github.com/tesseract-ocr.
- 21.
NYPL map-vectorizer (2016) [Computer software]. https://github.com/NYPL/map-vectorizer.
- 22.
Plageois Commons (2016) [Website]. Retrieved from http://commons.pelagios.org/.
References
Adams, O.G.: Place Names in the North Central Counties of Missouri (Ph. D.). University of Missouri-Columbia (1928)
Alex, B., Byrne, K., Grover, C., Tobin, R.: Adapting the Edinburgh geoparser for historical georeferencing. Int. J. Humanit. Comput. 9(1), 15–35 (2015)
Arteaga, M.G.: Historical map polygon and feature extractor. In: Proceedings of the 1st ACM SIGSPATIAL International Workshop on MapInteraction, pp. 66–71. ACM (2013)
Bizer, C., Heath, T., Berners-Lee, T.: Linked data - the story so far. Int. J. Seman. Web Inf. Syst. 5(3), 1–22 (2009)
Chiang, Y.-Y., Knoblock, C.A.: Recognizing text in raster maps. GeoInformatica 19(1), 1–27 (2014)
Chiang, Y.-Y., Leyk, S., Knoblock, C.A.: A survey of digital map processing techniques. ACM Comput. Surv. (CSUR) 47(1), 1 (2014)
Chiang, Y.-Y., Leyk, S., Nazari, N.H., Moghaddam, S., Tan, T.X.: Assessing the impact of graphical quality on automatic text recognition in digital maps. Comput. Geosci. 93, 21–35 (2016)
Davis, C.C., Willis, C.G., Connolly, B., Kelly, C., Ellison, A.M.: Herbarium records are reliable sources of phenological change driven by climate and provide novel insights into species’ phenological cueing mechanisms. Am. J. Bot. 102(10), 1599–1609 (2015)
D’Ignazio, C., Bhargava, R., Zuckerman, E.: Cliff-clavin: determining geographic focus for news. In: NewsKDD: Data Science for News Publishing (2014)
Garijo, D., Gil, Y., Harth, A.: Challenges for provenance analytics over geospatial data. In: Ludäscher, B., Plale, B. (eds.) IPAW 2014. LNCS, vol. 8628, pp. 261–263. Springer, Cham (2015). doi:10.1007/978-3-319-16462-5_28
Godfrey, B., Eveleth, H.: An adaptable approach for generating vector features from scanned historical thematic maps using image enhancement and remote sensing techniques in a in a geographic information system. J. Map Geogr. Librar. 11(1), 18–36 (2015)
Gregory, I., Donaldson, C., Murrieta-Flores, P., Rayson, P.: Geoparsing, GIS, and textual analysis: current developments in spatial humanities research. Int. J. Humanit. Comput. 9(1), 1–14 (2015)
Gregory, I.N., Ell, P.S.: Historical GIS: Technologies, Methodologies, and Scholarship, vol. 39. Cambridge University Press, Cambridge (2007)
Guralnick, R.P., Wieczorek, J., Beaman, R., Hijmans, R.J., Group, B.W., et al.: BioGeomancer: automated georeferencing to map the world’s biodiversity data. PLoS Biol. 4(11), e381 (2006)
Hill, A.W., Guralnick, R., Flemons, P., Beaman, R., Wieczorek, J., Ranipeta, A., Chavan, V., Remsen, D.: Location, location, location: utilizing pipelines and services to more effectively georeference the world’s biodiversity data. BMC Bioinf. 10(Suppl 14), S3 (2009)
Honarvar Nazari, N., Tan, T.X., Chiang, Y.-Y.: Integrating text recognition for overlapping text detection in maps. Electron. Imaging Doc. Recogn. Retrieval XXIII 17, 1–8 (2016)
Khotanzad, A., Zink, E.: Contour line and geographic feature extraction from USGS color topographical paper maps. IEEE Trans. Pattern Anal. Mach. Intell. 25(1), 18–31 (2003)
Kurashige, L.: Rethinking anti-immigrant racism: lessons from the Los Angeles vote on the 1920 Alien Land Law. Southern Calif. Q. 95(3), 265–283 (2013)
Lavoie, C.: Biological collections in an ever changing world: herbaria as tools for biogeographical and environmental studies. Perspect. Plant Ecol. Evol. Syst. 15(1), 68–76 (2013)
Leidner, J.L., Lieberman, M.D.: Detecting geographical references in the form of place names and associated spatial natural language. Sigspatial Spec. 3(2), 5–11 (2011)
Leyk, S., Boesch, R.: Colors of the past: color image segmentation in historical topographic maps based on homogeneity. GeoInformatica 14(1), 1–21 (2009)
Leyk, S., Boesch, R., Weibel, R.: Saliency and semantic processing: extracting forest cover from historical topographic maps. Pattern Recogn. 39(5), 953–968 (2006)
Li, L., Nagy, G., Samal, A., Seth, S., Xu, Y.: Integrated text and line-art extraction from a topographic map. Int. J. Doc. Anal. Recogn. 2(4), 177–185 (2000)
Murphey, P.C., Guralnick, R.P., Glaubitz, R., Neufeld, D., Ryan, J.A.: Georeferencing of museum collections: a review of problems and automated tools, and the methodology developed by the mountain and plains spatio-temporal database-informatics initiative (Mapstedi). Phyloinformatics 1(3), 1–29 (2004)
Nagy, G., Samal, A., Seth, S., Fisher, T.: Reading street names from maps-technical challenges. In: Proceedings of GIS/LIS (1997)
Nanetti, A., Cattaneo, A., Cheong, S.A., Lin, C.-Y.: Maps as knowledge aggregators: from Renaissance Italy Fra mauro to web search engines. Cartographic J. 52(2), 159–167 (2015)
Newbold, T.: Applications and limitations of museum data for conservation and ecology, with particular attention to species distribution models. Prog. Phys. Geogr. 34(1), 3–22 (2010)
Ngo, V., Swift, J., Chiang, Y.-Y.: Visualizing land reclamation in Hong Kong: a web application. In: International Cartographic Conference (2015)
Pezeshk, A., Tutwiler, R.L.: Improved multi angled parallelism for separation of text from intersecting linear features in scanned topographic maps. In: IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 1078–1081. IEEE (2010)
Pezeshk, A., Tutwiler, R.L.: Automatic feature extraction and text recognition from scanned topographic maps. IEEE Trans. Geosci. Remote Sens. 49(12), 5047–5063 (2011). A Publication of the IEEE Geoscience and Remote Sensing Society
Pyke, G.H., Ehrlich, P.R.: Biological collections and ecological/environmental research: a review, some observations and a look to the future. Biol. Rev. Camb. Philos. Soc. 85(2), 247–266 (2010)
Raveaux, R., Burie, J.C., Ogier, J.M.: A colour document interpretation: application to ancient cadastral maps. In: Ninth International Conference on Document Analysis and Recognition (ICDAR 2007), vol. 2, pp. 1128–1132. IEEE (2007)
Raveaux, R., Burie, J.C., Ogier, J.M.: Object extraction from colour cadastral maps. In: The Eighth IAPR International Workshop on Document Analysis Systems, DAS 2008, pp. 506–514. IEEE (2008)
Rios, N.E., Bart, H.L.: GEOLocate (Version 3.22) [Computer software] (2010)
Samy, G., Chavan, V., Ariño, A.H., Otegui, J., Hobern, D., Sood, R., Robles, E.: Content assessment of the primary biodiversity data published through GBIF network: status, challenges and potentials. Biodivers. Inform. 8(2) (2013). http://doi.org/10.17161/bi.v8i2.4124
Simon, R., Barker, E., Isaksen, L.: Linking early geospatial documents, one place at a time: annotation of geographic documents with Recogito. E-Perimetron 10(2), 49–59 (2015)
Simon, R., Pilgerstorfer, P., Isaksen, L., Barker, E.: Towards semi-automatic annotation of toponyms on old maps. E - Perimetron 9(3), 105–128 (2014)
Simon, R., Sadilek, C., Korb, J., Baldauf, M., Haslhofer, B.: Tag clouds and old maps: annotations as linked spatiotemporal data in the cultural heritage domain. In: Workshop on Linked Spatiotemporal Data, Zurich, Switzerland (2010)
Torr, P.H.S., Zisserman, A.: MLESAC: a new robust estimator with application to estimating image geometry. Comput. Vis. Image Underst. CVIU 78(1), 138–156 (2000)
Vellend, M., Brown, C.D., Kharouba, H.M., McCune, J.L., Myers-Smith, I.H.: Historical ecology: using unconventional data sources to test for effects of global environmental change. Am. J. Bot. 100(7), 1294–1305 (2013)
Weinman, J.: Toponym recognition in historical maps by Gazetteer alignment. In: Proceedings of the 12th International Conference on Document Analysis and Recognition, pp. 1044–1048 (2013)
Yoshida, K., Burbano, H.A., Krause, J., Thines, M., Weigel, D., Kamoun, S.: Mining herbaria for plant pathogen genomes: back to the future. PLoS Pathog. 10(4), e1004028 (2014)
Yu, R., Luo, Z., Chiang, Y.-Y.: Recognizing text on historical maps using maps from multiple time periods. In: Proceedings of the 23rd International Conference on Pattern Recognition (2016)
Acknowledgements
This research is based upon work supported in part by the National Science Foundation under award number IIS-1564164 and in part by the University of Southern California under the Undergraduate Research Associates Program (URAP). The author thanks Travis Longcore for his input on the biology studies and the U.S. National Committee (USNC) to the International Cartographic Association (ICA) for providing travel funding to attend the 27th International Cartographic Conference (ICC).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Chiang, YY. (2017). Unlocking Textual Content from Historical Maps - Potentials and Applications, Trends, and Outlooks. In: Santosh, K., Hangarge, M., Bevilacqua, V., Negi, A. (eds) Recent Trends in Image Processing and Pattern Recognition. RTIP2R 2016. Communications in Computer and Information Science, vol 709. Springer, Singapore. https://doi.org/10.1007/978-981-10-4859-3_11
Download citation
DOI: https://doi.org/10.1007/978-981-10-4859-3_11
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-4858-6
Online ISBN: 978-981-10-4859-3
eBook Packages: Computer ScienceComputer Science (R0)