Abstract
In this article, a novel observation-to-generalization place model is proposed. It is shown how this model can be used to formally define the problem of finding geographically similar places. The observation-to-generalization model differentiates between observations of phenomena in the environment at a specific location and time, and generalizations about places that are inferred from these observations. A suite of operations is defined to find similar places based on the invariance of generalized place properties, and it is demonstrated how these functions can be applied to the problem of finding similar places based on the topics that people write about in place descriptions. One use for similar-place search is for exploratory research that will enable investigators to perform case–control studies on place data.



Similar content being viewed by others
Notes
The meaning of the term ‘generalization’ here should not be confused with the meanings used in either cartography or logic.
The search functions are based on place entities in a knowledge base. These examples are illustrative shorthand. To perform natural language searches like these, one would also need to disambiguate places with the same toponym and integrate exonyms.
References
Adams B, Janowicz K (2011) Constructing geo-ontologies by reification of observation data. In: Cruz IF, Agrawal D, Jensen CS, Ofek E, Tanin E (eds) GIS. ACM, Chicago, pp 309–318
Adams B, Janowicz K (2012) On the geo-indicativeness of non-georeferenced text. In: Breslin JG, Ellison NB, Shanahan JG, Tufekci Z (eds) ICWSM. The AAAI Press, Dublin, pp 375–378
Adams B, McKenzie G (2013) Inferring thematic places from spatially referenced natural language descriptions. In: Sui D, Elwood S, Goodchild M (eds) Crowdsourcing geographic knowledge. Springer, Berlin, pp 201–221
Adams B, Raubal M (2009) A metric conceptual space algebra. In: Hornsby K, Claramunt C, Denis M, Ligozat G (eds) Spatial information theory, lecture notes in computer science, vol 5756. Springer, Berlin, pp 51–68
Alves AO, Pereira FC, Biderman A, Ratti C (2009) Place enrichment by mining the web. In: Proceedings of the European conference on ambient intelligence (Am I ’09). Springer, Berlin, pp 66–77
Banchuen T (2008) The geographical analog engine: hybrid numeric and semantic similarity measures for U.S. cities. Ph.D. thesis, The Pennsylvania State University
Bivand R, Gebhardt A (2000) Implementing functions for spatial statistical analysis using the language. J Geogr Syst 2(3):307–317
Blei DM, Ng AY, Jordan MI (2003) Latent dirichlet allocation. J Mach Learn Res 3(4/5):993–1022
Couclelis H (1992) People manipulate objects (but cultivate fields): beyond the raster-vector debate in GIS. In: Frank AU, Campari I, Formentini U (eds) Spatio-temporal reasoning, lecture notes in computer science, vol 639. Springer, Berlin, pp 65–77
Couclelis H (2010) Ontologies of geographic information. Int J Geogr Inf Sci 24(12):1785–1809
Cresswell T (2004) Place: a short introduction. Blackwell Publishing Ltd, Oxford
Cronon W (2003) Changes in the land: Indians, colonists, and the ecology of New England. 20th, anniversary edn. Hill and Wang, New York
Eisenstein J, O’Connor B, Smith NA, Xing EP (2010) A latent variable model for geographic lexical variation. In: Li H, Màrquez L (eds) EMNLP. ACL, Cambridge, pp 1277–1287
Fotheringham AS, Wong DW (1991) The modifiable areal unit problem in multivariate statistical analysis. Environ Plan A 23(7):1025–1044
Frank AU (2001) Tiers of ontology and consistency constraints in geographical information systems. Int J Geogr Inf Sci 15(7):667–678
Gangemi A, Guarino N, Masolo C, Oltramari A, Schneider L (2002) Sweetening ontologies with DOLCE. In: Gómez-Pérez A, Benjamins VR (eds) EKAW, Lecture notes in computer science, vol 2473. Springer, Berlin, pp 166–181
Gärdenfors P (2000) Conceptual spaces: the geometry of thought. A Bradford book. MIT Press, Cambridge
Gatrell AC, Bailey TC, Diggle PJ, Rowlingson BS (1996) Spatial point pattern analysis and its application in geographical epidemiology. Trans Inst Br Geogr 21(1):256–274
Goodchild M (2007) Citizens as sensors: the world of volunteered geography. GeoJournal 69(4):211–221
Goodchild M, Montello D, Fohl P, Gottsegen J (1998) Fuzzy spatial queries in digital spatial data libraries. In: Simpson PK (ed) Proceedings of the IEEE international conference on fuzzy sysem, vol 1. IEEE, Anchorage, Alaska, pp 205–210
Graham LT, Gosling SD (2011) Can the ambiance of a place be determined by the user profiles of the people who visit it? In: Adamic LA, Baeza-Yates RA, Counts S (eds) ICWSM. The AAAI Press, Barcelona, pp 145–152
Griffiths TL, Steyvers M (2004) Finding scientific topics. Proc Natl Acad Sci 101(Suppl. 1):5228–5235
Halpern B, Longo C, Hardy D, McLeod K, Samhouri J, Katona S, Kleisner K, Lester S, O’Leary J, Ranelletti M, Rosenberg A, Scarborough C, Selig E, Best B, Brumbaugh D, Chapin F, Crowder L, Daly K, Doney S, Elfes C, Fogarty M, Gaines S, Jacobsen K, Karrer L, Leslie H, Neeley E, Pauly D, Polasky S, Ris B, St Martin K, Stone G, Sumaila U, Zeller D (2012) An index to assess the health and benefits of the global ocean. Nature 488(7413):615–620
Hao Q, Cai R, Wang C, Xiao R, Yang JM, Pang Y, Zhang L (2010) Equip tourists with knowledge mined from travelogues. In: Rappa M, Jones P, Freire J, Chakrabarti S (eds) WWW. ACM, Raleigh, pp 401–410
Heuvelink GBM (1998) Error propagation in environmental modelling With GIS. Taylor & Francis, New York
Hey T, Tansley S, Tolle KM (eds) (2009) The fourth paradigm: data-intensive scientific discovery. Microsoft Research, Redmond
Hill LL (2006) Georeferencing: the geographic associations of information. MIT Press, Cambridge
Hong L, Ahmed A, Gurumurthy S, Smola AJ, Tsioutsiouliklis K (2012) Discovering geographical topics in the twitter stream. In: Mille A, Gandon FL, Misselis J, Rabinovich M, Staab S (eds) WWW. ACM, Lyon, pp 769–778
Jacquez GM (2000) Spatial analysis in epidemiology: nascent science or a failure of GIS? J Geogr Syst 2(1):91–97
Janowicz K, Adams B, Raubal M (2010) Semantic referencing—determining context weights for similarity measurement. In: Fabrikant SI, Reichenbacher T, van Kreveld MJ, Schlieder C (eds) GIScience, Lecture notes in computer science, vol 6292. Springer, Berlin, pp 70–84
Janowicz K, Raubal M, Kuhn W (2011) The semantics of similarity in geographic information retrieval. J Spat Inf Sci 2(1):29–57
Janowicz K, Wilkes M (2009) SIM-DL\_A: a novel semantic similarity measure for description logics reducing inter-concept to inter-instance similarity. ESWC, Heraklion
Kell DB, Oliver SG (2004) Here is the evidence, now what is the hypothesis? The complementary roles of inductive and hypothesis-driven science in the post-genomic era. BioEssays 26(1):99–105
Keßler C, Janowicz K, Bishr M (2009) An agenda for the next generation gazetteer: geographic information contribution and retrieval. In: Wolfson O, Agrawal D, Lu CT (eds) GIS. ACM, Seattle, pp 91–100
Kuhn W (2003) Semantic reference systems. Int J Geogr Inf Sci 17(5):405–409
Lehmann J, Bizer C, Kobilarov G, Auer S, Becker C, Cyganiak R, Hellmann S (2009) DBpedia—a crystallization point for the web of data. J Web Semant 7(3):154–165
Leung D, Newsam S (2010) Proximate sensing: inferring what-is-where from georeferenced photo collections. In: Davis L, Malik J (eds) CVPR. IEEE, San Francisco, pp 2955–2962
Madin J, Bowers S, Schildhauer M, Krivov S, Pennington D, Villa F (2007) An ontology for describing and synthesizing ecological observation data. Ecol Inform 2(3):279–296. doi:10.1016/j.ecoinf.2007.05.004
Montello DR, Goodchild MF, Gottsegen J, Fohl P (2003) Where’s downtown? Behavioral methods for determining referents of vague spatial queries. Spat Cogn Comput 3(2):185–204
Montero JM, Chasco C, Larraz B (2010) Building an environmental quality index for a big city: a spatial interpolation approach combined with a distance indicator. J Geogr Syst 12(4):435–459
Peel MC, Finlayson BL, McMahon TA (2007) Updated world map of the Köppen-Geiger climate classification. Hydrol Earth Syst Sci Discuss 4(2):439–473
Petersen J, Gibin M, Longley P, Mateos P, Atkinson P, Ashby D (2011) Geodemographics as a tool for targeting neighbourhoods in public health campaigns. J Geogr Syst 13(2):173–192. doi:10.1007/s10109-010-0113-9
Probst F (2006) Ontological analysis of observations and measurements. In: Raubal M, Miller HJ, Frank AU, Goodchild MF (eds) GIScience, Lecture notes in computer science, vol 4197. Springer, Berlin, pp 304–320
Probst F (2008) Observations, measurements and semantic reference spaces. Appl Ontol 3(1–2):63–89
Ramage D, Hall D, Nallapati R, Manning CD (2009) Labeled LDA: a supervised topic model for credit attribution in multi-labeled corpora. In: Koehn P, Mihalcea R (eds) EMNLP. ACL, Cambridge, pp 248–256
Relph E (1976) Place and placelessness. Pion, London
Rosenbaum PR, Rubin DB (1983) The central role of the propensity score in observational studies for causal effects. Biometrika 70(1):41–55
Schade S, Ostermann F, Spinsanti L, Kuhn W (2012) Semantic observation integration. Future Internet 4(3):807–829
Schulz KF, Grimes DA (2002) Case–control studies: research in reverse. Lancet 359(9304):431–434
Selvin HC (1958) Durkheim’s suicide and problems of empirical research. Am J Sociol 63(6):607–619
Sizov S (2010) GeoFolk: latent spatial semantics in web 2.0 social media. In: Suel T, Davison BD (eds) WSDM. ACM, New York, pp 281–290
Stasch C, Janowicz K, Bröring A, Reis I, Kuhn W (2009) A stimulus-centric algebraic approach to sensors and observations. In: Trigoni N, Markham A, Nawaz S (eds) GSN, Lecture notes in computer science, vol 5659. Springer, Berlin, pp 169–179
Tuan YF (1977) Space and Place: the Perspective of Experience. The Regents of the University of Minnesota, Saint Paul
United Nations Development Programme (1990) Human development report 1990. Oxford University Press, New York
Wang C, Wang J, Xie X, Ma WY (2007) Mining geographic knowledge using location aware topic model. In: Purves R, Jones C (eds) GIR. ACM, Lisbon, pp 65–70
Winter S, Freksa C (2012) Approaching the notion of place by contrast. J Spat Inf Sci 5:31–50
Winter S, Kuhn W, Krüger A (2009) Guest editorial: does place have a place in geographic information science? Spat Cogn Comput 9(3):171–173
Yin Z, Cao L, Han J, Zhai C, Huang TS (2011) Geographical topic discovery and comparison. In: Srinivasan S, Ramamritham K, Kumar A, Ravindra MP, Bertino E, Kumar R (eds) WWW. ACM, Hyderabad, pp 247–256
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Adams, B. Finding similar places using the observation-to-generalization place model. J Geogr Syst 17, 137–156 (2015). https://doi.org/10.1007/s10109-015-0209-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10109-015-0209-3