Contextual mapping: Visualization of high-dimensional spatial patterns in a single geo-map

doi:10.1016/j.compenvurbsys.2016.08.005

Computers, Environment and Urban Systems

Volume 61, Part A, January 2017, Pages 1-12

https://doi.org/10.1016/j.compenvurbsys.2016.08.005 Get rights and content

Highlights

•
A generic method based Self Organizing Maps to encode the high dimensional spatial patterns into single numerical vector.
•
The numerical vector makes it possible to visualize high dimensional patterns in a single geo-map, called contextual maps.
•
Instead of rigid spatial clusters it produces a color-coded spectrum of changing high dimensional emergent patterns.
•
It can be used in a hierarchically to combine several contextual maps, each representing a unique high dimensional context.

Abstract

In this study, we proposed a generic methodology for combining high-dimensional spatial data to identify and visualize the hidden spatial patterns in a single-layer geo-map. By using the less explored one-dimensional self-organizing maps, we showed how the high-dimensional data can be transformed into a spectrum of one-dimensional ordered numbers. These numbers (codes) can index a high-dimensional space with the important property that similar indices refer to similar high-dimensional contexts. Thus, the high-dimensional vectors will be attributed to single numbers, and this one-dimensional output can be easily rendered as a new single data layer in the original geographic map. As a result, it simultaneously identifies the main spatial clusters and visualizes the high-dimensional correlations (if any) in a single geographic map. Further, because the output of the proposed method is a set of ordered indices, there is no need to define a fixed number of clusters in advance.

Because these composite spatial layers are identified on the basis of the selected context (i.e., the selected features or aspects of the spatial phenomena), they are called contextual maps.

Finally, we showed the results of applying the proposed methodology to several synthetic and real-world data sets.

Introduction

With the current rapid growth in the amount of digital data, we must address the challenge of finding appropriate techniques to harness the power of these data streams. For example, in many cities across the world, no longer does anyone lack access to digital spatial maps; instead, the current challenge is, considering the amount and diversity of these digital data regarding different aspects of the cities, how one can picture his/her own map of the space as a combination of several factors of interest.

Toward this direction, there have been several interesting cases such as peoplemaps¹ or Livehood projects (Cranshaw, Schwartz, Hong, & Sadeh, 2012), which are explorations and mapping of activities within cities based on data available from online social networks. One of the cases most similar to our work is a project called Whereabout,² where by applying the K-means data-clustering algorithm to a collection of spatial data consisting of > 200 different aspects of each ward in the city of London, a fixed number of groups were created by grouping based on informational similarities (not physical locations). Then, on top of the classical map of London, people get an impression of different regions on the basis of their similarities in all of these categories. In a similar manner, but only based on demographic information, a new coding system of London called LOAC was developed (Longley & Singleton, 2014).

The classical clustering algorithms divide the high-dimensional data space into a predetermined number of groups, where each will be given a label (usually an arbitrary number). Then, these cluster labels attributed to each spatial data point can be visualized on the geographic map with a specified color code. However, despite the fact that standard clustering methods such as K-means are easy to use, they have some limitations in the domain of spatial pattern recognition. One of the main problems is that they divide the space into a small number of categories. Instead, it would be preferred to have a continuous and smooth changing pattern on top of the high-dimensional data. Further, one needs to select the number of clusters in advance, which is a critical decision (Tibshirani, Walther, & Hastie, 2001). In addition, in the context of spatial clustering, because the cluster labels are not ordered according to their high-dimensional similarities, the colored visualization of clusters in the geographic map is not directly helpful. Therefore, similar colors in a clustered geo-map do not necessarily refer to similar high-dimensional patterns. As a result, increasing the number of clusters with different colors may result in final spatial visualizations that are not helpful, but having too few clusters produces results that are too aggregated. One current solution to this problem is to create an RGB (red, green, blue) pattern after data clustering by reducing the high-dimensional vectors of the cluster centers to their first three principal components (Mahinthakumar, Hoffman, Hargrove, & Karonis, 1999). However, in addition to losing some information (by selecting only three principal components), the color interpretations will need an additional step.

The main hypothesis of this study is that if we find a method to sort the clusters in a way such that similar cluster indices refer to similar contexts (i.e., similar high-dimensional patterns), we can make a direct projection from high-dimensional spatial data to a one-dimensional vector and visualize the high-dimensional patterns in the geographical maps using a simple color spectrum. In this manner, by having many indices instead of dividing the high-dimensional data into a few distinct groups, one can create a spectrum of high-dimensional patterns that are visualized with a colored spectrum on spatial maps. Because the high-dimensional patterns would change gradually, this would also solve the problem of distinct cluster borders and the fixed number of clusters. As we show in Section 2, our proposed approach can be discussed from the viewpoint of dimensionality reduction and manifold learning (Bengio, Courville, & Vincent, 2013), where one of the best methods that satisfies these requirements is self-organizing maps (SOMs) (Kohonen, 2013).

Section snippets

SOMs in the domain of spatial analysis

SOM is a general-purpose machine-learning method that offers interesting solutions to different data-driven modeling tasks (Kohonen, 2013).

SOM is a nonlinear space transformation method that tries to preserve the topology of high-dimensional data, while transforming them into a low-dimensional space. This means that SOM projects the high-dimensional data points to a lower-dimensional space (normally a two-dimensional grid) in a manner such that neighboring objects in high-dimensional space

One-dimensional SOMs and spatial clustering

In this section, we assume that the reader is familiar with the original SOM algorithm. Therefore, we skip its re-explanation here and refer the reader to Kohonen (2001) for details regarding the training process.

We instead present how one can project high-dimensional spatial data onto geographical maps while preserving the high-dimensional correlations by using the less explored one-dimensional SOM.

We consider the training data set X = {x_i, … , x_M} as a set of M points in an n-dimensional space x_i ∈ R

Experiments with real-world spatial data

In this section, we show the results of the proposed method using two real-world spatial data sets. One is a collection of 235 attributes of the so-called wards in London (Fig. 2). The data set is provided by Future Cities Catapult from the abovementioned project Whereabout. The second data set is obtained from US census 2000 and 2010, including the distribution of different race groups at the census block level, corresponding to five boroughs of New York City.

In the data set from London, there

Discussions and future research

In this section, we will discuss two main technical issues related to the proposed methodology plus one potential application in the field of urban planning and zoning.

The first point is about the chosen one-dimensional topology of SOM. As we briefly mentioned before, it is known that having higher grid dimensions or a more-connected neighborhood topology in the SOM network can improve the performance and quality of the trained SOM in terms of quantization error and topology preservation.

Conclusions

With the ever-growing availability of digital data in many spatial domains, we need to develop appropriate methods to explore high-dimensional and complex spatial patterns. Compared to classical data clustering problems, one of the main issues of spatial pattern recognition and spatial clustering is that in spatial clustering, in addition to finding high-dimensional patterns, one needs to keep the spatial coordinates in parallel to other features. Finally, it is always desired to project the

Acknowledgments

This research was supported by the National Research Foundation Singapore (NRFS) through the Singapore-ETH Centre for Global Environmental Sustainability (SEC) and the Chair for Computer Aided Architectural Design (CAAD) at ETH Zurich. Further, the author would like to thank the reviewers of the paper as their comments on the initial submission significantly improved the quality of the final paper.

References (32)

D. Arribas-Bel et al.
Multidimensional urban sprawl in Europe: A self-organizing map approach
Computers, Environment and Urban Systems
(2011)
F. Bação et al.
The self-organizing map, the geo-SOM, and relevant variants for geosciences
Computers & Geosciences
(2005)
J.A. Flanagan
Self-organization in the one-dimensional SOM with a decreasing neighborhood
Neural Networks
(2001)
A. Frenkel et al.
The linkage between the lifestyle of knowledge-workers and their intra-metropolitan residential choice: A clustering approach based on self-organizing maps
Computers, Environment and Urban Systems
(2013)
R. Henriques et al.
Exploratory geospatial data analysis using the GeoSOM suite
Computers, Environment and Urban Systems
(2012)
T. Kohonen
Essentials of the self-organizing map
Neural Networks
(2013)
A. Skupin et al.
An alternative map of the United States based on an n-dimensional model of geographic space
Journal of Visual Languages and Computing
(2011)
S.E. Spielman et al.
Social area analysis, data mining, and GIS
Computers, Environment and Urban Systems
(2008)
J. Vesanto
SOM-based data visualization methodsI
Intelligent Data Analysis
(1999)
N. Wang et al.
Visualizing gridded time series data with self organizing maps: An application to multi-year snow dynamics in the Northern Hemisphere
Computers, Environment and Urban Systems
(2013)

Y. Bengio et al.

Representation learning: A review and new perspectives

Pattern Analysis and Machine Intelligence, IEEE Transactions on

(2013)

Y. Cheng

Convergence and ordering of Kohonen's batch map

Neural Computation

(1997)

J. Cranshaw et al.

The livehoods project: Utilizing social media to understand the dynamics of a city

E. Delmelle et al.

Trajectories of multidimensional neighbourhood quality of life change

Urban Studies

(2013)

E. Erwin et al.

Self-organizing maps: Ordering, convergence properties and energy functions

Biological Cybernetics

(1992)

Cited by (17)

Spatial patterning of benthic macroinvertebrate communities using Geo-self-organizing map (Geo-SOM): A case study in the Nakdong River, South Korea
2023, Ecological Informatics
Characterizing community responses to environmental disturbances is difficult because of the complexity of heterogeneous ecosystems. A geographical self-organizing map (Geo-SOM) was applied to present the spatial distribution patterns of benthic communities in a river. The benthic macroinvertebrate communities were collected in the mainstream of the Nakdong River in South Korea. Geo-SOM is a machine learning technique that extracts spatial patterns of given data across spatial weight k values (0–5), which control the vicinity of the map, to extract geographical information effectively. In the results, clusters were formed mainly according to the topography on a large scale and anthropogenic impacts on a small-scale showing consistency in spatial patterning for benthic communities in the gradient across different degrees of spatial weight. Geo-SOM provided both comprehensive and detailed views for presenting species-space relationships. Corresponding to the decrease in k value (more weight in geographical information), we accumulated data variations to present a comprehensive view of spatial species distributions. Overall, correlations between species were more associated with latitude rather than longitude. The feasibility of spatial clustering was also demonstrated with the effective differentiation of community indices. Community indices were effectively differentiated into clusters in the Geo-SOM. Finally, Geo-SOM is a useful tool for extracting the spatial distribution patterns of communities in a comprehensible manner for the monitoring and management of communities in aquatic ecosystems.
Developing an urban streetscape indexing based on visual complexity and self-organizing map
2023, Building and Environment
Streetscape examinations in the digital context offer a wealth of geospatial data and application support for urban informatization, facilitating a more scientific and efficient comprehension of the city image. Currently, digital investigations on streetscapes predominantly emphasize object-based parsing rather than perception-based parsing. Furthermore, there is a notable absence of a comprehensive analytical framework specifically designed to urban visual environments. Consequently, the accurate recognition and effective management of the city image have been limited. Therefore, this study parses streetscapes from the perspective of their visual perception, and accordingly develops a digital urban streetscape indexing to analyze urban visual environments. Specifically, the streetscape is decoded into multi-characteristic visual complexity including texture, shape, and color, which derive a three-dimensional dataset. The dataset is then fed into a machine learning technique, a self-organizing map (SOM), for synthetic training, resulting in an indexing that sheds light on the interconnections between the visual characteristics, the streetscape, and its geo-distribution, thereby enabling a multifaceted analysis of the urban visual environment. Three relevant applications of the proposed indexing are subsequently demonstrated. This study indicates that the streetscape can be parsed by multi-characteristic visual complexity of texture, shape, and color; based on this, the developed indexing can function as a digital system that facilitates streetscape management and exploration. The theoretical and technical contributions of this study can support the sustainable development of city image within the digital context.
Examining village characteristics for forest management using self- and geographic self-organizing maps: A case from the Baekdudaegan mountain range network in Korea
2023, Ecological Indicators
Understanding the village characteristics linked to forest networks is essential for the scientific management of forest resources. Forests are complex socio-ecological systems. This study classifies the resources and characteristics of forest networks and neighboring villages using unsupervised learning algorithms: self-organizing maps (SOM) and geographic-self-organizing maps (Geo-SOMs). Considering ecological, economic, and sociocultural indicators, 18 covariates of 379 villages in two forest networks of the Baekdudaegan Mountain Range in South Korea were analyzed. The data visualizing map size was fixed based on changes in quantization and topographic errors of the same grid maps, and the number of clusters was determined by comparing K-means and hierarchical clustering techniques. An optimal map size of 17 × 12 grids and six clusters was used for further classification of the input data for both SOM and Geo-SOM analyses. The common characteristics of villages were identified using SOM classification, whereas geographically bounded characteristics were identified using Geo-SOM. The approach introduced in this study can be applied to socio-ecological classification and the design of sustainable forest management policies that link the remote sensing and geographic information systems.
Mapping urban underground potential in Dakar, Senegal: From the analytic hierarchy process to self-organizing maps
2020, Underground Space (China)
Citation Excerpt :
The aggregation of the four resources proves challenging for the AHP, however, because the mapping process does not seek to give precedence to one resource potential over another, which is at odds with the AHP’s need for a clear hierarchy in the criteria. This shortcoming will be addressed in the case of Dakar by testing an alternative method for establishing relationships between criteria: a slightly unorthodox—but highly promising—use of the self-organizing map (SOM) algorithm (Kohonen, 2001, 2015; Moosavi, 2017). Rather than aggregate the resource potentials with the AHP, the SOM indexes the underlying patterns of combined potentials.
This article presents a mapping method that seeks to provide urban planning with a diagnostic overview of the underground resources of an urban area. Resource potentials (for buildable space, groundwater or geomaterial extraction and geothermal energy) tend to be investigated on a needs-only basis once a project or plan has already been elaborated. This paradigm of ‘needs to resources’ risks favoring single-use rather than multi-use underground development, leading to unforeseen conflicts between possible uses (e.g., pollution of an aquifer or congestion of infrastructure) or the irreversible loss of potential synergies (e.g., geothermal collectors on building foundations). The Deep City project at the EPFL in Switzerland has been working on an alternative paradigm of ‘resources to needs’, which is a holistic approach addressing the underground as a source of opportunity in synergy with surface development for curtailing urban sprawl while preserving public places or parks. The method, which combines geological and surface urban data, produces maps of individual and combined resource potentials without prioritizing any particular planning objective. This communication will present the method and the resulting maps through a case study conducted in 2016 in the city of Dakar, Senegal. After first summarizing the Deep City project and the mapping method, the urban and geological conditions of Dakar will be presented, followed by the application and results of the Deep City method. The calculation of the combined potentials map is an opportunity to compare two alternative methods of combination, the Analytic Hierarchy Process and Self-Organizing Maps (SOMs). Although the mapping method does not require complicated data collection or analysis, the SOM may be better suited both for dealing with larger quantities of data and for providing more meaningful mappings of geological and urban data in three dimensions.
Urban morphology meets deep learning: Exploring urban forms in one million cities, towns, and villages across the planet
2022, Machine Learning and the City: Applications in Architecture and Urban Design
Research on Machine Intelligent Perception of Urban Geographic Location Based on High Resolution Remote Sensing Images
2022, Photogrammetric Engineering and Remote Sensing

View all citing articles on Scopus

View full text

Contextual mapping: Visualization of high-dimensional spatial patterns in a single geo-map

Highlights

Abstract

Introduction

Section snippets

SOMs in the domain of spatial analysis

One-dimensional SOMs and spatial clustering

Experiments with real-world spatial data

Discussions and future research

Conclusions

Acknowledgments

Computers, Environment and Urban Systems

Computers & Geosciences

Neural Networks

Computers, Environment and Urban Systems

Computers, Environment and Urban Systems

Neural Networks

Journal of Visual Languages and Computing

Computers, Environment and Urban Systems

Intelligent Data Analysis

Computers, Environment and Urban Systems

Representation learning: A review and new perspectives

Pattern Analysis and Machine Intelligence, IEEE Transactions on

Convergence and ordering of Kohonen's batch map

Neural Computation

The livehoods project: Utilizing social media to understand the dynamics of a city

Trajectories of multidimensional neighbourhood quality of life change

Urban Studies

Self-organizing maps: Ordering, convergence properties and energy functions

Biological Cybernetics