International Journal of Applied Earth Observation and Geoinformation
A Gestalt rules and graph-cut-based simplification framework for urban building models
Introduction
Real 3D city models have wide applications in urban planning, mapping and virtual tourism. With the rapid development of photogrammetry, computer vision, scanners and 3D modeling technologies, it is currently possible to construct detailed 3D city models in a practical and cost-efficient manner. This is possibly the main reason that 3D digital urban models have become so increasingly popular. These developments are changing the way the user conceives 3D data. Typically, a large city often contains hundreds of millions of buildings, and it is difficult to view digitized city models in real time. Therefore, the visualization of large-scale 3D city models for a variety of professional and mass-market services has received significant attention in the photogrammetry and computer graphics communities. To make such services appealing to a large audience, these 3D models should reach a sufficient level of realism and accuracy. Many solutions have been proposed to generate 3D models of huge urban environments (Royan et al., 2007). For example, Sheppard and Cizek (2009) proposed criteria for evaluating landscape visualizations under the categories of (i) accuracy, (ii) representativeness, (iii) visual clarity, (iv) interest, (v) legitimacy, (vi) access to visual information, and (vii) framing and presentation. To visualize a large-scale city efficiently, some generalization algorithms (such as Forberg, 2007, Zhang et al., 2012) are used to simplify urban building models.
To help people recognize a city more effectively and maintain spatial coherence, Gestalt principles are introduced to identify the distribution pattern of urban buildings. In this paper, we apply Gestalt rules and graph-cuts to cluster similar buildings into the same group. Then, the building footprints and façade textures are aggregated, and the whole urban models can be generalized into multiple levels. To obtain a good generalized representation of a city that abides by the Gestalt principles, the spatial relations, including distance, orientation, similarity and continuity, should be taken into account when building models are clustered. To reduce the load time, a multiple representation data structure termed SceneTree is proposed to store hierarchical models of the aggregated building footprints and facade textures. When these buildings are viewed, different levels of the building models are retrieved from SceneTree and rendered by the parallelization scheme. Compared with other methods, the novelty of our approach is that we implement an optimization framework for generalizing building footprints and facade textures simultaneously. In this framework, a graph-cut-based optimization method is employed to solve the conflicts that a footprint or facade texture may satisfy with several Gestalt principles, and SceneTree is created to store the generalized footprints and textures at deferent levels.
Section snippets
Related work
Numerous studies have developed a range of tools and techniques for visualizing city buildings. Additionally, an extensive amount of literature can be found in the related fields of computer vision, object recognition and geometric modeling. However, there remain many challenges in applying landscape visualization techniques for effectively communicating processes and changes in building landscapes. In this section, previous generalization results of 3D building models are summarized.
Methodology
This section presents our method for urban building generalization and visualization.
Experiments
To validate the performance of our method, we use VC++ 2010 and OpenGL to develop a 3D urban model visualization system. Experiments are performed on a Windows 7 Professional operating system with a 3.2 GHz Intel (R) Core (TM) i5-3470 CPU, 4 GB memory, and an Nvidia Geforce GT 620 graphics card. The study data comprise Beijing urban models with 122,056 buildings and 916,427 vertices.
Conclusion
According to the theory of Gestalt psychology, including proximity, similarity, continuity and common orientation (Wertheimer, 1923), separate elements in an image that have certain relations tend to be merged into a whole in people's mind. Accordingly, to obtain a good generalized representation of a city that abides by these criteria, the spatial relations, including distance, orientation, similarity and continuity, should be taken into account when building models are clustered. To speed up
References (29)
Generalization of 3D building data based on a scale-space approach
ISPRS J. Photogramm. Remote Sens.
(2007)- et al.
Generalization of 3D building texture using image compression and multiple representation data structure
ISPRS J. Photogramm. Remote Sens.
(2013) - et al.
A multiple representation data structure for dynamic visualisation of generalised 3D city models
ISPRS J. Photogramm. Remote Sens.
(2011) - et al.
The ethics of Google Earth: crossing thresholds from spatial data to landscape visualisation
J. Environ. Manag.
(2009) - et al.
Automatic simplification and visualization of 3D urban building models
Int. J. Appl. Earth Observ. Geoinf.
(2012) - et al.
A geometry and texture coupled flexible generalization of urban building models
ISPRS J. Photogramm. Remote Sens.
(2012) - et al.
Compressed facade displacement maps
IEEE Trans. Vis. Comput. Graph.
(2009) - et al.
Omni-directional Relief Impostors, Computer Graphics Forum
(2007) - et al.
Visualization of large-scale urban models through multi-level relief impostors
Comput. Graph. Forum
(2010) - et al.
Relief impostor selection for large scale urban rendering
An experimental comparison of min-cut/max-flow algorithms for energy minimization in vision
IEEE Trans. Pattern Anal. Mach. Intell.
Fast approximate energy minimization via graph cuts
IEEE Trans. Pattern Anal. Mach. Intell.
Legible simplification of textured urban models
IEEE Comput. Graph. Appl.
Ray-casted blockmaps for large urban models visualization
Comput. Graph. Forum
Cited by (22)
Second-order texton feature extraction and pattern recognition of building polygon cluster using CNN network
2024, International Journal of Applied Earth Observation and GeoinformationAn adaptive-size vector tile pyramid construction method considering spatial data distribution density characteristics
2024, Computers and GeosciencesShape-preserving mesh decimation for 3D building modeling
2024, International Journal of Applied Earth Observation and GeoinformationLong-term spatio-temporal changes of wetlands in Tibetan Plateau and their response to climate change
2023, International Journal of Applied Earth Observation and GeoinformationPredicting residential building age from map data
2019, Computers, Environment and Urban SystemsRecognition of building group patterns in topographic maps based on graph partitioning and random forest
2018, ISPRS Journal of Photogrammetry and Remote SensingCitation Excerpt :Graph partition is to eliminate weak relationship among buildings within the connected subgraph and then forms a new disconnected graph that contains multiple connected subgraphs. To quantify the degree of the relationship among topographic objects such as buildings, metrics such as the centric point distance, minimum distance of building boundaries, mean distance, adjacent distance, and synthesized index, have been used (Regnauld, 2001; Ai and Zhang, 2007; Yang et al., 2011; W. Wang et al., 2015; Y. Wang et al., 2015). However, the relationship between each two objects within a group is also influenced by adjacent objects within the same group.