A new hierarchical triangle-based point-in-polygon data structure

doi:10.1016/j.cageo.2008.09.013

Computers & Geosciences

Volume 35, Issue 9, September 2009, Pages 1843-1853

https://doi.org/10.1016/j.cageo.2008.09.013 Get rights and content

Abstract

The point-in-polygon or containment test is fundamental in computational geometry and is applied intensively in geographical information systems. When this test is repeated several times with the same polygon a data structure is necessary in order to reduce the linear time needed to obtain an inclusion result. In the literature different approaches, like grids or quadtrees, have been proposed for reducing the complexity of these algorithms. We propose a new data structure based on hierarchical subdivisions by means of tri-cones, which reduces the time necessary for this inclusion test. This data structure, named tri-tree, decomposes the whole space by means of tri-cones and classifies the edges of the polygon. For the inclusion test only the edges classified in one tri-cone are considered. The tri-tree has a set of properties which makes it specially suited to this aim, with a similar complexity to other data structures, but well suited to polygons which represent regions. We include a theoretical and practical study in which the new data structure is compared with classical ones, showing a significant improvement.

Introduction

The containment test is widely used in areas like computational geometry, computer graphics and geographical information systems (GIS). In some situations we must consider the inclusion not only of a point in a polygon, but also of many points and polygons, and a common task is to assign points to polygons. The point-in-polygon operation is popular in GIS analysis and makes sense from both the discrete object and the field perspective (Longley et al., 2001).

The point-in-polygon test could be used in a very large set of applications, such as the superposition of polygons, the classification of entities or queries about properties of regions. Imagine an area in which it is necessary to determine spatial coincidence between hydrologic unit codes and regions. A point-in-polygon routine is needed for assigning regions to hydrologic units. These hydrologic units are useful for the prediction of zones of erosion and deposition in a catchment, and can be expanded when torrential rains occur. This special situation is considered in the “Andalucía” region of Spain.

The inclusion test consists of obtaining a boolean result regarding the containment of a point in the interior or on the boundary of a polygon represented by a set of ordered vertices.

The definition of polygon changes for different applications, and special algorithms have been developed for different types of polygons (like convex or non-convex) and for different definitions of polygons (like simple or non-simple). We consider a polygon as an ordered set of vertices (in counter-clockwise order) which form edges making a closed loop without edge intersection (simple polygon). We allow convex or non-convex polygons, manifold or non-manifold, and polygons with or without holes (Fig. 1). We employ this type of polygons which represent spatial areas in a GIS perspective.

When an inclusion test is needed repetitively for the same polygon and there is enough memory in the system for extra information, some data structures are introduced as pre-processing in order to reduce the time complexity of the algorithms. In these cases, some features of the polygon are classified or arranged and the inclusion time is reduced by means of a determined search in these structures.

In this paper we present a new data structure specially suited to the point-in-polygon test. This triangle-based structure subdivides recursively the space occupied by the polygon into tri-cones¹ and classifies the edges of the polygon in pre-processing time. Two bounding triangles delimit the area occupied by the classified edges. In query time the points are classified in this data structure, and a point-in-polygon test is performed only with the edges classified in the corresponding tri-cone (Fig. 2). For the containment test several algorithms could be used, but the ray-crossing method (Haines, 1994) and the triangle-based algorithm (Feito et al., 1995) are well suited.

The new data structure is suitable for polygons which represent regions because it fits properly to the contour, which means a quick inclusion test. The expected complexity is $O (\log n)$ for query, being n the number of vertices of the polygon, with $O (n)$ space and $O (n \log n)$ construction time. This new data structure is constructed in less time than other data structures like quadtrees, obtaining an improvement in the inclusion test and being suitable for some inclusion methods like the ray-crossings or the triangle-based ones.

In the next section we will revise data structures traditionally used for the point-in-polygon test, as well as their relevant properties. A review of point-in-polygon methods is also shown. Then we will propose a new data structure and its fundamental features. We will follow with the point-in-polygon algorithms adapted to the use of this data structure. Finally we will carry out a time study in which we measure the times obtained for diverse inclusion methods and data structures.

Section snippets

Background

The main data structures used for the point-in-polygon test are grids, quadtrees, k–d trees, bsp-trees, layer of edges, slices or trapezoids (Huang and Shih, 1997, Samet, 2006). Some of these data structures are suitable for some containment methods (i.e. the ray-crossing method could be used with grids, quadtrees, k–d trees, layer of edges or trapezoids). Let us examine some of these data structures. First we review the principal point-in-polygon methods.

Point-in-polygon methods have been

The new data structure

The previous strategies attempt to reduce the number of edges to examine in the point-in-polygon test. These strategies benefit from reduced query time complexity due to additional storage cost and pre-processing time. We propose a new data structure specially suited to the inclusion test for GIS applications.

In the new data structure we will try to use a spatial subdivision similar to the wedge method (Preparata and Shamos, 1985), but applied to non-convex polygons.

The proposed method

Algorithms adaptation to the new structure

We have mentioned four different algorithms for the point-in-polygon test with non-convex polygons: the ray-crossings, the triangle-based, the winding-number and the pixel-based method. The last method is suitable for raster representations. The ray-crossings and winding-number concepts are very closely related in a mathematical perspective (Hormann and Agathos, 2001), the winding-number being slower than the ray-crossings method. Therefore we will study the adaptation of the ray-crossings and

Experimental results and discussion

Now let us analyse the storage, construction and inclusion time for the new data structure.

For a tri-cone we must store information about the bounding triangle as well as the inner bounding triangle, and a list of edges classified in the tri-cone (pointers to the vertices). All bounding triangles have a common point as origin, the reason why for each triangle it is necessary to store only two vertices. The number of edges classified in a tri-cone depends on the geometry of the polygon.

The

Conclusions

We have obtained a new data structure specially suited for the point-in-polygon problem using the crossings-count and the triangle-based methods. This data structure fits better to the geometry of a polygon, obtaining better construction and query times than other data structures like grids and quadtrees. The method proposed is suitable for complex polygons with a large quantity of edges with or without holes, being translation, rotation and scale invariant.

This method could be combined with

Acknowledgements

This work has been partially supported by the Spanish Ministry of Education and Science and the European Union (via ERDF funds) through the research Project TIN2007-67474-C03-03, by the Consejería de Innovación, Ciencia y Empresa of the Junta de Andalucía through the research Projects P06-TIC-01403 and P07-TIC-02773, and by the University of Jaén through the research Project UJA-08-16-02.

References (16)

F.R. Feito et al.
Orientation, simplicity and inclusion test for planar polygons
Computers & Graphics
(1995)
E. Haines
Point in polygon strategies
K. Hormann et al.
The point in polygon problem for arbitrary polygons
Computational Geometry: Theory and Applications
(2001)
C. Huang et al.
On the complexity of point-in-polygon algorithms
Computers & Geosciences
(1997)
J. Li et al.
Point-in-polygon tests by convex decomposition
Computers & Graphics
(2007)
F. Martinez et al.
The multi-L-REP decomposition and its applications to a point-in-polygon inclusion test
Computers & Graphics
(2006)
W. Wang et al.
2D point-in-polygon test by classifying edges into layers
Computers & Graphics
(2005)
B. Žalik et al.
Polygon trapezoidation by sets of open trapezoids
Computers & Graphics
(2003)

There are more references available in the full text version of this article.

Cited by (17)

A robust, fast, and accurate algorithm for point in spherical polygon classification with applications in geoscience and remote sensing
2022, Computers and Geosciences
Citation Excerpt :
We utilize a method of recursive spatial subdivision that is particularly well suited to the sphere. It shares many similarities with the wedge method for convex planar polygons (Preparata and Shamos, 1985) and the tri-tree algorithm (Jiménez et al., 2009) used for arbitrary simple planar polygons. The algorithm works by classifying polygon edges into “spherical-lunes”.
The point in spherical polygon problem addresses the task of classifying an arbitrary query point as inside, outside, or on the boundary of a polygon drawn using great circle segments on the spherical surface. Point in spherical polygon has numerous applications in the geosciences and remote-sensing, where the problem of sorting point position data onto geographic regions on the surface of the earth or sensor field-of-view often arises. By resorting to a direct solution on the spherical surface, distortions arising from projection can be avoided. We examine the problem, with and without preprocessing, where preprocessing is a step applied when a large number of classifications are evaluated against the same spherical polygon. Several improvements are introduced to the existing point in spherical polygon algorithm (which does not include preprocessing), resulting in significantly faster inclusion queries and better handling of edge cases. Furthermore, we introduce a preprocessing algorithm by means of recursive, nonuniform spatial subdivision of longitudinal regions (“lunes”) on the sphere. With an $O (n log n)$ preprocessing step and $O (n)$ space requirement, the algorithm decreases query time from $O (n)$ to $O (log n)$ for most “realistic” polygons found in geoscience and remote sensing applications. Several empirical tests demonstrate that the algorithm performs to theoretical expectations.
Parallel scanline algorithm for rapid rasterization of vector geographic data
2013, Computers and Geosciences
Citation Excerpt :
Later rasterization algorithms were mostly derived from or improvements of these methods. Some researchers focused on the fast implementation of rasterization, such as the winding number algorithm (Hormann and Agathos, 2001), the hierarchical triangle-based method (Jiménez et al., 2009), and the point-in-polygon method based on the quasi-closest point (Yang et al., 2010). Other researchers considered the lossy conversion process and focused on improving the accuracy of the rasterization, such as the optimization algorithm based on errors in minimized areas (Wang et al., 2006) and the conversion of equal areas (Zhou et al., 2007).
With the expansion of complex geographic calculations and the increase of spatial data types involved in the spatial analysis of large areas, the need becomes urgent for fast rasterization of massive multi-source geographic vector data. A parallel scanline algorithm is proposed for rapid rasterization. It provides a systematic solution to solve the complicated situation in parallel processing (cross-processor boundaries, common boundaries, and tiny polygons), thus ensuring the accuracy of the parallel scanline algorithm. The relationship of parallel speedup with the number of processors, the data partition pattern, and the raster grid size is discussed. Massive vector geographic data (approximately 0.7 million polygons) used in the experiment were effectively processed, thereby dramatically reducing the processing time and getting good speedup.
A point-in-polygon method based on a quasi-closest point
2010, Computers and Geosciences
Citation Excerpt :
For example, every time a mouse is clicked inside a map on a screen, an instance of the point-in-polygon problem occurs (O'Rourke, 2000). Another example in geosciences is to identify if a water well is in a certain watershed, and even to assign regions to hydrologic units for the prediction of erosion and deposition zones (Jiménez et al., 2009). The point-in-polygon test can be also used to determine the number of soil boreholes located in a particular cadastral parcel (Ellis, 2001).
This paper presents a numerically stable solution to a point-in-polygon problem by combining the orientation method and the uniform subdivision technique. We define first a quasi-closest point that can be locally found through the uniform subdivision cells, and then we provide the criteria for determining whether a point lies inside a polygon according to the quasi-closest point. For a large number of points to be tested against the same polygon, the criteria are employed to determine the inclusion property of an empty cell as well as a test point. The experimental tests show that the new method resolves the singularity of a test point on an edge without loss of efficiency. The GIS case study also demonstrates the capability of the method to identify which region contains a test point in a map.
Fine Linear Equation Algorithm for Geo-Fence
2023, Lecture Notes in Electrical Engineering
Efficient Point-in-Polygon Tests by Grids Without the Trouble of Tuning the Grid Resolutions
2022, IEEE Transactions on Visualization and Computer Graphics
Polar, Spherical and Orthogonal Space Subdivisions for an Algorithm Acceleration: O(1) Point-in-Polygon/Polyhedron Test
2022, arXiv

View all citing articles on Scopus

View full text

A new hierarchical triangle-based point-in-polygon data structure

Abstract

Introduction

Section snippets

Background

The new data structure

Algorithms adaptation to the new structure

Experimental results and discussion

Conclusions

Acknowledgements

Computers & Graphics

Computational Geometry: Theory and Applications

Computers & Geosciences

Computers & Graphics

Computers & Graphics

Computers & Graphics

Computers & Graphics