Efficient Global Point Cloud Registration by Matching Rotation Invariant Features Through Translation Search

Liu, Yinlong; Wang, Chen; Song, Zhijian; Wang, Manning

doi:10.1007/978-3-030-01258-8_28

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11216))

Included in the following conference series:

European Conference on Computer Vision

2379 Accesses
42 Citations

Abstract

Three-dimensional rigid point cloud registration has many applications in computer vision and robotics. Local methods tend to fail, causing global methods to be needed, when the relative transformation is large or the overlap ratio is small. Most existing global methods utilize BnB optimization over the 6D parameter space of SE(3) . Such methods are usually very slow because the time complexity of BnB optimization is exponential in the dimensionality of the parameter space. In this paper, we decouple the optimization of translation and rotation, and we propose a fast BnB algorithm to globally optimize the 3D translation parameter first. The optimal rotation is then calculated by utilizing the global optimal translation found by the BnB algorithm. The separate optimization of translation and rotation is realized by using a newly proposed rotation invariant feature. Experiments on challenging data sets demonstrate that the proposed method outperforms state-of-the-art global methods in terms of both speed and accuracy.

Y. Liu and C. Wang—Contributed equally to this work.

You have full access to this open access chapter, Download conference paper PDF

Improved Point Cloud Registration with Scale Invariant Feature Extracted

Article 01 March 2021

A novel point cloud registration using 2D image features

Article Open access 07 January 2017

Three-Dimensional Point Cloud Registration Based on Normal Vector Angle

Article 29 November 2018

Keywords

1 Introduction

Three-dimensional rigid point cloud registration is a common problem in fields such as computer vision, robotics, and computer-assisted intervention [1,2,3,4]. Traditional local methods suffice only when the relative transformation is small and there is a large proportion of true overlap [1,2,3]. In contrast, a global method is needed when there is a large relative transformation or when the overlap proportion is small. Example applications include loop closing in simultaneous localization and mapping (SLAM) [1] and spatial registration in computer-assisted intervention [5]. In recent years, there has been a surge in the use of branch-and-bound (BnB) optimization to globally register 3D point clouds. However, the time complexity of BnB optimization is exponential in the dimensionality of the problem. Most existing global methods optimize an objective function in the parameter space of SE(3) , which has a dimensionality of six; thus, they are usually very slow.

Motivated by [6], in which SE(3) is decoupled into SO(3) and $ R^3 $ and the rotation and translation are optimized separately in SO(3) and $ R^3 $, respectively, we also optimize the rotation and translation separately in lower-dimensional spaces to achieve high efficiency via a newly proposed rotation invariant feature. In the method presented in [6], BnB optimization is first used to globally optimize the rotation to align translation invariant features, namely, the surface normal distributions, constructed from the original point clouds. Once the globally optimal rotation in SO(3) has been obtained, BnB optimization in $ R^3 $ is performed to calculate the translation. In contrast to [6], we propose a new rotation invariant feature (RIF) that allows us to first globally optimize the translation in $ R^3 $ to align the features computed for the original point clouds. The RIF we propose is a triple constructed from a pair of points. The first two elements of the RIF are the distances from the two points to the origin, and the third element is the distance between the two points. Conceptually, the elements of this RIF are the edge lengths of the triangle formed by the two points and the origin, which are obviously invariant with respect to the rotation of the two points around the origin. We maximize an objective function defined in terms of the consensus set between the two RIF sets constructed from the two point clouds to be registered and derive an upper bound on this objective function over the parameter space of translation in $ R^3 $. An efficient BnB optimization algorithm is developed based on this upper bound to calculate the translation with guaranteed global optimality. Experiments using real data demonstrate the efficiency and accuracy of the proposed 6D rigid point cloud registration method under challenging real-world conditions, such as large relative transformation and partial overlap.

2 Related Work

Given two 3D point clouds $ \mathcal {X} $ and $ \mathcal {Y} $, performing 3D rigid registration between them is the process of finding an optimal transformation $ \mathbf {T}\in SE(3) $ to align them, as follows:

$$\begin{aligned} \mathbf {T}^*=\mathop {\arg \max }_{\mathbf {T}\in SE(3)}\quad O(\mathcal {Y},\mathbf {T}(\mathcal {X})) \end{aligned}$$

(1)

where $ \mathcal {X} $ is the moving point cloud, $ \mathcal {Y} $ is the reference point cloud, and O is a function that measures the alignment. If some correspondences between these two point clouds are known, then the transformation can be robustly calculated analytically. However, this registration becomes a difficult optimization problem if no correspondence is known because the objective function O is highly non-convex.

2.1 Local Methods

As a non-convex optimization problem, point cloud registration is first locally solved with iterative approaches, the most popular of which is the iterative closest point (ICP) method [7]. The ICP method starts from an initial transformation and iterates between finding correspondences with the current transformation and updating the transformation with the newly established correspondences. The optimization scheme of the ICP algorithm is of the expectation maximization (EM) type and can converge only to a local optimum. In addition, the original ICP algorithm finds point-to-point correspondences in accordance with the L2 distance by minimizing the average distance between the corresponding point pairs found during iteration. The objective function and optimization scheme of the ICP algorithm make it very sensitive to outliers, and its convergence basin is very small.

A great number of variants have been proposed to improve the convergence and robustness of the ICP algorithm [8]. Another line of research for improving the convergence is to replace the objective function of the ICP algorithm with a new objective function defined in terms of the probability density [9]. In these methods, the probability densities are constructed from the original point clouds by means of kernels, such as the Gaussian kernel, and the difference between the two probability densities constructed from the two point clouds to be registered is minimized. This kind of objective function can be made much smoother than the ICP objective function can, thereby enlarging the basin of convergence. However, all these methods can still converge only to a local optimum.

2.2 Global Methods

The first attempts at global point cloud registration used heuristic methods, such as simulated annealing [10] or particle swarm optimization [11]. Methods of this kind have an increased probability of reaching the global optimum regardless of the initialization conditions. However, global optimality cannot be guaranteed.

The current trend is to solve the global point cloud registration problem using guaranteed global optimization methods. Most of these methods use BnB optimization, and almost every newly developed method involves deriving a new bound on the objective function. [12] is a pioneer work on global point cloud registration in which the point cloud registration problem is parameterized in terms of both transformation and correspondence and a simplified problem under rotation alone is solved by combining BnB optimization with Lipschitz optimization. Go-ICP [13] is the first practical global point cloud registration method. Based on a lemma presented in [14], which states that when a vector is rotated through two rotations, the angular difference between the two new vectors is no greater than the angular difference between the two rotations, Go-ICP establishes a geometric bound on a point rotated through an arbitrary rotation in a cubic branch of SO(3) . Go-ICP optimizes the same objective function as that of the traditional ICP algorithm via BnB optimization, and a trimming technique is used to address outliers. [15] proposes a tighter bound and optimizes a more robust objective function based on a consensus set, and this bound is further tightened in [16]. Another way to address outliers is to define the objective function in terms of the probability density, as is done in GOGMA [17], which minimizes the L2 distance between the two probability densities constructed from the original point clouds with Gaussian kernels.

The above global point cloud registration methods optimize an objective function in the six-dimensional space of SE(3) . Asymmetric point matching (APM) optimizes an objective function defined in terms of both transformation and point correspondence matrix [18]. Although the dimensionality of the original parameter space is very high, a bound is developed in a lower-dimensional space, which has the same dimensionality as that of the spatial transformation, by assuming a point-to-point correspondence. This assumption makes it difficult for APM to register partially overlapping point clouds or point clouds with gross outliers. The advantage of this method is that it can perform affine registration.

The time complexity of BnB optimization is exponential in the dimensionality of the problem. Therefore, existing global point cloud registration methods are usually very slow, primarily because the parameter space of SE(3) has six dimensions. One way to improve the time efficiency is to decouple SE(3) into SO(3) and $ R^3 $ and then optimize the 3D rotation and the 3D translation separately. [6] utilizes translation invariant features to first optimize the 3D rotation and then optimizes the 3D translation using the calculated globally optimal rotation; however, the process of constructing the translation invariant features is very time consuming.

2.3 Contribution

This paper also optimizes rotation and translation separately to achieve high efficiency. Our first contribution is that we propose a simple RIF that allows us to first globally search for the translation between two point clouds to align the features constructed from the original point clouds. We define a robust objective function based on a consensus set and derive a tight bound to achieve fast BnB optimization for the translation search. Second, we develop a 6D point cloud registration algorithm on the basis of the global translation search. Experiments on challenging real data demonstrate the superiority of the proposed method over state-of-the-art global methods in terms of both run time and accuracy.

3 Method

3.1 Rotation Invariant Feature

Let $ \mathcal {Y}=\{\mathbf {y}_j\}_{j=1}^Y $ and $ \mathcal {X}=\{\mathbf {x}_i\}_{i=1}^X $ be two 3D point clouds, which are related by a 3D rigid transformation from $ \mathcal {X} $ to $ \mathcal {Y} $. For a pair of corresponding points $ \mathbf {y}_j $ and $ \mathbf {x}_i $, we have

$$\begin{aligned} \mathbf {y}_j=\mathbf {R}^*(\mathbf {x}_i+\mathbf {t}^*) \end{aligned}$$

(2)

where $ \mathbf {t}^*\in R^3$ and $ \mathbf {R}^*\in SO(3)$ are the translation and rotation, respectively, from $ \mathcal {X} $ to $ \mathcal {Y} $.

For a pair of points $ \{\mathbf {x}_{i1}, \mathbf {x}_{i2}\} $ from the moving point cloud, we propose the construction of a triple $ \{\Vert \mathbf {x}_{i1} \Vert , \Vert \mathbf {x}_{i2} \Vert , \Vert \mathbf {x}_{i1}-\mathbf {x}_{i2} \Vert \} $, where $ \Vert \cdot \Vert $ denotes the Euclidean norm in $ R^3 $. As shown in Fig. 1, the three elements of this triple are the edge lengths of the triangle formed by the two points $ \mathbf {x}_{i1}$ and $ \mathbf {x}_{i2} $ and the origin. Obviously, this triple is invariant with respect to rotation around the origin, and thus, we call it a RIF. This means that for any $ \mathbf {R}\in SO(3) $, we have

$$\begin{aligned} \begin{Bmatrix} \Vert \mathbf {Rx}_{i1} \Vert \\ \Vert \mathbf {Rx}_{i2} \Vert \\ \Vert \mathbf {R}(\mathbf {x}_{i1}-\mathbf {x}_{i2}) \Vert \end{Bmatrix} = \begin{Bmatrix} \Vert \mathbf {x}_{i1} \Vert \\ \Vert \mathbf {x}_{i2} \Vert \\ \Vert \mathbf {x}_{i1}-\mathbf {x}_{i2} \Vert \end{Bmatrix} \end{aligned}$$

(3)

Figure 1(b–d) show the changes in RIFs with the translation of points.

3.2 Objective Function for the Translation Search

We use $ \mathbf {p}_{i1,i2}=\{\Vert \mathbf {x}_{i1} \Vert , \Vert \mathbf {x}_{i2} \Vert , \Vert \mathbf {x}_{i1}-\mathbf {x}_{i2} \Vert \} $ to denote the RIF constructed from a pair of moving points $ \{\mathbf {x}_{i1}, \mathbf {x}_{i2}\} $ and use $ \mathbf {q}_{j1,j2}=\{\Vert \mathbf {y}_{j1} \Vert , \Vert \mathbf {y}_{j2} \Vert , \Vert \mathbf {y}_{j1}-\mathbf {y}_{j2} \Vert \} $ to denote the RIF constructed from a pair of reference points $ \{\mathbf {y}_{j1}, \mathbf {y}_{j2}\} $. Following equation (3), if the two point pairs are related by a translation $ \mathbf {t} $ and a rotation $ \mathbf {R} $, we have

$$\begin{aligned} \mathbf {q}_{j1,j2}= \begin{Bmatrix} \Vert \mathbf {y}_{j1} \Vert \\ \Vert \mathbf {y}_{j2} \Vert \\ \Vert \mathbf {y}_{j1}-\mathbf {y}_{j2} \Vert \end{Bmatrix} =\begin{Bmatrix} \Vert \mathbf {R}(\mathbf {x}_{i1}+\mathbf {t}) \Vert \\ \Vert \mathbf {R}(\mathbf {x}_{i2}+\mathbf {t}) \Vert \\ \Vert \mathbf {R}(\mathbf {x}_{i1}-\mathbf {x}_{i2}) \Vert \end{Bmatrix} =\begin{Bmatrix} \Vert \mathbf {x}_{i1}+\mathbf {t} \Vert \\ \Vert \mathbf {x}_{i2}+\mathbf {t} \Vert \\ \Vert \mathbf {x}_{i1}-\mathbf {x}_{i2} \Vert \end{Bmatrix} :=F(\mathbf {p}_{i1,i2},\mathbf {t}) \end{aligned}$$

(4)

We find that the two RIFs are related by the translation $ \mathbf {t} $, and we define the function $ F(\mathbf {t}) $ to express this relationship. The parameter space of the function $ F(\mathbf {t}) $ is $ R^3 $. This means that if two point clouds are related by a translation $ \mathbf {t} $ and a rotation $ \mathbf {R} $, then the RIFs constructed from corresponding point pairs are related only by the translation $ \mathbf {t} $ through the function $ F(\mathbf {t}) $.

Let $ \mathcal {Q}=\{\mathbf {q}_n\}_{n=1}^Q $ be the set of RIFs constructed from point pairs $ \{\mathbf {y}_{j1}, \mathbf {y}_{j2}\} $, where $ \mathbf {y}_{j1},\mathbf {y}_{j2}\in \mathcal {Y} $ and $ j1 \ne j2 $. Let $ \mathcal {P}=\{\mathbf {p}_m\}_{m=1}^P $ be the set of RIFs constructed from point pairs $ \{\mathbf {x}_{i1}, \mathbf {x}_{i2}\} $, where $ \mathbf {x}_{i1},\mathbf {x}_{i2}\in \mathcal {X} $ and $ i1 \ne i2 $. The problem of finding the optimal translation from $ \mathcal {X} $ to $ \mathcal {Y} $ becomes the following optimization problem:

$$\begin{aligned} \mathbf {t}^*= \mathop {\arg \max }_{\mathbf {t}\in R^3} \quad E(\mathcal {Q},F(\mathcal {P},\mathbf {t})) \end{aligned}$$

(5)

where $ E(\mathbf {t}) $ is a function that measures the alignment between two RIF sets. By solving the optimization problem in (5) over $ R^3 $, we can find the optimal $ \mathbf {t}^* $, which forms part of the solution to the optimization problem in (1) over SE(3) .

To make the objective function robust to outliers, we define $ E(\mathbf {t}) $ on the basis of a consensus set as follows:

$$\begin{aligned} E(\mathcal {Q},F(\mathcal {P},\mathbf {t}))=\sum _m \max _n \lfloor {\Vert F(\mathbf {p}_m,\mathbf {t})-\mathbf {q}_n \Vert }_\infty \le \varepsilon \rfloor \end{aligned}$$

(6)

where $ \lfloor \cdot \rfloor $ is an indicator function that returns 1 if the condition $ \cdot $ is true and 0 otherwise. $ {\Vert \cdot \Vert }_\infty $ is the L-infinity norm in $ R^3 $ and $ \varepsilon $ is the inlier threshold. The size of both $ \mathcal {Q} $ and $ \mathcal {P} $ is very large. From the definition of a RIF, we can see that the third element of a RIF, which is the distance between the two points in the pair, is invariant with respect to translation. This means that the third elements of the corresponding RIFs in $ \mathcal {Q} $ and $ \mathcal {P} $ are equal to each other. In practice, we do not need to match the entirety of $ \mathcal {Q} $ and $ \mathcal {P} $. Instead, we match a subset of the RIFs in $ \mathcal {Q} $ and $ \mathcal {P} $, the third elements of which fall within a specific range. We denote these two subsets by $ \mathcal {Q'}=\{\mathbf {q}_n\}_{n=1}^{Q'} $ and $ \mathcal {\mathbf {P}'}=\{\mathbf {p}_m\}_{m=1}^{P'} $, and the actual objective function used in this paper is

$$\begin{aligned} E(\mathcal {Q'},F(\mathcal {P'},\mathbf {t}))=\sum _m \max _n \lfloor \Vert F(\mathbf {p}_m,\mathbf {t})-\mathbf {q}_n \Vert _\infty \le \varepsilon \rfloor \end{aligned}$$

(7)

3.3 Bounds and Branch-and-Bound-Based Algorithm

To maximize (7) via BnB optimization, we need an upper bound on this objective function in a branch of the parameter space. In our BnB algorithm, we search for the optimal translation in a cube in $ R^3 $ and iteratively divide the parameter space into sub-cubes. As shown in Fig. 2, for a translation cube $ \mathbb {T} $ with a diagonal length of 2r and centred at $ \mathbf {t}_c $, we have

$$\begin{aligned} \Vert \mathbf {x}+\mathbf {t}_c\Vert -r \le \Vert \mathbf {x}+\mathbf {t} \Vert \le \Vert \mathbf {x} + \mathbf {t}_c\Vert +r \end{aligned}$$

(8)

Let $ \mathbf {p}_m $ be the RIF constructed from a point pair $ \{\mathbf {x}_{m1},\mathbf {x}_{m2}\} $ from the moving point cloud and let $ \mathbf {q}_n $ be the RIF constructed from a point pair $ \{\mathbf {y}_{n1},\mathbf {y}_{n2}\} $ from the reference point cloud. Then, for any $ \mathbf {t} \in \mathbb {T} $, using formula (8), we have

$$\begin{aligned} {\Vert F(\mathbf {p}_m,\mathbf {t})-\mathbf {q}_n \Vert }_\infty = {\Vert \begin{Bmatrix} { \Vert \mathbf {x}_{m1} + \mathbf {t} \Vert -\Vert \mathbf {y}_{n1} \Vert } \\ {\Vert \mathbf {x}_{m2} + \mathbf {t} \Vert -\Vert \mathbf {y}_{n2} \Vert } \\ { \Vert \mathbf {x}_{m1} - \mathbf {x}_{m2} \Vert -\Vert \mathbf {y}_{n1} - \mathbf {y}_{n2} \Vert } \end{Bmatrix} \Vert }_\infty :=C(\mathbf {t}) \end{aligned}$$

(9)

where $ {\Vert \mathbf {x}_{m1} -\mathbf {x}_{m2} \Vert }-{\Vert \mathbf {y}_{n1} -\mathbf {y}_{n2} \Vert } $ is constant with respect to $ \mathbf {t} $. Because we can compute the bounds on $ \Vert \mathbf {x}_{m1} + \mathbf {t} \Vert $ and $ \Vert \mathbf {x}_{m2} + \mathbf {t} \Vert $ from equation (8), we can easily compute the bounds on the absolute values of $ \Vert \mathbf {x}_{m1}+\mathbf {t} \Vert - \Vert \mathbf {y}_{n1} \Vert $ and $ \Vert \mathbf {x}_{m2}+\mathbf {t} \Vert - \Vert \mathbf {y}_{n2} \Vert $. Then we can compute the upper and lower bounds on $ {\Vert F(\mathbf {p}_m,\mathbf {t})-\mathbf {q}_n \Vert }_\infty $, and we denote this lower bound by $ \underline{C}(\mathbf {t}) $. Thus, we have

$$\begin{aligned} {\Vert F(\mathbf {p}_m,\mathbf {t}) - \mathbf {q}_n \Vert }_\infty \ge \underline{C}(\mathbf {t}) \end{aligned}$$

(10)

It follows that

$$\begin{aligned} \lfloor \Vert F(\mathbf {p}_m,\mathbf {t})-\mathbf {q}_n\Vert _\infty \le \varepsilon \rfloor \le \lfloor \underline{C}(\mathbf {t}) \le \varepsilon \rfloor \end{aligned}$$

(11)

Then, we can define the upper-bound function as

$$\begin{aligned} \overline{E}(\mathbb {T})=\sum _m \max _n \lfloor \underline{C}(\mathbf {t}) \le \varepsilon \rfloor \end{aligned}$$

(12)

For any $ \mathbf {t} \in \mathbb {T} $, we have

$$\begin{aligned} E(\mathcal {Q'},F(\mathcal {P'},\mathbf {t}))\le \overline{E}(\mathbb {T}) \end{aligned}$$

(13)

Utilizing the upper bound given by formula (12), we can search the translation space to find the globally optimal translation that maximizes the objective function (7). The algorithm is outlined in Algorithm 1.

3.4 Six-Dimensional Registration

Once the globally optimal translation $ \mathbf {t} ^*$ between $ \mathcal {X} $ and $ \mathcal {Y} $ has been found, it is easy to calculate the optimal rotation $ \mathbf {R} ^* $ between the two point clouds. For example, we could use the global rotation search methods presented in [15] or [16]; however, the optimal rotation can actually be obtained in a much easier way. When we obtain the optimal $ \mathbf {t}^* $ via Algorithm 1, we also obtain a set of consensus RIFs between $ \mathcal {P'} $ and $ \mathcal {Q'} $, which provide us with potential correspondences between points in the original point clouds $ \mathcal {X} $ and $ \mathcal {Y} $. Of course, there will be some outliers among these correspondences, and the problem of estimating $ \mathbf {R} ^* $ becomes a robust estimation problem, which can be solved quickly and reliably. In this paper, we estimate $ \mathbf {R} ^* $ by means of the RANdom SAmple Consensus (RANSAC) algorithm [19]. We refer to our 6D rigid registration method as GoTS. Many global optimization methods sacrifice accuracy for speed and global optimality, and a subsequent local refinement is performed to improve accuracy. Similarly, a local refinement procedure can be added to GoTS; we refer to the algorithm with local refinement as GoTS$ ^+ $.

4 Experiments and Results

In this section, we report the results of evaluating the performance of GoTS and GoTS$ ^+ $ using many different point clouds and compare the proposed algorithms against the following three global methods: Go-ICP [13], Glob-GM [15] and APM [18]. We did not perform comparisons with the methods of [6] and [17], which use GPUs. GoTS and GoTS$ ^+ $ were implemented in MATLAB. The code for the other methods was obtained from the authors. In GoTS, a range is set for the third elements of the RIFs that are used to construct $ \mathcal {P'} $ and $ \mathcal {Q'} $ for the translation search. We set the lower bound of this range to 65% of the shortest extension of the original point cloud in the X, Y and Z directions, and set the range width to 0.01 times inlier threshold. In the supplementary material, we studied the registration time with respect to the range width used in choosing the RIFs for translation search. The original point clouds used in these experiments are very dense, and we down-sampled them using the pcdownsample function of MATLAB with a specified gridsize. In the experiments with synthetic data in Sects. 4.1 and 4.2, we scaled the points to make them fall within a cube of $[-1,1]^3 $ and set 0.01 as the inlier threshold for GoTS; in the experiment with real data in Sects. 4.3 and 4.4, we set gridsize as the inlier threshold for GoTS. In Glob-GM, the matchlist and local-refinement options were turned on. In Go-ICP, a distance transformation was used to speed up the nearest-neighbour distance calculation. All experiments were performed on a computer with a 2.80 GHz Intel(R) Core(TM) i7-7700HQ CPU and 16 GB of RAM. The run time for each method includes the method-specific pre-processing time.

4.1 Run-Time Comparison with Other Global Methods

We first compared the run times of GoTS for registering point clouds with different numbers of points to the run times of the other three methods. In this experiment, we used the bunny point cloud from the Stanford 3D Scanning Repository and down-sampled the original data to point clouds with different numbers of points. The down-sampled point cloud was scaled to fall within a cube of $[-1,1]^3 $ and was regarded as the moving point cloud, and it was transformed to a new position via a random transformation to form the reference point cloud. Therefore, we knew the ground-truth transformation from the moving point cloud to the reference point cloud and the point correspondence between the two point clouds. The moving point cloud was then registered to the reference point cloud with each of the four methods. For each number of points, we performed 20 registrations with different random ground-truth transformations, and the average run times with respect to the number of points are shown in Figs. 3(a) and (b). The run time of Glob-GM and APM exceeded 1000 s when registering 400 points. To better illustrate the difference between GoTS and Go-ICP, we show the run times of only these two methods in Fig. 3(b). We can see that GoTS and Go-ICP are much faster than the other two methods and that GoTS is approximately four times as fast as Go-ICP. In this experiment, all methods could successfully register the moving and reference point clouds.

Figure 3(c) shows the evolution of the upper and lower bounds of Algorithm 1 during one registration between 1000 moving points and 1000 reference points, in which it took only 5.2 s for GoTS to converge.

4.2 Robustness to Outliers

In this experiment, we evaluated the robustness of GoTS to outliers and compared it to the robustness of the other three methods. Outliers are commonly encountered in point cloud registration; in this section, we study only the influence of gross outliers, which do not belong to the true object represented by the point cloud. Gross outliers may originate from imperfections of the device used to obtain the point cloud. In the registration of partially overlapping surfaces, the points in the region that is not overlapped by the other point cloud can also be regarded as outliers; experiments focusing on this scenario are reported in the next section.

We down-sampled the bunny data from the Stanford 3D Scanning Repository (http://graphics.stanford.edu/data/3Dscanrep/) with a gridsize of 0.0121 to obtain 500 points and then scaled these points to fall within a cube of $[-1,1]^3 $. The scaled data points were used as the clean moving point cloud. Uniformly distributed random outlier points were added to this clean point cloud, and the resulting point cloud was then transformed via a random transformation to form a reference point cloud. Different numbers of outliers were added to generate reference point clouds with five different outlier percentages: 10%, 20%, 30%, 40% and 50%. Figure 4(c) and (d) show examples of reference point clouds with 10% and 50% outliers, respectively. We ran each algorithm 20 times for each outlier ratio.

Table 1 reports the root mean square errors on the inlier points for each method. Both GoTS and Go-ICP achieved high accuracy, indicating that they are both very robust to outliers. Although the errors of Go-ICP were smaller than those of GoTS, the accuracy of GoTS is sufficiently high in practice. The errors of GoTS$ ^+ $ show that two point clouds can be perfectly aligned by means of a subsequent local refinement, which requires less than one second of additional time. The registration error of Glob-GM is much higher than that of GoTS, and it increases as the outlier ratio increases. Considering that all the points lie in a cube with an edge length of 2, the registration errors of Glob-GM indicate that it fails in aligning the two point clouds in many cases. The errors of APM are even higher; the reason for these large errors may be that the assumption of a one-to-one correspondence that is adopted in APM is not valid in these experiments with outliers.

Table 1. Root mean square error between corresponding inlier points after registration.

Full size table

The average run times of the four algorithms with respect to the outlier ratio are shown in Fig. 4(a). The run time of APM increased rapidly as we added more outliers to the 500 inlier points and exceeded 1000 s when the outlier ratio was 20%. The run time of Glob-GM decreased when the outlier ratio was above 20%; this occurred because the algorithm terminated earlier at an incorrect solution in which the two point clouds were not aligned. Again, we plot the average run times of GoTS and Go-ICP separately in Fig. 4(b) to better illustrate the difference between them.

4.3 Partially Overlapping Registration of Real 3D Scans

In this section, we report the results of testing the performance of the proposed method on partially overlapping point clouds using real data. We compared GoTS and GoTS$ ^+ $ to Go-ICP and Glob-GM in this experiment. APM was not used here, but we can expect that its registration error would be large in this experiment because of its difficulty in dealing with outliers.

We used bun000, bun045, bun090, ArmadilloStand60, ArmadilloStand30 and ArmadilloStand0 from the Stanford 3D Scanning Repository in this experiment. The first three point clouds are scans of bunny from three different directions, and the last three are scans of Armadillo from three different directions. The original point clouds were down-sampled to clouds consisting of 1000 to 2000 points, all of which lay in a cube with an edge length of 0.3. The ground-truth transformations between each pair of scans of the same object are provided in the data set. Using the four methods, we performed registrations between four pairs of point clouds: ArmadilloStand60/ArmadilloStand30, ArmadilloStand30/ArmadilloStand0, bun045/bun000 and bun090/bun000. The point cloud pairs before and after registration are shown in the supplementary material.

Table 2. Results of registering Armadillo scans using four different methods.

Full size table

Table 3. Results of registering bunny scans using four different methods.

Full size table

The translation and rotation errors together with the run time of each method are listed in Tables 2 and 3. In these experiments, the inlier threshold of Glob-GM was set to 0.05 for the Armadillo data and to 0.029 for the bunny data because Glob-GM could not terminate within 1000 s when we used an inlier threshold equal to the gridsize of the down-sampling function. We can see that GoTS achieved high accuracy in these challenging registrations. In particular, the translation errors achieved by the global optimal translation search method in Algorithm 1 are all very small. Go-ICP failed on bun090/000, which is a difficult case because the relative transformation between the two point clouds is very large and the overlap ratio is small. The run time of Go-ICP was extremely long in this case, while in the other three cases, the run time of Go-ICP was slightly longer than or twice as long as that of GoTS. Carefully tuning the trimming ratio may improve the accuracy of Go-ICP, but in practice, the best trimming ratio cannot be known before registration. Glob-GM failed in all four cases, and its run times were very long. The number of RIFs generated for the global translation search was between 150 and 160 in these experiments.

4.4 Registration of Scans of LivingRoom

We tested the performance of the proposed method on large-scale field data using “livingRoom.mat” from the MATLAB example “3-D Point Cloud Registration and Stitching”, which consists of a series of 3D point sets obtained by continuously scanning a living room. We used GoTS, Go-ICP and Glob-GM to register the point clouds of the first and sixth frames of these data. The original point clouds consist of more than 300,000 points, and all these points lie in a cube with an edge length of 4 m. They were down-sampled with a gridsize of 0.1 m.

The results are listed in Table 4. GoTS successfully registered the two point clouds in 25.90 s. In contrast, Go-ICP failed to register the point clouds when the mean square error threshold was set to 0.05. When we used a mean square error threshold of 0.01, the algorithm could not terminate within half an hour. For Glob-GM, when we set the inlier threshold to 0.1, the algorithm could not terminate within half an hour. Then, we set the inlier threshold to 0.8; the algorithm terminated after 633.8 s, but the result was incorrect. The numbers of RIFs generated from the reference and moving point clouds were 1434 and 1866, respectively. More results can be found in the supplementary material.

Table 4. Results of registering livingRoom point clouds.

Full size table

5 Conclusion

We introduce a fast global 3D rigid point cloud registration method based on the decoupling of translation and rotation optimization via a newly proposed rotation invariant feature. We first globally optimize the translation by using a BnB algorithm to match sets of rotation invariant features constructed from the two point clouds, and we then calculate the optimal rotation using the optimized translation. Decoupling the optimization of the translation and rotation makes the proposed algorithm more efficient than existing methods. Experiments on real data demonstrate the superiority of the proposed algorithm in cases of partial overlap, large angular differences and high outlier ratios. All code will be available on the web (http://www.fudanmiccai.org).

References

Whelan, T., Kaess, M., Johannsson, H., Fallon, M., Leonard, J.J., McDonald, J.: Real-time large-scale dense RGB-D SLAM with volumetric fusion. Int. J. Robot. Res. 34, 598–626 (2015)
Article Google Scholar
Henry, P., Krainin, M., Herbst, E., Ren, X., Fox, D.: RGB-D mapping: using Kinect-style depth cameras for dense 3D modeling of indoor environments. Int. J. Robot. Res. 31, 647–663 (2012)
Article Google Scholar
Newcombe, R.A., et al.: KinectFusion: real-time dense surface mapping and tracking. In: International Symposium on Mixed and Augmented Reality, pp. 127–136 (2011)
Google Scholar
Salvi, J., Matabosch, C., Fofi, D., Forest, J.: A review of recent range image registration methods with accuracy evaluation. Image Vis. Comput. 25, 578–596 (2007)
Article Google Scholar
Fan, Y., Xu, X., Wang, M.: A surface-based spatial registration method based on sense three-dimensional scanner. J. Craniofac. Surg. 28, 157–160 (2017)
Article Google Scholar
Straub, J., Campbell, T., How, J.P., Fisher, J.W.I.: Efficient global point cloud alignment using Bayesian nonparametric mixtures. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2403–2412 (2017)
Google Scholar
Besl, P.J., McKay, N.D.: A method for registration of 3-D shapes. IEEE Trans. Pattern Anal. Mach. Intell. 14, 239–256 (1992)
Article Google Scholar
Chetverikov, D., Stepanov, D., Krsek, P.: Robust euclidean alignment of 3D point sets: the trimmed iterative closest point algorithm. Image Vis. Comput. 23, 299–309 (2005)
Article Google Scholar
Jian, B., Vemuri, B.C.: Robust point set registration using Gaussian mixture models. IEEE Trans. Pattern Anal. Mach. Intell. 33, 1633–1645 (2011)
Article Google Scholar
Kirkpatrick, S., Gelatt, C.D., Vecchi, M.P.: Optimization by simulated annealing. Science 220, 671–680 (1983)
Article MathSciNet Google Scholar
Wachowiak, M.P., Smolikova, R., Zheng, Y.F., Zurada, J.M., Elmaghraby, A.S.: An approach to multimodal biomedical image registration utilizing particle swarm optimization. IEEE Trans. Evol. Comput. 8, 289–301 (2004)
Article Google Scholar
Li, H., Hartley, R.: The 3D–3D registration problem revisited. In: IEEE International Conference on Computer Vision, pp. 1947–1954 (2007)
Google Scholar
Yang, J., Li, H., Campbell, D., Jia, Y.: Go-ICP: a globally optimal solution to 3D ICP point-set registration. IEEE Trans. Pattern Anal. Mach. Intell. 38, 2241–2254 (2016)
Article Google Scholar
Hartley, R.I., Kahl, F.: Global optimization through rotation space search. Int. J. Comput. Vis. 82, 64–79 (2009)
Article Google Scholar
Bustos, A.P., Chin, T., Eriksson, A., Li, H., Suter, D.: Fast rotation search with stereographic projections for 3D registration. IEEE Trans. Pattern Anal. Mach. Intell. 38, 2227–2240 (2016)
Article Google Scholar
Campbell, D., Petersson, L., Kneip, L., Li, H.: Globally-optimal inlier set maximisation for simultaneous camera pose and feature correspondence. In: IEEE International Conference on Computer Vision, pp. 1–10 (2017)
Google Scholar
Campbell, D., Petersson, L.: GOGMA: globally-optimal Gaussian mixture alignment. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 5685–5694 (2016)
Google Scholar
Lian, W., Zhang, L., Yang, M.: An efficient globally optimal algorithm for asymmetric point matching. IEEE Trans. Pattern Anal. Mach. Intell. 39, 1281–1293 (2016)
Article Google Scholar
Fischler, M.A., Bolles, R.C.: Random sample consensus - a paradigm for model-fitting with applications to image-analysis and automated cartography. Commun. ACM 24, 381–395 (1981)
Article MathSciNet Google Scholar

Download references

Acknowledgements

This research was supported by the National Natural Science Foundation of China (grants 81471758, 81701795 and 60972102). This research was also partially supported by the National Key Research and Development Program of China (2017YFC0110700) and the Program of Shanghai Academic/Technology Research Leaders (16XD1424900).

Author information

Authors and Affiliations

Digital Medical Research Center, School of Basic Medical Science, Fudan University, Shanghai, 200032, China
Yinlong Liu, Chen Wang, Zhijian Song & Manning Wang
Shanghai Key Laboratory of Medical Imaging Computing and Computer Assisted Intervention, Shanghai, 200032, China
Yinlong Liu, Chen Wang, Zhijian Song & Manning Wang

Authors

Yinlong Liu
View author publications
You can also search for this author in PubMed Google Scholar
Chen Wang
View author publications
You can also search for this author in PubMed Google Scholar
Zhijian Song
View author publications
You can also search for this author in PubMed Google Scholar
Manning Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Zhijian Song or Manning Wang .

Editor information

Editors and Affiliations

Google Research, Zurich, Switzerland
Vittorio Ferrari
Carnegie Mellon University, Pittsburgh, PA, USA
Martial Hebert
Google Research, Zurich, Switzerland
Cristian Sminchisescu
Hebrew University of Jerusalem, Jerusalem, Israel
Yair Weiss

1 Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 3672 KB)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liu, Y., Wang, C., Song, Z., Wang, M. (2018). Efficient Global Point Cloud Registration by Matching Rotation Invariant Features Through Translation Search. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds) Computer Vision – ECCV 2018. ECCV 2018. Lecture Notes in Computer Science(), vol 11216. Springer, Cham. https://doi.org/10.1007/978-3-030-01258-8_28

Download citation

DOI: https://doi.org/10.1007/978-3-030-01258-8_28
Published: 06 October 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-01257-1
Online ISBN: 978-3-030-01258-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Efficient Global Point Cloud Registration by Matching Rotation Invariant Features Through Translation Search

Abstract

Similar content being viewed by others

Improved Point Cloud Registration with Scale Invariant Feature Extracted

A novel point cloud registration using 2D image features

Three-Dimensional Point Cloud Registration Based on Normal Vector Angle

Keywords

1 Introduction