A coarse-to-fine IP-driven registration for pose estimation from single ultrasound image

doi:10.1016/j.cviu.2013.01.015

Computer Vision and Image Understanding

Volume 117, Issue 12, December 2013, Pages 1647-1658

https://doi.org/10.1016/j.cviu.2013.01.015 Get rights and content

Highlights

•
We propose a method for pose estimation from single ultrasound image.
•
We propose a 3D registration method using implicit polynomial (IP) model.
•
We improve the robustness, accuracy and computational efficiency for registration by a coarse-to-fine process with multiple IPs from low degree to high degree.

Abstract

A fast registration making use of implicit polynomial (IP) models is helpful for the real-time pose estimation from single clinical free-hand Ultrasound (US) image, because it is superior in the areas such as robustness against image noise, fast registration without enquiring correspondences, and fast IP coefficient transformation. However it might lead to the lack of accuracy or failure registration.

In this paper, we present a novel registration method based on a coarse-to-fine IP representation. The approach starts from a high-speed and reliable registration with a coarse (of low degree) IP model and stops when the desired accuracy is achieved by a fine (of high degree) IP model. Over the previous IP-to-point based methods our contributions are: (i) keeping the efficiency without requiring pair-wised correspondences, (ii) enhancing the robustness, and (iii) improving the accuracy. The experimental result demonstrates the good performance of our registration method and its capabilities of overcoming the limitations of unconstrained freehand ultrasound data, resulting in fast, robust and accurate registration.

Introduction

To support medical diagnosis, various imaging modalities, such as computed tomography (CT) scan, MRI, PET, and ultrasound (US), are widely used in clinics. Among these modalities, US has beneficial characteristics such as free-hand manner, non-invasiveness, compactness, low cost, and synchronization of operations and imaging. Thus US is attractive for assistance with surgical operations and real-time diagnosis of problems with the circulatory system, abdomen, breast, prostate gland, etc.

However, US images are notorious for the poor image quality, due to speckle noises, low signal-to-noise ratio, occlusions, and uniform brightness. And field of view (FOV) in US imaging is very limited; in severe cases, only 2D cross-sectional images are obtained. These aspects confuse the doctors in making right decisions for diagnosis.

In order to solve these issues, some recent literature advocates the fusion-of-modality techniques. For example, before the surgical operation, 3D models of target parts are obtained by rich but time-consuming modalities such as CT, MRI, and PET. By superimposing US images obtained during the operation on these 3D models, the result will provide rich information to help a doctor’s diagnosis. To achieve this, the key for superimposing is to estimate the pose of US images related to the images derived from other modalities.

The pose estimation can be viewed as a registration problem for two models: a source model (preoperative 3D model) and a target model (2D/3D US image). To do this, a class of methods such as [1], [2] bind the optical position sensors to a US probe, and measure the relative US position to 3D models; For enhancing robustness, the methods in [3], [4], [5] combine the information from position sensors and image features.

Without position sensors, Penney et al. [6] propose to register the surface points manually selected from US images to a preoperative 3D shape model by MRI segmentation; similarly, Amin et al. [7] register the bone boundaries in US images to a shape model segmented from CT image by a modified ICP method; Lange et al. [8] take advantage of 3D-Power Doppler to extract vessel shapes from intraoperative 3D US and register with preoperative models in liver surgery; to enhance the robustness, mutual information is advocated as measuring image similarities, such as [9], [10]; Wein et al. [11] achieve the CT-ultrasound registration by simulating the US image sequence from CT image, and using a new similarity metric: linear correlation of linear combination; other methods such as [12], [13], [10] estimate the relative positions according to the image features or intensity and gradient information of US images and preoperative 3D models. Although each of the methods has its effectiveness, they suffer from expensive computation caused either by the intensity-based similarity calculation or point-to-point ICP-based registration and thus they are difficult to work in real time.

Regardless of the data type, the registration problem is solved basically by three families of methods: (i) ICP-based methods: the iterative closest point method (ICP) first proposed by Besl and McKay [14] or its accelerated variations such as [15] for 3D range data, (ii) point-model methods: e.g., Fitzgibbon [16] encodes the Euclidean distance field by fast distance transformation and employs the robust estimation to remove the outliers; Huang et al. [17] proposed new similarity measurement using information theory to achieve the robust non-rigid registration; and iii) the approach relying on algebraic/geometric invariant features, e.g., moment features is described in [18], and IP global features are proposed by Taubin et al. [19], [20]. The first family of methods can achieve fine registration, but requires time-consuming computation of point-to-point/surface correspondences; The second family of methods can achieve the registration efficiently but needs huge memory, especially in dense 3D cases, to preserve the distance field; and the third family of methods can achieve fast registration, but cannot deal with registration in the case of partially overlapping the target objects [18].

In our previous work [21], [22], we propose to approximate the Euclidean distance with the algebraic formulation using implicit polynomials (IPs) and speed up the registration. The advantages of this method over the prior methods are that: i) unlike the ICP-based methods, it avoids the extra computation for point-wise correspondences; ii) unlike the point-model method of preserving a discrete distance field, it needs very little memory space for preserving a few IP coefficients, and the algebraic model can generate an infinite distance field to support registration in a wider space; iii) unlike the coarse registration methods, it supports partial-overlapped registration. A recent work proposed by Rouhani and Sappa [23] improves the optimization by Levenberg–Marquardt algorithm which leads to a faster convergence. These methods adopt a single IP based registration which remains an essential issue: a moderate IP model is really difficult to generate and thus to be obstacle to an accurate and robust registration.

The previous studies in [24], [25] pointed out two issues frustrating the IP fitting: i) An IP of low degree loses local accuracy for object representation, whereas ii) an IP of high degree might be globally unstable (the undesired surfaces appear in the fitting result). However the former may lead to the lack of accuracy for registration, but the latter may lead to a failure registration. Fig. 1 shows an example when an IP (gray surface) is registered to scattered points (blue points). While Fig. 1 (a) shows the registration result losing much accuracy due to coarse IP of low degree, Fig. 1 (b) shows failure registration due to the global instability problem of IP fitting with high degree.

Our method inherits IP’s merits: neither time-consuming process of correspondence searching nor huge memory for storing the discrete distance field is required. In addition, over the previous methods in [21], [22], [23] that use single IP for registration, we propose a coarse-to-fine IP registration. As illustrated in Fig. 2 leftmost, it starts from a low degree IP (ellipsoid) to achieve a robust initial guess. Second, after a rough registration covered by the IPs of low degree, the higher degree IPs can drive to a more accurate position, even if the IP is not stably modeled (extra zero sets appear around the desired zero set), as shown in Fig. 2 rightmost. Our method improves the robustness and accuracy. The robustness is guaranteed by the coarse estimation with an IP of low degree, whereas the high accuracy can be achieved by an IP of high degree given the appropriate initial guess.

Compared to the global IP matching methods, such as [19], [20], our method overcomes the partially overlapped problem. Such merits make it possible to be applied for the registration between a 3D shape and a 2D US image plane. We adopt boundary information which is independent of the types of modalities. As illustrated in Fig. 3 (a), our method supposes the 3D model has been obtained in an advance which is desired to registered with the online ultrasound image shown in Fig. 3 (b). In an online process, e.g., during a surgical operation, it fast aligns a 3D IP models to a 2D US image, as the coarse-to-fine approach shown in Fig. 4. Then the desired relative pose information between 3D model and the 2D US image (associated with the probe position information of ultrasound device) is obtained.

This paper is organized as follows: Section 2 introduces the mathematics of IP modeling and its properties; In Section 3, we present the registration technique using IP, formulated in both a general case and a US image case, and based on coarse-to-fine approach; Section 4 reports experimental results followed by conclusion in Sections 5. In addition, we present our symbolic computational transformation of IP in Appendix A.

Section snippets

Implicit polynomial

We adopt implicit polynomials for modeling the preoperative 3D images captured by a modality such as MRI, CT-SCAN, or 3D US, supposing that the 3D boundary has been obtained from the segmentation result of the captured volume images.

Registration

In this section, before we present our coarse-to-fine registration method, let us first consider the case driven by single IP model. The objective of IP-driven registration is to find a transformation that makes the IP zero set to be “best” matched with the given data set. It can be formulated as an energy minimization problem in a general case or US image case as introduced in Section 3.2. To this end, the moderate measurement for the distance between data set and IP is required first.

Experimental results

In this section, we report results of experiments dealing with some synthetic data sets to evaluate the method on computational performance. All the experiments were implemented in Matlab 8 combined by C++ code with a PC having an Intel core 2 CPU, 2.4 GHz, and 2 GB memory.

Conclusions

In this paper, we extend our previous method, correspondences free registration with single IP model, to a new coarse-to-fine registration driven by the multiple IPs of incremental degrees. The better performance is achieved by two aspects: (i) The registration robustness and computational efficiency are improved by initial coarse registration with IP of low degree, since no extra IP surface appears and low cost of transformation for the IP and (ii) the registration accuracy is improved by IP

Acknowledgments

This work was partially supported by Canon Inc., under the project: Physics-based vision theories for next-generation medical image processing.

References (32)

J.W. Trobaugh et al.
Frameless stereotactic ultrasonography: method and applications
Comput. Med. Imag. Graph.
(1994)
G. Penney et al.
Cadaver validation of intensity-based ultrasound to CT registration
Med. Image Anal.
(2006)
W. Wein et al.
Automatic CT-ultrasound registration for diagnostic imaging and image-guided intervention
Med. Image Anal.
(2008)
A.W. Fitzgibbon
Robust registration of 2d and 3d point sets
Image Vis. Comput.
(2003)
J. Salvi et al.
A review of recent range image registration methods with accuracy evaluation
Image Vis. Comput.
(2007)
N. Pagoulatos et al.
Interactive 3-D registration of ultrasound and magnetic resonance images based on a magnetic position sensor
IEEE Trans. Inform. Technol. Biomed.
(1999)
J. Blackall et al.
Alignment of sparse freehand 3-D ultrasound with preoperative images of the liver using models of respiratory motion and deformation
Trans. Med. Imag.
(2005)
X, Huang, N.A. Hill, J. Ren, G. Guiraudon, T.M. Peters, Intra-cardiac 2d us to 3d ct image registration, in:...
G. P. Penney, J. M. Blackall, D. Hayashi, T. Sabharwal, A. Adam, D.J. Hawkes, Overview of an ultrasound to ct or mr...
D.V. Amin et al.
Ultrasound registration of the bone surface for surgical navigation
Comput. Aid. Surg.
(2003)

T. Lange, S. Eulenstein, M. Hunerbein, H. Lamecker, P. Michael Schlag, Augmenting intraoperative 3d ultrasound with...

J.P.W. Pluim et al.

Mutual-information-based registration of medical images: a survey

IEEE Trans. Med. Imag.

(2003)

W. Wein et al.

Automatic registration and fusion of ultrasound with CT for radiotherapy

Proc. MICCAI2005

(2005)

A. Leroy et al.

Rigid registration of free-hand 3D ultrasound and CT-scan kidney images

Proc. MICCAI2004

(2004)

A. Roche et al.

Rigid registration of 3-D ultrasound with MR images: a new approach combining intensity and gradient information

IEEE Trans. Med. Imag.

(2001)

P. Besl et al.

A method for registration of 3-D shapes

IEEE Trans. Pattern. Anal. Mach. Intell. (TPAMI)

(1992)

Cited by (12)

A learning-based variable size part extraction architecture for 6D object pose recovery in depth images
2017, Image and Vision Computing
Citation Excerpt :
In the off-line phase, object model is represented with 3D IPs, and by utilizing its gradient flow, 2D ultrasound image is registered in the on-line process. In Ref. [23], a coarse-to-fine fast IP-driven registration method is presented. A rough pose estimation is quickly acquired with a coarse IP model (low degree curve fitting), and finer models refine the parameters of this rough estimation (high degree curve fitting).
State-of-the-art techniques for 6D object pose recovery depend on occlusion-free point clouds to accurately register objects in 3D space. To deal with this shortcoming, we introduce a novel architecture called Iterative Hough Forest with Histogram of Control Points that is capable of estimating the 6D pose of an occluded and cluttered object, given a candidate 2D bounding box. Our Iterative Hough Forest (IHF) is learnt using parts extracted only from the positive samples. These parts are represented with Histogram of Control Points (HoCP), a “scale-variant” implicit volumetric description, which we derive from recently introduced Implicit B-Splines (IBS). The rich discriminative information provided by the scale-variant HoCP features is leveraged during inference. An automatic variable size part extraction framework iteratively refines the object's roughly aligned initial pose due to the extraction of coarsest parts, the ones occupying the largest area in image pixels. The iterative refinement is accomplished based on finer (smaller) parts, which are represented with more discriminative control point descriptors by using our Iterative Hough Forest. Experiments conducted on a publicly available dataset report that our approach shows better registration performance than the state-of-the-art methods.
An analytical representation of conformal mapping for genus-zero implicit surfaces and its application to surface shape similarity assessment
2015, CAD Computer Aided Design
Citation Excerpt :
proposed an IP surface registration algorithm based on minimization of energy function of approximate distances by using IP gradient field. Furthermore, they [20] recently presented a coarse-to-fine method with multiple IPs from low degree to high degree in order to improve the accuracy and the robustness for IP surface registration. Another similar registration method was proposed by means of the Levenberg–Marquardt algorithm [21].
This paper develops an analytical representation of conformal mapping for genus-zero implicit surfaces based on algebraic polynomial functions, and its application to surface shape similarity assessment. Generally, the conformal mapping often works as a tool of planar or spherical parameterization for triangle mesh surfaces. It is further exploited for implicit surface matching in this study. The method begins with discretizing one implicit surface by triangle mesh, where a discrete harmonic energy model related to both the mesh and the other implicit surface is established based on a polynomial-function mapping. Then both the zero-center constraint and the landmark constraints are added to the model to ensure the uniqueness of mapping result with the Möbius transformation. By searching optimal polynomial coefficients with the Lagrange–Newton method, the analytical representation of conformal mapping is obtained, which reveals all global and continuous one-to-one correspondent point pairs between two implicit surfaces. Finally, a shape similarity assessment index for (two) implicit surfaces is proposed through calculating the differences of all the shape index values among those corresponding points. The proposed analytical representation method of conformal mapping and the shape assessment index are both verified by the simulation cases for the closed genus-zero implicit surfaces. Experimental results show that the method is effective for genus-zero implicit surfaces, which will offer a new way for object retrieval and manufactured surface inspection.
Point set registration for assembly feature pose estimation using simulated annealing nested Gauss-Newton optimization
2021, Assembly Automation
Point set registration for pose estimation using continuous distance field
2019, 2019 IEEE International Conference on Real-Time Computing and Robotics, RCAR 2019
A Learning-based Variable Size Part Extraction Architecture for 6D Object Pose Recovery in Depth
2017, arXiv
Aligning 3D local data of leapfrog locations along elongated structures
2016, Proceedings - 2016 13th Conference on Computer and Robot Vision, CRV 2016

View all citing articles on Scopus

View full text

A coarse-to-fine IP-driven registration for pose estimation from single ultrasound image

Highlights

Abstract

Introduction

Section snippets

Implicit polynomial

Registration

Experimental results

Conclusions

Acknowledgments

Comput. Med. Imag. Graph.

Med. Image Anal.

Med. Image Anal.

Image Vis. Comput.

Image Vis. Comput.

Interactive 3-D registration of ultrasound and magnetic resonance images based on a magnetic position sensor

IEEE Trans. Inform. Technol. Biomed.

Alignment of sparse freehand 3-D ultrasound with preoperative images of the liver using models of respiratory motion and deformation

Trans. Med. Imag.

Ultrasound registration of the bone surface for surgical navigation

Comput. Aid. Surg.

Mutual-information-based registration of medical images: a survey

IEEE Trans. Med. Imag.

Automatic registration and fusion of ultrasound with CT for radiotherapy

Proc. MICCAI2005

Rigid registration of free-hand 3D ultrasound and CT-scan kidney images

Proc. MICCAI2004

Rigid registration of 3-D ultrasound with MR images: a new approach combining intensity and gradient information

IEEE Trans. Med. Imag.

A method for registration of 3-D shapes

IEEE Trans. Pattern. Anal. Mach. Intell. (TPAMI)