A coarse-to-fine IP-driven registration for pose estimation from single ultrasound image

https://doi.org/10.1016/j.cviu.2013.01.015Get rights and content

Highlights

  • We propose a method for pose estimation from single ultrasound image.

  • We propose a 3D registration method using implicit polynomial (IP) model.

  • We improve the robustness, accuracy and computational efficiency for registration by a coarse-to-fine process with multiple IPs from low degree to high degree.

Abstract

A fast registration making use of implicit polynomial (IP) models is helpful for the real-time pose estimation from single clinical free-hand Ultrasound (US) image, because it is superior in the areas such as robustness against image noise, fast registration without enquiring correspondences, and fast IP coefficient transformation. However it might lead to the lack of accuracy or failure registration.

In this paper, we present a novel registration method based on a coarse-to-fine IP representation. The approach starts from a high-speed and reliable registration with a coarse (of low degree) IP model and stops when the desired accuracy is achieved by a fine (of high degree) IP model. Over the previous IP-to-point based methods our contributions are: (i) keeping the efficiency without requiring pair-wised correspondences, (ii) enhancing the robustness, and (iii) improving the accuracy. The experimental result demonstrates the good performance of our registration method and its capabilities of overcoming the limitations of unconstrained freehand ultrasound data, resulting in fast, robust and accurate registration.

Introduction

To support medical diagnosis, various imaging modalities, such as computed tomography (CT) scan, MRI, PET, and ultrasound (US), are widely used in clinics. Among these modalities, US has beneficial characteristics such as free-hand manner, non-invasiveness, compactness, low cost, and synchronization of operations and imaging. Thus US is attractive for assistance with surgical operations and real-time diagnosis of problems with the circulatory system, abdomen, breast, prostate gland, etc.

However, US images are notorious for the poor image quality, due to speckle noises, low signal-to-noise ratio, occlusions, and uniform brightness. And field of view (FOV) in US imaging is very limited; in severe cases, only 2D cross-sectional images are obtained. These aspects confuse the doctors in making right decisions for diagnosis.

In order to solve these issues, some recent literature advocates the fusion-of-modality techniques. For example, before the surgical operation, 3D models of target parts are obtained by rich but time-consuming modalities such as CT, MRI, and PET. By superimposing US images obtained during the operation on these 3D models, the result will provide rich information to help a doctor’s diagnosis. To achieve this, the key for superimposing is to estimate the pose of US images related to the images derived from other modalities.

The pose estimation can be viewed as a registration problem for two models: a source model (preoperative 3D model) and a target model (2D/3D US image). To do this, a class of methods such as [1], [2] bind the optical position sensors to a US probe, and measure the relative US position to 3D models; For enhancing robustness, the methods in [3], [4], [5] combine the information from position sensors and image features.

Without position sensors, Penney et al. [6] propose to register the surface points manually selected from US images to a preoperative 3D shape model by MRI segmentation; similarly, Amin et al. [7] register the bone boundaries in US images to a shape model segmented from CT image by a modified ICP method; Lange et al. [8] take advantage of 3D-Power Doppler to extract vessel shapes from intraoperative 3D US and register with preoperative models in liver surgery; to enhance the robustness, mutual information is advocated as measuring image similarities, such as [9], [10]; Wein et al. [11] achieve the CT-ultrasound registration by simulating the US image sequence from CT image, and using a new similarity metric: linear correlation of linear combination; other methods such as [12], [13], [10] estimate the relative positions according to the image features or intensity and gradient information of US images and preoperative 3D models. Although each of the methods has its effectiveness, they suffer from expensive computation caused either by the intensity-based similarity calculation or point-to-point ICP-based registration and thus they are difficult to work in real time.

Regardless of the data type, the registration problem is solved basically by three families of methods: (i) ICP-based methods: the iterative closest point method (ICP) first proposed by Besl and McKay [14] or its accelerated variations such as [15] for 3D range data, (ii) point-model methods: e.g., Fitzgibbon [16] encodes the Euclidean distance field by fast distance transformation and employs the robust estimation to remove the outliers; Huang et al. [17] proposed new similarity measurement using information theory to achieve the robust non-rigid registration; and iii) the approach relying on algebraic/geometric invariant features, e.g., moment features is described in [18], and IP global features are proposed by Taubin et al. [19], [20]. The first family of methods can achieve fine registration, but requires time-consuming computation of point-to-point/surface correspondences; The second family of methods can achieve the registration efficiently but needs huge memory, especially in dense 3D cases, to preserve the distance field; and the third family of methods can achieve fast registration, but cannot deal with registration in the case of partially overlapping the target objects [18].

In our previous work [21], [22], we propose to approximate the Euclidean distance with the algebraic formulation using implicit polynomials (IPs) and speed up the registration. The advantages of this method over the prior methods are that: i) unlike the ICP-based methods, it avoids the extra computation for point-wise correspondences; ii) unlike the point-model method of preserving a discrete distance field, it needs very little memory space for preserving a few IP coefficients, and the algebraic model can generate an infinite distance field to support registration in a wider space; iii) unlike the coarse registration methods, it supports partial-overlapped registration. A recent work proposed by Rouhani and Sappa [23] improves the optimization by Levenberg–Marquardt algorithm which leads to a faster convergence. These methods adopt a single IP based registration which remains an essential issue: a moderate IP model is really difficult to generate and thus to be obstacle to an accurate and robust registration.

The previous studies in [24], [25] pointed out two issues frustrating the IP fitting: i) An IP of low degree loses local accuracy for object representation, whereas ii) an IP of high degree might be globally unstable (the undesired surfaces appear in the fitting result). However the former may lead to the lack of accuracy for registration, but the latter may lead to a failure registration. Fig. 1 shows an example when an IP (gray surface) is registered to scattered points (blue points). While Fig. 1 (a) shows the registration result losing much accuracy due to coarse IP of low degree, Fig. 1 (b) shows failure registration due to the global instability problem of IP fitting with high degree.

Our method inherits IP’s merits: neither time-consuming process of correspondence searching nor huge memory for storing the discrete distance field is required. In addition, over the previous methods in [21], [22], [23] that use single IP for registration, we propose a coarse-to-fine IP registration. As illustrated in Fig. 2 leftmost, it starts from a low degree IP (ellipsoid) to achieve a robust initial guess. Second, after a rough registration covered by the IPs of low degree, the higher degree IPs can drive to a more accurate position, even if the IP is not stably modeled (extra zero sets appear around the desired zero set), as shown in Fig. 2 rightmost. Our method improves the robustness and accuracy. The robustness is guaranteed by the coarse estimation with an IP of low degree, whereas the high accuracy can be achieved by an IP of high degree given the appropriate initial guess.

Compared to the global IP matching methods, such as [19], [20], our method overcomes the partially overlapped problem. Such merits make it possible to be applied for the registration between a 3D shape and a 2D US image plane. We adopt boundary information which is independent of the types of modalities. As illustrated in Fig. 3 (a), our method supposes the 3D model has been obtained in an advance which is desired to registered with the online ultrasound image shown in Fig. 3 (b). In an online process, e.g., during a surgical operation, it fast aligns a 3D IP models to a 2D US image, as the coarse-to-fine approach shown in Fig. 4. Then the desired relative pose information between 3D model and the 2D US image (associated with the probe position information of ultrasound device) is obtained.

This paper is organized as follows: Section 2 introduces the mathematics of IP modeling and its properties; In Section 3, we present the registration technique using IP, formulated in both a general case and a US image case, and based on coarse-to-fine approach; Section 4 reports experimental results followed by conclusion in Sections 5. In addition, we present our symbolic computational transformation of IP in Appendix A.

Section snippets

Implicit polynomial

We adopt implicit polynomials for modeling the preoperative 3D images captured by a modality such as MRI, CT-SCAN, or 3D US, supposing that the 3D boundary has been obtained from the segmentation result of the captured volume images.

Registration

In this section, before we present our coarse-to-fine registration method, let us first consider the case driven by single IP model. The objective of IP-driven registration is to find a transformation that makes the IP zero set to be “best” matched with the given data set. It can be formulated as an energy minimization problem in a general case or US image case as introduced in Section 3.2. To this end, the moderate measurement for the distance between data set and IP is required first.

Experimental results

In this section, we report results of experiments dealing with some synthetic data sets to evaluate the method on computational performance. All the experiments were implemented in Matlab 8 combined by C++ code with a PC having an Intel core 2 CPU, 2.4 GHz, and 2 GB memory.

Conclusions

In this paper, we extend our previous method, correspondences free registration with single IP model, to a new coarse-to-fine registration driven by the multiple IPs of incremental degrees. The better performance is achieved by two aspects: (i) The registration robustness and computational efficiency are improved by initial coarse registration with IP of low degree, since no extra IP surface appears and low cost of transformation for the IP and (ii) the registration accuracy is improved by IP

Acknowledgments

This work was partially supported by Canon Inc., under the project: Physics-based vision theories for next-generation medical image processing.

References (32)

  • T. Lange, S. Eulenstein, M. Hunerbein, H. Lamecker, P. Michael Schlag, Augmenting intraoperative 3d ultrasound with...
  • J.P.W. Pluim et al.

    Mutual-information-based registration of medical images: a survey

    IEEE Trans. Med. Imag.

    (2003)
  • W. Wein et al.

    Automatic registration and fusion of ultrasound with CT for radiotherapy

    Proc. MICCAI2005

    (2005)
  • A. Leroy et al.

    Rigid registration of free-hand 3D ultrasound and CT-scan kidney images

    Proc. MICCAI2004

    (2004)
  • A. Roche et al.

    Rigid registration of 3-D ultrasound with MR images: a new approach combining intensity and gradient information

    IEEE Trans. Med. Imag.

    (2001)
  • P. Besl et al.

    A method for registration of 3-D shapes

    IEEE Trans. Pattern. Anal. Mach. Intell. (TPAMI)

    (1992)
  • Cited by (12)

    • A learning-based variable size part extraction architecture for 6D object pose recovery in depth images

      2017, Image and Vision Computing
      Citation Excerpt :

      In the off-line phase, object model is represented with 3D IPs, and by utilizing its gradient flow, 2D ultrasound image is registered in the on-line process. In Ref. [23], a coarse-to-fine fast IP-driven registration method is presented. A rough pose estimation is quickly acquired with a coarse IP model (low degree curve fitting), and finer models refine the parameters of this rough estimation (high degree curve fitting).

    • An analytical representation of conformal mapping for genus-zero implicit surfaces and its application to surface shape similarity assessment

      2015, CAD Computer Aided Design
      Citation Excerpt :

      proposed an IP surface registration algorithm based on minimization of energy function of approximate distances by using IP gradient field. Furthermore, they [20] recently presented a coarse-to-fine method with multiple IPs from low degree to high degree in order to improve the accuracy and the robustness for IP surface registration. Another similar registration method was proposed by means of the Levenberg–Marquardt algorithm [21].

    • Point set registration for pose estimation using continuous distance field

      2019, 2019 IEEE International Conference on Real-Time Computing and Robotics, RCAR 2019
    • Aligning 3D local data of leapfrog locations along elongated structures

      2016, Proceedings - 2016 13th Conference on Computer and Robot Vision, CRV 2016
    View all citing articles on Scopus
    View full text