Skip to main content
Log in

Generalizing Inverse Compositional and ESM Image Alignment

  • Published:
International Journal of Computer Vision Aims and scope Submit manuscript

Abstract

Inverse compositional (IC) image alignment (Baker and Matthews in Int. J. Comput. Vis. 56(3):221–255, 2004) uses the symmetry between the roles of the fixed and moving images for faster processing. However, it requires implementation of compositional optimizer update steps. The IC approach can be viewed as an efficient way of computing the similarity measure derivative relative to the fixed image warp parameters. Since the mapping between the fixed and moving warp parameters is continuous and differentiable, this derivative can be converted into the moving warp space using the chain rule. This avoids the need for compositional update steps. Our generalization also allows the efficient second order method (ESM) (Malis in Proceedings of the 2004 IEEE International Conference on Robotics and Automation (ICRA04), pp. 1843–1848, 2004; Benhimane and Malis in IEEE/RSJ International Conference on Intelligent Robots and Systems, 2004; Malis and Benhimane in Robot. Auton. Syst. 52(1):39–52, 2005) to be applied to general parameterizations of the transformation.

Experiments using multiple similarity measures and optimizers show that our generalized IC method equals or exceeds the performance of the original IC approach. The generalized ESM approach is more reliable than the classic approach as it increases the capture radius of the optimization.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Andreopoulos, A., & Tsotsos, J. K. (2005). A novel algorithm for fitting 3d active appearance models: applications to cardiac mri segmentation. In Proceedings of the 14th Scandinavian conference on image analysis, Joensuu, Finland.

  • Anuta, P. E. (1969). Digital registration of multispectral imagery. SPIE Journal, 7(6), 168–175.

    Google Scholar 

  • Baker, S., & Matthews, I. (2004). Lucas-Kanade 20 years on: a unified framework. International Journal of Computer Vision, 56(3), 221–255.

    Article  Google Scholar 

  • Baker, S., Patil, R., Cheung, K. M., & Matthews, I. (2004). Lucas-Kanade 20 years on: part 5 (Technical Report CMU-RI-TR-04-64). Robotics Institute, Carnegie Mellon University, Pittsburgh, PA.

  • Bartoli, A. (2006). Groupwise geometric and photometric direct image registration. In Proceedings of the seventeenth British machine vision conference (Vol. 1, pp. 157–66), Edinburgh, UK.

  • Benhimane, S., & Malis, E. (2004). Real-time image-based tracking of planes using efficient second-order minimization. In IEEE/RSJ international conference on intelligent robots and systems, Sendai, Japan.

  • Brooks, R., & Arbel, T. (2006). Generalizing inverse compositional image alignment. In Proceedings of the 18th international conference on pattern recognition (ICPR2006) (Vol. 2, pp. 1200–1203), Hong Kong.

  • Brown, L. G. (1992). A survey of image registration techniques. ACM Computing Surveys, 24(4), 325–376.

    Article  Google Scholar 

  • Brown, M., & Lowe, D. (2007). Automatic panoramic image stitching using invariant features. International Journal of Computer Vision, 74(1), 59–73.

    Article  Google Scholar 

  • Byrd, R. H., Lu, P., & Nocedal, J. (1995). A limited memory algorithm for bound constrained optimization. SIAM Journal on Scientific and Statistical Computing, 16(5), 1190–1208.

    Article  MATH  MathSciNet  Google Scholar 

  • Conn, A. R., Gould, N. I. M., & Toint, P. L. (2000). Trust-region methods. Society for Industrial and Applied Mathematics and Mathematical Programming Society.

  • Dennis, J. E., Jr., & Wolkowicz, H. (1993). Sizing and least-change secant methods. SIAM Journal of Numerical Analysis, 30(5), 1291–1314.

    Article  MATH  MathSciNet  Google Scholar 

  • Dowson, N., & Bowden, R. (2008). Mutual information for Lucas-Kanade tracking (MILK): an inverse compositional formulation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 30(1), 180–185.

    Article  Google Scholar 

  • Fletcher, R. (1987). Practical methods of optimization. New York: Wiley.

    MATH  Google Scholar 

  • Gill, P. E., Murray, W., & Wright, M. H. (1981). Practical optimization. New York: Academic Press.

    MATH  Google Scholar 

  • Harshman, R. A. (2001). An index formalism that generalizes the capabilities of matrix notation and algebra to n-way arrays. Journal of Chemometrics, 15(9), 689–714.

    Article  Google Scholar 

  • Ibanez, L., Schroeder, W., Ng, L., & Cates, J. (2005). The ITK software guide: ITK V2.0. Kitware Inc., Clifton Park. http://www.itk.org.

  • Keller, Y., & Averbuch, A. (2004). Fast motion estimation using bi-directional gradient methods. IEEE Transactions on Image Processing, 13(8), 1042–1054.

    Article  MathSciNet  Google Scholar 

  • Kwan, R.-S., Evans, A., & Pike, G. (1999). MRI simulation-based evaluation of image-processing and classification methods. IEEE Transactions on Medical Imaging, 18(11), 1085–1097.

    Article  Google Scholar 

  • Liu, T.-L., & Chen, H.-T. (2004). Real-time tracking using trust-region methods. IEEE Transactions on Pattern Analysis and Machine Intelligence, 26(3), 397–402.

    Article  Google Scholar 

  • Lucas, B. D., & Kanade, T. (1981). An iterative image registration technique with an application to stereo vision. In Proceedings of the international joint conference on artificial intelligence (pp. 674–679).

  • Malis, E. (2004). Improving vision-based control using efficient second-order minimization techniques. In Proceedings of the 2004 IEEE international conference on robotics and automation (ICRA04) (pp. 1843–1848), New Orleans, LA, USA.

  • Malis, E., & Benhimane, S. (2005). A unified approach to visual tracking and servoing. Robotics and Autonomous Systems, 52(1), 39–52.

    Article  Google Scholar 

  • Matthews, I., & Baker, S. (2004). Active appearance models revisited. International Journal of Computer Vision, 60(2), 135–164.

    Article  Google Scholar 

  • Moler, C., & Van Loan, C. (2003). Nineteen dubious ways to compute the exponential of a matrix, twenty five years later. SIAM Review, 45(1), 3–49.

    Article  MATH  MathSciNet  Google Scholar 

  • Pluim, J. P. W., Maintz, J. B. A., & Viergever, M. A. (2003). Mutual-information based registration of medical images: a survey. IEEE Transactions on Medical Imaging, 22(8), 986–1004.

    Article  Google Scholar 

  • Sheskin, D. J. (2000). Handbook of parametric and nonparametric statistical procedures (2nd ed.). New York: Chapman and Hall/CRC.

    MATH  Google Scholar 

  • Simard, P. Y., Cun, Y. A. L., Denker, J. S., & Victorri, B. (2000). Transformation invariance in pattern recognition: tangent distance and propagation. International Journal of Imaging Systems and Technology, 11(3), 181–197.

    Article  Google Scholar 

  • Szeliski, R. (2004). Image alignment and stitching: a tutorial (Technical Report MSR-TR-2004-92). Microsoft Research).

  • Thévenaz, P., & Unser, M. (2000). Optimization of mutual information for multiresolution image registration. IEEE Transactions on Image Processing, 9(12), 2083–2099.

    Article  MATH  Google Scholar 

  • Toews, M., Collins, D. L., & Arbel, T. (2005). Maximum a posteriori local histogram estimation for image registration. In Proceedings of medical image computing and computer aided intervention (MICCAI 2005) (Vol. 2, pp. 163–170), Palm Springs, CA, USA.

  • van de Kraats, E., Penney, G., Tomazevic, D., van Walsum, T., & Niessen, W. (2005). Standardized evaluation methodology for 2D-3D registration. IEEE Transactions on Medical Imaging, 24(9), 1177–1190.

    Article  Google Scholar 

  • Venkataraman, P. (2001). Applied optimization with MATLAB programming. New York: Canada.

    Google Scholar 

  • Viola, P., & Wells, W. M., III (1995). Alignment by maximization of mutual information. In Proceedings of the 5th IEEE international conference on computer vision (ICCV1995) (pp. 16–23). Cambridge: IEEE Computer Society.

    Chapter  Google Scholar 

  • Weidendorfer, J., Kowarschik, M., & Trinitis, C. (2004). A tool suite for simulation based analysis of memory access behavior. In Proceedings of the 4th international conference on computational science (ICCS 2004), Krakow, Poland.

  • Zitova, B., & Flusser, J. (2003). Image registration methods: a survey. Image and Vision Computing, 21(11), 977–1000.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rupert Brooks.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Brooks, R., Arbel, T. Generalizing Inverse Compositional and ESM Image Alignment. Int J Comput Vis 87, 191–212 (2010). https://doi.org/10.1007/s11263-009-0263-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11263-009-0263-8

Keywords

Navigation