Abstract
Inverse compositional (IC) image alignment (Baker and Matthews in Int. J. Comput. Vis. 56(3):221–255, 2004) uses the symmetry between the roles of the fixed and moving images for faster processing. However, it requires implementation of compositional optimizer update steps. The IC approach can be viewed as an efficient way of computing the similarity measure derivative relative to the fixed image warp parameters. Since the mapping between the fixed and moving warp parameters is continuous and differentiable, this derivative can be converted into the moving warp space using the chain rule. This avoids the need for compositional update steps. Our generalization also allows the efficient second order method (ESM) (Malis in Proceedings of the 2004 IEEE International Conference on Robotics and Automation (ICRA04), pp. 1843–1848, 2004; Benhimane and Malis in IEEE/RSJ International Conference on Intelligent Robots and Systems, 2004; Malis and Benhimane in Robot. Auton. Syst. 52(1):39–52, 2005) to be applied to general parameterizations of the transformation.
Experiments using multiple similarity measures and optimizers show that our generalized IC method equals or exceeds the performance of the original IC approach. The generalized ESM approach is more reliable than the classic approach as it increases the capture radius of the optimization.
Similar content being viewed by others
References
Andreopoulos, A., & Tsotsos, J. K. (2005). A novel algorithm for fitting 3d active appearance models: applications to cardiac mri segmentation. In Proceedings of the 14th Scandinavian conference on image analysis, Joensuu, Finland.
Anuta, P. E. (1969). Digital registration of multispectral imagery. SPIE Journal, 7(6), 168–175.
Baker, S., & Matthews, I. (2004). Lucas-Kanade 20 years on: a unified framework. International Journal of Computer Vision, 56(3), 221–255.
Baker, S., Patil, R., Cheung, K. M., & Matthews, I. (2004). Lucas-Kanade 20 years on: part 5 (Technical Report CMU-RI-TR-04-64). Robotics Institute, Carnegie Mellon University, Pittsburgh, PA.
Bartoli, A. (2006). Groupwise geometric and photometric direct image registration. In Proceedings of the seventeenth British machine vision conference (Vol. 1, pp. 157–66), Edinburgh, UK.
Benhimane, S., & Malis, E. (2004). Real-time image-based tracking of planes using efficient second-order minimization. In IEEE/RSJ international conference on intelligent robots and systems, Sendai, Japan.
Brooks, R., & Arbel, T. (2006). Generalizing inverse compositional image alignment. In Proceedings of the 18th international conference on pattern recognition (ICPR2006) (Vol. 2, pp. 1200–1203), Hong Kong.
Brown, L. G. (1992). A survey of image registration techniques. ACM Computing Surveys, 24(4), 325–376.
Brown, M., & Lowe, D. (2007). Automatic panoramic image stitching using invariant features. International Journal of Computer Vision, 74(1), 59–73.
Byrd, R. H., Lu, P., & Nocedal, J. (1995). A limited memory algorithm for bound constrained optimization. SIAM Journal on Scientific and Statistical Computing, 16(5), 1190–1208.
Conn, A. R., Gould, N. I. M., & Toint, P. L. (2000). Trust-region methods. Society for Industrial and Applied Mathematics and Mathematical Programming Society.
Dennis, J. E., Jr., & Wolkowicz, H. (1993). Sizing and least-change secant methods. SIAM Journal of Numerical Analysis, 30(5), 1291–1314.
Dowson, N., & Bowden, R. (2008). Mutual information for Lucas-Kanade tracking (MILK): an inverse compositional formulation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 30(1), 180–185.
Fletcher, R. (1987). Practical methods of optimization. New York: Wiley.
Gill, P. E., Murray, W., & Wright, M. H. (1981). Practical optimization. New York: Academic Press.
Harshman, R. A. (2001). An index formalism that generalizes the capabilities of matrix notation and algebra to n-way arrays. Journal of Chemometrics, 15(9), 689–714.
Ibanez, L., Schroeder, W., Ng, L., & Cates, J. (2005). The ITK software guide: ITK V2.0. Kitware Inc., Clifton Park. http://www.itk.org.
Keller, Y., & Averbuch, A. (2004). Fast motion estimation using bi-directional gradient methods. IEEE Transactions on Image Processing, 13(8), 1042–1054.
Kwan, R.-S., Evans, A., & Pike, G. (1999). MRI simulation-based evaluation of image-processing and classification methods. IEEE Transactions on Medical Imaging, 18(11), 1085–1097.
Liu, T.-L., & Chen, H.-T. (2004). Real-time tracking using trust-region methods. IEEE Transactions on Pattern Analysis and Machine Intelligence, 26(3), 397–402.
Lucas, B. D., & Kanade, T. (1981). An iterative image registration technique with an application to stereo vision. In Proceedings of the international joint conference on artificial intelligence (pp. 674–679).
Malis, E. (2004). Improving vision-based control using efficient second-order minimization techniques. In Proceedings of the 2004 IEEE international conference on robotics and automation (ICRA04) (pp. 1843–1848), New Orleans, LA, USA.
Malis, E., & Benhimane, S. (2005). A unified approach to visual tracking and servoing. Robotics and Autonomous Systems, 52(1), 39–52.
Matthews, I., & Baker, S. (2004). Active appearance models revisited. International Journal of Computer Vision, 60(2), 135–164.
Moler, C., & Van Loan, C. (2003). Nineteen dubious ways to compute the exponential of a matrix, twenty five years later. SIAM Review, 45(1), 3–49.
Pluim, J. P. W., Maintz, J. B. A., & Viergever, M. A. (2003). Mutual-information based registration of medical images: a survey. IEEE Transactions on Medical Imaging, 22(8), 986–1004.
Sheskin, D. J. (2000). Handbook of parametric and nonparametric statistical procedures (2nd ed.). New York: Chapman and Hall/CRC.
Simard, P. Y., Cun, Y. A. L., Denker, J. S., & Victorri, B. (2000). Transformation invariance in pattern recognition: tangent distance and propagation. International Journal of Imaging Systems and Technology, 11(3), 181–197.
Szeliski, R. (2004). Image alignment and stitching: a tutorial (Technical Report MSR-TR-2004-92). Microsoft Research).
Thévenaz, P., & Unser, M. (2000). Optimization of mutual information for multiresolution image registration. IEEE Transactions on Image Processing, 9(12), 2083–2099.
Toews, M., Collins, D. L., & Arbel, T. (2005). Maximum a posteriori local histogram estimation for image registration. In Proceedings of medical image computing and computer aided intervention (MICCAI 2005) (Vol. 2, pp. 163–170), Palm Springs, CA, USA.
van de Kraats, E., Penney, G., Tomazevic, D., van Walsum, T., & Niessen, W. (2005). Standardized evaluation methodology for 2D-3D registration. IEEE Transactions on Medical Imaging, 24(9), 1177–1190.
Venkataraman, P. (2001). Applied optimization with MATLAB programming. New York: Canada.
Viola, P., & Wells, W. M., III (1995). Alignment by maximization of mutual information. In Proceedings of the 5th IEEE international conference on computer vision (ICCV1995) (pp. 16–23). Cambridge: IEEE Computer Society.
Weidendorfer, J., Kowarschik, M., & Trinitis, C. (2004). A tool suite for simulation based analysis of memory access behavior. In Proceedings of the 4th international conference on computational science (ICCS 2004), Krakow, Poland.
Zitova, B., & Flusser, J. (2003). Image registration methods: a survey. Image and Vision Computing, 21(11), 977–1000.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Brooks, R., Arbel, T. Generalizing Inverse Compositional and ESM Image Alignment. Int J Comput Vis 87, 191–212 (2010). https://doi.org/10.1007/s11263-009-0263-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11263-009-0263-8