Abstract
Mutual information has been used for matching and registering 3D models to 2D images. However, in Viola’s original framework [1], surface albedo variance is assumed to be minimal when measuring similarity between 3D models and 2D image data using mutual information. In reality, most objects have textured surfaces with different albedo values across their surfaces, and direct application of this method in such circumstances will fail. To solve this problem, we propose to include spatial information into the original formulation by using histogram-based features of local regions that are robust to local but significant albedo variation. Neighborhood Extended Gaussian Images (NEGI) are used as descriptors to represent local surface regions on the 3D model, while pixel intensity data are considered within corresponding region windows on the image. Experiments on aligning 3D car models in cluttered scenes using this new framework demonstrate substantial improvement as compared to the original pixel-wise mutual information approach.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Viola, P.: Alignment by Maximization of Mutual Information. PhD thesis, Massachusetts Institute of Technology (1995)
Pluim, J., Maintz, J., Viergiver, M.: Mutual information based registration of medical images: A survey. IEEE Transcations on Medical Imaging 22, 986–1004 (2003)
Campbell, R., Flynn, P.: A survey of free-form object representation and recognition techniques. Computer Vision and Image Understanding 81, 166–210 (2001)
Suveg, I., Gosselman, G.: Mutual information based evaluation of 3D building models. In: Proceedings of the International Conference on Pattern Recognition, Quebec City, Canada, vol. 3, pp. 188–197 (2002)
Kollnig, H., Nagel, N.: 3D pose estimation by directly matching polyhedral models to gray value gradients. International Journal of Computer Vision 23, 283–302 (1997)
Tan, T., Sullivan, G., Baker, K.: Model-based localization and recognition of road vehicles. International Journal of Computer Vision 27, 5–25 (1998)
Maes, F., Collignon, A., Vandermeulen, D., Marchal, G., Suetens, P.: Multi-modality image registration by maximization of mutual information. IEEE Transactions on Medical Imaging 16, 187–198 (1997)
Cover, T., Thomas, J.: Elements of Information Theory. John Wiley, Chichester (1991)
Russakoff, D., Tomasi, C., Rohlfing, T., Maurer, C.: Image similarity using mutual information of regions. In: Proceedings of the European Conference on Computer Vision, Prague, Czech Republic, pp. 596–607 (2004)
Leventon, M., Wells III, W., Grimson, W.: Multiple view 2D-3D mutual information registration. In: DARPA Image Understanding Workshop, pp. 625–630 (1997)
Shannon, C.: A mathematical theory of communication. The Bell System Technical Journal 27, 379–423 (1948)
Horn, B.: Extended gaussian images. Proceedings of the IEEE 72, 1656–1678 (1984)
Pong, H., Cham, T.: Object detection using a cascade of 3D models. In: Proceedings of the Asian Conference on Computer Vision, Hyderabad, India (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pong, HK., Cham, TJ. (2006). Alignment of 3D Models to Images Using Region-Based Mutual Information and Neighborhood Extended Gaussian Images. In: Narayanan, P.J., Nayar, S.K., Shum, HY. (eds) Computer Vision – ACCV 2006. ACCV 2006. Lecture Notes in Computer Science, vol 3851. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11612032_7
Download citation
DOI: https://doi.org/10.1007/11612032_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-31219-2
Online ISBN: 978-3-540-32433-1
eBook Packages: Computer ScienceComputer Science (R0)