Abstract
This article introduces a novel method for 3D object recognition, which utilizes well-known local features in a more efficient way, without any reliance on partial or global planarity. Geometrically consistent local features, which form the crucial basis for object recognition, are identified using affine 3D geometric invariants. The utilization of 3D geometric invariants replaces the classical 2D affine transform estimation/verification step, and provides the ability to directly verify 3D geometric consistency. The main contribution of the proposed approach lies in this ability of incorporating highly discriminative affine invariant 3D information much earlier in the process of matching in comparison with its counterparts. The accuracy and robustness of the method in highly cluttered scenes, without any prior segmentation or post 3D reconstruction requirements, are presented in the experiments.
Similar content being viewed by others
Notes
This dataset is publicly available at http://www-cvr.ai.uiuc.edu/ponce_grp/data.
References
Bay H, Ess A, Tuytelaars T, Van Gool L (2008) Speeded-Up Robust Features (SURF). Comp Vision Image Underst 110(3):346–359
Burns JB, Weiss RS, Riseman E (1992) The non-existence of general-case view-invariants. In: Geometric invariance in computer vision, pp 120–131
Chen DY, Tian XP, Shen YT, Ouhyoung M (2003) On visual similarity based 3D model retrieval. In: Computer graphics forum, vol 22, pp 223–232
Chen H, Bhanu B (2007) Human ear recognition in 3D. IEEE Trans Pattern Anal Mach Intell 29(4):718–737
Chen H, Bhanu B (2009) Efficient recognition of highly similar 3D objects in range images. IEEE Trans Pattern Anal Mach Intell 31(1):172–9
Duda RO, Hart PE, Stork DG (2001) Pattern classification. Wiley
Ferrari V, Tuytelaars T, Gool L (2006) Simultaneous object recognition and segmentation from single or multiple model views. Int J Comput Vis 67(2):159–188
Fischler MA, Bolles RC (1981) Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun ACM 24(6):381–395
Forsyth DA, Ponce J (2003) Computer vision: a modern approach. Prentice Hall Series in Artificial Intelligence, Pearson/Prentice Hall
Gao Y, Dai Q, Wang M, Zhang N (2011) 3D model retrieval using weighted bipartite graph matching. Signal Process Image Commun 26(1):39–47
Gao Y, Tang J, Hong R, Yan S, Dai Q, Zhang N, Chua TS (2012) Camera constraint-free view-based 3-D object retrieval. IEEE Trans Image Process 21(4):2269–2281
Gao Y, Wang M, Tao D, Ji R, Dai Q (2012) 3-D object retrieval and recognition with hypergraph analysis. IEEE Trans Image Process 21(9):4290–4303
Gao Y, Wang M, Zha ZJ, Tian Q, Dai Q, Zhang N (2011) Less is more: efficient 3-D object retrieval with query view selection. IEEE Trans Multimedia 13(5):1007–1018
Hartley R, Zisserman A (2003) Multiple view geometry in computer vision, 2nd edn. Cambridge University Press, ISBN: 0521540518
Hinterstoisser S, Benhimane S, Lepetit V, Navab N, Lepetit V (2008) Simultaneous recognition and homography extraction of local patches with a simple linear classifier. BMVC British Machine Vision Conference 2008
Hinterstoisser S, Cagniart C, Ilic S, Konolige K, Navab N, Lepetit V, Hinterstoisser S Holzer S (2011) Multimodal templates for real-time detection of texture-less objects in heavily cluttered scenes. In: IEEE International Conference on Computer Vision (ICCV)
Johnson A (1999) Using spin images for efficient object recognition in cluttered 3D scenes. IEEE Trans Pattern Anal Mach Intell 21(5):433–449
Joly A, Buisson O (2009) Logo retrieval with a contrario visual query expansion. In: Proceedings of the 17th ACM international conference on multimedia, MM ’09. ACM, New York, pp 581–584
Khotanzad A, Hong Y (1990) Invariant image recognition by Zernike moments. IEEE Trans Pattern Anal Mach Intell 12(5):489–497
Li W, Bebis G, Bourbakis NG (2008) 3-D object recognition using 2-D views. IEEE Trans Image Process 17(11):2236–2255
Lian Z, Godil A, Fabry T, Furuya T (2010) SHREC’10 track: non-rigid 3D shape retrieval. In: Eurographics workshop on 3D objet retrieval, pp 1–8
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vision 60(2):91–110
Maybank S (1998) Relation between 3d invariants and 2D invariants. Image Vis Comput 16(1):13–20
Mikolajczyk K, Schmid C (2005) A Performance evaluation of local descriptors. IEEE Trans Pattern Anal Mach Intell 27(10):1615–1630
Mikolajczyk K, Tuytelaars T, Schmid C, Zisserman A, Matas J, Schaffalitzky F, Kadir T, Gool L (2005) A comparison of Affine region detectors. Int J Comput Vis 65(1–2):43–72
Moreels P, Perona P (2006) Evaluation of features detectors and descriptors based on 3D objects. Int J Comput Vis 73(3):263–284
Mundy J (2006) Object recognition in the geometric era: a retrospective. In: Toward category-level object recognition, pp 3–28
Rothganger F, Lazebnik S, Schmid C, Ponce J (2006) 3D object modeling and recognition using local affine-invariant image descriptors and multi-view spatial constraints. Int J Comput Vis 66(3):231–259
Rui Y, Huang T (2000) Optimizing learning in image retrieval. In: Computer vision and pattern recognition
Song BS, Lee KM, Lee SU (2001) Model-based object recognition using geometric invariants of points and lines. Comp Vision Image Underst 84(3):361–383
Soysal M, Alatan A, Karadeniz T (2009) Joint utilization of appearance and geometry for determining correspondences. In: International Symposium on Computer and Information Sciences (ISCIS). IEEE, pp 60–65
Soysal M, Alatan AA (2010) Joint utilization of appearance and geometry for scene logo retrieval. In: International Symposium on Computer and Information Sciences (ISCIS), vol 62. Springer-Verlag, New York, p 305
Soysal M, Alatan AA (2011) Joint utilization of appearance, geometry and chance for scene logo retrieval. Comput J 54(7):1221–1231
Tola E, Lepetit V, Fua P (2008) A fast local descriptor for dense matching. In: Conference on computer vision and pattern recognition. Alaska, USA
Torr PHS, Zisserman A, Maybank SJ (1998) Robust detection of degenerate configurations while estimating the fundamental matrix. Comput Vis Image Underst 71(3):312–333
Varma M, Ray D (2007) Learning the discriminative power-invariance trade-off. In: IEEE 11th International Conference on Computer Vision, ICCV 2007, pp 1–8
Weiss I, Ray M (1998) Model-based recognition of 3D objects from one view. IEEE Trans Pattern Anal Mach Intell 23(2):116–128
Zisserman A, Forsyth D, Mundy J, Rothwell C (1995) 3D object recognition using invariance. Artif Intell 78(1–2):239–288
Author information
Authors and Affiliations
Corresponding author
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Soysal, M., Alatan, A.A. Joint utilization of local appearance and geometric invariants for 3D object recognition. Multimed Tools Appl 74, 2611–2637 (2015). https://doi.org/10.1007/s11042-013-1622-6
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-013-1622-6