Joint utilization of local appearance and geometric invariants for 3D object recognition

Soysal, Medeni; Alatan, A. Aydın

doi:10.1007/s11042-013-1622-6

Joint utilization of local appearance and geometric invariants for 3D object recognition

Published: 08 August 2013

Volume 74, pages 2611–2637, (2015)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Medeni Soysal¹ &
A. Aydın Alatan²

234 Accesses
3 Citations
Explore all metrics

Abstract

This article introduces a novel method for 3D object recognition, which utilizes well-known local features in a more efficient way, without any reliance on partial or global planarity. Geometrically consistent local features, which form the crucial basis for object recognition, are identified using affine 3D geometric invariants. The utilization of 3D geometric invariants replaces the classical 2D affine transform estimation/verification step, and provides the ability to directly verify 3D geometric consistency. The main contribution of the proposed approach lies in this ability of incorporating highly discriminative affine invariant 3D information much earlier in the process of matching in comparison with its counterparts. The accuracy and robustness of the method in highly cluttered scenes, without any prior segmentation or post 3D reconstruction requirements, are presented in the experiments.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

HOTA: A Higher Order Metric for Evaluating Multi-object Tracking

Article Open access 08 October 2020

A review on face recognition systems: recent approaches and challenges

Article 30 July 2020

Ncorr: Open-Source 2D Digital Image Correlation Matlab Software

Article 31 March 2015

Notes

This dataset is publicly available at http://www-cvr.ai.uiuc.edu/ponce_grp/data.

References

Bay H, Ess A, Tuytelaars T, Van Gool L (2008) Speeded-Up Robust Features (SURF). Comp Vision Image Underst 110(3):346–359
Article Google Scholar
Burns JB, Weiss RS, Riseman E (1992) The non-existence of general-case view-invariants. In: Geometric invariance in computer vision, pp 120–131
Chen DY, Tian XP, Shen YT, Ouhyoung M (2003) On visual similarity based 3D model retrieval. In: Computer graphics forum, vol 22, pp 223–232
Chen H, Bhanu B (2007) Human ear recognition in 3D. IEEE Trans Pattern Anal Mach Intell 29(4):718–737
Article Google Scholar
Chen H, Bhanu B (2009) Efficient recognition of highly similar 3D objects in range images. IEEE Trans Pattern Anal Mach Intell 31(1):172–9
Article Google Scholar
Duda RO, Hart PE, Stork DG (2001) Pattern classification. Wiley
Ferrari V, Tuytelaars T, Gool L (2006) Simultaneous object recognition and segmentation from single or multiple model views. Int J Comput Vis 67(2):159–188
Article Google Scholar
Fischler MA, Bolles RC (1981) Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun ACM 24(6):381–395
Article MathSciNet Google Scholar
Forsyth DA, Ponce J (2003) Computer vision: a modern approach. Prentice Hall Series in Artificial Intelligence, Pearson/Prentice Hall
Gao Y, Dai Q, Wang M, Zhang N (2011) 3D model retrieval using weighted bipartite graph matching. Signal Process Image Commun 26(1):39–47
Article Google Scholar
Gao Y, Tang J, Hong R, Yan S, Dai Q, Zhang N, Chua TS (2012) Camera constraint-free view-based 3-D object retrieval. IEEE Trans Image Process 21(4):2269–2281
Article MathSciNet Google Scholar
Gao Y, Wang M, Tao D, Ji R, Dai Q (2012) 3-D object retrieval and recognition with hypergraph analysis. IEEE Trans Image Process 21(9):4290–4303
Article MathSciNet Google Scholar
Gao Y, Wang M, Zha ZJ, Tian Q, Dai Q, Zhang N (2011) Less is more: efficient 3-D object retrieval with query view selection. IEEE Trans Multimedia 13(5):1007–1018
Article Google Scholar
Hartley R, Zisserman A (2003) Multiple view geometry in computer vision, 2nd edn. Cambridge University Press, ISBN: 0521540518
Hinterstoisser S, Benhimane S, Lepetit V, Navab N, Lepetit V (2008) Simultaneous recognition and homography extraction of local patches with a simple linear classifier. BMVC British Machine Vision Conference 2008
Hinterstoisser S, Cagniart C, Ilic S, Konolige K, Navab N, Lepetit V, Hinterstoisser S Holzer S (2011) Multimodal templates for real-time detection of texture-less objects in heavily cluttered scenes. In: IEEE International Conference on Computer Vision (ICCV)
Johnson A (1999) Using spin images for efficient object recognition in cluttered 3D scenes. IEEE Trans Pattern Anal Mach Intell 21(5):433–449
Article Google Scholar
Joly A, Buisson O (2009) Logo retrieval with a contrario visual query expansion. In: Proceedings of the 17th ACM international conference on multimedia, MM ’09. ACM, New York, pp 581–584
Google Scholar
Khotanzad A, Hong Y (1990) Invariant image recognition by Zernike moments. IEEE Trans Pattern Anal Mach Intell 12(5):489–497
Article Google Scholar
Li W, Bebis G, Bourbakis NG (2008) 3-D object recognition using 2-D views. IEEE Trans Image Process 17(11):2236–2255
Article MathSciNet Google Scholar
Lian Z, Godil A, Fabry T, Furuya T (2010) SHREC’10 track: non-rigid 3D shape retrieval. In: Eurographics workshop on 3D objet retrieval, pp 1–8
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vision 60(2):91–110
Article Google Scholar
Maybank S (1998) Relation between 3d invariants and 2D invariants. Image Vis Comput 16(1):13–20
Google Scholar
Mikolajczyk K, Schmid C (2005) A Performance evaluation of local descriptors. IEEE Trans Pattern Anal Mach Intell 27(10):1615–1630
Article Google Scholar
Mikolajczyk K, Tuytelaars T, Schmid C, Zisserman A, Matas J, Schaffalitzky F, Kadir T, Gool L (2005) A comparison of Affine region detectors. Int J Comput Vis 65(1–2):43–72
Article Google Scholar
Moreels P, Perona P (2006) Evaluation of features detectors and descriptors based on 3D objects. Int J Comput Vis 73(3):263–284
Article Google Scholar
Mundy J (2006) Object recognition in the geometric era: a retrospective. In: Toward category-level object recognition, pp 3–28
Rothganger F, Lazebnik S, Schmid C, Ponce J (2006) 3D object modeling and recognition using local affine-invariant image descriptors and multi-view spatial constraints. Int J Comput Vis 66(3):231–259
Article Google Scholar
Rui Y, Huang T (2000) Optimizing learning in image retrieval. In: Computer vision and pattern recognition
Song BS, Lee KM, Lee SU (2001) Model-based object recognition using geometric invariants of points and lines. Comp Vision Image Underst 84(3):361–383
Article MATH MathSciNet Google Scholar
Soysal M, Alatan A, Karadeniz T (2009) Joint utilization of appearance and geometry for determining correspondences. In: International Symposium on Computer and Information Sciences (ISCIS). IEEE, pp 60–65
Soysal M, Alatan AA (2010) Joint utilization of appearance and geometry for scene logo retrieval. In: International Symposium on Computer and Information Sciences (ISCIS), vol 62. Springer-Verlag, New York, p 305
Google Scholar
Soysal M, Alatan AA (2011) Joint utilization of appearance, geometry and chance for scene logo retrieval. Comput J 54(7):1221–1231
Article Google Scholar
Tola E, Lepetit V, Fua P (2008) A fast local descriptor for dense matching. In: Conference on computer vision and pattern recognition. Alaska, USA
Torr PHS, Zisserman A, Maybank SJ (1998) Robust detection of degenerate configurations while estimating the fundamental matrix. Comput Vis Image Underst 71(3):312–333
Article Google Scholar
Varma M, Ray D (2007) Learning the discriminative power-invariance trade-off. In: IEEE 11th International Conference on Computer Vision, ICCV 2007, pp 1–8
Weiss I, Ray M (1998) Model-based recognition of 3D objects from one view. IEEE Trans Pattern Anal Mach Intell 23(2):116–128
Article Google Scholar
Zisserman A, Forsyth D, Mundy J, Rothwell C (1995) 3D object recognition using invariance. Artif Intell 78(1–2):239–288
Google Scholar

Download references

Author information

Authors and Affiliations

TÜBİTAK Space Technologies Research Institute, Middle East Technical University Campus, Ankara, Turkey
Medeni Soysal
Electrical and Electronics Engineering Department, Middle East Technical University, Ankara, Turkey
A. Aydın Alatan

Authors

Medeni Soysal
View author publications
You can also search for this author in PubMed Google Scholar
A. Aydın Alatan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Medeni Soysal.

Electronic supplementary material

Below is the link to the electronic supplementary material.

(PDF 67.9 KB)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Soysal, M., Alatan, A.A. Joint utilization of local appearance and geometric invariants for 3D object recognition. Multimed Tools Appl 74, 2611–2637 (2015). https://doi.org/10.1007/s11042-013-1622-6

Download citation

Published: 08 August 2013
Issue Date: April 2015
DOI: https://doi.org/10.1007/s11042-013-1622-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Joint utilization of local appearance and geometric invariants for 3D object recognition

Abstract

Access this article

Similar content being viewed by others

HOTA: A Higher Order Metric for Evaluating Multi-object Tracking

A review on face recognition systems: recent approaches and challenges

Ncorr: Open-Source 2D Digital Image Correlation Matlab Software

Notes

References

Author information

Authors and Affiliations

Corresponding author

Electronic supplementary material

(PDF 67.9 KB)

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Joint utilization of local appearance and geometric invariants for 3D object recognition

Abstract

Access this article

Similar content being viewed by others

HOTA: A Higher Order Metric for Evaluating Multi-object Tracking

A review on face recognition systems: recent approaches and challenges

Ncorr: Open-Source 2D Digital Image Correlation Matlab Software

Notes

References

Author information

Authors and Affiliations

Corresponding author

Electronic supplementary material

(PDF 67.9 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation