Object Detection from Large-Scale 3D Datasets Using Bottom-Up and Top-Down Descriptors

Patterson, Alexander; Mordohai, Philippos; Daniilidis, Kostas

doi:10.1007/978-3-540-88693-8_41

Alexander Patterson IV⁴,
Philippos Mordohai⁴ &
Kostas Daniilidis⁴

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 5305))

Included in the following conference series:

European Conference on Computer Vision

9936 Accesses
18 Citations

Abstract

We propose an approach for detecting objects in large-scale range datasets that combines bottom-up and top-down processes. In the bottom-up stage, fast-to-compute local descriptors are used to detect potential target objects. The object hypotheses are verified after alignment in a top-down stage using global descriptors that capture larger scale structure information. We have found that the combination of spin images and Extended Gaussian Images, as local and global descriptors respectively, provides a good trade-off between efficiency and accuracy. We present results on real outdoors scenes containing millions of scanned points and hundreds of targets. Our results compare favorably to the state of the art by being applicable to much larger scenes captured under less controlled conditions, by being able to detect object classes and not specific instances, and by being able to align the query with the best matching model accurately, thus obtaining precise segmentation.

Download to read the full chapter text

Chapter PDF

Dense Segmentation-Aware Descriptors

A Category-Level 3D Object Dataset: Putting the Kinect to Work

Contour Detection at Range Images Using Sparse Normal Detector

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Johnson, A., Carmichael, O., Huber, D., Hebert, M.: Toward a general 3-d matching engine: Multiple models, complex scenes, and efficient data filtering. In: Image Understanding Workshop, pp. 1097–1108 (1998)
Google Scholar
Carmichael, O., Huber, D., Hebert, M.: Large data sets and confusing scenes in 3-d surface matching and recognition. In: 3DIM, pp. 358–367 (1999)
Google Scholar
Matei, B., Shan, Y., Sawhney, H.S., Tan, Y., Kumar, R., Huber, D., Hebert, M.: Rapid object indexing using locality sensitive hashing and joint 3d-signature space estimation. IEEE Trans. on Pattern Analysis and Machine Intelligence 28(7), 1111–1126 (2006)
Article Google Scholar
Mian, A., Bennamoun, M., Owens, R.: Three-dimensional model-based object recognition and segmentation in cluttered scenes. IEEE Trans. Pattern Analysis and Machine Intelligence 28(10), 1584–1601 (2006)
Article Google Scholar
Correa, S.R., Shapiro, L.G., Meila, M., Berson, G., Cunningham, M.L., Sze, R.W.: Symbolic signatures for deformable shapes. IEEE Trans. on Pattern Analysis and Machine Intelligence 28(1), 75–90 (2006)
Article Google Scholar
Frueh, C., Jain, S., Zakhor, A.: Data processing algorithms for generating textured 3d building facade meshes from laser scans and camera images. IJCV 61(2), 159–184 (2005)
Article Google Scholar
Johnson, A.E., Hebert, M.: Using spin images for efficient object recognition in cluttered 3d scenes. IEEE Trans. on Pattern Analysis and Machine Intelligence 21(5), 433–449 (1999)
Article Google Scholar
Horn, B.: Extended gaussian images. Proceedings of the IEEE 72(12), 1656–1678 (1984)
Article Google Scholar
Solina, F., Bajcsy, R.: Recovery of parametric models from range images: The case for superquadrics with global deformations. IEEE Transactions on Pattern Analysis and Machine Intelligence 12(2), 131–147 (1990)
Article Google Scholar
Kang, S., Ikeuchi, K.: The complex egi: A new representation for 3-d pose determination. IEEE Transactions on Pattern Analysis and Machine Intelligence 15(7), 707–721 (1993)
Article Google Scholar
Hebert, M., Ikeuchi, K., Delingette, H.: A spherical representation for recognition of free-form surfaces. IEEE Transactions on Pattern Analysis and Machine Intelligence 17(7), 681–690 (1995)
Article Google Scholar
Dorai, C., Jain, A.K.: Cosmos: A representation scheme for 3d free-form objects. IEEE Trans. on Pattern Analysis and Machine Intelligence 19(10), 1115–1130 (1997)
Article Google Scholar
Osada, R., Funkhouser, T., Chazelle, B., Dobkin, D.: Shape distributions. ACM Transactions on Graphics 21(4) (2002)
Google Scholar
Liu, X., Sun, R., Kang, S.B., Shum, H.Y.: Directional histogram model for three-dimensional shape similarity. In: Int. Conf. on Computer Vision and Pattern Recognition (2003)
Google Scholar
Kazhdan, M., Funkhouser, T., Rusinkiewicz, S.: Rotation invariant spherical harmonic representation of 3D shape descriptors. In: Symposium on Geometry Processing (2003)
Google Scholar
Makadia, A., Patterson, A.I., Daniilidis, K.: Fully automatic registration of 3d point clouds. In: Int. Conf. on Computer Vision and Pattern Recognition, vol. I, pp. 1297–1304 (2006)
Google Scholar
Driscoll, J., Healy, D.: Computing fourier transforms and convolutions on the 2-sphere. Advances in Applied Mathematics 15, 202–250 (1994)
Article MATH MathSciNet Google Scholar
Stein, F., Medioni, G.: Structural hashing: Efficient three dimensional object recognition. IEEE Trans. on Pattern Analysis and Machine Intelligence 14(2), 125–145 (1992)
Article Google Scholar
Ashbrook, A., Fisher, R., Robertson, C., Werghi, N.: Finding surface correspondence for object recognition and registration using pairwise geometric histograms. In: Burkhardt, H., Neumann, B. (eds.) ECCV 1998. LNCS, vol. 1407, pp. 674–686. Springer, Heidelberg (1998)
Chapter Google Scholar
Frome, A., Huber, D., Kolluri, R., Bulow, T., Malik, J.: Recognizing objects in range data using regional point descriptors. In: Pajdla, T., Matas, J(G.) (eds.) ECCV 2004. LNCS, vol. 3023, pp. 224–237. Springer, Heidelberg (2004)
Google Scholar
Huber, D., Kapuria, A., Donamukkala, R., Hebert, M.: Parts-based 3d object classification. In: Int. Conf on Computer Vision and Pattern Recognition, vol. II, pp. 82–89 (2004)
Google Scholar
Besl, P.J., McKay, N.D.: A method for registration of 3-d shapes. IEEE Trans. on Pattern Analysis and Machine Intelligence 14(2), 239–256 (1992)
Article Google Scholar
Shan, Y., Sawhney, H.S., Matei, B., Kumar, R.: Shapeme histogram projection and matching for partial object recognition. IEEE Trans. on Pattern Analysis and Machine Intelligence 28(4), 568–577 (2006)
Article Google Scholar
Funkhouser, T., Shilane, P.: Partial matching of 3d shapes with priority-driven search. In: Symposium on Geometry Processing (2006)
Google Scholar
Medioni, G., Lee, M., Tang, C.: A Computational Framework for Segmentation and Grouping. Elsevier, New York (2000)
MATH Google Scholar
Arya, S., Mount, D.M., Netanyahu, N.S., Silverman, R., Wu, A.Y.: An optimal algorithm for approximate nearest neighbor searching. Journ. of the ACM 45, 891–923 (1998)
Article MATH MathSciNet Google Scholar
Smith, D.A.: Using enhanced spherical images. Technical Report AIM-530. MIT (1979)
Google Scholar
Carr, J.C., Beatson, R.K., Cherrie, J.B., Mitchell, T.J., Fright, W.R., McCallum, B.C., Evans, T.R.: Reconstruction and representation of 3d objects with radial basis functions. In: SIGGRAPH, pp. 67–76. ACM, New York (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

University of Pennsylvania,
Alexander Patterson IV, Philippos Mordohai & Kostas Daniilidis

Authors

Alexander Patterson IV
View author publications
You can also search for this author in PubMed Google Scholar
Philippos Mordohai
View author publications
You can also search for this author in PubMed Google Scholar
Kostas Daniilidis
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computer Science Department, University of Illinois at Urbana Champaign, 3310 Siebel Hall, IL 61801, Urbana, USA
David Forsyth
Department of Computing, Wheatley, Oxford Brookes University, OX33 1HX, Oxford, UK
Philip Torr
Department of Engineering Science, University of Oxford, Parks Road, OX1 3PJ, Oxford, UK
Andrew Zisserman

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Patterson, A., Mordohai, P., Daniilidis, K. (2008). Object Detection from Large-Scale 3D Datasets Using Bottom-Up and Top-Down Descriptors. In: Forsyth, D., Torr, P., Zisserman, A. (eds) Computer Vision – ECCV 2008. ECCV 2008. Lecture Notes in Computer Science, vol 5305. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88693-8_41

Download citation

DOI: https://doi.org/10.1007/978-3-540-88693-8_41
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-88692-1
Online ISBN: 978-3-540-88693-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Object Detection from Large-Scale 3D Datasets Using Bottom-Up and Top-Down Descriptors

Abstract

Chapter PDF

Similar content being viewed by others

Dense Segmentation-Aware Descriptors

A Category-Level 3D Object Dataset: Putting the Kinect to Work

Contour Detection at Range Images Using Sparse Normal Detector

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Object Detection from Large-Scale 3D Datasets Using Bottom-Up and Top-Down Descriptors

Abstract

Chapter PDF

Similar content being viewed by others

Dense Segmentation-Aware Descriptors

A Category-Level 3D Object Dataset: Putting the Kinect to Work

Contour Detection at Range Images Using Sparse Normal Detector

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation