Abstract
We propose an approach for detecting objects in large-scale range datasets that combines bottom-up and top-down processes. In the bottom-up stage, fast-to-compute local descriptors are used to detect potential target objects. The object hypotheses are verified after alignment in a top-down stage using global descriptors that capture larger scale structure information. We have found that the combination of spin images and Extended Gaussian Images, as local and global descriptors respectively, provides a good trade-off between efficiency and accuracy. We present results on real outdoors scenes containing millions of scanned points and hundreds of targets. Our results compare favorably to the state of the art by being applicable to much larger scenes captured under less controlled conditions, by being able to detect object classes and not specific instances, and by being able to align the query with the best matching model accurately, thus obtaining precise segmentation.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Johnson, A., Carmichael, O., Huber, D., Hebert, M.: Toward a general 3-d matching engine: Multiple models, complex scenes, and efficient data filtering. In: Image Understanding Workshop, pp. 1097–1108 (1998)
Carmichael, O., Huber, D., Hebert, M.: Large data sets and confusing scenes in 3-d surface matching and recognition. In: 3DIM, pp. 358–367 (1999)
Matei, B., Shan, Y., Sawhney, H.S., Tan, Y., Kumar, R., Huber, D., Hebert, M.: Rapid object indexing using locality sensitive hashing and joint 3d-signature space estimation. IEEE Trans. on Pattern Analysis and Machine Intelligence 28(7), 1111–1126 (2006)
Mian, A., Bennamoun, M., Owens, R.: Three-dimensional model-based object recognition and segmentation in cluttered scenes. IEEE Trans. Pattern Analysis and Machine Intelligence 28(10), 1584–1601 (2006)
Correa, S.R., Shapiro, L.G., Meila, M., Berson, G., Cunningham, M.L., Sze, R.W.: Symbolic signatures for deformable shapes. IEEE Trans. on Pattern Analysis and Machine Intelligence 28(1), 75–90 (2006)
Frueh, C., Jain, S., Zakhor, A.: Data processing algorithms for generating textured 3d building facade meshes from laser scans and camera images. IJCV 61(2), 159–184 (2005)
Johnson, A.E., Hebert, M.: Using spin images for efficient object recognition in cluttered 3d scenes. IEEE Trans. on Pattern Analysis and Machine Intelligence 21(5), 433–449 (1999)
Horn, B.: Extended gaussian images. Proceedings of the IEEE 72(12), 1656–1678 (1984)
Solina, F., Bajcsy, R.: Recovery of parametric models from range images: The case for superquadrics with global deformations. IEEE Transactions on Pattern Analysis and Machine Intelligence 12(2), 131–147 (1990)
Kang, S., Ikeuchi, K.: The complex egi: A new representation for 3-d pose determination. IEEE Transactions on Pattern Analysis and Machine Intelligence 15(7), 707–721 (1993)
Hebert, M., Ikeuchi, K., Delingette, H.: A spherical representation for recognition of free-form surfaces. IEEE Transactions on Pattern Analysis and Machine Intelligence 17(7), 681–690 (1995)
Dorai, C., Jain, A.K.: Cosmos: A representation scheme for 3d free-form objects. IEEE Trans. on Pattern Analysis and Machine Intelligence 19(10), 1115–1130 (1997)
Osada, R., Funkhouser, T., Chazelle, B., Dobkin, D.: Shape distributions. ACM Transactions on Graphics 21(4) (2002)
Liu, X., Sun, R., Kang, S.B., Shum, H.Y.: Directional histogram model for three-dimensional shape similarity. In: Int. Conf. on Computer Vision and Pattern Recognition (2003)
Kazhdan, M., Funkhouser, T., Rusinkiewicz, S.: Rotation invariant spherical harmonic representation of 3D shape descriptors. In: Symposium on Geometry Processing (2003)
Makadia, A., Patterson, A.I., Daniilidis, K.: Fully automatic registration of 3d point clouds. In: Int. Conf. on Computer Vision and Pattern Recognition, vol. I, pp. 1297–1304 (2006)
Driscoll, J., Healy, D.: Computing fourier transforms and convolutions on the 2-sphere. Advances in Applied Mathematics 15, 202–250 (1994)
Stein, F., Medioni, G.: Structural hashing: Efficient three dimensional object recognition. IEEE Trans. on Pattern Analysis and Machine Intelligence 14(2), 125–145 (1992)
Ashbrook, A., Fisher, R., Robertson, C., Werghi, N.: Finding surface correspondence for object recognition and registration using pairwise geometric histograms. In: Burkhardt, H., Neumann, B. (eds.) ECCV 1998. LNCS, vol. 1407, pp. 674–686. Springer, Heidelberg (1998)
Frome, A., Huber, D., Kolluri, R., Bulow, T., Malik, J.: Recognizing objects in range data using regional point descriptors. In: Pajdla, T., Matas, J(G.) (eds.) ECCV 2004. LNCS, vol. 3023, pp. 224–237. Springer, Heidelberg (2004)
Huber, D., Kapuria, A., Donamukkala, R., Hebert, M.: Parts-based 3d object classification. In: Int. Conf on Computer Vision and Pattern Recognition, vol. II, pp. 82–89 (2004)
Besl, P.J., McKay, N.D.: A method for registration of 3-d shapes. IEEE Trans. on Pattern Analysis and Machine Intelligence 14(2), 239–256 (1992)
Shan, Y., Sawhney, H.S., Matei, B., Kumar, R.: Shapeme histogram projection and matching for partial object recognition. IEEE Trans. on Pattern Analysis and Machine Intelligence 28(4), 568–577 (2006)
Funkhouser, T., Shilane, P.: Partial matching of 3d shapes with priority-driven search. In: Symposium on Geometry Processing (2006)
Medioni, G., Lee, M., Tang, C.: A Computational Framework for Segmentation and Grouping. Elsevier, New York (2000)
Arya, S., Mount, D.M., Netanyahu, N.S., Silverman, R., Wu, A.Y.: An optimal algorithm for approximate nearest neighbor searching. Journ. of the ACM 45, 891–923 (1998)
Smith, D.A.: Using enhanced spherical images. Technical Report AIM-530. MIT (1979)
Carr, J.C., Beatson, R.K., Cherrie, J.B., Mitchell, T.J., Fright, W.R., McCallum, B.C., Evans, T.R.: Reconstruction and representation of 3d objects with radial basis functions. In: SIGGRAPH, pp. 67–76. ACM, New York (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Patterson, A., Mordohai, P., Daniilidis, K. (2008). Object Detection from Large-Scale 3D Datasets Using Bottom-Up and Top-Down Descriptors. In: Forsyth, D., Torr, P., Zisserman, A. (eds) Computer Vision – ECCV 2008. ECCV 2008. Lecture Notes in Computer Science, vol 5305. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88693-8_41
Download citation
DOI: https://doi.org/10.1007/978-3-540-88693-8_41
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-88692-1
Online ISBN: 978-3-540-88693-8
eBook Packages: Computer ScienceComputer Science (R0)