Abstract
In this paper we propose a framework for instance recognition and object localization in cluttered and occluded household environment for robot grasping task. The whole system bases on a coarse to fine pipeline in combination with the state-of-the-art methods of RGBD-based object detection. We build a sparse feature model by extracting structure key points incorporating texture cues in the train procedure. After that, the paper demonstrates how the algorithm decreases the time complexity and simultaneously guarantees the accuracy of the recognition and pose estimation. Quantitative experimental evaluations are presented using both acknowledged ground truth dataset and real-world robot perception system.
Keywords
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Collet, A., Berenson, D., Srinivasa, S.S., Ferguson, D.: Object recognition and full pose registration from a single image for robotic manipulation. In: 2009 IEEE International Conference on Robotics and Automation, ICRA 2009, pp. 48–55. IEEE (2009)
Martinez, M., Collet, A., Srinivasa, S.S.: Moped: a scalable and low latency object recognition and pose estimation system. In: 2010 IEEE International Conference on Robotics and Automation (ICRA), pp. 2043–2049. IEEE (2010)
Rublee, E., Rabaud, V., Konolige, K., Bradski, G.: ORB: an efficient alternative to SIFT or SURF. In: 2011 IEEE International Conference on Computer Vision (ICCV), pp. 2564–2571. IEEE (2011)
Tang, J., Miller, S., Singh, A., Abbeel, P.: A textured object recognition pipeline for color and depth image data. In: 2012 IEEE International Conference on Robotics and Automation (ICRA), pp. 3467–3474. IEEE (2012)
Janoch, A., Karayev, S., Jia, Y., Barron, J.T., Fritz, M., Saenko, K., Darrell, T.: A category-level 3D object dataset: putting the kinect to work. In: Fossati, A., Gall, J., Grabner, H., Ren, X., Konolige, K. (eds.) Consumer Depth Cameras for Computer Vision. Advances in Computer Vision and Pattern Recognition, pp. 141–165. Springer, London (2013). doi:10.1007/978-1-4471-4640-7_8
Lai, K., Bo, L., Ren, X., Fox, D.: A large-scale hierarchical multi-view RGB-D object dataset. In: 2011 IEEE International Conference on Robotics and Automation (ICRA), pp. 1817–1824. IEEE (2011)
Mian, A., Bennamoun, M., Owens, R.: On the repeatability and quality of keypoints for local feature-based 3D object retrieval from cluttered scenes. Int. J. Comput. Vis. 89(2–3), 348–361 (2010)
Papazov, C., Burschka, D.: An efficient RANSAC for 3D object recognition in noisy and occluded scenes. In: Kimmel, R., Klette, R., Sugimoto, A. (eds.) ACCV 2010. LNCS, vol. 6492, pp. 135–148. Springer, Heidelberg (2011). doi:10.1007/978-3-642-19315-6_11
Petrelli, A., Di Stefano, L.: On the repeatability of the local reference frame for partial shape matching. In: 2011 International Conference on Computer Vision, pp. 2244–2251. IEEE (2011)
Aldoma, A., Tombari, F., Rusu, R.B., Vincze, M.: OUR-CVFH – oriented, unique and repeatable clustered viewpoint feature histogram for object recognition and 6DOF pose estimation. In: Pinz, A., Pock, T., Bischof, H., Leberl, F. (eds.) DAGM/OAGM 2012. LNCS, vol. 7476, pp. 113–122. Springer, Heidelberg (2012). doi:10.1007/978-3-642-32717-9_12
Wohlkinger, W., Vincze, M.: Ensemble of shape functions for 3D object classification. In: 2011 IEEE International Conference on Robotics and Biomimetics (ROBIO), pp. 2987–2992. IEEE (2011)
Jiang, D., Wang, H., Chen, W., Wu, R.: A novel occlusion-free active recognition algorithm for objects in clutter. In: 2016 IEEE International Conference on Robotics and Biomimetics (ROBIO), pp. 1389–1394. IEEE (2016)
Aldoma, A., Tombari, F., Stefano, L., Vincze, M.: A global hypotheses verification method for 3D object recognition. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012. LNCS, vol. 7574, pp. 511–524. Springer, Heidelberg (2012). doi:10.1007/978-3-642-33712-3_37
Aldoma, A., Tombari, F., Prankl, J., Richtsfeld, A., Di Stefano, L., Vincze, M.: Multimodal cue integration through hypotheses verification for RGB-D object recognition and 6DoF pose estimation. In: 2013 IEEE International Conference on Robotics and Automation (ICRA), pp. 2104–2111. IEEE (2013)
Lutz, M., Stampfer, D., Schlegel, C.: Probabilistic object recognition and pose estimation by fusing multiple algorithms. In: 2013 IEEE International Conference on Robotics and Automation (ICRA), pp. 4244–4249. IEEE (2013)
Tombari, F., Salti, S., Stefano, L.: Unique signatures of histograms for local surface description. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010. LNCS, vol. 6313, pp. 356–369. Springer, Heidelberg (2010). doi:10.1007/978-3-642-15558-1_26
Henry, P., Krainin, M., Herbst, E., Ren, X., Fox, D.: RGB-D mapping: using kinect-style depth cameras for dense 3D modeling of indoor environments. Int. J. Robot. Res. 31(5), 647–663 (2012)
Herbst, E., Henry, P., Fox, D.: Toward online 3-D object segmentation and mapping. In: 2014 IEEE International Conference on Robotics and Automation (ICRA), pp. 3193–3200. IEEE (2014)
Zhao, L., Huang, S., Sun, Y., Yan, L., Dissanayake, G.: Parallaxba: bundle adjustment using parallax angle feature parametrization. Int. J. Robot. Res. 34(4–5), 493–516 (2015)
Richtsfeld, A., Mörwald, T., Prankl, J., Zillich, M., Vincze, M.: Segmentation of unknown objects in indoor environments. In: 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 4791–4796. IEEE (2012)
Papon, J., Abramov, A., Schoeler, M., Worgotter, F.: Voxel cloud connectivity segmentation-supervoxels for point clouds. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2027–2034. IEEE (2013)
Tuytelaars, T., Fritz, M., Saenko, K., Darrell, T.: The NBNN kernel. In: 2011 IEEE International Conference on Computer Vision (ICCV), pp. 1824–1831. IEEE (2011)
Dogar, M., Srinivasa, S.: A framework for push-grasping in clutter. Robot.: Sci. Syst. VII 1 (2011)
Acknowledgement
This work was supported in part by the Natural Science Foundation of China under Grant U1613218, 61473191 and 61503245, in part by the Science and Technology Commission of Shanghai Municipality under Grant 15111104802, in part by Shanghai Sailing Program under Grant 15YF1406300, in part by State Key Laboratory of Robotics and System (HIT).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Zheng, L., Wang, H., Chen, W. (2017). A Fast 3D Object Recognition Pipeline in Cluttered and Occluded Scenes. In: Huang, Y., Wu, H., Liu, H., Yin, Z. (eds) Intelligent Robotics and Applications. ICIRA 2017. Lecture Notes in Computer Science(), vol 10463. Springer, Cham. https://doi.org/10.1007/978-3-319-65292-4_51
Download citation
DOI: https://doi.org/10.1007/978-3-319-65292-4_51
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-65291-7
Online ISBN: 978-3-319-65292-4
eBook Packages: Computer ScienceComputer Science (R0)