skip to main content
10.1145/3415048.3416116acmotherconferencesArticle/Chapter ViewAbstractPublication PagesprisConference Proceedingsconference-collections
research-article

Robotic Manipulation Based on 3D Vision: A Survey

Authors Info & Claims
Published:04 September 2020Publication History

ABSTRACT

Grasping has long been studied in the field of robotics. In this paper, we divide the process of robotic grasp into sensing and control. In terms of sensing, 2D vision based sensing relies on accurate feature matching and object surface texture features, resulting in poor performance in the complex environment with occlusion. By contrast, some sensors based on 3D vision are more robust to noise. Processing point clouds in a deep learning method can achieve high accuracy as well as reducing the computation time compared with those using cost volume regularization. For the control part, the traditional trajectory motion methods are limited to generalization and grasping with high degrees of freedom. On the contrary, the methods of reinforcement learning can improve the grasping strategy in the continuous interaction with the environment. We propose some commonly used benchmarks and simulation platforms for simulation experiment using reinforcement learning.

References

  1. Ashutosh Saxena, Justin Driemeyer, and Andrew Y Ng. Robotic grasping of novel objects using vision. The International Journal of Robotics Research, 27(2):157--173, 2008.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Gary M Bone and Yonghui Du. Multi-metric comparison of optimal 2d grasp planning algorithms. In Proceedings 2001 ICRA. IEEE International Conference on Robotics and Automation (Cat. No. 01CH37164), volume 3, pages 3061--3066. IEEE, 2001.Google ScholarGoogle ScholarCross RefCross Ref
  3. Saurabh Gupta, Pablo Arbeláez, Ross Girshick, and Jitendra Malik. Aligning 3d models to rgb-d images of cluttered scenes. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 4731--4740, 2015.Google ScholarGoogle Scholar
  4. Andy Zeng, Kuan-Ting Yu, Shuran Song, Daniel Suo, Ed Walker, Alberto Rodriguez, and Jianxiong Xiao. Multi-view self-supervised deep learning for 6d pose estimation in the amazon picking challenge. In 2017 IEEE international conference on robotics and automation (ICRA), pages 1386--1383. IEEE, 2017.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Matei Ciocarlie, Kaijen Hsiao, Edward Gil Jones, Sachin Chitta, Radu Bogdan Rusu, and Ioan A S, ucan. Towards reliable grasping and manipulation in household environments. In Experimental Robotics, pages 241--252. Springer, 2014.Google ScholarGoogle Scholar
  6. Stefan Hinterstoisser, Stefan Holzer, Cedric Cagniart, Slobodan Ilic, Kurt Konolige, Nassir Navab, and Vincent Lepetit. Multimodal templates for real-time detection of texture-less objects in heavily cluttered scenes. In 2011 international conference on computer vision, pages 858--865. IEEE, 2011.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. David Watkins-Valls, Jacob Varley, and Peter Allen. Multi-modal geometric learning for grasping and manipulation. In 2019 International Conference on Robotics and Automation (ICRA), pages 7339--7345. IEEE, 2019.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. J Kenneth Salisbury and B Roth. Kinematic and force analysis of articulated mechanical hands. 1983.Google ScholarGoogle Scholar
  9. Van-Duc Nguyen. Constructing force-closure grasps. The International Journal of Robotics Research, 7(3):3--16, 1988.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Jean Ponce, Darrell Stam, and Bernard Faverjon. On computing two-finger force-closure grasps of curved 2d objects. The International Journal of Robotics Research, 12(3):263--273, 1993.Google ScholarGoogle ScholarCross RefCross Ref
  11. Jean Ponce and Bernard Faverjon. On computing three-finger force-closure grasps of polygonal objects. IEEE Transactions on robotics and automation, 11(6):868--881, 1995.Google ScholarGoogle ScholarCross RefCross Ref
  12. Jean Ponce, Steve Sullivan, Attawith Sudsang, Jean-Daniel Boissonnat, and Jean-Pierre Merlet. On computing four-finger equilibrium and force-closure grasps of polyhedral objects. The International Journal of Robotics Research, 16(1):11--35, 1997.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Máximo A Roa and Raúl Suárez. Computation of independent contact regions for grasping 3-d objects. IEEE Transactions on Robotics, 25(4):839--850, 2009.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Alberto Rodriguez, Matthew T Mason, and Steve Ferry. From caging to grasping. The International Journal of Robotics Research, 31(7):886--900, 2012.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Jing Xu, Zhimin Hou, Zhi Liu, and Hong Qiao. Compare contact model-based control and contact model-free learning: A survey of robotic peg-in-hole assembly strategies. arXiv preprint arXiv: 1904.05240, 2019.Google ScholarGoogle Scholar
  16. Jing Xu, Zhimin Hou, Wei Wang, Bohao Xu, Kuangen Zhang, and Ken Chen. Feedback deep deterministic policy gradient with fuzzy reward for robotic multiple peg-in-hole assembly tasks. IEEE Transactions on Industrial Informatics, 15(3):1658--1667, 2018.Google ScholarGoogle ScholarCross RefCross Ref
  17. Jens Kober and Jan R Peters. Policy search for motor primitives in robotics. In Advances in neural information processing systems, pages 849--856, 2009.Google ScholarGoogle Scholar
  18. Petar Kormushev, Sylvain Calinon, and Darwin G Caldwell. Robot motor skill coordination with em-based reinforcement learning. In 2010 IEEE/RSJ international conference on intelligent robots and systems, pages 3232--3237. IEEE, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  19. Freek Stulp, Evangelos Theodorou, Jonas Buchli, and Stefan Schaal. Learning to grasp under uncertainty. In 2011 IEEE International Conference on Robotics and Automation, pages 5703--5708. IEEE, 2011.Google ScholarGoogle ScholarCross RefCross Ref
  20. Ian Lenz, Honglak Lee, and Ashutosh Saxena. Deep learning for detecting robotic grasps. The International Journal of Robotics Research, 34(4--5):705--724, 2015.Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Mrinal Kalakrishnan, Ludovic Righetti, Peter Pastor, and Stefan Schaal. Learning force control policies for compliant manipulation. In 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, pages 4639--4644. IEEE, 2011.Google ScholarGoogle ScholarCross RefCross Ref
  22. Sergey Levine, Peter Pastor, Alex Krizhevsky, and Deirdre Quillen. Learning hand-eye coordination for robotic grasping with large-scale data collection. In International symposium on experimental robotics, pages 173--184. Springer, 2016.Google ScholarGoogle Scholar
  23. Deirdre Quillen, Eric Jang, Ofir Nachum, Chelsea Finn, Julian Ibarz, and Sergey Levine. Deep reinforcement learning for vision-based robotic grasping: A simulated comparative evaluation of off-policy methods. In 2018 IEEE International Conference on Robotics and Automation (ICRA), pages 6284--6291. IEEE, 2018.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Vincent Lepetit, Francesc Moreno-Noguer, and Pascal Fua. Epnp: An accurate o (n) solution to the pnp problem. International journal of computer vision, 81(2):155, 2009.Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Daniel F Dementhon and Larry S Davis. Model-based object pose in 25 lines of code. International journal of computer vision, 15(1--2):123--141, 1995.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Donald W Marquardt. An algorithm for least-squares estimation of nonlinear parameters. Journal of the society for Industrial and Applied Mathematics, 11(2):431--441, 1963.Google ScholarGoogle Scholar
  27. Eric Brachmann, Alexander Krull, Frank Michel, Stefan Gumhold, Jamie Shotton, and Carsten Rother. Learning 6d object pose estimation using 3d object coordinates. In European conference on computer vision, pages 536--551. Springer, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  28. Radu Bogdan Rusu, Gary Bradski, Romain Thibaux, and John Hsu. Fast 3d recognition and pose using the viewpoint feature histogram. In 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, pages 2155--2162. IEEE, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  29. Paul J Besl and Neil D McKay. Method for registration of 3-d shapes. In Sensor fusion IV: control paradigms and data structures, volume 1611, pages 586--606. International Society for Optics and Photonics, 1992.Google ScholarGoogle ScholarCross RefCross Ref
  30. Martin A Fischler and Robert C Bolles. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Communications of the ACM, 24(6):381--395, 1981.Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Rui Chen, Songfang Han, Jing Xu, and Hao Su. Point-based multi-view stereo network. In Proceedings of the IEEE International Conference on Computer Vision, pages 1538--1547, 2019.Google ScholarGoogle ScholarCross RefCross Ref
  32. Anis Sahbani, Sahar El-Khoury, and Philippe Bidaud. An overview of 3d object grasp synthesis algorithms. Robotics and Autonomous Systems, 60(3):326--336, 2012.Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Andreas ten Pas, Marcus Gualtieri, Kate Saenko, and Robert Platt. Grasp pose detection in point clouds. The International Journal of Robotics Research, 36(13--14):1455--1473, 2017.Google ScholarGoogle Scholar
  34. Hongzhuo Liang, Xiaojian Ma, Shuang Li, Michael Görner, Song Tang, Bin Fang, Fuchun Sun, and Jianwei Zhang. Pointnetgpd: Detecting grasp configurations from point sets. In 2019 International Conference on Robotics and Automation (ICRA), pages 3629--3635. IEEE, 2019.Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Yuzhe Qin, Rui Chen, Hao Zhu, Meng Song, Jing Xu, and Hao Su. S4g: Amodal single-view single-shot se (3) grasp detection in cluttered scenes. arXiv preprint arXiv:1910.14218, 2019.Google ScholarGoogle Scholar
  36. Stefan Schaal. Dynamic movement primitives-a framework for motor control in humans and humanoid robotics. In Adaptive motion of animals and machines, pages 261--280. Springer, 2006.Google ScholarGoogle ScholarCross RefCross Ref
  37. Dmitry Kalashnikov, Alex Irpan, Peter Pastor, Julian Ibarz, Alexander Herzog, Eric Jang, Deirdre Quillen, Ethan Holly, Mrinal Kalakrishnan, Vincent Vanhoucke, et al. Qt-opt: Scalable deep reinforcement learning for vision-based robotic manipulation. arXiv preprint arXiv:1806.10293, 2018.Google ScholarGoogle Scholar
  38. Andy Zeng, Shuran Song, Johnny Lee, Alberto Rodriguez, and Thomas Funkhouser. Tossingbot: Learning to throw arbitrary objects with residual physics. arXiv preprint arXiv:1903.11239, 2019.Google ScholarGoogle Scholar
  39. Jan Matas, Stephen James, and Andrew J Davison. Sim-to-real reinforcement learning for deformable object manipulation. arXiv preprint arXiv:1806.07851, 2018.Google ScholarGoogle Scholar

Index Terms

  1. Robotic Manipulation Based on 3D Vision: A Survey

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      PRIS '20: Proceedings of the 2020 International Conference on Pattern Recognition and Intelligent Systems
      July 2020
      136 pages
      ISBN:9781450387699
      DOI:10.1145/3415048

      Copyright © 2020 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 4 September 2020

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited
    • Article Metrics

      • Downloads (Last 12 months)42
      • Downloads (Last 6 weeks)2

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader