research-article

Robotic Manipulation Based on 3D Vision: A Survey

Author:
Huahua Lin

College of Information Engineering, China Jiliang University, Hangzhou, China

College of Information Engineering, China Jiliang University, Hangzhou, China
View Profile

PRIS '20: Proceedings of the 2020 International Conference on Pattern Recognition and Intelligent SystemsJuly 2020Article No.: 12Pages 1–5https://doi.org/10.1145/3415048.3416116

Published:04 September 2020Publication History

PRIS '20: Proceedings of the 2020 International Conference on Pattern Recognition and Intelligent Systems

Pages 1–5

ABSTRACT

Grasping has long been studied in the field of robotics. In this paper, we divide the process of robotic grasp into sensing and control. In terms of sensing, 2D vision based sensing relies on accurate feature matching and object surface texture features, resulting in poor performance in the complex environment with occlusion. By contrast, some sensors based on 3D vision are more robust to noise. Processing point clouds in a deep learning method can achieve high accuracy as well as reducing the computation time compared with those using cost volume regularization. For the control part, the traditional trajectory motion methods are limited to generalization and grasping with high degrees of freedom. On the contrary, the methods of reinforcement learning can improve the grasping strategy in the continuous interaction with the environment. We propose some commonly used benchmarks and simulation platforms for simulation experiment using reinforcement learning.

References

Ashutosh Saxena, Justin Driemeyer, and Andrew Y Ng. Robotic grasping of novel objects using vision. The International Journal of Robotics Research, 27(2):157--173, 2008.Google ScholarDigital Library
Gary M Bone and Yonghui Du. Multi-metric comparison of optimal 2d grasp planning algorithms. In Proceedings 2001 ICRA. IEEE International Conference on Robotics and Automation (Cat. No. 01CH37164), volume 3, pages 3061--3066. IEEE, 2001.Google ScholarCross Ref
Saurabh Gupta, Pablo Arbeláez, Ross Girshick, and Jitendra Malik. Aligning 3d models to rgb-d images of cluttered scenes. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 4731--4740, 2015.Google Scholar
Andy Zeng, Kuan-Ting Yu, Shuran Song, Daniel Suo, Ed Walker, Alberto Rodriguez, and Jianxiong Xiao. Multi-view self-supervised deep learning for 6d pose estimation in the amazon picking challenge. In 2017 IEEE international conference on robotics and automation (ICRA), pages 1386--1383. IEEE, 2017.Google ScholarDigital Library
Matei Ciocarlie, Kaijen Hsiao, Edward Gil Jones, Sachin Chitta, Radu Bogdan Rusu, and Ioan A S, ucan. Towards reliable grasping and manipulation in household environments. In Experimental Robotics, pages 241--252. Springer, 2014.Google Scholar
Stefan Hinterstoisser, Stefan Holzer, Cedric Cagniart, Slobodan Ilic, Kurt Konolige, Nassir Navab, and Vincent Lepetit. Multimodal templates for real-time detection of texture-less objects in heavily cluttered scenes. In 2011 international conference on computer vision, pages 858--865. IEEE, 2011.Google ScholarDigital Library
David Watkins-Valls, Jacob Varley, and Peter Allen. Multi-modal geometric learning for grasping and manipulation. In 2019 International Conference on Robotics and Automation (ICRA), pages 7339--7345. IEEE, 2019.Google ScholarDigital Library
J Kenneth Salisbury and B Roth. Kinematic and force analysis of articulated mechanical hands. 1983.Google Scholar
Van-Duc Nguyen. Constructing force-closure grasps. The International Journal of Robotics Research, 7(3):3--16, 1988.Google ScholarDigital Library
Jean Ponce, Darrell Stam, and Bernard Faverjon. On computing two-finger force-closure grasps of curved 2d objects. The International Journal of Robotics Research, 12(3):263--273, 1993.Google ScholarCross Ref
Jean Ponce and Bernard Faverjon. On computing three-finger force-closure grasps of polygonal objects. IEEE Transactions on robotics and automation, 11(6):868--881, 1995.Google ScholarCross Ref
Jean Ponce, Steve Sullivan, Attawith Sudsang, Jean-Daniel Boissonnat, and Jean-Pierre Merlet. On computing four-finger equilibrium and force-closure grasps of polyhedral objects. The International Journal of Robotics Research, 16(1):11--35, 1997.Google ScholarDigital Library
Máximo A Roa and Raúl Suárez. Computation of independent contact regions for grasping 3-d objects. IEEE Transactions on Robotics, 25(4):839--850, 2009.Google ScholarDigital Library
Alberto Rodriguez, Matthew T Mason, and Steve Ferry. From caging to grasping. The International Journal of Robotics Research, 31(7):886--900, 2012.Google ScholarDigital Library
Jing Xu, Zhimin Hou, Zhi Liu, and Hong Qiao. Compare contact model-based control and contact model-free learning: A survey of robotic peg-in-hole assembly strategies. arXiv preprint arXiv: 1904.05240, 2019.Google Scholar
Jing Xu, Zhimin Hou, Wei Wang, Bohao Xu, Kuangen Zhang, and Ken Chen. Feedback deep deterministic policy gradient with fuzzy reward for robotic multiple peg-in-hole assembly tasks. IEEE Transactions on Industrial Informatics, 15(3):1658--1667, 2018.Google ScholarCross Ref
Jens Kober and Jan R Peters. Policy search for motor primitives in robotics. In Advances in neural information processing systems, pages 849--856, 2009.Google Scholar
Petar Kormushev, Sylvain Calinon, and Darwin G Caldwell. Robot motor skill coordination with em-based reinforcement learning. In 2010 IEEE/RSJ international conference on intelligent robots and systems, pages 3232--3237. IEEE, 2010.Google ScholarCross Ref
Freek Stulp, Evangelos Theodorou, Jonas Buchli, and Stefan Schaal. Learning to grasp under uncertainty. In 2011 IEEE International Conference on Robotics and Automation, pages 5703--5708. IEEE, 2011.Google ScholarCross Ref
Ian Lenz, Honglak Lee, and Ashutosh Saxena. Deep learning for detecting robotic grasps. The International Journal of Robotics Research, 34(4--5):705--724, 2015.Google ScholarDigital Library
Mrinal Kalakrishnan, Ludovic Righetti, Peter Pastor, and Stefan Schaal. Learning force control policies for compliant manipulation. In 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, pages 4639--4644. IEEE, 2011.Google ScholarCross Ref
Sergey Levine, Peter Pastor, Alex Krizhevsky, and Deirdre Quillen. Learning hand-eye coordination for robotic grasping with large-scale data collection. In International symposium on experimental robotics, pages 173--184. Springer, 2016.Google Scholar
Deirdre Quillen, Eric Jang, Ofir Nachum, Chelsea Finn, Julian Ibarz, and Sergey Levine. Deep reinforcement learning for vision-based robotic grasping: A simulated comparative evaluation of off-policy methods. In 2018 IEEE International Conference on Robotics and Automation (ICRA), pages 6284--6291. IEEE, 2018.Google ScholarDigital Library
Vincent Lepetit, Francesc Moreno-Noguer, and Pascal Fua. Epnp: An accurate o (n) solution to the pnp problem. International journal of computer vision, 81(2):155, 2009.Google ScholarDigital Library
Daniel F Dementhon and Larry S Davis. Model-based object pose in 25 lines of code. International journal of computer vision, 15(1--2):123--141, 1995.Google ScholarDigital Library
Donald W Marquardt. An algorithm for least-squares estimation of nonlinear parameters. Journal of the society for Industrial and Applied Mathematics, 11(2):431--441, 1963.Google Scholar
Eric Brachmann, Alexander Krull, Frank Michel, Stefan Gumhold, Jamie Shotton, and Carsten Rother. Learning 6d object pose estimation using 3d object coordinates. In European conference on computer vision, pages 536--551. Springer, 2014.Google ScholarCross Ref
Radu Bogdan Rusu, Gary Bradski, Romain Thibaux, and John Hsu. Fast 3d recognition and pose using the viewpoint feature histogram. In 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, pages 2155--2162. IEEE, 2010.Google ScholarCross Ref
Paul J Besl and Neil D McKay. Method for registration of 3-d shapes. In Sensor fusion IV: control paradigms and data structures, volume 1611, pages 586--606. International Society for Optics and Photonics, 1992.Google ScholarCross Ref
Martin A Fischler and Robert C Bolles. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Communications of the ACM, 24(6):381--395, 1981.Google ScholarDigital Library
Rui Chen, Songfang Han, Jing Xu, and Hao Su. Point-based multi-view stereo network. In Proceedings of the IEEE International Conference on Computer Vision, pages 1538--1547, 2019.Google ScholarCross Ref
Anis Sahbani, Sahar El-Khoury, and Philippe Bidaud. An overview of 3d object grasp synthesis algorithms. Robotics and Autonomous Systems, 60(3):326--336, 2012.Google ScholarDigital Library
Andreas ten Pas, Marcus Gualtieri, Kate Saenko, and Robert Platt. Grasp pose detection in point clouds. The International Journal of Robotics Research, 36(13--14):1455--1473, 2017.Google Scholar
Hongzhuo Liang, Xiaojian Ma, Shuang Li, Michael Görner, Song Tang, Bin Fang, Fuchun Sun, and Jianwei Zhang. Pointnetgpd: Detecting grasp configurations from point sets. In 2019 International Conference on Robotics and Automation (ICRA), pages 3629--3635. IEEE, 2019.Google ScholarDigital Library
Yuzhe Qin, Rui Chen, Hao Zhu, Meng Song, Jing Xu, and Hao Su. S4g: Amodal single-view single-shot se (3) grasp detection in cluttered scenes. arXiv preprint arXiv:1910.14218, 2019.Google Scholar
Stefan Schaal. Dynamic movement primitives-a framework for motor control in humans and humanoid robotics. In Adaptive motion of animals and machines, pages 261--280. Springer, 2006.Google ScholarCross Ref
Dmitry Kalashnikov, Alex Irpan, Peter Pastor, Julian Ibarz, Alexander Herzog, Eric Jang, Deirdre Quillen, Ethan Holly, Mrinal Kalakrishnan, Vincent Vanhoucke, et al. Qt-opt: Scalable deep reinforcement learning for vision-based robotic manipulation. arXiv preprint arXiv:1806.10293, 2018.Google Scholar
Andy Zeng, Shuran Song, Johnny Lee, Alberto Rodriguez, and Thomas Funkhouser. Tossingbot: Learning to throw arbitrary objects with residual physics. arXiv preprint arXiv:1903.11239, 2019.Google Scholar
Jan Matas, Stephen James, and Andrew J Davison. Sim-to-real reinforcement learning for deformable object manipulation. arXiv preprint arXiv:1806.07851, 2018.Google Scholar

Index Terms

Robotic Manipulation Based on 3D Vision: A Survey
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision tasks
        Vision for robotics

Recommendations

Offline reinforcement learning application in robotic manipulation with a COG method case
CCEAI '22: Proceedings of the 6th International Conference on Control Engineering and Artificial Intelligence

Artificial intelligence now has different applications in various industrial fields. Reinforcement learning (RL) is one of the hot topics in the artificial intelligence, also in robotics. It is an important learning method in the field of robotic ...
Read More
A Modified Convergence DDPG Algorithm for Robotic Manipulation
Abstract
Today, robotic arms are widely used in industry. Reinforcement learning algorithms are used frequently for controlling robotic arms in complex environments. One of the customs off-policy model-free actor-critic deep reinforcement learning for ...
Read More
Reinforcement learning for appearance based visual servoing in robotic manipulation
ROCOM'08: Proceedings of the 8th WSEAS International Conference on Robotics, Control and Manufacturing Technology

The objective of this paper is to develop a new appearance based visual servoing method that needs no prior structuring of the environment and also eliminates the correspondence problem associated with conventional visual servoing methods. Detailed ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
PRIS '20: Proceedings of the 2020 International Conference on Pattern Recognition and Intelligent Systems
July 2020
136 pages
ISBN:9781450387699
DOI:10.1145/3415048
Editor:
Wenbing Zhao
Copyright © 2020 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 4 September 2020
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
deep learning
reinforcement learning
robotic manipulation
Qualifiers
- research-article
- Research
- Refereed limited
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 213
  Total Downloads
- Downloads (Last 12 months)42
- Downloads (Last 6 weeks)2
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Robotic Manipulation Based on 3D Vision: A Survey

PRIS '20: Proceedings of the 2020 International Conference on Pattern Recognition and Intelligent Systems

ABSTRACT

References

Cited By

Index Terms

Recommendations

Offline reinforcement learning application in robotic manipulation with a COG method case

A Modified Convergence DDPG Algorithm for Robotic Manipulation

Reinforcement learning for appearance based visual servoing in robotic manipulation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Robotic Manipulation Based on 3D Vision: A Survey

PRIS '20: Proceedings of the 2020 International Conference on Pattern Recognition and Intelligent Systems

ABSTRACT

References

Cited By

Index Terms

Recommendations

Offline reinforcement learning application in robotic manipulation with a COG method case

A Modified Convergence DDPG Algorithm for Robotic Manipulation

Reinforcement learning for appearance based visual servoing in robotic manipulation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media