skip to main content
10.1145/3366622.3368146acmconferencesArticle/Chapter ViewAbstractPublication PagesmiddlewareConference Proceedingsconference-collections
research-article

A sim2real framework enabling decentralized agents to execute MADDPG tasks

Authors Info & Claims
Published:09 December 2019Publication History

ABSTRACT

Multi-agent RL is a process of training the agents to collaborate with others. We argue that an additional 'reality gap' in the system aspects occurs when applying sim2real to the multi-agent RL, especially when performing the 'transferred' collaborative task in the real-world environment. In this paper, we propose an ADO framework enabling decentralized agents to participate in performing collaborative tasks without suffering from the reality gap. Our contribution is threefold. First, we clearly identify and summarize the reality gaps in the context of the sim2real of multi-agent RL. Second, we propose a new system model to deal with system issues derived from when executing collaborative tasks. Third, we design and implement a software framework to support system issues required in developing and executing collaborative tasks in the real world.

References

  1. Greg Brockman, Vicki Cheung, Ludwig Pettersson, Jonas Schneider, John Schulman, Jie Tang, and Wojciech Zaremba. 2016. OpenAI Gym. (2016), 1--4. arXiv:1606.01540Google ScholarGoogle Scholar
  2. Yevgen Chebotar, Ankur Handa, Viktor Makoviychuk, Miles Macklin, Jan Issac, Nathan Ratliff, and Dieter Fox. 2018. Closing the Sim-to-Real Loop: Adapting Simulation Randomization with Real World Experience. (2018). arXiv:1810.05687Google ScholarGoogle Scholar
  3. Carl Hewitt. 2011. Actor Model of Computation. Inconsistency Robustness 2011 (2011), 1--25. arXiv:1008.1459Google ScholarGoogle Scholar
  4. Zhang Wei Hong, Yu Ming Chen, Hsuan Kung Yang, Shih Yang Su, Tzu Yun Shann, Yi Hsiang Chang, Brian Hsi Lin Ho, Chih Chieh Tu, Tsu Ching Hsiao, Hsin Wei Hsiao, Sih Pin Lai, Yueh Chuan Chang, and Chun Yi Lee. 2018. Virtual-to-real: Learning to control in visual semantic segmentation. IJCAI International Joint Conference on Artificial Intelligence 2018-July (2018), 4912--4920. arXiv:arXiv:1802.00285v4Google ScholarGoogle ScholarCross RefCross Ref
  5. Stephen James, Paul Wohlhart, Mrinal Kalakrishnan, Dmitry Kalashnikov, Alex Irpan, Julian Ibarz, Sergey Levine, Raia Hadsell, and Konstantinos Bousmalis. 2018. Sim-to-Real via Sim-to-Sim: Data-efficient Robotic Grasping via Randomized-to-Canonical Adaptation Networks. (2018). arXiv:1812.07252Google ScholarGoogle Scholar
  6. Svetoslav Kolev and Emanuel Todorov. 2015. Physically consistent state estimation and system identification for contacts. IEEE-RAS International Conference on Humanoid Robots 2015-December (2015), 1036--1043.Google ScholarGoogle ScholarCross RefCross Ref
  7. Timothy P. Lillicrap, Jonathan J. Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, and Daan Wierstra. 2015. Continuous control with deep reinforcement learning. (2015). arXiv:1509.02971Google ScholarGoogle Scholar
  8. Ryan Lowe, Yi Wu, Aviv Tamar, Jean Harb, Pieter Abbeel, and Igor Mordatch. 2017. Multi-agent actor-critic for mixed cooperative-competitive environments. Advances in Neural Information Processing Systems 2017-December (2017), 6380--6391. arXiv:arXiv:1706.02275v3Google ScholarGoogle Scholar
  9. OpenAI. 2017. Multi Agent Particle Environment. https://github.com/openai/multiagent-particle-envsGoogle ScholarGoogle Scholar
  10. Xue Bin Peng, Marcin Andrychowicz, Wojciech Zaremba, and Pieter Abbeel. 2018. Sim-to-Real Transfer of Robotic Control with Dynamics Randomization. Proceedings - IEEE International Conference on Robotics and Automation (2018), 3803--3810.Google ScholarGoogle ScholarCross RefCross Ref
  11. Marc Shapiro, Nuno Pregui, Carlos Baquero, Marek Zawirski, Carlos Baquero, Marek Zawirski, and Conflict-free Replicated Data. 2011. Conflict-free Replicated Data Types To cite this version:. (2011).Google ScholarGoogle Scholar
  12. Josh Tobin, Rachel Fong, Alex Ray, Jonas Schneider, Wojciech Zaremba, and Pieter Abbeel. 2017. Domain randomization for transferring deep neural networks from simulation to the real world. IEEE International Conference on Intelligent Robots and Systems 2017-September (2017), 23--30. arXiv:arXiv:1703.06907v1Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. A sim2real framework enabling decentralized agents to execute MADDPG tasks

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        DIDL '19: Proceedings of the Workshop on Distributed Infrastructures for Deep Learning
        December 2019
        17 pages
        ISBN:9781450370370
        DOI:10.1145/3366622

        Copyright © 2019 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 9 December 2019

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed limited

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader