An Open-World Novelty Generator for Authoring Reinforcement Learning Environment of Standardized Toolkits

Lee, Sangho; Park, Junbeom; Suk, Ho; Kim, Taewoo; Yadav, Pamul; Kim, Shiho

doi:10.1007/978-3-030-80253-0_3

Sangho Lee^11,12,
Junbeom Park¹²,
Ho Suk¹¹,
Taewoo Kim¹¹,
Pamul Yadav¹² &
…
Shiho Kim¹¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12832))

Included in the following conference series:

International Conference on Multi-disciplinary Trends in Artificial Intelligence

539 Accesses
2 Citations

Abstract

Current research in Reinforcement Learning (RL) is based on closed-world learning environment where the environment remains fixed and unchanged throughout the agent’s training and application session. The fixed environment may be prone to failure when the agents incorporate under novel unseen situations. To overcome the drawback of the existing closed-world model, an Open-world learning model is required which can classify the novelty occurring in an environment in a hierarchical manner. The proposed control suite with open world novelty generator is an attempt to augment the machine learning environment for authoring the novelty in actors, interactions, and environment of standardized Reinforcement learning toolkits such as UnityML, OpenAI Gym, and DeepMind Control Suite in real-time. Such a tool will provide an opportunity to the RL researchers to simulate the Open-world learning model and test their algorithms within the standardized closed-world learning environments of the standardized RL toolkits.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Silver, D., Huang, A., Maddison, C., et al.: Mastering the game of Go with deep neural networks and tree search. Nature 529, 484–489 (2016)
Article Google Scholar
AlphaGo Homepage. https://deepmind.com/research/case-studies/alphago-the-story-so-far
Science of Artificial Intelligence and Learning for Open-world Novelty (SAIL-ON), HR001119S0038
Google Scholar
Chen, Z., Liu, B.: Open-World Learning, 2nd edn. Morgan & Claypool Publishers. Lifelong Machine Learning (2018)
Google Scholar
Fei, G., Wang, S., Liu, B.: Learning cumulatively to become more knowledgeable. In: KDD (2016)
Google Scholar
Brockman, G., et al.: OpenAI Gym. arXiv preprint arXiv:1606.01540 (2016)
Juliani, A., et al.: Unity: a general platform for intelligent agents. arXiv preprint arXiv:1809.02627 (2018)
Tassa, Y., et al.: DeepMind Control Suite. ArXiv e-prints (2018)
Google Scholar
Koenig, N., Howard, A.: Design and use paradigms for gazebo, an open-source multi-robot simulator. In: Intelligent Robots and Systems, (IROS 2004) (2004)
Google Scholar
Webots. Mobile Robot Simulation Software. http://www.cyberbotics.com.Open-source
Alkilabi, M.H.M., Narayan, A., Tuci, E.: Cooperative object transport with a swarm of e-puck robots: robustness and scalability of evolved collective strategies. Swarm Intell. 11(3–4), 185–209 (2017). https://doi.org/10.1007/s11721-017-0135-8
Article Google Scholar
Gan, C., et al.: The ThreeDWorld transport challenge: a visually guided task-and-motion planning benchmark for physically realistic embodied AI. arXiv preprint arXiv:2103.14025 (2021)
Rizzardo, C., Katyara, S., Fernandes, M., Chen, F.: The importance and the limitations of Sim2Real for robotic manipulation in precision agriculture. In: 2nd Workshop on Closing the Reality Gap in Sim2Real Transfer for Robotics, RSS (2020)
Google Scholar
Todorov, E., Erez, T., Tassa, Y.: MuJoCo: a physics engine for model-based control. In: IEEE/RSJ International Conference on Intelligent Robots and Systems, Vilamoura 2012, pp. 5026–5033 (2012)
Google Scholar

Download references

Acknowledgments

This work was support by the Ministry of Science and ICT (MSIT), Korea, by (No. 2020–0-00056, to create AI systems that act appropriately and effectively in novel situations that occur in open worlds) supervised by IITP.

Author information

Authors and Affiliations

School of Integrated Technology, Yonsei University, Incheon, 21983, South Korea
Sangho Lee, Ho Suk, Taewoo Kim & Shiho Kim
Department of Research, GREW Creative Lab. Inc., Incheon, 21983, South Korea
Sangho Lee, Junbeom Park & Pamul Yadav

Authors

Sangho Lee
View author publications
You can also search for this author in PubMed Google Scholar
Junbeom Park
View author publications
You can also search for this author in PubMed Google Scholar
Ho Suk
View author publications
You can also search for this author in PubMed Google Scholar
Taewoo Kim
View author publications
You can also search for this author in PubMed Google Scholar
Pamul Yadav
View author publications
You can also search for this author in PubMed Google Scholar
Shiho Kim
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shiho Kim .

Editor information

Editors and Affiliations

Mahasarakham University, Maha Sarakham, Thailand
Phatthanaphong Chomphuwiset
Korea Advanced Institute of Science and Technology, Daejeon, Korea (Republic of)
Junmo Kim
Mahasarakham University, Maha Sarakham, Thailand
Pornntiwa Pawara

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lee, S., Park, J., Suk, H., Kim, T., Yadav, P., Kim, S. (2021). An Open-World Novelty Generator for Authoring Reinforcement Learning Environment of Standardized Toolkits. In: Chomphuwiset, P., Kim, J., Pawara, P. (eds) Multi-disciplinary Trends in Artificial Intelligence. MIWAI 2021. Lecture Notes in Computer Science(), vol 12832. Springer, Cham. https://doi.org/10.1007/978-3-030-80253-0_3

Download citation

DOI: https://doi.org/10.1007/978-3-030-80253-0_3
Published: 27 June 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-80252-3
Online ISBN: 978-3-030-80253-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics