Journals & Magazines >IEEE Transactions on Artifici... >Volume: 1 Issue: 2

Procedural Memory Augmented Deep Reinforcement Learning

Download PDF
Download References
Request Permissions
Save to
Alerts

Impact Statement:Reinforcement learning is critical for the design of next generation machine learning algorithms because it decreases label requirements. However, currently, the method s...Show More

Abstract:

Inspired by the human brain, we propose an external memory-augmented decision-making architecture for video processing. A self-organizing object detector is employed as a...Show More

Metadata

Impact Statement:

Reinforcement learning is critical for the design of next generation machine learning algorithms because it decreases label requirements. However, currently, the method still requires a considerable number of interactions with the environment, and the learned network cannot be generalized to other environments because of catastrophic forgetting, so it is still not very practical. This paper proposes a different approach to stochastic search inspired by cognitive science. The goal is to deconstruct the world into objects, store them in external memory, and learn object properties when they interact with the agent. The preliminary results show that this approach decreases the number of interactions with the environment, maintaining performance and improving generalizability to other environments.

Abstract:

Inspired by the human brain, we propose an external memory-augmented decision-making architecture for video processing. A self-organizing object detector is employed as a frontend to deconstruct the environment. This is done by extracting events from the flow of time and detecting objects within the frames. By employing an extra working memory where objects are temporarily stored, the system can extract properties of the stored objects related to the task. We propose a deep reinforcement learning (RL) neural network to learn affordances, i.e., a sequence of actions to manipulate these objects. The RL network and object detector are trained alternatively. After both the network and detector are trained, the objects and their affordances are transferred to an external memory. They are then utilized when the same objects are detected in input frames. Here, we use a combination of a dictionary and a linked list for the external memory that can be accessed by either content or temporal orde...

Published in: IEEE Transactions on Artificial Intelligence ( Volume: 1, Issue: 2, October 2020)

Page(s): 105 - 120

Date of Publication: 02 February 2021

Electronic ISSN: 2691-4581

DOI: 10.1109/TAI.2021.3054722

Funding Agency:

Contents

References is not available for this document.

Procedural Memory Augmented Deep Reinforcement Learning

Abstract:

Metadata

Abstract:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Procedural Memory Augmented Deep Reinforcement Learning

Alerts

Abstract:

Metadata

Abstract:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?