loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Authors: Mohamed Jabri 1 ; 2 ; 3 ; 4 ; Panagiotis Papadakis 1 ; 4 ; Ehsan Abbasnejad 2 ; 3 ; 4 ; Gilles Coppin 1 ; 4 and Javen Shi 2 ; 3 ; 4

Affiliations: 1 IMT Atlantique, Lab-STICC, UMR CNRS 6285, F-29238 Brest, France ; 2 The University of Adelaide, Adelaide, Australia ; 3 Australian Institute for Machine Learning, Adelaide, Australia ; 4 IRL CROSSING, CNRS, Adelaide, Australia

Keyword(s): Imitation Learning, Causality, Structural Causal Models (SCMs), Counterfactual Reasoning.

Abstract: Imitation learning has emerged as a pragmatic alternative to reinforcement learning for teaching agents to execute specific tasks, mitigating the complexity associated with reward engineering. However, the deployment of imitation learning in real-world scenarios is hampered by numerous challenges. Often, the scarcity and expense of demonstration data hinder the effectiveness of imitation learning algorithms. In this paper, we present a novel approach to enhance the sample efficiency of goal-conditioned imitation learning. Leveraging the principles of causality, we harness structural causal models as a formalism to generate counterfactual data. These counterfactual instances are used as additional training data, effectively improving the learning process. By incorporating causal insights, our method demonstrates its ability to improve imitation learning efficiency by capitalizing on generated counterfactual data. Through experiments on simulated robotic manipulation tasks, such as pus hing, moving, and sliding objects, we showcase how our approach allows for the learning of better reward functions resulting in improved performance with a limited number of demonstrations, paving the way for a more practical and effective implementation of imitation learning in real-world scenarios. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.142.198.129

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Jabri, M.; Papadakis, P.; Abbasnejad, E.; Coppin, G. and Shi, J. (2023). Improving Reward Estimation in Goal-Conditioned Imitation Learning with Counterfactual Data and Structural Causal Models. In Proceedings of the 20th International Conference on Informatics in Control, Automation and Robotics - Volume 2: LICARSA; ISBN 978-989-758-670-5; ISSN 2184-2809, SciTePress, pages 329-337. DOI: 10.5220/0012268200003543

@conference{licarsa23,
author={Mohamed Jabri. and Panagiotis Papadakis. and Ehsan Abbasnejad. and Gilles Coppin. and Javen Shi.},
title={Improving Reward Estimation in Goal-Conditioned Imitation Learning with Counterfactual Data and Structural Causal Models},
booktitle={Proceedings of the 20th International Conference on Informatics in Control, Automation and Robotics - Volume 2: LICARSA},
year={2023},
pages={329-337},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012268200003543},
isbn={978-989-758-670-5},
issn={2184-2809},
}

TY - CONF

JO - Proceedings of the 20th International Conference on Informatics in Control, Automation and Robotics - Volume 2: LICARSA
TI - Improving Reward Estimation in Goal-Conditioned Imitation Learning with Counterfactual Data and Structural Causal Models
SN - 978-989-758-670-5
IS - 2184-2809
AU - Jabri, M.
AU - Papadakis, P.
AU - Abbasnejad, E.
AU - Coppin, G.
AU - Shi, J.
PY - 2023
SP - 329
EP - 337
DO - 10.5220/0012268200003543
PB - SciTePress