Improving Reward Estimation in Goal-Conditioned Imitation Learning with Counterfactual Data and Structural Causal Models

Mohamed Jabri; Mohamed Jabri; Mohamed Jabri; Mohamed Jabri; Panagiotis Papadakis; Panagiotis Papadakis; Ehsan Abbasnejad; Ehsan Abbasnejad; Ehsan Abbasnejad; Gilles Coppin; Gilles Coppin; Javen Shi; Javen Shi; Javen Shi

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

Improving Reward Estimation in Goal-Conditioned Imitation Learning with Counterfactual Data and Structural Causal Models

Topics: Adaptive Control Systems using Machine Learning; Data-driven Control ; Deep Learning for Autonomous Control; Learning from Demonstration; Reinforcement Learning in Control and Robotics

In Proceedings of the 20th International Conference on Informatics in Control, Automation and Robotics - Volume 2: LICARSA, 329-337, 2023 , Rome, Italy

Authors: Mohamed Jabri ^{1

;

2

;

3

;

4} ; Panagiotis Papadakis ^{1

;

4} ; Ehsan Abbasnejad ^{2

;

3

;

4} ; Gilles Coppin ^{1

;

4} and Javen Shi ^{2

;

3

;

4}

Affiliations: ¹ IMT Atlantique, Lab-STICC, UMR CNRS 6285, F-29238 Brest, France ; ² The University of Adelaide, Adelaide, Australia ; ³ Australian Institute for Machine Learning, Adelaide, Australia ; ⁴ IRL CROSSING, CNRS, Adelaide, Australia

Keyword(s): Imitation Learning, Causality, Structural Causal Models (SCMs), Counterfactual Reasoning.

Abstract: Imitation learning has emerged as a pragmatic alternative to reinforcement learning for teaching agents to execute specific tasks, mitigating the complexity associated with reward engineering. However, the deployment of imitation learning in real-world scenarios is hampered by numerous challenges. Often, the scarcity and expense of demonstration data hinder the effectiveness of imitation learning algorithms. In this paper, we present a novel approach to enhance the sample efficiency of goal-conditioned imitation learning. Leveraging the principles of causality, we harness structural causal models as a formalism to generate counterfactual data. These counterfactual instances are used as additional training data, effectively improving the learning process. By incorporating causal insights, our method demonstrates its ability to improve imitation learning efficiency by capitalizing on generated counterfactual data. Through experiments on simulated robotic manipulation tasks, such as pus hing, moving, and sliding objects, we showcase how our approach allows for the learning of better reward functions resulting in improved performance with a limited number of demonstrations, paving the way for a more practical and effective implementation of imitation learning in real-world scenarios. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 3.142.198.129

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Jabri, M.; Papadakis, P.; Abbasnejad, E.; Coppin, G. and Shi, J. (2023). Improving Reward Estimation in Goal-Conditioned Imitation Learning with Counterfactual Data and Structural Causal Models. In Proceedings of the 20th International Conference on Informatics in Control, Automation and Robotics - Volume 2: LICARSA; ISBN 978-989-758-670-5; ISSN 2184-2809, SciTePress, pages 329-337. DOI: 10.5220/0012268200003543

@conference{licarsa23,
author={Mohamed Jabri. and Panagiotis Papadakis. and Ehsan Abbasnejad. and Gilles Coppin. and Javen Shi.},
title={Improving Reward Estimation in Goal-Conditioned Imitation Learning with Counterfactual Data and Structural Causal Models},
booktitle={Proceedings of the 20th International Conference on Informatics in Control, Automation and Robotics - Volume 2: LICARSA},
year={2023},
pages={329-337},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012268200003543},
isbn={978-989-758-670-5},
issn={2184-2809},
}

TY - CONF

JO - Proceedings of the 20th International Conference on Informatics in Control, Automation and Robotics - Volume 2: LICARSA
TI - Improving Reward Estimation in Goal-Conditioned Imitation Learning with Counterfactual Data and Structural Causal Models
SN - 978-989-758-670-5
IS - 2184-2809
AU - Jabri, M.
AU - Papadakis, P.
AU - Abbasnejad, E.
AU - Coppin, G.
AU - Shi, J.
PY - 2023
SP - 329
EP - 337
DO - 10.5220/0012268200003543
PB - SciTePress