extended-abstract

Aligning Robot Behaviors with Human Intents by Exposing Learned Behaviors and Resolving Misspecifications

Author:

Serena BoothAuthors Info & Claims

HRI '23: Companion of the 2023 ACM/IEEE International Conference on Human-Robot Interaction

Pages 742 - 744

https://doi.org/10.1145/3568294.3579971

Published: 13 March 2023 Publication History

Get Access

Abstract

Human-robot interaction is limited in large part by the challenge of writing correct specifications for robots. The research community wants alignment between humans' goals and robot behaviors, but this alignment is very hard to achieve. My research tackles this problem. I view alignment as the consequence of iterative design and ample testing, and I design methods in service of these processes. I first study how humans currently write reward functions, and I profile some of the typical errors they make when doing so. I then study how humans can inspect the behaviors robots learn from any given specification. A typical approach to this mandates unstructured or hand-designed test cases; I instead introduce a Bayesian inference method for finding behavior examples which cover information-rich test cases. Alongside finding these behavior examples, I study how these examples should be presented to the human through applying cognitive theories of human concept learning. For the remainder of my thesis, I am pursuing two open questions. My first question concerns how these components can be combined such that humans are able to iteratively design better behavioral specifications. My second question concerns how robots can better interpret humans' erroneous specifications and attempt to infer their true intent, in spite of the errors.

References

[1]

Dario Amodei, Chris Olah, Jacob Steinhardt, Paul Christiano, John Schul- man, and Dan Mané. 2016. Concrete problems in AI safety. arXiv preprint arXiv:1606.06565.

Google Scholar

[2]

Serena Booth, Bradley W. Knox, Julie Shah, Scott Niekum, Peter Stone, and Alessandro Allievi. 2023. The perils of trial-and-error reward design: misdesign through overfitting and invalid task specifications. AAAI Conference on Artificial Intelligence.

Crossref

Google Scholar

[3]

Serena Booth, Sanjana Sharma, Sarah Chung, Julie Shah, and Elena L. Glassman. 2022. Revisiting human-robot teaching and learning through the lens of human concept learning. Proceedings of the Human-Robot Interaction Conference (HRI).

Google Scholar

[4]

Serena Booth, Yilun Zhou, Ankit Shah, and Julie Shah. 2021. Bayes-TrEx: a Bayesian sampling approach to model transparency by example. AAAI Conference on Artificial Intelligence.

Crossref

Google Scholar

[5]

Dedre Gentner and Linsey A Smith. 2013. Analogical learning and reasoning. The Oxford handbook of cognitive psychology, 668--681.

Google Scholar

[6]

Dylan Hadfield-Menell, Smitha Milli, Pieter Abbeel, Stuart J Russell, and Anca Dragan. 2017. Inverse reward design. Advances in neural information processing systems, 30.

Google Scholar

[7]

Jerry Zhi-Yang He and Anca D Dragan. 2021. Assisted robust reward design. Conference on Robot Learning (CoRL).

Google Scholar

[8]

W Bradley Knox, Alessandro Allievi, Holger Banzhaf, Felix Schmitt, and Peter Stone. 2021. Reward (mis) design for autonomous driving. arXiv preprint arXiv:2104.13906.

Google Scholar

[9]

W Bradley Knox, Stephane Hatgis-Kessell, Serena Booth, Scott Niekum, Peter Stone, and Alessandro Allievi. 2022. Models of human preference for learning reward functions. arXiv preprint arXiv:2206.02231.

Google Scholar

[10]

Ference Marton. 2014. Necessary conditions of learning. Routledge.

Google Scholar

[11]

David Silver, Satinder Singh, Doina Precup, and Richard S Sutton. 2021. Reward is enough. Artificial Intelligence, 299, 103535.

Crossref

Google Scholar

[12]

Yilun Zhou, Serena Booth, Nadia Figueroa, and Julie Shah. 2021. RoCUS: robot controller understanding via sampling. Conference on Robot Learning.

Google Scholar

[13]

Yilun Zhou, Serena Booth, Marco Tulio Ribeiro, and Julie Shah. 2022. Do feature attribution methods correctly attribute features? AAAI Conference on Artificial Intelligence.

Crossref

Google Scholar

Cited By

View all

Index Terms

Aligning Robot Behaviors with Human Intents by Exposing Learned Behaviors and Resolving Misspecifications
1. Computing methodologies
  1. Artificial intelligence
    1. Philosophical/theoretical foundations of artificial intelligence
      1. Cognitive science
2. Human-centered computing
  1. Interaction design

Recommendations

People's Judgments of Human and Robot Behaviors: A Robust Set of Behaviors and Some Discrepancies
HRI '18: Companion of the 2018 ACM/IEEE International Conference on Human-Robot Interaction

The emergence of robots in everyday life raises the question of how people explain the behavior of robots---in particular, whether they explain robot behavior the same way as they explain human behavior. However, before we can examine whether people»s ...
Effects of robot-human versus robot-robot behavior and entitativity on anthropomorphism and willingness to interact
Abstract
As robots become prevalent, people are increasingly interacting with multiple robots at once. Thus, it is important to not only examine how robot behavior toward humans affects interaction, but how robot behavior toward other robots ...
Highlights
- Social robot behavior toward robots increased anthropomorphism of robots.
- ...
A Safe-Control Paradigm for Human–Robot Interaction

This paper introduces a new approach to control a robot manipulator in a way that is safe for humans in the robot‘s workspace. Conceptually the robot is viewed as a tool with limited autonomy. The limited perception capabilities of automatic systems ...

Comments

Information & Contributors

Information

Published In

HRI '23: Companion of the 2023 ACM/IEEE International Conference on Human-Robot Interaction

March 2023

612 pages

ISBN:9781450399708

DOI:10.1145/3568294

General Chairs:
Ginevra Castellano
Uppsala University, Sweden
,
Laurel Riek
University of California San Diego, USA
,
Program Chairs:
Maya Cakmak
University of Washington, USA
,
Iolanda Leite
KTH Royal Institute of Technology, Sweden

Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 March 2023

Check for updates

Author Tags

Qualifiers

Extended-abstract

Funding Sources

Conference

HRI '23

Sponsor:

HRI '23: ACM/IEEE International Conference on Human-Robot Interaction

March 13 - 16, 2023

Stockholm, Sweden

Acceptance Rates

Overall Acceptance Rate 268 of 1,124 submissions, 24%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
168
Total Downloads

Downloads (Last 12 months)45
Downloads (Last 6 weeks)3

Reflects downloads up to 05 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Abstract

References

Cited By

Index Terms

Recommendations

People's Judgments of Human and Robot Behaviors: A Robust Set of Behaviors and Some Discrepancies

Effects of robot-human versus robot-robot behavior and entitativity on anthropomorphism and willingness to interact

A Safe-Control Paradigm for Human–Robot Interaction

Comments

Information

Published In

Sponsors

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations