poster

Towards Integrating Real-Time Crowd Advice with Reinforcement Learning

Authors:

Gabriel V. de la Cruz,

Bei Peng,

Walter S. Lasecki,

Matthew E. TaylorAuthors Info & Claims

IUI '15 Companion: Companion Proceedings of the 20th International Conference on Intelligent User Interfaces

Pages 17 - 20

https://doi.org/10.1145/2732158.2732180

Published: 29 March 2015 Publication History

Get Access

Abstract

Reinforcement learning is a powerful machine learning paradigm that allows agents to autonomously learn to maximize a scalar reward. However, it often suffers from poor initial performance and long learning times. This paper discusses how collecting on-line human feedback, both in real time and post hoc, can potentially improve the performance of such learning systems. We use the game Pac-Man to simulate a navigation setting and show that workers are able to accurately identify both when a sub-optimal action is executed, and what action should have been performed instead. Demonstrating that the crowd is capable of generating this input, and discussing the types of errors that occur, serves as a critical first step in designing systems that use this real-time feedback to improve systems' learning performance on-the-fly.

References

[1]

Argall, B. D., Chernova, S., Veloso, M., and Browning, B. A survey of robot learning from demonstration. Robot. Auton. Syst. 57, 5 (May 2009), 469--483.

Digital Library

Google Scholar

[2]

Goldberg, K., Chen, B., Solomon, R., Bui, S., Farzin, B., Heitler, J., Poon, D., and Smith, G. Collaborative teleoperation via the internet. In Proc. of ICRA (2000).

Crossref

Google Scholar

[3]

Knox, W. B., and Stone, P. Combining manual feedback with subsequent MDP reward signals for reinforcement learning. In Proc. of AAMAS (2010).

Digital Library

Google Scholar

[4]

Lasecki, W. S., Murray, K. I., White, S., Miller, R. C., and Bigham, J. P. Real-time crowd control of existing interfaces. In Proc. of UIST (2011).

Digital Library

Google Scholar

[5]

Lasecki, W. S., Song, Y. C., Kautz, H., and Bigham, J. P. Real-time crowd labeling for deployable activity recognition. In Proc. of CSCW (2013).

Digital Library

Google Scholar

[6]

Loftin, R., Peng, B., MacGlashan, J., Littman, M. L., Taylor, M. E., Huang, J., and Roberts, D. L. A strategy-aware technique for learning behaviors from discrete human feedback. In Proc. of AAAI (2014).

Crossref

Google Scholar

[7]

Sutton, R. S., and Barto, A. G. Reinforcement learning: An introduction, vol. 28. MIT press, 1998.

Digital Library

Google Scholar

[8]

Taylor, M. E., Carboni, N., Fachantidis, A., Vlahavas, I., and Torrey, L. Reinforcement learning agents providing advice in complex video games. Connection Science 26, 1 (2014), 45--63.

Digital Library

Google Scholar

[9]

Taylor, M. E., Suay, H. B., and Chernova, S. Integrating reinforcement learning with human demonstrations of varying ability. In Proc. of AAMAS (2011).

Digital Library

Google Scholar

[10]

Toris, R., Kent, D., and Chernova, S. The robot management system: A framework for conducting human-robot interaction studies through crowdsourcing. Journal of Human-Robot Interaction 3, 2 (2014), 25--49.

Crossref

Google Scholar

Cited By

View all

Jouibari ZMoakhkhar HReformat M(2024)GameMentor: Customized Tutorial for Video Games2024 16th International Conference on Human System Interaction (HSI)10.1109/HSI61632.2024.10613541(1-6)Online publication date: 8-Jul-2024
https://doi.org/10.1109/HSI61632.2024.10613541
Yu GSiddique UWeng P(2024)Fair Deep Reinforcement Learning with Generalized Gini Welfare FunctionsAutonomous Agents and Multiagent Systems. Best and Visionary Papers10.1007/978-3-031-56255-6_1(3-29)Online publication date: 30-Mar-2024
https://doi.org/10.1007/978-3-031-56255-6_1
Taylor MNissen NWang YNavidi N(2021)Improving reinforcement learning with human assistance: an argument for human subject studies with HIPPO GymNeural Computing and Applications10.1007/s00521-021-06375-y35:32(23429-23439)Online publication date: 19-Sep-2021
https://dl.acm.org/doi/10.1007/s00521-021-06375-y
Show More Cited By

Index Terms

Towards Integrating Real-Time Crowd Advice with Reinforcement Learning
1. Human-centered computing

Recommendations

Integrating Guidance into Relational Reinforcement Learning

Reinforcement learning, and Q-learning in particular, encounter two major problems when dealing with large state spaces. First, learning the Q-function in tabular form may be infeasible because of the excessive amount of memory needed to store the table,...
Reward Shaping in Episodic Reinforcement Learning
AAMAS '17: Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems

Recent advancements in reinforcement learning confirm that reinforcement learning techniques can solve large scale problems leading to high quality autonomous decision making. It is a matter of time until we will see large scale applications of ...
Reinforcement Learning: With Open AI, TensorFlow and Keras Using Python

Comments

Information & Contributors

Information

Published In

IUI '15 Companion: Companion Proceedings of the 20th International Conference on Intelligent User Interfaces

March 2015

164 pages

ISBN:9781450333085

DOI:10.1145/2732158

General Chairs:
Oliver Brdiczka
Vectra Networks, Inc.
,
Polo Chau
Georgia Tech
,
Program Chairs:
Giuseppe Carenini
University of British Columbia
,
Shimei Pan
University of Maryland
,
Per Ola Kristensson
University of Cambridge

Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 29 March 2015

Check for updates

Qualifiers

Poster

Funding Sources

National Science Foundation

Conference

IUI'15

Sponsor:

IUI'15: IUI'15 20th International Conference on Intelligent User Interfaces

March 29 - April 1, 2015

Georgia, Atlanta, USA

Acceptance Rates

IUI '15 Companion Paper Acceptance Rate 47 of 205 submissions, 23%;

Overall Acceptance Rate 746 of 2,811 submissions, 27%

Upcoming Conference

IUI '25

Sponsor:
sigai
sigai

30th International Conference on Intelligent User Interfaces

March 24 - 27, 2025

Cagliari , Italy

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

5
Total Citations
View Citations
107
Total Downloads

Downloads (Last 12 months)3
Downloads (Last 6 weeks)1

Reflects downloads up to 28 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Jouibari ZMoakhkhar HReformat M(2024)GameMentor: Customized Tutorial for Video Games2024 16th International Conference on Human System Interaction (HSI)10.1109/HSI61632.2024.10613541(1-6)Online publication date: 8-Jul-2024
https://doi.org/10.1109/HSI61632.2024.10613541
Yu GSiddique UWeng P(2024)Fair Deep Reinforcement Learning with Generalized Gini Welfare FunctionsAutonomous Agents and Multiagent Systems. Best and Visionary Papers10.1007/978-3-031-56255-6_1(3-29)Online publication date: 30-Mar-2024
https://doi.org/10.1007/978-3-031-56255-6_1
Taylor MNissen NWang YNavidi N(2021)Improving reinforcement learning with human assistance: an argument for human subject studies with HIPPO GymNeural Computing and Applications10.1007/s00521-021-06375-y35:32(23429-23439)Online publication date: 19-Sep-2021
https://dl.acm.org/doi/10.1007/s00521-021-06375-y
Lee JWon JLee JCharalambous PChrysanthou YJones BLee J(2018)Crowd simulation by deep reinforcement learningProceedings of the 11th ACM SIGGRAPH Conference on Motion, Interaction and Games10.1145/3274247.3274510(1-7)Online publication date: 8-Nov-2018
https://dl.acm.org/doi/10.1145/3274247.3274510
Lundgard AYang YFoster MLasecki WMandryk RHancock MPerry MCox A(2018)BoltProceedings of the 2018 CHI Conference on Human Factors in Computing Systems10.1145/3173574.3174041(1-7)Online publication date: 21-Apr-2018
https://dl.acm.org/doi/10.1145/3173574.3174041
Altmeyer MLessel PKrüger ANichols JMahmud JO'Donovan JConati CZancanaro M(2016)Expense ControlProceedings of the 21st International Conference on Intelligent User Interfaces10.1145/2856767.2856790(31-42)Online publication date: 7-Mar-2016
https://dl.acm.org/doi/10.1145/2856767.2856790

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Abstract

References

Cited By

Index Terms

Recommendations

Integrating Guidance into Relational Reinforcement Learning

Reward Shaping in Episodic Reinforcement Learning

Reinforcement Learning: With Open AI, TensorFlow and Keras Using Python

Comments

Information

Published In

Sponsors

Publisher

Publication History

Check for updates

Qualifiers

Funding Sources

Conference

Acceptance Rates

Upcoming Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations