Conferences >2015 International Joint Conf...

Reward-based online learning in non-stationary environments: Adapting a P300-speller with a “backspace” key

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

We adapt a policy gradient approach to the problem of reward-based online learning of a non-invasive EEG-based “P300”-speller. We first clarify the nature of the P300-spe...Show More

Metadata

Abstract:

We adapt a policy gradient approach to the problem of reward-based online learning of a non-invasive EEG-based “P300”-speller. We first clarify the nature of the P300-speller classification problem and present a general regularized gradient ascent formula. We then show that when the reward is immediate and binary (namely “bad response” or “good response”), each update is expected to improve the classifier accuracy, whether the actual response is correct or not. We also estimate the robustness of the method to occasional mistaken rewards, i.e. show that the learning efficacy may only linearly decrease with the rate of invalid rewards. The effectiveness of our approach is tested in a series of simulations reproducing the conditions of real experiments. We show in a first experiment that a systematic improvement of the spelling rate is obtained for all subjects in the absence of initial calibration. In a second experiment, we consider the case of the online recovery that is expected to follow failed electrodes. Combined with a specific failure detection algorithm, the spelling error information (typically contained in a “backspace” hit) is shown useful for the policy gradient to adapt the P300 classifier to the new situation, provided the feedback is reliable enough (namely having a reliability greater than 70%).

Published in: 2015 International Joint Conference on Neural Networks (IJCNN)

Date of Conference: 12-17 July 2015

Date Added to IEEE Xplore: 01 October 2015

ISBN Information:

ISSN Information:

DOI: 10.1109/IJCNN.2015.7280686

Conference Location: Killarney, Ireland

Contents

References is not available for this document.

Reward-based online learning in non-stationary environments: Adapting a P300-speller with a “backspace” key

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Reward-based online learning in non-stationary environments: Adapting a P300-speller with a “backspace” key

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?