Paper: | PS-2B.3 |
Session: | Poster Session 2B |
Location: | Symphony/Overture |
Session Time: | Friday, September 7, 19:30 - 21:30 |
Presentation Time: | Friday, September 7, 19:30 - 21:30 |
Presentation: |
Poster
|
Publication: |
2018 Conference on Cognitive Computational Neuroscience, 5-8 September 2018, Philadelphia, Pennsylvania |
Paper Title: |
Combining heuristics with counterfactual play in reinforcement learning. |
Manuscript: |
Click here to view manuscript |
DOI: |
https://doi.org/10.32470/CCN.2018.1151-0 |
Authors: |
Erik Peterson, Necati Müyesser, Kyle Dunovan, Tim Verstynen, Carnegie Mellon University, United States |
Abstract: |
Deep reinforcement learning can sometimes match and exceed human performance, but if even minor changes are introduced artificial networks can't adapt what they've learned to new situations. Two reasons why people are so eminently adaptable is their use of heuristics, and their ability to imagine new environments and learn from them, a kind of counterfactual reasoning. We've developed a model of hierarchical reinforcement learning which includes both these elements. Using a board game with a known optimal strategy--Wythoff’s game--we show that this ``stumbler-strategist'' network promotes generalizability and robustness to new environments and rule changes, while also improving post-training interpretability of learning outcomes. |