Rxivist logo

Prefrontal cortex as a meta-reinforcement learning system

By Jane X Wang, Zeb Kurth-Nelson, Dharshan Kumaran, Dhruva Tirumala, Hubert Soyer, Joel Z. Leibo, Demis Hassabis, Matthew Botvinick

Posted 06 Apr 2018
bioRxiv DOI: 10.1101/295964 (published DOI: 10.1038/s41593-018-0147-8)

Over the past twenty years, neuroscience research on reward-based learning has converged on a canonical model, under which the neurotransmitter dopamine 'stamps in' associations between situations, actions and rewards by modulating the strength of synaptic connections between neurons. However, a growing number of recent findings have placed this standard model under strain. In the present work, we draw on recent advances in artificial intelligence to introduce a new theory of reward-based learning. Here, the dopamine system trains another part of the brain, the prefrontal cortex, to operate as its own free-standing learning system. This new perspective accommodates the findings that motivated the standard model, but also deals gracefully with a wider range of observations, providing a fresh foundation for future research.

Download data

  • Downloaded 32,300 times
  • Download rankings, all-time:
    • Site-wide: 43 out of 111,157
    • In neuroscience: 4 out of 18,509
  • Year to date:
    • Site-wide: 459 out of 111,157
  • Since beginning of last month:
    • Site-wide: None out of 111,157

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)