Rxivist logo

Rxivist combines preprints from bioRxiv with data from Twitter to help you find the papers being discussed in your field. Currently indexing 62,736 bioRxiv papers from 278,376 authors.

Prefrontal cortex as a meta-reinforcement learning system

By Jane X Wang, Zeb Kurth-Nelson, Dharshan Kumaran, Dhruva Tirumala, Hubert Soyer, Joel Z Leibo, Demis Hassabis, Matthew Botvinick

Posted 06 Apr 2018
bioRxiv DOI: 10.1101/295964 (published DOI: 10.1038/s41593-018-0147-8)

Over the past twenty years, neuroscience research on reward-based learning has converged on a canonical model, under which the neurotransmitter dopamine 'stamps in' associations between situations, actions and rewards by modulating the strength of synaptic connections between neurons. However, a growing number of recent findings have placed this standard model under strain. In the present work, we draw on recent advances in artificial intelligence to introduce a new theory of reward-based learning. Here, the dopamine system trains another part of the brain, the prefrontal cortex, to operate as its own free-standing learning system. This new perspective accommodates the findings that motivated the standard model, but also deals gracefully with a wider range of observations, providing a fresh foundation for future research.

Download data

  • Downloaded 27,701 times
  • Download rankings, all-time:
    • Site-wide: 16 out of 62,736
    • In neuroscience: 4 out of 11,207
  • Year to date:
    • Site-wide: 29 out of 62,736
  • Since beginning of last month:
    • Site-wide: 6 out of 62,736

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide

Sign up for the Rxivist weekly newsletter! (Click here for more details.)