Rxivist logo

Rxivist combines preprints from bioRxiv with data from Twitter to help you find the papers being discussed in your field. Currently indexing 70,836 bioRxiv papers from 309,131 authors.

Prefrontal cortex as a meta-reinforcement learning system

By Jane X Wang, Zeb Kurth-Nelson, Dharshan Kumaran, Dhruva Tirumala, Hubert Soyer, Joel Z. Leibo, Demis Hassabis, Matthew Botvinick

Posted 06 Apr 2018
bioRxiv DOI: 10.1101/295964 (published DOI: 10.1038/s41593-018-0147-8)

Over the past twenty years, neuroscience research on reward-based learning has converged on a canonical model, under which the neurotransmitter dopamine 'stamps in' associations between situations, actions and rewards by modulating the strength of synaptic connections between neurons. However, a growing number of recent findings have placed this standard model under strain. In the present work, we draw on recent advances in artificial intelligence to introduce a new theory of reward-based learning. Here, the dopamine system trains another part of the brain, the prefrontal cortex, to operate as its own free-standing learning system. This new perspective accommodates the findings that motivated the standard model, but also deals gracefully with a wider range of observations, providing a fresh foundation for future research.

Download data

  • Downloaded 29,054 times
  • Download rankings, all-time:
    • Site-wide: 16 out of 70,836
    • In neuroscience: 4 out of 12,721
  • Year to date:
    • Site-wide: 177 out of 70,836
  • Since beginning of last month:
    • Site-wide: 217 out of 70,836

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)