Hostname: page-component-68c7f8b79f-kbpd8 Total loading time: 0 Render date: 2025-12-20T01:55:23.979Z Has data issue: false hasContentIssue false

100 words on reward prediction error – 100 words

Published online by Cambridge University Press:  23 December 2019

Rights & Permissions [Opens in a new window]

Abstract

Information

Type
Extras
Copyright
Copyright © The Author 2019 

Reward prediction error is like Marmite – you either love it or hate it. I hate it because it commits to a view of the brain that inherits from 20th-century behaviourism and reinforcement learning. When people say dopamine encodes reward prediction error, they are assuming that the brain is in the game of maximising reward. But it is not – the brain updates its beliefs and selects a preferred course of action. On this (planning as active inference) view, the available evidence suggests that dopamine encodes the precision of beliefs about policies – or, more simply, the confidence afforded (subpersonal) plans of action.

This journal is not currently accepting new eletters.

eLetters

No eLetters have been published for this article.