Hostname: page-component-745bb68f8f-lrblm Total loading time: 0 Render date: 2025-01-25T19:51:40.612Z Has data issue: false hasContentIssue false

The selective serotonin reuptake inhibitor sertraline alters learning from aversive reinforcements in patients with depression: evidence from a randomized controlled trial

Published online by Cambridge University Press:  17 April 2024

Jolanda Malamud*
Affiliation:
Applied Computational Psychiatry Lab, Mental Health Neuroscience Department, Division of Psychiatry and Max Planck Centre for Computational Psychiatry and Ageing Research, Queen Square Institute of Neurology, University College London, London, UK
Gemma Lewis
Affiliation:
Division of Psychiatry, University College London, London, UK
Michael Moutoussis
Affiliation:
Max Planck UCL Centre for Computational Psychiatry & Ageing Research, University College London, London, UK Wellcome Centre for Human Neuroimaging, Queen Square Institute of Neurology, University College London, London, UK
Larisa Duffy
Affiliation:
Division of Psychiatry, University College London, London, UK
Jessica Bone
Affiliation:
Division of Psychiatry, University College London, London, UK Research Department of Behavioural Science and Health, Institute of Epidemiology, University College London, London, UK
Ramya Srinivasan
Affiliation:
Division of Psychiatry, University College London, London, UK
Glyn Lewis
Affiliation:
Division of Psychiatry, University College London, London, UK
Quentin J. M. Huys
Affiliation:
Applied Computational Psychiatry Lab, Mental Health Neuroscience Department, Division of Psychiatry and Max Planck Centre for Computational Psychiatry and Ageing Research, Queen Square Institute of Neurology, University College London, London, UK
*
Corresponding author: Jolanda Malamud; Email: j.malamud@ucl.ac.uk
Rights & Permissions [Opens in a new window]

Abstract

Background

Selective serotonin reuptake inhibitors (SSRIs) are first-line pharmacological treatments for depression and anxiety. However, little is known about how pharmacological action is related to cognitive and affective processes. Here, we examine whether specific reinforcement learning processes mediate the treatment effects of SSRIs.

Methods

The PANDA trial was a multicentre, double-blind, randomized clinical trial in UK primary care comparing the SSRI sertraline with placebo for depression and anxiety. Participants (N = 655) performed an affective Go/NoGo task three times during the trial and computational models were used to infer reinforcement learning processes.

Results

There was poor task performance: only 54% of the task runs were informative, with more informative task runs in the placebo than in the active group. There was no evidence for the preregistered hypothesis that Pavlovian inhibition was affected by sertraline. Exploratory analyses revealed that in the sertraline group, early increases in Pavlovian inhibition were associated with improvements in depression after 12 weeks. Furthermore, sertraline increased how fast participants learned from losses and faster learning from losses was associated with more severe generalized anxiety symptoms.

Conclusions

The study findings indicate a relationship between aversive reinforcement learning mechanisms and aspects of depression, anxiety, and SSRI treatment, but these relationships did not align with the initial hypotheses. Poor task performance limits the interpretability and likely generalizability of the findings, and highlights the critical importance of developing acceptable and reliable tasks for use in clinical studies.

Funding

This article presents research supported by NIHR Program Grants for Applied Research (RP-PG-0610-10048), the NIHR BRC, and UCL, with additional support from IMPRS COMP2PSYCH (JM, QH) and a Wellcome Trust grant (QH).

Type
Original Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
Copyright © The Author(s), 2024. Published by Cambridge University Press

Introduction

Anxiety and depression are the most common mental health problems, often occurring together and constituting a significant fraction of the global disease burden (Bandelow et al., Reference Bandelow, Reitt, Röver, Michaelis, Görlich and Wedekind2015; Evans-Lacko et al., Reference Evans-Lacko, Aguilar-Gaxiola, Al-Hamzawi, Alonso, Benjet, Bruffaerts and Thornicroft2018; Hirschfeld, Reference Hirschfeld2001). Selective serotonin reuptake inhibitors (SSRIs) are commonly prescribed as first-line pharmacological treatments for both depression (Bogowicz et al., Reference Bogowicz, Curtis, Walker, Cowen, Geddes and Goldacre2021; Kendrick, Stuart, Newell, Geraghty, & Moore, Reference Kendrick, Stuart, Newell, Geraghty and Moore2015) and anxiety disorders (Garakani et al., Reference Garakani, Murrough, Freire, Thom, Larkin, Buono and Iosifescu2020). However, how SSRIs work beyond their initial pharmacological action on the serotonin transporter remains unclear (Harmer, Duman, & Cowen, Reference Harmer, Duman and Cowen2017).

Neuropsychological models propose that antidepressants, including SSRIs, may alter cognitive processing, leading to improvements in depressive and anxiety symptoms (Harmer, Goodwin, & Cowen, Reference Harmer, Goodwin and Cowen2009a). Reinforcement learning provides a framework for investigating links between cognitive and biological processes and hence the effect of SSRIs on cognition (Huys, Browning, Paulus, & Frank, Reference Huys, Browning, Paulus and Frank2021; Lan & Browning, Reference Lan and Browning2022; Maia & Frank, Reference Maia and Frank2011). Preclinical and experimental research has established that several cognitive functions relevant to the etiology of anxiety and depression are sensitive to SSRIs (e.g. Geurts, Huys, den Ouden, & Cools, Reference Geurts, Huys, den Ouden and Cools2013a; Guitart-Masip, Duzel, Dolan, & Dayan, Reference Guitart-Masip, Duzel, Dolan and Dayan2014; Harmer, Reference Harmer2013; Michely, Eldar, Erdman, Martin, & Dolan, Reference Michely, Eldar, Erdman, Martin and Dolan2022; Michely, Eldar, Martin, & Dolan, Reference Michely, Eldar, Martin and Dolan2020; Roiser, Elliott, & Sahakian, Reference Roiser, Elliott and Sahakian2012a; Roiser et al., Reference Roiser, Levy, Fromm, Goldman, Hodgkinson, Hasler and Drevets2012b). However, there is little evidence tying these experimental effects of SSRIs on cognition to improvement in symptoms in clinical settings as only a few clinical randomized controlled trials (RCTs) have evaluated candidate mechanisms to explain treatment effects (Ahmed et al., Reference Ahmed, Bone, Lewis, Freemantle, Harmer, Duffy and Lewis2022; Cuthbert & Insel, Reference Cuthbert and Insel2013; Morris et al., Reference Morris, Sanislow, Pacheco, Vaidyanathan, Gordon and Cuthbert2022; Pizzagalli et al., Reference Pizzagalli, Smoski, Ang, Whitton, Sanacora, Mathew and Krystal2020). Evaluations in the context of RCTs comparing SSRIs and placebo provide a strong test of whether specific cognitive or learning processes are the mechanisms through which SSRIs alleviate symptoms of anxiety and depression.

The present study investigates whether SSRIs improve symptoms by modulating reinforcement learning processes, specifically aversive Pavlovian control. Aversive Pavlovian control refers to the automatic, stereotyped inhibition of actions in the face of negative expectations (Bolles, Reference Bolles1970; Dayan, Niv, Seymour, & Daw, Reference Dayan, Niv, Seymour and Daw2006), an effect that can be robustly observed in humans using neurocognitive probes (Boureau & Dayan, Reference Boureau and Dayan2011; Guitart-Masip et al., Reference Guitart-Masip, Huys, Fuentemilla, Dayan, Duzel and Dolan2012; Huys, Moutoussis, & Williams, Reference Huys, Moutoussis and Williams2011b). Aversive Pavlovian control is a promising candidate mechanism for the treatment of SSRIs. It is sensitive to serotonergic functioning in animal (Abela et al., Reference Abela, Browne, Sargin, Prevot, Ji, Li and Fletcher2020; Amo et al., Reference Amo, Fredes, Kinoshita, Aoki, Aizawa, Agetsuma and Okamoto2014; Doya, Miyazaki, & Miyazaki, Reference Doya, Miyazaki and Miyazaki2021; Ohmura, Tanaka, Tsunematsu, Yamanaka, & Yoshioka, Reference Ohmura, Tanaka, Tsunematsu, Yamanaka and Yoshioka2014) and preclinical studies (Crockett, Clark, Apergis-Schoute, Morein-Zamir, & Robbins, Reference Crockett, Clark, Apergis-Schoute, Morein-Zamir and Robbins2012; Crockett, Clark & Robbins, Reference Crockett, Clark and Robbins2009; Geurts, Huys, den Ouden, & Cools, Reference Geurts, Huys, den Ouden and Cools2013b; Hebart & Gläscher, Reference Hebart and Gläscher2015). Moreover, aversive Pavlovian control is associated with symptoms of depression and anxiety in clinical samples (Huys et al., Reference Huys, Gölzer, Friedel, Heinz, Cools, Dayan and Dolan2016; Nord, Lawson, Huys, Pilling, & Roiser, Reference Nord, Lawson, Huys, Pilling and Roiser2018) and in general population samples with anxiety traits (Mkrtchian, Aylward, Dayan, Roiser, & Robinson, Reference Mkrtchian, Aylward, Dayan, Roiser and Robinson2017). Influential reviews have highlighted the prominence of inhibition in response to negative expectations in depression (Roiser et al., Reference Roiser, Elliott and Sahakian2012a) and of avoidance driven by negative expectations in anxiety (LeDoux, Moscarello, Sears, & Campese, Reference LeDoux, Moscarello, Sears and Campese2017). Modifying aversive Pavlovian control is hence clinically promising (Huys, Russek, Abitante, Kahnt, & Gollan, Reference Huys, Russek, Abitante, Kahnt and Gollan2022; Martell, Dimidjian, & Herman-Dunn, Reference Martell, Dimidjian and Herman-Dunn2010).

In terms of underlying mechanisms, computational models have proposed formal relationships between rumination, acute reductions in central serotonin levels, and the attenuation of aversive Pavlovian control (Dayan & Huys, Reference Dayan and Huys2008, Reference Dayan and Huys2009; Huys et al., Reference Huys, Eshel, O'Nions, Sheridan, Dayan and Roiser2012; Robinson et al., Reference Robinson, Overstreet, Allen, Letkiewicz, Vytal, Pine and Grillon2013). At the neural level, the subgenual anterior cortex has been implicated in aversive Pavlovian control in research involving primates (Amemori & Graybiel, Reference Amemori and Graybiel2012) and healthy volunteers (Lally et al., Reference Lally, Huys, Eshel, Faulkner, Dayan and Roiser2017). This brain region is also recognized for its involvement in anxiety, as demonstrated in a study of healthy volunteers with contextual fear (Alvarez, Chen, Bodurka, Kaplan, & Grillon, Reference Alvarez, Chen, Bodurka, Kaplan and Grillon2011; Hasler et al., Reference Hasler, Fromm, Alvarez, Luckenbaugh, Drevets and Grillon2007) and it has been linked to depression in both preclinical (Drevets, Savitz, & Trimble, Reference Drevets, Savitz and Trimble2008; Ramirez-Mahaluf, Perramon, Otal, Villoslada, & Compte, Reference Ramirez-Mahaluf, Perramon, Otal, Villoslada and Compte2018) and depression treatment studies (Mayberg et al., Reference Mayberg, Lozano, Voon, McNeely, Seminowicz, Hamani and Kennedy2005).

Appetitive Pavlovian control may also be affected. In both clinical and subclinical depression samples, there have been reports of blunted reward responses (Bylsma, Morris, & Rottenberg, Reference Bylsma, Morris and Rottenberg2008; Eshel & Roiser, Reference Eshel and Roiser2010; Halahakoon et al., Reference Halahakoon, Kieslich, O'Driscoll, Nair, Lewis and Roiser2020; Pizzagalli, Jahn, & O'Shea, Reference Pizzagalli, Jahn and O'Shea2005; Steele, Kumar, & Ebmeier, Reference Steele, Kumar and Ebmeier2007), possibly due to reduced specificity (Huys et al., Reference Huys, Gölzer, Friedel, Heinz, Cools, Dayan and Dolan2016; Nord et al., Reference Nord, Lawson, Huys, Pilling and Roiser2018). Serotonergic manipulations have also shown effects on appetitive Pavlovian processes in animals (Cohen, Amoroso, & Uchida, Reference Cohen, Amoroso and Uchida2015) and healthy volunteers (Michely et al., Reference Michely, Eldar, Martin and Dolan2020).

As such, Pavlovian control may be a candidate mediator of the effect of SSRIs on anxiety and depression. Here, we report a test of this hypothesis in the context of the PANDA RCT (Lewis et al., Reference Lewis, Duffy, Ades, Amos, Araya, Brabyn and Lewis2019). This trial compared sertraline to placebo for the treatment of depression in primary care in the UK (Duffy et al., Reference Duffy, Bacon, Clarke, Donkor, Freemantle, Gilbody and Lewis2019; Lewis et al., Reference Lewis, Duffy, Ades, Amos, Araya, Brabyn and Lewis2019; Salaminios et al., Reference Salaminios, Duffy, Ades, Araya, Button, Churchill and Lewis2017). PANDA found no evidence that sertraline reduced depressive symptoms to a clinically meaningful extent at 6 weeks, with only a weak effect at 12 weeks. However, they found evidence that sertraline reduced anxiety at 6 and 12 weeks. We measured Pavlovian inhibition and a number of other reinforcement learning processes during this trial using computational modeling of the affective Go/NoGo task (Guitart-Masip et al., Reference Guitart-Masip, Huys, Fuentemilla, Dayan, Duzel and Dolan2012). This is a well-established learning paradigm in which computational analyses allow appetitive and aversive Pavlovian processes to be measured (Guitart-Masip et al., Reference Guitart-Masip, Huys, Fuentemilla, Dayan, Duzel and Dolan2012).

We pre-registered an analysis plan investigating five main hypotheses (osf.io/7q8v2). The primary analyses aimed to test whether treatment with the SSRI sertraline alters aversive Pavlovian control and whether aversive Pavlovian control is related to anxiety, i.e. whether Pavlovian inhibition might mediate the effect of sertraline on anxiety. We also examined the relationship between appetitive Pavlovian biases and depressive symptomatology. Overall, task compliance was poor, and the primary hypotheses were not supported. However, exploratory analyses did reveal that higher changes in aversive Pavlovian bias early on were linked to more severe depression after 12 weeks. Additionally, there was an effect of SSRI treatment on the aversive learning rate at week two and an association between learning from losses and anxiety.

Methods

Ethics

The National Research Ethics Service Committee, East of England – Cambridge South approved the study (ref: 13/EE/0418). The Medicines and Healthcare Products Regulatory Agency gave clinical trial authorization. Written informed consent was obtained from each participant before the study.

Participants

We present secondary analyses of data acquired in the context of the PANDA trial. PANDA was a randomized, double-blind, placebo-controlled pragmatic study investigating the clinical effectiveness of sertraline on depressive symptoms as the primary outcome.

Patients (aged 18–74 years) were recruited from 179 primary care surgeries in four UK sites (Bristol, Liverpool, London, York). The critical entry criterion was that general practitioners (GPs) and/or patients were uncertain about the potential benefits of an antidepressant. No lower or higher thresholds were set on depression severity or duration. The study aimed for a diverse participant pool by including doctors with varied decision-making approaches, promoting clinical equipoise, and capturing the spectrum of depressive symptom severity. The exclusion criteria were: unable to understand or complete study questionnaires in English; antidepressant treatment in the past eight weeks; comorbid psychosis, schizophrenia, mania, hypomania, bipolar disorder, dementia, eating disorder, or major alcohol or substance abuse; and medical contraindications for sertraline.

Patients were randomized to sertraline or placebo, stratified by severity, duration, and site, and followed up after 2, 6, and 12 weeks. For the first week, patients received one capsule (50 mg sertraline or placebo) a day. From week two onwards, they took two capsules per day, either containing 100 mg of sertraline or placebo, for up to 11 weeks. Medication could be increased to 150 mg in consultation with the local principal investigator in cases of non-response after six weeks. The study was double-blind: study patients, care providers, and all members of the research team were blinded to the study treatment allocation (Salaminios et al., Reference Salaminios, Duffy, Ades, Araya, Button, Churchill and Lewis2017).

Measurements

The Go/NoGo task (Fig. 1a) was designed to study Pavlovian appetitive and aversive influence on choice by crossing action (go v. nogo) and valence (rewards v. losses; Guitart-Masip et al., Reference Guitart-Masip, Huys, Fuentemilla, Dayan, Duzel and Dolan2012). Participants were verbally instructed that each fractal would lead to a more favorable outcome with either go or nogo, but that outcomes were probabilistic (cf. Fig. 1 for detailed task description). Each task administration employed a different fractal set. Fractal sets were randomized across participants and assessment timepoints. The Go/NoGo task was assessed at baseline, at two weeks (follow-up 1), and at six weeks (follow-up 2), but it was not part of the 12 weeks assessment (follow-up 3). The Generalized Anxiety Disorder Assessment (GAD-7; Spitzer, Kroenke, Williams, & Löwe, Reference Spitzer, Kroenke, Williams and Löwe2006), the Patient Health Questionnaire-9 (PHQ-9; Kroenke, Spitzer, & Williams, Reference Kroenke, Spitzer and Williams2001), and the Beck Depression Inventory (BDI; Beck, Steer, & Brown, Reference Beck, Steer and Brown1996) were completed at baseline and every follow-up. Several baseline variables were acquired (cf. Table 1).

Figure 1. Task and performance. (a) The Go/NoGo task consisted of four different conditions. On each trial one of four possible fractal images was shown. Actions were required in response to a circle that followed the fractal image after a variable delay. After a brief delay, the outcome was presented: a green upward arrow for a win, a red downward arrow for a loss, or a horizontal bar for a neutral outcome. In the go-to-win condition, pressing the key (‘go’) led to a reward with 80% and a neutral outcome with 20% probability, vice versa if they did not press the key (‘nogo’). In the go-to-avoid condition, pressing the key (‘go’) led to a neutral outcome with 80% and a loss with 20% probability. In the nogo-to-win, not pressing the key (‘nogo’) led to a reward with 80% and a neutral outcome with 20% probability. In the nogo-to-avoid condition, not pressing the key (‘nogo’ response) led to a neutral outcome with 80% and a loss with 20% probability. Each task administration consisted of 96 trials, with 24 trials per condition. (b) Mean percentage of correct responses in each of the four conditions. Black dots depict participants and black error bars depict standard deviation of the mean (s.d.). Dashed lines depict chance level. Post hoc comparisons were implemented by means of repeated measures t tests showing a significant difference in accuracy between Pavlovian congruent (got to win and nogo to avoid) and incongruent conditions (go to avoid and nogo to win). Significance ∗ ≤ 0.05, ∗∗ ≤ 0.01, ∗∗∗ ≤ 0.001, ∗∗∗∗ ≤ 0.0001.

Table 1. Baseline characteristics for participants providing informative Go/NoGo task data (N = 435)

Data are reported in N(%) or mean(s.d.). There was no evidence for differences in baseline characteristics between the treatment groups shown by the p values (≤0.05). PHQ-9 = Patient Health Questionnaire, 9-item version total score (possible range 0–27). GAD-7 = Generalized Anxiety Disorder Assessment, 7-item version total score (possible range 0–21). BDI = Beck Depression Inventory, 21-item version total score (possible range 0–63). CIS-R = Clinical Interview Schedule-Revised measuring depression severity score (possible range 0–21).

Computational models

Previously published computational models for this task (Guitart-Masip et al., Reference Guitart-Masip, Huys, Fuentemilla, Dayan, Duzel and Dolan2012; Mkrtchian et al., Reference Mkrtchian, Aylward, Dayan, Roiser and Robinson2017; Moutoussis et al., Reference Moutoussis, Bullmore, Goodyer, Fonagy, Jones, Dolan and Dayan2018; Scholz et al., Reference Scholz, Hook, Kandroodi, Algermissen, Ioannidis, Christmas and den Ouden2022) provide formal, quantitative descriptions of the evolution of decisions over the course of learning during the task. The core parameters of interest in the models are the Pavlovian parameters. These capture appetitive Pavlovian influences through the extent to which participants automatically emit ‘go’ responses when faced with reward stimuli, and aversive Pavlovian inhibition through the extent to which they automatically emit ‘nogo’ responses when faced with loss stimuli. The Pavlovian processes are separate from instrumental learning processes, which emit ‘go’ and ‘nogo’ according to which of the two actions is more likely to lead to the better outcome. Other parameters include reward and loss sensitivity, learning rates, irreducible noise, and an overall ‘go’ bias.

Data validation

To evaluate whether the existing data was in principle sufficient to assess the key hypotheses, and to provide an informative a-priori estimate of power, two authors (J. M. and Q. J. M. H) were provided with blinded access to the behavioral task data only, but without access to group allocation, demographics, or measures of symptoms. These authors fitted different reinforcement learning (RL) models (for a list of the models, see Supplementary Materials B.1 RL Models) as described previously in the literature (cf. Huys et al. (Reference Huys, Cools, Gölzer, Friedel, Heinz, Dolan and Dayan2011a) and Supplementary Materials B.2 Model Fitting Procedure & B.3 Model Comparison). All datasets of the study were combined, disregarding within-subject information (i.e. treating repeated sessions as independent task assessments). In the supplements, we report the recoverability and reliability of the parameters (cf. Supplementary Materials Fig. B.3 and B.4).

Models were fitted separately to the data and compared using the integrated Bayesian Information Criterion (iBIC; Figure 2a) at the group level, where the individual likelihoods were first integrated over the individual parameters using a sampling procedure and then summed over all individuals. The most parsimonious model included learning rates, outcome sensitivities, and Pavlovian biases, all separated into rewarding and punishing contexts. Figure 2c shows that simulated data captured the empirical data qualitatively. Hence, standard models of the task are able to parametrically capture the variability of behavioral performance in the task across individuals and sessions on a trial-by-trial level.

Figure 2. Computational modeling of the Go/NoGo task. (a) shows the differences in integrated Bayesian Information Criterion (iBIC) scores for all models tested compared to the most parsimonious model (red star), where a smaller iBIC score indicates a more parsimonious model. All models are modified Q-learning models (Rescorla Wagner – RW) with two pairs of action-values (‘go’ and ‘nogo’) for each stimulus. The y-axis shows the number of free parameters for each model. The most parsimonious model includes separate learning rates for rewards and punishments, win and loss sensitivities, appetitive and aversive Pavlovian biases, irreducible noise, and a constant bias factor added to the action-value for ‘go’. (b) shows the histogram of the difference between the integrated loglikelihood (iLL) of the most parsimonious model and the iLL of the random baseline model. Datasets were declared as informative if the data was more than three times more likely to have occurred under the most parsimonious model (vertical red dashed line). (c) The four subplots show the average learning curves in blue (averaged over participants; solid line) for each condition separately. Each row of the raster images shows the choices of each participant. ‘Go’ responses are depicted in white, and ‘nogo’ responses are depicted in grey. Additionally, the average ‘go’ probability was separated into included datasets (orange) and excluded datasets (green). The solid line refers to empirical data and the dashed line to simulated data from the most parsimonious model. Informative datasets (orange) show that participants, on average, seem to learn over trials, which can be captured qualitatively well by the most parsimonious model. In contrast, the average ‘go’ probability of non-informative/excluded datasets (green) appears to have no temporal relation, hence showing no learning over trials. Further, it is well captured by the random baseline model.

In the Go/NoGo task, non-informative responses (e.g. always emitting the same response) cannot provide information about Pavlovian or other cognitive processes and therefore do not inform parameter estimates. Whether the data of a particular task run are meaningful can be evaluated formally by examining whether a model encompassing the core processes provides a more parsimonious account of the behavioral data than a random baseline model. In other words, to examine whether the observed behavioral data meaningfully constrained the model parameter estimates, we compared the integrated likelihood of the most parsimonious model to the integrated likelihood of a random baseline model for each dataset from each individual at each session. The integrated likelihood integrated over an individual parameters refers to the likelihood of the data given the group-level hyperparameters. A task run was deemed as missing if the integrated likelihood of the random baseline model was more than three times higher than that of the most parsimonious model at the group level (Fig. 2b). Note, the model selection process conducted only on the informative task runs yielded consistent results with those obtained on the complete dataset (cf. Supplementary Materials Fig. B.2).

The parameters for each informative task run were extracted from the most parsimonious model to test the hypotheses.

Preregistration

The key hypotheses and analyses were pre-registered on OSF (osf.io/7q8v2; cf. Supplementary Materials Table D.4).

Statistical analyses

Predictors of missing and non-informative data at baseline were identified using a univariate logistic regression. Significantly related baseline variables were used as covariates in all further analyses.

To investigate drug effects, we employed a mixed-effects linear regression (1) using group allocation as the independent variable and the parameter estimate (e.g. aversive Pavlovian bias as the dependent variable) controlling for stratification variables (baseline CIS-R total score in three categories, duration of depressive episode in two categories, and site) and including random intercepts. We reported mean differences (MD), 95% confidence intervals (CI), and the corresponding p values (p).

Next, we examined whether parameter estimates relate to depressive or anxiety symptoms using a mixed-effects multiple linear regression (2) with the parameter estimate as independent variable and log-transformed symptom scores (e.g. GAD-7 total score) as dependent variable. Random slopes and intercepts per individual were included. We controlled for group allocation and stratification variables. We reported regression coefficients (β), 95% confidence intervals (CI), and the corresponding p values (p).

For both analyses, we performed separate mixed-effects models for baseline and week two, baseline and week six and over all three time-points. To investigate a potential drug time interaction, we additionally performed a regression including a group-time interaction. The group variable in the mixed-effects models was coded [0,1,1] for a patients allocated to sertraline and [0, 0, 0] for a patients allocated to placebo. Both groups have a 0 at baseline because they were unmedicated at that time.

To investigate whether a baseline parameter estimate predict treatment outcome, we performed a simple linear regression predicting symptoms core at the at week 12 controlling for symptoms at baseline, group allocation and stratification variables.

As an exploratory analysis we examined whether early change in aversive Pavlovian bias (week 2 – baseline) relates to log-transformed BDI total score at week 12 using a simple linear regression including an interaction effect between group-allocation and Pavlovian bias.

Exploratory analyses repeated the analysis type 1 above for each individual parameter and used Bonferroni-correction to correct for testing multiple parameters (p ≤ (0.05/8) ≤ 0.00625).

Additionally, we conducted simple linear regression examining group differences in parameter slopes (early change = week two – baseline; late change = week six – week two). We also repeated analysis type 2 for each of the parameter estimates and the three psychological measures (GAD-7, PHQ-9, BDI) and used Bonferroni-correction to correct for testing multiple parameters (p ≤ (0.05/8 × 3) ≤ 0.002).

Finally, to assess test–retest reliability we calculated Pearson correlation of individuals' parameters between the different time points and employed intra-class correlation coefficients (ICCs; McGraw & Wong, Reference McGraw and Wong1996) using the informative data (cf. Supplementary Materials B.6 Test-Retest Reliability).

Results

A total of 655 patients were recruited and randomly assigned to sertraline (326, 50%) and placebo (329, 50%). Two patients in the sertraline group did not complete a substantial proportion of the baseline assessment and were excluded. Additionally, 25 patients (9 from the sertraline group and 16 from placebo) did not complete the Go/NoGo task at any time-point. This left 628 participants (315 sertraline and 313 placebo) for analyses (cf. Fig. A.1 in Supplementary Materials). Task data for seven patients at baseline, 99 patients at 2 weeks, and 145 patients at 6 weeks were missing. Missing follow-up data were more common in participants who had higher baseline depressive and anxiety symptoms, financial difficulties, were from ethnic minorities and were recruited from London (cf. Supplementary Materials Table E.5). Missing data did not differ statistically by treatment allocation.

Basic task characteristics

Examination of the average percent correct response per condition showed the typical interaction pattern characteristic of Pavlovian inference found in previous studies (Guitart-Masip et al., Reference Guitart-Masip, Huys, Fuentemilla, Dayan, Duzel and Dolan2012; Mkrtchian et al., Reference Mkrtchian, Aylward, Dayan, Roiser and Robinson2017; Moutoussis et al., Reference Moutoussis, Bullmore, Goodyer, Fonagy, Jones, Dolan and Dayan2018; Scholz et al., Reference Scholz, Hook, Kandroodi, Algermissen, Ioannidis, Christmas and den Ouden2022) at all measurement points (Fig. 1b). Performance was better in Pavlovian congruent (go to win and nogo to avoid) than incongruent (go to avoid and nogo to win) conditions (|t| ∈ [4.65, 16.70], p < 0.001). There were no differences in average performance between patients allocated to sertraline and patients allocated to placebo (|MD| ∈ [0.00, 0.03], p > 0.05).

Computational modeling results

Overall, 747 (46%) task runs did not contain interpretable and informative behavioral data. Variables associated with non-informative behavior were higher age, lower education, and past antidepressant use. At week 2 non-informative task runs (N = 230, 43%) were more likely in patients who were allocated to the sertraline group (57%, X2 = 7.06, p = 0.008). In addition, baseline anxiety score, depression severity, and employment status were predictive of non-informative behavior at week 6 (cf. Supplementary Materials Table E.6). For all further analyses we focused on the 886 informative task runs from 435 patients (66% of those originally randomized) and adjusted for significant predictors of non-informative data as covariates. Characteristics of the remaining sample according to the study arm are shown in Table 1. Baseline characteristics of the sample were not statistically distinguishable between treatment groups.

The effect of sertraline on anxiety remained significant in the smaller included sample (week 6: MD = −0.1, CI[ − 0.17 to  − 0.03], p = 0.005; week 12: MD = −0.12, CI[ − 0.17 to  − 0.06], p ≤ 0.001; over time: MD = −0.08, CI[ − 0.12 to  − 0.04], p ≤ 0.001). This is important since our preregistered hypotheses were developed under consideration of this effect.

Preregistered hypotheses

The preregistered hypotheses were not supported (Table 2): there was no evidence that the aversive Pavlovian inhibition was affected by sertraline (Fig. 3a and b); that aversive Pavlovian inhibition was related to anxiety symptoms; that the baseline aversive Pavlovian bias was predictive of treatment response; that the appetitive Pavlovian bias was associated with depression or that the reward sensitivity was related to anhedonia.

Table 2. Mixed-effect linear models testing our pre-registered hypotheses (only informative data N task runs = 886; N patients = 435, 66% of those randomized)

We tested whether sertraline alters aversive Pavlovian control (Hypothesis 1; H1) and whether aversive Pavlovian control is related to anxiety (Hypothesis 2; H2). Hypothesis 3 regarding the aversive Pavlovian bias as a mediator for the effect of sertraline on anxiety was not investigated as there was no evidence for H1 and H2. Hypothesis 4 (H4) tested whether aversive Pavlovian bias at baseline before starting SSRI treatment predicted treatment outcome. Hypothesis 5 (H5) examined the relationship between the appetitive Pavlovian bias and depressive symptoms. Hypothesis 6 (H6), tested for a relationship between reward sensitivity and anhedonia. We controlled for stratification variables and variables associated with missing data in all analyses.

Figure 3. Effects of sertraline on RL parameters. (a) Shows the aversive Pavlovian bias at baseline and at the follow-ups separated into drug groups (blue, left = placebo; red, right = sertraline). (b) Shows the change in aversive Pavlovian bias between sessions separately for the drug groups. (c) Early changes in the aversive Pavlovian bias predict treatment outcome. This figure shows the relation between the change from baseline to week two in the aversive Pavlovian bias and log-transformed BDI total score (only of participants who had an informative task run at baseline and week 2). In blue the placebo group and in red the sertraline group. An interaction effect was observed between group and early change in the aversive Pavlovian predicting depression at 12 weeks driven by a significant association between the early change and log-transformed BDI total score at 12 weeks. (blue, left = placebo; red, right = sertraline). (d) Shows the loss learning rate at baseline and at the follow-ups separated into drug groups (blue, left = placebo; red, right = sertraline). (e) Shows the change in loss learning rate between sessions separately for the drug groups. Significance *  ≤  0.05, **  ≤  0.01, ***  ≤  0.001, ****  ≤  0.0001.

Exploratory analyses

Exploratory analyses of a subsample with better test–retest correlation, and of a subsample with low symptoms did not support the pre-registered hypotheses (cf. Supplementary Materials F Subsample Analyses).

Two sets of results in the exploratory analyses are noteworthy. The first relates to early change in the aversive Pavlovian bias. The slope of the aversive Pavlovian bias between baseline and week two was positively related to depressive symptoms at week 12 (log-transformed PHQ9 total score: β = 0.06, CI[0.0 − 0.11], p = 0.044; log-transformed BDI total score: β  =  0.07, CI[0.01 − 0.13], p = 0.016). A larger increase in aversive Pavlovian bias was associated with more severe subsequent depressive symptoms. Furthermore, the BDI model revealed an interaction between group allocation and early change in the aversive Pavlovian bias (β = 0.14, CI [0.02–0.26], p = 0.024; Figure 3c). That is, early change in aversive Pavlovian bias was more strongly related to BDI scores at week 12 in the sertraline group (β = 0.14, CI[0.05 − 0.23]), than in the placebo group (β = 0.02, CI[ − 0.03 to 0.07]). However, note that sertraline had no effect on the early change in aversive Pavlovian bias (MD = −0.08, CI[ − 0.32 to 0.15], p = 0.49). The second set of findings relates to the speed at which participants adapted behavior following losses (the loss learning rate). There was an effect of sertraline on the loss learning rate at week 2 (MD = 0.6, CI[0.22 − 0.97], p = 0.002; Figure 3d). The sertraline group learned faster from losses at week 2 than the placebo group. Early change in loss learning rate (week two – baseline) was higher in the sertraline group (MD = 0.75, CI[0.18 − 1.3], p = 0.009; Figure 3e), whereas later change (week six minus week two) was lower in the sertraline group (MD = −0.72, CI[ − 1.27 to  − 0.17], p = 0.011; Figure 3e). In the sertraline group, the early change was different from zero (t = 2.74, p = 0.007), whereas the later change was not (t = −0.32, p = 0.75). In contrast, in the placebo group, the early change did not differ from zero (t = −0.70, p = 0.483), but the late change did (t = 3.44, p < 0.001). Hence, the group difference in the late change was due to an increase in loss learning rate from baseline to week 6 in the placebo group. The aversive learning rate is strongly driven by switching after losses in the early part of the learning curve. Indeed, there was an elevated switching probability after losses during the first eight trials in the sertraline group (MD = 0.21, CI [0.0–0.41], p = 0.048; averaged across the go-to-avoid and nogo-to-avoid conditions). Finally, the loss learning rate was also positively associated with the anxiety scores (at week 2: β = 0.01, CI[0.0 − 0.02], p = 0.047; at week 6: β = 0.02, CI[0.0 − 0.03], p = 0.016; across all sessions: β = 0.02, CI[0.01 − 0.03], p = 0.001). However, there was no evidence for an association between anxiety symptoms and either the loss learning rate at baseline (β = 0.01, CI [−0.0 to 0.02], p = 0.24) or the early change in loss learning rate (week 2 – baseline; β = −0.02, CI[ − 0.04 to 0.01], p = 0.164). Additionally, this effect could not be shown based on the early switch probability described above (β = 0.0, CI[ − 0.02 to 0.02], p = 0.73).

Repeating these analyses on the complete sample including all task runs resulted in a broadly consistent pattern of effects (c.f. Supplementary Materials C Findings in the Whole Sample).

In post hoc analyses, we adjusted for the use of other antidepressants and/or psychotherapy, adherence score, and the number of tablets, yielding consistent results as detailed in the Supplementary Materials G Post-hoc Analyses.

Task reliability

Parameters showed poor to moderate reliability (ICC(3,1) ranging from 0 to 0.53; cf. Supplementary Materials B.6 Test-Retest Reliability). The aversive Pavlovian bias was the most reliable parameter (ICC(3, 1) = 0.53, CI[0.41 − 0.64], p < 0.001). The Pavlovian parameters and the go bias also significantly changed over time. The Pavlovian biases decreased (aversive: β = −0.1, CI[ − 0.16 to  − 0.04], p = 0.001; appetitive: β = −0.08, CI[ − 0.13 to  − 0.04], p < 0.001) and the go bias increased (β = 0.13, CI[0.05 − 0.21], p < 0.001) over sessions which likely led to an increase in task accuracy (β = 0.02, CI[0.01 − 0.03], p < 0.001). We note that age reduced accuracy (β = −0.03, CI[ − 0.04 to  − 0.02], p < 0.001), most likely due to increasing Pavlovian biases (aversive: β = 0.19, CI[0.11 − 0.26], p < 0.001; appetitive: β = 0.16, CI[0.11 − 0.21], p < 0.001) and reducing go bias with age (β = −0.4, CI [−0.49 to−0.32], p < 0.001).

Discussion

We investigated the effects of the SSRI sertraline on reinforcement learning mechanisms in the PANDA trial, a pragmatic multicenter, double-blind, placebo-controlled, randomized clinical trial. SSRIs are first-line pharmacological treatments for depression and anxiety, but the mechanism of SSRI action is still unknown. A better understanding of how SSRIs work could lead to improved response predictions and new, refined treatments. Our goal was to identify clinically relevant mechanisms to link receptor action to cognition and affective processing. Reinforcement learning enables such links and hence is a promising framework for investigating the mechanisms of SSRI action. The PANDA trial was the largest individual placebo-controlled trial not funded by the pharmaceutical industry. The sample was recruited in primary care based on clinical equipoise, and depressive symptoms ranged from mild to severe. Findings might therefore be of relevance to the broader primary care population. As sertraline acts through similar mechanisms as other SSRIs (Cipriani et al., Reference Cipriani, Furukawa, Salanti, Chaimani, Atkinson, Ogawa and Geddes2018), the findings may also be relevant for other SSRIs.

Due to the poor task performance, almost half of the performed task runs were excluded. Early on (at week two) non-informative data was more prevalent in the sertraline group, suggesting that patients in the active group may have responded more randomly. Such randomness can be a signature of low overall motivation to perform the task. One possibility is that such a broad motivational reduction could be a signature of SSRI-induced affective blunting (Barnhart, Makela, & Latocha, Reference Barnhart, Makela and Latocha2004; Marazziti et al., Reference Marazziti, Mucci, Tripodi, Carbone, Muscarella, Falaschi and Baroni2019; McCabe, Cowen, & Harmer, Reference McCabe, Cowen and Harmer2009; Price, Cole, & Goodwin, Reference Price, Cole and Goodwin2009). However, there were no discernible differences in symptoms between patients who provided informative and non-informative data at week two, and sertraline had a positive impact on learning at week two in the included sample. These findings speak against a broad blunting effect.

The primary goal of this study was to test whether aversive Pavlovian bias mediates the effect of sertraline on anxiety. We found no evidence supporting an influence of sertraline on aversive Pavlovian bias. This result contrasts with previous research suggesting that Pavlovian inhibition is sensitive to serotonin (Crockett et al., Reference Crockett, Clark and Robbins2009, Reference Crockett, Clark, Apergis-Schoute, Morein-Zamir and Robbins2012; Geurts et al., Reference Geurts, Huys, den Ouden and Cools2013b; Hebart & Gläscher, Reference Hebart and Gläscher2015). There are several possible reasons for this discrepancy. First, it may be that serotonin manipulations have different effects on Pavlovian inhibition in samples with and without depression and/or anxiety. While the current study was performed in a clinical population, previous studies primarily examined healthy volunteers. Second, previous research focused on acute changes via tryptophan depletion (Crockett et al., Reference Crockett, Clark and Robbins2009, Reference Crockett, Clark, Apergis-Schoute, Morein-Zamir and Robbins2012; Geurts et al., Reference Geurts, Huys, den Ouden and Cools2013a; Hebart & Gläscher, Reference Hebart and Gläscher2015) or a single administration of an SSRI citalopram (Guitart-Masip et al., Reference Guitart-Masip, Duzel, Dolan and Dayan2014) rather than the chronic administration examined here. It has long been posited that acute and chronic SSRI administration have opposite effects (e.g. Harmer, Cowen, & Goodwin, Reference Harmer, Cowen and Goodwin2011; Harmer et al., Reference Harmer, O'Sullivan, Favaron, Massey-Chase, Ayres, Reinecke and Cowen2009b). Third, we cannot rule out that some of the Pavlovian inhibition signal is conflated with the loss learning signal as there are non-negligible correlations between parameters (cf. Supplementary Materials Fig. B.6). This is likely compounded by broader issues with data quality, which in turn reduce the ability of models to distinguish aversive Pavlovian inhibition and learning from losses. Subsample analyses attempting to identify either test–retest or symptom loads as reasons for the null results did not yield clear results.

Exploratory analyses identified relationships between sertraline, aversive processing, and symptoms. First, sertraline affected learning from losses but not from rewards. This finding is in keeping with well-supported empirical evidence demonstrating that serotonin modulation impacts learning (Bari et al., Reference Bari, Theobald, Caprioli, Mar, Aidoo-Micah, Dalley and Robbins2010; Brigman et al., Reference Brigman, Mathur, Harvey-White, Izquierdo, Saksida, Bussey and Holmes2010; Michely et al., Reference Michely, Eldar, Martin and Dolan2020; Scholl et al., Reference Scholl, Kolling, Nelissen, Browning, Rushworth and Harmer2017), and specifically punishment learning (Chamberlain et al., Reference Chamberlain, Müller, Blackwell, Clark, Robbins and Sahakian2006; Cools, Roberts, & Robbins, Reference Cools, Roberts and Robbins2008; Tanaka et al., Reference Tanaka, Schweighofer, Asahi, Shishida, Okamoto, Yamawaki and Doya2007, Reference Tanaka, Shishida, Schweighofer, Okamoto, Yamawaki and Doya2009). Prolonged serotonin alterations have downstream effects including augmented learning and plasticity (Dayer, Reference Dayer2014; Kraus, Castrén, Kasper, & Lanzenberger, Reference Kraus, Castrén, Kasper and Lanzenberger2017). In the current dataset, the learning rate from losses increased over the first two weeks of sertraline treatment relative to placebo. The placebo group then ‘caught up’, removing the group differences in loss learning rate at six weeks. Changes in the performance of learning tasks are frequently observed and thought to represent a type of meta-learning, i.e. learning more broadly about the strategy of performing a task rather than learning within the task itself (Botvinick et al., Reference Botvinick, Ritter, Wang, Kurth-Nelson, Blundell and Hassabis2019; Doya, Reference Doya2002; Langdon et al., Reference Langdon, Botvinick, Nakahara, Tanaka, Matsumoto and Kanai2022; Vanschoren, Reference Vanschoren, Hutter, Kotthoff and Vanschoren2019). As such, the late change in performance in the placebo group compared to the early change in the sertraline group suggests that sertraline may have increased the speed at which this meta-learning may have occurred and may have done so by specifically altering behavioral adaptation after losses within the task. One complication is that, at two weeks, there was already some evidence for changes in anxiety symptoms, and an inverse causal path (with anxiety mediating the effect of sertraline) cannot be excluded.

The loss learning rate was correlated with anxiety symptoms at both follow-up time points and over all measurement points. This is, in principle, in line with previous research outlined in a recent meta-analysis reporting higher punishment learning rates and slightly lower reward learning rates in patients (Pike & Robinson, Reference Pike and Robinson2022). Yet, this is difficult to reconcile with, first, the SSRI-induced increase in learning from punishment, and second the fact that both anxiety and depression are treated by SSRIs and are linked to heightened punishment learning themselves. Interestingly, a similar conundrum was present in the literature on learned helplessness, which was associated with increased levels of serotonin (Petty, Kramer, & Moeller, Reference Petty, Kramer and Moeller1994), but could also be reversed as a response to SSRIs (Hajszan et al., Reference Hajszan, Szigeti-Buck, Sallam, Bober, Parducz, Maclusky and Duman2010; Kirby, Reference Kirby2006; Malberg & Duman, Reference Malberg and Duman2003; Maudhuit et al., Reference Maudhuit, Prévot, Dangoumau, Martin, Hamon and Adrien1997). Hence, coupling increases in serotonin levels with a simple account of serotonin levels on behavior is unlikely to be able to explain SSRI effects. Indeed, the serotonin system is known to be exquisitely complex, with many different serotonin receptors distinctively distributed (Hansen et al., Reference Hansen, Shafiei, Markello, Smart, Cox, Nørgaard and Misic2022). A possible explanation could be that SSRIs facilitate learning faster in a punishing environment, thus leading to less negative and more positive (or neutral) feedback. It is interesting to consider how this bias towards learning from losses might be linked to mood. Self-reports of happiness are linked to positive prediction errors (Rutledge, Skandali, Dayan, & Dolan, Reference Rutledge, Skandali, Dayan and Dolan2014), suggesting that negative prediction errors might similarly influence negative affective states. In other words, SSRIs might gradually improve mood by enhancing negative expectations through faster loss learning, thereby giving rise to less disappointing and more rewarding experiences.

Finally, improvements in depressive symptoms in the sertraline group were preceded by an early decrease in the aversive Pavlovian bias. In other words, patients on sertraline showed a higher increase in their tendency to withhold an action when facing a loss between baseline and the 2-week follow-up, the higher their depressive symptoms were after 12 weeks.

Overall, the findings draw a complex picture involving aversive processing, sertraline, and symptoms, possibly reflecting the known complexity of the serotonin system. Despite the methodological limitations and the failure to support the preregistered hypotheses, the exploratory data suggest alterations in the processing of losses. A tentative possibility is that SSRIs alter the speed of learning from losses early on, inducing a shift from Pavlovian to instrumental learning when confronted with losses. The alteration in aversive Pavlovian bias was not directly linked to sertraline. However, sertraline appeared to modulate the association between Pavlovian inhibition and future treatment outcomes. Reducing aversive Pavlovian control might hence promote approach responses in a punishing environment, facilitating unexpected rewarding experiences and thus helping to alleviate depressive symptoms.

Limitations

Inclusion in the trial was based on clinical equipoise, i.e. inclusion was based on an uncertainty whether medication could clinically be helpful for a particular person. This may have decreased the power to detect differences from placebo. For mechanistic studies such as the current one, it could be better to study a cohort of typical responders, i.e. patients who are prescribed medication with clinical confidence.

Extensive validation analyses showed that task performance was frequently objectively poor resulting in a large fraction of the task runs being non-informative. Non-informative task runs had to be excluded from analyses because formally they cannot provide information about cognitive mechanisms. We attempted to address this by correcting for baseline variables that were significantly associated with non-informative task runs. The sertraline and the placebo group in the final informative sample continued to be matched on baseline characteristics. Nevertheless, the exclusion of data has severely curtailed the power in the study. Furthermore, because noninformative data was more common in the drug than the non-drug arm, a causal interpretation is no longer warranted.

The poor task performance has important implications for future mechanistic research in this domain. Although the task has been extensively used in laboratory studies (Guitart-Masip et al., Reference Guitart-Masip, Huys, Fuentemilla, Dayan, Duzel and Dolan2012), combined with neuroimaging (Guitart-Masip et al., Reference Guitart-Masip, Fuentemilla, Bach, Huys, Dayan, Dolan and Duzel2011), pharmacological (Guitart-Masip et al., Reference Guitart-Masip, Duzel, Dolan and Dayan2014) and other interventions and adapted (Millner, Gershman, Nock, & den Ouden, Reference Millner, Gershman, Nock and den Ouden2018; Moutoussis et al., Reference Moutoussis, Bullmore, Goodyer, Fonagy, Jones, Dolan and Dayan2018; Swart et al., Reference Swart, Froböse, Cook, Geurts, Frank, Cools and den Ouden2017), it did not prove effective in a longitudinal clinical trial. This reinforces the paramount importance of acceptability and effectiveness testing of cognitive measurements for translational research and calls for an involvement of stakeholders in the design of research tasks.

The relationships between cognitive mechanisms and symptoms were weak. This probably reflects more general findings in the field (Eisenberg et al., Reference Eisenberg, Bissett, Zeynep Enkavi, Li, MacKinnon, Marsch and Poldrack2019), but also the specific limitations around data quality mentioned above which limit the strength of possible associations (Spearman, Reference Spearman1904). We also note that our computational modeling approach was very conservative in that all parameters were allowed to change freely between participants and sessions, with no constraints for within-participant data.

Exploratory results were presented based on passing a conservative significance threshold and their relevance to the preregistered hypotheses. Nevertheless, they should be treated with caution prior to replication.

Conclusion

This study represents a significant exploration of specific reinforcement learning processes in a pragmatic RCT for depression. Specific reinforcement learning mechanisms did show a relationship to aspects of depression and anxiety and its treatment with SSRIs, but this was weak and not as hypothesized a priori. Sertraline influenced aversive processing in the first two treatment weeks by altering how participants learn to execute a passive or active action to avoid loss. Moreover, symptoms were associated with aversive processing but how this relationship relates to SSRI appears complex. The fact that almost half of the data was non-informative emphasizes the importance of developing patient-acceptable task probes.

Supplementary material

The supplementary material for this article can be found at https://doi.org/10.1017/S0033291724000837.

Acknowledgements

We would like to thank all the members of the Trial Committee, the Data Management Committee and the Public and Patient Involvement representatives. We would also like to thank colleagues who have contributed to the study through recruitment, administrative help and other advice. Finally, we would like to thank Agnes Norbury, Anahit Mkrtchian, Tore Erdmann, Jiazhou Chen, Jenny Fielder, Anna Hall, Jakub Onysk, Jade Serfaty, and Lana Tymchyk for their feedback on the manuscript.

Author contributions

Gl. L. secured the funding for the clinical trial. Gl. L. and L. D. were responsible for writing the detailed protocol, trial management, and data collection. M. M. provided the software for Go/NoGo experiment and helped with data management. M. M. and L. D. contributed to the training of the researchers. J. M. and Q. J. M. H. performed the computational modeling of the Go/NoGo task and planned the analyses with input from all authors. J. M. and Q. J. M. H. wrote the initial draft of the manuscript. All authors contributed to and approved the final manuscript.

Funding statement

This article summarizes independent research funded by the National Institute for Health Research (NIHR) under its Program Grants for Applied Research, Reference Number RP-PG-061010048. We acknowledge support from the UCLH NIHR BRC. J. M. was supported by an International Max Planck Research School on Computational Methods in Psychiatry and Ageing Research (IMPRS COMP2PSYCH) fellowship.

Competing interests

Q. J. M. H. has obtained a research grant from Koa Health, and consultancy fees from Aya Technologies Limited and Alto Neuroscience. All other authors report no competing interest.

Footnotes

For the purpose of open access, the author has applied a Creative Commons Attribution (CC BY) license to any Author Accepted Manuscript version arising.

References

Abela, A. R., Browne, C. J., Sargin, D., Prevot, T. D., Ji, X. D., Li, Z., … Fletcher, P. J. (2020). Median raphe serotonin neurons promote anxiety-like behavior via inputs to the dorsal hippocampus. Neuropharmacology, 168, 107985. doi: 10.1016/j.neuropharm.2020.107985CrossRefGoogle Scholar
Ahmed, N., Bone, J. K., Lewis, G., Freemantle, N., Harmer, C. J., Duffy, L., & Lewis, G. (2022). The effect of sertraline on emotional processing: Secondary analyses of the PANDA randomised controlled trial. Psychological Medicine, 52(13), 28142821. doi: 10.1017/S0033291720004985CrossRefGoogle ScholarPubMed
Alvarez, R. P., Chen, G., Bodurka, J., Kaplan, R., & Grillon, C. (2011). Phasic and sustained fear in humans elicits distinct patterns of brain activity. NeuroImage, 55(1), 389400. doi: 10.1016/j.neuroimage.2010.11.057CrossRefGoogle ScholarPubMed
Amemori, K., & Graybiel, A. M. (2012). Localized microstimulation of primate pregenual cingulate cortex induces negative decision-making. Nature Neuroscience, 15(5), 776785. doi: 10.1038/nn.3088CrossRefGoogle Scholar
Amo, R., Fredes, F., Kinoshita, M., Aoki, R., Aizawa, H., Agetsuma, M., … Okamoto, H. (2014). The habenulo-raphe serotonergic circuit encodes an aversive expectation value essential for adaptive active avoidance of danger. Neuron, 84(5), 10341048. doi: 10.1016/j.neuron.2014.10.035CrossRefGoogle ScholarPubMed
Bandelow, B., Reitt, M., Röver, C., Michaelis, S., Görlich, Y., & Wedekind, D. (2015). Efficacy of treatments for anxiety disorders: A meta-analysis. International Clinical Psychopharmacology, 30(4), 183192. doi: 10.1097/YIC.0000000000000078CrossRefGoogle ScholarPubMed
Bari, A., Theobald, D. E., Caprioli, D., Mar, A. C., Aidoo-Micah, A., Dalley, J. W., & Robbins, T. W. (2010). Serotonin modulates sensitivity to reward and negative feedback in a probabilistic reversal learning task in rats. Neuropsychopharmacology: Official Publication of the American College of Neuropsychopharmacology, 35(6), 12901301. doi: 10.1038/npp.2009.233CrossRefGoogle Scholar
Barnhart, W. J., Makela, E. H., & Latocha, M. J. (2004). SSRI-induced apathy syndrome: A clinical review. Journal of Psychiatric Practice, 10(3), 196199. doi: 10.1097/00131746-200405000-00010CrossRefGoogle ScholarPubMed
Beck, A. T., Steer, R. A., & Brown, G. K. (1996). Manual for the Beck depression inventory-II. San Antonio, TX: Psychological Corporation.Google Scholar
Bogowicz, P., Curtis, H. J., Walker, A. J., Cowen, P., Geddes, J., & Goldacre, B. (2021). Trends and variation in antidepressant prescribing in English primary care: A retrospective longitudinal study. BJGP Open, 5(4), BJGPO.2021.0020. doi: 10.3399/BJGPO.2021.0020CrossRefGoogle ScholarPubMed
Bolles, R. C. (1970). Species-specific defense reactions and avoidance learning. Psychological Review, 77(1), 3248. doi: 10.1037/h0028589CrossRefGoogle Scholar
Botvinick, M., Ritter, S., Wang, J. X., Kurth-Nelson, Z., Blundell, C., & Hassabis, D. (2019). Reinforcement learning, fast and slow. Trends in Cognitive Sciences, 23(5), 408422. doi: 10.1016/j.tics.2019.02.006CrossRefGoogle ScholarPubMed
Boureau, Y. L., & Dayan, P. (2011). Opponency revisited: Competition and cooperation between dopamine and serotonin. Neuropsychopharmacology, 36(1), 7497. doi: 10.1038/npp.2010.151CrossRefGoogle ScholarPubMed
Brigman, J. L., Mathur, P., Harvey-White, J., Izquierdo, A., Saksida, L. M., Bussey, T. J., … Holmes, A. (2010). Pharmacological or genetic inactivation of the serotonin transporter improves reversal learning in mice. Cerebral Cortex, 20(8), 19551963. doi: 10.1093/cercor/bhp266CrossRefGoogle ScholarPubMed
Bylsma, L. M., Morris, B. H., & Rottenberg, J. (2008). A meta-analysis of emotional reactivity in major depressive disorder. Clinical psychology review, 28(4), 676691. doi: 10.1016/j.cpr.2007.10.001CrossRefGoogle ScholarPubMed
Chamberlain, S. R., Müller, U., Blackwell, A. D., Clark, L., Robbins, T. W., & Sahakian, B. J. (2006). Neurochemical modulation of response inhibition and probabilistic learning in humans. Science, 311(5762), 861863. doi: 10.1126/science.1121218CrossRefGoogle ScholarPubMed
Cipriani, A., Furukawa, T. A., Salanti, G., Chaimani, A., Atkinson, L. Z., Ogawa, Y., … Geddes, J. R. (2018). Comparative efficacy and acceptability of 21 antidepressant drugs for the acute treatment of adults with major depressive disorder: A systematic review and network meta-analysis. Lancet, 391(10128), 13571366. doi: 10.1016/S0140-6736(17)32802-7CrossRefGoogle ScholarPubMed
Cohen, J. Y., Amoroso, M. W., & Uchida, N. (2015). Serotonergic neurons signal reward and punishment on multiple timescales. eLife, 4, e06346. doi: 10.7554/eLife.06346CrossRefGoogle ScholarPubMed
Cools, R., Roberts, A. C., & Robbins, T. W. (2008). Serotoninergic regulation of emotional and behavioural control processes. Trends in Cognitive Sciences, 12(1), 3140. doi: 10.1016/j.tics.2007.10.011CrossRefGoogle ScholarPubMed
Crockett, M. J., Clark, L., Apergis-Schoute, A. M., Morein-Zamir, S., & Robbins, T. W. (2012). Serotonin modulates the effects of Pavlovian aversive predictions on response vigor. Neuropsychopharmacology, 37(10), 22442252. doi: 10.1038/npp.2012.75CrossRefGoogle ScholarPubMed
Crockett, M. J., Clark, L., & Robbins, T. W. (2009). Reconciling the role of serotonin in behavioral inhibition and aversion: Acute tryptophan depletion abolishes punishment-induced inhibition in humans. The Journal of Neuroscience, 29(38), 1199311999. doi: 10.1523/JNEUROSCI.2513-09.2009CrossRefGoogle ScholarPubMed
Cuthbert, B. N., & Insel, T. R. (2013). Toward the future of psychiatric diagnosis: The seven pillars of RDoC. BMC Medicine, 11, 126. doi: 10.1186/1741-7015-11-126CrossRefGoogle ScholarPubMed
Dayan, P., & Huys, Q. J. M. (2008). Serotonin, inhibition, and negative mood. PLoS Computational Biology, 4(2), e4. doi: 10.1371/journal.pcbi.0040004CrossRefGoogle ScholarPubMed
Dayan, P., & Huys, Q. J. M. (2009). Serotonin in affective control. Annual Review of Neuroscience, 32, 95126. doi: 10.1146/annurev.neuro.051508.135607CrossRefGoogle ScholarPubMed
Dayan, P., Niv, Y., Seymour, B., & Daw, N. D. (2006). The misbehavior of value and the discipline of the will. Neural Networks, 19(8), 11531160. doi: 10.1016/j.neunet.2006.03.002CrossRefGoogle ScholarPubMed
Dayer, A. (2014). Serotonin-related pathways and developmental plasticity: Relevance for psychiatric disorders. Dialogues in Clinical Neuroscience, 16(1), 2941. doi: 10.31887/DCNS.2014.16.1/adayerCrossRefGoogle ScholarPubMed
Doya, K. (2002). Metalearning and neuromodulation. Neural Networks, 15(4–6), 495506. doi: 10.1016/s0893-6080(02)00044-8CrossRefGoogle Scholar
Doya, K., Miyazaki, K. W., & Miyazaki, K. (2021). Serotonergic modulation of cognitive computations. Current Opinion in Behavioral Sciences, 38, 116123. doi: 10.1016/j.cobeha.2021.02.003CrossRefGoogle Scholar
Drevets, W. C., Savitz, J., & Trimble, M. (2008). The subgenual anterior cingulate cortex in mood disorders. CNS Spectrums, 13(8), 663681. doi: 10.1017/s1092852900013754CrossRefGoogle ScholarPubMed
Duffy, L., Bacon, F., Clarke, C. S., Donkor, Y., Freemantle, N., Gilbody, S., … Lewis, G. (2019). A randomised controlled trial assessing the use of citalopram, sertraline, fluoxetine and mirtazapine in preventing relapse in primary care patients who are taking long-term maintenance antidepressants (ANTLER: ANTidepressants to prevent reLapse in dEpRession): Study protocol for a randomised controlled trial. Trials, 20(1), 319. doi: 10.1186/s13063-019-3390-8CrossRefGoogle ScholarPubMed
Eisenberg, I. W., Bissett, P. G., Zeynep Enkavi, A., Li, J., MacKinnon, D. P., Marsch, L. A., & Poldrack, R. A. (2019). Uncovering the structure of self-regulation through data-driven ontology discovery. Nature Communications, 10(1), 2319. doi: 10.1038/s41467-019-10301-1CrossRefGoogle ScholarPubMed
Eshel, N., & Roiser, J. P. (2010). Reward and punishment processing in depression. Biological Psychiatry, 68(2), 118124. doi: 10.1016/j.biopsych.2010.01.027CrossRefGoogle Scholar
Evans-Lacko, S., Aguilar-Gaxiola, S., Al-Hamzawi, A., Alonso, J., Benjet, C., Bruffaerts, R., … Thornicroft, G. (2018). Socio-economic variations in the mental health treatment gap for people with anxiety, mood, and substance use disorders: Results from the WHO World Mental Health (WMH) surveys. Psychological Medicine, 48(9), 15601571. doi: 10.1017/S0033291717003336CrossRefGoogle Scholar
Garakani, A., Murrough, J. W., Freire, R. C., Thom, R. P., Larkin, K., Buono, F. D., & Iosifescu, D. V. (2020). Pharmacotherapy of anxiety disorders: Current and emerging treatment options. Frontiers in Psychiatry, 11, 595584. doi: 10.3389/fpsyt.2020.595584CrossRefGoogle ScholarPubMed
Geurts, D. E., Huys, Q. J. M., den Ouden, H. E., & Cools, R. (2013a). Aversive Pavlovian control of instrumental behavior in humans. Journal of Cognitive Neuroscience, 25(9), 14281441. doi: 10.1162/jocn_a_00425CrossRefGoogle Scholar
Geurts, D. E., Huys, Q. J. M., den Ouden, H. E., & Cools, R. (2013b). Serotonin and aversive Pavlovian control of instrumental behavior in humans. The Journal of Neuroscience, 33(48), 1893218939. doi: 10.1523/JNEUROSCI.2749-13.2013CrossRefGoogle ScholarPubMed
Guitart-Masip, M., Duzel, E., Dolan, R., & Dayan, P. (2014). Action versus valence in decision making. Trends in Cognitive Sciences, 18(4), 194202. doi: 10.1016/j.tics.2014.01.003CrossRefGoogle ScholarPubMed
Guitart-Masip, M., Fuentemilla, L., Bach, D. R., Huys, Q. J. M., Dayan, P., Dolan, R. J., & Duzel, E. (2011). Action dominates valence in anticipatory representations in the human striatum and dopaminergic midbrain. The Journal of Neuroscience, 31(21), 78677875. doi: 10.1523/JNEUROSCI.6376-10.2011CrossRefGoogle ScholarPubMed
Guitart-Masip, M., Huys, Q. J. M., Fuentemilla, L., Dayan, P., Duzel, E., & Dolan, R. J. (2012). Go and no-go learning in reward and punishment: Interactions between affect and effect. NeuroImage, 62(1), 154166. doi: 10.1016/j.neuroimage.2012.04.024CrossRefGoogle Scholar
Hajszan, T., Szigeti-Buck, K., Sallam, N. L., Bober, J., Parducz, A., Maclusky, N. J., … Duman, R. S. (2010). Effects of estradiol on learned helplessness and associated remodeling of hippocampal spine synapses in female rats. Biological Psychiatry, 67(2), 168174. doi: 10.1016/j.biopsych.2009.08.017CrossRefGoogle Scholar
Halahakoon, D. C., Kieslich, K., O'Driscoll, C., Nair, A., Lewis, G., & Roiser, J. P. (2020). Reward-processing behavior in depressed participants relative to healthy volunteers: A systematic review and meta-analysis. JAMA Psychiatry, 77(12), 12861295. doi: 10.1001/jamapsychiatry.2020.2139CrossRefGoogle Scholar
Hansen, J. Y., Shafiei, G., Markello, R. D., Smart, K., Cox, S. M. L., Nørgaard, M., … Misic, B. (2022). Mapping neurotransmitter systems to the structural and functional organization of the human neocortex. Nature Neuroscience, 25(11), 15691581. doi: 10.1038/s41593-022-01186-3CrossRefGoogle Scholar
Harmer, C. J. (2013). Emotional processing and antidepressant action. Current Topics in Behavioral Neurosciences, 14, 209222. doi: 10.1007/7854_2012_210CrossRefGoogle Scholar
Harmer, C. J., Cowen, P. J., & Goodwin, G. M. (2011). Efficacy markers in depression. Journal of Psychopharmacology, 25(9), 11481158. doi: 10.1177/0269881110367722CrossRefGoogle Scholar
Harmer, C. J., Duman, R. S., & Cowen, P. J. (2017). How do antidepressants work? New perspectives for refining future treatment approaches. The Lancet Psychiatry, 4(5), 409418. doi: 10.1016/S2215-0366(17)30015-9CrossRefGoogle ScholarPubMed
Harmer, C. J., Goodwin, G. M., & Cowen, P. J. (2009a). Why do antidepressants take so long to work? A cognitive neuropsychological model of antidepressant drug action. The British Journal of Psychiatry, 195(2), 102108. doi: 10.1192/bjp.bp.108.051193CrossRefGoogle Scholar
Harmer, C. J., O'Sullivan, U., Favaron, E., Massey-Chase, R., Ayres, R., Reinecke, A., … Cowen, P. J. (2009b). Effect of acute antidepressant administration on negative affective bias in depressed patients. The American Journal of Psychiatry, 166(10), 11781184. doi: 10.1176/appi.ajp.2009.09020149CrossRefGoogle ScholarPubMed
Hasler, G., Fromm, S., Alvarez, R. P., Luckenbaugh, D. A., Drevets, W. C., & Grillon, C. (2007). Cerebral blood flow in immediate and sustained anxiety. The Journal of Neuroscience, 27(23), 63136319. doi: 10.1523/JNEUROSCI.5369-06.2007CrossRefGoogle ScholarPubMed
Hebart, M. N., & Gläscher, J. (2015). Serotonin and dopamine differentially affect appetitive and aversive general Pavlovian-to-instrumental transfer. Psychopharmacology, 232(2), 437451. doi: 10.1007/s00213-014-3682-3CrossRefGoogle ScholarPubMed
Hirschfeld, R. M. (2001). The comorbidity of major depression and anxiety disorders: Recognition and management in primary care. Primary Care Companion to the Journal of Clinical Psychiatry, 3(6), 244254. doi: 10.4088/pcc.v03n0609Google ScholarPubMed
Huys, Q. J. M., Browning, M., Paulus, M. P., & Frank, M. J. (2021). Advances in the computational understanding of mental illness. Neuropsychopharmacology, 46(1), 319. doi: 10.1038/s41386-020-0746-4CrossRefGoogle ScholarPubMed
Huys, Q. J. M., Cools, R., Gölzer, M., Friedel, E., Heinz, A., Dolan, R. J., & Dayan, P. (2011a). Disentangling the roles of approach, activation and valence in instrumental and Pavlovian responding. PLoS Computational Biology, 7(4), e1002028. doi: 10.1371/journal.pcbi.1002028CrossRefGoogle ScholarPubMed
Huys, Q. J. M., Eshel, N., O'Nions, E., Sheridan, L., Dayan, P., & Roiser, J. P. (2012). Bonsai trees in your head: How the Pavlovian system sculpts goal-directed choices by pruning decision trees. PLoS Computational Biology, 8(3), e1002410. doi: 10.1371/journal.pcbi.1002410CrossRefGoogle ScholarPubMed
Huys, Q. J. M., Gölzer, M., Friedel, E., Heinz, A., Cools, R., Dayan, P., & Dolan, R. J. (2016). The specificity of Pavlovian regulation is associated with recovery from depression. Psychological Medicine, 46(5), 10271035. doi: 10.1017/S0033291715002597CrossRefGoogle ScholarPubMed
Huys, Q. J. M., Moutoussis, M., & Williams, J. (2011b). Are computational models of any use to psychiatry? Neural Networks, 24(6), 544551. doi: 10.1016/j.neunet.2011.03.001CrossRefGoogle ScholarPubMed
Huys, Q. J. M., Russek, E. M., Abitante, G., Kahnt, T., & Gollan, J. K. (2022). Components of behavioral activation therapy for depression engage specific reinforcement learning mechanisms in a pilot study. Computational Psychiatry, 6(1), 238255. doi: 10.5334/cpsy.81CrossRefGoogle ScholarPubMed
Kendrick, T., Stuart, B., Newell, C., Geraghty, A. W., & Moore, M. (2015). Did NICE guidelines and the Quality Outcomes Framework change GP antidepressant prescribing in England? Observational study with time trend analyses 2003–2013. Journal of Affective Disorders, 186, 171177. doi: 10.1016/j.jad.2015.06.052CrossRefGoogle Scholar
Kirby, K. J. (2006). The effects of subchronic and chronic SSRI treatments on learned helplessness behavior in rats [Master's thesis]. University of Colorado, Boulder, CO, USA. ISBN: 9780542599736.Google Scholar
Kraus, C., Castrén, E., Kasper, S., & Lanzenberger, R. (2017). Serotonin and neuroplasticity – links between molecular, functional and structural pathophysiology in depression. Neuroscience and Biobehavioral Reviews, 77, 317326. doi: 10.1016/j.neubiorev.2017.03.007CrossRefGoogle ScholarPubMed
Kroenke, K., Spitzer, R. L., & Williams, J. B. (2001). The PHQ-9: Validity of a brief depression severity measure. Journal of General Internal Medicine, 16(9), 606613. doi: 10.1046/j.1525-1497.2001.016009606.xCrossRefGoogle ScholarPubMed
Lally, N., Huys, Q. J. M., Eshel, N., Faulkner, P., Dayan, P., & Roiser, J. P. (2017). The neural basis of aversive Pavlovian guidance during planning. The Journal of Neuroscience, 37(42), 1021510229. doi: 10.1523/JNEUROSCI.0085-17.2017CrossRefGoogle ScholarPubMed
Lan, D. C. L., & Browning, M. (2022). What can reinforcement learning models of dopamine and serotonin tell us about the action of antidepressants? Computational Psychiatry, 6(1), 166188. doi: 10.5334/cpsy.83CrossRefGoogle Scholar
Langdon, A., Botvinick, M., Nakahara, H., Tanaka, K., Matsumoto, M., & Kanai, R. (2022). Meta-learning, social cognition and consciousness in brains and machines. Neural Networks, 145, 8089. doi: 10.1016/j.neunet.2021.10.004CrossRefGoogle Scholar
LeDoux, J. E., Moscarello, J., Sears, R., & Campese, V. (2017). The birth, death and resurrection of avoidance: A reconceptualization of a troubled paradigm. Molecular Psychiatry, 22(1), 2436. doi: 10.1038/mp.2016.166CrossRefGoogle ScholarPubMed
Lewis, G., Duffy, L., Ades, A., Amos, R., Araya, R., Brabyn, S., … Lewis, G. (2019). The clinical effectiveness of sertraline in primary care and the role of depression severity and duration (PANDA): A pragmatic, double-blind, placebo-controlled randomised trial. The Lancet Psychiatry, 6(11), 903914. doi: 10.1016/S2215-0366(19)30366-9CrossRefGoogle Scholar
Maia, T. V., & Frank, M. J. (2011). From reinforcement learning models to psychiatric and neurological disorders. Nature Neuroscience, 14(2), 154162. doi: 10.1038/nn.2723CrossRefGoogle ScholarPubMed
Malberg, J. E., & Duman, R. S. (2003). Cell proliferation in adult hippocampus is decreased by inescapable stress: Reversal by fluoxetine treatment. Neuropsychopharmacology, 28(9), 15621571. doi: 10.1038/sj.npp.1300234CrossRefGoogle ScholarPubMed
Marazziti, D., Mucci, F., Tripodi, B., Carbone, M. G., Muscarella, A., Falaschi, V., & Baroni, S. (2019). Emotional blunting, cognitive impairment, bone fractures, and bleeding as possible side effects of long-term use of SSRIs. Clinical Neuropsychiatry, 16(2), 7585.Google ScholarPubMed
Martell, C. R., Dimidjian, S., & Herman-Dunn, R. (2010). Behavioral activation for depression: A clinician's guide. New York: Guilford Press.Google Scholar
Maudhuit, C., Prévot, E., Dangoumau, L., Martin, P., Hamon, M., & Adrien, J. (1997). Antidepressant treatment in helpless rats: Effect on the electrophysiological activity of raphe dorsalis serotonergic neurons. Psychopharmacology, 130(3), 269275. doi: 10.1007/s002130050239CrossRefGoogle ScholarPubMed
Mayberg, H. S., Lozano, A. M., Voon, V., McNeely, H. E., Seminowicz, D., Hamani, C., … Kennedy, S. H. (2005). Deep brain stimulation for treatment-resistant depression. Neuron, 45(5), 651660. doi: 10.1016/j.neuron.2005.02.014CrossRefGoogle ScholarPubMed
McCabe, C., Cowen, P. J., & Harmer, C. J. (2009). Neural representation of reward in recovered depressed patients. Psychopharmacology, 205(4), 667677. doi: 10.1007/s00213-009-1573-9CrossRefGoogle ScholarPubMed
McGraw, K. O., & Wong, S. P. (1996). Forming inferences about some intraclass correlation coefficients. Psychological Methods, 1, 3046. doi: 10.1037/1082-989X.1.1.30CrossRefGoogle Scholar
Michely, J., Eldar, E., Erdman, A., Martin, I. M., & Dolan, R. J. (2022). Serotonin modulates asymmetric learning from reward and punishment in healthy human volunteers. Communications Biology, 5(1), 812. doi: 10.1038/s42003-022-03690-5CrossRefGoogle ScholarPubMed
Michely, J., Eldar, E., Martin, I. M., & Dolan, R. J. (2020). A mechanistic account of serotonin's impact on mood. Nature Communications, 11(1), 2335. doi: 10.1038/s41467-020-16090-2CrossRefGoogle ScholarPubMed
Millner, A. J., Gershman, S. J., Nock, M. K., & den Ouden, H. E. M. (2018). Pavlovian control of escape and avoidance. Journal of Cognitive Neuroscience, 30(10), 13791390. doi: 10.1162/jocn_a_01224CrossRefGoogle Scholar
Mkrtchian, A., Aylward, J., Dayan, P., Roiser, J. P., & Robinson, O. J. (2017). Modeling avoidance in mood and anxiety disorders using reinforcement learning. Biological Psychiatry, 82(7), 532539. doi: 10.1016/j.biopsych.2017.01.017CrossRefGoogle Scholar
Morris, S. E., Sanislow, C. A., Pacheco, J., Vaidyanathan, U., Gordon, J. A., & Cuthbert, B. N. (2022). Revisiting the seven pillars of RDoC. BMC Medicine, 20(1), 220. doi: 10.1186/s12916-022-02414-0CrossRefGoogle ScholarPubMed
Moutoussis, M., Bullmore, E. T., Goodyer, I. M., Fonagy, P., Jones, P. B., Dolan, R. J., Dayan, P., & Neuroscience in Psychiatry Network Research Consortium (2018). Change, stability, and instability in the Pavlovian guidance of behaviour from adolescence to young adulthood. PLoS Computational Biology, 14(12), e1006679. doi: 10.1371/journal.pcbi.1006679CrossRefGoogle ScholarPubMed
Nord, C. L., Lawson, R. P., Huys, Q. J. M., Pilling, S., & Roiser, J. P. (2018). Depression is associated with enhanced aversive Pavlovian control over instrumental behaviour. Scientific Reports, 8(1), 12582. doi: 10.1038/s41598-018-30828-5CrossRefGoogle ScholarPubMed
Ohmura, Y., Tanaka, K. F., Tsunematsu, T., Yamanaka, A., & Yoshioka, M. (2014). Optogenetic activation of serotonergic neurons enhances anxiety-like behaviour in mice. The International Journal of Neuropsychopharmacology, 17(11), 17771783. doi: 10.1017/S1461145714000637CrossRefGoogle Scholar
Petty, F., Kramer, G., & Moeller, M. (1994). Does learned helplessness induction by haloperidol involve serotonin mediation? Pharmacology, Biochemistry, and Behavior, 48(3), 671676. doi: 10.1016/0091-3057(94)90330-1CrossRefGoogle ScholarPubMed
Pike, A. C., & Robinson, O. J. (2022). Reinforcement learning in patients with mood and anxiety disorders vs control individuals: A systematic review and meta-analysis. JAMA Psychiatry, 79(4), 313322. doi: 10.1001/jamapsychiatry.2022.0051CrossRefGoogle ScholarPubMed
Pizzagalli, D. A., Jahn, A. L., & O'Shea, J. P. (2005). Toward an objective characterization of an anhedonic phenotype: A signal-detection approach. Biological Psychiatry, 57(4), 319327. doi: 10.1016/j.biopsych.2004.11.026CrossRefGoogle ScholarPubMed
Pizzagalli, D. A., Smoski, M., Ang, Y. S., Whitton, A. E., Sanacora, G., Mathew, S. J., … Krystal, A. D. (2020). Selective kappa-opioid antagonism ameliorates anhedonic behavior: Evidence from the Fast-fail Trial in Mood and Anxiety Spectrum Disorders (FAST-MAS). Neuropsychopharmacology, 45(10), 16561663. doi: 10.1038/s41386-020-0738-4CrossRefGoogle ScholarPubMed
Price, J., Cole, V., & Goodwin, G. M. (2009). Emotional side-effects of selective serotonin reuptake inhibitors: Qualitative study. The British Journal of Psychiatry, 195(3), 211217. doi: 10.1192/bjp.bp.108.051110CrossRefGoogle ScholarPubMed
Ramirez-Mahaluf, J. P., Perramon, J., Otal, B., Villoslada, P., & Compte, A. (2018). Subgenual anterior cingulate cortex controls sadness-induced modulations of cognitive and emotional network hubs. Scientific Reports, 8(1), 8566. doi: 10.1038/s41598-018-26317-4CrossRefGoogle ScholarPubMed
Robinson, O. J., Overstreet, C., Allen, P. S., Letkiewicz, A., Vytal, K., Pine, D. S., & Grillon, C. (2013). The role of serotonin in the neurocircuitry of negative affective bias: Serotonergic modulation of the dorsal medial prefrontal-amygdala ‘aversive amplification’ circuit. NeuroImage, 78, 217223. doi: 10.1016/j.neuroimage.2013.03.075CrossRefGoogle ScholarPubMed
Roiser, J. P., Elliott, R., & Sahakian, B. J. (2012a). Cognitive mechanisms of treatment in depression. Neuropsychopharmacology, 37(1), 117136. doi: 10.1038/npp.2011.183CrossRefGoogle ScholarPubMed
Roiser, J. P., Levy, J., Fromm, S. J., Goldman, D., Hodgkinson, C. A., Hasler, G., … Drevets, W. C. (2012b). Serotonin transporter genotype differentially modulates neural responses to emotional words following tryptophan depletion in patients recovered from depression and healthy volunteers. Journal of Psychopharmacology, 26(11), 14341442. doi: 10.1177/0269881112442789CrossRefGoogle ScholarPubMed
Rutledge, R. B., Skandali, N., Dayan, P., & Dolan, R. J. (2014). A computational and neural model of momentary subjective well-being. Proceedings of the National Academy of Sciences of the United States of America, 111(33), 1225212257. doi: 10.1073/pnas.1407535111CrossRefGoogle ScholarPubMed
Salaminios, G., Duffy, L., Ades, A., Araya, R., Button, K. S., Churchill, R., … Lewis, G. (2017). A randomised controlled trial assessing the severity and duration of depressive symptoms associated with a clinically significant response to sertraline versus placebo, in people presenting to primary care with depression (PANDA trial): Study protocol for a randomised controlled trial. Trials, 18(1), 496. doi: 10.1186/s13063-017-2253-4CrossRefGoogle ScholarPubMed
Scholl, J., Kolling, N., Nelissen, N., Browning, M., Rushworth, M. F., & Harmer, C. J. (2017). Beyond negative valence: 2-week administration of a serotonergic antidepressant enhances both reward and effort learning signals. PLoS Biology, 15(2), e2000756. doi: 10.1371/journal.pbio.2000756CrossRefGoogle ScholarPubMed
Scholz, V., Hook, R. W., Kandroodi, M. R., Algermissen, J., Ioannidis, K., Christmas, D., … den Ouden, H. E. M. (2022). Cortical dopamine reduces the impact of motivational biases governing automated behaviour. Neuropsychopharmacology, 47(8), 15031512. doi: 10.1038/s41386-022-01291-8CrossRefGoogle ScholarPubMed
Spearman, C. (1904). The proof and measurement of association between two things. The American Journal of Psychology, 15(1), 72101. doi: 10.2307/1422689CrossRefGoogle Scholar
Spitzer, R. L., Kroenke, K., Williams, J. B., & Löwe, B. (2006). A brief measure for assessing generalized anxiety disorder: The GAD-7. Archives of Internal Medicine, 166(10), 10921097. doi: 10.1001/archinte.166.10.1092CrossRefGoogle ScholarPubMed
Steele, J. D., Kumar, P., & Ebmeier, K. P. (2007). Blunted response to feedback information in depressive illness. Brain, 130(Pt 9), 23672374. doi: 10.1093/brain/awm150CrossRefGoogle Scholar
Swart, J. C., Froböse, M. I., Cook, J. L., Geurts, D. E., Frank, M. J., Cools, R., & den Ouden, H. E. (2017). Catecholaminergic challenge uncovers distinct Pavlovian and instrumental mechanisms of motivated (in)action. eLife, 6, e22169. doi: 10.7554/eLife.22169CrossRefGoogle ScholarPubMed
Tanaka, S. C., Schweighofer, N., Asahi, S., Shishida, K., Okamoto, Y., Yamawaki, S., & Doya, K. (2007). Serotonin differentially regulates short- and long-term prediction of rewards in the ventral and dorsal striatum. PLoS One, 2(12), e1333. doi: 10.1371/journal.pone.0001333CrossRefGoogle Scholar
Tanaka, S. C., Shishida, K., Schweighofer, N., Okamoto, Y., Yamawaki, S., & Doya, K. (2009). Serotonin affects association of aversive outcomes to past actions. The Journal of Neuroscience, 29(50), 1566915674. doi: 10.1523/JNEUROSCI.2799-09.2009CrossRefGoogle ScholarPubMed
Vanschoren, J. (2019). Meta-learning. In Hutter, F., Kotthoff, L., & Vanschoren, J. (Eds.), Automated machine learning: Methods, systems, challenges, the Springer series on challenges in machine learning (pp. 3561). Cham: Springer International Publishing. doi: 10.1007/978-3-030-05318-5_9CrossRefGoogle Scholar
Figure 0

Figure 1. Task and performance. (a) The Go/NoGo task consisted of four different conditions. On each trial one of four possible fractal images was shown. Actions were required in response to a circle that followed the fractal image after a variable delay. After a brief delay, the outcome was presented: a green upward arrow for a win, a red downward arrow for a loss, or a horizontal bar for a neutral outcome. In the go-to-win condition, pressing the key (‘go’) led to a reward with 80% and a neutral outcome with 20% probability, vice versa if they did not press the key (‘nogo’). In the go-to-avoid condition, pressing the key (‘go’) led to a neutral outcome with 80% and a loss with 20% probability. In the nogo-to-win, not pressing the key (‘nogo’) led to a reward with 80% and a neutral outcome with 20% probability. In the nogo-to-avoid condition, not pressing the key (‘nogo’ response) led to a neutral outcome with 80% and a loss with 20% probability. Each task administration consisted of 96 trials, with 24 trials per condition. (b) Mean percentage of correct responses in each of the four conditions. Black dots depict participants and black error bars depict standard deviation of the mean (s.d.). Dashed lines depict chance level. Post hoc comparisons were implemented by means of repeated measures t tests showing a significant difference in accuracy between Pavlovian congruent (got to win and nogo to avoid) and incongruent conditions (go to avoid and nogo to win). Significance ∗ ≤ 0.05, ∗∗ ≤ 0.01, ∗∗∗ ≤ 0.001, ∗∗∗∗ ≤ 0.0001.

Figure 1

Table 1. Baseline characteristics for participants providing informative Go/NoGo task data (N = 435)

Figure 2

Figure 2. Computational modeling of the Go/NoGo task. (a) shows the differences in integrated Bayesian Information Criterion (iBIC) scores for all models tested compared to the most parsimonious model (red star), where a smaller iBIC score indicates a more parsimonious model. All models are modified Q-learning models (Rescorla Wagner – RW) with two pairs of action-values (‘go’ and ‘nogo’) for each stimulus. The y-axis shows the number of free parameters for each model. The most parsimonious model includes separate learning rates for rewards and punishments, win and loss sensitivities, appetitive and aversive Pavlovian biases, irreducible noise, and a constant bias factor added to the action-value for ‘go’. (b) shows the histogram of the difference between the integrated loglikelihood (iLL) of the most parsimonious model and the iLL of the random baseline model. Datasets were declared as informative if the data was more than three times more likely to have occurred under the most parsimonious model (vertical red dashed line). (c) The four subplots show the average learning curves in blue (averaged over participants; solid line) for each condition separately. Each row of the raster images shows the choices of each participant. ‘Go’ responses are depicted in white, and ‘nogo’ responses are depicted in grey. Additionally, the average ‘go’ probability was separated into included datasets (orange) and excluded datasets (green). The solid line refers to empirical data and the dashed line to simulated data from the most parsimonious model. Informative datasets (orange) show that participants, on average, seem to learn over trials, which can be captured qualitatively well by the most parsimonious model. In contrast, the average ‘go’ probability of non-informative/excluded datasets (green) appears to have no temporal relation, hence showing no learning over trials. Further, it is well captured by the random baseline model.

Figure 3

Table 2. Mixed-effect linear models testing our pre-registered hypotheses (only informative data Ntask runs = 886; Npatients = 435, 66% of those randomized)

Figure 4

Figure 3. Effects of sertraline on RL parameters. (a) Shows the aversive Pavlovian bias at baseline and at the follow-ups separated into drug groups (blue, left = placebo; red, right = sertraline). (b) Shows the change in aversive Pavlovian bias between sessions separately for the drug groups. (c) Early changes in the aversive Pavlovian bias predict treatment outcome. This figure shows the relation between the change from baseline to week two in the aversive Pavlovian bias and log-transformed BDI total score (only of participants who had an informative task run at baseline and week 2). In blue the placebo group and in red the sertraline group. An interaction effect was observed between group and early change in the aversive Pavlovian predicting depression at 12 weeks driven by a significant association between the early change and log-transformed BDI total score at 12 weeks. (blue, left = placebo; red, right = sertraline). (d) Shows the loss learning rate at baseline and at the follow-ups separated into drug groups (blue, left = placebo; red, right = sertraline). (e) Shows the change in loss learning rate between sessions separately for the drug groups. Significance *  ≤  0.05, **  ≤  0.01, ***  ≤  0.001, ****  ≤  0.0001.

Supplementary material: File

Malamud et al. supplementary material

Malamud et al. supplementary material
Download Malamud et al. supplementary material(File)
File 1.7 MB