Hostname: page-component-68c7f8b79f-m4fzj Total loading time: 0 Render date: 2025-12-24T16:53:31.421Z Has data issue: false hasContentIssue false

Economic evaluation of digitally supported therapy for people with psychosis who hear distressing voices: the AVATAR2 trial

Published online by Cambridge University Press:  15 December 2025

Paul McCrone*
Affiliation:
Institute for Lifecourse Development, University of Greenwich, London, UK
Evdoxia Gkaintatzi
Affiliation:
Institute for Lifecourse Development, University of Greenwich, London, UK Department of Psychology, Institute of Psychiatry, Psychology & Neuroscience, King’s College London, London, UK
Thomas Ward
Affiliation:
Department of Psychology, Institute of Psychiatry, Psychology & Neuroscience, King’s College London, London, UK South London & Maudsley NHS Foundation Trust, London, UK
Clementine J. Edwards
Affiliation:
Department of Psychology, Institute of Psychiatry, Psychology & Neuroscience, King’s College London, London, UK South London & Maudsley NHS Foundation Trust, London, UK
Hassan Jafari
Affiliation:
Department of Biostatistics and Health Informatics, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, UK
Richard Emsley
Affiliation:
Department of Biostatistics and Health Informatics, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, UK
Mark Huckvale
Affiliation:
Department of Speech, Hearing and Phonetic Sciences, University College London, London, UK
Mar Rus-Calafell
Affiliation:
Department of Psychology, Institute of Psychiatry, Psychology & Neuroscience, King’s College London, London, UK Mental Health Research and Treatment Center, Faculty of Psychology, Ruhr University Bochum, Bochum, Germany
Miriam Fornells-Ambrojo
Affiliation:
Research Department of Clinical, Educational and Health Psychology, University College London, London, UK North East London NHS Foundation Trust, London, UK
Andrew Gumley
Affiliation:
School of Health & Wellbeing, University of Glasgow, Glasgow, UK NHS Greater Glasgow & Clyde, Glasgow, UK
Gillian Haddock
Affiliation:
Division of Psychology and Mental Health, School of Health Sciences, University of Manchester and the Manchester Academic Health Sciences Centre, Manchester, UK Greater Manchester Mental Health NHS Foundation Trust and the Manchester Academic Health Sciences Centre, Manchester, UK
Sandra Bucci
Affiliation:
Division of Psychology and Mental Health, School of Health Sciences, University of Manchester and the Manchester Academic Health Sciences Centre, Manchester, UK Greater Manchester Mental Health NHS Foundation Trust and the Manchester Academic Health Sciences Centre, Manchester, UK
Hamish J. McLeod
Affiliation:
School of Health & Wellbeing, University of Glasgow, Glasgow, UK NHS Greater Glasgow & Clyde, Glasgow, UK
Jeffrey McDonnell
Affiliation:
Research Department of Clinical, Educational and Health Psychology, University College London, London, UK North East London NHS Foundation Trust, London, UK
Moya Clancy
Affiliation:
School of Health & Wellbeing, University of Glasgow, Glasgow, UK NHS Greater Glasgow & Clyde, Glasgow, UK
Michael Fitzsimmons
Affiliation:
Division of Psychology and Mental Health, School of Health Sciences, University of Manchester and the Manchester Academic Health Sciences Centre, Manchester, UK Greater Manchester Mental Health NHS Foundation Trust and the Manchester Academic Health Sciences Centre, Manchester, UK
Alice Montague
Affiliation:
Research Department of Clinical, Educational and Health Psychology, University College London, London, UK North East London NHS Foundation Trust, London, UK
Nikos Xanidis
Affiliation:
School of Health & Wellbeing, University of Glasgow, Glasgow, UK NHS Greater Glasgow & Clyde, Glasgow, UK
Hannah Ball
Affiliation:
Division of Psychology and Mental Health, School of Health Sciences, University of Manchester and the Manchester Academic Health Sciences Centre, Manchester, UK Greater Manchester Mental Health NHS Foundation Trust and the Manchester Academic Health Sciences Centre, Manchester, UK
Thomas K. J. Craig
Affiliation:
South London & Maudsley NHS Foundation Trust, London, UK Department of Health Service and Population Research, Institute of Psychiatry, Psychology & Neuroscience, King’s College London, London, UK
Philippa A. Garety
Affiliation:
Department of Psychology, Institute of Psychiatry, Psychology & Neuroscience, King’s College London, London, UK South London & Maudsley NHS Foundation Trust, London, UK
*
Correspondence: Paul McCrone. Email: p.mccrone@greenwich.ac.uk
Rights & Permissions [Opens in a new window]

Abstract

Background

AVATAR therapy, a digitally supported intervention, utilises avatars to promote recovery in people who experience distressing auditory hallucinations. This approach was recently evaluated in a multicentre randomised controlled trial comparing brief (AV-BRF) and extended (AV-EXT) forms of therapy with treatment as usual (TAU). There was evidence for the effectiveness of therapy, particularly for AV-EXT. However, value for money needs to be assessed.

Aims

To compare separately the cost utility of the brief and extended forms of AVATAR therapy with TAU.

Method

In a three-arm randomised controlled trial the use of health services was measured, and costs (2021/2022; pounds sterling) calculated from a health and social care perspective over a 28-week follow-up period. Quality-adjusted life years (QALYs; derived from the 5-level version of the EuroQol 5-Dimension questionnaire) were combined with costs.

Results

AV-BRF resulted in extra costs of £319 (95% CI, −£1558 to £2496), and AV-EXT in lower costs of £1965 (95% CI, −£1912 to £1519), compared with TAU. Over the follow-up, AV-BRF resulted in 0.0159 (95% CI, −0.0103 to 0.0422) and AV-EXT in 0.0173 (95% CI, −0.0049 to 0.0395) more QALYs than TAU. The cost per QALY for AV-BRF compared with TAU was £20 016, while AV-EXT dominated TAU (lower costs and more QALYs).

Conclusions

Neither version of AVATAR had a substantial impact on QALYs. However, AV-EXT did result in reduced care costs − albeit not statistically significant − and was potentially cost-effective compared with TAU. AV-BRF had an incremental cost-effectiveness ratio that indicated lower potential cost-effectiveness. These findings are uncertain, but could still inform decision-making regarding interventions in this field.

Information

Type
Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of Royal College of Psychiatrists

The prevalence of schizophrenia has been estimated at 0.28%, and results in impaired quality of life and high levels of disability. Reference Charlson, Ferrari, Santomauro, Diminic, Stockings and Scott1 In addition to these impacts, schizophrenia and related conditions result in substantial economic costs associated with service use and lost work. Reference McDaid and Park2,Reference McCrone, Dhanasiri, Patel, Knapp and Lawton-Smith3 Symptoms of schizophrenia and schizophrenia spectrum disorders include auditory hallucinations (hereafter ‘voices‘), and these affect up to 70% of those with the disorders. Reference Waters, Allen, Aleman, Fernyhough, Woodward and Badcock4,Reference Mawson, Cohen and Berry5 Voices can persist for long periods of time and cause significant distress. While therapy, either psychological or medication-based, can be effective for some individuals, many show an incomplete response. Reference Turner, Burger, Smit, Valmaggia and van der Gaag6,Reference Bighelli, Huhn, Schneider-Thoma, Krause, Reitmeir and Wallis7

Interventions have been developed, and are being evaluated, that specifically address auditory hallucinations. These include an approach (Talking with Voices) that encourages understanding through dialogue with the voices, Reference Longden, Corstens, Bowe, Pyle, Emsley and Peters8 and another (Relating Therapy) that focuses on assertive responses to voices using role-play methods. Reference Lincoln, Pillny, Schlier and Hayward9 Over the past decade, a digitally supported approach (AVATAR therapy) has emerged as an innovative way to help individuals experiencing distressing auditory hallucinations. Reference Leff, Williams, Huckvale, Arbuthnot and Leff10 It involves a therapist and voice-hearer working together to create a digital representation (‘avatar’) as a close match to the main distressing voice. The voice-hearer then engages in a series of active dialogues with their avatar, supported by the therapist, in which the aim is to reduce voice-related distress and build empowerment. The first fully powered trial (AVATAR1) compared seven sessions of AVATAR therapy against supportive counselling. AVATAR therapy showed significant reductions in the severity of voice-hearing experiences at 12 weeks post-therapy (the primary end-point), with no adverse events related to the therapy. Reference Craig, Rus-Calafell, Ward, Leff, Huckvale and Howarth11

Based on learning from the AVATAR1 trial, the first three-arm, multi-site randomised controlled trial (RCT) of AVATAR therapy has recently evaluated the brief (AV-BRF) and extended (AV-EXT) forms of AVATAR therapy, each against treatment as usual (TAU) alone. Reference Garety, Edwards, Jafari, Emsley, Huckvale and Rus-Calafell12 The AVATAR2 trial is the first to test effectiveness when therapy was delivered by a large cohort of therapists across geographically diverse sites. The AVATAR2 study found that both versions of AVATAR therapy significantly reduced voice-related distress at post-therapy (16-week) follow-up compared with TAU. AV-EXT therapy exceeded the prespecified threshold for a clinically significant post-treatment change on the primary outcome, while AV-BRF was slightly below this threshold. These effects were not maintained at 28 weeks, thus providing only partial support for the primary hypothesis. There were improvements in the key secondary outcomes of voice severity for both AV-BRF and AV-EXT at 16 weeks, and reduced voice frequency for AV-EXT at both time points. AVATAR therapy was found to be safe, in that there were no serious adverse events determined to be definitely related to receiving therapy.

Assessing the efficacy and effectiveness of an intervention such as AVATAR therapy is essential if it is to be recommended as a treatment for patients with distressing voices in routine care settings. However, it is also crucial to assess value for money to weigh the ‘opportunity cost’ of deployment of AVATAR therapy against alternative uses of available resource. This paper reports an economic evaluation with a primary aim of comparing the cost utility of brief and extended AVATAR therapy each against TAU in terms of quality-adjusted life years (QALYs).

Method

Trial design

Details of the study methods are given elsewhere. Reference Garety, Edwards, Ward, Emsley, Huckvale and McCrone13 This is a three-arm, parallel-group, superiority RCT (ISRCTN Registry no. ISRCTN55682735), with a 1:1:1 allocation and blinded assessors, that has been conducted to evaluate the effectiveness of AVATAR therapy (both brief and extended versions) in individuals diagnosed with schizophrenia spectrum disorder or affective disorder with psychosis (bipolar, or depression with psychosis), when added to TAU compared with TAU alone. The trial received Health Research Authority ethical approval from London-Camberwell St Giles Research Ethics Committee (no. 20/LO/0657). All participants in the trial provided written informed consent.

Setting and sample

Four research sites in the UK were involved in the trial: the Institute of Psychiatry, Psychology & Neuroscience (KCL), the University of Manchester, the University of Glasgow and University College London. Each of these research sites was affiliated with two local National Health Service (NHS) sites, and therapy sessions were primarily conducted in community settings. An important aim of the trial was to test effectiveness when therapy was delivered by a wider workforce, across geographically and demographically diverse locations including front-line mental health services. The study inclusion criteria were: (a) be aged 18 years or over; (b) currently under the care of a specialist mental health team (in- and out-patient settings); (c) have current frequent and distressing voices (as measured by a score of at least 1 on each of the intensity of distress and frequency items of the Psychotic Symptoms Rating Scale (PSYRATS-AH)), persisting for at least 6 months and spoken in English; (d) speak and read English to a sufficient level to provide consent and complete the assessment procedures; and (e) have a clinical diagnosis of schizophrenia spectrum disorder (ICD10 F20–29) or affective disorder with psychotic symptoms (ICD-10 F30–39, subcategories with psychotic symptoms), as determined through clinical records and additional consultation with the clinical team if unclear. Full details of the inclusion and exclusion criteria and flow through the trial are provided in the published protocol and clinical results. Reference Garety, Edwards, Jafari, Emsley, Huckvale and Rus-Calafell12,Reference Garety, Edwards, Ward, Emsley, Huckvale and McCrone13

Interventions

Participants were randomised to one of three groups: AV-EXT plus TAU, AV-BRF plus TAU or TAU alone. Both forms of AVATAR start with an initial assessment session including the avatar creation. AV-BRF then comprises six individual, face-to-face sessions delivered by trained therapists, which involve active avatar dialogues using the AVATAR therapy software where the avatar is voiced by the therapist using voice-transformation software. AV-BRF focuses on reducing anxiety through exposure to the avatar and targeting increased power, control and self-esteem. AV-EXT consists of up to 12 active dialogues and comprises 2 phases. The first phase (typically the first four to five sessions) mirrors AV-BRF. The second phase aims to build a more personalised understanding of the voice within the context of the person’s life history, and includes a wider range of potential therapy targets. Within both AV-BRF and AV-EXT, important therapeutic work also occurs pre- and post-dialogue to support positive change.

Outcomes, service use and costs

The primary clinical outcome measure was the distress dimension of the Psychotic Symptoms Rating Scale (PSYRATS-AH). Reference Haddock, McCarron, Tarrier and Faragher14 For the cost-effectiveness analysis, the combination of health and social care costs with QALYs was undertaken. QALYs were calculated based on the 5-level version of the EuroQol 5-Dimension questionnaire (EQ-5D-5L) scores at 3 specific time points: baseline, week 16 and week 28. The EQ-5D-5L tool assesses 5 domains (mobility, self-care, usual activities, pain/discomfort and anxiety/depression) on a scale from 1 (indicating no problems) to 5 (signifying extreme problems). Reference Herdman, Gudex, Lloyd, Janssen, Kind and Parkin15 The resulting 5-digit score represents a distinct health state, which is then converted to a value ranging from 1 (full health) to 0 (death). We used weights from the earlier EQ-5D-3L instrument utilising the eq5dmap function in Stata. Reference Hernández Alava, Pudney and Wailoo16 Using the area under the curve method, these weights were combined at baseline and each subsequent follow-up point to generate QALYs. This approach assumes a linear change between quality-of-life scores. The costs were measured from a health and social care perspective. Service use was measured with the researcher-administered Client Service Receipt Inventory (CSRI) at baseline, week 16 and week 28. Reference Beecham and Knapp17 Participants were requested to indicate whether, in the period leading up to baseline or since the last assessment, they had utilised particular services including primary care, hospital-based secondary care or social care. If participants reported using these services, they were further queried about the frequency of utilisation. Regarding in-patient care, participants were asked to specify the number of days spent in hospital. Electronic clinical notes were reviewed by unblinded trial coordinators, and additional service use included in the CSRI at each time point. Costs were computed by integrating the information of therapy time received with the relevant national unit cost data for 2021/2022. Reference Jones, Weatherly, Birch, Castelli, Chalkley and Dargan18,19 The cost for the therapist’s time was based on a Band 8 healthcare professional, with an assumed 51% increase to account for non-contact time (based on discussions with colleagues conducting similar work). Given the probable importance of this unit cost, we conducted sensitivity analyses around it (see below). The unit costs used are available in the supplementary information available at https://doi.org/10.1192/bjo.2025.10925. The hardware and software cost depends on the number of potential users, and was estimated at £50 in the base case analysis (varied in sensitivity analyses).

Analysis

Analysis was conducted using Stata 18.0 for Windows (Stata, College Station, Texas, USA; https://www.stata.com/) and R Studio for Windows (R Foundation for Statistical Computing, Vienna, Austria; https://www.R-project.org/). We followed recent recommendations for conducing trial-based cost-effectiveness analyses using R according to Ben et al. Reference Ben, van Dongen, El Alili, Esser, Broulíková and Bosmans20 The main analyses were conducted with multiple imputation for missing data. Imputations were based on chained equations and predictive mean matching from the nearest five neighbours. A comparison was made between the three groups regarding the utilisation of services and their associated costs. The total healthcare costs over the follow-up period were compared between (a) the AV-EXT and TAU groups and (b) the AV-BRF and TAU groups, using seemingly unrelated regression models controlling for baseline costs and study site. QALYs were similarly compared, controlling for baseline utility. (We did not directly compare AV-BRF and AV-EXT groups.) If either arm had lower (higher) costs and produced more (fewer) QALYs than TAU, it was then considered to be ‘dominant’ (‘dominated’). If it had higher (lower) costs and produced more (fewer) QALYs, incremental cost-effectiveness ratios (ICERs) were then produced by dividing incremental costs by incremental QALYs to show the cost incurred (saved) by AV-EXT or AV-BRF to produce one more (fewer) QALY compared with TAU.

Uncertainty was addressed using cost-effectiveness planes and cost-effectiveness acceptability curves (CEACs). These were derived by repeating the multiple imputation procedure for each of 1000 bootstrapped resamples and plotting the incremental cost and QALY combinations against each other, and by generating incremental net benefits. These were defined as the value placed on a QALY (ranging from £0 to £100 000, in £2000 increments) multiplied by the incremental QALY and then subtracting the incremental cost. For each of the 1000 resamples and each QALY value, the groups were ranked in terms of their net benefits, and these data then used to produce the CEACs.

Sensitivity analyses were conducted by (a) basing analyses on available data without multiple imputation using ordinary least-squares regression models (given different numbers with complete cost and QALY data); (b) changing the fixed element of the therapy cost (i.e. hardware, maintenance, etc.) from £50 per recipient of therapy to £75, £100, £125 and £150; and (c) increasing and decreasing therapist costs by 25 and 50% to reflect the possibility of staff of different grades delivering therapy or extra non-contact time being required.

Results

Out of 642 people who were assessed, 345 eligible participants were randomised: 114 to the AV-EXT group, 116 to the AV-BRF group and 115 to TAU. Demographic and clinical characteristics at randomisation were similar for each group, both for the randomised sample and the complete case sample (Table 1). Follow-up service use and cost data were available for all participants at baseline. At 16-week follow-up these were available for 98 (84.5%) AV-BRF, 96 (84.2%) AV-EXT and 104 (90.4%) TAU participants. At 28-week follow-up the figures were 98 (84.5%) AV-BRF, 91 (79.8%) AV-EXT and 102 (88.7%) TAU. EQ-5D-5L baseline data were available for 115 (99.1%) AV-BRF, 112 (98.2%) AV-EXT and 115 (100%) TAU participants. At 16-week follow-up the figures were 96 (82.8%), AV-BRF, 92 (80.7%) AV-EXT and 99 (86.1%) TAU. At 28-week follow-up the figures were 94 (81.0%) AV-BRF, 83 (72.8%) AV-EXT and 99 (86.1%) TAU.

Table 1 Sample characteristics

TAU, treatment as usual; AV-BRF, brief form of AVATAR therapy; AV-EXT, extended form of AVATAR therapy; PSYRATS-AH, Psychotic Symptoms Rating Scale.

Tables 2 –4 show service use and costs (other than for mental health admissions and the intervention) for available cases at each time point. In the period prior to baseline, the most commonly used services were contacts with general practitioners (GPs), psychiatrists and mental health nurses (Table 2). Around a third of participants had contacts with both psychologists and early intervention teams. There were no substantial differences between the three groups in terms of numbers of participants using services. In the period between baseline and 16- and 28-week follow-ups, GPs, psychiatrists and mental health nurses continued to be used more than other services. Use of psychologists fell in each group, although it was slightly higher for TAU participants. The mean number of service contacts among participants with at least one contact at baseline and each follow-up indicated that intensity of service use did not differ noticeably between groups (Table 3). The highest non-in-patient and non-therapy costs at each time point were observed for psychiatrists (Table 4). Most other costs were relatively low, given the small numbers using these services (most participants had zero costs for most services). However, the cost of GP contacts was low even though upwards of half of the participants used this service.

Table 2 Number (%) of participants using services (excluding AVATAR therapy) at each time point

TAU, treatment as usual; AV-BRF, brief form of AVATAR therapy; AV-EXT, extended form of AVATAR therapy; EI, early intervention.

Table 3 Mean (s.d.) number of service contacts for those using services at each time point

TAU, treatment as usual; AV-BRF, brief form of AVATAR therapy; AV-EXT, extended form of AVATAR therapy; EI, early intervention.

Means and standard deviations apply only to those with at least one contact.

Table 4 Mean service costs (pounds sterling) at each time point (complete case sample)

TAU, treatment as usual; AV-BRF, brief form of AVATAR therapy; AV-EXT, extended form of AVATAR therapy; EI, early intervention.

The AV-BRF group had a mean (s.d.) 5.1 (2.4) sessions of therapy, and the AV-EXT had a mean (s.d.) of 8.2 (4.4) sessions. The mean (s.d.) costs of therapist time were £420 (£198) for AV-BRF and £674 (£365) for AV-EXT. Over the combined follow-up period, mental health in-patient admissions occurred for a few participants (8 AV-BRF, 7 AV-EXT and 9 Tau). For those who were admitted, the mean number of days during the follow-up period was 70.0 for AV-BRF, 27.9 for AV-EXT and 76.7 for TAU. Mean costs for this in-patient use were substantially less for AV-EXT (£583) than for either AV-BRF (£1646) or TAU (£2046).

Total costs at baseline were very similar between groups (Table 5). Across all participants with cost data, mean follow-up costs were £4271 for TAU, £4624 for AV-BRF and £4070 for AV-EXT. Following multiple imputation, the incremental costs compared with TAU, controlling for baseline costs and site, were £319 for AV-BRF (95% CI, –£1558 to £2496) and –£196 for AV-EXT (95% CI, –£1912 to £1519). The incremental costs for those with available cost data were £435 for AV-BRF (95% CI, –£1770 to £3319) and £208 for AV-EXT (95% CI, –£1583 to £1929).

Table 5 Cost-effectiveness results

TAU, treatment as usual; AV-BRF, brief form of AVATAR therapy; AV-EXT, extended form of AVATAR therapy; QALYs, quality-adjusted life years; SUR, seemingly unrelated regression; OLS, ordinary least squares.

a. Adjusting for baseline costs and study site.

b. Adjusting for baseline 5-level version of the EuroQol 5-Dimension questionnaire score and study site.

Primary health economic analyses

At baseline, the EQ-5D-5L quality of life scores were comparable across the three groups (Table 6). By the 16-week follow-up, both AVATAR groups showed an improvement in their scores. However, by the 28-week mark there was a slight decrease in scores for both AVATAR groups compared with the 16-week point. EQ-5D-5L scores for TAU remained relatively constant. Based on the multiple imputation analysis, over the combined follow-up, AV-BRF resulted in 0.0159 (95% CI, −0.0103 to 0.0422) more QALYs than TAU, and AV-EXT in 0.0173 (95% CI, −0.0049 to 0.0395) more QALYs than TAU (Table 5). The analysis based on available cases showed incremental QALY gains of 0.0171 (95% CI, −0.0089 to 0.0416) for AV-BRF and 0.0165 (95% CI, −0.0077 to 0.0389) for AV-EXT.

Table 6 Mean (s.d.) health-related quality of life scores derived from the 5-level version of the EuroQol 5-Dimension questionnaire

TAU, treatment as usual; AV-BRF, brief form of AVATAR therapy; AV-EXT, extended form of AVATAR therapy.

Following multiple imputation for missing data, AV-BRF resulted in higher costs and more QALYs than TAU, with an ICER of £20 016 per QALY. AV-EXT dominated TAU (it had lower costs and produced more QALYs). Figures 1 and 2 show cost-effectiveness planes indicating uncertainty in the cost-effectiveness results. In Figure 2, for AV-EXT the most incremental cost–QALY combinations are shown at the top-right (higher costs and better outcomes) and bottom-right (lower cost and better outcomes) quadrants. A similar pattern emerged for AV-BRF, although the incremental costs were generally higher. The cost-effectiveness acceptability curves (Figs. 3 and 4) show that AV-EXT has a higher likelihood of being the most cost-effective option compared with TAU for all values placed on a QALY. AV-BRF has a greater likelihood of being the most cost-effective option compared with TAU for values placed on a QALY exceeding around £20 000.

Fig. 1 Cost-effectiveness plane, brief form of AVATAR therapy versus treatment as usual. QALYs, quality-adjusted life years.

Fig. 2 Cost-effectiveness plane, extended form of AVATAR therapy versus treatment as usual. QALYs, quality-adjusted life years.

Fig. 3 Cost-effectiveness acceptability curve (brief form of AVATAR therapy versus treatment as usual). QALYs, quality-adjusted life years.

Fig. 4 Cost-effectiveness acceptability curve (extended form of AVATAR therapy versus treatment as usual). QALYs, quality-adjusted life years.

At a threshold of £20 000 per QALY (the lower threshold used by the National Institute for Health and Care Excellence (NICE)), AV-BRF had a probability of 0.4943 of being more cost-effective than TAU, and this increased to 0.5534 for a threshold of £30 000. For AV-EXT the probabilities were 0.7188 and 0.7662, respectively.

Sensitivity analyses

When the non-staff costs of therapy (equipment, software, maintenance, etc.) were decreased from £50 per participant to £25, the ICER for AV-BRF compared with TAU fell to £18 657. If these costs increased to £75–£150, the ICER for AV-BRF compared with TAU then increased to £21 374–£25 449. For these alternative non-therapist costs, AV-EXT always dominated TAU.

If therapist costs are reduced by 50%, the ICER for AV-BRF then becomes £52; if these are reduced by 25%, the ICER for AV-BRF then becomes £10 034; and if they are increased by 25 and 50% then the ICER is £29 997 and £39 979, respectively. AV-EXT dominates TAU regarding changes to therapy costs unless these are increased by 50%, when ICER becomes £18 173.

Discussion

This study reports evidence on the cost utility of the brief and extended forms of AVATAR therapy, compared with TAU. We have shown that AV-EXT is cost-effective compared with TAU, in that it has lower costs and produces more QALYs. AV-BRF results in higher costs and has an ICER slightly above £30 000 (the upper threshold used by NICE in England and Wales). These findings offer supplementary evidence to the main trial outcomes, in which AV-EXT therapy exceeded the prespecified threshold for a clinically significant post-treatment change on the primary outcome (AV-BRF was just below), and demonstrated improvements in secondary outcomes prioritised by voice-hearers, including measures of well-being and recovery. However, it is important to note that while AV-EXT emerged as the more cost-effective option compared with TAU, the analysis is based on sample averages meaning that, for some, AV-BRF may be more cost-effective and indeed for others TAU may be more appropriate.

These results were almost entirely influenced by cost differences. Both forms of AVATAR therapy resulted in more QALYs than TAU, but these incremental gains were very small. Although AV-EXT is a more expensive form of therapy than AV-BRF, the AV-EXT trial arm had lower psychiatric in-patient costs during the follow-up period, which resulted in overall cost savings. The use of QALYs and EQ-5D-5L to generate these has long been a source of debate regarding mental health studies. Reference Chisholm, Healey and Knapp21,Reference McCrone, Patel, Knapp, Schene, Koeter and Amaddeo22 The very small QALY gains for AV-EXT and AV-BRF compared with TAU mean that either health-related quality of life was largely unaffected by receipt of the interventions, or that EQ-5D-5L was not sensitive to change in this study. In support of the latter explanation, the lack of observed change in EQ-5DL stands in contrast to the pattern of relevant secondary outcomes reported in the trial, which included a significant positive impact on well-being. The main outcomes from this trial show that, at 16-week follow-up, AB-BRF and AV-EXT improved significantly on the Warwick–Edinburgh Mental Wellbeing Scale, by 2.19 and 6.83 points, respectively, compared with TAU. Reference Garety, Edwards, Jafari, Emsley, Huckvale and Rus-Calafell12 At 28 weeks, AV-EXT improved by 5.1 points compared with TAU while the improvement for AV-BRF (2.03) was not statistically significant. Self-reported recovery showed significant improvements for both groups compared with TAU at both time points. Given that the primary aim was to reduce distress caused by voices, it is perhaps expected to see the biggest impacts on the anxiety and depression domains of EQ-5D-5L. EQ-5D-5L continues to be used widely in mental health studies, although mental health-specific alternatives such as the Recovering Quality of Life questionnaire are also used. Reference Keetharuth, Brazier, Connell, Bjorner, Carlton and Taylor Buck23

Although AV-EXT produced gains compared with TAU and appeared to be cost-effective, it is important to note that completion of treatment for this group was 57.9% compared with 81.9% for AV-BRF. This did not affect the results, in that we attached costs to actual therapy time; however, if completion rates were higher so would be the costs. Of course, it may also then be the case that outcomes could be improved further.

The current study was not designed to consider the complexities involved when moving beyond clinical research trials into real-world deployment of AVATAR therapy. Implementation is recognised as a challenge that many digital interventions, regardless of efficacy and cost-effectiveness evidence, fail to overcome. Reference Greenhalgh, Wherton, Papoutsi, Lynch, Hughes and A’Court24Reference Borges Do Nascimento, Abdulazeem, Vasanthan, Martinez, Zucoloto and Østengaard26 In this study, therapy was delivered by a relatively small number of therapists (19 therapists delivering therapy across 8 NHS trusts), while sustained roll-out of therapy will require training of more people.

AVATAR therapy has recently been recommended for NHS deployment to support the collection of real-world evidence. 27 Therefore, the next steps will be to test the effectiveness and cost-effectiveness of AVATAR therapy when delivered in diverse routine care settings, and to understand the barriers to, and facilitators of, wider implementation.

Limitations

Although this was a large trial and with a reasonable rate of follow-up, there are limitations in regard to the economic evaluation. First, to estimate care costs we partly relied on self-report data from participants. This was unavoidable given the lack of linked data covering all services that were relevant. However, the use of mental health services was checked against records. Second, while the study was large, the use of in-patient care may have been unbalanced due to the small numbers receiving this. Findings for AV-BRF may have been impacted by a small number of in-patient high-cost admissions exerting a disproportionate influence on service costs. This is frequently an issue with economic evaluations, where cost data are inevitably skewed. Third, the main outcome measure used in the analysis was QALYs derived from EQ-5D-5L (see discussion above).

In conclusion, this study has shown that providing AVATAR therapy in an extended form could be cost-effective compared with TAU. There is, however, much uncertainty in the results, and the finding in favour of the therapy is almost entirely due to the lower costs in that group. There is limited evidence currently to suggest that the brief form of AVATAR therapy is cost-effective, although the costs for AV-BRF may have been disproportionately influenced by a relatively small number of high-cost admissions. Based on the recent NICE-EVA recommendation and associated evidence generation plan, the next steps are to test the implementation of AVATAR therapy, including the provision of further evidence of effectiveness and cost-effectiveness, when delivered in diverse routine care settings. The evidence presented in this paper converges with main trial outcomes to suggest that future implementation is primarily guided by the AV-EXT protocol.

Supplementary material

The supplementary material is available online at https://doi.org/10.1192/bjo.2025.10925

Data availability

Data will be made accessible following publication of this paper. A request can be made by academic or clinical researchers to for the purpose of conducting non-commercial, ethically approved research. The research data team will review the request against the conditions set out in the data access agreement. An initial response to requests will be formulated within 1 month. A data access agreement will be drawn up before data can be shared.

Acknowledgements

We thank all participants who took part in the trial and provided their data as well as the clinical teams who referred and supported the participants. We thank the research team who collected data. We are grateful to the members of the Trial Steering Committee and the Data Monitoring and Ethics Committee.

Author contributions

P.A.G., T.W., T.K.J.C., R.E., M.H., M.R.-C. and P.M. conceived and designed the study. P.A.G., T.K.J.C., R.E., M.H., A.G., G.H., M.F.-A. and P.M. obtained the funding. P.A.G. was the study’s principal investigator. T.K.J.C. was the study’s co-principal investigator. P.M. led the health economic evaluation. T.W. led, and T.K.J.C. co-led, treatment provision and training. C.J.E. coordinated, with local site leads J.M., M.C. and M.F., all aspects of trial management, recruitment, data collection and processing. M.H. designed the software. M.F.-A., A.G., G.H., S.B. and H.J.M. were the local site principal investigators. H.B., A.M. and N.X. led the local site treatment training and provision. E.G., R.E., C.J.E. and H.J. had full access to all data. E.G. and P.M. were responsible for the health economic analyses and wrote the first draft of the paper, and comments and approval were received by other authors.

Funding

This research was funded by a Wellcome grant (no. 215471/Z/19/Z). The study was also supported by the National Institute for Health and Care Research (NIHR), the Wellcome Trust King’s Clinical Research Facility, the NIHR Maudsley Biomedical Research Centre and Maudsley National Health Service (NHS) Foundation Trust, King’s College London (P.A.G. and R.E., no. NIHR203318), the Manchester Biomedical Research Centre (S.B. and G.H., no. NIHR203308), NHS Research Scotland through the Chief Scientist Office and the NHS Scotland Mental Health Network (A.G. and H.J.M., no. NRSMHN/2021/01). The funding bodies and supporters did not have any role in study design, data collection and analysis, data interpretation or writing of the manuscript.

Declaration of interest

M.H. is a shareholder of Avatar Therapy Ltd. T.K.J.C. and P.A.G. are unpaid scientific advisers to Avatar Therapy Ltd. S.B. is a director and shareholder of CareLoop Health Ltd, which develops and markets digital therapeutics for schizophrenia and a digital screening app for postnatal depression. S.B. discloses support for publication of this work from the NIHR research professorship (no. NIHR300794). R.E. discloses support for publication of this work from the NIHR research professorship (no. NIHR300051). M.R.-C. discloses support for the publication of this work from the Sofja Kovalevskaja Award from the Alexander von Humbold Foundation and the Ministry of Education and Research in Germany (SKP2020RusCalafell). The other authors declare no competing interests.

Footnotes

*

Joint last authors.

References

Charlson, FJ, Ferrari, AJ, Santomauro, DF, Diminic, S, Stockings, E, Scott, JG, et al. Global epidemiology and burden of schizophrenia: findings from the Global Burden of Disease Study 2016. Schizophr Bull 2018; 44: 1195–203.CrossRefGoogle ScholarPubMed
McDaid, D, Park, A-L. The Economic Case for Investing in the Prevention of Mental Health Conditions in the UK. Care Policy and Evaluation Centre, Department of Health Policy, London School of Economics and Political Science, 2022 (https://www.mentalhealth.org.uk/sites/default/files/2022-06/MHF-Investing-in-Prevention-Full-Report.pdf).Google Scholar
McCrone, P, Dhanasiri, S, Patel, A, Knapp, M, Lawton-Smith, S. Paying the Price: The Cost of Mental Health Care in England to 2026. Kings Fund, 2008.Google Scholar
Waters, F, Allen, P, Aleman, A, Fernyhough, C, Woodward, TS, Badcock, JC, et al. Auditory hallucinations in schizophrenia and nonschizophrenia populations: a review and integrated model of cognitive mechanisms. Schizophr Bull 2012; 38: 683–93.10.1093/schbul/sbs045CrossRefGoogle ScholarPubMed
Mawson, A, Cohen, K, Berry, K. Reviewing evidence for the cognitive model of auditory hallucinations: the relationship between cognitive voice appraisals and distress during psychosis. Clin Psychol Rev 2010; 30: 248–58.10.1016/j.cpr.2009.11.006CrossRefGoogle ScholarPubMed
Turner, DT, Burger, S, Smit, F, Valmaggia, LR, van der Gaag, M. What constitutes sufficient evidence for case formulation-driven CBT for psychosis? Cumulative meta-analysis of the effect on hallucinations and delusions. Schizophr Bull 2020; 46: 1072–85.CrossRefGoogle ScholarPubMed
Bighelli, I, Huhn, M, Schneider-Thoma, J, Krause, M, Reitmeir, C, Wallis, S, et al. Response rates in patients with schizophrenia and positive symptoms receiving cognitive behavioural therapy: a systematic review and single-group meta-analysis. BMC Psychiatry 2018; 18: 380.10.1186/s12888-018-1964-8CrossRefGoogle ScholarPubMed
Longden, E, Corstens, D, Bowe, S, Pyle, M, Emsley, R, Peters, S, et al. A psychological intervention for engaging dialogically with auditory hallucinations (Talking with Voices): a single-site, randomised controlled feasibility trial. Schizophr Res 2022; 250: 172–9.10.1016/j.schres.2022.11.007CrossRefGoogle ScholarPubMed
Lincoln, TM, Pillny, M, Schlier, B, Hayward, M. RELATE – a randomised feasibility trial of a Relating Therapy module for distressing auditory verbal hallucinations: a study protocol. BMJ Open 2021; 11: e046390.10.1136/bmjopen-2020-046390CrossRefGoogle ScholarPubMed
Leff, J, Williams, G, Huckvale, MA, Arbuthnot, M, Leff, AP. Computer-assisted therapy for medication-resistant auditory hallucinations: proof-of-concept study. Br J Psychiatry 2013; 202: 428–33.10.1192/bjp.bp.112.124883CrossRefGoogle ScholarPubMed
Craig, TK, Rus-Calafell, M, Ward, T, Leff, JP, Huckvale, M, Howarth, E, et al. AVATAR therapy for auditory verbal hallucinations in people with psychosis: a single-blind, randomised controlled trial. Lancet Psychiatry 2018; 5: 3140.CrossRefGoogle ScholarPubMed
Garety, P, Edwards, CJ, Jafari, H, Emsley, R, Huckvale, M, Rus-Calafell, M, et al. Digital AVATAR therapy for distressing voices in psychosis: the phase 2/3 AVATAR2 trial. Nat Med 2024; 30: 3658–68.10.1038/s41591-024-03252-8CrossRefGoogle ScholarPubMed
Garety, P, Edwards, CJ, Ward, T, Emsley, R, Huckvale, M, McCrone, P, et al. Optimising AVATAR therapy for people who hear distressing voices: study protocol for the AVATAR2 multi-centre randomised controlled trial. Trials 2021; 22: 366.10.1186/s13063-021-05301-wCrossRefGoogle ScholarPubMed
Haddock, G, McCarron, J, Tarrier, N, Faragher, EB. Scales to measure dimensions of hallucinations and delusions: the psychotic symptom rating scales (PSYRATS). Psychol Med 1999; 29: 879–89.10.1017/S0033291799008661CrossRefGoogle ScholarPubMed
Herdman, M, Gudex, C, Lloyd, A, Janssen, M, Kind, P, Parkin, D, et al. Development and preliminary testing of the new five-level version of EQ-5D (EQ-5D-5L). Qual Life Res 2011; 20: 1727–36.10.1007/s11136-011-9903-xCrossRefGoogle ScholarPubMed
Hernández Alava, M, Pudney, S, Wailoo, A. Estimating the relationship between EQ-5D-5L and EQ-5D-3L: results from a UK population study. PharmacoEconomics 2023; 41: 199207.CrossRefGoogle ScholarPubMed
Beecham, J, Knapp, MM. Costing psychiatric interventions. In Measuring Mental Health Needs: 163–83. Gaskell, 1992.Google Scholar
Jones, KC, Weatherly, H, Birch, S, Castelli, A, Chalkley, M, Dargan, A, et al. Unit Costs of Health and Social Care 2022 Manual. Personal Social Services Research Unit, University of Kent and Centre for Health Economics, University of York, 2022 (https://kar.kent.ac.uk/100519/).Google Scholar
NHS England. National Cost Collection for the NHS. NHS England, 2025 (https://www.england.nhs.uk/costing-in-the-nhs/national-cost-collection/).Google Scholar
Ben, AJ, van Dongen, JM, El Alili, M, Esser, JL, Broulíková, HM, Bosmans, JE. Conducting trial-based economic evaluations using R: a tutorial. PharmacoEconomics 2023; 41: 1403–13.CrossRefGoogle Scholar
Chisholm, D, Healey, A, Knapp, M. QALYs and mental health care. Soc Psychiatry Psychiatr Epidemiol 1997; 32: 6875.CrossRefGoogle ScholarPubMed
McCrone, P, Patel, A, Knapp, M, Schene, A, Koeter, M, Amaddeo, F, et al. A comparison of SF-6D and EQ-5D utility scores in a study of patients with schizophrenia. J Ment Health Policy Econ 2009; 12: 2731.Google Scholar
Keetharuth, AD, Brazier, J, Connell, J, Bjorner, JB, Carlton, J, Taylor Buck, E, et al. Recovering Quality of Life (ReQoL): a new generic self-reported outcome measure for use with people experiencing mental health difficulties. Br J Psychiatry 2018; 212: 42–9.10.1192/bjp.2017.10CrossRefGoogle ScholarPubMed
Greenhalgh, T, Wherton, J, Papoutsi, C, Lynch, J, Hughes, G, A’Court, C, et al. Beyond adoption: a new framework for theorizing and evaluating nonadoption, abandonment, and challenges to the scale-up, spread, and sustainability of health and care technologies. J Med Internet Res 2017; 19: e367.CrossRefGoogle Scholar
Eisner, E, Faulkner, S, Allan, S, Ball, H, Di Basilio, D, Nicholas, J, et al. Barriers and facilitators of user engagement with digital mental health interventions for people with psychosis or bipolar disorder: systematic review and best-fit framework synthesis. JMIR Ment Health 2025; 12: e65246.10.2196/65246CrossRefGoogle ScholarPubMed
Borges Do Nascimento, IJ, Abdulazeem, H, Vasanthan, LT, Martinez, EZ, Zucoloto, ML, Østengaard, L, et al. Barriers and facilitators to utilizing digital health technologies by healthcare professionals. NPJ Digit Med 2023; 6: 161.10.1038/s41746-023-00899-4CrossRefGoogle ScholarPubMed
National Institute for Health and Care Excellence. Digital Health Technologies to Help Manage Symptoms of Psychosis and Prevent Relapse in Adults and Young People: Early Value Assessment. National Institute for Health and Care Excellence, 2024 (www.nice.org.uk/guidance/hte17).Google Scholar
Figure 0

Table 1 Sample characteristics

Figure 1

Table 2 Number (%) of participants using services (excluding AVATAR therapy) at each time point

Figure 2

Table 3 Mean (s.d.) number of service contacts for those using services at each time point

Figure 3

Table 4 Mean service costs (pounds sterling) at each time point (complete case sample)

Figure 4

Table 5 Cost-effectiveness results

Figure 5

Table 6 Mean (s.d.) health-related quality of life scores derived from the 5-level version of the EuroQol 5-Dimension questionnaire

Figure 6

Fig. 1 Cost-effectiveness plane, brief form of AVATAR therapy versus treatment as usual. QALYs, quality-adjusted life years.

Figure 7

Fig. 2 Cost-effectiveness plane, extended form of AVATAR therapy versus treatment as usual. QALYs, quality-adjusted life years.

Figure 8

Fig. 3 Cost-effectiveness acceptability curve (brief form of AVATAR therapy versus treatment as usual). QALYs, quality-adjusted life years.

Figure 9

Fig. 4 Cost-effectiveness acceptability curve (extended form of AVATAR therapy versus treatment as usual). QALYs, quality-adjusted life years.

Supplementary material: File

McCrone et al. supplementary material

McCrone et al. supplementary material
Download McCrone et al. supplementary material(File)
File 16.3 KB
Submit a response

eLetters

No eLetters have been published for this article.