Clinical Reach into the Cognitive Space (CRITiCS): outline conceptual framework for safe use of generative artificial intelligence in mental health decision-making

Andrew Hider; Lesa Wright; Jacob Needle

doi:10.1192/bjb.2025.36

Clinical Reach into the Cognitive Space (CRITiCS): outline conceptual framework for safe use of generative artificial intelligence in mental health decision-making

Published online by Cambridge University Press: 13 August 2025

Andrew Hider

Lesa Wright and

Jacob Needle

Show author details

Andrew Hider*: Affiliation:
Iris Care Group, Cardiff, UK
Lesa Wright: Affiliation:
Psychiatry UK Ltd, Camelford, UK
Jacob Needle: Affiliation:
Iris Care Group, Cardiff, UK
*: Correspondence to Andrew Hider (andrew.hider@iriscaregroup.co.uk)

Article contents

Abstract
Aims and method
Results
Clinical implications
Method
Results
Discussion
Data availability
Author contributions
Funding
Declaration of interest
References

Rights & Permissions

Abstract

Aims and method

Advances in generative artificial intelligence, particularly through large language models, like GPT-4, have opened opportunities to develop cognitive agents to enhance clinical productivity, especially in complex secondary and tertiary care settings. However, as artificial intelligence begins to occupy the cognitive space traditionally held by human clinical reasoning, transparency becomes a significant concern. Unlike human decision-making, artificial intelligence-generated outputs may not be traceable to a transparent chain of clinical reasoning, potentially impacting safety if used without adequate ‘clinician reach’ into the reasoning space of artificial intelligence.

Results

We highlight the need for a consensus framework to guide the responsible use of generative artificial intelligence in mental healthcare, which, it is argued, has cognitive demands and features distinct from physical medicine. We propose such a framework, Clinical Reach into the Cognitive Space (CRITiCS), to support clinician involvement in the deployment of these technologies.

Clinical implications

This paper aims to spark dialogue and interest in both the clinical and artificial intelligence development communities.

Keywords

Artificial intelligence case formulation patient safety computational psychiatry clinical reasoning

Information

Type: Original Papers
Information: BJPsych Bulletin , First View , pp. 1 - 6

DOI: https://doi.org/10.1192/bjb.2025.36 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright: © The Author(s), 2025. Published by Cambridge University Press on behalf of Royal College of Psychiatrists

‘… much of the meaning of a represented piece of information derives from the context in which the information is encoded and decoded. This can be a tremendous advantage. To the extent that the two thinking beings are sharing a common rich context, they may utilize terse signals to communicate complex thoughts.’ Douglas Lenat (1975)^{Reference Lenat1}

We will not review the entire history of artificial intelligence developments here, but it is clear that recent advances have surpassed initial timescale predictions as to the emergence of human-level cognitive processing, and perhaps even accelerated progress towards general artificial intelligence.^{Reference Bubeck, Chandrasekaran, Eldan, Gehrke, Horvitz and Kamar2} For clinicians seeking a comprehensive overview that supports an understanding of how the new ‘transformer’-based large language models (LLMs) such as GPT4 work, we would suggest reading Wolfram^{Reference Wolfram3} as an adjunct to the ideas contained within this paper. By way of definition, a LLM is a type of algorithm capable of generating text based on a particular context due to the associated patterns in the vast quantities of data on which it has been trained. These patterns convey an apparent ability to reason when the LLM is presented with particular information.

In comparison with physical medicine, use cases for artificial intelligence in mental healthcare, while nascent, have been limited in scope given the differences between mental health treatment and those applied in physical medicine. For example, artificial intelligence use for diagnostic purposes in radiology is established, initial models have already been deployed and consensus governance frameworks developed.^{Reference Vasey, Novak, Ather, Ibrahim and McCulloch4} We would argue that this situation is due to both the ontological and procedural structure of mental healthcare, which we discuss below.

Method

Mental healthcare has unique features

It is important to delineate the distinct features of mental healthcare before considering how a conceptual model might support the development and governance of artificial intelligence systems that can be used safely in the clinical domain. It is also of note that the majority of conversations about the applications of artificial intelligence in mental health services have, to date, focused on clinical ‘expert models’ that replicate the (psychological) therapeutic task via chatbot interfaces. This paper starts from the position that ‘artificial intelligence therapy’ is not likely to be the main use case for generative artificial intelligence in complex mental health provision. Rather, we believe that the use case will centre on expert models that undertake “cognitive space” work such as case formulation, and therefore augment clinician cognition, to improve productivity and increase the time available for direct clinical contact. This is particularly relevant in secondary and tertiary care services, where information load and clinical administration burden are high, and where patient safety is a function of good global awareness of historical and current clinical factors. We also envisage that areas of the world with significant shortages of trained mental health clinicians would benefit from access to a generative artificial intelligence cognitive space that mirrors that of a highly trained clinician, particularly in the context of specialism-specific demand.

Mental healthcare is ontologically and theoretically diverse

Mental healthcare delivery relies on the application of theories of mind and behaviour that are often contested and are pragmatic in nature, to a greater extent than those in physical medicine.^{Reference Malla, Joober and Garcia5} Clinical constructs, derived from and constitutive of these theories, are used and manipulated in cognitive space by clinicians. These constructs do not typically map onto the world in the same way as those manipulated by clinicians working in physical medicine. For example, many, if not most, commonly used mental health ontological constructs such as, for example, ‘schemata’, ‘automatic thoughts’, ‘overvalued ideas’, ‘primary affect’ and ‘executive function’ are, we would argue, heterotopic to reality. In other words, these ontologies map reality onto often speculative or inferred psychophysical states, and/or onto higher-level factor analytically, neuropsychologically and/or biologically derived constructs from the fields of clinical psychology and psychiatry. Furthermore, constructs may be ontologically distinct across multiple theoretical areas (e.g. schemata is a construct used with variable meaning across different kinds of cognitive therapy and neuropsychological theory). It is therefore challenging to map a negative mental or emotional experience to a measurable causal chain. In contrast, physical medical constructs are often more measurable, visible and homotopic to reality (such as measures of neoplasm cell division in oncology). This makes it far simpler to map a negative experience such as a painful lump in the neck to a causal chain – for example, a neoplastic growth. We recognise that we are, to some extent, generalising and that, with respect to certain medical conditions (e.g. chronic inflammatory conditions such as fibromyalgia, and some chronic pain conditions), this is generally not the case. However, it is arguable that in mental healthcare, even quantitative psychometric and neuropsychometric data are not, at root, measuring ‘states of the world’ analogous to such observational measures in physical medicine. This feature of mental healthcare has profound implications for the safe clinical use of artificial intelligence. Given that clinical decision-making in mental healthcare uses, in almost all cases, these ‘heterotopic’ constructs, the transparency of a system’s ontology increases in importance, as does a governance framework that ensures that the model ‘knows’ the ontological status of the constructs it is representing, when constructing and mapping a high-dimensional cognitive space for the purposes of undertaking those cognitive tasks relevant to mental healthcare.

Patient safety in mental healthcare is often a function of ‘weak signal’ detection in unstructured data-sets

Serious adverse events related to mental health, such as suicide and homicide, are rare in the general population. While the incidence of these is higher in specific clinical mental health populations, general purpose language models may not be equipped to identify these correctly in unstructured data-sets. The clinical record in mental health is dominated by unstructured data. ‘Weak signals’ in these data, such as a single sentence in a clinical note that reports a particular risk indicator, or patterns in collective clinical data across a single site, may in hindsight be seen as drivers of significant harm.^{Reference Wilson6} Serious incident reviews in the field are replete with such examples.⁷ Arguably, AI-assisted automation of clinical history review is likely to improve signal detection. However, even given the known variabilities of humans in their ability to digest and detect weak signals in large data-sets, the accountability of the agent (the clinician) would still apply, post hoc (for example, in a serious incident investigation), even if a weak signal was missed by either an AI or a human agent. When applying artificial intelligence to mental health clinical review, a transparent governance framework is critical to ensure sufficient clinician ‘reach’ into the processes of signal definition, signal detection and the activation of a response to both weak and strong signals in the data, in order to maintain patient safety.^{Reference Lee, Goldberg and Kohane8}

Conversely, overemphasising weak signals due to lack of clinical experience can lead to iatrogenic harm through overtreatment and stigmatisation – for example, the misinterpretation of normative yet subjectively severe psychological distress and precipitous use of diagnosis and treatment. For example, this can occur in grief or adjustment disorder states, which may be misconstrued as severe depressive episodes. Any governance framework for the use of autonomous clinical agents will also need to be able to protect against this risk.

Large language models make it possible, depending on the interface used, for clinicians to question in vivo the information presented to them. This would permit trained clinicians to ask the right questions and properly examine weak signals. Furthermore, the domain knowledge that practising clinicians have would allow them to question the absence of signals where they might otherwise be expected. Taking this a step further, the clinician can explore the relationships between weak signals and related biopsychosocial factors, as established in the literature, with relative ease.

Fig. 1 An outline proposal: the Clinical Reach into the Cognitive Space (CRITiCS) framework for generative artificial intelligence patient safety in mental healthcare. LLM, large language model.

Mental healthcare delivery is driven by idiographic case formulation

There are some components of mental healthcare that yield to diagnostically based heuristics (for example, protocols for antipsychotic use in schizophrenia and other psychotic illnesses). However, in the majority of cases the overall treatment heuristic incorporates some degree of idiographic case formulation that drives ongoing treatment and risk management. Case formulation may either be theory naive – for example, generic ‘5P’ models as outlined by Winiarski;^{Reference Winiarski9} single-theory dependent – for example, cognitive behavioural formulation models;^{Reference Bieling and Kuyken10} or procedurally theoretically integrated – for example, process-based therapy.^{Reference Hofmann and Hayes11}

The National Institute for Health and Care Excellence in the UK provides guidance on the broad theoretical constraints around which treatment for specific conditions should be provided. However, given that in mental healthcare, multiple morbidity is the norm, clinicians often undertake case formulation from a theory-diverse perspective in order to construct an individual map of the presenting difficulty, alongside the patient wherever possible. This supports treatment selection, clinical staging and outcome evaluation. Formulation is a highly idiographic task that depends heavily on clinician available knowledge, training level and both clinician and patient preference. There is limited predictive power to idiographic formulation, particularly in complex cases, such that a hindsight review can definitively conclude that the formulation was ‘wrong’ in a manner analogous to a treatment or assessment omission in the management of a physical disease. This has implications in the use of case formulations as ‘fine-tuning’ material for both clinicians and generative artificial agents. Further, as treatment histories elongate, multiple formulations typically exist in the clinical record.

Additionally, the passage of time may invalidate formulations, simply because they are missing more recent data points. Case formulations are substantially approximate representations of the clinical situation. Suitably skilled mental health clinicians have the ability to distil a sometimes vague sense of meaning into more concrete signals via learned clinical reasoning (for example, by noticing particular signs, inflections or case history details). This skill is hard to operationalise within the terms of a formal cognitive space in the context of mental healthcare, perhaps even more so than in physical healthcare. Finally, the nature of human mental health is that it is profoundly influenced by stochastic variables external to the person (such as social and economic circumstances and life events that may impact on the genesis and pathway of illness and, in many instances, define it). However, there have been recent advances in the literature supporting improved definition of this cognitive space.^{Reference Wilcox, Schroeder and Drefs12,Reference Dissanayake, Colicchio and Cimino13}

Results

The future – safe agentive reach into the clinical cognitive space

Computational models of the kinds of complex human cognition that account for all of the cognitive processes underpinning mental health clinical work remain matters of active investigation.^{Reference Kelly, Arora, West and Reitter14} Nonetheless, mapping and creating a high-dimensional computational space that broadly (albeit not fully transparently) models the cognitive space of a clinician when constructing a mental health case formulation is now technologically achievable, as is a system that automates the process of clinical formulation. However, there is a clear need to ensure that the ‘agentive reach’ of clinicians into the finalisation of hypothesis-driven case formulations constructed by artificially intelligent agents is significant and sufficient. There is also a need to ensure that objective and transparent confidence metrics are built into such processes, to reduce the risk of automation bias, i.e. clinician overconfidence in case formulations generated by such agents. Furthermore, future clinicians will need to be able to explain these metrics and their meaning to their patients. Development of confidence metrics and paradigms using current ‘explainable artificial intelligence’ protocols is an urgent task. Hauser et al,^{Reference Hauser, Skvortsova, De Choudhury and Koutsouleris15} talking from the viewpoint of computational psychiatry, refer to this move towards increased process transparency as the development of ‘grey box’ models. Regulators and clinicians developing such tools may usefully develop objective measures of ‘reach’ using the proposed model as a general frame. The question of what is sufficient ‘clinician agentive reach’, and by what measure, into such automated clinical systems used in mental healthcare is an open one. We would suggest that all those seeking to develop new mental health clinical technologies based on LLMs should expect to be asked to demonstrate governance principles and measures across each of the domains of the framework (Fig. 1).

This diagram visualises an outline framework that could be used by practising clinicians without advanced knowledge of LLM and other artificial intelligence technology, to conceptualise the interaction between the traditional cognitive space of the mental health clinician and the new cognitive (clinical) space of the intelligent artificial agent.

The area space of each triangle represents the degree of cognitive space occupied by each agent (human and artificial intelligence). As such, it represents a combined human/machine cognitive space. As the area of cognitive space controlled by each agent decreases, the degree to which ‘supervisory’ attention is required by each agent (to the decision-making process) increases, to both support weak signal detection and ensure human penetrance into areas of case consideration that require, for example, empathic or derived relational responses originating in human (clinician) learning. Similarly, machine attention is deployed to some degree throughout the space, in order to support safety in areas where human cognitive and psychological weaknesses may affect the heuristic process.

In sum, solving the challenge of the ‘alignment problem’ as applied to artificial intelligence use in mental healthcare will involve both ensuring adequate human reach into machine cognitive spaces in healthcare applications of artificial intelligence, as well as using current technological advances to introduce intelligent agent ‘reach’ into the cognitive space of human clinical decision-making. This view sees artificial intelligence reach as a potential safety-enhancing process, given that human errors make such a large contribution to patient safety incidents in mental healthcare.

Table 1 provides an overview of the tasks and respective agentive reach of each agent in the range of cognitive spaces that would operate in artificial intelligence agent/human clinical tasks in mental healthcare.

Table 1 The Clinical Reach into the Cognitive Space (CRITiCS) model: nature and degree of agentive reach by cognitive space

LLM, large language model.

Discussion

The framework – that we have called ‘CRITiCS’ – suggests a potential conceptual basis through which to map existing artificial intelligence healthcare governance principles, such as those outlined by Reddy et al,^{Reference Reddy, Allan, Coghlan and Cooper17} to support the safe application of the new LLM technologies in the field of mental health. We have argued that mental healthcare needs such a framework, given its ontological and epistemological status as associated with, but distinct from, physical medicine. The clinical vision for such technologies must be a human one. As such, it is important that practising clinicians and users of mental health services – not just those with an understanding of technology – are involved in the development of the new computational tools. We believe that these tools will inevitably transform clinical practice in mental healthcare in the coming years. Clinicians and people receiving mental health services are, to paraphrase Lenat (1975), the ‘thinking beings’ most likely to ensure that this new artificial cognitive space is used safely and responsibly and in a manner that preserves the sound clinical reasoning, wisdom and interpersonal process that we would argue are the hallmarks of all good mental healthcare. We strongly advocate that users of mental health services must be involved as equal partners in the transformative changes yet to come. Psychological distress is still uniquely human. Most people either use, or love those who use, mental health services, so it is in all our interests for human partnerships to be the fulcrum of this process.

Data availability

Data availability is not applicable to this article as no new data were created or analysed in this study.

Author contributions

A.H. wrote the initial draft of the paper. L.W. revised and refined the content. J.N. supported figure production.

Funding

This research received no specific grant from any funding agency, commercial or not-for-profit sectors.

Declaration of interest

A.H. and L.W. are both involved in designing artificial intelligence systems for potential use inclinical settings.

References

Lenat, DB. BEINGS: “knowledge as interacting experts”. In Proceedings of the 4th International Joint Conference on Artificial Intelligence: 126–33. Morgan Kaufmann Publishers Inc., 1975.Google Scholar

Bubeck, S, Chandrasekaran, V, Eldan, R, Gehrke, J, Horvitz, E, Kamar, E, et al. Sparks of artificial general intelligence: early experiments with GPT-4. arXiv [Preprint] 2023. Available from: https://doi.org/10.48550/arXiv.2303.12712.CrossRef Google Scholar

Wolfram, S. What Is ChatGPT Doing … and Why Does It Work? Stephen Wolfram, 2023 (https://writings.stephenwolfram.com/2023/02/what-is-chatgpt-doing-and-why-does-it-work/).CrossRef Google Scholar

Vasey, B, Novak, A, Ather, S, Ibrahim, M, McCulloch, P. DECIDE-AI: a new reporting guideline and its relevance to artificial intelligence studies in radiology. Clin Radiol 2023; 78: 130–6.10.1016/j.crad.2022.09.131CrossRef Google Scholar PubMed

Malla, A, Joober, R, Garcia, A. “Mental illness is like any other medical illness”: a critical examination of the statement and its impact on patient care and society. J Psychiatry Neurosci 2015; 40: 147–50.10.1503/jpn.150099CrossRef Google Scholar PubMed

Wilson, S. Developing an analytical framework to identify early warnings of serious problems with the quality and safety of care. Int J Health Gov 2022; 27: 208–16.Google Scholar

NHS England. Independent Investigation Reports. NHS England, 2013 (https://www.england.nhs.uk/publications/reviews-and-reports/invest-reports/).Google Scholar

Lee, P, Goldberg, C, Kohane, I. The Artificial Intelligence Revolution in Medicine: GPT-4 and beyond. Pearson, 2023.Google Scholar

Winiarski, MG. Review of multiperspective case formulation: a step towards treatment integration. J Psychother Integr 1998; 8: 183–5.10.1023/A:1023241630627CrossRef Google Scholar

Bieling, PJ, Kuyken, W. Is cognitive case formulation science or science fiction? Clin Psychol 2003; 10: 52–69.Google Scholar

Hofmann, SG, Hayes, SC. The future of intervention science: process-based therapy. Clin Psychol Sci 2018; 7: 37–50.10.1177/2167702618772296CrossRef Google Scholar PubMed

Wilcox, G, Schroeder, M, Drefs, MA. Clinical reasoning: a missing piece for improving evidence-based assessment in psychology. J Intell 2023; 11: 26.10.3390/jintelligence11020026CrossRef Google Scholar PubMed

Dissanayake, PI, Colicchio, TK, Cimino, JJ. Using clinical reasoning ontologies to make smarter clinical decision support systems: a systematic review and data synthesis. J Am Med Inform Assoc 2019; 27: 159–74.10.1093/jamia/ocz169CrossRef Google Scholar

Kelly, M, Arora, N, West, R, Reitter, D. High dimensional vector spaces as the architecture of cognition. Proceedings of the Annual Meeting of the Cognitive Science Society 2019 41. Available from: https://escholarship.org/uc/item/88v0k9pt.Google Scholar

Hauser, TU, Skvortsova, V, De Choudhury, M, Koutsouleris, N. The promise of a model-based psychiatry: building computational models of mental ill health. Lancet Digit Health 2022; 4: 816–28.10.1016/S2589-7500(22)00152-2CrossRef Google Scholar PubMed

James, CA, Singh, K, Valley, TS, Wiens, J. Reimagining Healthcare Teams: Leveraging the Patient-Clinician-Artificial Intelligence-Triad to Improve Diagnostic Safety. Agency for Healthcare Research and Quality, 2023 (https://www.ahrq.gov/sites/default/files/wysiwyg/patient-safety/reports/issue-briefs/dxsafety-reimagining-healthcare-teams.pdf).Google Scholar

Reddy, S, Allan, S, Coghlan, S, Cooper, P. A governance model for the application of artificial intelligence in health care. J Am Med Inform Assoc 2019; 27: 491–7.10.1093/jamia/ocz192CrossRef Google Scholar

Luxton, DD. Artificial Intelligence in Behavioral and Mental Health Care. Elsevier/Academic Press, 2016.10.1016/B978-0-12-420248-1.00001-5CrossRef Google Scholar

Anthropic. Introducing Contextual Retrieval. Anthropic, 2024 (https://www.anthropic.com/news/contextual-retrieval).Google Scholar

Fig. 1 An outline proposal: the Clinical Reach into the Cognitive Space (CRITiCS) framework for generative artificial intelligence patient safety in mental healthcare. LLM, large language model.

Table 1 The Clinical Reach into the Cognitive Space (CRITiCS) model: nature and degree of agentive reach by cognitive space

Submit a response

eLetters

No eLetters have been published for this article.

Article contents

Clinical Reach into the Cognitive Space (CRITiCS): outline conceptual framework for safe use of generative artificial intelligence in mental health decision-making

Abstract

Keywords

Information

Method

Mental healthcare has unique features

Mental healthcare is ontologically and theoretically diverse

Patient safety in mental healthcare is often a function of ‘weak signal’ detection in unstructured data-sets

Mental healthcare delivery is driven by idiographic case formulation

Results

The future – safe agentive reach into the clinical cognitive space

Discussion

Data availability

Author contributions

Funding

Declaration of interest

References

eLetters

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests