Hostname: page-component-54dcc4c588-rz4zl Total loading time: 0 Render date: 2025-09-12T02:54:00.275Z Has data issue: false hasContentIssue false

Generative Pre-trained Transformer-4 (GPT-4) generated psychological reports in psychodynamic perspective: a pilot study on quality of report, risk of hallucination, and client satisfaction

Published online by Cambridge University Press:  26 August 2025

T.-S. Kim*
Affiliation:
Department of Psychiatry, Seoul St. Mary’s Hospital, The Catholic University of Korea, College of Medicine
N. Kim
Affiliation:
Department of Clinical Medical Sciences, Seoul National University, Seoul
J. Lee
Affiliation:
Department of Psychiatry, Uijeongbu Eulji Medical Center, Eulji University School of Medicine, Uijeongbu, Korea, Republic Of
*
*Corresponding author.

Abstract

Core share and HTML view are not available for this content. However, as you have access to this content, a full PDF is available via the ‘Save PDF’ action button.
Introduction

Recently, there has been growing interest in leveraging large language models (LLMs) in psychiatry and counseling. Specifically, there is a need to develop LLM-based programs that generate psychodynamic assessments, helping individuals gain self-insight and evaluate the quality of such services. However, research in this area remains limited.

Objectives

This pilot study aims to evaluate quality, risk of hallucination, and client satisfaction with psychodynamic psychological reports generated by GPT-4.

Methods

The reports consisted of five components: psychodynamic formulation, psychopathology, parental influence, defense mechanisms, and client strengths. Participants experiencing distress from recurring interpersonal issues were recruited for the study, which followed three stages: 1) GPT-4 generated tailored questions for participants to infer psychodynamic formulations, then used the responses to create psychological reports. 2) Seven psychiatry professors from various university hospitals assessed the reports for quality and hallucination risk, comparing GPT-4-generated reports with expert-inferred reports. 3) Participants evaluated their satisfaction with the psychological reports. All assessments were conducted using self-report questionnaires based on a Likert scale developed for this study.

Results

Ten participants were recruited for the study, with an average age of 32 years. The median response indicated that the quality across all five components of the psychological report aligned closely with expert evaluations. The risk of hallucination was assessed to be minimal, ranging from unlikely to minor. In the satisfaction evaluation, over 90% of participants agreed or strongly agreed that the report was clear, insightful, credible, useful, satisfying, and recommendable.

Conclusions

These findings suggest that artificial intelligence may have the potential to provide expert-level psychodynamic interpretations with minimum face-to-face interaction.

Disclosure of Interest

None Declared

Information

Type
Abstract
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of European Psychiatric Association
Submit a response

Comments

No Comments have been published for this article.