No CrossRef data available.
Published online by Cambridge University Press: 26 August 2025
Psychiatric diagnosis plays a key rol in the mental health care. One of the critical factors that influence the diagnostic process is inter-rater reliability, the degree to which different raters agree on the diagnosis when assessing the same patient. Despite the availability of standardized diagnostic manuals, variability in psychiatric diagnoses persists. The assessment of inter-rater reliability involves calculating statistical measures which quantify the level of agreement between raters beyond what would be expected by chance. Improving inter-rater reliability in psychiatric diagnoses is necessary for optimizing both patient care and research quality in mental health.
Assess inter-rater reliability across main psychiatric disorders and identify the sources of variability.
This study was performed according to the PRISMA guidelines and a total of ninety-three studies were included. Regarding inclusion criteria, (1) the articles had to focus on inter-rater reliability, (2) study participants had to have an average age greater than 18 years, and (3) the reported diagnoses had to refer to a diagnostic manual. Quality scores were assessed for all included studies (Armijo-Olivo S et al. J Eval Clin Pract. 2012; 18 12-8). Seven different meta-analysis were conducted, one for each psychiatric diagnosis detected. The heterogeneity between studies was quantified using Cochran’s Q and I2. Funnel plots was analyzed to assess the possible influence of publication and location biases (Higgins&Green. BMJ. 2011;343). To account for publication bias, the Eggers’ test and the Fail-Safe Number31 was applied.
Psychotic disorder: (k=0.70; 95% CI: 0.66-0.75) (I²=97%).
Anxiety Disorders: (k=0.65; 95% CI: 0.60-0.70) (I²=91%).
Obsessive-Compulsive Disorder (OCD): (k=0.73; 95% CI: 0.64-0.82) (I²=76%)
PTSD: (k=0.60; 95% CI: 0.52-0.66) (I² =84%).
DCA: (k=0.73; 95% CI: 0.67-0.79) (I² =76%).
two different meta-analysis were conducted because many studies used Intraclass Correlation Coefficient (ICC) as a value to express inter-rater agreement. [k=0.65 (95% CI: 0.59, 0.7)] (I2= 96%) (ICC=0.85; 95% CI: 0.82-0.87) (I² = 66%).
Image 1:
The lower Kappa for schizoaffective disorder (Kappa < 0.7) compared to other psychiatric disorders underscores the diagnostic challenges posed by this category, given its overlapping symptoms with both mood and psychotic disorders. As regards personality disorder, antisocial and borderline PD showed highest agreement potentially due to its well-defined diagnostic criteria. The lowest agreement (k=0.60) of PTSD emphasizes the variability of his clinical presentation. In conclusion, studies show variability across disorders, highlighting the need for further research to improve diagnostic accurac (Regier et al. Am J Psychiatry. 2009;166 645-50) thereby enhancing clinical and research outcomes.
None Declared
Comments
No Comments have been published for this article.