Introduction
Cognitive impairment is a core feature of psychotic disorders, present from the early stages of illness and often persisting throughout its course [Reference Mesholam-Gately, Giuliano, Goff, Faraone and Seidman1]. Longitudinal studies have shown that deficits in cognitive domains such as attention, memory, and executive functioning predict a broad range of functional outcomes in schizophrenia, including social functioning, occupational adjustment, independent living, self-care, quality of life, and overall community integration [Reference Rajji, Miranda and Mulsant2]. Consequently, accurate and efficient assessment of cognitive functioning is essential for early identification of difficulties, individualized treatment planning, and monitoring of therapeutic interventions [Reference Nuechterlein, Nasrallah and Velligan3].
Numerous instruments are currently used to assess cognition in individuals with psychotic disorders, including standardized cognitive assessment batteries such as the MATRICS Consensus Cognitive Battery, which is considered the gold standard for evaluating cognitive impairment in schizophrenia [Reference Nuechterlein, Green, Kern, Baade, Barch and Cohen4]. While these tools are well-validated and widely used in research, they present notable limitations in clinical practice due to their administration time and need for staff time and training. To address these barriers, the Brief Assessment of Cognition in Schizophrenia (BACS) was developed as a rapid, reliable, and portable alternative that focuses on cognitive domains most impaired and most strongly related to functional outcomes in schizophrenia, including verbal memory, working memory, attention, executive functions, and motor speed [Reference Keefe, Goldberg, Harvey, Gold, Poe and Coughenour5]. Requiring approximately 30 minutes to complete, the BACS has demonstrated high test–retest reliability, minimal practice effects, and strong concurrent validity with standard batteries.
The growing availability of digital tools has fostered the development of assessments that offer automated scoring, standardized instruction delivery, and reduced rater-related variability, making them particularly useful in both research and clinical contexts. Several brief computerized neurocognitive test batteries have been validated in individuals with psychotic disorders [Reference Domen, van de Weijer, Jaspers, Denys and Nieman6–Reference Pietrzak, Olver, Norman, Piskulic, Maruff and Snyder8]. Within this context, the Brief Assessment of Cognition (BAC) app was developed as a fully digital adaptation of the BACS, aiming to retain the original instrument’s psychometric strengths while taking advantage of the practical benefits of tablet-based administration. The BAC app includes digital versions of the six BACS subtests assessing verbal memory (list learning and recall), working memory (digit sequencing), motor speed (token motor task), verbal fluency (semantic and phonemic), attention and processing speed (symbol coding), and executive functioning (Tower of London). Each task is self-administered on a tablet device and features standardized audio and visual instructions, automated timing, and immediate scoring, reducing examiner bias and enhancing feasibility in routine settings. This format minimizes the need for trained examiners, eliminates rater-related variability, and supports consistent administration across settings. Its validation study has shown high concordance with the paper version, preserved sensitivity to cognitive impairment, and maintained strong associations with functional outcomes [Reference Atkins, Tseng, Vaughan, Twamley, Harvey and Patterson9].
Beyond its psychometric properties, the BAC app exemplifies a broader movement toward the modernization of cognitive assessment in schizophrenia and related disorders. This transition reflects a growing interest in tools that combine psychometric rigor with practical benefits such as remote administration, ecological validity, and reduced burden on clinicians and patients [Reference Schulz and Murray10–Reference Horan, Moore, Belanger and Harvey12]. Recent studies have demonstrated the feasibility and utility of mobile-based cognitive tools to monitor functioning longitudinally and across diverse settings, enhancing accessibility and enabling scalable, cross-cultural applications [Reference Cohen, Joshi, Bondre, Chand, Chaturvedi and Choudhary13, Reference Lakhtakia, Bondre, Chand, Chaturvedi, Choudhary and Currey14]. The European Psychiatric Association emphasizes the necessity of such innovation in guiding more accurate and comprehensive assessments, suggesting that these digital adaptations can greatly improve both research and clinical practice by integrating real-world cognitive assessments [Reference Vita, Gaebel, Mucci, Sachs, Erfurth and Barlati11]. In this context, the BAC App, represents a practical and innovative solution to the logistical barriers of traditional assessment approaches.
However, available evidence of the psychometric properties of the BAC app is limited to a single study conducted in a small clinical sample and did not differentiate between distinct stages of illness, such as first-episode psychosis (FEP) and chronic schizophrenia, limiting generalizability of its findings. Moreover, no normative data are currently available for Spanish-speaking populations nor has the influence of key demographic factors such as age, sex, and education been examined. Notably, the original validation did not assess internal consistency, leaving the reliability of the total composite score unverified—a crucial aspect for its use in clinical interpretation and longitudinal monitoring.
Thus, the present study aimed to provide a comprehensive validation of the BAC app in a Spanish-speaking sample that includes individuals with FEP, chronic schizophrenia, and healthy controls (HCs). Specifically, we seek to (i) examine the internal consistency of the BAC app; (ii) assess discriminative validity by comparing mean performance across groups (HCs, FEP, and schizophrenia) and evaluating the classification accuracy of the BAC app through receiver operating characteristic (ROC) curve analyses; (iii) evaluate convergent validity by examining correlations between BAC app scores and estimated intelligence quotient (IQ) and a functional capacity measure, and test the independence of BAC app scores from symptom severity; (iv) analyze the influence of key demographic variables (sex, age, and education) on BAC app performance within each group; and(v) subsequently develop stratified normative data for each BAC app subtest and the composite score based on the observed effects of these demographic variables.
Methods
Participants and procedure
Participants were recruited as part of a multicenter study in 15 public mental health centers and university hospitals across Spain to validate digital tools for assessing cognitive and functional capacity in individuals with psychotic disorders. The final sample included individuals diagnosed with chronic schizophrenia or FEP, identified through clinician referral based on convenience sampling, as well as HCs recruited to be comparable in age, sex, and educational level. A total of 403 participants were initially recruited, including 117 individuals with FEP, 125 with chronic schizophrenia, and 161 HCs. After excluding one control with missing BAC app data, the final sample comprised 402 individuals.
Participants were eligible if they: (1) were aged 18–60 years; (2) met diagnostic criteria for a schizophrenia spectrum disorder according to the Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition (DSM-5); (3) had adequate Spanish proficiency; and (4) could provide informed consent. Exclusion criteria included (1) organic brain pathology or neurological illness, (2) intellectual disability (DSM-5), and (3) current or recent (past 6 months) substance dependence, assessed via the Comprehensive Assessment of Symptoms and History (CASH [Reference Andreasen, Flaum and Arndt15]). Participants were classified as FEP if they had initiated antipsychotic treatment within the past 3 years, a widely accepted threshold based on the “critical period” hypothesis, which posits that the first 3 years after illness onset represent a pivotal window for intervention [Reference Birchwood, Todd and Jackson16, Reference McGorry, Killackey and Yung17]; otherwise, they were classified as having chronic schizophrenia.
HCs met the same age and language criteria and were required to provide informed consent. Exclusion criteria included any current or past mental or neurological disorder, organic brain pathology, intellectual disability, substance use disorder, or psychotropic medication use.
Data were collected between July 2022 and December 2024. Participants with psychosis were recruited from routine clinical care settings across 15 public university-affiliated hospitals and mental health centers in Spain. These included early intervention programs for FEP, day hospitals, outpatient clinics, and acute inpatient units. Although recruitment was based on convenience sampling, the diversity of participating public sector services reflects typical clinical contexts of psychosis care in Spain. HCs were also recruited using convenience sampling strategies, including outreach to university students, hospital staff, and acquaintances of patients or clinicians. The absence of mental or neurological disorders was operationally defined as having no current or past mental disorders (including mood, anxiety, or psychotic disorders), no history of neurological illness, no substance use disorder (past or present), and no use of psychotropic medication. These criteria were confirmed through the CASH structured interview and participant self-report.
Trained research staff administered the assessments in a single session following standardized instructions. Diagnoses were established according to DSM-5 criteria through semistructured face-to-face clinical interviews conducted by trained clinicians at each site. Although no formal blinding was used, the BAC app was generally administered by a different team member than the one responsible for diagnosis. Written informed consent was obtained, and the study received ethical approval from the ethics committee of the principal center (code: PI20/00066) and from the corresponding ethics committees at each participating site.
Measures
BAC app
The BAC app is a tablet-based version of the BACS [Reference Keefe, Goldberg, Harvey, Gold, Poe and Coughenour5, Reference Atkins, Tseng, Vaughan, Twamley, Harvey and Patterson9], designed to assess six cognitive domains relevant to clinical populations: episodic memory, working memory, verbal fluency, processing speed, executive functioning, and psychomotor speed. The full battery required approximately 30 minutes to complete. Raw scores from each subtest were converted into T-scores based on the HC group distribution. These T-scores were then averaged to generate a cognitive composite score reflecting overall performance. Higher scores indicate better cognitive performance across all subtests.
Virtual Reality Functional Capacity Assessment Tool (VRFCAT)
The VRFCAT is a computerized performance-based measure developed to assess functional capacity through the simulation of everyday tasks in a realistic virtual environment [Reference Keefe, Davis, Atkins, Vaughan, Patterson and Narasimhan18]. A Spanish-translated version of the VRFCAT was administered on a tablet. In the present study, only the total time to completion was used for analysis, with higher times indicating poorer functional capacity. Internal consistency of this composite score was acceptable to good across groups: for the total sample, ω = 0.80; for the clinical group, ω = 0.76; and for controls, ω = 0.71.
Positive and Negative Syndrome Scale (PANNS)
In the present study, the validated Spanish version of the PANSS was used [Reference Kay, Fiszbein and Opler19, Reference Peralta and Cuesta20]. The PANSS includes three subscales: positive symptoms (seven items), negative symptoms (seven items), and general psychopathology (16 items).
Vocabulary subtest of the Wechsler Adult Intelligence Scale – Third Edition (WAIS-III)
The vocabulary subtest from the Spanish version of the WAIS-III was used as an estimate of premorbid IQ [Reference Wechsler21]. Participants are required to define a series of words that increase in difficulty, providing a measure of verbal knowledge and crystallized intelligence. Raw scores were converted to age-adjusted scaled scores using normative data from the WAIS-III manual.
Sociodemographic and clinical variables
Sociodemographic data were collected, including age, sex, ethnicity, educational level, and current employment status. Clinical status was obtained through a structured interview that included mental diagnosis based on DSM-5 criteria, age of onset, duration of untreated illness (DUI), and duration of untreated psychosis (DUP).
Data analysis
Descriptive statistics were computed by group (FEP, schizophrenia, and HCs), including means with standard deviations, medians, interquartile ranges, skewness, and kurtosis. Group comparisons used χ2 for categorical and t-tests, analyses of variance (ANOVAs), or nonparametric tests (Mann–Whitney U, Kruskal–Wallis) for continuous variables.
Raw scores on each BAC app subtest were converted into T-scores (mean = 50, SD = 10) using the HC sample as a normative reference group. This standardization facilitated the computation of a composite cognitive score, calculated as the average of the six subtest T-scores.
Internal consistency of the BAC app was estimated using Cronbach’s alpha for each group and the total sample. Intercorrelations between subtests were also examined to assess construct coherence.
To evaluate group differences in cognitive performance, one-way ANOVAs were conducted on T-scores and the composite score. Post hoc comparisons were conducted using Tukey’s HSD test when the assumption of homogeneity of variances (Levene’s test, p > 0.05) was met and the Games–Howell test when it was not (p < 0.05). ROC curve analyses were performed to assess the discriminative validity of BAC app scores in distinguishing between the clinical and the control groups, as well as between FEP and schizophrenia subgroups. Area under the curve (AUC), sensitivity, specificity, and optimal cutoff points (Youden Index) were reported.
To develop normative data, we first examined the effects of sex, age, and years of education on BAC app performance within each group. Based on the observed associations, quantile regression analyses were conducted on the HC sample to generate normative percentile scores (10th, 25th, 50th, 75th, and 90th) for the six BAC app subtests. Percentiles were stratified by sex, age group, and educational level. Age and education were categorized into three levels each to ensure clinically meaningful distinctions and adequate sample sizes.
Finally, we analyzed the relationships between estimated IQ, functional capacity (VRFCAT total time to completion), and clinical symptoms, assessed through the positive, negative, and general subscales of the PANSS. All statistical tests were two-tailed, with the significance threshold set at p < 0.05. Analyses were conducted using IBM SPSS Statistics (version 26).
Results
Sociodemographic and clinical characteristics of the sample
Table 1 presents the demographic and clinical characteristics of participants in the FEP, schizophrenia, and HC groups. Significant differences were found across groups in age, years of education, ethnicity, marital status, and employment status. In contrast, sex distribution did not differ significantly between groups.
Table 1. Demographic and clinical characteristics of FEP, schizophrenia, and healthy controls

Abbreviations: DUI, duration of untreated illness (in months); DUP, duration of untreated psychosis (in months); FEP, first-episode psychosis; H, Kruskal–Wallis test statistic; HC,= healthy controls; IQR, interquartile range (25th–75th percentile); Md, median; PANSS, Positive and Negative Syndrome Scale; U, Mann–Whitney U test statistic.
Descriptive statistics for each BAC app subtest and the composite score are presented in Table 2 for each group. Distributions across variables were generally symmetric, although some subtests (e.g., tower of London) showed notable skewness and kurtosis, particularly in the control group.
Table 2. Descriptive statistics for BAC app scores in FEP, schizophrenia, and healthy control groups

Abbreviations: BAC app, Brief Assessment of Cognition app; FEP, first-episode psychosis; IQR, interquartile range (25th–75th percentile); SD, standard deviation.
Note: Subtest scores are raw scores. The composite score is the average of T-scores from the six BAC App domains, standardized using healthy control group data.
Internal consistency
Internal consistency was acceptable to good across groups. For the FEP group, Cronbach’s α = 0.83 (95% confidence interval [CI] [0.77–0.87]); for the schizophrenia group, α = 0.81 (95% CI [0.73–0.86]); for the control group, α = 0.76 (95% CI [0.68–0.82]); and for the total sample, α = 0.87 (95% CI [0.85–0.89]).
Additionally, all BAC app subtests showed statistically significant positive correlations across the three groups (FEP, schizophrenia, and controls), with the strongest associations observed between each subtest and the global composite score (r = 0.63–0.82). Intersubtest correlations were generally moderate to high (r = 0.28–0.64).
Group differences in BAC app performance
As shown in Table 3, there were statistically significant group effects for all subtests and the composite score (all p < 0.001). Post hoc comparisons revealed that HCs outperformed both clinical groups across all domains, whereas participants with FEP showed significantly better performance than those with schizophrenia on most subtests, a pattern that was especially evident in the composite score.
Table 3. ANOVA results and post hoc comparisons of BAC app subtests and composite score across FEP, SCZ, and HC groups

Abbreviations: FEP first-episode psychosis; SCZ, schizophrenia; HC, healthy controls; SD, standard deviation.
Note: Subtest scores are raw scores. The CS is the average of T-scores from the six BAC app domains, standardized using healthy control group data.
a Post hoc comparisons were conducted using Tukey’s HSD test when the assumption of homogeneity of variances (Levene’s test, p > 0.05) was met, and Games–Howell test when it was not (p < 0.05).
* p < 0.05.
** p < 0.01.
*** p < 0.001.
ROC curve analyses
ROC curve analyses were conducted to evaluate the discriminative validity of the BAC app subtest T-scores and composite score. As shown in Figure 1, when comparing the clinical group (individuals with FEP and schizophrenia combined) to HCs, the composite score showed good classification accuracy, yielding an AUC of 0.862 (95% CI [0.826–0.899]). The optimal cutoff point of 47, identified using the Youden index (J = 0.63), resulted in a sensitivity of 77.3% and a specificity of 85.0%. Among the individual subtests, symbol coding demonstrated the highest discriminative power (AUC = 0.814).

Figure 1. Receiver operating characteristic (ROC) curves for Brief Assessment of Cognition (BAC) app subtests and composite score: clinical (first-episode psychosis [FEP] + schizophrenia) group versus controls. ROC curves comparing individuals with psychosis (FEP and schizophrenia combined) to healthy controls on BAC app subtest T-scores and the composite score. The composite score showed the highest discriminative validity (area under the curve [AUC] = 0.862), followed by symbol coding (AUC = 0.814), verbal fluency (AUC = 0.794), token motor (AUC = 0.785), verbal memory (AUC = 0.774), digit sequencing (AUC = 0.772), and tower of London (AUC = 0.741). The diagonal reference line (AUC = 0.50) represents chance-level performance.
To examine the ability of the BAC app to distinguish between individuals with FEP and those with schizophrenia, a second set of ROC analyses was performed within the clinical sample (see Figure 2). The composite score showed poor discriminative ability (AUC = 0.649, 95% CI [0.580–0.718]), with an optimal cutoff of 45 (Youden index = 0.30), yielding a sensitivity of 68.6% and a specificity of 61.6%. The token motor subtest provided the highest AUC in this comparison (0.702), while the remaining subtests ranged from 0.541 to 0.650, indicating limited accuracy in differentiating between these clinical subgroups.

Figure 2. Receiver operating characteristic (ROC) curves for Brief Assessment of Cognition (BAC) app subtests and composite score: first-episode psychosis (FEP) versus schizophrenia.
Note. ROC curves comparing individuals with FEP to those with schizophrenia on BAC app subtest T-scores and the composite score. The composite score showed poor discriminative validity (area under the curve [AUC] = 0.649), with the token motor subtest yielding the highest AUC (0.702) followed by verbal memory (AUC = 0.650), symbol coding (AUC = 0.632), tower of London (AUC = 0.580), verbal fluency (AUC = 0.553), and digit sequencing (AUC = 0.541). The diagonal reference line (AUC = 0.50) represents chance-level performance.
Associations with functional capacity performance, IQ, and clinical symptoms
As shown in Table 4, BAC app scores were significantly associated with both estimated IQ and functional capacity across all groups, with stronger correlations generally observed in clinical groups, indicating that better cognitive performance was associated with higher IQ and better functional capacity.
Table 4. Correlations between BAC app and estimated IQ/VRFCAT

Abbreviations: BAC app, Brief Assessment of Cognition app; IQ, estimated intelligence quotient (vocabulary subtest); VRFCAT, Virtual Reality Functional Capacity Assessment Tool.
Note: Spearman’s rho correlations. * p < 0.05, ** p < 0.01, *** p < 0.001. Higher VRFCAT Time indicates poorer functional capacity.
Regarding clinical symptoms, negative associations were observed between BAC app performance and all three PANSS dimensions, with correlation magnitudes ranging from small to moderate. While not all correlations reached statistical significance, those that did suggest that greater symptom severity is linked to poorer cognitive functioning. Overall, associations tended to be stronger in the FEP group than in the schizophrenia group.
Normative performance on the BAC app
As a preliminary step to determine whether stratification of normative data was warranted, we examined the effects of sex, age, and education on BAC app performance using the composite score as the primary outcome. For this type of analysis, we opted to use the composite score rather than analyzing the six subtests separately, as it provides a reliable and sensitive global indicator of cognitive performance and minimizes multiple testing issues. In the FEP group, women obtained significantly higher composite scores than men (t(106.0) = −2.36, p = 0.020), whereas no significant sex differences were found in the schizophrenia or control groups. Age was negatively correlated with the composite score in both the schizophrenia (ρ = −0.30, p = 0.001) and control (ρ = −0.24, p = 0.003) groups. Years of education were positively associated with the composite score across all groups: FEP (ρ = 0.34, p < 0.001), schizophrenia (ρ = 0.41, p < 0.001), and controls (ρ = 0.40, p < 0.001). These findings support the need to stratify normative data according to these key demographic variables. Table 5 shows normative percentile scores (10th, 25th, 50th, 75th, and 90th) derived from the HC sample and stratified by sex, age group, and years of education.
Table 5. Normative percentile scores for the BAC app subtests and CS, stratified by sex, age group, and education level

Abbreviations: CS, composite score; DS, digit sequencing; SC, symbol coding; TL, tower of London; TM, token motor; VF, verbal fluency; VM, verbal memory.
Note: Subtest scores are raw scores. The CS is the average of T-scores from the six BAC app domains, standardized using healthy control group data. Percentile scores were calculated independently using quantile regression models. Minor reversals in percentile order (e.g., P90 < P75) were observed only in the tower of London domain and do not indicate data errors. Clinicians are advised to interpret these cases cautiously and consider the overall pattern of scores rather than isolated percentile differences in this domain.
Discussion
This study aimed to validate the BAC app in a Spanish-speaking sample, including individuals with FEP, chronic schizophrenia, and HCs. The BAC app showed good discriminative validity in distinguishing individuals with psychosis from HCs, showing a clear gradient of impairment from controls to FEP to individuals with chronic schizophrenia. Internal consistency of the composite score was acceptable to good across all groups, supporting its use as a global index of cognitive performance. The BAC app showed small-to-moderate positive associations with higher IQ and better functional capacity, and negative associations with greater symptom severity. In addition, because performance on the BAC app varied significantly by sex, age, and education, normative data were stratified accordingly to support meaningful interpretation in clinical settings.
These findings are broadly consistent with the initial validation study of the BAC app by Atkins et al. [Reference Atkins, Tseng, Vaughan, Twamley, Harvey and Patterson9], which reported strong concordance with the paper-based BACS and significant associations with functional outcomes. Individuals with FEP performed better than those with chronic schizophrenia across most BAC app subtests, except digit sequencing and verbal fluency. While this cross-sectional gradient might suggest cumulative cognitive deterioration, longitudinal studies have produced mixed findings regarding the course of cognitive functioning following illness onset [Reference Mesholam-Gately, Giuliano, Goff, Faraone and Seidman1, Reference Rajji, Miranda and Mulsant2, Reference Bozikas and Andreou22–Reference Shmukler, Gurovich, Agius and Zaytseva24]. Some short-term studies report gains likely due to practice effects [Reference Szoke, Trandafir, Dupont, Meary, Schurhoff and Leboyer25]. Medium- to long-term studies show stability in most domains, except for declines in verbal memory, and mixed findings in executive functions [Reference Bozikas and Andreou22]. Studies with follow-up periods beyond 10 years report modest deterioration in verbal and visuospatial memory, while processing speed and executive functions tend to remain stable [Reference Fett, Velthorst, Reichenberg, Ruggero, Callahan and Fochtmann26–Reference Rodriguez-Sanchez, Setien-Suero, Suarez-Pinilla, Mayoral Van Son, Vazquez-Bourgon and Gil Lopez28]. Together with findings from data-driven subgroup analyses – such as latent class or growth mixture modeling – identifying stable, improving, or declining trajectories [Reference Habtewold, Rodijk, Liemburg, Sidorenkov, Boezen and Bruggeman29], this suggests that the differences observed in our study are more likely due to between-group variability (e.g., illness duration, treatment, and comorbidities) rather than uniform cognitive decline within individuals over time.
ROC curve analyses confirmed that the BAC app had good discriminative accuracy in differentiating individuals with psychosis from HCs (AUC = 0.862), with the composite score offering the highest classification performance. Among the individual subtests, symbol coding, a task that strongly depends on processing speed, yielded the highest AUC (0.814), suggesting that it may be particularly sensitive to illness-related cognitive impairment. This is in line with previous findings indicating that symbol coding is especially sensitive to cognitive deficits in psychotic disorders [Reference Dickinson, Ramsey and Gold30]. However, when attempting to distinguish between FEP and schizophrenia, the composite score showed only modest discriminative ability (AUC = 0.649), reflecting poor classification accuracy. As reported in prior literature, cognitive deficits are already evident in the early stages of psychosis [Reference Mesholam-Gately, Giuliano, Goff, Faraone and Seidman1] and, together with the heterogeneity of cognitive trajectories over time, may help to explain the modest discriminative power of the BAC app in separating early- from late-stage illness, despite statistically significant differences in group means.
When compared to other brief computerized batteries validated in psychosis, including the CogState Battery [Reference Pietrzak, Olver, Norman, Piskulic, Maruff and Snyder8], the MyCognition Quotient [Reference Domen, van de Weijer, Jaspers, Denys and Nieman6], and PsyCog [Reference Gifford, Cullen, Vieira, Searle, McCutcheon and Modinos7], the BAC app shows a pattern of results that overlaps in key areas, such as discriminative validity and sensitivity to cognitive deficits while differing in terms of the specific cognitive domains assessed, the mode of task presentation, and the rationale underlying their development. The convergence of psychometric properties across tools with different designs may reflect a shared capacity to detect broad cognitive impairment in psychosis, rather than the measurement of distinct cognitive processes. Consistent with previous findings suggesting a predominant general cognitive factor in psychosis [Reference Dickinson, Ramsey and Gold30], the high internal consistency and intersubtest correlations of the BAC App support its validity as a global measure of cognitive functioning. Although BAC app scores were only moderately correlated with estimated IQ, this is consistent with the fact that vocabulary-based estimates primarily reflect crystallized intelligence and may not fully represent the broader range of abilities encompassed by the general factor, such as processing speed, working memory, and executive function. In line with this interpretation, performance on a single task, such as symbol coding, has been found to explain a substantial proportion of global cognitive variance in FEP [Reference Dickinson, Ramsey and Gold30–Reference Rodriguez-Sanchez, Crespo-Facorro, Gonzalez-Blanch, Perez-Iglesias, Vazquez-Barquero and Study32], reinforcing the idea that commonly used tasks may be more effective at detecting generalized impairment than at identifying distinct cognitive profiles [Reference Blanchard and Neale33, Reference Dickinson, Ragland, Gold and Gur34].
Regarding associations with clinical symptoms, in individuals with FEP, the BAC app composite score showed significant negative correlations with all three PANSS dimensions – positive, negative, and general – indicating that greater symptom severity across domains was associated with poorer cognitive performance. In contrast, among patients with chronic schizophrenia, only negative symptoms were significantly correlated with the composite score. This divergence suggests that the relationship between symptomatology and cognitive functioning may be more dynamic and multifaceted in the early stages of psychosis, whereas in chronic stages, cognitive impairments appear more stable and primarily linked to enduring negative symptoms. Importantly, the generally low magnitude of these correlations supports the discriminant validity of the BAC app, suggesting that it captures cognitive functioning beyond the influence of symptom severity.
Demographic variables such as sex, age, and education had an impact on BAC app performance across groups. The finding that women scored higher than men in FEP suggests that sex-related cognitive advantages, as previously described in psychosis [Reference Ochoa, Usall, Cobo, Labad and Kulkarni35], may be more evident in early stages of illness, but their impact may be outweighed by cumulative illness-related neurobiological burden, long-term antipsychotic treatment, and social disengagement, leading to a reduction in sex-related cognitive differences in chronic schizophrenia. The absence of sex differences among HCs may reflect ceiling effects in a cognitively intact population. Similarly, older age was associated with poorer performance across groups, with the most pronounced decline observed in the schizophrenia group, consistent with prior evidence of accelerated brain aging in psychosis [Reference Ballester, Romano, de Azevedo Cardoso, Hassel, Strother and Kennedy36]. Consistent with prior research, higher education was associated with better BAC app performance across groups, supporting its role as a cognitive reserve factor [Reference Ayesa-Arriola, de la Foz, Murillo-Garcia, Vazquez-Bourgon, Juncal-Ruiz and Gomez-Revuelta37]. Given these influences, stratified norms by sex, age, and education are essential for accurate score interpretation.
Several limitations should be acknowledged. First, although the overall sample size was adequate for group comparisons, stratification of the normative data by sex, age, and education resulted in reduced cell sizes for some percentile estimations. Second, participants were recruited through convenience sampling, which may limit the generalizability of the findings. Although the control sample was carefully matched and screened to exclude individuals with mental, neurological, or substance-related disorders, participants were not selected through random population-based sampling procedures. This methodological limitation constrains the generalizability of the normative data to the broader Spanish population. Therefore, while the normative percentiles derived from this sample can be valuable for research- and group-level comparisons, they should be interpreted with caution when used for clinical purposes. Furthermore, given the limited evidence linking positive symptom severity to cognitive performance, and the practical limitations of assessing cognition during acute episodes, our findings primarily reflect cognitive functioning in nonacute stages of psychosis. Further studies are needed to evaluate the validity and reliability of the BAC app in highly symptomatic patients during acute phases, although it remains unclear whether cognitive performance under such conditions can meaningfully reflect a patient’s underlying cognitive functioning. Third, group comparisons revealed significant sociodemographic differences in age and educational attainment. While these differences represent a limitation, they are largely unavoidable in studies comparing individuals with FEP, chronic schizophrenia, and HCs. Fourth, convergent validity was assessed using the vocabulary subtest of the WAIS-III. Although vocabulary performance is an imperfect estimate of general cognitive ability, it remains a well-validated and stable proxy [Reference Murillo-Garcia, Soler, Ortiz-Garcia de la Foz, Miguel-Corredera, Barrio-Martinez and Setien-Suero38]. Finally, potential covariates such as medication use, age, and education were not controlled, as the aim was to capture real-world performance and provide ecologically valid normative data.
In conclusion, this study provides robust evidence supporting the reliability, validity, and clinical relevance of the BAC app as a brief, accessible digital tool for assessing cognitive functioning in Spanish-speaking populations with psychotic disorders, offering practical advantages for its use in clinical settings such as minimal staff training, automatic scoring, and standardized administration. Its ability to detect group differences, its associations with key clinical and functional outcomes, and the generation of stratified normative data all reinforce its utility for both research and applied practice. The BAC app remains a valuable instrument for screening cognitive impairment, tracking changes over time, and evaluating treatment response.
Data availability statement
The datasets generated and/or analyzed during the current study are available from the corresponding author upon reasonable request.
Acknowledgments
We would like to extend our sincere gratitude to all the participants who generously contributed their time and effort to this research. We are especially grateful to the 15 collaborating hospitals and research centers across Spain whose support and collaboration made this work possible. The dedication of the clinical and research teams at each site was essential to the successful execution of this multicenter study. We also thank the developers of the VRFCAT for their support and assistance during the validation process.
Author contribution
CG-B: Data curation, formal analysis, visualization, writing – original draft, writing – review & editing; VO-G: Data curation, writing – review & editing; RA-A: Conceptualization, methodology, funding acquisition, project administration, writing – review & editing; RL-C, TB-B, AC, CA, RR-J, AG-Z, OD-M, AG-P, GC-E, MZ, DG, Md-C-S, EV, LG-A, SO, PC-G, VB-M, JVS-O, EP, SM, CF, PR-P, RF-M, IG-C, CC-R, JMC, JC, AY-R: Investigation, writing – review & editing.
Financial support
This study was supported by the Instituto de Salud Carlos III (ISCIII) through project PI20/00066, co-funded by the European Union. AG-P acknowledges the support of the Spanish Ministry of Science, Innovation and Universities, integrated into the National R&D&I Plan and co-funded by the ISCIII–Subdirección General de Evaluación and the European Regional Development Fund (FEDER; grants PI18/01055 and PI21/00713), as well as the CIBERSAM, the Basque Government (grant 2022111054), and the University of the Basque Country (grant IT1631-22). RR-J acknowledges the support of the Madrid Regional Government (R&D activities in Biomedicine S2022/BMD-7216 (AGES 3-CM), and CIBERSAM-ISCIII. DG acknowledges the support of the Instituto de Salud Carlos III (ISCIII) (PI24/01065) and the European Commission (HORIZON-HLTH-2023-ENVHLTH-02/ID:101137256). CA was supported by the Alicia Koplowitz Foundation. AY-R is funded by a predoctoral contract (PFIS FI/00162) from the Carlos III Health Institute. RA-A was funded by a Miguel Servet contract from the Carlos III Health Institute (CP18/00003) and a Consolidator Grant from the Spanish Ministry of Science and Innovation (CNS2022-136110).
Ethics approval and consent to participate
This study was conducted in accordance with the Declaration of Helsinki. Ethical approval was obtained from the Ethics Committee of the Hospital Universitario Marqués de Valdecilla (approval code: PI20/00066) and from the corresponding ethics committees of each participating site. Written informed consent was obtained from all participants prior to study enrollment.
Competing interests
RR-J has served as a consultant, has given lectures, or has received funding from Janssen-Cilag, Lundbeck, Otsuka, Pfizer, Ferrer, Juste, Takeda, Exeltis, Casen-Recordati, Angelini, and Rovi. These relationships are unrelated to the present work.
Comments
No Comments have been published for this article.