Screening for a “trifecta” of executive function patterns in a large cohort of individuals with Parkinson’s disease

Rachel N. Schade; Katie Rodriguez; Lauren E. Kenney; Adrianna M. Ratajska; Kelly D. Foote; Justin D. Hilliard; Michael S. Okun; Dawn Bowers

doi:10.1017/S1355617725101239

Screening for a “trifecta” of executive function patterns in a large cohort of individuals with Parkinson’s disease

Published online by Cambridge University Press: 18 August 2025

Rachel N. Schade

Katie Rodriguez

Lauren E. Kenney

Adrianna M. Ratajska ,

Michael S. Okun and

Rachel N. Schade*: Affiliation:
Department of Clinical and Health Psychology, University of Florida, Gainesville, FL, USA.
Katie Rodriguez: Affiliation:
Department of Clinical and Health Psychology, University of Florida, Gainesville, FL, USA.
Lauren E. Kenney: Affiliation:
Department of Clinical and Health Psychology, University of Florida, Gainesville, FL, USA.
Adrianna M. Ratajska: Affiliation:
Department of Clinical and Health Psychology, University of Florida, Gainesville, FL, USA.
Kelly D. Foote: Affiliation:
Department of Neurosurgery, Norman Fixel Institute for Neurological Diseases, University of Florida Health, Gainesville, FL, USA.
Justin D. Hilliard: Affiliation:
Department of Neurosurgery, Norman Fixel Institute for Neurological Diseases, University of Florida Health, Gainesville, FL, USA.
Michael S. Okun: Affiliation:
Department of Neurology, Norman Fixel Institute for Neurological Diseases, University of Florida Health, Gainesville, FL, USA.
Dawn Bowers: Affiliation:
Department of Clinical and Health Psychology, University of Florida, Gainesville, FL, USA. Department of Neurology, Norman Fixel Institute for Neurological Diseases, University of Florida Health, Gainesville, FL, USA.
*: Corresponding author: Rachel N. Schade, email: rschade1@ufl.edu

Article contents

Abstract
Objective:
Methods:
Results:
Conclusion:
Statement of Research Significance
Introduction
Materials and methods
Statistical analyses
Results
Discussion
Funding statement
Competing interests
References

Rights & Permissions

Abstract

Objective:

This study examined three neurocognitive patterns or “clinical pearls” historically viewed as evidence for executive dysfunction in Parkinson disease (PD): 1) letter < category fluency; 2) word list < story delayed recall; 3) word list delayed recall < recognition. The association between intraindividual magnitudes of each neuropsychological pattern and individual performance on traditional executive function tests was examined.

Methods:

A clinical sample of 772 individuals with PD underwent neuropsychological testing including tests of verbal fluency, word list/story recall, recognition memory, and executive function. Raw scores were demographically normed (Heaton) and converted to z-scores for group-level analyses.

Results:

Letter fluency performance was worse than category fluency (d = −0.12), with 28% of participants showing a discrepancy of ≥ −1.0 SD. Delayed recall of a list was markedly poorer than story recall (d = −0.86), with 52% of the sample exhibiting ≥ −1.0 SD deficits. Lastly, delayed free recall was worse than recognition memory (d = −0.25), with 24% showing a discrepancy of ≥ −1.0 SD. These patterns did not consistently correlate with executive function scores. The word list < story recall pattern was more common in earlier than later PD stages and durations.

Conclusion:

Among the three pearls, the most pronounced was stronger memory performance on story recall than word lists, observed in more than half the sample. Only ¼ the participants exhibited all three neurocognitive patterns simultaneously. The variability in patterns across individuals highlights the heterogeneity of cognitive impairment in PD and suggests that intra-individual comparisons may offer a more nuanced insight into cognitive functioning.

Keywords

Neuropsychology Parkinson’s disease cognition executive function neuropsychological tests memory

Information

Type: Research Article
Information: Journal of the International Neuropsychological Society , First View , pp. 1 - 10

DOI: https://doi.org/10.1017/S1355617725101239 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright: © The Author(s), 2025. Published by Cambridge University Press on behalf of International Neuropsychological Society

Statement of Research Significance

Research Question(s) or Topic(s):

This study aimed to explore whether three cognitive patterns, or “clinical pearls,” of executive dysfunction could be confirmed in a large group of people with Parkinson’s disease (PD). The focus was three key cognitive pearls: letter < category fluency, delayed recall of a word list < story recall, and delayed recall < recognition.

Main Findings:

Statistically, the findings supported these patterns on average. However, significant variation was seen across individuals and the effect sizes were small for two pearls. This variability highlights the complexity of cognitive changes in PD.

Study Contributions:

This research emphasizes the need to carefully evaluate common patterns of cognitive weaknesses in Parkinson’s disease before routinely using them in clinical settings. These patterns may serve as supportive indicators of executive dysfunction in PD and should be a part of a more comprehensive cognitive assessment when identifying executive function problems in individuals with PD.

Introduction

Clinical decision-making in neuropsychological assessment often involves two complementary interpretive approaches. The first examines absolute impairment, typically through comparison to normative data, though this method has limitations related to individual variability and contextual factors. A more individualized approach assesses change relative to estimated or documented premorbid functioning, capturing meaningful decline that may not meet traditional impairment thresholds. The second approach focuses on intra-individual patterns of strengths and weaknesses across cognitive domains, aiding in the differentiation of global versus selective deficits. Together, these interpretive strategies enhance diagnostic precision and inform targeted intervention planning.

In this study, we examined whether individual differences in three specific neuropsychological patterns, commonly observed in Parkinson’s disease (PD), are present in a large PD sample and how these patterns relate to executive functioning. Executive dysfunction is one of the most well-recognized cognitive sequelae of PD, often emerging early due to dopamine depletion and degeneration within frontal-subcortical circuits, particularly the associative loop connecting the basal ganglia and prefrontal cortex (PFC) (Alexander et al., Reference Alexander, DeLong and Strick1986; Brown et al., Reference Brown, Hakun, Lewis, De Jesus, Du, Eslinger, Kong and Huang2023; Hirano, Reference Hirano2021). The PFC itself consists of multiple subregions that may be differentially affected in PD, contributing to varied profiles of executive dysfunction (Foerde & Shohamy, Reference Foerde and Shohamy2011). Despite substantial functional overlap among these regions, differences in vulnerability likely underlie the considerable heterogeneity observed in executive dysfunction across individuals with PD (Arrigoni et al., Reference Arrigoni, Antoniotti, Bellocchio, Veronelli, Corbo and Pisoni2024; Devignes et al., Reference Devignes, Lopes and Dujardin2022; Kehagia et al., Reference Kehagia, Barker and Robbins2010). While traditional, standard executive function tests are used in neuropsychological assessments to gauge cognitive impairment in PD (e.g., Wisconsin Card Sorting Test, WCST) (Heaton & Staff, Reference Heaton and Staff1993), previous research has identified certain performance patterns that may indirectly reflect executive dysfunction in PD and offer additional interpretive value beyond traditional executive tasks.

One neuropsychological pattern researched in PD is the differential performance on letter versus category fluency tasks (Azuma et al., Reference Azuma, Cruz, Bayles, Tomoeda and Montgomery2003; Pettit et al., Reference Pettit, McCarthy, Davenport and Abrahams2013). Both types of verbal fluency have been localized to regions within the frontal and temporal lobes, but not equally: letter fluency is more dependent on frontal executive processes, whereas category fluency relies more heavily on temporal-lobe-based semantic retrieval (Troyer et al., Reference Troyer, Moscovitch, Winocur, Alexander and Stuss1998). This dissociation is supported by findings that frontal lobe damage disproportionately impairs letter fluency, while temporal lobe damage – as seen in Alzheimer’s disease – more strongly affects category fluency (Monsch et al., Reference Monsch, Bondi, Butters, Paulsen, Salmon, Brugger and Swenson1994; Rosser & Hodges, Reference Rosser and Hodges1994; Troyer et al., Reference Troyer, Moscovitch, Winocur, Alexander and Stuss1998). Structural and functional neuroimaging support this distinction, with letter fluency activating primarily frontal regions and category fluency involving both frontal and temporo-parietal areas (Tupak et al., Reference Tupak, Badewien, Dresler, Hahn, Ernst, Herrmann, Fallgatter and Ehlis2012; Vonk et al., Reference Vonk, Rizvi, Lao, Budge, Manly, Mayeux and Brickman2018). In PD, even without dementia, frontal dysfunction is thought to underlie worse performance on letter fluency compared to category fluency (Azuma et al., Reference Azuma, Bayles, Cruz, Tomoeda, Wood, McGeagh and Montgomery1997). However, findings have been mixed, with some studies reporting equivalent or even greater impairment in category fluency – often in older studies lacking cognitive stratification or normative comparison (Auriacombe et al., Reference Auriacombe, Grossman, Carvell, Gollomp, Stern and Hurtig1993; Azuma et al., Reference Azuma, Bayles, Cruz, Tomoeda, Wood, McGeagh and Montgomery1997; Henry & Crawford, Reference Henry and Crawford2004).

A second observed cognitive pattern in PD is a performance discrepancy in delayed recall between word list and story memory tasks. Differences in these tasks are also thought to relate to frontal versus temporal localization of functions. Delayed recall of stories benefits from built in organization through semantically-related information, which is largely supported by temporal lobe structures (Helmstaedter et al., Reference Helmstaedter, Wietzke and Lutz2009; Lezak, Reference Lezak2004; Tremont et al., Reference Tremont, Halpert, Javorsky and Stern2000). In contrast, word-list learning and recall rely more on self-generated organizational strategies of semantically related or unrelated words, which are more dependent on frontal lobe executive functions (Broadway et al., Reference Broadway, Rieger, Campbell, Quinn, Mayer, Yeo, Wilson, Gill, Fratzke and Cavanagh2019; Kopelman & Stanhope, Reference Kopelman and Stanhope1998; Tremont et al., Reference Tremont, Halpert, Javorsky and Stern2000). As Cummings (Reference Cummings1990) noted, individuals with subcortical dementias like PD are thought to perform better with structured information, whereas individuals with cortical dementias may not show a difference in performance with structured vs unstructured information recall. Accordingly, individuals with PD typically show greater impairment on word list recall due to executive dysfunction, while story recall remains relatively preserved (Helkala et al., Reference Helkala, Laulumaa, Soininen and Riekkinen1989; O’Brien et al., Reference O’Brien, Wadley, Nicholas, Stover, Watts and Griffith2009; Zahodne et al., Reference Zahodne, Bowers, Price, Bauer, Nisenzon, Foote and Okun2011).

The third cognitive pattern concerns the difference between delayed free recall and recognition memory. Poor performance on both often suggests encoding or storage deficits, associated with a “cortical” memory profile. Conversely, impaired recall with preserved recognition has been viewed as a “retrieval deficit,” suggesting intact storage but difficulty accessing information without cues – a pattern often described in PD (Cummings, Reference Cummings1990). Squire, Wixted, and Clark (Reference Squire, Wixted and Clark2007) challenged this view however, noting that intact recognition does not equate to intact encoding or storage as recognition may rely on familiarity-based processes that demand less robust encoding than the recollection for recall. Thus, preserved recognition can occur with shallow encoding, complicating the attribution of poor recall solely to retrieval deficits. Indeed, studies have found both encoding deficits and impaired recognition in PD (Beatty et al., Reference Beatty, Ryder, Gontkovsky, Scott, McSwan and Bharucha2003; Hartikainen et al., Reference Hartikainen, Helkala, Soininen and Riekkinen1993; Higginson et al., Reference Higginson, Wheelock, Carroll and Sigvardt2005; Tanner et al., Reference Tanner, Mareci, Okun, Bowers, Libon and Price2015). Nonetheless, the more typical pattern in PD involves impaired delayed recall with relatively preserved recognition (E. Helkala et al., Reference Helkala, Laulumaa, Soininen and Riekkinen1988), reflecting frontostriatal dysfunction affecting both encoding strategies and retrieval, rather than medial temporal lobe-related storage impairments (Carlesimo et al., Reference Carlesimo, Taglieri, Zabberoni, Scalici, Peppe, Caltagirone and Costa2022; Weintraub et al., Reference Weintraub, Moberg, Culbertson, Duda and Stern2004).

Although these three neuropsychological performance patterns have been frequently described in the literature, findings have been inconsistent - likely due to differences in methodologies and small sample sizes. Despite inconsistent empirical support, this “trifecta” of patterns has been described anecdotally in clinical practice and may be informally referenced as “clinical pearls” to support hypotheses of executive dysfunction. We clarify that our use of the term “trifecta” refers to commonly seen and practically useful patterns in clinical work with individuals with PD, not to formally established or universally accepted diagnostic criteria. The informal use of these patterns highlights the need for stronger empirical validation. Therefore, the primary aim of this study is to evaluate each of the three neuropsychological patterns in a large sample of individuals with PD. We also aimed to quantify the extent to which this “trifecta” of patterns co-occurred within individuals with PD. Specifically, we aimed to assess the percent of patients who exhibit these patterns overall, examine the degree of severity in the observed differences in test performance, and determine whether clinically meaningful differences are common within these neuropsychological patterns. Lastly, we explored the relationship between this “trifecta” and other executive function tests and other PD characteristics.

Materials and methods

Design

This study involved a retrospective chart review of individuals diagnosed with idiopathic PD who were seen at the University of Florida (UF) Health Norman Fixel Institute for Neurological Diseases. Study procedures were approved by the UF Institutional Review Board, with informed consent obtained in accordance with the Declaration of Helsinki and University and Federal standards. Data included demographic, clinical, and neuropsychological information.

Participants

Participants included a clinical convenience sample of 772 individuals with idiopathic PD drawn from a large IRB-approved prospectively acquired clinical-research database (INFORM). Most participants were candidates for deep brain stimulation (DBS) surgery, indicating that motor symptoms were sufficiently bothersome and not well controlled by medication management. Inclusion criteria included: 1) a diagnosis of idiopathic PD made by a fellowship-trained movement disorders specialist and 2) neuropsychological evaluation between 2002 and 2022. Exclusion criteria included: 1) previous brain surgery (e.g., deep brain stimulation, pallidotomy); 2) history of epilepsy, stroke, brain injury, or other neurological diagnosis with ongoing cognitive sequela; and 3) evidence of significant cognitive impairment based on scores below 125 on the Dementia Rating Scale-2 (DRS-2; Johnson-Greene, Reference Johnson-Greene2004). Demographic and clinical data were obtained from the UF INFORM clinical-research database.

Clinical measures

All participants underwent a comprehensive neuropsychological evaluation, including a cognitive screener and measures to assess functioning in multiple cognitive domains. The specific tests drawn from the full neuropsychological assessment for this study are highlighted in Table 1, as well as the raw scores used for analyses. Raw scores were converted to normed z-scores based on test-specific manuals or previously published norms (Heaton, Reference Heaton2004). Self-report mood and motivation scales were also included as part of a standardized neuropsychological battery as listed in Table 1.

Table 1. Neuropsychological tests and self-report measures within each cognitive domain composite

Note: WMS-III = Wechsler Memory Scale-Version III (Wechsler, Reference Wechsler1997); HVLT-R = Hopkin’s Verbal Learning Test-Revised (Brandt, Reference Brandt1991); Letter Fluency (FAS) (Tombaugh et al., Reference Tombaugh, Kozak and Rees1999); Stroop Test is the Golden version (Golden, Reference Golden1978); TMT-B = Trail Making Test Part B (Reitan, Reference Reitan1992); Wisconsin Carding Sorting Test-64 (hand administration) (Kongs et al., Reference Kongs, Thompson, Iverson and Heaton2000); Category Fluency (Animals) (Tombaugh et al., Reference Tombaugh, Kozak and Rees1999); BDI-II (Beck et al., Reference Beck, Steer and Brown1996; Leentjens et al., Reference Leentjens, Verhey, Luijckx and Troost2000), AS (Leentjens et al., Reference Leentjens, Dujardin, Marsh, Martinez-Martin, Richard, Starkstein, Weintraub, Sampaio, Poewe and Rascol2008; Starkstein et al., Reference Starkstein, Mayberg, Preziosi, Andrezejewski, Leiguarda and Robinson1992), STAI (Knight et al., Reference Knight, Waal-Manning and Spears1983; Spielberger, Reference Spielberger1983).

All participants were “on” dopaminergic medication as part of routine clinical care. Most participants received PD-specific scales for disease staging (Hoehn & Yahr staging) and for gauging motor severity in response to dopamine medications (Unified Parkinson Disease Rating Scale-Part III) (Fahn & Elton, Reference Fahn and Elton1987; Goetz et al., Reference Goetz, Poewe, Rascol, Sampaio, Stebbins, Counsell, Giladi, Holloway, Moore and Wenning2004). UPDRS motor scores should be interpreted in the context of a pre-DBS cohort, in which many participants were referred due to suboptimal response to medication. As such, scores may not fully capture treatment efficacy.

Statistical analyses

All analyses were conducted using SPSS Version 28 (IBM Corp, 2021). Data were examined for normality and outliers. Paired-sample t-tests were used to compare performance within each neuropsychological pearl, and we calculated the percentage of participants showing lower performance (z-difference <0) and clinically meaningful differences at z-score thresholds of ≤ −0.5, ≤ −1.0, and ≤ −1.5, representing increasing levels of severity in intra-individual performance discrepancies. This method, commonly used in the absence of anchor-based criteria, defines minimal clinically important difference as a change of ≥|0.5| SD and provides a standardized way to approximate meaningful intra-individual change. While clinical significance also depends on patient perceptions and score distributions, this approach offers a practical framework for interpretation.

We defined a meaningful difference of −1.0 standard deviation between tests, consistent with procedures for interpreting subtest discrepancies outlined in comprehensive assessments, including the Wechsler Adult Intelligence Scale–Fourth Edition. While comparison of separate tests is not the same as comparing subtests, this procedure aligns with evidence that such differences often exceed what is expected due to measurement error or normal variability, supporting their relevance for identifying significant cognitive strengths or weaknesses. For participants meeting the ≥ −1.0 SD threshold, we used independent t-tests and Chi-square tests to determine if there were demographic or clinical differences between those who did and did not show each neuropsychological pattern. FDR corrections (Benjamini & Hochberg, Reference Benjamini and Hochberg1995) were applied to control for multiple comparisons in analyses comparing demographic and clinical variables between those who did and did not exhibit a ≥ 1.0 SD difference within each cognitive pattern.

Lastly, associations between neuropsychological pearls and executive function were explored via correlations and Chi-squared tests in a sub-sample of participants (N = 548) who had received three “executive” tasks assessing executive processes including cognitive inhibition (Stroop Color-Word (Golden, Reference Golden1978), speeded set-shifting (Trail Making Test, Part B; (Reitan, Reference Reitan1992), and novel problem solving (Wisconsin Card Sorting Test-64 cards (Kongs et al., Reference Kongs, Thompson, Iverson and Heaton2000), total errors). The z-score difference for each individual pearl was correlated with z-score performance on each classic task and an executive function z-score composite, as described in methods. Z-scores from each task were derived from test manuals and averaged to create a total executive function composite z-score. For each neuropsychological pattern, we correlated the z-score discrepancy (within-subject test pair difference) with individual executive task z-scores and a composite executive function z-score (calculated by averaging the three task specific z-scores). All correlation analyses were FDR corrected (Benjamini & Hochberg, Reference Benjamini and Hochberg1995). The multiple correlational analyses between were FDR corrected. Exploratory Chi-squared analyses examined whether the prevalence of each neuropsychological pattern differed across PD clinical characteristics, including disease duration (categorized as early [ ≤ 5 years] vs. late [>5 years]) and Hoehn & Yahr disease stage.

Results

Sample characteristics

Demographic characteristics, scores on cognitive testing, and disease-related measures of the sample (n = 772) are depicted in Table 2. Overall, participants were largely non-Hispanic White (93.6%), male (72%), well-educated (15.2 ± 2.7), and in their mid-60s (65.0 ± 9.3). Most had tremor-dominant PD at the time of diagnosis with motor symptoms well-controlled with medication (Martínez-Martín et al., Reference Martínez-Martín, Rodríguez-Blázquez, Alvarez, Arakaki, Arillo, Chaná, Fernández, Garretto, Martínez-Castrillo, Rodríguez-Violante, Serrano-Dueñas, Ballesteros, Rojo-Abuin, Chaudhuri and Merello2015). Motor severity, based on Hoehn and Yahr staging (subset, n = 540), ranged from stage 1 to 5 (mean = 2.35, SD = 0.62): Stage 1 (n = 13), 1.5 (n = 8), 2 (n = 289), 2.5 (n = 98), 3 (n = 108), 4 (n = 17), and 5 (n = 7). (Table 3)

Table 2. Sample demographic, clinical, and cognitive (z-score) characteristics

Note: UPDRS III = Unified Parkinson’s Disease Rating Scale motor scale, BDI-II = Beck Depression Inventory-II, STAI = State Trait Anxiety Inventory, WMS-III = Wechsler Memory Scale-Version III (Wechsler, Reference Wechsler1997); HVLT-R = Hopkin’s Verbal Learning Test-Revised (Brandt, Reference Brandt1991); Letter Fluency (FAS) (Tombaugh et al., Reference Tombaugh, Kozak and Rees1999); Category Fluency (Animals) (Tombaugh et al., Reference Tombaugh, Kozak and Rees1999); BDI-II (Beck et al., Reference Beck, Steer and Brown1996; Leentjens et al., Reference Leentjens, Verhey, Luijckx and Troost2000), AS (Leentjens et al., Reference Leentjens, Dujardin, Marsh, Martinez-Martin, Richard, Starkstein, Weintraub, Sampaio, Poewe and Rascol2008; Starkstein et al., Reference Starkstein, Mayberg, Preziosi, Andrezejewski, Leiguarda and Robinson1992), STAI (Knight et al., Reference Knight, Waal-Manning and Spears1983; Spielberger, Reference Spielberger1983). Cognitive tests scores provided are z-scores as described in methods.

Table 3. Sample average performance on executive measures (z-scores)

Note: Stroop Test is the Golden version (Golden, Reference Golden1978); Trail Making Test Part B (Reitan, Reference Reitan1992); Wisconsin Carding Sorting Test-64 (hand administration) (Kongs et al., Reference Kongs, Thompson, Iverson and Heaton2000).

Clinical pearls

Pearl 1: letter fluency vs. Category fluency

Letter fluency (z = −0.45 ± 1.1) was significantly lower than category fluency (z = −0.29 ± 1.2), though with small effect size (t (765) = −3.4, p = 0.001, Cohen’s d = −0.12). Among the 765 participants with both scores, 53% (n = 408) performed worse on letter than category fluency. Differences of ≤ −0.5 SD, ≤ −1.0 SD, and ≤ −1.5 SD were observed in 40% (n = 310), 28% (n = 212), and 16% (n = 120), respectively. Conversely, 44% (n = 333) performed better on letter fluency, with 28% (n = 214), 17% (n = 133), and 9% (n = 66) showing positive differences of ≥0.5 SD, ≥1.0 SD, and ≥1.5 SD, respectively.

Individuals who showed a ≤ −1.0 SD difference on Pearl 1 were younger at the time of testing (t (764) = 2.6, p = 0.01, d = 0.21) and had fewer years of education (t (764) = 3.2, p = 0.001, d = 0.26). No differences emerged in sex, race/ethnicity, DRS-2 scores, UPDRS motor scores, or self-reported mood symptoms after FDR-corrections (Table 4).

Table 4. Descriptive characteristics between individuals with and without at least a -1.0 standard deviation difference in performance on pearls 1-3

Note: Z refers to z-score difference between tests within each neuropsychological pearl (Pearl 1 = Letter-Category Fluency; Pearl 2 = HVLT-R Delay - WMS-III LM Delay; Pearl 3 = HVLT-R Delay – HVLT-R Recognition).

Pearl 2: delayed word list recall vs. Story recall

Delayed recall of the Hopkins Verbal Learning Test-Revised (HVLT-R) word list (z = −0.93 ± 1.3) was significantly worse than delayed story recall from the WMS-III Logical Memory test (z = 0.15 ± 1.1), with a large effect size (t (752) = 23.7, p < 0.001, Cohen’s d = −0.86). Deficits of ≤ −0.5 SD, ≤ −1.0 SD, and ≤ −1.5 SD were found in 68% (n = 512), 52% (n = 388), and 34% (n = 254) of the 752 individuals with both scores, respectively. In contrast, 17% (n = 130) performed better on list recall, with differences of ≥0.5 SD (n = 79, 11%), ≥1.0 SD (n = 32, 4%), and ≥1.5 SD (n = 10, 1%).

Those with a ≤ −1.0 SD difference on Pearl 2 were more often male (χ²(1, n = 753) = 13.0, p < 0.001). No differences emerged in age, education, sex, race/ethnicity, DRS-2 scores, UPDRS motor scores, or self-reported mood symptoms after FDR-corrections (Table 4).

Pearl 3: delayed recall vs. Recognition discrimination

Delayed recall performance on the HVLT-R (z = −0.93 ± 1.3) was significantly worse than recognition discrimination (calculated as the number of true positives minus false positives) on the yes-no recognition trial (z = −0.65 ± 1.2; t (724) = −6.7, p < 0.001) with small effect size (Cohen’s d = −0.25). Differences of ≤ −0.5 SD, ≤ −1.0 SD, and ≤ −1.5 SD were observed in 39% (n = 286), 24% (n = 172), and 14% (n = 105) of the sample respectively. Conversely, 40% (n = 291) performed better on delayed recall, with 21% (n = 151), 9% (n = 67), and 4% (n = 29) showing higher scores by ≥0.5, ≥1.0, and ≥1.5 SD, respectively.

Group-level analysis revealed that individuals with a ≤ −1.0 SD difference on Pearl 3 were significantly younger (t (723) = 2.8, p = 0.006; Cohen’s d = 0.26) compared to individuals who performed better than −1.0 SD (Table 4). There were no other group differences in demographics of PD clinical variables after FDR-corrections.

Co-occurrence of executive function trifecta

In addition to calculating the percentage of our sample displaying each neurocognitive pearl (Figure 1), we examined the co-occurrence of these three neuropsychological pearls within our sample (n = 716). For consistency, we used a z-score of difference of −1.0 between the two tests as a “yes” or “no” variable to indicate whether they did or did not demonstrate each cognitive pattern. Participants were grouped into all possible permutations, which came to 8 different groups, and we examined the frequency of the different groups (Figure 2). The largest percentage of the sample (28.3%, n = 203) did not demonstrate any of the three cognitive patterns. The second most frequent co-occurrence was for Pearl 1 and 2 (25.5%, n = 183), which appeared to be driven largely by Pearl 2 as 14.2% (n = 102) of this sample demonstrated this pattern only.

Figure 1. Cumulative percentage of the sample (n = 772) demonstrating z-score differences for each neuropsychological pearl.

Figure 2. Frequency of co-occurrence of a trifecta of neuropsychological patterns (n = 716) with −1.0 SD z-score difference.

Comparison to classic executive function tests

A subset of 548 participants completed three executive function tests, the Stroop Color-Word test (average z = −0.34 ± 1.0, range = −3.00 to 3.00), Trail Making Test Part B (TMT-B; average z = −0.95 ± 1.4, range = −3.00 to 3.00), and Wisconsin Card Sorting Test (WCST) total errors (average z = −0.58 ± 1.2, range = −3.10 to 2.50) (Table 3). The distribution of scores across each of these tasks reflects a range from impaired to strong performances. To explore the relationship between performance on these executive function tests and the presence of neuropsychological pearls, we first conducted FDR-corrected correlations between the z-score differences within each pearl, z-score performance on each classic task, and the executive function z-score composite, as described in methods. Only Pearl 3 demonstrated a significant, albeit small, correlation with executive function composite scores (r = 0.132, p = 0.003), suggesting limited overlap.

With this sample subset, we then categorized participants into three groups based on their executive function composite scores: a “low” executive function group (z ≤ −1.0) (n = 148), a “within normal limits” (WNL) group (−1.0<z<1.0) (n = 376), and a “high” executive function group (z ≥1.0) (n = 24). Chi-square tests revealed no significant differences in the proportion of individuals displaying a≥1.0 SD discrepancy in any of the three pearls among the low, WNL, and high executive groups. These findings suggest that traditional executive tasks did not predict the neuropsychological patterns observed in this sample.

Relationship of PD Variables to Executive Function Trifecta

Given that executive dysfunction often emerges early in PD, we explored whether disease duration or severity was associated with the presence of neuropsychological pearls. Disease duration (years since symptom onset) was unrelated to the executive function composite and each neuropsychological pearl’s z-score difference, suggesting that longer disease duration was not associated with greater likelihood of these patterns. When participants were divided into two groups based on disease duration: “early” (≤5 years) and “late” (>5 years), chi-square analyses revealed that a greater percentage of individuals in the early group exhibited Pearl 2 compared to the late group (X²(1, n = 741) = 4.0, p = 0.044).

We also examined pearl prevalence across stages of disease severity using Hoehn & Yahr (H&Y) ratings by dividing participants into “early-stage” (H&Y 1–2) and “mid-stage” (H&Y 2.5–3). Individuals in H&Y stages 4–5 (N = 23) were excluded due to small sample size. Chi-squared tests revealed more early-stage participants demonstrated Pearl 2 compared to mid-stage participants (X² (1, n = 506) = 3.9, p = 0.049). Together, these results suggest that worse delayed recall of unstructured information compared to structured information is more common in the earlier stages and durations of PD. This may reflect the early impact of executive dysfunction, which may later be overshadowed by broader impairments in memory or other cognitive domains as the disease progresses.

Discussion

This study sought to learn whether a “trifecta” of previously identified neuropsychological patterns could be validated in a large cohort of individuals with Parkinson’s disease. The focus was three key cognitive pearls: letter fluency < category fluency, delayed recall of a word list < story recall, and delayed recall < recognition. Statistically, our results supported these hypothesized cognitive patterns. On average, individuals with PD performed statistically worse on letter fluency, delayed recall of a word list, and delayed recall compared to recognition.

When going beyond statistical significance of average scores, our results revealed that the “trifecta” of neuropsychological patterns or “clinical pearls” were not uniformly present, and many did not show clinically meaningful differences between tests within a given neuropsychological pattern (Figure 1, Figure 2). Indeed, only Pearl 2 (word list vs. story memory) showed a large effect size with at least half the sample showing a clinically meaningful difference (−1SD) in performance. The other two patterns yielded small effect sizes, and only a minority exhibited differences of one standard deviation or greater. This variability suggests that while certain cognitive patterns may emerge at the group level, individual differences in cognitive decline and neuropsychological profiles in Parkinson’s disease are substantial, underscoring the need for individualized assessment.

We further explored how these patterns related to broader executive functioning (Stroop Color-Word, TMT B, WCST). There was marked individual variability in performance across these executive function tasks, in line with recent reports about differing rates of cognitive decline (Fereshtehnejad et al., Reference Fereshtehnejad, Moqadam, Azizi, Postuma, Dadar, Lang, Marras and Zeighami2025). Pearls were either weakly or unrelated to performance on classic executive function tests. This discrepancy may reflect differences in task-specific demands, the multifaceted nature of executive function (i.e., different neural substrates involved in different tasks), and the varied use of compensatory strategies or cognitive reserve. Furthermore, stratifying participants by executive function status revealed no significant differences in the prevalence of any pearl, except Pearl 3 (recall < recognition), which was significantly correlated with executive performance. Notably, Pearl 2 (word list vs story) was more prevalent in individuals with shorter disease duration and lower Hoehn & Yahr stage, suggesting it may be a more prominent early disease marker that diminishes with broader cognitive decline as more diffuse or non-frontal cognitive deficits arise.

Our findings, based on what is perhaps the largest sample of individuals with PD to date, align with the broader literature in emphasizing the complex and heterogeneous nature of cognitive impairments in PD. Prior studies have yielded mixed results, with both confirmative and contradictory findings regarding neuropsychological patterns, particularly in verbal fluency. For example, there are at least eight studies that found worse performance on letter vs category fluency (Barbosa et al., Reference Barbosa, Voos, Chen, Francato, Souza, Barbosa, Chien and Mansur2017; Bayles et al., Reference Bayles, Trosset, Tomoeda, Montgomery and Wilson1993; Gabrieli et al., Reference Gabrieli, Singh, Stebbins and Goetz1996; Galtier et al., Reference Galtier, Nieto, Lorenzo and Barroso2017; Jaywant et al., Reference Jaywant, Musto, Neargarder, Stavitsky Gilbert and Cronin-Golomb2014; Monsch et al., Reference Monsch, Bondi, Butters, Paulsen, Salmon, Brugger and Swenson1994; Rosser & Hodges, Reference Rosser and Hodges1994; Suhr & Jones, Reference Suhr and Jones1998; Troyer et al., Reference Troyer, Moscovitch, Winocur, Alexander and Stuss1998). In contrast, over ten other studies have found the opposite, with better performance on category compared to letter fluency (Auriacombe et al., Reference Auriacombe, Grossman, Carvell, Gollomp, Stern and Hurtig1993; Beatty et al., Reference Beatty, Staton, Weir, Monson and Whitaker1989; Koerts et al., Reference Koerts, Meijer, Colman, Tucha, Lange and Tucha2013; Matison et al., Reference Matison, Mayeux, Rosen and Fahn1982; Raskin et al., Reference Raskin, Sliwinski and Borod1992) or no difference between the two types of fluency at all (Azuma et al., Reference Azuma, Bayles, Cruz, Tomoeda, Wood, McGeagh and Montgomery1997; Dadgar et al., Reference Dadgar, Khatoonabadi and Bakhtiyari2013; Flowers et al., Reference Flowers, Robertson and Sheridan1995; Gurd & Ward, Reference Gurd and Ward1989; Hanlly et al., Reference Hanlly, Dewick, Davies, Playeer and Turnbull1990; McDowd et al., Reference McDowd, Hoffman, Rozek, Lyons, Pahwa, Burns and Kemper2011; Obeso et al., Reference Obeso, Ray, Antonelli, Cho and Strafella2011; Piatt et al., Reference Piatt, Fields, Paolo, Koller and Tröster1999; Troyer et al., Reference Troyer, Moscovitch, Winocur, Alexander and Stuss1998). Notably, a meta-analysis of verbal fluency performance in PD showed more impairment on category fluency than letter, though both were found to be related to psychomotor speed more than executive dysfunction (Henry & Crawford, Reference Henry and Crawford2004), which has been corroborated in at least two other studies (Koerts et al., Reference Koerts, Meijer, Colman, Tucha, Lange and Tucha2013; McDowd et al., Reference McDowd, Hoffman, Rozek, Lyons, Pahwa, Burns and Kemper2011). Some of the aforementioned studies also did not directly compare letter vs category fluency performance within a group, only noting that both verbal fluencies were impaired compared to controls (Azuma et al., Reference Azuma, Bayles, Cruz, Tomoeda, Wood, McGeagh and Montgomery1997; Dadgar et al., Reference Dadgar, Khatoonabadi and Bakhtiyari2013; Gurd & Ward, Reference Gurd and Ward1989; Obeso et al., Reference Obeso, Ray, Antonelli, Cho and Strafella2011; Troyer et al., Reference Troyer, Moscovitch, Winocur, Alexander and Stuss1998). Methodological differences, such as the number of fluency trials, the specific letter or categories tested, and the norming standards use, may account for inconsistent findings across studies. In the current study, participants on average performed worse on three letter fluency trials compared to a single trial of category fluency, which raises multiple possible considerations. One possibility is that the cumulative demand of multiple trials, especially on a task potentially more reliant on executive control, may reveal subtle impairments that a single trial cannot capture. Alternatively, poorer performance across three trials may reflect fatigue or difficulty maintaining verbal output under increasing cognitive load. has shown that in healthy older adults, category fluency typically declines more with age than letter fluency (Gladsjo et al., Reference Gladsjo, Schuman, Evans, Peavy, Miller and Heaton1999) suggesting that the opposite pattern found in our sample is unlikely to be explained by normative aging effects alone. Moreover, both letter and category fluency tasks were co-normed and standardized using the revised Heaton norms, allowing for a direct, demographically adjusted comparison in performance.

At least three studies have demonstrated worse word list delayed memory compared to spared delayed recall of stories in PD (Hartikainen et al., Reference Hartikainen, Helkala, Soininen and Riekkinen1993; Lafo et al., Reference Lafo, Jones, Okun, Bauer, Price and Bowers2015; Zahodne et al., Reference Zahodne, Bowers, Price, Bauer, Nisenzon, Foote and Okun2011) and in individuals with “significant” executive dysfunction (Tremont et al., Reference Tremont, Halpert, Javorsky and Stern2000). A similar study found worse performance on short delay recall of a word list in a group “with executive dysfunction” compared to the group without executive dysfunction, but no difference in long delay (Brooks et al., Reference Brooks, Weaver and Scialfa2006). One PD study did not find a difference in delayed recall performance between list and stories using the Repeatable Battery for the Assessment of Neuropsychological Status (RBANS), but statistical analyses were only conducted using a difference score, rather than directly comparing list and story performances (Beatty et al., Reference Beatty, Ryder, Gontkovsky, Scott, McSwan and Bharucha2003). These mixed findings may in part reflect differences in test sensitivity and normative frameworks across studies. For instance, the California Verbal Learning Test (CVLT) includes more trials, structured semantic categories, and greater demands on learning and retrieval compared to other word list tasks like the HVLT, potentially making it more sensitive to executive dysfunction. Differences in demographic norming between the HVLT-R and the WMS-III Logical Memory may have also influenced performance classification. However, the study by Zahodne and colleagues demonstrated the same pattern (poorer performance on a word list versus stories) when both memory tasks were co-normed together from the same reference group (i.e., Weschler Memory). Nevertheless, discrepancies in memory performance patterns may be shaped not only by underlying cognitive deficits but also by the psychometric properties and normative context of the measures used.

Lastly, at least five studies have provided evidence supporting the third clinical pearl, namely impaired verbal delayed recall with relatively spared recognition (Auriacombe et al., Reference Auriacombe, Grossman, Carvell, Gollomp, Stern and Hurtig1993; Brooks et al., Reference Brooks, Weaver and Scialfa2006; Taylor et al., Reference Taylor, Saint-Cyr and Lang1986, Reference Taylor, Saint-Cyr and Lang1990). Conversely, six studies found similar impairments in delayed recall and recognition compared to controls suggesting additional dysfunction in encoding (Beatty et al., Reference Beatty, Ryder, Gontkovsky, Scott, McSwan and Bharucha2003; Brønnick et al., Reference Brønnick, Alves, Aarsland, Tysnes and Larsen2011; Hartikainen et al., Reference Hartikainen, Helkala, Soininen and Riekkinen1993; Higginson et al., Reference Higginson, Wheelock, Carroll and Sigvardt2005; Tanner et al., Reference Tanner, Mareci, Okun, Bowers, Libon and Price2015). However, most of these studies did not directly compare performances in the PD cohort alone (Beatty et al., Reference Beatty, Ryder, Gontkovsky, Scott, McSwan and Bharucha2003; Brønnick et al., Reference Brønnick, Alves, Aarsland, Tysnes and Larsen2011; Hartikainen et al., Reference Hartikainen, Helkala, Soininen and Riekkinen1993; Higginson et al., Reference Higginson, Wheelock, Carroll and Sigvardt2005; Tanner et al., Reference Tanner, Mareci, Okun, Bowers, Libon and Price2015). Further support for encoding and retrieval difficulties comes from a meta-analysis showing impaired delayed recall and recognition in PD, even in the absence of dementia, though with low effect sizes (Whittington et al., Reference Whittington, Podd and Kan2000). Importantly, Squire and colleagues have cautioned against interpreting preserved recognition as definitive evidence of intact encoding or storage, as recognition memory can often rely on familiarity-based processes that do not require deep or elaborative encoding (Squire et al., Reference Squire, Wixted and Clark2007). Thus, relatively intact recognition can occur even when encoding is shallow, complicating the inference that poor recall reflects a pure retrieval deficit. Moreover, effective retrieval of episodic information is thought to reactivate encoding engrams, underscoring the interdependence of these processes and the challenges in using neuropsychological tests to disentangle them cleanly (Alvarez & Squire, Reference Alvarez and Squire1994; Nyberg et al., Reference Nyberg, Habib, McIntosh and Tulving2000; Squire & Kandel, Reference Squire and Kandel2000).

Overall, the conflicting research findings are likely a result of the interplay of methodological and sample-related factors. Possible methodological differences include the smaller sample sizes, determination of dementia based on varying criteria, use of different neuropsychological tests and process scores to assess memory and verbal learning, and use of varied statistical methods. Sample characteristics may also account for discrepancies observed. Smaller studies, particularly earlier studies, may be constrained by a narrower range of demographic (e.g., age) or PD characteristics (e.g., motor severity, disease duration) and likely lack the statistical power to capture the full diversity of cognitive profiles. In our study, participants’ age ranged from 30–90, which is an exceptionally large age range compared to many studies that focus on older adults with PD. It is believed that young onset PD individuals tend to have less severe motor progression and cognitive impact compared to late onset PD (Diederich et al., Reference Diederich, Moore, Leurgans, Chmura and Goetz2003; Pagano et al., Reference Pagano, Ferrara, Brooks and Pavese2016; Santos-García et al., Reference Santos-García, de Deus Fonticoba, Cores Bartolomé, Feal Painceiras, García Díaz, Íñiguez Alvarado, Paz, Jesús, Cosgaya and García Caldentey2023). As a result, the inclusion of younger individuals in our sample may have diluted the overall prevalence or severity of neuropsychological differences typically observed in older PD cohorts. Additionally, our sample demonstrated relatively mild motor severity and a wide range of executive dysfunction overall, which may also explain the lower frequency of robust or clinically meaningful differences in performance across the patterns we examined. It is also likely that at least some individuals have co-occurring neuropathology like Alzheimer’s disease or limbic-predominant age-related TDP-43 encephalopathy, which could also be impacting the presenting neurocognitive profiles (Fan et al., Reference Fan, Liu and Wu2021; Nelson et al., Reference Nelson, Dickson, Trojanowski, Jack, Boyle, Arfanakis, Rademakers, Alafuzoff, Attems, Brayne, Coyle-Gilchrist, Chui, Fardo, Flanagan, Halliday, Hokkanen, Hunter, Jicha, Katsumata and Schneider2019). Lastly, few studies have examined clinically meaningful differences in neuropsychological patterns to determine the percentage of individuals who display a certain pattern. Our findings suggest that the broader age range and larger sample size employed here allowed for a potentially more comprehensive representation of cognitive variability.

This study has several limitations. First, the use of a convenience sample – primarily individuals undergoing neuropsychological evaluations for DBS candidacy – introduces selection bias. These patients often represent a specific subset of PD (e.g., tremor dominant, less cognitively impaired), limiting generalizability. Second, we lacked data on cognitive diagnoses (e.g., amnestic vs. non-amnestic MCI), which precluded analyses by MCI subtype. Third, we used test-specific normative data consistent with clinical practice, which introduced variation in comparison groups – particularly affecting Pearl 2. The sample was also predominantly non-Hispanic White, well-educated individuals, reducing. This not only significantly reduces applicability of these findings to more diverse sociocultural populations, but future studies in more diverse groups would be potentially limited by the specific normative groups themselves, as these lack consideration of individuals from diverse backgrounds and sociocultural factors that can influence cognitive performance in meaningful ways (Byrd & Rivera-Mindt, Reference Byrd and Rivera-Mindt2022). Additionally, all PD participants were assessed while “on” their standard dopaminergic medications, but their medication usage was not formally tracked during the 2–3-hour evaluation. Variability in medication levels, including potential “wearing off” effects or concerns with excessive dopamine (e.g., overdose hypothesis), may have influenced cognitive performance and should be considered when interpreting the results. Lastly, there is limited research on direct comparisons of tests and common neurocognitive patterns in healthy, cognitively intact older adults. Future studies should prioritize exploring typical patterns of relative strengths and weaknesses in aging to better contextualize findings in PD and other neurodegenerative conditions.

Our findings suggest that incorporating pattern-based interpretation – focusing on within-person variability and relative cognitive strengths and weaknesses – may provide more nuanced insights into the cognitive changes associated with Parkinson’s disease. However, given the lack of robust findings, this study underscores the importance of critically evaluating commonly cited neuropsychological patterns of relative impairment – especially those derived from small or older studies. The presence or absence of these patterns alone should not be viewed as a definitive indicator of cognitive status or diagnosis. Instead, when assessing executive dysfunction in PD, such patterns should be interpreted within the broader context of a comprehensive neuropsychological evaluation.

Funding statement

Funding was provided by the National Institute of Health: T32-NS082168, T32-AG061892, F31-NS131000, F31-AG081047, UF Fixel Institute of Neurological Diseases.

Competing interests

The authors declare that there are no conflicts of interest relevant to this work.

References

Alexander, G. E., DeLong, M. R., & Strick, P. L. (1986). Parallel organization of functionally segregated circuits linking basal ganglia and cortex. Annual Review of Neuroscience, 9, 357–381.10.1146/annurev.ne.09.030186.002041CrossRef Google Scholar PubMed

Alvarez, P., & Squire, L. R. (1994). Memory consolidation and the medial temporal lobe: A simple network model. Proceedings of The National Academy of Sciences of The United States of America, 91, 7041–7045.10.1073/pnas.91.15.7041CrossRef Google Scholar PubMed

Arrigoni, E., Antoniotti, P., Bellocchio, V., Veronelli, L., Corbo, M., & Pisoni, A. (2024). Neural alterations underlying executive dysfunction in Parkinson’s disease: A systematic review and coordinate-based meta-analysis of functional neuroimaging studies. Ageing Research Reviews, 95, 102207.10.1016/j.arr.2024.102207CrossRef Google Scholar PubMed

Auriacombe, S., Grossman, M., Carvell, S., Gollomp, S., Stern, M. B., & Hurtig, H. I. (1993). Verbal fluency deficits in Parkinson’s disease. Neuropsychology, 7, 182.10.1037/0894-4105.7.2.182CrossRef Google Scholar

Azuma, T., Bayles, K. A., Cruz, R. F., Tomoeda, C. K., Wood, J. A., McGeagh, A., & Montgomery, E. B. (1997). Comparing the difficulty of letter, semantic, and name fluency tasks for normal elderly and patients with Parkinson’s disease. Neuropsychology, 11, 488–497.10.1037/0894-4105.11.4.488CrossRef Google Scholar PubMed

Azuma, T., Cruz, R. F., Bayles, K. A., Tomoeda, C. K., & Montgomery, E. B. Jr. (2003). A longitudinal study of neuropsychological change in individuals with Parkinson’s disease. International Journal of Geriatric Psychiatry, 18, 1043–1049.10.1002/gps.1015CrossRef Google Scholar PubMed

Barbosa, A. F., Voos, M. C., Chen, J., Francato, D. C. V., Souza, C.de O., Barbosa, E. R., Chien, H. F., & Mansur, L. L. (2017). Cognitive or cognitive-motor executive function tasks? Evaluating verbal fluency measures in people with Parkinson’s disease. BioMed Research International, 2017, 7893975.10.1155/2017/7893975CrossRef Google Scholar PubMed

Bayles, K. A., Trosset, M. W., Tomoeda, C. K., Montgomery, E. B. Jr., & Wilson, J. (1993). Generative naming in Parkinson disease patients. Journal of Clinical and Experimental Neuropsychology, 15, 547–562.10.1080/01688639308402578CrossRef Google Scholar PubMed

Beatty, W. W., Ryder, K. A., Gontkovsky, S. T., Scott, J. G., McSwan, K. L., & Bharucha, K. J. (2003). Analyzing the subcortical dementia syndrome of Parkinson’s disease using the RBANS. Archives of Clinical Neuropsychology: The Official Journal of the National Academy of Neuropsychologists, 18, 509–520.Google Scholar PubMed

Beatty, W. W., Staton, R. D., Weir, W. S., Monson, N., & Whitaker, H. A. (1989). Cognitive disturbances in Parkinson’s disease. Journal of Geriatric Psychiatry and Neurology, 2, 22–33.10.1177/089198878900200106CrossRef Google Scholar PubMed

Beck, A. T., Steer, R. A., & Brown, G. K. (1996). Manual for the beck depression inventory-II (10–1037). Psychological Corporation.Google Scholar

Benjamini, Y., & Hochberg, Y. (1995). Controlling the false discovery rate: A practical and powerful approach to multiple testing. Journal of the Royal Statistical Society: Series B (Methodological), 57, 289–300.10.1111/j.2517-6161.1995.tb02031.xCrossRef Google Scholar

Brandt, J. (1991). The Hopkins verbal learning test: Development of a new memory test with six equivalent forms. The Clinical Neuropsychologist, 5, 125–142.10.1080/13854049108403297CrossRef Google Scholar

Broadway, J. M., Rieger, R. E., Campbell, R. A., Quinn, D. K., Mayer, A. R., Yeo, R. A., Wilson, J. K., Gill, D., Fratzke, V., & Cavanagh, J. F. (2019). Executive function predictors of delayed memory deficits after mild traumatic brain injury. Cortex, 120, 240–248.10.1016/j.cortex.2019.06.011CrossRef Google Scholar PubMed

Brønnick, K., Alves, G., Aarsland, D., Tysnes, O.-B., & Larsen, J. P. (2011). Verbal memory in drug-naive, newly diagnosed Parkinson’s disease. The retrieval deficit hypothesis revisited.. The retrieval deficit hypothesis revisited. Neuropsychology, 25, 114–124.Google Scholar PubMed

Brooks, B. L., Weaver, L. E., & Scialfa, C. T. (2006). Does impaired executive functioning differentially impact verbal memory measures in older adults with suspected dementia? The Clinical Neuropsychologist, 20, 230–242.10.1080/13854040590947461CrossRef Google Scholar PubMed

Brown, G., Hakun, J., Lewis, M. M., De Jesus, S., Du, G., Eslinger, P. J., Kong, L., & Huang, X. (2023). Frontostriatal and limbic contributions to cognitive decline in Parkinson’s disease. Journal of Neuroimaging, 33, 121–133.10.1111/jon.13045CrossRef Google Scholar PubMed

Byrd, D. A., & Rivera-Mindt, M. G. (2022). Neuropsychology’s race problem does not begin or end with demographically adjusted norms. Nature Reviews Neurology, 18, 125–126.10.1038/s41582-021-00607-4CrossRef Google Scholar PubMed

Carlesimo, G. A., Taglieri, S., Zabberoni, S., Scalici, F., Peppe, A., Caltagirone, C., & Costa, A. (2022). Subjective organization in the episodic memory of individuals with Parkinson’s disease associated with mild cognitive impairment. Journal of Neuropsychology, 16, 161–182.10.1111/jnp.12256CrossRef Google Scholar PubMed

Cummings, J. L. (1990). Subcortical dementia. Oxford University Press.Google Scholar

Dadgar, H., Khatoonabadi, A. R., & Bakhtiyari, J. (2013). Verbal fluency performance in patients with non-demented Parkinson’s disease. Iranian Journal of Psychiatry, 8, 55.Google Scholar PubMed

Devignes, Q., Lopes, R., & Dujardin, K. (2022). Neuroimaging outcomes associated with mild cognitive impairment subtypes in Parkinson’s disease: A systematic review. Parkinsonism & Related Disorders, 95, 122–137.10.1016/j.parkreldis.2022.02.006CrossRef Google Scholar PubMed

Diederich, N. J., Moore, C. G., Leurgans, S. E., Chmura, T. A., & Goetz, C. G. (2003). Parkinson disease with old-age onset: A comparative study with subjects with middle-age onset. Archives of Neurology, 60(4), 529–533.10.1001/archneur.60.4.529CrossRef Google Scholar PubMed

Fahn, S., & Elton, R. (1987). UPDRS program members. Unified Parkinsons disease rating scale. Recent Developments in Parkinson’s Disease, 2, 153–163.Google Scholar

Fan, T. S., Liu, S. C. H., & Wu, R. M. (2021). Alpha-synuclein and cognitive decline in Parkinson disease. Life, 11, 1239.10.3390/life11111239CrossRef Google Scholar PubMed

Fereshtehnejad, S., Moqadam, R., Azizi, H., Postuma, R. B., Dadar, M., Lang, A. E., Marras, C., & Zeighami, Y. (2025),, Distinct Longitudinal Clinical-Neuroanatomical Trajectories in Parkinson’s Disease Clinical Subtypes: Insight toward Precision Medicine. Movement Disorders. Movement Disorders.10.1002/mds.30229CrossRef Google Scholar

Flowers, K. A., Robertson, C., & Sheridan, M. R. (1995). Some characteristics of word fluency in Parkinson’s disease. Journal of Neurolinguistics, 9, 33–46.10.1016/0911-6044(95)00004-6CrossRef Google Scholar

Foerde, K., & Shohamy, D. (2011). The role of the basal ganglia in learning and memory: Insight from Parkinson’s disease. Neurobiology of Learning and Memory, 96, 624–636.10.1016/j.nlm.2011.08.006CrossRef Google Scholar PubMed

Gabrieli, J. D., Singh, J., Stebbins, G. T., & Goetz, C. G. (1996). Reduced working memory span in Parkinson’s disease: Evidence for the role of frontostriatal system in working and strategic memory. Neuropsychology, 10, 322.10.1037/0894-4105.10.3.321CrossRef Google Scholar

Galtier, I., Nieto, A., Lorenzo, J. N., & Barroso, J. (2017). Mild cognitive impairment in Parkinson’s disease: Clustering and switching analyses in verbal fluency test. Journal of the International Neuropsychological Society, 23, 511–520.10.1017/S1355617717000297CrossRef Google Scholar PubMed

Gladsjo, J. A., Schuman, C. C., Evans, J. D., Peavy, G. M., Miller, S. W., & Heaton, R. K. (1999). Norms for Letter and Category Fluency: Demographic Corrections for Age, Education, and Ethnicity. Assessment, 6, 147–178.10.1177/107319119900600204CrossRef Google Scholar PubMed

Goetz, C. G., Poewe, W., Rascol, O., Sampaio, C., Stebbins, G. T., Counsell, C., Giladi, N., Holloway, R. G., Moore, C. G., & Wenning, G. K. (2004). Movement disorder society task force report on the Hoehn and yahr staging scale: Status and recommendations the movement disorder society task force on rating scales for Parkinson’s disease. Movement Disorders, 19, 1020–1028.10.1002/mds.20213CrossRef Google Scholar

Golden, C. J. (1978). Stroop Color and Word Test. Stoelting Company.Google Scholar

Gurd, J. M., & Ward, C. D. (1989). Retrieval from semantic and letter-initial categories in patients with Parkinson’s disease. Neuropsychologia, 27, 743–746.10.1016/0028-3932(89)90120-6CrossRef Google Scholar PubMed

Hanlly, J., Dewick, H., Davies, A., Playeer, J., & Turnbull, C. (1990). Verbal fluency in Parkinson’s disease. Neuropsychologia, 28, 737–741.10.1016/0028-3932(90)90129-CCrossRef Google Scholar

Hartikainen, P., Helkala, E. L., Soininen, H., & Riekkinen, P. (1993). Cognitive and memory deficits in untreated Parkinson’s disease and amyotrophic lateral sclerosis patients: A comparative study. Journal of Neural Transmission-Parkinson’s Disease and Dementia Section, 6, 127–137.10.1007/BF02261006CrossRef Google Scholar PubMed

Heaton, R. K. (2004). Revised comprehensive norms for an expanded Halstead-Reitan Battery: Demographically adjusted neuropsychological norms for African American and Caucasian adults, professional manual. Psychological Assessment Resources.Google Scholar

Heaton, R. K., & Staff, P. (1993). Wisconsin card sorting test: Computer version 2, (pp. 1–4). Psychological Assessment Resources.Google Scholar

Helkala, E., Laulumaa, V., Soininen, H., & Riekkinen, P. J. (1988). Recall and recognition memory in patients with Alzheimer’s and Parkinson’s diseases. Annals of Neurology: Official Journal of the American Neurological Association and the Child Neurology Society, 24, 214–217.10.1002/ana.410240207CrossRef Google Scholar PubMed

Helkala, E.-L., Laulumaa, V., Soininen, H., & Riekkinen, P. J. (1989). Different error pattern of episodic and semantic memory in Alzheimer’s disease and Parkinson’s disease with dementia. Neuropsychologia, 27, 1241–1248.10.1016/0028-3932(89)90036-5CrossRef Google Scholar PubMed

Helmstaedter, C., Wietzke, J., & Lutz, M. T. (2009). Unique and shared validity of the, Wechsler logical memory test”, the “California verbal learning test”, and the “verbal learning and memory test, in patients with epilepsy. Epilepsy Research, 87, 203–212.10.1016/j.eplepsyres.2009.09.002CrossRef Google Scholar

Henry, J. D., & Crawford, J. R. (2004). Verbal fluency deficits in Parkinson’s disease: A meta-analysis. Journal of the International Neuropsychological Society, 10, 608–622.10.1017/S1355617704104141CrossRef Google Scholar PubMed

Higginson, C. I., Wheelock, V. L., Carroll, K. E., & Sigvardt, K. A. (2005). Recognition memory in Parkinson’s disease with and without dementia: Evidence inconsistent with the retrieval deficit hypothesis. Journal of Clinical and Experimental Neuropsychology, 27, 516–528.10.1080/13803390490515469CrossRef Google Scholar PubMed

Hirano, S. (2021). Clinical implications for dopaminergic and functional neuroimage research in cognitive symptoms of Parkinson’s disease. Molecular Medicine, 27, 40.10.1186/s10020-021-00301-7CrossRef Google Scholar PubMed

IBM Corp (2021), IBM SPSS Statistics for Windows (Version 28.0) IIBM Corp. [Computer software].Google Scholar

Jaywant, A., Musto, G., Neargarder, S., Stavitsky Gilbert, K., & Cronin-Golomb, A. (2014). The effect of Parkinson’s disease subgroups on verbal and nonverbal fluency. Journal of Clinical and Experimental Neuropsychology, 36, 278–289.10.1080/13803395.2014.889089CrossRef Google Scholar PubMed

Johnson-Greene, D. (2004). Dementia Rating Scale-2 (DRS-2) By P.J. Jurica, C.L. Leitten, and S. Mattis: Psychological Assessment Resources, 2001. Archives of Clinical Neuropsychology, 19, 145–147.10.1016/j.acn.2003.07.003CrossRef Google Scholar

Kehagia, A. A., Barker, R. A., & Robbins, T. W. (2010). Neuropsychological and clinical heterogeneity of cognitive impairment and dementia in patients with Parkinson’s disease. The Lancet Neurology, 9, 1200–1213.10.1016/S1474-4422(10)70212-XCrossRef Google Scholar PubMed

Knight, R. G., Waal-Manning, H. J., & Spears, G. F. (1983). Some norms and reliability data for the state-trait anxiety inventory and the zung self-rating depression scale. British Journal of Clinical Psychology, 22, 245–249.10.1111/j.2044-8260.1983.tb00610.xCrossRef Google Scholar PubMed

Koerts, J., Meijer, H. A., Colman, K. S., Tucha, L., Lange, K. W., & Tucha, O. (2013). What is measured with verbal fluency tests in Parkinson’s disease patients at different stages of the disease? Journal of Neural Transmission, 120, 403–411.10.1007/s00702-012-0885-9CrossRef Google Scholar PubMed

Kongs, S. K., Thompson, L. L., Iverson, G. L., & Heaton, R. K. (2000). WCST-64: Wisconsin Card Sorting Test-64 Card Version, Professional Manual. PAR. https://books.google.com/books?id=3cONPgAACAAJ Google Scholar

Kopelman, M. D., & Stanhope, N. (1998). Recall and recognition memory in patients with focal frontal, temporal lobe and diencephalic lesions. Neuropsychologia, 36, 785–796.10.1016/S0028-3932(97)00167-XCrossRef Google Scholar PubMed

Lafo, J. A., Jones, J. D., Okun, M. S., Bauer, R. M., Price, C. C., & Bowers, D. (2015). Memory similarities between essential tremor and Parkinson’s disease: A final common pathway? The Clinical Neuropsychologist, 29, 985–1001.10.1080/13854046.2015.1118553CrossRef Google Scholar PubMed

Leentjens, A. F., Dujardin, K., Marsh, L., Martinez-Martin, P., Richard, I. H., Starkstein, S. E., Weintraub, D., Sampaio, C., Poewe, W., & Rascol, O. (2008). Apathy and anhedonia rating scales in Parkinson’s disease: Critique and recommendations. Movement Disorders: Official Journal of the Movement Disorder Society, 23, 2004–2014.10.1002/mds.22229CrossRef Google Scholar PubMed

Leentjens, A. F., Verhey, F. R., Luijckx, G., & Troost, J. (2000). The validity of the beck depression inventory as a screening and diagnostic instrument for depression in patients with Parkinson’s disease. Movement Disorders: Official Journal of the Movement Disorder Society, 15, 1221–1224.10.1002/1531-8257(200011)15:6<1221::AID-MDS1024>3.0.CO;2-H3.0.CO;2-H>CrossRef Google Scholar PubMed

Lezak, M. D. (2004). Neuropsychological assessment. Oxford University Press.Google Scholar

Martínez-Martín, P., Rodríguez-Blázquez, C., Alvarez, Mario, Arakaki, T., Arillo, V. C., Chaná, P., Fernández, W., Garretto, N., Martínez-Castrillo, J. C., Rodríguez-Violante, M., Serrano-Dueñas, M., Ballesteros, D., Rojo-Abuin, J. M., Chaudhuri, K. R., & Merello, M. (2015). Parkinson’s disease severity levels and MDS-unified Parkinson’s disease rating scale. Parkinsonism & Related Disorders, 21, 50–54.10.1016/j.parkreldis.2014.10.026CrossRef Google Scholar PubMed

Matison, R., Mayeux, R., Rosen, J., & Fahn, S. (1982). Tip-of-the-tongue, phenomenon in Parkinson disease. Neurology, 32, 567–567.10.1212/WNL.32.5.567CrossRef Google Scholar PubMed

McDowd, J., Hoffman, L., Rozek, E., Lyons, K. E., Pahwa, R., Burns, J., & Kemper, S. (2011). Understanding verbal fluency in healthy aging, Alzheimer’s disease, and Parkinson’s disease. Neuropsychology, 25, 210.10.1037/a0021531CrossRef Google Scholar PubMed

Monsch, A. U., Bondi, M. W., Butters, N., Paulsen, J. S., Salmon, D. P., Brugger, P., & Swenson, M. R. (1994). A comparison of category and letter fluency in Alzheimer’s disease and huntington’s disease. Neuropsychology, 8, 25.10.1037/0894-4105.8.1.25CrossRef Google Scholar

Nelson, P. T., Dickson, D. W., Trojanowski, J. Q., Jack, C. R., Boyle, P. A., Arfanakis, K., Rademakers, R., Alafuzoff, I., Attems, J., Brayne, C., Coyle-Gilchrist, I. T. S., Chui, H. C., Fardo, D. W., Flanagan, M. E., Halliday, G., Hokkanen, S. R. K., Hunter, S., Jicha, G. A., Katsumata, Y.…Schneider, J. A. (2019). Limbic-predominant age-related TDP-43 encephalopathy (LATE): Consensus working group report. Brain, 142, 1503–1527.10.1093/brain/awz099CrossRef Google Scholar PubMed

Nyberg, L., Habib, R., McIntosh, A. R., & Tulving, E. (2000). Reactivation of encoding-related brain activity during memory retrieval. Proceedings of the National Academy of Sciences of the United States of America, 97, 11120–11124.10.1073/pnas.97.20.11120CrossRef Google Scholar PubMed

O’Brien, T. J., Wadley, V., Nicholas, A. P., Stover, N. P., Watts, R., & Griffith, H. R. (2009). The contribution of executive control on verbal-learning impairment in patients with Parkinson’s disease with dementia and Alzheimer’s disease. Archives of Clinical Neuropsychology, 24, 237–244.10.1093/arclin/acp029CrossRef Google Scholar PubMed

Obeso, I., Ray, N. J., Antonelli, F., Cho, S. S., & Strafella, A. P. (2011). Combining functional imaging with brain stimulation in Parkinson’s disease. International Review of Psychiatry (Abingdon, England), 23, 467–475.10.3109/09540261.2011.621414CrossRef Google Scholar PubMed

Pagano, G., Ferrara, N., Brooks, D. J., & Pavese, N. (2016). Age at onset and Parkinson disease phenotype. Neurology, 86, 1400.10.1212/WNL.0000000000002461CrossRef Google Scholar PubMed

Pettit, L., McCarthy, M., Davenport, R., & Abrahams, S. (2013). Heterogeneity of letter fluency impairment and executive dysfunction in Parkinson’s disease. Journal of the International Neuropsychological Society, 19, 986–994.10.1017/S1355617713000829CrossRef Google Scholar PubMed

Piatt, A. L., Fields, J. A., Paolo, A. M., Koller, W. C., & Tröster, A. I. (1999). Lexical, semantic, and action verbal fluency in Parkinson’s disease with and without dementia. Journal of Clinical and Experimental Neuropsychology, 21, 435–443.10.1076/jcen.21.4.435.885CrossRef Google Scholar PubMed

Raskin, S. A., Sliwinski, M., & Borod, J. C. (1992). Clustering strategies on tasks of verbal fluency in Parkinson’s disease. Neuropsychologia, 30, 95–99.10.1016/0028-3932(92)90018-HCrossRef Google Scholar PubMed

Reitan, R. M. (1992). Trail Making Test: Manual for Administration and Scoring. Reitan Neuropsychology Laboratory.Google Scholar

Rosser, A., & Hodges, J. R. (1994). Initial letter and semantic category fluency in Alzheimer’s disease, Huntington’s disease, and progressive supranuclear palsy. Journal of Neurology, Neurosurgery & Psychiatry, 57, 1389–1394.10.1136/jnnp.57.11.1389CrossRef Google Scholar PubMed

Santos-García, D., de Deus Fonticoba, T., Cores Bartolomé, C., Feal Painceiras, M. J., García Díaz, I., Íñiguez Alvarado, M. C., Paz, J. M., Jesús, S., Cosgaya, M., & García Caldentey, J. (2023). Cognitive impairment and dementia in young onset Parkinson’s disease. Journal of Neurology, 270, 5793–5812.10.1007/s00415-023-11921-wCrossRef Google Scholar PubMed

Spielberger, C. D. (1983). State-Trait Anxiety Inventory for Adults (STAI-AD) [Database record]. APA PsycTests. https://doi.org/10.1037/t06496-000.CrossRef Google Scholar

Squire, L. R., & Kandel, E. (2000). Memory. From Mind to Molecules. Owl Books.Google Scholar

Squire, L. R., Wixted, J. T., & Clark, R. E. (2007). Recognition memory and the medial temporal lobe: a new perspective. Nature Reviews Neuroscience, 8, 872–883.10.1038/nrn2154CrossRef Google Scholar PubMed

Starkstein, S. E., Mayberg, H. S., Preziosi, T., Andrezejewski, P., Leiguarda, R., & Robinson, R. (1992). Reliability, validity, and clinical correlates of apathy in Parkinson’s disease. J Neuropsychiatry Clin Neurosci, 4, 134–139.Google Scholar PubMed

Suhr, J. A., & Jones, R. (1998). Letter and semantic fluency in Alzheimer’s, Huntington’s, and Parkinson’s dementias. Archives of Clinical Neuropsychology, 13, 447–454.10.1093/arclin/13.5.447CrossRef Google Scholar PubMed

Tanner, J. J., Mareci, T. H., Okun, M. S., Bowers, D., Libon, D. J., & Price, C. C. (2015). Temporal lobe and frontal-subcortical dissociations in non-demented Parkinson’s disease with verbal memory impairment. PloS One, 10, e0133792.10.1371/journal.pone.0133792CrossRef Google Scholar PubMed

Taylor, A. E., Saint-Cyr, J., & Lang, A. (1990). Memory and learning in early Parkinson’s disease: Evidence for a, frontal lobe syndrome. Brain and Cognition, 13, 211–232.10.1016/0278-2626(90)90051-OCrossRef Google Scholar PubMed

Taylor, A. E., Saint-Cyr, J. A., & Lang, A. E. (1986). Frontal lobe dysfunction in Parkinson’s disease: The cortical focus of neostriatal outflow. Brain, 109, 845–883.10.1093/brain/109.5.845CrossRef Google Scholar PubMed

Tombaugh, T. N., Kozak, J., & Rees, L. (1999). Normative data stratified by age and education for two measures of verbal fluency: FAS and animal naming. Archives of Clinical Neuropsychology, 14, 167–177.Google Scholar PubMed

Tremont, G., Halpert, S., Javorsky, D. J., & Stern, R. A. (2000). Differential impact of executive dysfunction on verbal list learning and story recall. The Clinical Neuropsychologist, 14, 295–302.10.1076/1385-4046(200008)14:3;1-P;FT295CrossRef Google Scholar PubMed

Troyer, A. K., Moscovitch, M., Winocur, G., Alexander, M. P., & Stuss, D. (1998). Clustering and switching on verbal fluency: The effects of focal frontal- and temporal-lobe lesions. Neuropsychologia, 36, 499–504.10.1016/S0028-3932(97)00152-8CrossRef Google Scholar PubMed

Tupak, S. V., Badewien, M., Dresler, T., Hahn, T., Ernst, L. H., Herrmann, M. J., Fallgatter, A. J., & Ehlis, A. C. (2012). Differential prefrontal and frontotemporal oxygenation patterns during phonemic and semantic verbal fluency. Neuropsychologia, 50, 1565–1569.10.1016/j.neuropsychologia.2012.03.009CrossRef Google Scholar PubMed

Vonk, J. M. J., Rizvi, B., Lao, P. J., Budge, M., Manly, J. J., Mayeux, R., & Brickman, A. M. (2018). Letter and category fluency performance correlates with distinct patterns of cortical thickness in older adults. Cerebral Cortex (New York, NY), 29, 2694.Google Scholar

Wechsler, D. (1997). Wechsler adult intelligence scale. Frontiers in Psychology.Google Scholar

Weintraub, D., Moberg, P. J., Culbertson, W. C., Duda, J. E., & Stern, M. B. (2004). Evidence for impaired encoding and retrieval memory profiles in Parkinson disease. Cognitive and Behavioral Neurology, 17, 195–200.Google Scholar PubMed

Whittington, C. J., Podd, J., & Kan, M. M. (2000). Recognition memory impairment in Parkinson’s disease: Power and meta-analyses. Neuropsychology, 14, 233–246.10.1037/0894-4105.14.2.233CrossRef Google Scholar PubMed

Zahodne, L. B., Bowers, D., Price, C. C., Bauer, R. M., Nisenzon, A., Foote, K. D., & Okun, M. S. (2011). The case for testing memory with both stories and word lists prior to DBS surgery for Parkinson’s disease. The Clinical Neuropsychologist, 25, 348–358.10.1080/13854046.2011.562869CrossRef Google Scholar PubMed

Table 1. Neuropsychological tests and self-report measures within each cognitive domain composite

Table 2. Sample demographic, clinical, and cognitive (z-score) characteristics

Table 3. Sample average performance on executive measures (z-scores)

Table 4. Descriptive characteristics between individuals with and without at least a -1.0 standard deviation difference in performance on pearls 1-3

Figure 1. Cumulative percentage of the sample (n = 772) demonstrating z-score differences for each neuropsychological pearl.

Figure 2. Frequency of co-occurrence of a trifecta of neuropsychological patterns (n = 716) with −1.0 SD z-score difference.

Article contents

Screening for a “trifecta” of executive function patterns in a large cohort of individuals with Parkinson’s disease

Abstract

Keywords

Information

Statement of Research Significance

Introduction

Materials and methods

Design

Participants

Clinical measures

Statistical analyses

Results

Sample characteristics

Clinical pearls

Pearl 1: letter fluency vs. Category fluency

Pearl 2: delayed word list recall vs. Story recall

Pearl 3: delayed recall vs. Recognition discrimination

Co-occurrence of executive function trifecta

Comparison to classic executive function tests

Relationship of PD Variables to Executive Function Trifecta

Discussion

Funding statement

Competing interests

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests