Hostname: page-component-68c7f8b79f-lqrcg Total loading time: 0 Render date: 2025-12-18T04:58:55.724Z Has data issue: false hasContentIssue false

The Danish Need for Recovery Scale: Spanish Validation and Cultural Adaptation of a Full Form and a Short Form

Published online by Cambridge University Press:  12 December 2025

Francisco A. Burgos-Julián
Affiliation:
Departamento de Psicología, Universidad Rey Juan Carlos, Alcorcón, Spain Servicio de Psicología, Guardia Civil , Madrid, Spain
Cintia Díaz-Silveira
Affiliation:
Departamento de Psicología, Universidad Rey Juan Carlos, Alcorcón, Spain
Raquel Ruiz-Íñiguez
Affiliation:
Departamento de Ciencias de la Salud y Biomédicas, Universidad Loyola —Campus de Sevilla , Spain
Miguel Ángel Santed-German
Affiliation:
Departamento de Psicología de la Personalidad, Evaluación y Tratamiento Psicológico, Universidad Nacional de Educacion a Distancia , Spain
Carlos-María Alcover*
Affiliation:
Departamento de Psicología, Universidad Rey Juan Carlos, Alcorcón, Spain
*
Corresponding author: Carlos-María Alcover; Email: carlosmaria.alcover@urjc.es
Rights & Permissions [Opens in a new window]

Abstract

The increasing levels of job demands and work stress experienced daily by workers in multiple contexts have increased the experience of the need for recovery after work. This study aims to provide validity evidence for the Danish Need for Recovery (NFR) Scale. We analyzed the psychometric properties of the scale, its factor structure, internal consistency, measurement invariance, and validity based on relationships with other variables (job stress, general health, and affective states) in two samples of Spanish workers. The results obtained through exploratory and confirmatory factor analyses strongly support the unidimensional structure of the Spanish version of the NFR in its full eight-item version after eliminating one item not applicable in our cultural context, with good data fit. The values obtained regarding internal consistency, besides measurement invariance across groups, ensure the scale’s reliability and applicability. The validity and associations with other constructs tested were also confirmed in the hypothesized relationships with the respective variables analyzed. However, the short three-item version does not present a better fit compared to the full version in its Spanish validation. The Spanish adaptation of the Danish NFR scale in its full eight-item version constitutes a reliable and valid instrument for the assessment of this construct. The methodological and practical implications of the Spanish version of the NFR scale are detailed in the discussion.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of Universidad Complutense de Madrid and Colegio Oficial de la Psicología de Madrid

Introduction

The increasing levels of demand, intensity, and stress that characterize work activity today, accentuated by the changes in work associated with the postpandemic COVID-19 period, have significantly increased the perceptions and negative experiences related to work activity, and with them, the need for recovery from work (De Bloom, Reference De Bloom2020; Wentz et al., Reference Wentz, Gyllensten, Sluiter and Hagberg2020). Work-related fatigue, work stress, personal resources depletion, and work exhaustion are basic processes analyzed by occupational health research for decades (Bennett et al., Reference Bennett, Gabriel, Calderwood, Dahling and Trougakos2016; Van Veldhoven, Reference van Veldhoven, Houdmont and Leka2008), directly related to the processes of recovery from work (Sonnentag et al., Reference Sonnentag, Cheng and Parker2022). The recovery concept refers to “unwinding and restoration processes during which a person’s strain level that has increased as a reaction to a stressor or any other demand returns to its prestressor level” (Sonnentag et al., Reference Sonnentag, Venz and Casper2017, p. 366). Accumulating research (Sonnentag et al., Reference Sonnentag, Stephan, Wendsche, de Bloom, Syrek and Vahle-Hinz2021; Steed et al., Reference Steed, Swider, Keem and Liu2021; Wendsche & Lohmann-Haislah, Reference Wendsche and Lohmann-Haislah2017) has shown that effective recovery from work has positive consequences for well-being and health, as well as continued performance and job and life satisfaction. In contrast, inadequate recovery leads to the reverse consequences.

Difficulty or inability to recover from work is often antecedent to adverse working conditions (de Croon et al., Reference de Croon, Sluiter and Frings-Dresen2006), such as excessive workloads, extended workdays, and work schedules (Jansen et al., Reference Jansen, Kant, van Amelsvoort, Nijhuis and van den Brandt2003); emotional work demands (Xanthopoulou et al., Reference Xanthopoulou, Bakker, Oerlemans and Koszucka2018); low job autonomy or difficulty engaging in job crafting behaviors (Shi et al., Reference Shi, She, Li, Zhang and Niu2021); high-stress levels (Sonnentag, Reference Sonnentag2018); physical occupational activity (Stevens et al., Reference Stevens, Crowley, Rasmussen, Hallman, Mortensen, Nygård and Holtermann2020); or abusive supervision (Tu & Chi, Reference Tu and Chi2023). Unsurprisingly, absence or insufficient recovery from work can have serious consequences for mental health and physical health, both in the short and medium terms (de Croon et al., Reference de Croon, Sluiter and Frings-Dresen2006) and increases the risk of long-term or work-incapacitating illnesses (Peters et al., Reference Peters, Dennerlein, Wagner and Sorensen2022).

One of the indicators for identifying perceptions of difficulties in recovery is the measure of the need for recovery from work, which aims to quantify the difficulties experienced by individuals in achieving effective recovery from work (de Croon et al., Reference de Croon, Sluiter and Frings-Dresen2006). More precisely, the construct of the need for recovery after work describes “the extent that the work task induces a collection of symptoms, characterized by temporary feelings of overload, irritability, social withdrawal, lack of energy for new effort, and reduced performance” (van Veldhoven, Reference van Veldhoven, Houdmont and Leka2008, p. 3). Specifically, in the whole process of strain–stress consequences/effects, need for recovery refers to a “very early stage of a long-term strain process” (Sonnentag & Zijlstra, Reference Sonnentag and Zijlstra2006, p. 331), or in other words, as a “precursor of prolonged fatigue or psychological distress” (Jansen et al., Reference Jansen, Kant and van den Brandt2002, p. 324). The feeling of needing to recover is experienced to a greater extent in the latter part of the workday and especially in the hours immediately after the end of the workday, which in turn can lead to difficulties in effective recovery (Chan et al., Reference Chan, Howard, Eva and Tse2022; van Veldhoven, Reference van Veldhoven, Houdmont and Leka2008; Wentz et al., Reference Wentz, Gyllensten, Sluiter and Hagberg2020). In fact, the higher the level of fatigue and stress experienced, and consequently the greater the need to recover from work, the more the recovery process tends to be deficient and ineffective, an experience that has led to the identification of the so-called the recovery paradox (Sonnentag, Reference Sonnentag2018).

In sum, if the goal of occupational health is to achieve effective recovery from work that reduces the levels of daily stress and the negative consequences of the workers’ work activity and, subsequently, decreases the risk of long-term adverse effects in both their health and future permanence in the labor market (Gommans et al., Reference Gommans, Jansen, Stynen, de Grip and Kant2015), correctly identifying and measuring the need for recovery as a subjective experience before possible recovery activities is crucial in facilitating the effectiveness of these nonwork activities.

The measurement of the need for recovery began in the 1980s in the Netherlands by constructing different scales (van Veldhoven, Reference van Veldhoven, Houdmont and Leka2008). Between 1992 and 1994, this author carried out the refinement of an instrument composed of 11 dichotomous response items (yes/no), with questions on the severity and duration of the symptoms experienced, considered as indicators that the person is not fully recovered from the effort of the work activity (Van Veldhoven & Meijman, Reference Van Veldhoven and Meijman1994). The Need for Recovery (NFR) scale was subsequently translated into English (de Croon et al., Reference de Croon, Sluiter and Frings-Dresen2006; Sluiter et al., Reference Sluiter, van der Beek and Frings-Dresen1999; Van Veldhoven & Broersen, Reference Van Veldhoven and Broersen1999), Portuguese (Moriguchi et al., Reference Moriguchi, Alem, van Veldhoven and Coury2010), Italian (Pace et al., Reference Pace, Lo Cascio, Civilleri, Guzzo, Foddai and Van Veldhoven2013), Taiwanese (Lin et al., Reference Lin, Chen, Hsieh and Chen2015), Danish (Gupta et al., Reference Gupta, Wåhlin-Jacobsen, Abildgaard, Henriksen, Nielsen and Holtermann2018), and French (Dupret et al., Reference Dupret, Bocéréan, Feltrin, Chemolle and Lebon2018). Stevens et al. (Reference Stevens, Crowley, Garde, Mortensen, Nygård and Holtermann2019) reported that the full version of the NFR exhibits adequate psychometric properties in terms of internal consistency, construct validity, and test–retest reliability. However, some aspects of this work are somewhat unclear. For example, their analyses suggested potential multidimensional aspects depending on the method employed (factor analysis versus IRT), as well as certain limitations regarding the clarity of discriminant validity and the interpretation of confirmatory results. These discrepancies highlight the importance of continuing to evaluate these properties, especially in specific contexts of cultural validation, such as this study.

Also, from the original scale, the Danish NFR scale (Stevens et al., Reference Stevens, Crowley, Garde, Mortensen, Nygård and Holtermann2019) reduced the original 11 items to 9 items, with a Likert-type measurement of five response anchors (1 = never, 5 = always). These same authors present a short version of the NFR consisting of three items, which is claimed to have good psychometric properties, primarily based on an intraclass correlation coefficient (ICC) value as evidence of criterion validity (Stevens et al., Reference Stevens, Crowley, Rasmussen, Hallman, Mortensen, Nygård and Holtermann2020). However, this approach does not comprehensively address the essential phases of the recommended psychometric validation process, such as the analysis of factorial structure, convergent and discriminant validity, and reliability. In this regard, while the studies by Stevens et al. (Reference Stevens, Crowley, Garde, Mortensen, Nygård and Holtermann2019, Reference Stevens, Crowley, Rasmussen, Hallman, Mortensen, Nygård and Holtermann2020) represent an initial step in the validation of the NFR, particularly in its shorter version, they exhibit certain methodological limitations that should be addressed in line with best practices described in the literature (Boateng et al., Reference Boateng, Neilands, Frongillo, Melgar-Quiñonez and Young2018), ensuring a more rigorous validation process aligned with international standards. Although other instruments, such as the Maslach Burnout Inventory (MBI; Maslach & Jackson, Reference Maslach and Jackson1981), and other procedures, such as interviews, exist, the NFR was selected for its specificity in assessing recovery after the workday. Unlike the MBI, which measures broader dimensions such as burnout, the NFR focuses exclusively on temporary symptoms of fatigue and disengagement. Furthermore, this instrument has been validated in multiple contexts, and along with advantages such as its ease of administration, it becomes a practical and effective tool.

In general, the Danish NFR scale in its two versions has shown evidence of convergent and discriminant validity through its associations with subjective indicators of fatigue, workability (physical and mental), and exhaustion (i.e., burnout), and high criterion validity with classic work outcomes and health variables, such as perceived health, stress, vitality, anxiety symptoms, and depressive symptoms (Dupret et al., Reference Dupret, Bocéréan, Feltrin, Chemolle and Lebon2018; Moriguchi et al., Reference Moriguchi, Alem, van Veldhoven and Coury2010; Stevens et al., Reference Stevens, Crowley, Garde, Mortensen, Nygård and Holtermann2019). Although the predictive value of the NFR scale of sickness absence has been proven in some studies (e.g., de Croon et al., Reference de Croon, Sluiter and Frings-Dresen2003), no data are yet available to assess its ability to predict challenging and long-term indicators, such as long-term absenteeism, retirement intention, or disability retirement (Stevens et al., Reference Stevens, Crowley, Garde, Mortensen, Nygård and Holtermann2019).

To have a reliable and valid measure of the need for recovery from work in Spanish, this study set out to carry out its adaptation and validation in the Spanish working population. For this purpose, the Danish NFR scale (Stevens et al., Reference Stevens, Crowley, Garde, Mortensen, Nygård and Holtermann2019, Reference Stevens, Crowley, Rasmussen, Hallman, Mortensen, Nygård and Holtermann2020), originally developed in English and composed of nine items, was used in its culturally adapted and translated version, with the five response anchors specified later in the “Method” section.

The purpose of this study is to analyze the factor structure of the Danish NFR using an exploratory factor analysis (EFA) followed by a confirmatory factor analysis (CFA) and then to analyze the internal consistency of the scale and the measurement invariance in two samples on the NFR. Finally, to provide validity evidence based on relationships with other variables, measures of perceived work stress, perceived general health, and positive and negative affect were included.

As already mentioned, there is evidence of a direct relationship between the need for recovery and work stress (Jansen et al., Reference Jansen, Kant and van den Brandt2002; Sonnentag, Reference Sonnentag2018; Sonnentag et al., Reference Sonnentag, Kuttler and Fritz2010). Therefore, it is to be expected that the more work-related stress experienced during the workday, the greater the difficulty in recovering (i.e., the greater the need for recuperation) after the workday is over.

Conversely, previous research has found that an increased need for recovery has a negative relationship with general health (Sluiter et al., Reference Sluiter, van der Beek and Frings-Dresen1999, Reference Sluiter, de Croon, Meijman and Frings-Dresen2003; Wentz et al., Reference Wentz, Gyllensten, Sluiter and Hagberg2020), since this need is a direct indicator of work-related fatigue (Jansen et al., Reference Jansen, Kant and van den Brandt2002), associated with adverse work factors acting as an antecedent. Therefore, a greater need for recuperation after work (which in turn is associated with greater work stress) will result in the individual experiencing poorer general health.

Based on these theoretical relationships, and to examine the validity of the NFR scale through its associations with other constructs, we proposed the following hypothesis:

H1: There will be a positive relationship between the need for recovery and job stress levels, and a negative relationship between the need for recovery and general health levels.

Finally, previous evidence shows that psychological distancing is an antecedent of affective states, both high levels of positive affect (Rodríguez-Muñoz et al., Reference Rodríguez-Muñoz, Sanz-Vergel, Antino, Demerouti and Bakker2018; Sonnentag et al., Reference Sonnentag, Mojza, Binnewies and Scholl2008; Wendsche & Lohmann-Haislah, Reference Wendsche and Lohmann-Haislah2017) and low levels of negative affect (Feuerhahn et al., Reference Feuerhahn, Sonnentag and Woll2014; Sonnentag et al., Reference Sonnentag, Mojza, Binnewies and Scholl2008; Wendsche & Lohmann-Haislah, Reference Wendsche and Lohmann-Haislah2017). These results have been confirmed in both correlational and experimental studies (Sonnentag & Niessen, Reference Sonnentag and Niessen2020), demonstrating the strength of this relationship. Similarly, we postulate that an increased need for recovery will be antecedent to inverse affective states (i.e., high negative affect and low positive affect), as also is postulated by the recovery paradox hypothesis (Sonnentag, Reference Sonnentag2018). Based on this evidence and rationales, and to test the predictive validity of the NFR scale, we postulate the following hypotheses:

H2: There will be a positive relationship between the need for recovery and negative affect levels, and a negative relationship between the need for recovery and positive affect levels.

Method

This validation study was conducted using data from a study composed of two workplace interventions. One was a randomized controlled trial on internal recovery from work in the private sector. The other study was a nonrandomized trial on external recovery from work in the public sector. The study was registered in ClinicalTrial.gov (NCT05557266). Ethical approval was provided by the Ethical Committee for the Universidad Nacional de Educación a Distancia (UNED, Spain), 10-PSI-2021.

Participants

We used a convenience, nonprobability sampling technique to collect the two subsamples. The sample consisted of 445 participants, of whom, 201 (45.17%) belonged to the public sector (Civil Guard) and 244 (54.83%) to the private sector (telecommunications and civil works). The participants’ mean age was 39.18 years (SD = 10.68), with a significant difference between groups in pretest measures: public sector participants had a younger mean age (M = 31.94, SD = 8.40) compared to the private sector (M = 45.15, SD = 8.42), t(443) = −16.48, p < .01. Regarding the NFR scores, the total mean was 22.32 (SD = 5.45), significantly lower in the public sector group (M = 20.37, SD = 5.08) than in the private sector (M = 23.92, SD = 5.23), t(443) = −7.22, p < .001. The effect size was moderate to large, with Cohen’s d = 0.69, 95% CI (0.50, 0.88). Additionally, significant differences were found in the gender distribution between groups: in the public sector, 82.04% of participants were men, while this proportion was 31.38% in the private sector, χ2(1) = 114.64, p < .01. The inclusion criteria for eligible workers was to work office based at least 35 hours per week. Exclusion criteria were to be free of any severe physical or mental pathology.

Instruments

Need for Recovery Scale (NFR; De Croon et al., Reference de Croon, Sluiter and Frings-Dresen2006).

The level of need for recovery was evaluated using the Spanish version of the NFR scale, which, in a preliminary version, consisted of eight items rated on a 5-point Likert scale (1 = never and 5 = always). The total score was obtained by summing up the value of each item. The total score can range from 8 to 40 points. A higher score indicates a higher degree of need for recovery after work. The internal consistency for the original scale was alpha = .87 (van Veldhoven & Broersen, Reference van Veldhoven and Broersen2003).

Due to the characteristics of both the Spanish work context and the occupational activities of the participants in the two samples, one of the nine items of the full NFR scale was eliminated (“I do not normally relax if I have only had one day without work”), since it is usual in Spain to have two consecutive days of rest in the work week (usually the weekend), so this item would not be applicable in most cases. Tables 1 and 2 show the eight items of the Danish NFR scale in its English and Spanish versions (see “Results” section).

Table 1. KMO index, homogeneity index, means, standard deviations, and correlations with confidence intervals

Note: M and SD are used to represent mean and standard deviation, respectively. Values in square brackets indicate the 95% confidence interval for each correlation. The confidence interval is a plausible range of population correlations that could have caused the sample correlation (Cumming, Reference Cumming2014). **p < .01. HI = homogeneity index.

Table 2. Item content and factor loadings from the exploratory and confirmatory factor analyses of the Danish need for recovery scale

Note: The table presents the content of the items in both English and Spanish, along with the standardized factor loadings obtained from the exploratory factor analysis (EFA) and confirmatory factor analysis (CFA).

Job Stress (General Nordic Questionnaire, QPSNordic; Dallner et al., Reference Dallner, Elo, Gamberale, Hottinen, Knardahl, Lindström, Skogstad and Ørhede2000) in its Spanish Version (Carral & Alcover, Reference Carral and Alcover2019) // Stress ([Nordic Age Discrimination Scale, NADS]; Furunes & Mykletun, Reference Furunes and Mykletun2010).

A single item was used: Stress refers to a person’s situation when they feel tense, restless, nervous, or anxious or cannot sleep at night because their mind is constantly preoccupied with work-related issues. Please indicate the extent to which you currently feel this type of stress. It has five possible responses on a Likert-type scale from 1 to 5 (1 = nothing and 5 = a lot).

General Health Questionnaire (GHQ-12; Goldberg, Reference Goldberg1978, Goldberg & Williams, Reference Goldberg and Williams1988) in its Spanish Version (Rocha et al., Reference Rocha, Pérez, Rodríguez-Sanz, Borrell and Obiols2011).

The scale consists of 12 items and is widely used as a unidimensional measure of psychological distress, despite ongoing discussions about its factorial structure. Numerous studies support its application as a unidimensional tool in practical settings, highlighting that the general factor explains most of the variance (Hystad & Johnsen, Reference Hystad and Johnsen2020). It is answered on a Likert-type scale from 1 to 4 (1 = never and 4 = always), with the total score ranging from 12 to 48. A higher score indicates a higher level of distress. The internal consistency for this sample was Omega = .81.

Positive and Negative Affect Schedule (PANAS; Watson et al., Reference Watson, Clark and Tellegen1988) in its Spanish Adaptation (Sandín et al., Reference Sandín, Chorot, Lostao, Joiner, Santed and Valiente1999).

The scale consists of 20 items, divided into two subscales designed to measure positive affect (PA) and negative affects (NA). The bidimensional structure of the PANAS has been validated in Spanish populations, demonstrating robust psychometric properties (Sandín et al., Reference Sandín, Chorot, Lostao, Joiner, Santed and Valiente1999; López-Gómez et al., Reference López-Gómez, Hervas and Vazquez2015). Each item is rated on a Likert-type scale from 1 (very slightly or not at all) to 5 (extremely), with higher scores indicating greater levels of positive or negative affect. Within the scope of this study, the reliability through the Omega coefficient was excellent for the PANAS N (ω = .96) and for the PANAS P (ω = .93).

Procedure

The study was conducted in Spain between 2021 and 2023. This period encompasses the entire research process, including the design, translation, adaptation of the instruments, as well as the data collection and analysis phases. The procedure can be divided into two steps. First, a translation/back-translation method integrated with expert committee review and pretesting was applied. Before translating each item, the conceptual and content equivalence of the underlying construct should be considered (Flaherty et al., Reference Flaherty, Gaviria, Pathak, Mitchell, Wintrob, Richman and Birz1988). Therefore, two researchers involved in the back-translation process, who were experts in the field and fluent in both English and Spanish, agreed that the meaning of the need for recovery construct underlying the NFR scale was meaningful in Spanish culture.

The translation used the backward translation strategy (Hambleton & Patsula, Reference Hambleton and Patsula1998). In the first step, two independent translators, native English speakers and proficient Spanish speakers, translated the original scale from English to Spanish. In the second step, they synthesized the two translations into one. In the third step, two independent translators, native Spanish speakers and English proficient, performed the backward translation. In a fourth step, a three-member expert committee compared the backward translations with the original scale and consented to a prefinal version of the translated questionnaire. The committee members, who were experts in Organizational Psychology, work stress, and burnout, guaranteed that the items were well designed to measure the construct meant to be measured and retain the original meaning. In the fifth and final step, the committee ensured good comprehension of each question of the scale and concluded with the final version of the Spanish NFR.

Some minor changes were made during this process to adjust the items’ ability to fit into contemporary Spanish. Thus, we used the wording «jornada laboral» when «work day» was used in English items because the exact translation for this term (día de trabajo) would have sounded too informal for the context in Spanish (items 1, 2, and 8).

Second, once the scale was fully adapted to the Spanish population, we proceeded with recruitment. Participants were recruited through the respective human resources departments of the companies that participated in the study. Participants were asked to complete the newly translated NFR. They signed the informed agreement and received instructions from a trained interviewer. The scale was administered, emphasizing that the responses were anonymous and there were no right or wrong responses. The items appeared in the same order as the English-language NFR (except for the eliminated item). There was no time limit. During the administration of the scale, any doubts arising during the process about the meaning of the items were clarified.

Data Analysis

Using an exploratory and descriptive analysis, the adequacy of the data to a multivariate normal distribution is analyzed, as well as the sociodemographic characteristics of the sample. Additionally, univariate normality was assessed for each item through skewness and kurtosis indices, which indicated that all items fall within acceptable ranges (skewness: −0.21 to 0.30; kurtosis: −1.01 to −0.56). These results suggest minimal deviations from normality at the univariate level. The study of the dimensionality of the questionnaire is analyzed using a split-sample validation procedure, dividing the sample into two halves, using a simple randomization procedure.

Given the cultural adaptation process and the potential for semantic or contextual variability, we conducted an exploratory factor analysis (EFA) with the first subsample (n = 222) using the minimum residuals method (MINRES; Barendse et al., Reference Barendse, Oort and Timmerman2015). While EFA is typically used in the development of new scales, its application in cross-cultural adaptations can serve as a conservative strategy to verify whether the hypothesized unidimensional structure is appropriate for the target population (Brown, Reference Brown2015). To determine the number of factors to retain, we performed a parallel analysis (PA; Timmerman & Lorenzo-Seva, Reference Timmerman and Lorenzo-Seva2011), which compares the eigenvalues obtained from the actual data with those derived from simulated data. The retained factors were those whose eigenvalues exceeded the threshold from the simulated data. As the parallel analysis supported a one-factor solution, no rotation was applied in the exploratory factor analysis. Pearson’s correlation matrices were used as input for both the EFA and PA. In our case, the multivariate normality criterion, as assessed by the Royston test (Royston < .01), was not met, supporting the use of these nonparametric estimation methods.

The criterion for item retention is based on the presence of a factor loading greater than .30 (Lloret-Segura et al., Reference Lloret-Segura, Ferreres-Traver, Hernández-Baeza and Tomás-Marco2014). Kaiser’s (Reference Kaiser1970) Kaiser–Meyer–Olkin (KMO) measure is used as a criterion for the goodness of fit of the data to the EFA. This procedure allows for determining whether the correlation matrix fits the EFA. In general, KMO values > .80 indicate a satisfactory degree of fit.

A confirmatory factor analysis (CFA) was carried out with the second subsample (n = 223) to evaluate the factor structure obtained in the EFA (Model 1). A second model (Model 2) was estimated based on the reduced factor structure of three items (2, 6, and 7). For Model 2, the factor structure was tested while addressing potential overestimation of parameters by constraining the loading of item 2 to 1 as an anchor point, following methodological recommendations (Brown, Reference Brown2015). Confirmatory factor analysis was conducted using multiple estimation methods to ensure the robustness of the findings. The robust maximum likelihood estimator (MLM) was first applied, providing robust standard errors and a Satorra–Bentler scaled chi-square. Additionally, the model was tested using WLSMV (weighted least squares mean and variance adjusted), an estimator specifically recommended for ordinal data. Although the items are ordinal, they were treated as approximately continuous based on the presence of five response categories, minimal skewness and kurtosis. This approach is supported by prior research suggesting that robust ML methods can be appropriately applied to ordinal variables with five or more categories under these conditions (Rhemtulla et al., Reference Rhemtulla, Brosseau-Liard and Savalei2012; Li, Reference Li2016). Model fit was assessed using the comparative fit index (CFI), the Tucker–Lewis index (TLI), the root mean square error of approximation (RMSEA), and the standardized root mean square residual (SRMR), following recommendations by Brown (Reference Brown2015). Values above 0.90 for CFI and TLI, below 0.08 for SRMR, and between 0.05 and 0.08 for RMSEA indicate an acceptable fit.

After analyzing the internal structure of the instrument, reliability was assessed using omega total, calculated from the same sample used for the exploratory factor analysis. Values above .70 are considered acceptable (Gadermann et al., Reference Gadermann, Guhn and Zumbo2012). Next, measurement invariance was assessed to determine whether the set of items measures the same construct across different groups and whether observed differences between groups are not due to biases in the items themselves (Oppong et al., Reference Oppong, Liu, Wang, Xie, Lei, Zhou, Peng, Li and Xing2023). This analysis was conducted using a multigroup CFA to evaluate the degree to which the instrument’s factor structure is invariant across the groups compared: sample origin, gender, and age. The grouping variables were operationalized as follows: sample origin was categorized into public sector (Guardia Civil) and private sector (telecommunications and civil works), while age was dichotomized based on the sample’s median (Mdn = 39), dividing participants into two groups: younger than 39 years and 39 years or older.

To assess invariance, four levels of measurement invariance were tested sequentially. Configural invariance: this level examines whether the same factor structure holds across groups without imposing additional constraints. This is achieved if the model fit is acceptable, and the factor loadings are significant across all groups. Metric invariance: this level evaluates whether factor loadings are equal across groups. This is tested by constraining the factor loadings to equality and comparing the fit with the configural model. Scalar invariance: at this level, the equality of item intercepts across groups is tested by further constraining the intercepts in addition to the factor loadings. The model fit is then compared to the metric invariance model. Strict invariance: this final level of invariance tests whether residual variances are equal across groups by imposing additional constraints on the residuals. The model fit is compared to the scalar invariance model. The criteria for evaluating invariance were based on the comparison of RMSEA and CFI indices, following Cheung and Rensvold’s (Reference Cheung and Rensvold2002) guidelines, where ΔCFI ≤0.01 and ΔRMSEA ≤0.015 indicate invariance. Additionally, chi-square difference tests were performed to compare nested models, as recommended by Brown (Reference Brown2015), to further support the assessment of measurement invariance. To complement the multigroup CFA and provide additional insight into group-related effects on the latent construct, we conducted a Multiple Indicators Multiple Causes (MIMIC) model. This approach is particularly useful when the results of measurement invariance testing are inconclusive or only partially supported, as it allows for the inclusion of covariates to assess their influence on the latent factor without imposing the strict constraints of group-based models.

Once the assumption of measurement invariance is assessed, a multiple regression analysis is conducted to explore the relationships between age, sex, and origin of the sample with the scores derived from the instrument based on the resulting factorial structure.

To examine the theoretical consistency of the construct in the context of occupational health, we included external variables associated with job strain and recovery. These included general mental health and job stress. While general health is a broader construct, it has been shown to reflect the accumulated fatigue and psychological strain resulting from work overload (e.g., Bridger & Brasher, Reference Bridger and Brasher2011; Gommans et al., Reference Gommans, Jansen, Stynen, de Grip and Kant2015). Predictive association analysis was performed on a specific subsample of public sector participants from the overall study (n = 201). These participants were involved in a controlled trial that included pretest and posttest measurements over a 4-week period.

All statistical analyses were conducted using R software (R Core Team, 2023) and the following packages: mvnTest (Royston, Reference Royston1995), psych (Revelle, Reference Revelle2021a), EFA.MRFA (Timmerman & Lorenzo-Seva, Reference Timmerman and Lorenzo-Seva2011), psychTools (Revelle, Reference Revelle2021b), car (Fox & Weisberg, Reference Fox and Weisberg2019), factoextra (Kassambara & Mundt, Reference Kassambara and Mundt2020), lavaan (Rosseel, Reference Rosseel2012), semPlot (Epskamp, Reference Epskamp2015), and semTools (Jorgensen et al., Reference Jorgensen, Pornprasertmanit, Schoemann and Rosseel2023).

Results

Exploratory Factor Analysis

The results of the PA, as a method for estimating factors to be retained, are shown in Figure 1. The abscissa axis shows the number of factors. In contrast, the ordinate axis shows the percentage of common variance explained—the similarity of the actual and simulated data points to the presence of a unifactorial structure.

Figure 1. Results of the parallel analysis following the procedure of Timmerman and Lorenzo-Seva (Reference Timmerman and Lorenzo-Seva2011). The ordinate axis shows the percentage of variance explained, while the abscissa axis shows the number of factors. The convergence or similarity between the red (real data) and green (simulated data) lines indicates the presence of a factor.

The Kaiser–Meyer–Olkin (KMO) measure of sampling adequacy showed satisfactory results, indicating the overall adequacy of the data for exploratory factor analysis (global KMO = .89). Additionally, the item-specific KMO values exceeded 0.80, confirming the adequacy of the items for the model (see Table 1). The same table shows the correlation matrix between items, their mean, and standard deviation. After confirming the dimensionality of the questionnaire through the EFA, the reliability of the total scale was evaluated. Corrected item-total correlations (HI: homogeneity indexes) for each item showed values above .72, indicating that all items contributed meaningfully to the internal consistency of the questionnaire. This is further supported by the reliability coefficients for the total scale, with Omega (ω = .89), demonstrating adequate levels of internal consistency, with values equal to or above .70 (Nunnally & Bernstein, Reference Nunnally and Bernstein1994).

Table 2 (see next) presents the factor loadings obtained for both the EFA. Notably, Item 3 displayed a negative factor loading, indicating a reversed relationship with the latent factor. This is consistent with its reverse wording, designed to reduce potential response biases (Podsakoff et al., Reference Podsakoff, MacKenzie, Lee and Podsakoff2003). While this approach may slightly lower the overall reliability estimates, it does not compromise the validity of the instrument or the adequacy of the model. As can be seen, all items maintain loading between 0.37 and 0.83, higher than the minimum retention criterion (>.30) (Lloret-Segura et al., Reference Lloret-Segura, Ferreres-Traver, Hernández-Baeza and Tomás-Marco2014). The only extracted factor explains 47% of the variance. Therefore, the interpretability of the factorial solution suggests the possibility of considering a one-factor structure provisionally.

Confirmatory Factor Analysis

Continuing with the split-sample validation procedure and the second subsample (N = 223), the CFA was carried out to contrast the hypothesis of the factorial structure of only one dimension obtained in the CFA (Model 1). Figure 2 shows the proposed model with the standardized estimated parameters (see also Table 1).

Figure 2. The standardized parameters of the model, the covariance parameter between the latent variables, and the standard errors.

The RMSEA value of 0.08 falls at the threshold of acceptable fit. The 90% confidence interval for the RMSEA ranged from 0.05 to 0.11, suggesting that while the lower bound indicates a good fit, the upper bound reflects some degree of misfit. This result should be interpreted in conjunction with other fit indices (CFI = 0.94, TLI = 0.91, SRMR = 0.041), which collectively suggest that the model provides a reasonable representation of the data.

To ensure the robustness of the findings, we also conducted the CFA using the WLSMV estimator (weighted least squares mean and variance adjusted), recommended for ordinal data. The WLSMV analysis yielded similar results, with CFI = 0.95, TLI = 0.93, RMSEA = 0.08, and SRMR = 0.044. These consistent findings across estimation methods confirm the stability of the unidimensional structure of the scale.

Complementarily, a reduced single-factor model based on items 2, 6, and 7 was tested to explore whether a shorter version of the scale could maintain acceptable psychometric properties. However, this model presented a lower fit, with only one acceptable index (CFI = 0.90), while other indicators, such as TLI (0.71) and RMSEA (0.29), fell outside the recommended thresholds (see Table 3).

Table 3. Evaluation of the goodness of fit of Models 1 and 2

* p < .05.

Analysis of Measurement Invariance

Measurement invariance was evaluated using multiple-group confirmatory factor analysis (MG-CFA) across the variables sample origin, gender, and age. Configural invariance was supported, indicating that the overall factor structure is consistent across groups. However, metric, scalar, and strict invariance showed partial support, as some significant differences were observed in chi-square tests and ΔCFI values exceeded the recommended threshold of −0.01 in specific comparisons, particularly for sample origin and gender (see Table 4).

Table 4. Evaluation of measurement invariance by estimating the configural (Model 0), metric (Model 1), scalar (Model 2), and strict invariance (Model 3) models through the variables sample origin, age, and sex

Note: Values.

* Model fit is statistically significant at p < .01.

a Dichotomous variable with median-based values (Mdn = 39). Group 0 with Mdn values < 39; Group 1 with Mdn values > 39.

The MIMIC model showed an acceptable fit: CFI = 0.883, RMSEA = 0.109, 90% CI (0.095, −0.124), and SRMR = 0.067. These results suggest that the model adequately captures the relationships between covariates and the latent variable, though the RMSEA indicates some degree of misfit.

Once the metric invariance was assessed, the predictive capacity of the variables sex, age, and the sample’s origin was analyzed. For this purpose, a multiple regression model is estimated, where these variables are introduced as predictors, while the total scores in the instrument constitute the dependent variable. The results show statistically significant values for age and group variables, showing that older age predicts lower scores. In contrast, the group of workers from private companies predicts higher scores on the instrument (see Table 5).

Table 5. Regression results using the total score of the instrument as a criterion

Note: A significant b weight indicates that the semipartial correlation is also significant. b represents unstandardized regression weights. sr2 represents the semipartial correlation squared. LL and UL indicate a confidence interval’s lower and upper limits, respectively.

**p < .01.

Validity Based on the Relationship With Other Variables

Table 6 shows the results of the analysis of validity based on the relationship with other variables. The correlation between the need for recovery and work stress was positive and statistically significant (r = .61, p < .01, 95% CI [0.55, 0.66]), which corresponds to a large effect according to Cohen’s (Reference Cohen1988) criteria. This result is theoretically consistent, as greater work stress during the day is expected to lead to more difficulty recovering at the end of the workday, and therefore to a greater need for recovery. In contrast, the association between the need for recovery and general health was negative (r = −.32, p < .01, 95% CI [−0.40, −0.23]), reflecting a moderate effect size. This result supports the idea that higher levels of recovery need are linked to poorer perceived general health.

Table 6. Means, standard deviations, and correlations with confidence intervals

Note: M and SD are used to represent mean and standard deviation, respectively. Values in square brackets indicate the 95% confidence interval for each correlation. The confidence interval is a plausible range of population correlations that could have caused the sample correlation (Cumming, Reference Cumming2014).

**p < .01.

Additionally, the two external variables, work stress and general health, were negatively correlated (r = −.35, p < .01, 95% CI [−0.43, −0.27]), which is consistent with the previous findings. The fact that the NFR shows opposite correlations with these two variables (positive with stress and negative with health) is theoretically coherent, given the inverse relationship between perceived stress and general well-being.

Regarding affective states, the NFR was positively associated with negative affect (r = .63, p < .01, 95% CI [0.54, 0.71]), and negatively associated with positive affect (r = −.74, p < .01, 95% CI [−0.74, −0.59]). According to Cohen’s (Reference Cohen1988) benchmarks, the first represents a large effect size and the second a very large effect size. These results indicate that a greater need for recovery is related to higher levels of negative affect and lower levels of positive affect.

Together, these findings support Hypotheses H1 and H2, and provide validity evidence based on relationships with other theoretically related variables.

Discussion

This study delves into the methodological understanding of adapting psychometric instruments to specific cultural contexts, emphasizing the importance of considering local particularities in instrument validation.

The results of the adaptation and validation of the Danish NFR scale (Stevens et al., Reference Stevens, Crowley, Garde, Mortensen, Nygård and Holtermann2019) in the Spanish population allow us to conclude that the scale, in its full eight-item version after eliminating an item not applicable in our context, has adequate psychometric properties according to the usual indicators in the literature, confirming its one-dimensional structure. The factor analysis results support the predominantly unidimensional structure of the Spanish version of the Danish NFR, with acceptable data fit, underscoring the scale’s potential for assessing recovery needs in the Spanish work context. While the values obtained for internal consistency confirm the reliability of the scale, the findings regarding measurement invariance suggest that the instrument is stable across some, but not all, group comparisons. These results highlight the applicability of the scale while also pointing to areas for further exploration in future studies.

The findings of this study highlight significant differences among the analyzed subgroups and point to the need to include additional variables related to work and personal characteristics that could act as moderators or mediators. For example, factors such as workload, type of contract, or family responsibilities could more comprehensively explain the observed differences in NFR scores.

The results of this study highlight the practical relevance of the NFR scale in understanding how recovery needs relate to affective states. Specifically, higher levels of need for recovery were associated with higher negative affect and lower positive affect, with both associations reaching moderate to large effect sizes. Although no specific thresholds were defined in the hypotheses, the observed effect sizes (r = .51 and r = .71, respectively) support the theoretical consistency of these associations, in line with current recommendations for interpreting the strength of psychological relationships (Cohen, Reference Cohen1988). These findings, together with the validity evidence based on relations with other variables, reinforce the conceptual value of the NFR scale as a meaningful indicator of both psychological distress and well-being.

However, unlike the results obtained by Stevens et al. (Reference Stevens, Crowley, Garde, Mortensen, Nygård and Holtermann2019, Reference Stevens, Crowley, Rasmussen, Hallman, Mortensen, Nygård and Holtermann2020), the short version of three items does not present a better fit compared to the full version in its Spanish adaptation. The comparison of single-factor models versus the reduced three-item version reveals a superiority in the fit of the complete model, reflecting a more accurate and detailed capture of the recovery need construct and underscores the value of a rigorous methodological approach to psychometric evaluation in the workplace. These results allow us to conclude that, provisionally, the underlying construct is more adequately captured by the full version in its Spanish adaptation. Although the cited authors (Stevens et al., Reference Stevens, Crowley, Garde, Mortensen, Nygård and Holtermann2019) suggest that even a single-item version would be able to represent the need for recovery construct, our results suggest caution in this direction.

The trend in psychology and related sciences to develop increasingly shorter scales is well known (e.g., Rammstedt & Beierlein, Reference Rammstedt and Beierlein2014; Smith et al., Reference Smith, McCarthy and Anderson2000), and especially in the field of work and organizations (e.g., van Veldhoven et al., Reference van Veldhoven, Prins, Laken and Dijkstra2015), based on criteria of convenience in application, on reducing response biases caused by participant fatigue or loss of attention caused by long questionnaires, and on obtaining psychometric properties equivalent and sometimes even superior to full versions. However, this tendency cannot lead to an oversimplification in the measurement of constructs that, though one-dimensional, require the formulation of a sufficient number of items to capture their real meaning.

In conclusion, the Spanish adaptation of the Danish NFR scale, in its full eight-item version, provides evidence of reliability and validity for the assessment of the need for recovery in the evaluated sample and context and represents a helpful complement for the measurement of stress and work recovery experiences, for which the Spanish adaptation (Sanz-Vergel et al., Reference Sanz-Vergel, Sebastián, Rodríguez-Muñoz, Garrosa, Moreno-Jiménez and Sonnentag2010) of the questionnaire developed by Sonnentag and Fritz (Reference Sonnentag and Fritz2007) was already available. The daily depletion of personal resources at work, manifested by the level of need for recovery in the last part of the workday and after its end (Bennett et al., Reference Bennett, Gabriel, Calderwood, Dahling and Trougakos2016; Wentz et al., Reference Wentz, Gyllensten, Sluiter and Hagberg2020), significantly reduces the resources available for investment in activities that facilitate recovery, at home or in other social or leisure contexts (Sonnentag et al., Reference Sonnentag, Cheng and Parker2022; Ten Brummelhuis & Bakker, Reference Ten Brummelhuis and Bakker2012), and lead to the aforementioned recovery paradox (Sonnentag, Reference Sonnentag2018). The accurate identification of difficulties in achieving effective post-workday recovery through the measurement of the need for recovery with the instrument validated in Spanish in this study may contribute to improving such recovery processes (Chawla et al., Reference Chawla, MacGowan, Gabriel and Podsakoff2020; de Croon et al., Reference de Croon, Sluiter and Frings-Dresen2006) and thus achieve an increase in well-being and health associated with work and quality of life in general.

Limitations, Future Research, and Practical Implications

Despite its strengths, this study is not without limitations, several of which point to valuable directions for future research. First, certain contextual and demographic variables—such as number and age of children, working hours, contract type, salary, and commuting distance—were not considered in this study. These factors could influence the perception of recovery needs and should be incorporated into future studies to better contextualize individual differences.

Second, the use of a Likert scale with extreme anchors (“never” to “always”) may have limited the variability of responses. Exploring alternative response formats (e.g., “rarely” to “very frequently”) may enhance the scale’s sensitivity and discriminatory power across occupational settings.

In addition, while the NFR scale is conceptually distinct from chronic stress constructs such as burnout, some overlap—particularly with emotional exhaustion and depersonalization—may exist. The absence of these variables in this study limits the possibility of examining how the NFR construct can be empirically distinguished from related constructs such as burnout. Future research should include such measures to better delineate the nomological network of the construct.

The study also employed a single-item indicator to assess perceived work stress. Although this approach offers simplicity and has empirical precedent, it does not permit the estimation of internal consistency and may attenuate observed associations due to measurement error. Multi-item validated scales are recommended in future applications. Finally, while the test–retest reliability of the NFR scale showed moderate stability over time, the absence of an analysis of potential changes in participants between measurements makes it difficult to fully interpret these results. Future studies should aim to clarify this issue by controlling for possible changes in external or psychological factors between administrations.

In terms of practical implications, as already mentioned, the need for recovery is considered an initial or early stage in the long-term strain process (Sonnentag & Zijlstra, Reference Sonnentag and Zijlstra2006). Therefore, preventing or detecting the need for recovery at the earliest possible stage can facilitate the recovery process, particularly within work (Chan et al., Reference Chan, Howard, Eva and Tse2022; Gommans et al., Reference Gommans, Jansen, Stynen, de Grip and Kant2015), and reduce work-related stress and its associated adverse consequences (van Veldhoven, Reference van Veldhoven, Houdmont and Leka2008; Sonnentag et al., Reference Sonnentag, Kuttler and Fritz2010). Designing organizational interventions that can detect the need for recovery early on and eliminate the factors that cause it can, in turn, reduce long-term fatigue, the effects of psychological distress, and burnout (Sinval et al., Reference Sinval, Veldhoven, Oksanen, Azevedo, Atallah, Melnik and Marôco2021). In summary, these interventions aim to reduce or eliminate factors that contribute to the need for recovery, thereby enhancing well-being and health at work (Wentz et al., Reference Wentz, Gyllensten, Sluiter and Hagberg2020). They can serve as a foundation for the design of restorative work environments (Hilbert et al., Reference Hilbert, Binnewies and Berkemeyer2024) and contribute to a healthier and more sustainable work design (Parker & Knight, Reference Parker and Knight2025).

Data availability statement

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Acknowledgments

We thank the Servicio de Psicología of the Guardia Civil for facilitating this research by granting access to its facilities and the sample of professionals who participated.

Author contribution

F.A.B.-J.: conceptualization, formal analysis, methodology, writing—original draft, writing—review and editing. C.D.-S.: data curation, investigation, funding acquisition. R.R.-Í.: data curation, investigation, project administration. M.-Á.S.G.: conceptualization, formal analysis, methodology, funding acquisition, project administration, supervision. C.-M.A.: conceptualization, writing—original draft, funding acquisition, supervision, writing—review and editing.

Funding statement

This research was carried out within the framework of the research project funded by the Ministerio de Ciencia e Innovación/AEI/10.13039/501100011033, Grant PID2019-110490RB-I00.

Competing interests

The authors declare none.

References

Barendse, M. T., Oort, F. J., & Timmerman, M. E. (2015). Using exploratory factor analysis to determine the dimensionality of discrete responses. Structural Equation Modeling, 22(1), 87101. https://doi.org/10.1080/10705511.2014.934850CrossRefGoogle Scholar
Bennett, A. A., Gabriel, A. S., Calderwood, C., Dahling, J. J., & Trougakos, J. P. (2016). Better together? Examining profiles of employee recovery experiences. Journal of Applied Psychology, 101(12), 16351654. https://doi.org/10.1037/apl0000157CrossRefGoogle ScholarPubMed
Boateng, G. O., Neilands, T. B., Frongillo, E. A., Melgar-Quiñonez, H. R., & Young, S. L. (2018). Best practices for developing and validating scales for health, social, and behavioral research: A primer. Frontiers in Public Health, 6, Article 149. https://doi.org/10.3389/fpubh.2018.00149CrossRefGoogle ScholarPubMed
Bridger, R. S., & Brasher, K. (2011). Cognitive task demands, self-control demands and the mental well-being of office workers. Ergonomics, 54(9), 830839. https://doi.org/10.1080/00140139.2011.596948CrossRefGoogle ScholarPubMed
Brown, T. A. (2015). Confirmatory factor analysis for applied research (2nd ed.). The Guilford Press.Google Scholar
Carral, P., & Alcover, C. M. (2019). Measuring age discrimination at work: Spanish adaptation and preliminary validation of the Nordic Age Discrimination Scale (NADS). International Journal of Environmental Research and Public Health, 16(8), Article 1431. https://doi.org/10.3390/ijerph16081431CrossRefGoogle ScholarPubMed
Chan, P. H. H., Howard, J., Eva, N., & Tse, H. H. M. (2022). A systematic review of at-work recovery and a framework for future research. Journal of Vocational Behavior, 137, 119. https://doi.org/10.1016/j.jvb.2022.103747CrossRefGoogle Scholar
Chawla, N., MacGowan, R. L., Gabriel, A. S., & Podsakoff, N. P. (2020). Unplugging or staying connected? Examining the nature, antecedents, and consequences of profiles of daily recovery experiences. Journal of Applied Psychology, 105(1), 1939. https://doi.org/10.1037/apl0000423CrossRefGoogle ScholarPubMed
Cheung, G. W., & Rensvold, R. B. (2002). Evaluating goodness-of-fit indexes for testing measurement invariance. Structural Equation Modeling, 9(2), 233255. https://doi.org/10.1207/S15328007SEM0902_5CrossRefGoogle Scholar
Cohen, J. (1988). Statistical power analysis for the behavioral sciences (2nd ed.), ErlbaumGoogle Scholar
Cumming, G. (2014). The new statistics: why and how. Psychological science, 25(1), 729. https://doi.org/10.1177/0956797613504966CrossRefGoogle Scholar
Dallner, M., Elo, A.-L., Gamberale, F., Hottinen, V., Knardahl, S., Lindström, K., Skogstad, A., & Ørhede, E. (2000). Validation of the General Nordic Questionnaire (QPSNordic) for psychological and social factors at work. Nordic Council of Ministers.Google Scholar
De Bloom, J. (2020). How to recover during and from a pandemic. Industrial Health, 58(3), 197199. https://doi.org/10.2486/indhealth.58_300CrossRefGoogle ScholarPubMed
de Croon, E. M., Sluiter, J. K., & Frings-Dresen, M. H. (2003). Need for recovery after work predicts sickness absence: A 2-year prospective cohort study in truck drivers. Journal of Psychosomatic Research, 55(4), 331339. https://doi.org/10.1016/s0022-3999(02)00630-xCrossRefGoogle ScholarPubMed
de Croon, E. M., Sluiter, J. K., & Frings-Dresen, M. H. (2006). Psychometric properties of the Need for Recovery After Work Scale: Test-retest reliability and sensitivity to detect change. Occupational and Environmental Medicine, 63(3), 202206. https://doi.org/10.1136/oem.2004.018275CrossRefGoogle ScholarPubMed
Dupret, É., Bocéréan, C., Feltrin, M., Chemolle, É., & Lebon, J. (2018). L’échelle de Besoin de Récupération: adaptation et validation françaises [The Need for Recovery Scale: French adaptation and validation]. Sante Publique, 30(4), 445454. https://doi.org/10.3917/spub.185.0445CrossRefGoogle Scholar
Epskamp, S. (2015). semPlot: Unified visualizations of structural equation models. Structural Equation Modeling: A Multidisciplinary Journal, 22(3), 474483. https://doi.org/10.1080/10705511.2014.937847CrossRefGoogle Scholar
Feuerhahn, N., Sonnentag, S., & Woll, A. (2014). Exercise after work, psychological mediators, and affect: A day-level study. European Journal of Work and Organizational Psychology, 23(1), 6279. https://doi.org/10.1080/1359432X.2012.709965CrossRefGoogle Scholar
Flaherty, J. A., Gaviria, F. M., Pathak, D., Mitchell, T., Wintrob, R., Richman, J. A., & Birz, S. (1988). Developing instruments for cross-cultural psychiatric research. The Journal of Nervous and Mental Disease, 176(5), 257263.10.1097/00005053-198805000-00001CrossRefGoogle ScholarPubMed
Fox, J., & Weisberg, S. (2019). An R companion to applied regression, 3rd ed. Sage.Google Scholar
Furunes, T., & Mykletun, R. J. (2010). Age discrimination in the workplace: Validation of the Nordic Age Discrimination Scale (NADS). Scandinavian Journal of Psychology, 51(1), 2330. https://doi.org/10.1111/j.1467-9450.2009.00738.xCrossRefGoogle ScholarPubMed
Gadermann, A., Guhn, M., & Zumbo, B. (2012). Estimating ordinal reliability for Likert-type and ordinal item response data: A conceptual, empirical and practical guide. Practical Assessment, Research and Evaluation, 17(3), 113. https://doi.org/10.7275/n560-j767Google Scholar
Goldberg, D. (1978). Manual of general health questionnaire. N. Publishing (ed.).Google Scholar
Goldberg, D., & Williams, P. (1988). A user’s guide to the General Health Questionnaire. NFER-Nelson.Google Scholar
Gommans, F. G., Jansen, N. W., Stynen, D., de Grip, A., & Kant, I. (2015). Need for recovery across work careers: The impact of work, health and personal characteristics. International Archives of Occupational and Environmental Health, 88(3), 281295. https://doi.org/10.1007/s00420-014-0956-3CrossRefGoogle ScholarPubMed
Gupta, N., Wåhlin-Jacobsen, C. D., Abildgaard, J. S., Henriksen, L. N., Nielsen, K., & Holtermann, A. (2018). Effectiveness of a participatory physical and psychosocial intervention to balance the demands and resources of industrial workers: A cluster-randomized controlled trial. Scandinavian Journal of Work, Environment & Health, 44(1), 5868. https://doi.org/10.5271/sjweh.3689CrossRefGoogle ScholarPubMed
Hambleton, R. K., & Patsula, L. (1998). Adapting tests for use in multiple languages and cultures. Social Indicators Research, 45(1/3), 151171. http://www.jstor.org/stable/27522340CrossRefGoogle Scholar
Hilbert, M., Binnewies, C., & Berkemeyer, L. (2024). Synergies between recovery from work and restorative environments for sustainable development: An integrative theoretical perspective. Gruppe. Interaktion. Organisation. Zeitschrift für Angewandte Organisationspsychologie (GIO), 55(2), 201207. https://doi.org/10.1007/s11612-024-00736-6CrossRefGoogle Scholar
Hystad, S. W., & Johnsen, B. H. (2020). The dimensionality of the 12-item General Health Questionnaire (GHQ-12): Comparisons of factor structures and invariance across samples and time. Frontiers in Psychology, 11, 1300. https://doi.org/10.3389/fpsyg.2020.01300CrossRefGoogle ScholarPubMed
Jansen, N. W. H., Kant, I., & van den Brandt, P. A. (2002). Need for recovery in the working population: Description and associations with fatigue and psychological distress. International Journal of Behavioral Medicine, 9(4), 322340. https://doi.org/10.1207/S15327558IJBM0904_03CrossRefGoogle ScholarPubMed
Jansen, N., Kant, I., van Amelsvoort, L., Nijhuis, F., & van den Brandt, P. (2003). Need for recovery from work: Evaluating short-term effects of working hours, patterns and schedules. Ergonomics, 46(7), 664680. https://doi.org/10.1080/0014013031000085662CrossRefGoogle ScholarPubMed
Jorgensen, T. D., Pornprasertmanit, S., Schoemann, A. M., & Rosseel, Y. (2023). semTools: Useful tools for structural equation modeling. https://cran.r-project.org/package=semToolsGoogle Scholar
Kaiser, H. F. (1970). A second generation little jiffy. Psychometrika, 35 ( 4), 401 415. https://doi.org/10.1007/BF02291817CrossRefGoogle Scholar
Kassambara, A., & Mundt, F. (2020). Factoextra: Extract and visualize the results of multivariate data analyses. https://cran.r-project.org/package=factoextraGoogle Scholar
Li, C. H. (2016). Confirmatory factor analysis with ordinal data: Comparing robust maximum likelihood and diagonally weighted least squares. Behavior Research Methods, 48, 936949. https://doi.org/10.3758/s13428-015-0619-7CrossRefGoogle ScholarPubMed
Lin, Y. C., Chen, Y. C., Hsieh, H. I., & Chen, P. C. (2015). Risk for work-related fatigue among the employees on semiconductor manufacturing lines. Asia-Pacific Journal of Public Health, 27(2), 18051818. https://doi.org/10.1177/1010539513490788CrossRefGoogle ScholarPubMed
Lloret-Segura, S., Ferreres-Traver, A., Hernández-Baeza, A., & Tomás-Marco, I. (2014). El análisis factorial exploratorio de los ítems: una guía práctica, revisada y actualizada [Exploratory factor analysis of items: A practical guide, revised and updated]. Anales de Psicología / Annals of Psychology, 30(3), 11511169. https://doi.org/10.6018/analesps.30.3.199361Google Scholar
López-Gómez, I., Hervas, G., & Vazquez, C. (2015). An adaptation of the Positive and Negative Affect Schedules (PANAS) in a Spanish general sample. Behavioral Psychology/Psicologia Conductual, 23(3), 529548.Google Scholar
Maslach, C., & Jackson, S. E. (1981), The measurement of experienced burnout. Journal of Organizational Behavior, 2, 99113. https://doi.org/10.1002/job.4030020205CrossRefGoogle Scholar
Moriguchi, C. S., Alem, M. E., van Veldhoven, M., & Coury, H. J. (2010). Cultural adaptation and psychometric properties of Brazilian Need for Recovery Scale. Revista de Saude Publica, 44(1), 131139. https://doi.org/10.1590/s0034-89102010000100014CrossRefGoogle ScholarPubMed
Nunnally, J., & Bernstein, I. (1994). Psychometric Theory (3rd ed.). McGraw-HillGoogle Scholar
Oppong, J., Liu, X., Wang, Q., Xie, C., Lei, Y., Zhou, Y., Peng, Y., Li, H., & Xing, C. (2023). Assessing measurement invariance of the SEGUE framework among Chinese and Ghanaian patients: A cross-sectional study. PEC Innovation, 3, Article 100183. https://doi.org/10.1016/j.pecinn.2023.100183CrossRefGoogle ScholarPubMed
Pace, F., Lo Cascio, V., Civilleri, A., Guzzo, G., Foddai, E., & Van Veldhoven, M. (2013). The Need for Recovery Scale: Adaptation to the Italian context. European Review of Applied Psychology / Revue Européenne de Psychologie Appliquée, 63(4), 243249. https://doi.org/10.1016/j.erap.2013.05.001CrossRefGoogle Scholar
Parker, S. K., & Knight, C. (2025). Design work to prevent burnout. MIT Sloan Management Review, 66 (2) 18.Google Scholar
Peters, S. E., Dennerlein, J. T., Wagner, G. R., & Sorensen, G. (2022). Work and worker health in the post-pandemic world: A public health perspective. The Lancet. Public health, 7(2), e188e194. https://doi.org/10.1016/S2468-2667(21)00259-0CrossRefGoogle ScholarPubMed
Podsakoff, P. M., MacKenzie, S. B., Lee, J. Y., & Podsakoff, N. P. (2003). Common method biases in behavioral research: A critical review of the literature and recommended remedies. Journal of Applied Psychology, 88(5), 879903. https://doi.org/10.1037/0021-9010.88.5.879CrossRefGoogle ScholarPubMed
Rammstedt, B., & Beierlein, C. (2014). Can’t we make it any shorter? The limits of personality assessment and ways to overcome them. Journal of Individual Differences, 35(4), 212220. https://doi.org/10.1027/1614-0001/a000141CrossRefGoogle Scholar
Revelle, W. (2021a). psych: Procedures for psychological, psychometric, and personality research. Northwestern University, Evanston, Illinois. https://cran.r-project.org/package=psychGoogle Scholar
Revelle, W. (2021b). psychTools: Tools to accompany the psych package for psychological research. Northwestern University, Evanston, Illinois. https://cran.r-project.org/package=psychToolsGoogle Scholar
Rhemtulla, M., Brosseau-Liard, P. É., & Savalei, V. (2012). When can categorical variables be treated as continuous? A comparison of robust continuous and categorical SEM estimation methods under suboptimal conditions. Psychological Methods, 17(3), 354373. https://doi.org/10.1037/a0029315CrossRefGoogle ScholarPubMed
Rocha, K. B., Pérez, K., Rodríguez-Sanz, M., Borrell, C., & Obiols, J. E. (2011). Propiedades psicométricas y valores normativos del General Health Questionnaire (GHQ-12) en población general española. [Psychometric properties and normative values of General Health Questionnaire (GHQ-12) in Spanish population.]. International Journal of Clinical and Health Psychology, 11, 125139.Google Scholar
Rodríguez-Muñoz, A., Sanz-Vergel, A. I., Antino, M., Demerouti, E., & Bakker, A. B. (2018). Positive experiences at work and daily recovery: Effects on couple’s well-being. Journal of Happiness Studies: An Interdisciplinary Forum on Subjective Well-Being, 19(5), 13951413. https://doi.org/10.1007/s10902-017-9880-zCrossRefGoogle Scholar
Rosseel, Y. (2012). lavaan: An R package for structural equation modeling. Journal of Statistical Software, 48(2), 136. https://doi.org/10.18637/jss.v048.i02CrossRefGoogle Scholar
Royston, P. (1995). Remark AS R94: A remark on algorithm AS 181: The W-test for normality. Applied Statistics, 44(4), 547551. https://doi.org/10.2307/2986146CrossRefGoogle Scholar
R Core Team. (2023). R: A language and environment for statistical computing. R Foundation for Statistical Computing. Vienna, Austria. https://www.r-project.org/Google Scholar
Sandín, B., Chorot, P., Lostao, L., Joiner, T. E., Santed, M. A., & Valiente, R. M. (1999). Escalas PANAS de afecto positivo y negativo: validación factorial y convergencia transcultural. Psicothema, 11(1), 3751.Google Scholar
Sanz-Vergel, A. I., Sebastián, J., Rodríguez-Muñoz, A., Garrosa, E., Moreno-Jiménez, B., & Sonnentag, S. (2010). Adaptación del “Cuestionario de Experiencias de Recuperación” a una muestra Española [Adaptation of the “Recovery Experience Questionnaire” in a Spanish sample]. Psicothema, 22(4), 990996.Google Scholar
Shi, Y., She, Z., Li, D., Zhang, H., & Niu, K. (2021). Job crafting promotes internal recovery state, especially in jobs that demand self-control: a daily diary design. BMC Public Health, 21, Article number 1889. https://doi.org/10.1186/s12889-021-11915-1CrossRefGoogle ScholarPubMed
Sinval, J., Veldhoven, M., Oksanen, T., Azevedo, L. F., Atallah, Á. N., Melnik, T., & Marôco, J. (2021). Interventions for improving recovery from work. The Cochrane Database of Systematic Reviews, 2021(12), CD014518. https://doi.org/10.1002/14651858.CD014518Google Scholar
Sluiter, J. K., de Croon, E. M., Meijman, T. F., & Frings-Dresen, M. H. (2003). Need for recovery from work related fatigue and its role in the development and prediction of subjective health complaints. Occupational and Environmental Medicine, 60(Suppl 1), i62–i70. https://doi.org/10.1136/oem.60.suppl_1.i62CrossRefGoogle ScholarPubMed
Sluiter, J. K., van der Beek, A. J., & Frings-Dresen, M. H. (1999). The influence of work characteristics on the need for recovery and experienced health: A study on coach drivers. Ergonomics, 42(4), 573583. https://doi.org/10.1080/001401399185487CrossRefGoogle Scholar
Smith, G. T., McCarthy, D. M., & Anderson, K. G. (2000). On the sins of short-form development. Psychological Assessment, 12(1), 102111. https://doi.org/10.1037/1040-3590.12.1.102CrossRefGoogle ScholarPubMed
Sonnentag, S. (2018). The recovery paradox: Portraying the complex interplay between job stressors, lack of recovery, and poor well-being. Research in Organizational Behavior, 38, 169185. https://doi.org/10.1016/j.riob.2018.11.002CrossRefGoogle Scholar
Sonnentag, S., Cheng, B. H., & Parker, S. L. (2022). Recovery from work: Advancing the field toward the future. Annual Review of Organizational Psychology and Organizational Behavior, 9, 3360. https://doi.org/10.1146/annurev-orgpsych-012420-091355CrossRefGoogle Scholar
Sonnentag, S., & Fritz, C. (2007). The Recovery Experience Questionnaire: Development and validation of a measure for assessing recuperation and unwinding from work. Journal of Occupational Health Psychology, 12(3), 204221. https://doi.org/10.1037/1076-8998.12.3.204CrossRefGoogle ScholarPubMed
Sonnentag, S., Kuttler, I., & Fritz, C. (2010). Job stressors, emotional exhaustion, and need for recovery: A multi-source study on the benefits of psychological detachment. Journal of Vocational Behavior, 76(3), 355365. https://doi.org/10.1016/j.jvb.2009.06.005CrossRefGoogle Scholar
Sonnentag, S., Mojza, E. J., Binnewies, C., & Scholl, A. (2008). Being engaged at work and detached at home: A week-level study on work engagement, psychological detachment, and affect. Work & Stress, 22(3), 257276. https://doi.org/10.1080/02678370802379440CrossRefGoogle Scholar
Sonnentag, S., & Niessen, C. (2020). To detach or not to detach? Two experimental studies on the affective consequences of detaching from work during non-work time. Frontiers in Psychology, 11, Article 560156. https://doi.org/10.3389/fpsyg.2020.560156CrossRefGoogle ScholarPubMed
Sonnentag, S., Stephan, U., Wendsche, J., de Bloom, J., Syrek, C., & Vahle-Hinz, T. (2021). Recovery in occupational health psychology and human resource management research: An interview with Prof. Sabine Sonnentag and Prof. Ute Stephan. German Journal of Human Resource Management, 35(2), 274281. https://doi.org/10.1177/23970022211004599Google Scholar
Sonnentag, S., Venz, L., & Casper, A. (2017). Advances in recovery research: What have we learned? What should be done next? Journal of Occupational Health Psychology, 22(3), 365380. https://doi.org/10.1037/ocp0000079CrossRefGoogle ScholarPubMed
Sonnentag, S., & Zijlstra, F. R. H. (2006). Job characteristics and off-job activities as predictors of need for recovery, well-being, and fatigue. Journal of Applied Psychology, 91(2), 330350. https://doi.org/10.1037/0021-9010.91.2.330CrossRefGoogle ScholarPubMed
Steed, L. B., Swider, B. W., Keem, S., & Liu, J. T. (2021). Leaving work at work: A meta-analysis on employee recovery from work. Journal of Management, 47(4), 867897. https://doi.org/10.1177/0149206319864153CrossRefGoogle Scholar
Stevens, M. L., Crowley, P., Garde, A. H., Mortensen, O. S., Nygård, C. H., & Holtermann, A. (2019). Validation of a short-form version of the Danish Need for Recovery Scale against the full scale. International Journal of Environmental Research and Public Health, 16(13), Article 2334. https://doi.org/10.3390/ijerph16132334CrossRefGoogle ScholarPubMed
Stevens, M. L., Crowley, P., Rasmussen, C. L., Hallman, D. M., Mortensen, O. S., Nygård, C. H., & Holtermann, A. (2020). Accelerometer-measured physical activity at work and need for recovery: A compositional analysis of cross-sectional data. Annals of Work Exposures and Health, 64(2), 138151. https://doi.org/10.1093/annweh/wxz095CrossRefGoogle ScholarPubMed
Ten Brummelhuis, L. L., & Bakker, A. B. (2012). A resource perspective on the work-home interface: The work-home resources model. American Psychologist, 67(7), 545556. https://doi.org/10.1037/a0027974CrossRefGoogle ScholarPubMed
Timmerman, M. E., & Lorenzo-Seva, U. (2011). Dimensionality assessment of ordered polytomous items with parallel analysis. Psychological Methods, 16(2), 209220. https://doi.org/10.1037/a0023353CrossRefGoogle ScholarPubMed
Tu, M., & Chi, N.-W. (2023). How and when abusive supervision leads to recovery activities: The recovery paradox and the conservation of resources perspectives. Journal of Organizational Behavior, 45(4), 558575. https://doi.org/10.1002/job.2757CrossRefGoogle Scholar
van Veldhoven, M., & Broersen, S. (2003). Measurement quality and validity of the “need for recovery scale”. Occupational and Environmental Medicine, 60(Suppl 1), i3i9. https://doi.org/10.1136/oem.60.suppl_1.i3CrossRefGoogle ScholarPubMed
van Veldhoven, M., Prins, J., Laken, P. V., & Dijkstra, L. (2015). QEEW2.0: 42 short scales for survey research on work, well-being and performance. SKB.Google Scholar
van Veldhoven, M. J. P. M. (2008). Need for recovery after work: An overview of construct, measurement and research. In Houdmont, J., & Leka, S. (Eds.), Occupational health psychology: European perspectives on research, education and practice (pp. 125). Nottingham University Press.Google Scholar
Van Veldhoven, M. J. P. M., & Broersen, J. P. J. (1999). Psychosociale arbeidsbelasting en werkstress in Nederland. [Psychosocial job demands and job stress in the Netherlands]. Foundation for Quality Improvement in Occupational Health Care.Google Scholar
Van Veldhoven, M. J. P. M., & Meijman, T.F. (1994). Het meten van psychosociale arbeidsbelasting met een vragenlijst: de vragenlijst beleving en beoordeling van de arbeid (VBBA). [The measurement of psychosocial job demands with a questionnaire: The questionnaire on the experience and evaluation of work (QEEW)]. Dutch Institute for Working Conditions.Google Scholar
Watson, D., Clark, L. A., & Tellegen, A. (1988). Development and validation of brief measures of positive and negative affect: The PANAS Scales. Journal of Personality and Social Psychology, 54(6), 10631070. https://doi.org/10.1037/0022-3514.54.6.1063CrossRefGoogle ScholarPubMed
Wendsche, J., & Lohmann-Haislah, A. (2017). A meta-analysis on antecedents and outcomes of detachment from work. Frontiers in Psychology, 7, Article 2072. https://doi.org/10.3389/fpsyg.2016.02072CrossRefGoogle ScholarPubMed
Wentz, K., Gyllensten, K., Sluiter, J. K., & Hagberg, M. (2020). Need for recovery in relation to effort from work and health in four occupations. International Archives of Occupational and Environmental Health, 93(2), 243259. https://doi.org/10.1007/s00420-019-01476-7CrossRefGoogle ScholarPubMed
Xanthopoulou, D., Bakker, A. B., Oerlemans, W. G. M., & Koszucka, M. (2018). Need for recovery after emotional labor: Differential effects of daily deep and surface acting. Journal of Organizational Behavior, 39(4), 481494. https://doi.org/10.1002/job.2245CrossRefGoogle Scholar
Figure 0

Table 1. KMO index, homogeneity index, means, standard deviations, and correlations with confidence intervals

Figure 1

Table 2. Item content and factor loadings from the exploratory and confirmatory factor analyses of the Danish need for recovery scale

Figure 2

Figure 1. Results of the parallel analysis following the procedure of Timmerman and Lorenzo-Seva (2011). The ordinate axis shows the percentage of variance explained, while the abscissa axis shows the number of factors. The convergence or similarity between the red (real data) and green (simulated data) lines indicates the presence of a factor.

Figure 3

Figure 2. The standardized parameters of the model, the covariance parameter between the latent variables, and the standard errors.

Figure 4

Table 3. Evaluation of the goodness of fit of Models 1 and 2

Figure 5

Table 4. Evaluation of measurement invariance by estimating the configural (Model 0), metric (Model 1), scalar (Model 2), and strict invariance (Model 3) models through the variables sample origin, age, and sex

Figure 6

Table 5. Regression results using the total score of the instrument as a criterion

Figure 7

Table 6. Means, standard deviations, and correlations with confidence intervals