Hostname: page-component-7f64f4797f-7vqc4 Total loading time: 0 Render date: 2025-11-04T11:47:35.810Z Has data issue: false hasContentIssue false

Examining intra- and inter-device reliability of pressure-mediated reflection spectroscopy in a multi-state sample of healthy adults

Published online by Cambridge University Press:  25 September 2025

Susan B. Sisson*
Affiliation:
Department of Nutrition Sciences, University of Oklahoma Health Sciences Center , Oklahoma City, OK, USA
Emily Helms
Affiliation:
Department of Nutrition Sciences, University of Oklahoma Health Sciences Center , Oklahoma City, OK, USA
Shanon Casperson
Affiliation:
USDA Agricultural Research Services, Grand Forks Human Nutrition Research Center, Grand Forks, ND, USA
Saima Hasnin
Affiliation:
Department of Food Science and Human Nutrition, College of Agricultural, Consumer and Environmental Sciences, University of Illinois Urbana-Champaign, Urbana, IL, USA
Stephanie Jilcott Pitts
Affiliation:
Department of Public Health, East Carolina University, Greenville, NC, USA
Virginia C. Stage
Affiliation:
Department of Agricultural and Human Sciences, NC State University, Raleigh, NC, USA
Christopher R. Long
Affiliation:
Center for Nutrition and Health Impact, Omaha, NE, USA
Taren Massey-Swindle
Affiliation:
Department of Pediatrics, University of Arkansas for Medical Sciences, Little Rock, AR; Arkansas Children’s Research Institute, Little Rock, AR; Arkansas Children’s Nutrition Center, Little Rock, AR, USA
Dipti A. Dev
Affiliation:
Department of Child, Youth and Family Studies, College of Education and Human Sciences, University of Nebraska-Lincoln, Lincoln, NE, USA
Ashlea Braun
Affiliation:
Department of Nutritional Sciences, College of Education & Human Sciences, Oklahoma State University, Stillwater, OK, USA TSET Health Promotion Research Center, Stephenson Cancer Center, University of Oklahoma Health Sciences, Tulsa, OK, USA
Jodi D. Stookey
Affiliation:
Maternal, Child & Adolescent Health, San Francisco Department of Public Health, San Francisco, CA 94103, USA
Rowena Cape
Affiliation:
Maternal, Child & Adolescent Health, San Francisco Department of Public Health, San Francisco, CA 94103, USA
Jonathan Baldwin
Affiliation:
College of Allied Health, University of Oklahoma Health Sciences, Oklahoma City, OK, USA
*
Corresponding author: Susan B. Sisson; Email: susan-sisson@ouhsc.edu
Rights & Permissions [Opens in a new window]

Abstract

Objective:

To examine the intra- and inter-device reliability of devices using pressure-mediated reflection spectroscopy (the Veggie Meter®).

Design:

A cross-sectional research study was conducted across eight sites in the USA. Using two Veggie Meters® at each site, participants completed five, counter-balanced pairs of finger scans. Intra-device comparisons included intra-class correlation coefficients (ICC) and calculation of the CV and 95 % CI of each device/site; hypothesised to be ≤ 6 %. Inter-device comparisons included ICC, absolute relative differences (ARD) and 95 % CI, and equivalence; both hypothesised to be ≤ 10 %.

Setting:

Eight sites across the USA.

Participants:

Across sites, participants’ (n 282) average age ranged 24·7–39·0 years; sex ranged 60·0–85·7 % women and Non-Hispanic White ranged 20·0–94·3 %.

Results:

Intra-device ICC ranged from 0·77 to 0·99. The CV ranged from 6·2 to 14·2 %, with an average of 8·8 %. A majority (63 %; n 10) of the Veggie Meter® devices had significantly higher CV from the hypothesised 6 %. Inter-device ICC ranged from 0·58 to 0·94. The ARD ranged from 7·5 to 22·0 %, with an average of 13·9 %. ARD in a majority (n 5) of sites was significantly higher than the hypothesised 10 %. Five sites (63 %) demonstrated equivalence below the hypothesised 10 %.

Conclusions:

Our study demonstrates the intra-device and inter-device reliability to be moderate to high, as per ICC. The observed margin of difference within a device was up to 14 %, with an average of 9 %. The observed margin of difference between devices was up to 22 %, with an average of 14 % between devices.

Information

Type
Research Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of The Nutrition Society

Accurate and non-invasive assessment of fruit and vegetable intake is needed to overcome challenges associated with self-report measures. Technologies that measure skin carotenoids have emerged as a valid non-invasive method of assessing fruit and vegetable intake using resonance Raman spectroscopy and reflection spectroscopy (RS). The latter, RS, is relatively lower cost, portable and provides a rapid (less than 1 min) assessment of skin carotenoids. An RS device that is frequently used in recent public health nutrition studies is the Veggie Meter®(Reference Akingbule, Teran-Garcia and Okoye1Reference Whiteside-Mansell, Swindle and Davenport8).

There is extremely high construct validity between the Veggie Meter®-assessed skin carotenoid score (SCS) and the score derived from the earlier established resonance Raman spectroscopy (r = 0·86–0·94)(Reference Ermakov, Ermakova and Sharifzadeh9,Reference Jahns, Johnson and Conrad10) . Veggie Meter®-assessed SCS in adults is highly correlated with total serum carotenoids (r = 0·81, P < 0·001)(Reference Ermakov, Ermakova and Sharifzadeh9) and total plasma carotenoids (r = 0·71, P < 0·001)(Reference Jahns, Johnson and Conrad10,Reference Jilcott Pitts, Moran and Wu11) and is sensitive to changes in dietary intake of carotenoids over relatively short time frames (e.g. 13·1 mg consumed daily for 3 weeks)(Reference Ermakov, Ermakova and Sharifzadeh9,Reference Casperson, Roemmich and Larson12,Reference Jilcott Pitts, Moran and Laska13) . Veggie Meter®-assessed SCS is also significantly associated with self-reported dietary recall in adults and children using a variety of recalls representing varied recall time frames(Reference Jilcott Pitts, Moran and Wu11,Reference Rush, Jalili-Moghaddam and Diep14Reference Casperson, Scheett and Palmer19) . Veggie Meter®-assessed SCS has high stability with sd of sequential measurements ranging between 3·4 % and 4·1 %(Reference Ermakov, Ermakova and Sharifzadeh9).

A better understanding of the reliability of the Veggie Meter® is needed considering its use in a variety of public health nutrition research and community-based settings. It is important to note that reliability needs to be examined in two manners: (1) intra-device reliability, or repeatability, as the similarity in SCS on the same individual, on the same day, on the same device and (2) inter-device reliability as the similarity in SCS on the same individual, on the same day and on a different device. The intra-device reliability of the Veggie Meter® has been examined in only two studies(Reference Jilcott Pitts, Jahns and Wu18,Reference Obana, Asaoka and Takayanagi20) . The CV ranged from 4·8 % to 9·1 % in these studies(Reference Jilcott Pitts, Jahns and Wu18,Reference Obana, Asaoka and Takayanagi20) . Inter-device reliability has been examined in only one study with a high reported intraclass correlation coefficient (ICC = 0·99)(Reference Obana, Asaoka and Takayanagi20). Further, the mean actual differences between devices ranged from 24·9 to 44·6 indicating 7·4–11·8 % difference between devices(Reference Obana, Asaoka and Takayanagi20). All participants were of similar ancestry and in a single geographic location limiting generalisability.

What is unknown is if multiple Veggie Meter® devices will produce similar SCS within the same person and how these scores can be compared between devices in geographically diverse populations. Understanding the equivalence between devices is an important aspect of inter-device reliability and can have research and practice implications (e.g. in cases where a research team has multiple Veggie Meter® devices and measures individuals longitudinally using either device). Therefore, the purpose of this study was to examine the intra-device reliability and inter-device reliability of Veggie Meter® devices in a sample of adults at eight sites across the USA. We hypothesised that the intra-device and inter-device ICC will be high (above 0·75). We expected the CV in SCS from multiple scans of the same participant on the same device would be <6 %(Reference Jilcott Pitts, Jahns and Wu18,Reference Obana, Asaoka and Takayanagi20) . We anticipated the absolute relative difference (ARD) in SCS from multiple Veggie Meter® devices would be within 10 %(Reference Obana, Asaoka and Takayanagi20). Further equivalence in SCS of the two devices was hypothesised to be 10 % or less.

Methods

Study design

This cross-sectional research study included a target of thirty-five participants per site across eight sites in the USA: University of Oklahoma Health Sciences in Oklahoma City, OK (coordinating site; OUHSC); University of Nebraska in Lincoln, Nebraska (UNL); East Carolina University and North Carolina State University collaborative, North Carolina (ECU/NC State); University of Arkansas for Medical Sciences (UAMS) campuses at Fayetteville (UAMS – Fayetteville) and Little Rock (UAMS – Little Rock); Oklahoma State University in Stillwater, OK (OSU); the Grand Forks Human Nutrition Research Center in Grand Forks, North Dakota (GFHNRC) and the San Francisco Department of Public Health (SFDPH). All sites participated in protocol development, study design and data collection. OUHSC was the coordinating site and developed the online survey data collection tools, housed de-identified data and conducted data analyses. Sample size was determined based on power to determine a 5 % absolute magnitude of difference within devices (n 21) and between devices (n 29). The absolute magnitude of difference between scores within a participant for our actual pilot data of 4 people was 9·1 %. These pilot data suffer from a very wide CI with such a small sample. If the actual difference was modified to 6·5 %, as would be more likely with a larger dataset, twenty-one participants would be needed. The absolute magnitude of difference between devices in our pilot data was 9·6 %. If the actual difference was modified to 7 %, as would be more likely with a larger dataset, twenty-nine participants would be needed. Enrollment was increased to account for variability and missing data.

Following consent, participants completed a questionnaire and surveys on demographic characteristics, skin light reactivity(Reference Fitzpatrick21) and a carotenoid-rich food source screener(Reference Casperson, Roemmich and Larson12). Using two Veggie Meters® at each site, participants completed five pairs of skin carotenoid scans using the single scan function for a total of ten scans of their non-dominant ring finger(Reference Radtke, Poe and Stookey22). Each University’s Institutional Review Board approved each site’s study (OUHSC #14488; UNL #20221222110EP; ECU #22-001293; UAMS #274841; OSU #IRB-22-449-STW; GFNHRC reviewed by University of North Dakota #0005363; SFDPH #20-30320). Following participation, some participants received compensation of varying ($0, $15, $25 and $35) amounts based on funding availability and site-level IRB approval. Data collection occurred from July 2022 through November 2023.

Recruitment/Participants

Participants (n 282) were recruited to research laboratories at each site (OUHSC n 35; UNL n 35; ECU n 35; UAMS-Fayetteville n 33; UAMS-Little Rock n 37; OSU n 37; GFHNRC n 35 and SFDPH n 35). Participants were English-speaking, healthy adults 18–65 years of age and had no impairments or deformities to his/her non-dominant ring finger. Participants were recruited by convenience, word of mouth and email. Due to possible skin discolouration, participants who had eaten Doritos, Cheetos or Takis in the 3 days prior to their appointment or who used self-tanning lotion in the 2 weeks prior to their scheduled Veggie Meter® appointment were rescheduled or excluded from the study if they did not wish to reschedule.

Interested participants were screened for eligibility and scheduled until recruitment goals were met. Eligibility exclusions included finger scarring (OUHSC n 1) and using self-tanning lotion (UNL n 1).

Procedures

Veggie Meter® equipment descriptions and any protocol variations across sites are described in online supplementary material, Supplemental Table 1. Prior to participant arrival, Veggie Meters® were calibrated using the factory recommended procedures. Veggie Meters® are re-calibrated every 60 min of data collection at all sites except UNL in which they are calibrated every 30 min. Veggie Meters® are identified within each laboratory as device 1 or 2 to ensure counterbalancing of finger scans.

Upon arrival at the research laboratory, participants provided informed consent. Height and weight were measured at only five of the sites. Anthropometric measures were excluded at UAMS Fayetteville due to privacy concerns as many participants were faculty and staff in the department during data collection and self-reported at two sites for cultural preferences (SFDPH) and privacy (UAMS Little Rock).

For the finger scans, participants inserted their non-dominant ring finger into one Veggie Meter®. Upon completion of the first scan, participants inserted the same finger into the other Veggie Meter® to complete the first pair of scans. Each scan took less than a minute to produce an SCS. For each pair of scans, the order of the devices was counterbalanced. For example, for the first pair of scans, the participant was scanned on device 1 then device 2 (arbitrarily assigned at each site but remained consistent throughout the study); for the second pair of scans, the participant was scanned on device 2 then device 1. Participants were alternately assigned to start scan 1 of the first pair of scans on either device 1 or 2. For example, participant 1 started their first scan with device 1; participant 2 started their first scan with device 2, and so on. Participants rested 5 minutes between pairs of scans to allow for full blood flow to return to the finger. During the 5-minute breaks between scan sets, participants completed questionnaires. Thus, each participant had a total of five pairs of SCS representing a total of ten SCS, five scores from device 1 and 5 scores from device 2.

Most sites collected data on a single research participant at a time; however, two sites (UAMS-Little Rock and SFPHD) collected data on two participants simultaneously and switched participants back and forth across both devices. For those sites collecting data on a single participant, the participants washed hands prior to starting scans, and the device was wiped with alcohol prior to scans. For those sites collecting data on multiple participants simultaneously, the device was wiped with alcohol and allowed to dry between each participant’s finger scan.

Spurious values were occasionally observed and handled differently across sites. All but two sites recorded each value as displayed by the equipment. However, two sites used different protocols. When values were observed greater than 100 points higher or lower than the immediately preceding score, GFNHRC would remove the participant’s finger, reposition and rescan. Spurious values were attributed to incorrect finger placement on the Veggie Meter® lens. At UNL, when values >10 % different from the previous scan were observed, the device was recalibrated and the participant rescanned.

Measures

Veggie Meter®

The Veggie Meter® device is a tool used to measure skin carotenoids using pressure-mediated RS. The spectroscopy-based device utilise white LED light and reflection optical signaling(Reference Ermakov, Ermakova and Sharifzadeh9,Reference Ermakov and Gellermann23) . Slight pressure (∼1 atm) temporarily moves blood from the fingertip for optical assessment of the reflected light to be identified by the spectrograph(Reference Ermakov, Ermakova and Sharifzadeh9,Reference Ermakov and Gellermann23) .

Veggie Meters® used across sites varied by the date of purchase. While manufacturers indicated no change in the equipment or anticipated scores over time, investigators observed minor nuances in the instructional manuals regarding warm up time and calibration reference numbers (see online supplementary material, Supplemental Table 1). Further, Obana 2023(Reference Obana, Asaoka and Takayanagi20) indicated variation in SCS with devices purchased over different time frames.

Anthropometric measures

For those sites measuring height (cm) and weight (kg), each participant’s anthropometric measures were collected in bare or socked feet. Height was measured to the nearest half centimeter. Weight was recorded without heavy clothes and to the nearest hundredth of a kilogram. Tools used to collect height and weight varied by site and described in online supplementary material, Supplemental Table 2. The questionnaire obtained participant information regarding sex, education, marital status, race/ethnicity, age, smoking/vaping status and Fitzpatrick skin reactivity(Reference Fitzpatrick21,Reference Roberts24) .

Statistical analysis

Sites were analysed separately since examination of these intra-individual and inter-individual variability will require the same individual being scanned on each device. Further, slight deviations in the protocol around spurious values may introduce lower intra-device and inter-device variability. Demographic characteristics were calculated for participants at each site. To fully examine intra- and inter-device reliability, multiple analyses were conducted and are described herein. Mathematical calculations, respectively, described below, were used to quantify the magnitude of the variance within and between devices. SAS version 9·4 (SAS Institutes) was used for all calculations.

Intra-device comparisons

ICC are a statistic that describes how strongly units within two groups resemble each other and is often used as a measure of intra- and inter-rater agreement(Reference Koo and Li25,Reference Liljequist, Elfving and Skavberg Roaldsen26) . A higher ICC indicates higher agreement across devices or raters; an ICC of <0·5 is poor, 0·5–0·75 is moderate, 0·75–0·9 is good and above 0·9 is excellent(Reference Koo and Li25). An ICC ≥ 0·75 was hypothesised. The 95 % CI describe the range (i.e. variance) around the point estimate (ICC, in this case). ICC and 95 % CI were calculated across all five scores within each device for each participant to determine the reliability of scores within an individual.

CV (CV = sd/mean) is a measure of the precision of a sample mean (i.e. data dispersion). CV presents the standard deviation (sd) as a percent of the mean and is calculated by CV = sd/mean. CV for each device were calculated to determine the relative measure of variability within an individual’s five scores, yielding two CV for each participant, one for each Veggie Meter® device. CV for each device (Veggie Meter® 1 and Veggie Meter® 2) were calculated from averaging all respective device CV across all site participants. CV across each site were averaged. A 95 % CI was calculated to determine if the variation of each site’s CV differs from the hypothesised 6 %, based on previous studies(Reference Jilcott Pitts, Jahns and Wu18,Reference Obana, Asaoka and Takayanagi20) . If the upper and lower values of the 95 % CI do not cross the hypothesised value, then it can be determined that the CV for that device is significantly lower or higher than the hypothesised value. A bootstrapped approach was used to determine the 95 % CI since the CV were not normally distributed. Since analysis involved separate computations for each clinical site, normality was assessed for data from each of the eight sites. Six (ECU/NC State, UAMS-Fayetteville, UAMS-Little Rock, OSU, GFNHRC and SFDPH) site’s CV were not normally distributed. In the case of a normally distributed CV, that site’s 95 % CI was computed using parametric methods; however, if the site’s CV was not normally distributed, a bootstrap procedure (5000 bootstrap samples) was used to obtain 95 % CI from the 2·5th percentile and 97·5th percentile of the bootstrap distribution. We chose to use both ICC and CV because they describe intra-device reliability differently: the ICC estimates intra-device variability as a fraction of the total variability, and CV measures intra-device variability as a fraction of the mean measure.

Inter-device comparisons

To determine the level of agreement between devices at each respective site, ICC were calculated across each pair of SCS taken from both devices (Veggie Meter® 1 and Veggie Meter® 2) at a given site.

The ARD of the SCS from device 1 and device 2 was determined relative to the mean of the two scores in the pair for each of the five pairs (│device 2 – device 1│/mean of device 1 and device 2). The participant’s mean relative difference was determined by averaging the relative difference for each of the five pairs of scans. The 95 % CI describe the range (i.e. variance) around the point estimate (ARD, in this case). If the upper and lower values of the 95 % CI do not cross the hypothesised value, then it can be determined that the ARD is significantly lower or higher than the hypothesised value. A 95 % CI was calculated to determine if the variation of each site’s ARD differs from the hypothesised 10 %(Reference Obana, Asaoka and Takayanagi20). Much like CV above, six site’s (UNL, ECU/NC State, UAMS-Fayetteville, UAMS-Little Rock, OSU, GFNHRC) ARD was not normally distributed. If normally distributed, the 95 % CI was computed using parametric methods, and if non-normally distributed then a 5000-sample bootstrap procedure was used to compute the 95 % CI.

Skin carotenoid score equivalence between the two devices was determined using CI intervals on the percent deviation, relative to their mean ([Device 1 – Device 2]/[mean of Devices 1 and 2] × 100), of the two devices per site. Equivalence was examined at 10 %.

Results

Descriptive characteristics of participants enrolled at each site are included in Table 1. Across all eight sites, the participants were mostly young adults (average 31 years), female, Non-Hispanic White, had college or graduate/professional degrees, non-smokers and about half of the sample was classified as overweight/obese.

Table 1 Mean, sd, frequency of participant descriptive characteristics across 8 sites (2022–2023)

OUHSC, Oklahoma Health Sciences in Oklahoma City; UNL, University of Nebraska in Lincoln, Nebraska; ECU, East Carolina University; NC, North Carolina; UAMS, University of Arkansas for Medical Sciences; OSU, Oklahoma State University; SFDPH, San Francisco Department of Public Health.

Intra-device ICC can be found in Table 2. Intra-device ICC ranged from 0·77 to 0·99, with an average of 0·89 indicating good to excellent agreement within SCS on the same Veggie Meter® device. Table 3 displays each device’s five individual and average of five SCS. CV was hypothesised to be 6 % based on previous literature(Reference Jilcott Pitts, Jahns and Wu18,Reference Obana, Asaoka and Takayanagi20) and is a measure of the dispersion of the five scores per device within an individual participant. CV for devices 1 and 2 can be found in Figure 1(a) and (b) and Table 2. The CV ranges from 6·2 % to 14·2 %, with an average of 8·8 %. A majority (63 %; n 10) of the Veggie Meter® devices had significantly higher CV from the hypothesised 6 %. Further, we observed no patterns of association between equipment variability reported in online supplementary material, Supplemental Table 1 and measures of intra-device reliability.

Figure 1 (a) Forest plot demonstrating CV and 95 % CI for Veggie Meter® device 1 at each site. Bold, dashed line indicated the hypothesised CV. (b) Forest plot demonstrating CV and 95 % CI for Veggie Meter® Device 2 at each site. Bold, dashed line indicated the hypothesised CV. AR, Arkansas; OSU, Oklahoma State University; OUHSC, University of Oklahoma Health Sciences Center.

Table 2 Intra- and inter-device reliability output across 8 sites

ICC, intra-class correlation coefficients; OUHSC, Oklahoma Health Sciences in Oklahoma City; UNL, University of Nebraska in Lincoln, Nebraska; ECU, East Carolina University; NC, North Carolina; UAMS, University of Arkansas for Medical Sciences; OSU, Oklahoma State University; SFDPH, San Francisco Department of Public Health.

Table 3 Individual Veggie Meter® skin carotenoid scores and average for each site

OUHSC, Oklahoma Health Sciences in Oklahoma City; UNL, University of Nebraska in Lincoln, Nebraska; ECU, East Carolina University; NC, North Carolina; UAMS, University of Arkansas for Medical Sciences; OSU, Oklahoma State University; GFHNRC, Grand Forks Human Nutrition Research Center in Grand Forks, North Dakota; SFDPH, San Francisco Department of Public Health.

Inter-device ICC across both devices at each site can be found in Figure 2 and Table 2. The range of ICC indicating degree of agreement between Veggie Meter® devices for a single individual varies between 0·58 and 0·94, indicating moderate to excellent agreement. ARD was hypothesised to be 10 % based on previous research(Reference Obana, Asaoka and Takayanagi20). Figure 3 and Table 2 show the ARD across devices. The ARD ranged from 7·5 % to 22·0 %, with an average of 13·9 %. A majority (n 5) of sites had an ARD significantly higher than the hypothesised 10 % ranging from 12·7 % to 22 %; one site had a significantly lower ARD (7·5 %) and the ARD for two sites (25 %) did not significantly differ from the hypothesised 10 %. Equivalence at each threshold including 5 % CI is in Figure 4. Five sites (63 % of all sites) demonstrated equivalence below the hypothesised 10 % threshold indicating SCS from one device would be expected to be within 10 % of the other device. Further, we observed no patterns of association between equipment variability reported in online supplementary material, Supplemental Table 1 and measures of inter-device reliability.

Figure 2 Inter-device intra class correlation coefficients and 95 % CI for each site. AR, Arkansas; OSU, Oklahoma State University; OUHSC, University of Oklahoma Health Sciences Center.

Figure 3 Inter-device absolute relative difference and 95 % CI for each site. AR, Arkansas; OSU, Oklahoma State University; OUHSC, University of Oklahoma Health Sciences Center. Bold, dashed line indicated the hypothesized absolute relative difference.

Figure 4 Equivalence of both Veggie Meter® devices and 95 % CI for each site. Bold, dashed line indicated the hypothesized equivalence. AR, Arkansas; OSU, Oklahoma State University; OUHSC, University of Oklahoma Health Sciences Center.

Discussion

The primary findings of this study indicate that within Veggie Meter® devices, there is good to excellent agreement across SCS. However, even with high correlations within each device, the variation within an individual was larger than the predicted 6 % in the majority (63 %) of sites ranging from 6 % to 14 %, with an average of 8·8 %. These values are slightly higher compared with the degree of variation between scores using resonance Raman spectroscopy (5·7 %) reported in another study(Reference Hwang, Park and Jung27), although the study did not report variation for RS to compare. Education, behavioural and community environmental interventions have reported intervention changes in Veggie Meter® assessed SCS ranging from 3 % to 58 %(Reference Akingbule, Teran-Garcia and Okoye1Reference Obana, Asaoka and Miura4,Reference Jilcott Pitts, Moran and Laska13,Reference Casperson, Scheett and Palmer19,Reference Yamane, Mochiji and Ichioka28Reference Ling, Suriyawong and Robbins36) , while supplementation studies generally have larger effects (range 5–58 %)(Reference Jilcott Pitts, Moran and Laska13,Reference Casperson, Scheett and Palmer19,Reference Yamane, Mochiji and Ichioka28Reference Amoah, Cairncross and Rush31) . Our results demonstrate the need to interpret changes in Veggie Meter®-assessed SCS findings with consideration for the margin of difference.

Studies examining resonance Raman spectroscopy variability between devices report CV ranges of 0·5–14·6 %, with an average of 5·1 %(Reference Ermakov and Gellermann37,Reference Bernstein, Zhao and Wintch38) . This is smaller than the average CV of 8·8 % observed in our study. There was moderate to high agreement across different Veggie Meter® SCS for a single individual. While the majority (63 %) of sites demonstrated equivalence below 10 %, the ARD, sites demonstrated variability ranging from 7·5 % to 22·0 % with an average of 13·9 %. This degree of inter-device variability necessitates caution when using multiple devices on a single participant over time. While the maximum difference between devices was 22 %, the average difference across all sites was approximately 14 %.

Practical implications

Several practical implications have emerged from this study. The highest CV was 14 %, conservatively indicating that actual changes observed may need to exceed 14 % to be clearly attributed to the intervention and outside the device margin of difference within an individual. Agreement across devices was moderate to high, with the range of difference being 12–22 %. Thus, to reduce artifacts and noise in the data, it is recommended that the same Veggie Meter® be used to measure an individual over time. If tracking devices is not feasible in the context or multiple devices are used across a study, ARD across devices can be calculated per site and reported to assure that change observed is attributable to the intervention effects and larger than the margin of difference between devices. The variance between scores should also be considered when comparing Veggie Meter® scores across papers and studies.

Manufacturer (Longevity Link, LLC) recommendations state that field ‘recalibration’ using the light and dark calibration sticks should be conducted every hour of continual use(Reference Radtke, Poe and Stookey22). One site (UNL) used light and dark calibration sticks every 30 min. One could hypothesise that perhaps that due to the more frequent use of the recalibration process, the degree of variability within and between participants would be decreased. However, this was not the case and the degree of variability within and between at UNL was equal to or greater than other sites. Further, the optimal frequency or regularity of service or calibration at Longevity Link headquarters is unknown. Specifically, it is currently unknown what impact servicing or re-calibration when devices are returned to Longevity Link has on the variability of SCS. Researchers may wish to avoid returning a device for service in the middle of a study or document such occurrences if they are unavoidable. In efforts to promote transparency and standardise reporting of SCS, dissemination efforts could include age of the device, equipment use patterns and date of last calibration.

Research opportunities

Based on our findings, the calculation of a correction factor for more accurate translation across multiple devices may be beneficial. Furthermore, while we found a 14 % margin of difference, an acceptable range of difference should also be defined. The FDA has determined that all self-monitoring blood glucose test kits may have a maximum of ±15 % error rate for about 95 % of the measurements, including the kits which use spectroscopy(39). Skin roughness and oxidative exposure to the sun are both reported to impact skin carotenoids(40,Reference Madore, Hwang and Park41) and developing greater understanding of these impacts and how to navigate them in controlled research and public health settings would be needed. Additional needs for research in a variety of populations include accuracy, validity, reliability and translation of Veggie Meter®-assessed SCS.

Treatment of spurious values was not addressed in the recommended standard Veggie Meter® research protocol(Reference Radtke, Poe and Stookey22). While the majority of the sites in this study recorded each value without any repositioning, cleaning equipment or recalibration, two sites had a more conservative approach to spurious values (UNL and GFNHRC). These sites rescanned the finger if the score was >10 % (UNL) or 100 points (GFNHRC) from the previous scan and discarded spurious values. UNL went so far as to recalibrate the Veggie Meter® then reposition the finger. With this more conservative approach, one would hypothesise that the intra- and inter-device reliability would be higher. However, this was not the case, as the CV and ADR for both sites were in the middle of the range across sites. While it seems prudent to reposition the finger if value appears spurious, the appropriate protocol for that process needs to be determined. It does not appear that discarding spurious values at these sites impacted the reliability of the SCS.

The development of field calibration phantoms or sticks that can be scanned with a known carotenoid score would be a critically important development to ensure Veggie Meter® devices are indeed accurately recording in the field. If multiple Veggie Meters® SCS are not regularly compared, accuracy issues could be overlooked. The ability for users to determine the accuracy of a device in the field, without another Veggie Meter®, is a critical next step to ensuring the integrity of Veggie Meter® utilisation in research projects.

These findings should be considered in light of the strengths and limitations of this study. Strengths of our study design include numerous sites across the USA representing different geographic regions, climates, sun exposure and month of the year. However, this is also a limitation since the Veggie Meter® devices used at each site were not the same and purchased at different times, had been serviced at different times and used in a variety of settings. This variability may have introduced bias or error into the data collected. The demographic and racial representation at each of these sites varied, which may contribute to the intra- and inter-device reliability at each site. While previous research demonstrates no differences in SCS by race(Reference Jilcott Pitts, Jahns and Wu18), more research is needed to understand these relationships more clearly. Another strength of our study was the diversity of ethnicities, age, skin reactivity categories and weight status. All efforts were taken to minimise systematic bias introduced by data collection procedures such as rigorous timing of scans, cleaning of participant fingers, counter-balancing and a series of five scans. Additional limitations include minor deviations on site procedures. Site investigators met virtually to develop the protocol and throughout data collection. A single IRB was considered across sites and not pursued given the cross-sectional nature of this project and varying institutional review board protocol requirements.

Conclusion

Veggie Meters® is an increasingly popular method of non-invasive dietary assessment in community populations and to evaluate intervention effectiveness. Our study demonstrates the intra-device and inter-device reliability to be moderate to high, as per ICC. We offer guidance that the observed margin of difference within a device was up to 14 %, with an average of 9 %. We offer guidance that the observed margin of difference between devices was up to 22 % with an average of 14 % between devices. Practical implications support previous work(Reference Radtke, Poe and Stookey22) and include utilisation of the same equipment for a participant across an intervention timeframe and the utilisation of multiple finger scans (i.e. triplicate mode or average multiple individual scans). More research is needed to refine the accurate use and ability to translate Veggie Meter (R)-assessed SCS in public health nutrition research studies.

Supplementary material

For supplementary material accompanying this paper visit https://doi.org/10.1017/S136898002510116X

Acknowledgements

Investigators would like to thank all the participants who donated time to contribute to this study.

Authorship

S.B.S. was the multi-site study Principal Investigator and organiser, contributed to study design, supervised site-level data collection and lead author on the manuscript. E.H. was the coordinating site study coordinator and developed the data collection databases, cleaned and processed all data and contributed to design and writing. S.C., S.J.P., V.C.S., C.R.L., T.M.S., D.A.D., A.B. and J.D.S. were site Principal Investigators and contributed to study design, data interpretation, literature development and writing. S.H. and R.C. were site study coordinators and contributed to study design, data collection, literature review, interpretation and writing. J.B. was the study statistician and contributed to design, conducted analyses and contributed to interpretation and writing.

Financial support

There were no extramural funding sources for OUHS, OSU or UNL. The work performed at the Grand Forks Human Nutrition Research Center was funded by USDA ARS #3062-10 700-001-000D. The collaborative work performed at East Carolina University with NC State University was funded by the National Institutes of Health R01 HL142544 and the Department of Agricultural and Human Sciences. The work performed at the University of Arkansas for Medical Sciences was supported by the National Institute on Minority Health and Health Disparities of the National Institutes of Health grant number 1P50MD017319, DK138153 and P20 GM109096. The work performed at Oklahoma State University was supported by a College of Education & Human Sciences Proposal for Student Technology Seed Grant and internal start-up support. The content is solely the responsibility of the authors and does not necessarily represent the official views of the funders.

Competing interests

There are no conflicts of interest for authors.

Ethics of human subject participation

This study was conducted according to the guidelines laid down in the Declaration of Helsinki, and all procedures involving research study participants were approved by the respective sites. The coordinating site was the University of Oklahoma Health Sciences. Written informed consent was obtained from all participants.

References

Akingbule, O, Teran-Garcia, M, Okoye, S et al. (2023) Impact of a nutrition education and physical activity intervention on fruits and vegetable intake of Nigerian immigrants. J Nutr Educ Behav 55, 7980. doi: 10.1016/j.jneb.2023.05.173.Google Scholar
Di Noia, J, Monica, D, Sikorskii, A et al. (2021) Pilot study of a farm-to-special supplemental nutrition program for Women, Infants, and Children (WIC) intervention promoting vegetable consumption. J Acad Nutr Diet 121, 20352045. doi: 10.1016/j.jand.2020.12.020.Google Scholar
Laviolette, C, Johnson, CM, Butler, JL et al. (2023) Nutrition effects of a family-centered health promotion program for Mexican-Heritage children in the lower Rio Grande Valley of Texas. Nutrients 15, 1600. doi: 10.3390/nu15071600.Google Scholar
Obana, A, Asaoka, R, Miura, A et al. (2022) Improving skin carotenoid levels in young students through brief dietary education using the Veggie Meter. Antioxidants 11, 1570. doi: 10.3390/antiox11081570.Google Scholar
Bakirci-Taylor, AL, Reed, DB, McCool, B et al. (2019) mHealth improved fruit and vegetable accessibility and intake in young children. J Nutr Educ Behav 51, 556566. doi: 10.1016/j.jneb.2018.11.008.Google Scholar
Caparello, G, Ceraudo, F, Meringolo, F et al. (2024) Eating habits and carotenoid skin content among children based on their attendance at the school meals: a cross-sectional pilot study. J Clin Transl Endocrinol 38, 100378. doi: 10.1016/j.jcte.2024.100378.Google Scholar
Hill, CM, Paschall, MJ, O’Brien, DM et al. (2021) Characterizing vegetable and fruit intake in a remote Alaska native community using reflection spectroscopy and 24-hour recalls. J Nutr Educ Behav 53, 712718. doi: 10.1016/j.jneb.2021.02.002.Google Scholar
Whiteside-Mansell, L, Swindle, TM & Davenport, K (2019) Evaluation of ‘Together, We Inspire Smart Eating’ (WISE) nutrition intervention for young children: assessment of fruit and vegetable consumption with parent reports and measurements of skin carotenoids as biomarkers. J Hunger Environ Nutr. Published online: 19 August 2019. doi: 10.1080/19320248.2019.1652127.Google Scholar
Ermakov, IV, Ermakova, M, Sharifzadeh, M et al. (2018) Optical assessment of skin carotenoid status as a biomarker of vegetable and fruit intake. Arch Biochem Biophys 646, 4654. doi: 10.1016/j.abb.2018.03.033.Google Scholar
Jahns, L, Johnson, LK, Conrad, Z et al. (2019) Concurrent validity of skin carotenoid status as a concentration biomarker of vegetable and fruit intake compared to multiple 24-h recalls and plasma carotenoid concentrations across one year: a cohort study. Nutr J 18, 78. doi: 10.1186/s12937-019-0500-0.Google Scholar
Jilcott Pitts, SB, Moran, NE, Wu, Q et al. (2022) Pressure-mediated reflection spectroscopy criterion validity as a biomarker of fruit and vegetable intake: a 2-site cross-sectional study of 4 racial or ethnic groups. J Nutr 152, 107116. doi: 10.1093/jn/nxab349.Google Scholar
Casperson, SL, Roemmich, JN, Larson, KJ et al. (2023) Sensitivity of pressure-mediated reflection spectroscopy to detect changes in skin carotenoids in adults without obesity in responses to increased carotenoid intake: a randomized controlled trial. J Nutr 153, 588597. doi: 10.1016/j.tjnut.2023.01.002.Google Scholar
Jilcott Pitts, S, Moran, NE, Laska, MN et al. (2023) Reflection spectroscopy-assessed skin carotenoids are sensitive to change in carotenoid intake in a 6-week randomized controlled feeding trial in a racially/ethnically diverse sample. J Nutr 153, 11331142. doi: 10.1016/j.tjnut.2023.02.017.Google Scholar
Rush, E, Jalili-Moghaddam, S, Diep, T et al. (2019) Who is eating their veggies. Proceedings 37, 14. doi: 10.3390/proceedings2019037014.Google Scholar
Martinelli, S, Acciai, F, Tasevska, N et al. (2021) Using the Veggie Meter in elementary schools to objectively measure fruit and vegetable intake: a pilot study. Methods Protoc 4, 33. doi: 10.3390/mps4020033.Google Scholar
Varghese, V, Cepni, AB, Change, J et al. (2023) Skin carotenoids measures by reflection spectroscopy correlate wtih dietary carotenoid intake in racially and ethnically diverse US toddlers from Houston, Texas. J Acad Nutr Diet 124, 628635.e1. doi: 10.1016/j.jand.2023.10.015.Google Scholar
Di Noia, J & Gellermann, W (2021) Use of the spectroscopy-based Veggie Meter((R)) to objectively assess fruit and vegetable intake in low-income adults. Nutrients 13, 2270. doi: 10.3390/nu13072270.Google Scholar
Jilcott Pitts, SB, Jahns, L, Wu, Q et al. (2018) A non-invasive assessment of skin carotenoid status through reflection spectroscopy is a feasible, reliable and potentially valid measure of fruit and vegetable consumption in a diverse community sample. Public Health Nutr 21, 16641670. doi: 10.1017/S136898001700430X.Google Scholar
Casperson, SL, Scheett, A, Palmer, DG et al. (2023) Biochemical validation of a self-administered carotenoid intake screener to assess carotenoid intake in nonobese adults. Curr Dev Nutr 7, 100024. doi: 10.1016/j.cdnut.2022.100024.Google Scholar
Obana, A, Asaoka, R, Takayanagi, Y et al. (2023) Inter-device concordance of Veggie Meter-a reflection spectroscopy to measure skin carotenoids. J Biophotonics 16, e202300071. doi: 10.1002/jbio.202300071.Google Scholar
Fitzpatrick, TB (1988) The validity and practicality of sun-reactive skin types I through VI. Arch Dermatol 124, 869871. doi: 10.1001/archderm.124.6.869.Google Scholar
Radtke, MD, Poe, M, Stookey, J et al. (2021) Recommendations for the use of the Veggie Meter(R) for spectroscopy-based skin carotenoid measurements in the research setting. Curr Dev Nutr 5, nzab104. doi: 10.1093/cdn/nzab104.Google Scholar
Ermakov, IV & Gellermann, W (2012) Dermal carotenoid measurements via pressure mediated reflection spectroscopy. J Biophotonics 5, 559570. doi: 10.1002/jbio.201100122.Google Scholar
Roberts, WE (2009) Skin type classification systems old and new. Dermatol Clin 27, 529533, viii. doi: 10.1016/j.det.2009.08.006.Google Scholar
Koo, TK & Li, MY (2016) A guideline of selecting and reporting intraclass correlation coefficients for reliability research. J Chiropr Med 15, 155163. doi: 10.1016/j.jcm.2016.02.012.Google Scholar
Liljequist, D, Elfving, B & Skavberg Roaldsen, K (2019) Intraclass correlation – a discussion and demonstration of basic features. PloS One 14, e0219854. doi: 10.1371/journal.pone.0219854.Google Scholar
Hwang, JE, Park, JY, Jung, MH et al. (2023) Evaluation of a commercial device based on reflection spectroscopy as an alternative to resonance Raman spectroscopy in measuring skin carotenoid levels: randomized controlled trial. Sens 23, 7654. doi: 10.3390/s23177654.Google Scholar
Yamane, Y, Mochiji, M, Ichioka, S et al. (2022) Effects of water chestnut (Tarpa bispinosa Roxb.) extract/lutein on fingertip-measured advanced glycation endproduct/carotenoid levels. Free Radic Res 56, 282289. doi: 10.1080/10715762.2022.2085098.Google Scholar
Obana, A, Gohto, Y, Nakazawa, R et al. (2020) Effect of an antioxidant supplement containing high dose lutein, zeaxanthin on macular pigment, skin carotenoid levels. Sci Rep 10, 10262. doi: 10.1038/s41598-020-66962-2.Google Scholar
Martell, SG, Kim, J, Cannavale, CN et al. (2023) Randomized, placebo-controlled, single-blind study of lutein supplementation on carotenoid status and cognition in persons with multiple sclerosis. J Nutr 153, 22982311. doi: 10.1016/j.tjnut.2023.06.027.Google Scholar
Amoah, I, Cairncross, C & Rush, E (2023) Vegetable-enriched bread: pilot and feasibility study of measurement of changes in skin carotenoid concentrations by reflection spectroscopy as a biomarker of vegetable intake. Food Sci Nutr 11, 33763384. doi: 10.1002/fsn3.3327.Google Scholar
Jilcott Pitts, SB, Wu, Q, Truesdale, KP et al. (2018) One-year follow-up examination of the impact of the north carolina healthy food small retailer program on healthy food availability, purchases, and consumption. Int J Environ Res Public Health 15, 2681. doi: 10.3390/ijerph15122681.Google Scholar
Biddle, MJ, Kang, JJ, Brewer, D et al. (2021) Reducing cardiovascular risk among participants in a community supported agriculture program. Circulation 144, A10560. doi: 10.1161/circ.144.suppl_1.10560.Google Scholar
Khalil, M & Moore, C (2022) The impact of an after-school Garden Enhanced Education (GENE) on skin carotenoids as a measure of vegetable consumption on diverse, low-income children. Cur Dev Nutr 6, 60. doi: 10.1093/cdn/nzac049.006.Google Scholar
Bayles, J, Peterson, AD, Jilcott Pitts, S et al. (2021) Food-based Science, Technology, Engineering, Arts, and Mathematics (STEAM) learning activities may reduce decline in preschoolers’ skin carotenoid status. J Nutr Educ Behav 53, 343351. doi: 10.1016/j.jneb.2020.10.017.Google Scholar
Ling, J, Suriyawong, W, Robbins, LB et al. (2024) FirstStep2Health: a cluster randomised trial to promote healthy behaviours and prevent obesity amongst low-income preschoolers. Pediatr Obes 19, e13122. doi: 10.1111/ijpo.13122.Google Scholar
Ermakov, IV & Gellermann, W (2010) Validation model for Raman based skin carotenoid detection. Arch Biochem Biophys 504, 4049. doi: 10.1016/j.abb.2010.07.023.Google Scholar
Bernstein, PS, Zhao, DY, Wintch, SW et al. (2002) Resonance Raman measurement of macular carotenoids in normal subjects and in age-related macular degeneration patients. Ophthalmology 109, 17801787. doi: 10.1016/s0161-6420(02)01173-9.Google Scholar
Food and Drug Administration (2020) Blood Glucose Monitoring Test Systems for Prescription Point-Of-Care Use; Guidance for Industry and Food and Drug Administration Staff. Rockville, MD: Food and Drug Administration.Google Scholar
Longevity Link Corporation (2016) The Veggie Meter. Salt Lake City, UT: Longevity Link Corporation. http://www.longevitylinkcorporation.com/products.html (accessed october 2025).Google Scholar
Madore, MP, Hwang, JE, Park, JY et al. (2023) A narrative review of factors associated with skin carotenoid levels. Nutrients 15, 2156. doi: 10.3390/nu15092156.Google Scholar
Figure 0

Table 1 Mean, sd, frequency of participant descriptive characteristics across 8 sites (2022–2023)

Figure 1

Figure 1 (a) Forest plot demonstrating CV and 95 % CI for Veggie Meter® device 1 at each site. Bold, dashed line indicated the hypothesised CV. (b) Forest plot demonstrating CV and 95 % CI for Veggie Meter® Device 2 at each site. Bold, dashed line indicated the hypothesised CV. AR, Arkansas; OSU, Oklahoma State University; OUHSC, University of Oklahoma Health Sciences Center.

Figure 2

Table 2 Intra- and inter-device reliability output across 8 sites

Figure 3

Table 3 Individual Veggie Meter® skin carotenoid scores and average for each site

Figure 4

Figure 2 Inter-device intra class correlation coefficients and 95 % CI for each site. AR, Arkansas; OSU, Oklahoma State University; OUHSC, University of Oklahoma Health Sciences Center.

Figure 5

Figure 3 Inter-device absolute relative difference and 95 % CI for each site. AR, Arkansas; OSU, Oklahoma State University; OUHSC, University of Oklahoma Health Sciences Center. Bold, dashed line indicated the hypothesized absolute relative difference.

Figure 6

Figure 4 Equivalence of both Veggie Meter® devices and 95 % CI for each site. Bold, dashed line indicated the hypothesized equivalence. AR, Arkansas; OSU, Oklahoma State University; OUHSC, University of Oklahoma Health Sciences Center.

Supplementary material: File

Sisson et al. supplementary material

Sisson et al. supplementary material
Download Sisson et al. supplementary material(File)
File 34.5 KB