Introduction
Using information-based labels to inform consumers can help consumers make better choices (Roe and Sheldon Reference Roe and Sheldon2007; Bonroy and Constantatos Reference Bonroy and Constantatos2014; Lusk and McCluskey Reference Lusk and McCluskey2018). However, the lack of knowledge regarding certification standards can reduce the informativeness (Harbaugh et al. Reference Harbaugh, Maxwell and Roussillon2011). For example, the United States Department of Agriculture (USDA) provides the definition for natural meat products as those that do not contain artificial ingredients, added colors, and is minimally processed (Fortin Reference Fortin2016), but more than 60% of consumers wrongly believed meat labeled as ‘natural’ was raised without antibiotics, growth hormones, and genetically modified organisms during production (Syrengelas et al. Reference Syrengelas, DeLong, Grebitus and Nayga2018). For this reason, there has been a push among consumer groups and other organizations to have more transparent and stricter food labeling rules.
Existing research highlights consumers’ misperception and confusion surrounding food labels (Zepeda et al. Reference Zepeda, Sirieix, Pizarro, Corderre and Rodier2013; Garcia and de-Magistris Reference Gracia and de-Magistris2016; Brécard Reference Brécard2017; Ellison et al. Reference Ellison, Brooks and Mieno2017; Syrengelas et al. Reference Syrengelas, DeLong, Grebitus and Nayga2018; Lim et al. Reference Lim, Hu and Nayga2020). Consumers tend to overinterpret the quality that a label represents or struggle to understand one (Brécard Reference Brécard2017; McFadden and Lusk Reference McFadden and Lusk2018; Bernard et al. Reference Bernard, Duke and Albrecht2019; Asioli et al. Reference Asioli, Aschermann-Witzel and Nayga2020). For example, people believe that the grass-fed label means superior food safety (Lim et al. Reference Lim, Hu and Nayga2020). If consumers are expecting unsupported food safety benefits from such labels, then it is necessary to adjust the distortion created by the misperception. On the other hand, having strict labeling rules could weaken firms’ incentives to provide quality. Scott and Sesmerro (Reference Scott and Sesmero2022) find that misperception about labels can in fact benefit consumers and enhance efficiency due to firms’ strategic reactions to it. However, the relationship between misperception and welfare hinges upon the direction of misperception.
Given this, it is necessary to investigate whether adding additional clarification and certification could reduce misperception. We define misperception as the alignment between the labels and the values they represent. We focused on sustainability-related food labels because they are among the most frequently misinterpreted labels on the market. This study was conducted as an online survey with three information treatments: (1) label only; (2) label and description; or (3) label, description, and certification statement. Using a best-worst scaling (BWS) approach, we first sought to understand if preference shares would change based on information. We then calculated the correlation between food labels and food values to examine whether additional information could improve the alignment and reduce misperception. The rest of the paper is structured as follows: a review of selected label and claims, methodology and econometric models, survey and data, results, and conclusions.
Review of labels and food values
Consumer Reports conducted extensive market research on what consumers expect from a food label claim or seal. Consumers see claims as “words or phrases printed on the label such as ‘humanely raised’ or ‘no GMOs’”; and seals as “graphics combining a logo or an image with a short claim, such as the USDA organic seal” (Consumer Reports 2019, 1). We selected nine seals mentioned in the Consumer Reports (2019) and Ecolabel Index (2022). The nine seals are American Grassfed, Animal Welfare Approved, Non-GMO Project Verified, USDA Organic, Certified Humane Raised and Handled, American Humane Certified, One Health Certified,Footnote 1 Certified B Corporation, and Food Alliance Certified. The three claims included in this study were “All Natural or Natural,” “No Antibiotics,” and “Non-GMO.” Each seal or claim has different criteria for obtaining certification. The top four seals on the market with clear rules and rigorous verification include American Grassfed, Animal Welfare Approved, USDA Organic, and Non-GMO Project Verified. See Table A1 in the Appendix for a summary of the prior literature on the seals or claims included in this study.
Consumer Reports (2019) also conducted research on labeling by focusing on the aspects of food production highlighted on food labels that cause the most confusion. These aspects include reducing pesticides, reducing the use of drugs in farm animals, what farm animals eat, animal welfare, and reducing the use of genetically modified organisms.
Food values
Researchers have determined that consumer purchasing decisions are influenced by their preferred food values or food quality attributes. For example, Bazzani et al. (Reference Bazzani, Gustavsen, Nayga and Rickertsen2018) identified 12 food values to capture the main food quality attributes consumers focus on when making purchasing decisions. The food values were naturalness, safety, environmental impact, origin, animal welfare, fairness, nutrition, taste, appearance, convenience, novelty, and price. The study conducted by Bazzani et al. (Reference Bazzani, Gustavsen, Nayga and Rickertsen2018) was then replicated by Cerroni et al. (Reference Cerroni, Nayga, Pappalardo and Yang2021) to observe the malleability of food values. Our study expands upon Bazzani et al. (Reference Bazzani, Gustavsen, Nayga and Rickertsen2018) and Cerroni et al. (Reference Cerroni, Nayga, Pappalardo and Yang2021) by identifying if food value preference shares align with consumer preferences for 12 labels. Our study also expands on Cerroni et al. (Reference Cerroni, Nayga, Pappalardo and Yang2021) by identifying if food values were malleable based on the amount of information provided with the labels. Bazzani et al. (Reference Bazzani, Gustavsen, Nayga and Rickertsen2018) explained that food values are categorized into credence, experience, and price attributes. Credence attributes are characteristics that consumers cannot decipher by looking at the product, for example, sustainability and ethical issues (Fortin Reference Fortin2016; Bazzani et al. Reference Bazzani, Gustavsen, Nayga and Rickertsen2018). Experience attributes are characteristics that consumers can personally experience, for example, taste and convenience (Bazzani et al. Reference Bazzani, Gustavsen, Nayga and Rickertsen2018). Cerroni et al. (Reference Cerroni, Nayga, Pappalardo and Yang2021) created a table to visually represent the 12 food values in Table 1.
Table 1. Food values presented in the best-worst scaling survey

Note. Reprinted from Cerroni et al. (Reference Cerroni, Nayga, Pappalardo and Yang2021, 8).
Using the label definitions shown to participants during the study, the background research pertaining to each label, and the definitions for the food values (or attributes) as defined in the study, it was determined that the labels represented five of the 12 food values, as shown in Table A2 in the Appendix. For this study, the food value Naturalness was represented by American Grassfed, Non-GMO project verified, and USDA Organic. Safety was represented by USDA Organic. Environmental impact was represented by USDA Organic, One Health Certified, and Certified B Corporation. Animal Welfare was represented by Food Alliance Certified, Animal Welfare Approved, Certified Humane Raised & Handled, and American Humane Certified. Fairness was represented by Certified B Corporation and Food Alliance Certified.
Methodology
Best-Worst Scaling (BWS)
The BWS approach uses a series of choice sets made up of a subset of statements, attributes, or items to identify preference shares for the items in the subset. Respondents are asked to choose their most important (or preferred) and least important (or preferred) attribute, statement, or item among the choice set. The BWS approach was made popular by Finn and Louviere (Reference Finn and Louviere1992) and has been used by researchers from many research disciplines (e.g., Auger et al. Reference Auger, Devinney and Louviere2007; Flynn et al. Reference Flynn, Louviere, Peters and Coast2007; Lusk and Briggeman Reference Lusk and Briggeman2009). The BWS approach allows researchers to identify preference shares for each issue under consideration and conduct accurate comparisons of the preference shares. Following Bazzani et al. (Reference Bazzani, Gustavsen, Nayga and Rickertsen2018), this study uses the Case 1 mechanism of the BWS approach, where respondents are asked to select their most important and least important item among each choice set.
Treatment design and research objectives
Respondents were randomly assigned to one of three groups to determine the effect different types of information have on preference shares for different labels. The first group is the control group, where in the food label best-worst choice sets they only see a picture of the label. The second group is treatment one (T1), where in the food label best-worst choice sets they see a picture of the label and a description of what the label means. The third group is treatment two (T2), where in the food label best-worst choice sets they see a picture of the label, a description of what the label means, and a statement explaining if the label is verified. Table A3 includes each food label image, description, and verification statement included in the study. All three groups were asked the same food value questions, environmental questions, and a variation of the food sustainability label questions based on which group they were assigned to. See Figure A1 in Appendix for an example of a food label choice set shown to participants.
Survey design
In our study, the BWS is employed to evaluate food value and environmental labels applied on food products in the market. Twelve food values related to the main issues of food consumption are used: appearance, price, nutrition, novelty, convenience, origin, taste, naturalness, fairness, safety, animal welfare, and environmental impact (Bazzani et al. Reference Bazzani, Gustavsen, Nayga and Rickertsen2018; Cerroni et al. Reference Cerroni, Nayga, Pappalardo and Yang2021). The approach of partially balanced incomplete design (BIBD) is used to generate a design with an equal number of items, where each item is repeated the same number of times across the choice tasks. The same approach generates the experimental design for evaluating environmental food labels. The 12 environmental food labels commonly used in the United States were selected from the food label database, Ecolabel Index (2022), the largest global online directory of ecolabels, and were separated into 12 choice tasks.
The questionnaire is composed of four sections. The first section was comprised of 12 food label choice sets. Four labels were presented in each choice set, and each label was displayed four times in the first section. The order of choice sets was randomized across respondents to control for position bias (Campbell and Erdem Reference Campbell and Erdem2015). The second section comprised 12 food attribute (also called food value) choice sets. Four food attributes were presented in each choice set, and each attribute was displayed four times in the second section. The order of the first and second sections was randomized across respondents to control for order bias. The label choice options and the food attribute chose options within each choice set were also randomized to control for position bias to help prevent respondents from selecting only the higher positioned items in a choice set. The third section comprised the 15 revised New Ecological Paradigm (NEP) scale statements measuring a population’s environmental worldview (Anderson Reference Anderson2012). The NEP Scale questions cover five factors of the relationship between humans and the environment: balance, limits, anti-anthropocentrism, anti-exceptionalism, and eco-crisis (Dunlap et al. Reference Dunlap, Van Liere, Mertig and Jones2000). Respondents were asked to indicate their level of agreement or disagreement with each statement using a 5-point Likert-type scale format with 1 = strongly agree to 5 = strongly disagree. The final section of the survey included sociodemographic questions and food purchase behavior questions.
We targeted our sample from the general US population by using two screening questions: 1) Are you 18-year-old or older? 2) Have you purchased chicken in the last 6 months? Only participants who responded “Yes” to both questions were considered valid respondents. Purchasing chicken was chosen as a screening variable because it was reported as the most consumed type of meat in the US, which would encompass a diverse participant pool (Shahbandeh Reference Shahbandeh2021). At the beginning of our survey, we asked each individual to complete an online consent form and asked them to promise to read all questions and information carefully and provide their best answers. A text “cheap talk” was provided to every respondent before starting choice tasks to reduce the hypothesis bias (Tonsor and Shupp Reference Tonsor and Shupp2011; Ellis et al. Reference Ellis, Delong, Jensen and Griffith2021). In order to control for order effect, we randomized the order of food value BWS and environmental food label BWS choice tasks. Attention check questions, including instructed response attention check questions, were included in the survey to ensure all respondents included in the analysis were attentive throughout the survey (Gummer et al. Reference Gummer, Roßmann and Silber2021).
Econometric model
 Responses from the BWS of food value and environmental food label are analyzed using a random parameter logit (RPL) model following Lusk and Briggeman (Reference Lusk and Briggeman2009) and Cerroni et al. (Reference Cerroni, Nayga, Pappalardo and Yang2021). In the model, we assume that there are J items presented in each choice task set t, then the number of possible pairs of items is J(J−1). We define the observable level of importance of the item j as 
 ${\lambda _j}$
, and then the unobservable level of importance is
${\lambda _j}$
, and then the unobservable level of importance is 
 ${I_{ij}} = {\lambda _j} + {\varepsilon _{ij}}$
, where i stands for respondent i and
${I_{ij}} = {\lambda _j} + {\varepsilon _{ij}}$
, where i stands for respondent i and 
 ${\varepsilon _{ij}}$
 is a random error.
${\varepsilon _{ij}}$
 is a random error.
 All the models in our study are consistent with random utility theory (McFadden Reference McFadden1974). The idiosyncratic error 
 ${\varepsilon _{ij}}$
 is independent and identically distributed extreme value type 1. The probability of respondent i selects item j as the most important and item k as the least important in choice task t compared to other M = J(J−1)−1 possible pairs can be presented by:
${\varepsilon _{ij}}$
 is independent and identically distributed extreme value type 1. The probability of respondent i selects item j as the most important and item k as the least important in choice task t compared to other M = J(J−1)−1 possible pairs can be presented by:
 $${P_{ijkt}} = {\rm{exp}}\left( {{\lambda _{ijt}} - \;{\lambda _{ikt}}} \right)/\mathop \sum \limits_{l = 1}^J \mathop \sum \limits_{m = 1}^J \exp \left( {{\lambda _{ilt}} - \;{\lambda _{imt}}} \right) - J$$
$${P_{ijkt}} = {\rm{exp}}\left( {{\lambda _{ijt}} - \;{\lambda _{ikt}}} \right)/\mathop \sum \limits_{l = 1}^J \mathop \sum \limits_{m = 1}^J \exp \left( {{\lambda _{ilt}} - \;{\lambda _{imt}}} \right) - J$$
 Our models allow heterogeneity in preferences for the various food labels and food values, and assume that estimated parameters 
 ${\lambda _j}$
are following a multinormal distribution. The RPL models are estimated using the gmnl package in R version 1.3.1073 (Sarrias and Daziano Reference Sarrias and Daziano2017). The share of preference for each value or label, the predicted probability of that value or label selected as the most important one, is calculated by:
${\lambda _j}$
are following a multinormal distribution. The RPL models are estimated using the gmnl package in R version 1.3.1073 (Sarrias and Daziano Reference Sarrias and Daziano2017). The share of preference for each value or label, the predicted probability of that value or label selected as the most important one, is calculated by:
 $$P{S_j} = {\rm{\;}}{{{e^{{{\widehat \lambda }_j}}}} \over {\mathop \sum \nolimits_{k = 1}^j {e^{{{\widehat \lambda }_l}}}}}$$
$$P{S_j} = {\rm{\;}}{{{e^{{{\widehat \lambda }_j}}}} \over {\mathop \sum \nolimits_{k = 1}^j {e^{{{\widehat \lambda }_l}}}}}$$
where 
 $\widehat \beta $
 is the mean of estimated individual parameter. The total of the share of preferences must be one.
$\widehat \beta $
 is the mean of estimated individual parameter. The total of the share of preferences must be one.
Data
The questionnaire was administered online between October 21, 2021, and November 1, 2021, via Dynata. Our sample consists of 1,200 US consumers. Ninety percent of respondents were able to complete the survey in under 24.2 minutes with the average time being 14 minutes. Respondents who spent less than 5 minutes or more than 60 minutes on the survey were excluded from the analysis. Participants who spent less than 5 minutes on the survey may not have thoroughly read the questions or provided sincere responses, given that the survey typically took around 14 minutes to complete. On the other hand, respondents who spent more than 60 minutes might also have been less likely to provide accurate answers due to the long period taken to finish the survey. Respondents who did not answer the attention check questions correctly were also removed. The final analysis sample contained 1,158 surveys. Demographic and socioeconomic characteristics of our sample and the US population are shown in Table 2. Overall, our sample is representative of the US population. However, the percentage of Hispanic or Latino individuals in the population (18%) is higher than our sample (7.5%). The distribution of gender, place of residence, and education was fairly similar in both the sample and US population. Approximately 67% of the sample had an annual income equal or below the US median income of $69,717.
Table 2. Demographic and socioeconomic distribution

Note. US population data were extracted from the United States Census Bureau (United States Census Bureau 2021).
 Table 3 provides the balance test across treatment groups. Out of the sample of 1,158 US consumers, the majority were female (52%), white (62%), married (48%), earned a 4-year college degree (25%), and had a gross household income of less than $69,717 (67%). The political views identified by respondents included democrat (41%), followed by independent (30%), republican (26%), and other (3%), respectively. A 
 ${\chi ^2}$
 is performed between the control group and treatment groups to detect any significant difference. P values higher than 5% indicate that the sample is well balanced.
${\chi ^2}$
 is performed between the control group and treatment groups to detect any significant difference. P values higher than 5% indicate that the sample is well balanced.
Table 3. Balance test across treatment groups

Results
Identify consumer preference ranking for the 12 food labels
Results from the RPL model and preference share estimates for food labels are reported in Table 4. The most selected least important label, B corporation, was used as the baseline for the food labels. See Table A4 in the Appendix for a summary table of the percentage breakdown of labels chosen as most important and least important by group and for the full sample. The ranking of food labels across treatment groups was dissimilar as expected. The top three labels for the control group were “No antibiotics,” “Natural,” and Non-GMO Project Verified. For groups T1 and T2, the USDA Organic label was ranked first, followed by One Health Certified. T1 ranked American Grassfed third, while T2 ranked Non-GMO Project Verified third. Figure 1 provides a graphical representation of the preference shares attributed to the food labels by treatment group.
Table 4. Random parameter logit models for labels by treatment group a

Note. Although a lower AIC or BIC value is preferred, they are not absolute measures of model goodness-of-fit, but rather relative measures of model fit. The same goes for log-likelihood values. Since we do not compare model fit across different groups, we do not focus on these measures and their interpretations.
**p < 0.01, * p < 0.05.
a Standard error in brackets.

Figure 1. Preference shares for food labels with 95% confidence interval by treatment group.
As shown in Fig. 1, the food labels with the most preference shares from the control group included two claims, “no antibiotics” and “Natural”, and two seals USDA Organic and Non-GMO Project Verified. The 95% confidence intervals are calculated by using Bootstrap. Among these four labels, the preference share of “Natural” was not significantly different from others at the 0.05 level. However, “no antibiotics” was significantly different from the two seals at the 0.05 level. Treatment 1 respondents, who were shown the label picture and description, attributed the most preference shares to four seals, USDA Organic, One Health Certified, American Grassfed, and Food Alliance Certified, respectively. There was no significant difference in preference shares between USDA Organic and One Health Certified at the 0.05 level. But both of them were significantly different from American Grassfed and Alliance Certified. The food labels with the most preference shares from the Treatment 2 group were USDA Organic, Non-GMO Project Verified, One Health Certified, and Food Alliance Certified. In the Treatment 2 group, the preference share of USDA Organic was significantly different from the other 3 most preference shares. The B Corporation seal and “Non-GMO” claim were ranked among the lowest across all treatment groups. The control group ranked American Grassfed much lower than the other two groups, while T1 and T2 ranked “Natural” much lower than the control group.
Determine if consumer preference ranking changes by providing more information with the labels, including descriptions and verification statements
 As shown in Table 5, significant changes in preference shares were observed for different food labels across treatment groups (see appendix Table A5 for preference shares by treatment group). Interest in the Food Alliance Certified and American Grassfed labels increased as more information was provided. When compared to the control group, consumers in the T1 group increased interest in the Food Alliance Certified label (
 $\Delta S$
 = 0.055; p < 0.01), as did consumers in the T2 group (
$\Delta S$
 = 0.055; p < 0.01), as did consumers in the T2 group (
 $\Delta S$
 = 0.057; p < 0.01). When compared to the control group, consumers in the T1 group increased interest in the American Grassfed label (
$\Delta S$
 = 0.057; p < 0.01). When compared to the control group, consumers in the T1 group increased interest in the American Grassfed label (
 $\Delta S$
 = 0.079; p < 0.01), as did consumers in the T2 group (
$\Delta S$
 = 0.079; p < 0.01), as did consumers in the T2 group (
 $\Delta S$
 = 0.060; p < 0.01). Four labels lost importance as more information was provided, three of which were the claims included in the study. First, the claim “No Antibiotics” lost importance in the T2 group (
$\Delta S$
 = 0.060; p < 0.01). Four labels lost importance as more information was provided, three of which were the claims included in the study. First, the claim “No Antibiotics” lost importance in the T2 group (
 $\Delta S$
 = −0.102; p < 0.01). Second, “Natural” lost importance significantly in the T1 group (
$\Delta S$
 = −0.102; p < 0.01). Second, “Natural” lost importance significantly in the T1 group (
 $\Delta S$
 = −0.110; p < 0.01) and the T2 group (
$\Delta S$
 = −0.110; p < 0.01) and the T2 group (
 $\Delta S$
 = −0.085; p < 0.01). Finally, the claim “Non-GMO” lost importance significantly in the T1 group (
$\Delta S$
 = −0.085; p < 0.01). Finally, the claim “Non-GMO” lost importance significantly in the T1 group (
 $\Delta S$
 = −0.033; p < 0.01) and the T2 group (
$\Delta S$
 = −0.033; p < 0.01) and the T2 group (
 $\Delta S$
 = −0.026; p < 0.05). We observed that when all the information was provided about a label, T2 had less interest in “No Antibiotics,” American Humane Certified, and American Grassfed labels compared to T1.
$\Delta S$
 = −0.026; p < 0.05). We observed that when all the information was provided about a label, T2 had less interest in “No Antibiotics,” American Humane Certified, and American Grassfed labels compared to T1.
Table 5. Change in preference shares (
 $\Delta {\rm{S}}$
) for food labels across treatment groups
a
$\Delta {\rm{S}}$
) for food labels across treatment groups
a

**p < 0.01, * p < 0.05.
a Statistical significance levels are related to the results from Bootstrap and the Poe test (Poe et al. 2005).
Identify consumer preference ranking for the food attributes or values
Results from the RPL model and preference share estimates for food values are reported in Table 6. Fig. 2 provides a graphical representation of the food value preference shares by treatment group. The least important food value selected by most respondents, Novelty, was used as the baseline. All groups ranked Safety and Taste the highest compared to the other food values. Environmental impact is ranked slightly lower than convenience in T2. The ranking for all other food values was similar across treatment groups.
Table 6. Random parameter logit models for food values by treatment group a

Note. Although a lower AIC or BIC value is preferred, they are not absolute measures of model goodness-of-fit, but rather relative measures of model fit. The same goes for log-likelihood values. Since we do not compare model fit across different groups, we do not focus on these measures and their interpretations.
**p < 0.01, * p < 0.05.
a Standard error in brackets.

Figure 2. Preference shares for food values by treatment group.
As shown in Fig. 2, the three food values with the most preference shares across the groups include Safety, Taste, and Nutrition. The food values with the lowest preference shares across treatment groups were Convenience and Novelty.
Determine if there is a connection between food labels and food attributes
The correlation values between each food label and food attribute by treatment group are displayed in Table 7. The food attributes represented by the food labels were identified using the label and attribute definitions and the background research for the label (see Appendix Table A2 for more information). Overall, there were correlations among all three groups between USDA Organic and Naturalness; B Corporation and Fairness; and Animal Welfare Approved and Animal Welfare. Highly significant correlations were found between six food labels and food attributes within the control group (p < 0.01). Highly significant correlations were found between seven food labels and food attributes within the T1 group, followed by only two significant correlations in the T2 group (p < 0.01). Participants in the T1 and T2 groups were influenced by the information provided about the label. However, the certification statement shown to the T2 group shows that it will sometimes prove harmful to the perception of the label because consumers want to make their own decision. These correlations proved that information provided to the consumers could be beneficial, but too much information is unnecessary, as shown in the correlations between some labels in which the correlation coefficients decreased as more information was provided for the label. The labels and attributes with no significant correlation were: American Grassfed and Naturalness; USDA Organic and Environment; and, One Health Certified and Environment.
Table 7. Correlation between food labels and food attributes by treatment group

Note. Food label and food attribute combinations are based on the food attribute represented by the food label. For more information on these combinations see Table A2 in the Appendix.
**p < 0.01, * p < 0.05.
Heterogeneity in treatment effects
Two tests were used to determine if there was heterogeneity in treatment effects by looking at NEP scores and shopping frequency. The first test determined if preference shares were influenced by NEP scores (see Fig. 3). We separated the sample into two NEP categories: Low NEP group, in which their total scores of NEP items were less than the median of total scores in the whole sample; otherwise, they were classified in the High NEP group (Dsouza et al., Reference Dsouza, Fang, Yang, Kemper and Nayga2023; Zheng et al., Reference Zheng, Nayga, Yang and Tokunaga2023). The food labels with the most preference shares from the control group with low NEP scores included two claims (“No antibiotics” and “Natural”) and two seals (USDA Organic and Non-GMO Project Verified). The same was found in the control group using the pooled data set. However, the two seals with the most preference shares from the control group with high NEP scores were Certified Humane Raised & Handled and One Health Certified instead of USDA Organic and Non-GMO Project Verified. The respondents with high NEP scores from the T1 group had the same most preferred labels compared to all respondents from T1, such as One Health Certified, American Grassfed, USDA Organic, and Food Alliance Certified. There was a slight difference for respondents with low NEP scores from T1. The Food Alliance Certified seal had a slightly lower preference share among respondents with low NEP scores compared to all respondents from T1, which removed Food Alliance Certified from the top four labels. Similar results also were found in the T2 group. Respondents with high and low NEP scores from T2 had the same top four food labels compared to the whole sample from treatment 2: USDA Organic, One Health Certified, Non-GMO Project, and Food Alliance Certified. The preference shares for the claims “No antibiotics” and “Natural” in the control group were higher than in the other two groups regardless of low NEP and high NEP scores. This result was the same as what we found in the control group by using the pooled data set. The respondents in the control group with high NEP scores and low NEP scores had a higher preference share for the Non-GMO Project Verified seal than the respondents in treatment 1, who were provided a label description. There was no significant difference between the control group and treatment 2, who were provided label information and verification.

Figure 3. Preference shares for food labels by treatment group based on high vs. low NEP scores.
We also examined if there is heterogeneity in treatment effects across shopping frequency. Participants who indicated shopping for groceries more than once a week were considered frequent shoppers; otherwise, they were considered as infrequent shoppers. As shown in Fig. 4, the frequency of shopping had minimal effect on the preference shares by treatment group when compared to the pooled preference shares for each group. B Corporation had a significant difference in preference shares between infrequent and frequent shoppers for participants in the control and T1 groups (Table 8, preference shares see Table A6 in appendix). The T2 group had the most labels with significant changes in preference shares (p < 0.05), including the labels Food Alliance Certified, American Grassfed, Non-GMO Project Verified, B Corporation, and the “Natural” claim. When compared to the pooled preference shares attributed to the labels by each treatment group, the most preferred labels were similar even when taking shopping frequency into account. The control group’s top four labels for frequent shoppers were the same as the control group’s pooled label ranking. The top four labels for infrequent shoppers within the control group were similar, but organic was removed from the top four and replaced with Certified Humane Raised and Handled. The top four labels for frequent and infrequent shoppers in the T1 and T2 groups were the same as their groups pooled label ranking.

Figure 4. Preference shares for food labels by treatment group based on shopping frequency.
Table 8. Change in preference shares (
 $\Delta {\rm{S}}$
) for food labels based on infrequent vs. frequent shopping by treatment groups
a
$\Delta {\rm{S}}$
) for food labels based on infrequent vs. frequent shopping by treatment groups
a

**p < 0.01, * p < 0.05.
a Statistical significance levels are related to the results from the Poe test (Poe et al. Reference Poe, Giraud and Loomis2005).
In summation, the heterogeneity checks revealed variation in preference for the labels based on high and low NEP scores. However, shopping frequency was found to have minimal influence on preference share by treatment group when compared to the pooled preferences.
Conclusions
The goal of the study was to identify consumer preference shares for 12 sustainability-related food labels and 12 food values (or attributes). Participants were separated into three groups to identify the effect of more information on food label preference shares. A BWS approach was applied to identify consumer preferences for food labels and food values. The other goal of the study was to identify if a correlation exists between food labels and food values.
The labels included in this study are popular in the U.S. market, and most have been studied as the labels contributing to consumer misperception. Our results imply that consumers do not fully understand the standards or verification process of a label by simply seeing the logo or image. As more information was provided for the T1 and T2 groups, the preference shares changed for each label (seal or claim). The claims “Natural” and “No Antibiotics” are among the most misinterpreted labels on the market. The results of this study provide further support for that statement by showing the extent to which importance decreased for those claims as more information was provided to groups T1 and T2. The USDA Organic label received the most preference shares across our participant pool. This finding does not align with Ellison et al. (Reference Ellison, Brooks and Mieno2017), where the ‘Product is certified organic’ claim received the least preference shares, which highlighted consumer confusion surrounding the standards required for that claim. This finding could be attributed to the USDA doing a better job over the last 6 years of explaining the meaning of their label and the criteria for obtaining the label. Based on the observed change in preference shares, Food Alliance Certified, American Grassfed, and B Corporation should increase efforts focusing on consumer literacy for their labels to increase consumer interest.
As indicated by consumers in this study, the most important food values were Safety and Taste. These findings align with previous research on food values which determined that Safety and Taste are among the most important attributes for consumers when purchasing products (Lusk and Briggeman Reference Lusk and Briggeman2009; Bazzani et al. Reference Bazzani, Gustavsen, Nayga and Rickertsen2018; Cerroni et al. Reference Cerroni, Nayga, Pappalardo and Yang2021). The food attributes with the lowest preference shares across treatment groups were Convenience and Novelty.
The correlation values between food values and food labels within groups determined that perceived authoritative certification statements can harm the perception of the label because consumers want to make their own decisions based on the label image and description. The correlation values supported the idea that more information is useful as shown by the increase in preference share for labels when the definition of the label was added. Too much information, on the other hand, is unnecessary and can have an adverse effect on consumer perception of the label, consistent with the findings of Jacoby et al. (Reference Jacoby, Berning and Dietvorst1977), Lusk and Marette (Reference Lusk and Marette2012), McCluskey and Swinnen (Reference McCluskey and Swinnen2004), Salaün and Flores (Reference Salaün and K. Flores2001), Verbeke (Reference Verbeke2005). The researchers believe that consumer fatigue related to the number of certified labels displaying too much information can be overwhelming for consumers and will not positively affect their food value preferences.
This study was the first of its kind to determine consumer preferences for a large number of environmental food labels. This study should guide further research on the connection between food labels and food values. Future studies should test the robustness of our findings in other contexts (e.g., other countries).
Policy implications
This study further supports the notion that consumers could benefit from clear label standards to make informed purchasing decisions. Food policy efforts should require strict, clear label standards. Promoting clear labeling standards for sustainability-related ecolabels will benefit the environment and influence companies to adopt better practices. Companies will be more likely to adopt new standards if the certification will increase consumer preference for their product. Developing clear labeling standards could encourage companies to adopt sustainable practices because the consumers would be more likely to understand the standards needed to receive certification for a specific label.
Enhancing consumer understanding of food labels offers benefits to consumers, retailers, and producers alike. When consumers demonstrate a preference or comprehension of a label, companies are more likely to pursue certification as it distinguishes their products in the market. As evidenced in this study, consumer preferences can shift with the provision of additional information for certain labels. However, it is important to strike a balance, as excessive information may overwhelm consumers. Thus, labels should provide clear descriptions of certification requirements to enhance consumer knowledge. Transparent and effective communication about these requirements can also foster consumer trust in a company or brand (Smith, Reference Smith2019). Pierce and Hartt (Reference Pierce and Hartt2019) suggest that certifications should be regarded as a strategic investment in marketing rather than solely driving sales, as they enable companies to better target specific consumer segments. The authors further highlight that certifications establish trust by signaling the “presence or absence of qualities that consumers seek or avoid” (Pierce and Hartt Reference Pierce and Hartt2019, 23–24).
Consumer fatigue related to food labeling is a concern (Fang et al. Reference Fang, Nayga, Snell, West and Bazzani2019). Increasingly, more media outlets are citing the conflicting label and nutrition messages as the source of stress and fatigue on the consumer (Badaracco Reference Badaracco2012; Loria Reference Loria2017; Nunes Reference Nunes2017; and Visser Reference Visser2019). Consumer fatigue has also been linked to causing voters to avoid labeling initiatives (Gunlock Reference Gunlock2015).
The Food and Drug Administration (FDA) has recently proposed a new definition and regulations for labeling products as “healthy,” indicating their commitment to addressing consumer confusion and misperception of food labels (Reiley Reference Reiley2022). However, policy makers should not limit their efforts to redefining terms solely for front-of-package labels. It is crucial to design labels with the underlying food values in mind. Our study findings reveal that consumers perceive a label as less significant when it fails to align with their core food values. Therefore, policy makers should consider this aspect when developing guidelines to mitigate consumer misperception and misinterpretation of labels, including seals and claims.
Food labeling alone is no longer sufficient to drive lasting consumer behavior change towards sustainable and healthy foods, primarily due to the growing consumer fatigue and skepticism surrounding food labels. To foster lasting behavior change, food labeling can be complemented by other interventions, such as taxes and subsidies, community-based initiatives, and targeted interventions tailored to individual lifestyles, in order to avoid relying on a one-size-fits-all approach (Osman and Jenkins Reference Osman and Jenkins2021, 44). Another effective strategy for influencing consumer behavior is to raise awareness about the potential risks associated with certain products. For instance, Chile implemented a black stop sign as a front-of-package label on unhealthy foods in 2016 to discourage their consumption or purchase (Correa et al. Reference Correa, Fierro, Reyes, Taillie, Carpentier and Corvalán2022). When designing labels to alert consumers to product risks, it is crucial to provide clear explanations of the risk levels to prevent individuals from either becoming overly cautious or disregarding the warnings entirely, as recommended by Robinson et al. (Reference Robinson, Viscusi and Zeckhauser2016). One approach to indicate varying levels of risk is by using traffic light labeling, which employs a color-coded system (e.g., green, yellow, and red) to convey low, medium, and high levels of potential harm to individuals or the environment (e.g., sugar or sodium content, carbon emissions). This method helps differentiate and communicate different levels of risk effectively.
Limitations
This study has a limitation in that it does not allow for the calculation of the extent to which preference is attributed solely to the picture (logo) compared to the accompanying text description included in the logo. Future research could address this limitation by incorporating a control group that includes text descriptors in the design. It would be valuable to explore how preference shares are influenced by various logo design elements, such as color, esthetic, and the specific text used in the logo. Another potential limitation of the study is related to survey length, which could contribute to survey fatigue. Since the average completion time is approximately 14 minutes, it is unlikely that our survey had a negative impact on some participants’ responses due to fatigue.
Supplementary material
The supplementary material for this article can be found at [https://doi.org/10.1017/age.2023.28].
Data availability statement
The data that support the findings of this study are available from the corresponding author upon reasonable request.
Funding statement
This work is supported by AFRI Sustainable Agricultural Systems Coordinated Agricultural Project (SAS-CAP) grant no. 2021-68012-35917 from the USDA National Institute of Food and Agriculture.
Competing interests
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
 
 













