1 Introduction
Dietary choice has been a focal interest in many areas of research. Many people struggle to find the correct balance between taste and health considerations in these routine decisions. Goal-directed decision-making requires the decision maker to value each option based on relevant factors such as hunger state and health goals (Reference RangelRangel, 2013). Previous studies have shown that external cues are important determinants of product valuation and choice (Reference Borgmeier and WestenhoeferBorgmeier & Westenhoefer, 2009; Reference Bruce, Bruce, Black, Lepping, Henry, Cherry and SavageBruce et al., 2014; Reference Enax, Weber, Ahlers, Kaiser, Diethelm, Holtkamp and KerstingEnax, Weber et al., 2015; Reference Enax, Weber, Ahlers, Kaiser, Diethelm, Holtkamp and KerstingEnax et al., 2015; Fernqvist & Ekelund Axelson, 2013; Reference Hübl and TriftsHübl & Trifts, 2000; Reference Moser, Raffaelli and ThilmanyMoser et al., 2011; Reference Thaler and SunsteinThaler & Sunstein, 2008; Reference Trudel and MurrayTrudel & Murray, 2011). In the realm of dietary choice, nutrition information labels have been intensively studied, as they serve a decisive role in conveying health attributes of a product (Reference Sonnenberg, Gelsomin, Levy, Riis, Barraclough and ThorndikeSonnenberg et al., 2013; Reference Temple and FraserTemple & Fraser, 2014) and are important cornerstones of successful public policy interventions (Reference Hawkes, Smith, Jewell, Wardle, Hammond, Friel and KainHawkes et al., 2015). Nutrition labels are perceived as highly credible and are used to guide food selections (Reference Campos, Doxey and HammondCampos et al., 2011), especially when a food’s healthfulness is ambiguous (Reference Graham and JefferyGraham & Jeffery, 2012). Providing health information has been shown to increase hedonic liking ratings of products (Reference Annett, Muralidharan, Boxall, Cash and WismerAnnett et al., 2008; Reference Sabbe, Verbeke, Deliza, Matta and Van DammeSabbe et al., 2009), however, other studies provide opposing evidence (Reference Ng, Stice, Yokum and BohonNg et al., 2011; Reference Raghunathan, Naylor and HoyerRaghunathan et al., 2006; Reference Wansink and ChandonWansink & Chandon, 2006). A systematic literature review concluded that consumers can more easily interpret and select healthier choices when confronted with front-of-package labels that incorporate text as well as symbolic color to indicate nutrient levels rather than labels that only include numeric information (Reference Hersey, Wohlgenant, Arsenault, Kosa and MuthHersey et al., 2013). While consumer interest in nutrition information on foods is high (Reference Grunert and WillsGrunert & Wills, 2007), the actual use of nutrition labels in real-world settings is more ambiguous, and gaps between reported and actual use are likely (Reference Cowburn and StockleyCowburn & Stockley, 2005; Reference Gorton, Ni Mhurchu, Chen and DixonGorton et al., 2009; Reference Grunert, Fernández-Celemín, Wills, Storcksdieck genannt Bonsmann and NureevaGrunert et al., 2010). Nutrition label use varies between product types, socioeconomic status, education, and demographic characteristics (Reference Graham and JefferyGraham & Jeffery, 2012). Further, the visual saliency of the label itself influences label use, with higher saliency leading to increased fixation likelihood (Reference Graham, Orquin and VisschersGraham et al., 2012; Reference Orquin, Scholderer and JeppesenOrquin et al., 2012).
When comparing color-coded traffic light (TL) labels directly with other labeling methods, TL labels scored higher in terms of enabling the identification (Reference Borgmeier and WestenhoeferBorgmeier & Westenhoefer, 2009; Reference Gorton, Ni Mhurchu, Chen and DixonGorton et al., 2009; Reference Hawley, Roberto, Bragg, Liu, Schwartz and BrownellHawley et al., 2013; Reference Jones and RichardsonJones & Richardson, 2007; Reference Kelly, Hughes, Chapman, Louie, Dixon and CrawfordKelly et al., 2009; Reference Roberto, Bragg, Schwartz, Seamans, Musicus, Novak and BrownellRoberto et al., 2012) and selection (Reference Thorndike, Sonnenberg, Riis, Barraclough and LevyThorndike et al., 2012; Reference van Herpen and Trijpvan Herpen & Trijp, 2011) of healthier food items, possibly by prompting individuals to consider the health costs of products more strongly (Reference Sonnenberg, Gelsomin, Levy, Riis, Barraclough and ThorndikeSonnenberg et al., 2013; Reference Trudel, Murray, Kim and ChenTrudel et al., 2015). Importantly, consumers’ health evaluations of products have been shown to predict consumption (Reference Trudel, Murray, Kim and ChenTrudel et al., 2015). Critically, TL labeling has been shown to increase sales of healthy items and decrease sales of unhealthy items in longitudinal field studies, across socioeconomic status and ethnicity (Reference Levy, Riis, Sonnenberg, Barraclough and ThorndikeLevy et al., 2012; Reference Sonnenberg, Gelsomin, Levy, Riis, Barraclough and ThorndikeSonnenberg et al., 2013; Reference Thorndike, Sonnenberg, Riis, Barraclough and LevyThorndike et al., 2012).
While this evidence suggests that TL labeling is effective for increasing health considerations, it still remains unclear exactly how this is occurring. It is important to understand how these labels are changing people’s behavior, in order to develop even better interventions and to address similar problems in other domains. However, it is difficult to draw conclusions about the cognitive mechanisms underlying decisions using only choice data (Reference Helfer and ShultzHelfer & Shultz, 2014) because there are often multiple theories that can accommodate the same choice results, but that make different predictions about other measures. Therefore, researchers have begun to study the mechanism by which TL labels influence food choice. For example, TL labels may attract more attention and thus receive more weight in the choice process (Reference Fehr and RangelFehr & Rangel, 2011; Reference Hare, Malmaud and RangelHare et al., 2011) similar to how salient smoking warnings are more effective at reducing smoking (Reference Borland, Wilson, Fong, Hammond, Cummings, Yong and McNeillBorland et al., 2009). Indeed, a recent eye-tracking study revealed that the use of colors in nutrient-specific labels increases attention to the labels, which are thus more salient, compared to monochromatic labels (Reference Becker, Bello, Sundar, Peltier and BixBecker et al., 2015). Critically, attention has been shown to mediate the effect of nutrition labels on choice (Reference Bialkova, Grunert, Juhl, Wasowicz-Kirylo, Stysko-Kunkowska and van TrijpBialkova et al., 2014). Other research using a functional magnetic resonance imaging (fMRI) study has shown that TL labels seem to enhance the coupling between brain regions associated with valuation and self-control (Reference Enax, Hu, Trautner and WeberEnax, Hu et al., 2015).
What this previous work leaves unanswered, is how exactly health information and taste preferences are integrated in the choice process, and how the comparison process between items is influenced by the more salient TL display of information. A choice bias could reflect that the healthiness of the items is considered throughout the choice process, a single thought of “I should choose the healthier item”, or a response bias caused by, perhaps, an experimenter demand effect. It would be difficult to distinguish between these explanations with just choice behavior, but with response-time (RT) data we can do exactly that. Inspired by computational simulations of the choice process in decisions between food items along with nutrition labels (Reference Helfer and ShultzHelfer & Shultz, 2014), we chose to actually study choice and RT data using a drift diffusion model (DDM) of the choice process. The DDM decomposes choice and RT data into psychologically meaningful parameters, which can be used to infer cognitive processes (Reference Voss, Nagler and LercheVoss et al., 2013) such as response caution, response bias, and noise.
Recent modeling of binary choice experiments has suggested that decisions are formed by the continuous accumulation of evidence towards one of two decision thresholds, which is consistent with the framework of sequential sampling models such as the DDM (Reference BogaczBogacz, 2007; Reference Busemeyer and TownsendBusemeyer & Townsend, 1993; Reference RatcliffRatcliff, 1978). The DDM assumes that information is sampled continuously until sufficient evidence is accumulated for one of the available options, relative to the other. In detail, the information sampling is described by a Wiener Diffusion Process characterized by a constant rate of evidence accumulation (i.e., drift) towards one of two boundaries, combined with Gaussian noise (Reference Miller and CassadyRatcliff & Smith, 2004). While DDMs have been traditionally applied in perceptual decision-making, recent studies have used evidence accumulation models to also analyze value-based decisions (Reference Busemeyer and TownsendBusemeyer & Townsend, 1993; Reference De Martino, Fleming, Garrett and DolanDe Martino et al., 2013; Reference Krajbich, Armel and RangelKrajbich et al., 2010). Research suggests that using RTs, in addition to choice data, can improve predictions of subjects’ preferences and shed light on the mechanism how different attributes are incorporated in the decision (Reference Krajbich, Oud and FehrKrajbich et al., 2014; Reference Taubinsky, Morris, Schuldt, Chabris and LaibsonTaubinsky et al., 2009). The stochasticity in value-based decision-making is thought to arise, at least in part, from the noise in how our brain represents the choice options (Reference Krajbich, Oud and FehrKrajbich et al., 2014).
Here we build on previous applications of DDM to food choice by Krajbich, Rangel, and colleagues. In that work they study how people choose between food items based on independently collected “liking ratings” on a simple Likert scale. They assume that in value-based decisions, the decision makers cannot immediately access their preferences for each option, but slowly determine their preferences by accumulating and comparing evidence for the options until a predetermined level of confidence is reached. As before, this evidence accumulation process is modeled as Wiener diffusion of a net evidence variable, referred to as the relative decision value (RDV). Importantly, the average rate of evidence accumulation depends on the underlying subjective value difference of the two options, and is in our case dependent on the sensitivity to health and the weight on taste attributes.
To investigate the relative effects of health and taste concerns, we follow the approach used by Reference Philiastides and RatcliffPhiliastides and Ratcliff (2013) who investigated the effects of brand labels on clothing choice. In that work, the authors assumed that the binary relative quality of the brand label (better vs. worse) would influence the rate of evidence accumulation of the items (lower for the worse brand, higher for the better brand). By analogy, here we assumed that the binary relative quality of the nutritional content (healthier vs. unhealthier) would influence the rate of evidence accumulation of the foods.
We tested the influence of two different nutrition labels, that is, a color-coded, and thereby more salient (Reference Becker, Bello, Sundar, Peltier and BixBecker et al., 2015), TL label and a purely information-based, guideline daily amount (GDA) label. Both labels are similar in size, have been compared in previous studies (Reference Gorton, Ni Mhurchu, Chen and DixonGorton et al., 2009; Reference Hamlin, McNeill and MooreHamlin et al., 2015; Reference Jones and RichardsonJones & Richardson, 2007; Reference Maubach and HoekMaubach & Hoek, 2008; Reference Savoie, Barlow Gale, Harvey, Binnie and PasutSavoie et al., 2013), and are preferred over very simplified health information, such as health logos (Reference Grunert and WillsGrunert & Wills, 2007). Based on previous studies (e.g., Thorndike et al., 2012; Reference van Herpen and Trijpvan Herpen & Trijp, 2011), we predicted that color-coded nutrition labels would increase healthy choices in a binary choice task. We then used the DDM to analyze the underlying choice process. If the nutrition information directly influences subjects’ preferences, this should be seen as changes in drift rate, but not in starting point biases or non-decision time (Reference Philiastides and RatcliffPhiliastides & Ratcliff, 2013), see Figure 1 (H1). An alternative hypothesis is that the salient TL labels might simply produce a stimulus-response association, irrespective of the items’ features. This would manifest as a bias in the starting point of the choice process, that is, a bias towards the decision threshold for the healthier item (H2). Further, we expect that salient nutrition labels decrease the weight on taste attributes and increase the sensitivity to health features.
2 Method
2.1 Subjects
44 subjects completed the experiment (mean age=23.72, SD=4.4). The sample size was chosen based on the assumed effect size of 0.4 from a prior study (Reference Enax, Hu, Trautner and WeberEnax, Hu, et al., 2015). For a hierarchical multiple regression analysis, and two levels of the predictor (TL vs. GDA), a sample size of 40 subjects would provide 90% power to detect a significant effect tested at α =0.05. We also conducted two additional experiments with single-nutrient nutrition labels, which are included in the Supplement. All subjects had normal or corrected-to-normal vision. In line with previous studies (Reference Hare, Malmaud and RangelHare et al., 2011; Reference Maier, Makwana and HareMaier et al., 2015), subjects were tested at varying times during the day but asked to fast four hours prior to the experiment to increase the value of food items (Reference Epstein, Truesdale, Wojcik, Paluch and RaynorEpstein et al., 2003) and standardize hunger levels. Subjects received €15 endowment for participation as well as their chosen product from a randomly selected choice trial.
2.2 Stimuli
A set of 50 healthy and 50 unhealthy packaged products were obtained from the internet and presented on a black background (resolution: 1920 × 1200 pixel). Nutrition labels were taken from the producer’s nutrition information for the product and included sugar, fat, saturated fat, salt, and calories. The labels were presented either numerically with percentages (GDA) or more saliently, using the color-coded TL; see Figure 2 for the stimuli. The GDA percentage values were extracted from the CIAA (CIAA (EU Food and Drink Confederation), 2014) and the TL guidance values from the Food Standards Agency’s website (Department of Health & Food Standards Agency (FSA), 2013). Note that calories were not colored, as no guidance values from the FSA exist. GDA and TL labels were of the same size and denoted the respective nutrition value per 100g. We used the classification of healthy versus unhealthy products as described in (Reference Enax, Hu, Trautner and WeberEnax, Hu, et al., 2015) based on the TL color classification scheme. Specifically, an item was considered healthy if it contained at least one green and no red coded nutrient, and unhealthy if it contained at least one red and no more than one green coded nutrient. No difference between naturally occurring sugar (e.g., fructose) and added sugar (e.g., sucrose) was made. We used the correct nutrition values of products, therefore, nutrition information could be “mixed”, in that a label was not completely green or red, but rather green (healthy) or rather red (unhealthy). The design was presented using z-tree (version 3.4.7; Reference FischbacherFischbacher, 2007)
2.3 Behavioral paradigm
In line with previous studies investigating the process of how people make choices between food items based on independently collected taste “liking ratings” on a simple Likert scale (Reference Krajbich, Armel and RangelKrajbich et al., 2010; Reference Maier, Makwana and HareMaier et al., 2015; Reference Philiastides and RatcliffPhiliastides & Ratcliff, 2013), we adopted this design but incorporated nutrition labels as additional modulators of value. Therefore, subjects first rated the taste of each product on a discrete Likert scale from –5 to 5 (–5= do not like at all, 5= like very much) in increments of 1. The items were presented in the center of the screen without any nutrition information. In the main task, subjects made binary choices between healthy and unhealthy food products on the left and right side of the display, see Figure 2. 350 pairs of healthy and unhealthy products were randomly generated. For each individual, once a product was coupled with a TL (or GDA) label, consecutive presentations of this product also occurred with that label. Product-label combinations were randomized across subjects. Whether the healthy product appeared on the left or right side was randomized. TL and GDA trials were interleaved. Trials were separated by an inter-trial interval of 1000 ms (showing a white fixation cross on a black background). No time limits were imposed on these tasks, but subjects were told to make a response as soon as they formed a decision. Subjects indicated their choice by clicking on a button labeled “left” or “right” below the products corresponding to the screen position with the preferred index finger on a standard computer mouse. Items were removed from the screen as soon as a choice was made.
2.4 Data analysis
Behavioral data were analyzed using R (R Core Team, 2013).
Data cleaning:
For each individual, we excluded trials in which the RT was two standard deviations above the individual mean RT, as those trials were likely contaminated by non-attention or distraction and are thus problematic for further analyses. As mean RTs are very sensitive to outliers, we first applied a cutoff of 30 s on the RTs. On average, 17 (SD = 5.4, range: 4–41 trials) out of 350 trials were excluded per subject in this experiment. (The effect of label [model “Label”, see below] on healthy choices is also significant when using all choices; Z=2.9, p=0.0037).
Regression analyis:
To analyze the overall effect of label (GDA versus TL) on healthy choices, a maximal logistic mixed-effects regression analysis was performed with healthy choice as the dependent variable, label type as an independent variable and subjects as random effects, to account for idiosyncratic variation due to individual differences (Reference WinterWinter, 2013), fit by maximum likelihood (Laplace approximation, model “Label”). We then also controlled for liking by adding rating as a covariate in the model (model “Label + Liking”). Subsequently, we tested for an interaction effect between subjective taste ratings and nutrition labels by adding ratings and the interaction between rating and label to the model (model “Label × Liking”). Further, RT data were analyzed using a maximal linear-mixed model. As RT distributions in these binary choices are highly skewed, we log-transformed the RT data. We used label (TL vs. GDA) as a fixed effect, subject as random effect and log-RT as dependent variable.
Diffusion model fits:
Diffusion modeling was performed using fast-dm (fast-dm-30, Heidelberg, Germany) as well as the RWiener package implemented in R (Reference WabersichWabersich, 2014) for analyzing the drift rate as a function of preferences as this analysis is currently not supported in fast-dm. We used the chi-square (χ2 ) algorithm for diffusion model fitting. In the DDM analyses, we were specifically interested in the following two research questions (RQ1 and RQ2):
RQ1:
If TL labels increase the drift rate towards healthier options, or alternatively if they induce a starting point bias, and
RQ2:
if TL labels increase the weight on health, and decrease the weight on taste attributes in the comparison process, compared to the GDA labels (see ω in the DDM equation below).
For all models, a positive drift indicates accumulation towards the “healthy” boundary, whereas a negative drift indicates that information is generally accumulated towards the “unhealthy” boundary. Similarly, a starting point parameter value greater than 0.5 indicates a starting point bias towards the “healthy” boundary, whereas a value below 0.5 indicates a starting point bias towards the “unhealthy” boundary.
For RQ1, we investigated whether TL labels increase the drift rate towards the healthier option. On a single-subject level, data were modeled across taste ratings. Because we presented TL and GDA trials in a random sequence, subjects could not anticipate which type of label would occur on the next trial, and decision boundaries could not be set beforehand. The model “Drift” included two drift rate parameters (for GDA and TL). We further included label-specific inter-trial variability in drift rates to account for the fact that each decision involves a unique pair of items (Reference Krajbich and SmithKrajbich & Smith, 2015; Reference Philiastides and RatcliffPhiliastides & Ratcliff, 2013). We let non-decision time and starting point vary across both labels and set the parameter accounting for variability in non-decision time and inter-trial variability in relative starting point to zero because this makes the estimation of the remaining parameters more robust, even in presence of inter-trial-variability in our data (Reference Voss, Voss and LercheVoss et al., 2015). We then compared the two drift rates for GDA and TL using a paired-samples t-test.
Testing for model fit:
Model fit was assessed using Monte Carlo simulations, which has, in comparison to graphical inspection, the advantage that it leads to a clear criterion for model fit to each subject (Reference Voss, Voss and LercheVoss et al., 2015). 1000 parameter sets from a multidimensional normal distribution defined by the covariance matrix of estimated parameter values were drawn using the mvtnorm package for R (Reference Genz, Bretz, Miwa, Mi, Leisch, Scheipl and HothornGenz et al., 2014). Then, for each of the 1000 parameter sets, a data set was simulated using the construct-sample tool of fast-dm and then re-fit with the same settings as used for the empirical data. The parameters from the empirical fit were then compared to these distributions of simulated data fits. Any subjects with parameter fits lying outside of the 95% confidence intervals from the simulated fits were excluded from further analysis (Reference Voss, Voss and LercheVoss et al., 2015). For completeness, we also present the quantile probability plots across subjects (Figure S1).
Alternative models:
As the behavioral effect could also be explained by other diffusion model parameters, suggesting a different mechanism for how labels are processed, we tested three alternative models. The model “Drift + Starting Point” included separate parameter estimates for drift rate and starting point bias for each label. The model “Drift + Non-decision” included separate parameter estimates for drift rate and non-decision time for each label. The model “Drift + Starting Point + Non-decision” included separate estimates for drift rate, starting point, and non-decision time for each label. All alternative models accounted for drift rate variability. Variability in non-decision time and inter-trial variability in relative starting point were again set to zero. We then tested for significant differences between TL and GDA using a paired-samples t-test.
Drift as a function of taste ratings:
For RQ2, we allowed the drift rate to vary as a function of the taste ratings on a single-subject level. We assumed that
where RDV is the relative decision value at time t, healthS is the sensitivity to health (intercept), and ω is the weight on the difference between the taste ratings of the healthy (H) and unhealthy (U) food item. ω multiplies the taste value difference between the healthy and unhealthy option and determines the relative importance of taste in the mean drift rate. The model assumes that it takes time to accumulate and compare evidence for the options until a pre-specified level of confidence is reached. The rate of evidence accumulation depends linearly on the difference between the underlying subjective taste values. For estimation purposes (because we did not have enough data in each bin to properly fit the model), we further binned the taste ratings into three coarse bins: unhealthy preferred [rating difference from –10 to –4], roughly equally liked [–3 to 3] and healthy preferred [4 to 10]. We also included Gaussian noise (ε ). See the Supplement for further logit analyses investigating whether the labels change the absolute or the relative weight of taste and health attributes, as well as rating-specific drift rates, which were calculated using a jackknifing procedure.
3 Results
3.1 Choice and reaction time data analyses
We found a significant effect of label on healthy choice (model “Label”, estimate (standard error, SE): 0.25 (0.08); Z=2.82, p <0.01, intercept: –0.09), with higher proportions of healthy choices in the TL compared to the GDA condition. The effect of label was still significant, and even larger (0.33), when statistically controlling for liking (model “Label + Liking”). As expected we found that liking ratings significantly affected choices (main effect label: estimate (SE): 0.33 (0.10); Z=3.43, p <0.001; main effect liking: estimate (SE): 0.55 (0.03); Z=17.15, p <0.001, intercept: 0.17).
Further, we found an almost significant interaction between ratings and label (model “Label × Liking”, interaction effect: estimate (SE): –0.05 (0.02) Z=1.75, p=0.08; main effect label: estimate (SE): 0.30 (0.10); Z=3.12, p=0.002; main effect liking: estimate (SE): 0.58 (0.03); Z=16.7, p <0.001; intercept: 0.18); see Figure 3. Note that the almost significant interaction term probably does not reflect a true psychological difference in the effect of the TL labels when taste healthy > unhealthy, but is rather the product of a ceiling effect where the healthy item is almost always being chosen, and so there is little room for the TL labels to have an additional effect. In other words, this is likely a removable interaction (Reference LoftusLoftus 1978; Reference Wagenmakers, Krypotos, Criss and IversonWagenmakers et al. 2012).
We also analyzed whether there was a difference in RT depending on the label using a mixed-effects linear regression analysis using log-transformed RT data. We found a trending effect of label on RTs in that subjects were somewhat faster in the GDA condition (t=1.43, p=0.16; mean log-RT for GDA=0.766, SD=0.55; mean log-RT for TL=0.78, SD=0.53)
3.2 Diffusion model analyses
For RQ1, we investigated whether drift rates differ between the two labels at a single-subject level (model “Drift”). The drift rate towards the healthy option is significantly higher for the TL label, compared to the numeric GDA label (t(43)=2.3, p=0.029; drift rate mean GDA=-0.10, TL=0.05)). Monte-Carlo simulations as well as quantile-probability plots were used to assess model fit. Since fast-dm minimizes the χ2 value, high χ2 values are indicative of a poor fit. We used the 95% quantile of the χ2 distribution and determined whether our values were below this criterion. All of our fits were below the obtained critical value, indicating an acceptable model fit in all cases; therefore, no subjects were excluded. See also the quantile probability plot across subjects (Figure S1 in the Supplement).
We then tested alternative models to investigate whether other diffusion model parameters can capture the observed behavioral effect, which would suggest a different underlying psychological process. We only find significant differences between drift rates but not in non-decision time or starting point bias for TL and GDA; see Table 1 and Figure 4.
a Standard error of the mean.
b Does not account for model complexity.
For RQ2, we let the drift rate vary as a function of the relative desirability of the taste of the products, that is, the difference between the taste of the healthy product and the taste of the unhealthy product. As expected, the salient TL labels increase the sensitivity (s) to health attributes (mean healthS GDA=0.002, SEM=0.012; mean healthS TL=0.093, SEM=0.013, t(43)=2.60, p=0.013). Also, salient labels reduce the weight (ω ) subjects place on taste attributes (mean ω GDA=0.77, SEM=0.01; mean ω TL=0.71, SEM=0.012; t(43)=2.331, p=0.021); see Figure 5 and also the additional analyses in the Supplement, where we further investigated whether the labels change the relative or absolute weight on health and taste attributes. Rating-specific drift rates were analyzed using a jackknifing procedure (see Supplement).
4 Discussion
In this study, we investigated the cognitive mechanism underlying value-based decisions with nutrition labels as modulators of value. As expected, the percentage of healthy choices increased when the product was labeled with a color-coded, compared to a purely numeric label. We further used drift diffusion modeling to draw conclusions about the underlying cognitive mechanism, which has not been addressed in previous studies. We found that the drift rate towards the healthier option is increased in case of color-coded labeling, compared to the purely numerical counterpart, suggesting that health information and taste preferences are integrated in the decision process. In contrast, we do not find evidence for a simple stimulus-response bias due to color-coded labels irrespective of the items’ features. Last, our data suggests that subjects put less weight on taste attributes, and more weight on health attributes when choosing between color-coded labeled products.
Manipulating the amount of attention paid to health features, for example via overt instruction (Reference Hare, Malmaud and RangelHare et al., 2011) or salient cigarette warnings (Reference Borland, Wilson, Fong, Hammond, Cummings, Yong and McNeillBorland et al., 2009), can increase the weight placed on health features, and thereby alter the choice process (Reference Fehr and RangelFehr & Rangel, 2011). Our traditional regression analyses revealed a higher probability to choose the healthy product when presented with more salient, color-coded labels. This is in line with previous studies that showed that color-coded labels increase the identification and choice of healthier options (Reference Borgmeier and WestenhoeferBorgmeier & Westenhoefer, 2009; Reference Hawley, Roberto, Bragg, Liu, Schwartz and BrownellHawley et al., 2013; Reference Hersey, Wohlgenant, Arsenault, Kosa and MuthHersey et al., 2013; Reference Kelly, Hughes, Chapman, Louie, Dixon and CrawfordKelly et al., 2009; Reference Roberto, Bragg, Schwartz, Seamans, Musicus, Novak and BrownellRoberto et al., 2012; Reference van Herpen and Trijpvan Herpen & Trijp, 2011). Schulte-Mecklenbeck and colleagues (2013) analyzed strategy use in information acquisition during food choices and found that choices are often based on very simple heuristics, which reduce computation time. As GDA labels are cognitively more demanding than TL labels, they likely provide information that is harder to process, which is in turn utilized less. In this study, we did not classify subject’s overt behavior into choice strategies. Therefore, future studies using strategy analysis in combination with actual process tracing data (e.g., eye-tracking or mouse-tracking) would be valuable to analyze, for example, if salient labels interfere with an automatic preference-based choice heuristic or actually promote a fully-informed choice strategy.
To our knowledge, this is the first study to also analyze how exactly health information is integrated into the decision-making process, and how this is changed by the more salient TL display, using empirical choice and RT data in a DDM. This type of decision is interesting because subjects need to combine information from pictorial stimuli (food products) as well as symbolic and numeric information (labels). Although DDMs have been used before in consumer contexts, it was not known a priori whether the DDM could account for the impact of nutrition information on the valuation process. Importantly, the DDM provides information above and beyond traditional logit analyses, as it estimates different parameters accounting for various decisional processes, informing us not only whether health information influences choices but also how exactly health information is incorporated into the decision. In particular, we investigated whether the salient health information influences the valuation process, or whether it induces a simple response bias. Our data support the hypothesis that salient, color-coded nutrition information directly influences the valuation process in favor of healthier options, as the behavioral effect of nutrition labels could only be explained by changes in drift rate, but not in starting point bias. This finding provides evidence that nutrition label information and taste preferences are incorporated into the valuation process, ruling out the alternative mechanism that these labels only induce an automatic stimulus-response choice bias. Further, we find that for saliently labeled products the weight on taste gets discounted, while the sensitivity to health increases.
In two additional experiments, subjects made the same binary choices, but products were labeled with simplified nutrition information, displaying only the amount of one nutrient, that is, sugar. Overall, subjects made less healthy choices when confronted with information on only one nutrient (sugar). The effects of simplified nutrition information were weaker, suggesting that more comprehensive, salient information is more effective (see Supplement and Table S1).
It is possible, given that we used the actual nutritional information for each product, that healthiness could be correlated with other features of the products. Thus the changes in behavior due to the TL vs. GDA labels cannot unambiguously be attributed to an increase in the weight on health information, though we do see this as the most likely explanation. Importantly, our use of real products paired with real nutritional information implies that, in any case, the use of TL labels in real-world applications should promote choosing healthy products.
As many food decisions occur automatically or habitually (Reference RangelRangel, 2013; Reference Wansink and SobalWansink & Sobal, 2007), nutrition labels may have a decisive role in triggering goal-directed decisions that incorporate not only taste considerations, but also long-term health outcomes. We demonstrate that salient labels increase the integration of health considerations into the decision process; salient nutrition labels may therefore interfere with automatic decision processes and trigger re-evaluation of the choice options. The results have obvious implications for public policy interventions. Environmental nudges, including understandable nutrition labels, are important pillars of public policy interventions aiming at improving dietary preferences and choices (Reference Hawkes, Smith, Jewell, Wardle, Hammond, Friel and KainHawkes et al., 2015). Salient TL nutrition labels seem to be a feasible option to increase the consideration of health attributes in every-day choice situations to encourage consumers to purchase the healthier product. Of course, the unnatural size and placement of nutrition labels may have influenced the valuation process. Previous studies have shown that display size is an important determinant of attention (Reference Bialkova and van TrijpBialkova & van Trijp, 2010), therefore, future studies with a more natural design are necessary. In addition, real-world choice alternatives include many other product attributes, next to nutrition labeling, as well as subjects’ individual characteristics, which were shown to influence nutrition label use and understanding (Reference Miller and CassadyMiller & Cassady, 2012). The impact of these factors and their interaction with nutrition labels warrants further investigation.
In sum, the results presented in this study provide insights into the nature of computational processes that take place during simple choices between two food products along with health attribute labels. Our results suggest that health information can be successfully coalesced with taste-preferences based on representational values during the decision-making process.