In the realm of public health, menu labelling (ML) is considered an important tool to inform consumers of the energy content of meals in the eating-out environment and to encourage favourable reformulation of menu items by the food-service industry, as a low-cost and broad-reaching obesity prevention strategy( 1 , Reference Nikolaou, Lean and Hankey 2 ). The initiative applies the principles of nutrition labelling to foods consumed outside the home through the disclosure of energy content at the point of purchase( Reference Mah, Vanderlinden and Mamatis 3 , Reference Pang and Hammond 4 ). ML applies to standardized food and beverage items sold at chain food establishments (with generally fifteen or more outlets). More than ever, ML is deemed necessary as the consumption of food outside the home increases( 1 , Reference Andersen 5 , 6 ) and most consumers underestimate the energy content of such foods( Reference Block, Condon and Kleinman 7 ), which are associated with increased energy intake( Reference Rosenheck 8 – Reference Lachat, Nago and Verstraeten 10 ) as well as unhealthy weight gain( Reference Pereira, Kartashov and Ebbeling 11 ) due to increased portion sizes and energy density. Overweight and obesity put individuals at greater risk of developing type 2 diabetes, CVD and certain cancers( 12 ), all of which incur great financial and social costs( 13 – Reference Hammond and Levine 15 ). Furthermore, the disclosure of energy values at the point of purchase has overwhelming public support in several countries, including those where ML has been implemented( Reference Mah, Vanderlinden and Mamatis 3 , Reference Liu, Roberto and Liu 16 – 19 ). Consumers otherwise rarely notice or access energy declarations provided by food-service establishments in any other format, including tray liners, food packaging or company websites( Reference Krieger and Saelens 17 , Reference Roberto, Agnew and Brownell 20 ), despite energy being sought-after nutrition information( Reference Vanderlee and Hammond 21 – Reference Storcksdieck and Wills 24 ).
Current European Union legislation on food labelling only requires nutrition declarations on pre-packed foods and beverages. According to a modelling study conducted by the Organisation for Economic Co-operation and Development, mandatory nutrition labelling of processed foods could reduce European obesity rates by 2·5 % and avert up to 15 million disability-adjusted life years( Reference Sassi, Cecchini and Lauer 25 ). The European Union legislation, however, does not extend to menu items prepared and served in catering outlets, despite the increase in foods eaten outside the home and strong consumer support for ML. A global survey conducted in 2011, including 250 000 respondents from fifty-six countries, revealed that 80 % desire ML either all the time or sometimes in fast-food establishments and full-service chain restaurants, with particularly high support for these measures in Latin America, North America and Europe( 26 ). Voluntary ML by the restaurant industry exists, however, in Great Britain, Northern Ireland and the Republic of Ireland( 19 , 27 , 28 ). The USA was the first country to implement nationwide mandatory ML in 2010, and the Food and Drug Administration issued a final rule to guide labelling for qualifying chain restaurants and food-retail establishments to sell away-from-home foods that will take effect in December 2016( 29 ). ML is also mandatory in some states of Australia( 30 – 32 ) and one single province in Canada( 33 ), while the initiative is being debated in Denmark and Asia( 34 ).
In addition to fast-food outlets, ML has evolved to incorporate standardized foods and beverages served in cafés, bakeries, juice bars, ice-cream parlours, table-service restaurants, convenience stores, cinemas and supermarkets, since such establishments increasingly contribute to overall energy intake( Reference Andersen 5 , 35 ). The expansion of ML, both geographically and with regard to the food outlets it applies to, has generated an array of new evidence as to its effects on consumer behaviour. Recent studies, often in real-world settings, add to the earlier literature, once predominated by studies conducted in experimental settings on fast-food selections in the USA.
Objective
The present review and meta-analysis aims to provide an update of the most recent evidence (studies published only between 2012 and 2014) of the effects of ML on energy consumed, ordered or selected by the general population, in both real-world and experimental settings. The review also aims to assess the importance of ML format and ML noticing by consumers as aspects impacting its effectiveness. The rationale for narrowing the inclusion dates was an attempt to capture results which may more accurately reflect consumers’ response to ML as they grow accustomed to the initiative and as industry adaptation to legislation is more widespread and compliant.
For the purpose of the present article, menu labelling (ML) is defined as the prominent display of energy values appearing on menus (or food tags, retail shelf displays and other promotional material), so it clearly relates to the item and its respective price. In accordance with other ML literature, the term ‘healthier’ refers to lower-energy food choices, since energy content is the most important nutrition aspect in relation to overweight and obesity( 1 ).
Methods
Literature search
The systematic literature review was conducted in two stages between May and November 2014. The first stage was carried out by one reviewer (J.A.L.) in May–June 2014 and consisted of a systematic literature search of the MEDLINE database. Two reviewers (J.A.L. and S.L.) carried out the second stage in October–November 2014, extending the MEDLINE search to Web of Science and Science Direct databases using the same search strategy.
A structured approach was used to identify studies assessing the effects of ML on energy selected, ordered or consumed. First, the search terms ‘menu board label*’, ‘menu label*’, ‘calorie label*’ and ‘energy label*’ were applied in all databases. Then, titles and abstracts of retrieved studies were screened for relevance in accordance with the defined inclusion criteria (see below). Reference lists of identified studies were also perused for relevant additional publications. Finally, in an effort to obtain evidence beyond the USA, government agencies, such as the Food Safety Authority of Ireland, and non-government organizations known to advocate ML, such as Dietitians of Canada and the Heart Foundation of Australia, were contacted. This review is not registered in any systematic review databases.
Inclusion/exclusion criteria
Peer-reviewed articles met the inclusion criteria if they were: (i) full text; (ii) in English; (iii) published after 1 January 2012; (iv) primary studies; (v) investigating the impact of ML on purchase behaviour or purchase intentions; and (vi) with outcomes measuring the effect of ML on energy consumed, ordered or selected, as quantified kJ/kcal differences. Energy selected referred to hypothetical choices of ‘intended’ purchase. Energy ordered referred to actual food and beverage items ordered from a menu, substantiated by participant recalls, purchase receipts or sales records. Energy consumed took into account uneaten portions of meals, established through either researchers weighing leftovers or participants self-reporting the uneaten portion of their meal.
Studies were excluded if they assessed only: (i) customer/participant awareness of ML; (ii) self-reported use of ML; (iii) consumer energy-based knowledge; (iv) consumer attitude towards the provision of ML; and/or (v) consumer preference for various labelling formats.
Data extraction, synthesis and quality assessment
A structured approach was applied to extract pertinent information from each of the fifteen included studies and systematic across-study comparisons were conducted based on a matrix including: year and authors; country; aim; study design; sampling; setting; type of outcome (including meals consumed, ordered or selected, nutrients and interpretative guidance, as well as meal type); covariates measured; results; and limitations of study design. Results from studies presented as kilocalories were converted to kilojoules based on 1 kcal=4·184 kJ.
The quality of the included studies was assessed using a rating scheme inspired by previous reviews( Reference Krieger and Saelens 17 , Reference Sinclair, Cooper and Mansfield 36 , Reference Swartz, Braxton and Viera 37 ). Articles were rated according to: study setting; sample size; extent of displaying ML; ML noticing rate; randomization (for experimental studies) or case–control match (for real-world studies); and degree of blinding (see online supplementary material, Supplemental Table 1). Two reviewers (J.A.L. and S.L.) independently rated all studies. The final scores were awarded after conferring and discussing any discrepancies, and consensus was reached.
ML, menu labelling; RT, randomized trial; RCT, randomized controlled trial; TL, traffic light; RV, reference values; PAE, physical activity equivalent; %DI, percentage of daily intake.
Meta-analysis
A meta-analysis was conducted with twelve of the fifteen studies, stratified according to type of outcome: changes in energy consumed, ordered or selected. The analyses conducted compare controls (no labels) v. interventions presenting energy labels alone; results from intervention groups reporting the effect of interpretive guidance, such as traffic lights or physical activity equivalents (PAE), were not included in the meta-analysis. All twelve studies reporting mean and standard deviation for energy consumed, ordered or selected were included in the analysis. When such data were not reported, the corresponding authors were contacted in order to retrieve the missing data. Two studies reporting results on energy ordered( Reference Holmes, Serrano and Machin 38 , Reference Ellison, Lusk and Davis 39 ) and one study reporting results on energy selected( Reference Roseman, Mathe-Soulek and Higgins 40 ) were excluded due to data not being provided. No original raw data sets were available.
A random-effects meta-analysis of mean differences was conducted to assess the overall effect of ML, applying the DerSimonian–Laird estimator to evaluate the between-study variance. Publication bias was assessed via funnel plots and Egger’s test where appropriate, and inspected visually. Data were analysed with the statistical software R version 3·03 and the package meta 4·1–0( 41 ). In order to quantify the effect of heterogeneity we used the I 2 statistic as a measure of the degree of inconsistency in the studies’ results. The quantity I 2 describes the percentage of total variation across studies that is due to heterogeneity rather than chance( Reference Higgins, Thompson and Deeks 42 ).
Results
Search results
The online database search identified 141 articles, fifteen of which were retrieved for full-text review (Fig. 1). Three studies considered the effects of ML on energy ordered and subsequently consumed( Reference Vanderlee and Hammond 21 , Reference James, Adams-Huet and Shah 43 , Reference Hammond, Goodman and Hanning 44 ). Six studies reported on energy ordered( Reference Auchincloss, Mallya and Leonberg 22 , Reference Holmes, Serrano and Machin 38 , Reference Ellison, Lusk and Davis 39 , Reference Krieger, Chan and Saelens 45 – Reference Brissette, Lowenfels and Noble 47 ), while six assessed the effects on energy selected( Reference Pang and Hammond 4 , Reference Liu, Roberto and Liu 16 , Reference Roseman, Mathe-Soulek and Higgins 40 , Reference Dodds, Wolfenden and Chapman 48 – Reference Dowray, Swartz and Braxton 50 ). Seven studies were conducted in real-world settings( Reference Vanderlee and Hammond 21 , Reference Auchincloss, Mallya and Leonberg 22 , Reference Holmes, Serrano and Machin 38 , Reference Ellison, Lusk and Davis 39 , Reference Krieger, Chan and Saelens 45 – Reference Brissette, Lowenfels and Noble 47 ) and eight studies were conducted in experimental settings( Reference Pang and Hammond 4 , Reference Liu, Roberto and Liu 16 , Reference Roseman, Mathe-Soulek and Higgins 40 , Reference James, Adams-Huet and Shah 43 , Reference Hammond, Goodman and Hanning 44 , Reference Dodds, Wolfenden and Chapman 48 – Reference Dowray, Swartz and Braxton 50 ). Eight studies reported on the proportion of participants noticing ML( Reference Liu, Roberto and Liu 16 , Reference Vanderlee and Hammond 21 , Reference Auchincloss, Mallya and Leonberg 22 , Reference James, Adams-Huet and Shah 43 – Reference Elbel, Mijanovich and Dixon 46 , Reference Dodds, Wolfenden and Chapman 48 ).
Studies’ characteristics and quality
Overall, the effects of ML were assessed on 17 859 participants spanning three countries. Despite similar aims and objectives, the studies varied greatly in design, most notably in sample size, data collection methods and ML format. Sample sizes ranged from under 50( Reference Ellison, Lusk and Davis 39 ) to over 2000( Reference Krieger, Chan and Saelens 45 ) participants. Data collection methods varied from telephone surveys collecting meal selections at random times of the day( Reference Dodds, Wolfenden and Chapman 48 ), to complete sales records from private club bistros( Reference Holmes, Serrano and Machin 38 ), to calculating energy consumption by obtaining calorie content and weight information from the fast-food website and deducting weighed leftovers( Reference James, Adams-Huet and Shah 43 ). While all studies compared the absence of ML as a control condition with the presence of ML, the format and provision of additional contextual or interpretive guidance differed. The characteristics of the included studies are presented in Table 1. The rating process yielded three good-quality studies, nine of fair quality and three of weak quality (see online supplementary material, Supplemental Table 2).
TL, traffic light; ML, menu labelling; RV, reference values; PAE, physical activity equivalent; %DI, %DI, percentage of daily intake; RT, randomized trial; RCT, randomized controlled trial.
The effects of menu labelling
In nine of the fifteen articles reviewed, participants exposed to ML in various formats consumed, ordered or selected significantly less energy( Reference Pang and Hammond 4 , Reference Vanderlee and Hammond 21 , Reference Auchincloss, Mallya and Leonberg 22 , Reference Holmes, Serrano and Machin 38 , Reference James, Adams-Huet and Shah 43 , Reference Hammond, Goodman and Hanning 44 , Reference Brissette, Lowenfels and Noble 47 , Reference Morley, Scully and Martin 49 , Reference Dowray, Swartz and Braxton 50 ) than participants not exposed to ML. Statistically significant reductions ranged from as few as 96·2 kJ (P=0·05) when choosing snack items( Reference Pang and Hammond 4 ) to 648·5 kJ (P=0·018) for meals ordered at a table-service chain restaurant( Reference Auchincloss, Mallya and Leonberg 22 ). Only three articles reported no effects of ML( Reference Roseman, Mathe-Soulek and Higgins 40 , Reference Elbel, Mijanovich and Dixon 46 , Reference Dodds, Wolfenden and Chapman 48 ). The overall results are somewhat mixed, so closer attention was paid to research setting, type of outcome assessed, the extent to which participants noticed ML, ML format and sociodemographic characteristics of study participants. Where ML was partial and only applied to some items, the results of interest focused on the changes in energy consumed, ordered or selected for the labelled items. Table 2 provides a summary of results.
Research setting
Seven studies were carried out in real-world settings, one of which resulted in a 531·4 kJ (P<0·001) reduction in energy consumed when customers were offered a range of ‘healthier’ reformulated options and were exposed to ML in a cafeteria( Reference Vanderlee and Hammond 21 ) compared with no ML. The provision of ML, compared with no ML, also resulted in statistically significant reductions of energy ordered by 92 kJ in coffee chains (P=0·002)( Reference Krieger, Chan and Saelens 45 ); between 234·3 kJ (P<0·05)( Reference Holmes, Serrano and Machin 38 ) and 648·5 kJ (P=0·018)( Reference Auchincloss, Mallya and Leonberg 22 ) in table-service restaurants; and a reduction of 251 kJ (P=0·05) in fast-food outlets( Reference Brissette, Lowenfels and Noble 47 ). One further study resulted in 539·7 fewer kJ (P=0·033) being ordered at a table-service restaurant for main meals, but no statistically significant changes were observed for ‘extras’ that were not consistently labelled( Reference Ellison, Lusk and Davis 39 ). Only one real-world study showed ML to have no statistically significant impact( Reference Elbel, Mijanovich and Dixon 46 ).
Eight studies were conducted in experimental settings where consumers participated online, by post, by telephone or in laboratory settings. Two studies showed that the provision of ML on fast-food menus significantly reduced energy consumed by 397·5 kJ (P=0·048)( Reference Hammond, Goodman and Hanning 44 ) and 200·8–405·8 kJ (P=0·04) across ML formats( Reference James, Adams-Huet and Shah 43 ). Of the remaining six studies, three resulted in overall statistically significant reductions in energy selected when participants were presented with ML on fast-food menus: 510·4 kJ (P<0·05)( Reference Morley, Scully and Martin 49 ) and 398·1–811·7 kJ (P=0·02)( Reference Dowray, Swartz and Braxton 50 ); or on a snack menu: 96·2–146·4 kJ (P=0·02)( Reference Pang and Hammond 4 ). One other study showed a statistically significant reduction of 644·3 kJ (P=0·013) for a meal selected in a family-style restaurant with a ranked energy-label condition (although other ML formats tested resulted in statistically non-significant energy reductions)( Reference Liu, Roberto and Liu 16 ). ML had no impact in the remaining two studies based on fast-food selections( Reference Roseman, Mathe-Soulek and Higgins 40 , Reference Dodds, Wolfenden and Chapman 48 ).
Energy consumed, ordered or selected
All three studies measuring energy consumption reported statistically significant overall energy reductions (200·8–535·6 kJ)( Reference Vanderlee and Hammond 21 , Reference James, Adams-Huet and Shah 43 , Reference Hammond, Goodman and Hanning 44 ). Three of six studies measuring the effects of ML on energy ordered resulted in overall statistically significant decreases (234·3–631·8 kJ)( Reference Auchincloss, Mallya and Leonberg 22 , Reference Holmes, Serrano and Machin 38 , Reference Brissette, Lowenfels and Noble 47 ). Another two studies resulted in statistically significant reductions in energy ordered for energy-labelled main meals, but not partially labelled ‘extras’( Reference Ellison, Lusk and Davis 39 ), and for coffee chains, but not food chains( Reference Krieger, Chan and Saelens 45 ). Three of six studies assessing the effects of ML on energy selected resulted in overall statistically significant reductions for meals (389·1–711·3 kJ)( Reference Morley, Scully and Martin 49 , Reference Dowray, Swartz and Braxton 50 ) and snack selections (129·7 kJ)( Reference Pang and Hammond 4 ).
Noticing of menu labelling by consumers
Eight of the included studies reported on the rate of noticing energy information on menus. ML had no statistically significant impact where less than 70 % of participants noticed the information( Reference Krieger, Chan and Saelens 45 , Reference Elbel, Mijanovich and Dixon 46 ). The remaining studies all reported a high ML noticing rate (≥70 %); of these, three studies reported statistically significant reductions in energy ordered and consumed( Reference Vanderlee and Hammond 21 , Reference James, Adams-Huet and Shah 43 , Reference Hammond, Goodman and Hanning 44 ), one study reported a statistically significant reduction in energy ordered( Reference Auchincloss, Mallya and Leonberg 22 ) and two studies reported statistically significant reductions in energy selected( Reference Liu, Roberto and Liu 16 , Reference Dodds, Wolfenden and Chapman 48 ).
Menu-labelling format
A number of studies also tested the effects of different ML formats which offered interpretive or contextual guidance. The most common variation was the addition of traffic light colour coding to help consumers interpret if the menu item was low (green), medium (amber) or high (red) in energy. Traffic lights enhanced the noticeability of ML( Reference Hammond, Goodman and Hanning 44 , Reference Dodds, Wolfenden and Chapman 48 ), facilitated comprehension of energy information( Reference Ellison, Lusk and Davis 39 , Reference Morley, Scully and Martin 49 ), and in some instances significantly reduced total energy selected by 497·9 kJ (P<0·05)( Reference Morley, Scully and Martin 49 ) or energy ordered for main meals by 539·7 kJ (P=0·033)( Reference Ellison, Lusk and Davis 39 ) compared with no ML.
The use of PAE, which present energy information in terms of how much physical activity is required to expend the energy content of a specified menu item, was also effective. ML formats with PAE significantly reduced energy by 96·2 kJ (P=0·05) when selecting a snack( Reference Pang and Hammond 4 ), and by 811·7 kJ (P=0·0007) when selecting a fast-food meal( Reference Dowray, Swartz and Braxton 50 ), compared with no ML. The use of PAE also resulted in significantly lower energy ordered (−581·6 kJ, P=0·002) and consumed (−405·8 kJ, P=0·01) for a fast-food lunch compared with no ML( Reference James, Adams-Huet and Shah 43 ).
Reference values (RV) offer contextual guidance as to how the energy content of menu items compares to a full day’s energy requirement, and appear in statements such as ‘The average adult daily energy intake is 8700 kJ’. Five experimental studies explicitly included RV in their ML formats, one of which compared the effects of ‘energy labelling’ with ‘energy labelling with RV’. This study showed that RV enhanced the effects of energy labelling for energy selected, that it was the most effective of all ML formats and was rated by participants as the most ‘understandable’( Reference Pang and Hammond 4 ). The other four studies tested ML with RV against other formats. Two of these studies showed that ML+RV significantly reduced energy selected compared with no ML( Reference James, Adams-Huet and Shah 43 , Reference Morley, Scully and Martin 49 ), while one showed ML+RV had no statistically significant impact unless the menu items were also ranked according to ascending energy content( Reference Liu, Roberto and Liu 16 ); one study showed RV had no impact at all( Reference Dodds, Wolfenden and Chapman 48 ).
Sociodemographic differences
No detailed analyses were conducted, but it appears that variances in BMI( Reference Liu, Roberto and Liu 16 , Reference Vanderlee and Hammond 21 , Reference Morley, Scully and Martin 49 ), ethnicity( Reference Vanderlee and Hammond 21 , Reference Krieger, Chan and Saelens 45 , Reference Brissette, Lowenfels and Noble 47 ), age( Reference Vanderlee and Hammond 21 , Reference Krieger, Chan and Saelens 45 , Reference Brissette, Lowenfels and Noble 47 , Reference Morley, Scully and Martin 49 ) and socio-economic status( Reference Vanderlee and Hammond 21 , Reference Krieger, Chan and Saelens 45 , Reference Brissette, Lowenfels and Noble 47 , Reference Morley, Scully and Martin 49 ) did not substantially influence the degree of effectiveness of ML. There was a tendency for those who have an interest in health, and were self-reported ‘users’ of labelling when purchasing food, to be statistically significantly more attentive to ML( Reference Vanderlee and Hammond 21 , Reference Roseman, Mathe-Soulek and Higgins 40 , Reference Brissette, Lowenfels and Noble 47 ). In one study, however, the level of ‘interest in health’ had no impact( Reference Morley, Scully and Martin 49 ), and another study showed those who were least health conscious to be statistically significantly more responsive to energy information( Reference Ellison, Lusk and Davis 39 ). Females were found to be statistically significantly more attentive to ML compared with males in two studies( Reference Vanderlee and Hammond 21 , Reference Krieger, Chan and Saelens 45 ), although another two studies reported no statistically significant differences between genders( Reference Morley, Scully and Martin 49 , Reference Dowray, Swartz and Braxton 50 ).
Meta-analysis findings
The meta-analysis showed that the effect of ML was statistically significant for all outcomes measured. For studies measuring energy consumption, the overall estimated effect of ML was a mean reduction of 419·50 (95 % CI −613·25, −225·76) kJ (P<0·0001; Fig. 2). Studies measuring energy consumed showed no statistically significant heterogeneity.
For studies measuring energy ordered conducted in real-world settings, the effect of ML was estimated to be a mean reduction of 325·66 (95 % CI −508·60, −142·71) kJ (P=0·0005), with significant heterogeneity (I 2=63·8 %; P=0·026; n 5; Fig. 3). When studies conducted in experimental settings were included, ML reduced the overall mean energy ordered by an estimated 313·16 (95 % CI −453·14, −173·19) kJ (P<0·0001) and heterogeneity was no longer statistically significant (Fig. 4).
For studies assessing the effect of ML on energy selected, the estimated effect of ML was a mean reduction of 201·71 (95 % CI −343·66, −59·76) kJ (P=0·005), with significant heterogeneity (I 2=60·2 %; P=0·028; n 6; Fig. 5).
No publication bias was detected.
Discussion
Consumption of food outside the home is becoming more frequent( 1 , Reference Andersen 5 , 6 ) and since this practice is associated with increased energy intake( Reference Rosenheck 8 – Reference Lachat, Nago and Verstraeten 10 ) and unhealthy weight gain( Reference Pereira, Kartashov and Ebbeling 11 ), it poses challenges of public health concern. The current review and meta-analysis aimed to assess whether ML is effective in reducing energy selected, ordered or consumed in both real-world and experimental settings, and to assess the importance of ML noticing, according to the most recent published literature.
Is menu labelling effective?
Previous reviews cast doubt over the effectiveness of ML in changing consumer behaviour with regard to reducing energy ordered and/or consumed. A review from 2008, considering six studies published between 1976 and 2006, concluded that the effects may be ‘limited in magnitude’( Reference Harnack and French 51 ). Another review from 2012, based on seven studies published between 2008 and 2011, suggested that energy labelling had ‘no effect or only a modest effect’( Reference Swartz, Braxton and Viera 37 ). A review from 2014, including thirty-one studies from 2007 to 2013, concluded that ‘calorie labels do not have the desired effect’( Reference Kiszko, Martinez and Abrams 52 ). More recently in 2015, a review and meta-analysis by Long et al., including nineteen studies spanning from 2008 to 2013, found a small but statistically significant energy reduction in energy ordered per meal, but this was associated with statistically significant heterogeneity across studies( Reference Long, Tobias and Cradock 53 ). Despite no restrictions on the published date of the studies included, only those published since 2008 were deemed acceptable, due to the eligibility criteria applied, which excluded studies with ML formats not consistent with the US federal ML laws.
The present review includes only studies published between 2012 and 2014 based on suggestions from other authors that ML may become more effective over time, as it might more accurately reflect consumers’ response to ML as they grow accustomed to the initiative and as industry adaptation to legislation becomes more consistent, widespread and compliant( 18 , 19 , Reference Vanderlee and Hammond 21 , Reference Swartz, Braxton and Viera 37 , Reference Krieger, Chan and Saelens 45 , Reference Harnack and French 51 , Reference Dumanovsky, Huang and Bassett 54 – Reference Larson and Story 57 ). Further, the present review and meta-analysis take ten new primary studies into account, not included in previous reviews( Reference Pang and Hammond 4 , Reference Vanderlee and Hammond 21 , Reference Auchincloss, Mallya and Leonberg 22 , Reference Holmes, Serrano and Machin 38 , Reference Ellison, Lusk and Davis 39 , Reference James, Adams-Huet and Shah 43 , Reference Hammond, Goodman and Hanning 44 , Reference Elbel, Mijanovich and Dixon 46 – Reference Dodds, Wolfenden and Chapman 48 ). A crude tally of results, based on predominantly fair- and good-quality studies, provides a more positive outlook compared with earlier reviews. Nine studies showed statistically significant reductions in energy consumed, ordered or selected. Four studies reported positive effects, depending on consumer characteristics, type of establishment or the format of ML. Only three articles reported no effect of ML. Furthermore, the meta-analysis confirmed that ML is effective in reducing energy consumed and energy ordered, particularly in real-world settings, as well as reducing energy selected in experimental settings.
This positive trend may be attributable to new studies emerging from countries other than the USA where ML policy was initially strongly opposed by industry and gained negative press( Reference Roberto, Schwartz and Brownell 58 – Reference Farley, Caffarelli and Bassett 62 ). Furthermore, implementation of ML legislation was sporadic and inconsistent across jurisdictions, pending national legislation( 63 ), and practical utility and policy compliance were low( Reference Cohn, Larson and Araujo 64 ). The US scenario can be contrasted to the swift implementation of ML in New South Wales, Australia, where the policy was shaped by a consultation with stakeholders, while implementation was supported with extensive social marketing and educational efforts, and industry was provided with implementation assistance. Here, an outcome evaluation resulted in statistically significant decreases (−518·8 kJ) in energy ordered and 100 % industry compliance to legislation, 18 months after implementation( 18 ).
The strengthening of the evidence over time could also be attributed to consumers now being used to see and expect ML. The notion that repeated exposure is needed for consumers to notice and understand ML is demonstrated by studies collecting multiple sets of post-labelling data. In such studies ML had a greater impact after 1 year, compared with 6 months( 18 , Reference Krieger, Chan and Saelens 45 , Reference Nikolaou, Hankey and Lean 65 ). These results, however, may also be attributable to reformulation or the addition of ‘healthier’ options appearing on menus as a food industry response to ML( Reference Bruemmer, Krieger and Saelens 66 , Reference Namba, Auchincloss and Leonberg 67 ), although some research indicates that ML has little effect on menu items’ overall energy content( Reference Bauer, Hearst and Earnest 68 ). The positive effects of repeated exposure, or consumer familiarity with ML, are further supported by findings from studies reporting on the level of ML noticing by participants. ML had no impact where less than 70 % of participants noticed the information( Reference Krieger, Chan and Saelens 45 , Reference Elbel, Mijanovich and Dixon 46 ), but ML had a statistically significant positive impact with regard to energy ordered and/or consumed for the majority of studies where at least 70 % of participants noticed the information( Reference Vanderlee and Hammond 21 , Reference Auchincloss, Mallya and Leonberg 22 , Reference James, Adams-Huet and Shah 43 , Reference Hammond, Goodman and Hanning 44 ). This social normalization of ML in restaurants could hence lead to a critical mass of businesses that offer the information necessary to drive industry reformulation to reduce the energy content of menu items.
The effects of menu labelling in real-world and experimental settings
Other reviews often excluded studies measuring energy hypothetically selected or studies conducted in laboratory settings( Reference Swartz, Braxton and Viera 37 , Reference Harnack and French 51 ). Such studies were included in the present review since they can control variables such as ML formats that can be held constant for items offered, prices and portion sizes, and can more accurately capture the effects of ML v. no ML, while internal validity can be strengthened by randomization. Such study designs, however, do not take into account the strong influences of price (where few participants actually pay for the meal), food preferences, participant hunger/satiety level, or the social and environmental contexts of eating behaviour( Reference Booth, Sallis and Ritenbaugh 69 , Reference Eertmans, Baeyens and Van den Bergh 70 ). To compensate, the quality assessment scheme awarded more points to real-world studies.
Four out of seven studies conducted in real-world settings resulted in ML statistically significantly reducing overall energy ordered and consumed( Reference Vanderlee and Hammond 21 , Reference Auchincloss, Mallya and Leonberg 22 , Reference Holmes, Serrano and Machin 38 , Reference Brissette, Lowenfels and Noble 47 ). Another two real-world studies resulted in statistically significant reductions in energy ordered for energy-labelled main meals( Reference Ellison, Lusk and Davis 39 ) and for coffee chains( Reference Krieger, Chan and Saelens 45 ). This is a progressive and positive step compared with the reviews by Krieger and Saelens (2013) and Long et al. (2015), which found that only those studies conducted in experimental settings resulted in statistically significant energy reductions( Reference Krieger and Saelens 17 , Reference Long, Tobias and Cradock 53 ). The positive impact of ML on consumer behaviour in real-world settings is further highlighted by the results of an outcome evaluation of ML implementation in New South Wales, Australia, where the median energy purchased decreased significantly by 15 % from May 2011 to January 2013( 18 ), and by the results of the current meta-analysis which found a statistically significant reduction in energy ordered in real-world settings. Our results are further supported by a recent study conducted in dining facilities in a university residential hall – the first, to the best of our knowledge, reporting the effects of ML on BMI. In that study, prolonged exposure to prominent ML was associated with halving the likelihood of weight gain in young adults( Reference Nikolaou, Hankey and Lean 65 ). Collectively, these findings dismiss that positive outcomes are limited to experimental settings or that these results should be largely due to response bias.
The effects of menu labelling in different types of outlet
Most of the studies included in the present review were based on ‘typical’ fast-food menus and the majority of these resulted in statistically significant reductions of at least 251 kJ consumed, ordered or selected( Reference James, Adams-Huet and Shah 43 , Reference Hammond, Goodman and Hanning 44 , Reference Brissette, Lowenfels and Noble 47 , Reference Morley, Scully and Martin 49 , Reference Dowray, Swartz and Braxton 50 ). This should be considered very encouraging since the consumption of fast food is frequent, primarily due to its affordability and convenience( Reference Rydell, Harnack and Oakes 71 , Reference Schindler, Kiszko and Abrams 72 ). The provision of point-of-purchase energy information can help overcome passive overconsumption( Reference Swartz, Dowray and Braxton 73 ) and counter pricing incentives to ‘up-size’ or ‘bundle’.
The present review also included studies conducted in table-service restaurants, given their increasing contribution to total energy intake( Reference Ray, Clegg and Davidson 74 , Reference Powell and Nguyen 75 ) and that their meals are high in energy, even compared with fast food( Reference Auchincloss, Leonberg and Glanz 76 , Reference Urban, Lichtenstein and Gary 77 ). Although most patrons appreciate energy information on table-service restaurant menus( Reference Liu, Roberto and Liu 16 , 19 , Reference Ray, Clegg and Davidson 74 ), the impact on meal choices was not consistent in the current review( Reference Liu, Roberto and Liu 16 , Reference Auchincloss, Mallya and Leonberg 22 , Reference Holmes, Serrano and Machin 38 , Reference Ellison, Lusk and Davis 39 ). This may reflect attitudes towards allowing the occasional indulgence( Reference Ray, Clegg and Davidson 74 ), evidenced by some consumers ordering about three-quarters of the recommended average daily energy intake in a single meal( Reference Liu, Roberto and Liu 16 , Reference Auchincloss, Mallya and Leonberg 22 ). The energy content of similar dishes served at leading family-style restaurants can vary considerably( Reference Hammond, Goodman and Hanning 44 ), largely due to portion sizes( Reference Scourboutakos and L’Abbe 78 , Reference Schwartz, Riis and Elbel 79 ), where, for example, a ‘small burger and fries’ can range from 2008·3 to 4602·4 kJ between chains( Reference Brissette, Lowenfels and Noble 47 ).
There were some instances when energy labelling alone reduced energy ordered by 288·7–389·1 kJ( Reference Liu, Roberto and Liu 16 , Reference Ellison, Lusk and Davis 39 , Reference Dowray, Swartz and Braxton 50 ), although these decreases were statistically insignificant. Such reductions could, however, be considered relevant in terms of public health outcomes, especially for regular out-of-home eating, since sustained energy decreases as small as about 210 kJ per restaurant visit can avert weight gain( Reference Elbel, Kersh and Brescoll 80 , Reference Bassett, Dumanovsky and Huang 81 ). These findings support the need for ML to extend beyond the fast-food setting into table-service/family-style chain restaurants, to enable consumers to make within- and between-chain comparisons. Furthermore, ML could encourage chain outlets to reduce energy content through product reformulation. Introducing healthier options or reducing the energy content is a new trend in various food-service outlets, which concurred with the introduction of ML, and can save consumers hundreds of kilojoules per restaurant visit( Reference Bruemmer, Krieger and Saelens 66 , 82 ).
The effects of menu labelling on energy consumption
Energy consumption, it could be argued, most accurately determines the effects of ML, since consumers could respond to ML by eating less( Reference Hammond, Goodman and Hanning 44 ) and it helps control for social desirability bias( Reference Harnack, French and Oakes 83 ). All three studies measuring energy consumption resulted in statistically significant energy reductions( Reference Vanderlee and Hammond 21 , Reference James, Adams-Huet and Shah 43 , Reference Hammond, Goodman and Hanning 44 ), in one case as high as 21 %, although in that particular study the energy-labelled menu also contained ‘healthier’ reformulated items compared with the control menu, which might have contributed to its impact( Reference Vanderlee and Hammond 21 ). The meta-analysis showed a statistically significant reduction of 419·5 kJ in the mean energy consumed. Thus, current evidence supports that ML can contribute to obesity prevention by prompting people to choose more energy-appropriate meals or eating less.
The impact of noticing menu labelling
While the format of ML was once proposed to be a possible explanation for the mixed results as to the effect of ML( Reference Liu, Roberto and Liu 16 , Reference Hammond, Goodman and Hanning 44 , Reference Morley, Scully and Martin 49 ), the present review suggests the level of visibility or noticeability of the information as a more important factor. Earlier studies have identified that ML has both a salience effect, whereby the provision of prominent energy disclosure incites consumers to have a higher regard for health when making selections, and a learning effect, whereby consumers are better able to recall or estimate the energy content of their food and beverage choices( Reference Bollinger, Leslie and Sorensen 84 ). The current review found evidence of the learning effect, when participants presented with ML were better able to estimate and recall the energy content of their menu choices compared with those whose menu contained no energy information( Reference Liu, Roberto and Liu 16 , Reference James, Adams-Huet and Shah 43 , Reference Hammond, Goodman and Hanning 44 ). Previous reviews have often neglected the importance of this factor.
The effects of menu-labelling formats and reference values
More than half the studies included in the current review tested the effects of ML offering interpretive guidance. While traffic light labelling and PAE may have enhanced noticeability of ML and improved customer understanding of the information, this did not translate to improved ML use compared with energy labelling alone. Only one of five studies testing traffic light labelling( Reference Ellison, Lusk and Davis 39 ) and two of three studies testing PAE( Reference James, Adams-Huet and Shah 43 , Reference Dowray, Swartz and Braxton 50 ) were more effective in reducing energy than the plain disclosure of energy information. While other studies have found that traffic lights ‘drew attention to’ and ‘facilitated understanding’( Reference Feldman, Harwell and Brusca 85 ) in various settings and across sociodemographic groups( Reference Harnack, French and Oakes 83 , Reference Sonnenberg, Gelsomin and Levy 86 ), and overcome barriers to ML use such as low numeracy skills and time needed to consider the information( Reference Elbel, Mijanovich and Dixon 46 , Reference Thorndike, Sonnenberg and Riis 87 ), the colour codes can be considered ‘coercive’ or ‘directive’, which may deter their use( Reference Grunert and Wills 88 ). Other studies have found PAE to be effective in reducing energy consumed or ordered( Reference Campos, Doxey and Hammond 23 , Reference Fitch, Harnack and Neumark-Sztainer 89 , Reference Blumenthal and Volpp 90 ) by framing energy information in a more familiar and tangible way, allowing consumers to comprehend the tradeoffs between energy consumption and expenditure( Reference Swartz, Dowray and Braxton 73 ). A difference of 251 kJ between items, for example, was considered trivial, but an extra 20 min of walking effectively prompted participants to consider menu options more carefully( Reference Swartz, Dowray and Braxton 73 ). PAE, however, are more often popular with younger consumers( Reference Pang and Hammond 4 ) and those of normal weight( Reference Brissette, Lowenfels and Noble 47 ), and may be considered irrelevant by the elderly or those who do not habitually partake in the illustrated physical activity( Reference Swartz, Dowray and Braxton 73 ).
It is difficult to isolate the effects of RV from the studies considered in the present review due to considerable differences in study designs; most notably, only one tested the effects of ML+RV compared with plain energy labelling( Reference Pang and Hammond 4 ). While the review provides no consistent evidence as to the effects of RV, a recent meta-analysis found that disclosing energy only had marginal effects, while adding contextual information in the form of RV assisted consumers to significantly reduce energy consumed by 338·9 kJ and selected by 280·3 kJ( Reference Sinclair, Cooper and Mansfield 36 ).
Other reviews have excluded studies where menus were only partially labelled( Reference Swartz, Braxton and Viera 37 , Reference Harnack and French 51 ), whereas such studies were included in the current review. While partial labelling does not accurately capture the effects of ML in terms of an entire meal, such designs offer important insights into substitution effects and shifts in purchasing patterns between labelled and unlabelled items. The rating scheme, however, took into account the limitation of selective labelling (online supplementary material, Supplemental Table 1). When brand-name non-alcoholic drinks( Reference Auchincloss, Mallya and Leonberg 22 ), for example, and à la carte meals( Reference Holmes, Serrano and Machin 38 ) remained unlabelled, energy purchased increased for these menu items, while there was a corresponding decrease for total energy purchased( Reference Auchincloss, Mallya and Leonberg 22 ) and for labelled combo meals( Reference Holmes, Serrano and Machin 38 ) attributable to ML. All menu items therefore need to be labelled to ensure shifts in purchase patterns are fully informed.
Sociodemographic aspects of menu labelling
Sociodemographic differences such as socio-economic status, BMI, race and age had little effect on ML use, supporting the observation by Morley et al. ( Reference Morley, Scully and Martin 49 ) that ML is an equitable initiative. Particularly encouraging is that low socio-economic status was earlier associated with little or no response to ML in studies originating from the USA( Reference Sarink, Peeters and Freak-Poli 91 ). The present review quells previous health disparity concerns by showing that ML can be effective across social grades.
Strengths and limitations
Mixed results for the efficacy of ML from studies conducted prior to 2012 may partly be explained by consumers not noticing or understanding the inclusion of energy values on menus, industry non-compliance, ineffective labelling presentations or limited consumer exposure to ML. The current review is, to the best of our knowledge, the first to focus exclusively on articles published in very recent years (2012–2014) and despite the narrow time period, fifteen articles still met the inclusion criteria. The potential strength of the review design thereby restricts the evidence to better reflect the evolution and progressive improvements in ML design and implementation. This limited scope can, however, introduce a bias whereby only studies with positive results of ML have been submitted and accepted for publication in more recent years.
A clear limitation of the present review, as well as previous reviews, is that several of the included studies had methodological shortcomings, such as incomplete ML( Reference Vanderlee and Hammond 21 , Reference Holmes, Serrano and Machin 38 , Reference Ellison, Lusk and Davis 39 ), medium or small sample size( Reference Pang and Hammond 4 , Reference Liu, Roberto and Liu 16 , Reference Ellison, Lusk and Davis 39 , Reference Roseman, Mathe-Soulek and Higgins 40 , Reference James, Adams-Huet and Shah 43 , Reference Dodds, Wolfenden and Chapman 48 ), lack of case–control match( Reference Vanderlee and Hammond 21 , Reference Auchincloss, Mallya and Leonberg 22 ), and statistically significant differences between comparison groups not adjusted for or not reported( Reference Pang and Hammond 4 , Reference Ellison, Lusk and Davis 39 , Reference Roseman, Mathe-Soulek and Higgins 40 , Reference Brissette, Lowenfels and Noble 47 ). Only three studies were rated as good quality.
Seven studies included in the review were conducted in real-world settings. It could be argued that results from such settings provide the strongest evidence of the effectiveness of ML, since experimental settings do not accurately reflect the complexity of the eating-out-environment where sights, sounds, smells, price and promotional influence, among other factors, affect consumer choices( Reference Booth, Sallis and Ritenbaugh 69 ). While studies from experimental settings were also included in the present review, the rating criteria scheme awarded this type of setting fewer points, compared with real-world studies, and analysis of results was independent for both setting types to control for this limitation. The meta-analysis conducted on studies measuring energy ordered in real-world settings shows a statistically significant mean reduction of 325·6 kJ, which contradicts the findings of the meta-analysis by Long et al. ( Reference Long, Tobias and Cradock 53 ). However, only five studies were included in the present meta-analysis, there was statistically significant heterogeneity between them and one of the studies was classified as weak. Furthermore, one of the studies included offered ‘healthier’ reformulated options in the menus, which could have accounted for its positive impact( Reference Vanderlee and Hammond 21 ). Thus, this calls for a revision of the analysis in the near future to include emerging real-world studies measuring energy ordered.
Similarly, the consumption of energy can be argued to be the best proxy measure of ML efficacy given the extremely limited evidence available measuring changes in population weight. The present meta-analysis showed a statistically significant reduction in energy consumed, and it was based on two studies of good quality and one of fair quality. However, the scarcity of studies measuring this outcome is a limitation to the analysis and since reliable methods for measuring energy consumption are more intrusive and costly, it might be some time before enough good-quality studies are available for a follow-up analysis. Additionally, studies measuring only consumption of energy in a single meal do not allow us to rule out possible compensatory energy consumption behaviour in subsequent meals.
Reductions in energy ordered still provide valuable information in the sense that energy consumption is limited by the amount of energy ordered. Studies measuring energy selected and ordered were included in the present review, but were awarded fewer points than studies measuring energy consumed, while the outcomes were analysed separately. Energy selected as an outcome yielded the lowest rating since the absence of price and fabricated ‘generic’ menus are likely to affect hypothetical choices. Moreover, some hypothetical meal selections were made at random times of the day that did not correspond to a regular mealtime( Reference Dodds, Wolfenden and Chapman 48 ). Still, the meta-analysis showed that the effect of ML was statistically significant regardless of the measured outcome, although it should be noted that there was statistically significant heterogeneity present in the analysis for both energy selected and energy ordered, and the number of studies included is limited for all outcomes. Furthermore, even though no publication bias was detected, these tests are generally underpowered and as such publication bias might not have been detected in our sample of studies.
More reliable research is needed to determine the impact of ML on reducing energy ordered and consumed in real-world settings. It is important for future studies to report on the level of ML awareness, the extent to which ML exists (partial or full compliance), the time of data collection relative to ML implementation, the inclusion of contextual or interpretive information, and the intensity and duration of educational and social marketing efforts undertaken to support ML introduction, to more accurately reflect potential reductions of energy consumed, ordered or selected attributable to the presence of ML. Just as the most effective format of presenting nutrition labelling on pre-packaged foods is yet to be determined( 34 ), further studies are required to establish the most effective way of displaying ML.
Notwithstanding that ML places the burden on the individual to choose energy-appropriate menu items and that information-based obesity prevention strategies alone have not stopped or reversed the global trend of increased population weight, it is still conceivable that ML could prevent excess energy consumption in the eating-out environment and prevent unhealthy weight gain, given the results of the current review. ML can complement other obesity prevention strategies and it provides information that is desired by consumers. It is the most sought-after nutrition information in restaurant settings,( Reference Yoon and George 92 ) which can enhance consumers’ motivation to use it. Furthermore, product reformulation and ‘healthier’ options as an industry response to ML can reduce energy consumption even without consumer knowledge, alleviating the individual burden.
Conclusions
The present review found that ML is effective in reducing energy ordered and consumed in real-world settings. ML was shown to be effective in various types of food-service outlet and appears to positively affect a large proportion of the population. It further provides evidence that prolonged, prominent and noticeable labelling of all menu items, preferably with reference values, is important to increase ML use by heterogeneous populations and to enable shifts in purchasing patterns guided by the prominent display of energy values. Time is needed for the food-service industry to comply with regulations and for consumers to become familiar with ML. As the eating-out environment increasingly contributes to a larger proportion of total energy intake, providing energy information at the point of purchase is essential for consumers to make informed food choices, and ML should be considered an important component in a multifaceted approach to reduce the burden of overweight and obesity.
Acknowledgements
Acknowledgements: The authors would like to thank Elin Raakvad for constructive feedback on the manuscript. Financial support: This research received no specific grant from any funding agency in the public, commercial or not-for-profit sectors. Conflict of interest: None. Authorship: J.A.L. and S.L. formulated the research questions, designed the study, carried out the systematic review and analysed the data for the systematic review. The two authors wrote the article. C.L.I. carried out the meta-analysis and made the meta-analysis figures. G.L.H. provided critical review and suggestions for the text and provided the final approval of the version to be published. Ethics of human subject participation: Not applicable.
Supplementary Material
To view supplementary material for this article, please visit http://dx.doi.org/10.1017/S1368980015003468