Research across several disciplines has established that food choices and dietary habits of most U.S. Americans are out of sync with recommendations. According to the National Health and Nutritional Examination Survey (NHANES), the consumption of fruits and vegetables, lean meats, seafood, and whole grains is too low, while the intake of refined grains, added sugars, and saturated fats is too high (USDA 2015). These insights are also readily available from the United States Department of Agriculture's (USDA) long-running Healthy Eating Index (HEI), which assesses the dietary quality based on a representative sample of the U.S. population. The Center for Nutrition Policy and Promotion (CNPP) of the USDA used the HEI-2010 to compare U.S. diet quality between 2001 and 2002 and 2007 and 2008 and determined that average HEI scores increased slightly from 51.9 to 53.5 (CNPP 2013). In both time periods, average HEI scores reflected considerable room for improvement.
Coinciding with poor average U.S. dietary quality, the incidence of overweight and obesity has been on the rise. Over one-third of U.S. adults, or 76 million Americans, were obese as of 2012 (Ogden et al. Reference Ogden, Carroll, Kit and Flegal2015). The total medical care costs related to obesity were estimated at approximately $147 billion as of 2008 (Finkelstein et al. Reference Finkelstein, Trogdon, Cohen and Dietz2009). There is little question that more work is needed to understand the causes and consequences of obesity as well as the means by which policy and industry might influence the problem.
Our study examines linkages between fruit and vegetable spending patterns, an important component of a healthy diet, and body weight or obesity. An extensive scientific literature (see Van Duyn and Pivonka Reference Van Duyn and Pivonka2000) has demonstrated the health-protective benefits of fruit and vegetable consumption. In addition, the 2010 Dietary Guidelines for Americans (DGA) strongly emphasizes increased fruit and vegetable consumption for all Americans (CNPP 2010). We focus specifically on the purchase patterns for fruits and vegetables among U.S. households, and use expenditure shares for these foods as our key measurement of interest to tie our findings to policy recommendations.
Using a large dataset on U.S. household grocery purchases, demographic information and self-reported health outcomes, we test for relationships between the grocery expenditure share on fruits and vegetables and obesity as well as the average body mass index (BMI) among adults. Previous studies have examined the impacts of dietary quality or even fruit and vegetable consumption specifically on health outcomes such as body weight (Chen et al. Reference Chen, Jaenicke and Volpe2016; Kaiser et al. Reference Kaiser, Brown, Bohan Brown, Shikany, Mattes and Allison2014). Our study breaks new ground and contributes to the literature in two important ways.
First, we use a large household-level dataset that allows us to assess severa; lifestyle choices related to health and weight, such as attitude toward physical exercise (Sarma et al. Reference Sarma, Zaric, Campbell and Gilliland2014). To establish the relationship between fruit and vegetable purchases and health outcomes, we link the 2008–2012 Information Resources Incorporated (IRI) household data on food purchases with the 2010–2012 IRI MedProfiler data. The MedProfiler data consists of self-reported health outcomes for all household members. Fruit and vegetable expenditures are readily observable using point-of-sale scanner data, and they provide a longitudinal aspect to our analysis that is rarely available in studies of this nature, many of which have relied on dietary recall data. The breadth and granularity of household- and individual-level controls in our data also lends support to our identification strategy.
Second, we aim to contribute to ongoing policy discussions about low fruit and vegetable consumption among Americans. The federal government has established several policy initiatives aimed at promoting a higher intake level of fruits and vegetables. These include the DGA, the Let's Move! Campaign, MyPlate, 5-A-Day and its successor, Fruits & Veggies-More Matters. Hence, there is an evident market for information about recommended fruit and vegetable consumption habits and the associated linkages to health outcomes. In our analysis, we quantify our results by showing the impact of a 1 percentage point increase in fruit and vegetable purchases on the likelihood of obesity (or on BMI). By quantifying the associations between fruit and vegetable purchasing decisions and health outcomes that bear significant medical and social costs, our findings may inform the extent to which the government invests in educational efforts or policy tools intended to increase fruit and vegetable purchases in the United States.
Using a sample of over 18,000 households, we seek to quantify the association between fruit and vegetable expenditures and weight-related health outcomes by gender, by controlling for lifestyle choices, age, income, race, marital status, education, and geographic region to illustrate the relative importance of fruits and vegetables among food categories. In our sample, the same households appear in both the IRI household data and the MedProfiler data. In addition, only those households who participate in the panel consistently over time are used in our fixed effects estimation, resulting in approximately 18,000 households each year between 2010 and 2012. Our findings indicate that increased fruit and vegetable expenditures, as a share of the household food-at-home budget, are associated with a significantly reduced obesity rate as well as BMI. The estimated relationships are generally consistent to specification choice. However, potentially important differences in our key associations emerge when comparing results across gender.
Background
Diet, body weight, and health outcomes are intimately linked. According to the Centers for Disease Control and Prevention (CDC), obesity is associated with a large number of ailments and health risks, including type 2 diabetes, high blood pressure, high cholesterol, cardiovascular diseases, and morbidity (Avenell et al. Reference Avenell, Broom, Brown, Poobalan, Aucott, Stearns and Smith2004; Ahima and Lazar Reference Ahima and Lazar2013). For the prevention and treatment of obesity, the CDC recommend eating a healthy diet and limiting caloric intake. Accordingly, researchers have applied numerous methods to investigating and quantifying the relationships between food intake and obesity. Examples of impactful meta-analyses on these topics include the impact of dietary sugars on body weight (Te Morenga et al. Reference Te Morenga, Mallard and Mann2013), the health impacts of soft drink consumption (Vartanian et al. Reference Vartanian, Schwartz and Brownell2007), and understanding the impacts of low-carbohydrate diets (Nordmann et al. Reference Nordmann, Nordmann, Briel, Keller, Yancy, Brehm and Bucher2006).
Our interest in the determinants of obesity and body weight also stems from the incidence of related health care costs. In addition to the aforementioned health care costs directly associated with obesity and overweight, several related ailments among U.S. adults are responsible for substantial health care costs. Approximately 29 million U.S. adults have diabetes, which led to an estimated $176 billion in direct medical costs in 2012 (CDC 2014). The CDC estimates that one in three U.S. adults has high blood pressure, and one in three U.S. adults has high cholesterol, where these cases are defined by exceeding the respective established medical thresholds for concern. High cholesterol and high blood pressure are associated with several heart ailments that are estimated to be over $300 billion in health care costs (CDC 2011).
Economists have been interested in these issues for decades. Many studies have applied economic modeling to questions surrounding food choices and health. A handful of studies (Howard et al. Reference Howard, Van Horn, Hsia, Manson, Stefanick, Wassertheil-Smoller and Kuller2006; Powell Reference Powell2009; Oyebode et al. Reference Oyebode, Gordon-Dseagu, Walker and Mindell2014) have even sought to identify empirically linkages between dietary quality, food choices, and health outcomes in panel settings. Estimated effects in this respect have typically been small but significant. For example, using data very similar to ours, Chen et al. (Reference Chen, Jaenicke and Volpe2016) find that a one-unit increase in overall dietary quality is associated with a 7 percent lower likelihood of obesity. Our key finding with respect to obesity is similar in magnitude.
We discuss the studies and findings that have been most influential to the present research, as they have informed the controls we have used in our econometric modeling. Morgan (Reference Morgan1986) and Wilde et al. (Reference Wilde, McNamara and Ranney1999) identified a significant income effect, with higher-income people generally consuming healthier diets. Income has been shown to be an important determinant of food expenditures, by broad category, as well (Okrent and MacEwan Reference Okrent and MacEwan2014). Adelaja et al. (Reference Adelaja, Nayga and Lauderbach1997) identified important differences across races, e.g., that most minorities groups consume less calcium than whites. Gao et al. (Reference Gao, Yu and Lee2013) is among the more recent studies to demonstrate the positive impact of education level on food choices. Blaylock et al. (Reference Blaylock, Smallwood, Kassel, Variyam and Aldrich1999) and Mancino and Newman (Reference Mancino and Newman2007) studied how time constraints influence food preparation and food choices, finding that working longer hours is associated with less food preparation at home.
In studies of both health outcomes, food choices, and diet quality, gender has emerged as a significant consideration. Among U.S. adults, obesity rates as well as changes in those rates over time differ for men and women (Wang and Beydoun Reference Wang and Beydoun2007; Flegal et al. Reference Flegal, Carroll, Kit and Ogden2012, Reference Flegal, Carroll, Kit and Ogden2016). Moreover, there may be gender differences in the relationship between dietary quality and health outcomes such as obesity (Mozaffarian et al. Reference Mozaffarian, Hao, Rimm, Willett and Hu2011). Dietary quality may differ significantly by gender, with research showing that women have overall healthier diets than men (Wang et al. Reference Wang, Leung, Li, Ding, Chiuve, Hu and Willett2014). Finally, households with a single female head are considerably more likely to be food insecure or to lack access to the foods necessary to meet the DGA, compared with households with male heads or married household heads (Coleman-Jensen et al. Reference Coleman-Jensen, Gregory and Singh2014). We, therefore, conduct a detailed analysis by gender.
From a nutritional perspective, fruits and vegetables are the foundation of a healthy diet. Van Duyn and Pivonka (Reference Van Duyn and Pivonka2000) conducted a comprehensive review for the epidemiological profession, by providing an overview of the scientific literature on the health effects of fruits and vegetables. They concluded that the scientific evidence supports the contention that fruits and vegetables have vast and varied protective effects for many major organs and against a range of diseases and ailments. These include obesity and overweight, as well as diabetes, cancer, and hypertension. Mandal and Powell (Reference Mandal and Powell2014) noted that fruits and vegetables are key “low-energy dense” foods that are negatively and importantly associated with obesity rates among children. Capacci and Mazzocchi (Reference Capacci and Mazzocchi2011) showed that the 5-a-day program increased fruit and vegetable consumption in the United Kingdom by 0.3 portions.
But U.S. fruit and vegetable consumption lags behind other developed nations and is widely out of sync with dietary recommendations. For example, Canadians consume more than twice the per-capita quantities of bananas, apples, and oranges than Americans (Richards and Patterson Reference Richards and Patterson2005). Kimmons et al. (Reference Kimmons, Gillespie, Seymour, Serdula and Blanck2009) used the 2003–2004 NHANES and determined that fewer than one in 10 Americans met their caloric recommendations for fruit and vegetable intake. Similar findings on fruit and vegetable consumption have also been found when looking at specific demographic subgroups, such as college students (Huang et al. Reference Huang, Harris, Lee, Nazir, Born and Kaur2003), children (Krebs-Smith et al. Reference Krebs-Smith, Cook, Subar, Cleveland, Friday and Kahle1996), low-income households (Cassady et al. Reference Cassady, Jetter and Culp2007), and pregnant women (Bodnar and Siega-Riz Reference Bodnar and Siega-Riz2002).
From a policy perspective, fruit and vegetable intake has been the focus of several studies on food access and food security. These works reflect efforts to inform policy makers regarding actual fruit and vegetable purchases. Rose and Richards (Reference Rose and Richards2004) and Bustillos et al. (Reference Bustillos, Sharkey, Anding and McIntosh2008) used survey data to show that low-income households are more likely to have lower dietary quality and a higher average incidence of adverse health outcomes, relative to higher-income households. A separate line of research, which does not measure or account for dietary quality, has investigated the role of food assistance programs, particularly the Supplemental Nutrition Assistance Program (SNAP), on fruit and vegetable expenditures. Dong and Lin (Reference Dong and Lin2009) as well as Leung et al. (Reference Leung, Ding, Catalano, Villamor, Rimm and Willett2012) have found a small impact of SNAP participation in this regard. Leibtag and Kumcu (Reference Leibtag and Kumcu2011) demonstrated that regional food price differences may affect the purchasing power of SNAP for fruits and vegetables.
Data and Statistics
Our study departs from much of the existing research on food choices and diet quality, in that we use household-level scanner data. Most previous work has relied upon dietary recall data, such as the NHANES data. As we discuss the properties of our data, we take care to describe the advantages and disadvantages of our approach, relative to the use of dietary recall data.
We use the 2008–2012 IRI Household Panel Data (henceforth called the “Panel”). The Panel is based on a sample of over 100,000 households nationwide. Household Panel members could either be volunteers or recruited by IRI. Participating households use hand-held scanners to scan the dates of their product purchases, universal product code, purchase volume, and total expenditures.Footnote 1 As a result, our data includes information about the comprehensive grocery purchases for these households, including the location of grocery purchases and the prices paid.
Household purchases set the data frequency in the raw Panel data. Households participating in the Panel may make hundreds of shopping trips per year, and we aggregate to the annual level for all variables. We have several years of data for many thousands of households. This is in contrast to dietary recall data, which typically consists of 2 days' worth of dietary intake. The purchase data are accompanied by household-level demographics, which allow us to control for a wide range of important variables, including income, race and ethnicity, age, education, gender, marital status, and geographic location. The Panel does not include information about food preparation or actual consumption, or foods purchased and consumed outside of the home such as at restaurants or other institutions. We assume that at-home purchases are a close proxy for at-home consumption.
We measure fruit and vegetable consumption by means of expenditure shares, as reflected by grocery purchases. Our key variable of interest, FruitVegShare, includes fresh, frozen, and canned fruits and vegetables, as defined within the IRI data dictionary. We argue that increases in the expenditure share attributed to fruits and vegetables indicate unambiguous increases in dietary quality. By using shares, we preclude the possibility that households increase their fruit and vegetable expenditures in lockstep with other foods that are less aligned with the DGA. That is, we are not simply capturing changes in aggregate household food spending. USDA has devised food spending guidelines for shoppers on any budget, to meet the DGA (Carlson et al. Reference Carlson, Lino and Fungwe2007). Preliminary statistics indicated that average household in the Panel far exceeds recommendations for refined grains, red meat, sugars and sweets and falls far short of recommendations for whole grains, lean protein sources, and all fruit and vegetable categories.Footnote 2 Therefore, as expenditure shares increase for fruits and vegetables, households are unlikely to be substituting spending in other, relatively healthful categories, but rather, are substituting away from categories with less healthful foods.Footnote 3
To assess health outcomes, we merge the Panel with the 2010–2012 IRI MedProfiler data. The MedProfiler data consists of a subset of the Panel, approximately 40,000 households, who also respond to a detailed annual survey regarding health and well-being. We focus our analysis only on those households participating in both datasets, and, therefore, providing purchase records and demographics matched with self-reported health outcomes. Respondents indicate on a five-point scale the extent to which they suffer from obesity. The possible responses include: suffer but do not treat, suffer and treat with over-the-counter medication, suffer and treat with prescription medication, suffer and treat with both over-the-counter and prescription medication, and do not suffer. Restricting our attention to adults,Footnote 4 we calculate the binary Obesity, which serves as one of our key variables of interest, as being equal to one for households in which any adults report suffering from obesity.Footnote 5
Researchers have long recognized the biases associated with self-reported health data. Rowland (Reference Rowland1990) found that survey respondents were prone to underreport their weight, and that this bias increased among overweight people. It is, therefore, worth comparing the MedProfiler statistics to findings from NHANES. Among the MedProfiler sample, we find that the share of adults reporting obesity is approximately 13 percent. However, the average adult BMI for the sample was 28.8, with values ranging from 18.9 to 46.8, and a standard deviation of 7.5. Flegal et al. (Reference Flegal, Carroll, Kit and Ogden2012) used 2009–2010 NHANES data and found that 35 percent of adults were obese, and the average adult BMI was 28.7, as calculated based on reported height and weight. Therefore, MedProfiler respondents are accurately reporting their height and weight, but are underreporting or misunderstanding the extent to which they are obese.
Owing to this potential issue with the self-reported incidence of obesity, we also calculate and analyze adult BMI in our MedProfiler dataset (AdultBMI). Therefore, we are able to determine the impact of fruit and vegetable purchase behavior on the discrete obesity classification as well as on incremental changes in adult BMI. This widens the implications of our potential findings, given that fruit and vegetable purchase and consumption may affect the body weight of individuals who are not obese. Additionally, we recognize that survey respondents have the potential to underreport their status as obese or to misreport their weight. We are able to investigate the robustness of our findings by measuring weight outcomes in two ways. In our dataset, across all years, the correlation between average adult BMI and fruit and vegetable expenditure share is −0.021. The average fruit and vegetable expenditure share for households with Obesity equal to 1 is 13.31 percent and for households with Obesity equal to 0 is 13.9 percent .
The MedProfiler data also includes several survey questions that we use to measure lifestyle choices, which are important as controls. We calculate household scores for exercise activity, vitamin intake, and omega-3 supplement usage. The possible responses for these questions are 1 = Never, 2 = Sometimes, and 3 = Frequently. We average these responses across all adults within households to calculate these lifestyle scores. Table 1 provides summary statistics and definitions for the variables used in our analysis.
a A very small percentage of households in the IRI Panel report having multiple household heads of the same gender. In these cases, the gender-specific obesity dummy is calculated as 1 if any of the household heads for the relevant gender report being obese.
Source: Authors' calculations using IRI Household Panel and IRI MedProfiler Data.
Methodology and Results
Our empirical strategy for identifying a relationship between fruit and vegetable purchases and health outcomes is two-pronged. Our empirical models for the binomial logistic estimation and ordinary least squares (OLS) estimation are as follows:
where the odds ratio of household-level incidence of obesity in equation (1) or average AdultBMI in equation (2) for household i is a function of fruit and vegetable expenditure shares (FruitVegShare), a vector of variables measuring lifestyle choices (Lifestyle), a vector of geographic effects (Region), a vector of educational attainment variables (Education), a vector of race and ethnicity dummy variables (Race), and a vector of additional household demographic controls (Demographics). In terms of dietary choices, FruitVegShare is calculated as the share of total annual grocery expenditures attributed to fruits and vegetables.
Our key explanatory variable of interest is FruitVegShare, and we expect to estimate negative coefficients for this variable, owing to the body of evidence supporting the health protectiveness of these foods. Our identification strategy hinges on the notion that body weight and overweight/obesity status can change in a matter of years. Given that the average age in the MedProfiler sample is 55, our data is skewed toward an age above the general U.S. population. Wang et al. (Reference Wang, Yi, Roebothan, Colbourne, Maddalena, Wang and Sun2016) tracked the body weight status of over 3,000 adults aged between 40 and 55 years at the start of the study. The authors identified four distinct body-weight trajectories within the sample, indicating that body weight and obesity status can change, even among middle-aged adults. We control for age, although due to the nature of our sample, our expectations for this variable are ambiguous.
Given that we lack access to a comprehensive or consistent measure of nutritional content or serving size across products, we are not able to measure fruit and vegetable consumption as a share of calories in our analysis. Therefore, it is possible that changes in fruit and vegetable expenditure shares could be due to changes in relative prices. However, we argue that measurement error due to relative food prices is only an issue if households systematically purchase fruits and vegetables at stores different from those at which they buy other groceries. Statistics from our dataset suggest that this is not the case. For example, we compared fruit and vegetable purchases of households that primarily shop at supermarkets with those of households that shop predominantly elsewhere. Households in the highest quartile of supermarket shoppers have an average FruitVegShare of 14.05 percent. For households in the bottom quartile, this value is 12.55 percent.Footnote 6 This indicates a modest difference between these two groups of shoppers.
The vectors of controls include variables calculated using the Panel and the MedProfiler datasets. The Lifestyle vector includes household scores based on the responses to questions on exercise activity, vitamin intake, and omega-3 supplementation. We model these as measures of lifestyle choices or the unobservable demand for health, and thus we expect each of these factors to be negatively associated with measures of body weight.
The Region vector indicates if households are located in the Northeast, Midwest, South, or West. Through this vector we aim to capture geographic differences in obesity rates, attitudes toward health, average dietary quality, and food prices. As an example, the CDC calculates and reports obesity rates by state each year. It is readily apparent as of 2016 that obesity rates are the highest on average in the southern and Midwestern states and the lowest on average in the western and northeastern states (CDC 2017).
The remaining controls consist of demographic variables. The Education vector indicates, by gender of the household heads, the highest level of education attained. Owing to the literature cited in support of this control, we expect that education will be negatively associated with obesity and BMI. The Race vector indicates if households categorize themselves as White, Black, Asian, or other.Footnote 7 We have no clear expectations for these variables, particularly given that the Panel is predominantly white. And, finally, the Demographics vector includes measures of household income and the aforementioned average age of adults. We expect income to be negatively associated with obesity and BMI.
Our logistic and OLS approaches seek to identify the impacts of purchasing behavior by means of cross-sectional variation across households. Therefore, we model the 2012 Obesity and AdultBMI values as functions of the average values for FruitVegShare for all available years (maximum of 6) of data for the respective households. We do not assume that 1 year of observed purchase behavior is likely to have a meaningful impact on body weight or health outcomes generally. We also follow this approach to address concerns regarding dual-causality, as health outcomes are a plausible function of earlier behavior but the reverse cannot be true. Nevertheless, we recognize that unobservable household characteristics, e.g., the demand for health and longevity, may be correlated with both the error term and fruit and vegetable purchases. Using the average values for FruitVegShare, we are able to capture the multiyear effect of households' past fruit and vegetable purchases, measured as measures of past behavior. Similarly, we believe that the lifestyle variables tend to have the same type of long-term impact on health as the food purchase behavior; therefore, we use the average values for these variables as well in equations (1) and (2).
Furthermore, we revise equations (1) and (2) by including both the cross sectional and longitudinal effects. Our dependent variable is again adult Obesity or AdultBMI at time t. This way, we are able to estimate our regression in a panel setting, using household fixed effects. In equations (3) and (4) below, we use Obesity and AdultBMI for all available years, 2010–2012, and model these as functions of FruitVegShare using the most recent 3 years of data. For example, 2012 health outcomes are modeled as a function of average fruit and vegetable expenditures for 2010–2012.
Similar to the variables used in equations (1) and (2), the odds ratio of household-level incidence of obesity in equation (3) or average AdultBMI in equation (4) for household i at time t is a function of household fruit and vegetable expenditure shares at time t (FruitVegShare), household lifestyle choices at time t (Lifestyle), a vector of geographic effects (Region), a vector of educational attainment variables (Education), a vector of race and ethnicity dummy variables (Race), and a vector of additional household demographic controls (Demographics) such as age that vary with time.
Through this approach we increase our sample size substantially. In addition, the usage of household-level fixed effects controls for unobservable characteristics that might relate to household purchasing decisions and health outcomes, mitigating potential concerns regarding dual causality or endogeneity that cannot be addressed using a lag structure. In all of our models, we checked for multicollinearity using the variance inflation factor, and found no values above two. We also estimate using robust standard errors, to account for potential heteroscedasticity resulting from variation in total fruit and vegetable expenditures across households.
Results and Discussion
We begin by estimating equations (1) and (2) using logistic regression for the binary Obesity variable and OLS estimation for the continuous AdultBMI variable.Footnote 8 Selected results are reported in Table 2, and the complete set of results are available from the authors upon request. Importantly, we find that increased fruit and vegetable expenditure shares are associated with significant decreases in the incidence of high obesity among adults. We also find a negative and significant relationship with AdultBMI.
Robust standard errors in parentheses.
*** Denotes that the coefficient estimate is significant at the 0.01 level. **: At the 0.05 level. *: At the 0.10 level.
a Marginal effects are calculated based on the estimated coefficient on FruitVegShare for the logit regressions.
b Elasticities are calculated at the sample mean values for all variables. Given our regression specification, this corresponds to ${\hat \beta} _{FruitVegShare}{\ast}\left({\textstyle{{\overline {FruitVegShare}} \over {\overline {AdultBMI}}}} \right)$, for the AdultBMI coefficient.
c The AdultBMI is the average BMI among the household heads, as calculated using their reported height and weight. For the male and female estimations, this usually corresponds to the BMI of a single adult. However, some households feature two (or more) household heads of the same gender, and in this case AdultBMI is also an average.
The interpretation of the estimated coefficients on FruitVegShare vary between the logit and OLS estimations. To aid with interpretation, we calculate and report the estimated marginal effect for FruitVegShare with respect to Obesity and elasticities with respect to AdultBMI. We find that a one percentage point increase in FruitVegShare is associated with an approximately 9 percent decrease in the probability of being obese, within households. With respect to BMI, the estimated impact is a 1.4 percent decrease in average BMI among adults. These associations are both statistically and economically significant. Moreover, given the gap between the average American's consumption of fruit and vegetables and federal recommendations, they suggest that considerable reductions in the U.S. obesity rate can be achieved through dietary changes.
In most cases, the coefficient estimates for our controls have the expected signs. The lifestyle factors (exercise, vitamin intake, and omega-3 supplement intake), which are not intended to demonstrate causality, but rather to control for preferences toward health and well-being, are generally negative and significant. We corroborate the findings of previous studies (e.g., Schroeter et al. Reference Schroeter, Anders and Carlson2013), in that our lifestyle vector seems to serve as another marker for healthy eating.
Income is negatively associated with adult obesity and BMI, supporting the body of work on socioeconomic status and health outcomes (e.g., Stewart et al. Reference Stewart, Blisard and Joliffe2003). We find evidence that average age is negatively associated with adult body weight. Recall that age in the Panel averages well above the general population and exhibits relatively little variation, and, therefore, estimation results regarding age ought to be interpreted with caution.Footnote 9 Given the age distribution of our data, our results indicate that as adults advance through middle age, they are more likely to lose than gain weight, all else equal.
Across gender, the results for equations (1) and (2) show some intriguing differences that may warrant further investigation. The estimated coefficient on Obesity is negative and not significant for male household head, but for female household head it is highly significant. The coefficient on BMI for male household head is negative and significant, indicating that a 1 percent increase in FruitVegShare is associated with a decrease in BMI of nearly five units, which translates into approximately a 5 percent decrease based on the elasticity. However, the coefficient on BMI for female household head is insignificant. Across other covariates, no systematic patterns emerge between the genders, although in some cases estimated coefficients differ significantly in magnitude.
We also estimate equations (3) and (4) using the full MedProfiler dataset, with both year effects and household fixed effects. This enables us to use a considerably larger sample size and examine how the key variables influence the health outcomes of the same households over time, as we are no longer restricted to using ailment data from 2012 alone. Therefore, when using the 2010–2012 ailment scores, we estimate equations (3) and (4) with the fixed effects Panel approach to control for the un-observables such as household genetics and the overall demand for health. To eliminate the incidental parameter problem in the fixed effects logit model specified in equation (3), we used the conditional maximum likelihood estimation method suggested in Cameron and Trivedi (Reference Cameron and Trivedi2005). Our dasataset is strongly balanced, as we include only those households that participated consistently from 2010–2012. In addition, we detected no autocorrelations with the errors.Footnote 10 Table 3 reports the results with fixed effects.
Robust standard errors in parentheses.
*** Denotes that the coefficient estimate is significant at the 0.01 level. **: At the 0.05 level. *: At the 0.10 level.
a Marginal effects are calculated based on the estimated coefficient on FruitVegShare for the logit regressions.
b Elasticities are calculated at the sample mean values for all variables. Given our regression specification, this corresponds to ${\hat \beta} _{FruitVegShare}{\ast} \left({\textstyle{{\overline {FruitVegShare}} \over {\overline {AdultBMI}}}} \right) $, for the AdultBMI coefficient.
c The AdultBMI is the average BMI among the household heads, as calculated using their reported height and weight. For the male and female estimations, this usually corresponds to the BMI of a single adult. However, some households feature two (or more) household heads of the same gender, and in this case AdultBMI is also an average.
d In the fixed effects logit model, neither the intercept term nor the pseudo-R2 are estimated or calculated.
The fixed effect findings lend robustness to our logistic and OLS results and reinforce our key estimated relationships. Specifically, we find that increased fruit and vegetable expenditure shares are associated with statistically significant reductions in both Obesity and AdultBMI, for the full samples. The same association is also found in female head Obesity and male head BMI. The FruitVegShare coefficients indicate larger impacts for women than the entire sample with respect to obesity status, which may help inform the optimal implementation strategies for educational or promotional efforts aimed at improving fruit and vegetable intake. In contrast with the results of (2), the impact of FruitVegShare on BMI for female household head is found to be negative and significant.
Furthermore, consistent with the logistic and OLS results, the ExerciseScore coefficients show negative and significant impact on both Obesity and AdultBMI in the fixed effects results. Interestingly, HHAge is positively associated with our measures of body weight. Thus, aging can result in higher incidence of obesity and increased average BMI among Americans.
The gender-specific results call for a closer examination, ideally using data with more diverse demographics. We consistently estimate negative and significant impacts for the overall samples, for both Obesity and AdultBMI. When estimating equations (1) and (2), we find no significant impact of fruit and vegetable expenditures on obesity for men or for average BMI for women. When estimating equations (3) and (4), with household fixed effects, neither of the gender-based estimations yield significant associations with FruitVegShare. The lack of significant findings for male head Obesity and female head BMI may be due to the fact that the MedProfiler data are self-reported. There may be important systematic differences between men and women with respect to reporting weight or obesity status, and this possibility warrants investigation to inform future research. Epidemiological research on the mechanisms affecting body weight and how these differ by gender, as well as other interdisciplinary approaches, may help inform these results.
Our results indicate that, when controlling for lifestyle choices, and a host of demographic factors, there is a strong association between fruit and vegetable expenditures and obesity incidence or BMI. Therefore, policy and educational efforts aimed toward increasing fruit and vegetable purchases at the retail level, particularly in supermarkets, may be worth the cost of implementation, owing to the expected decrease in health care expenditures over the coming years. Additional research may help demonstrate whether such expenditures at the national level are justified, given that we find a robust and significant impact on average adult BMI. However, efforts may also be targeted to areas of above-average obesity levels, given that we find the discrete classification of obesity is more sensitive to changes in fruit and vegetable purchases than is BMI. Richards and Hamilton (Reference Richards and Hamilton2012) argue that obesity policy would be best aimed at informing consumers about the long-term implications of their behavior, based on people's effective discount rate in making decisions. Our research provides a unique contribution by quantifying the impacts of multiple years of household food purchase behavior on adult obesity outcomes and BMI.
Conclusions and Future Work
We use a rich and granular dataset on household food expenditures, paired with self-reported medical outcome data. The results indicate a robust, negative, and significant relationship between the share of total food expenditures on fruits and vegetables and the incidence of obesity as well as average adult BMI. We measure fruit and vegetable expenses as shares of total recorded food expenditures. We find that increasing household fruit and vegetable expenditure shares by one percentage point decreases the long run, multiyear incidence of adult obesity by approximately 9 percent and average adult BMI by 1.4 percent. These estimates are robust to specification choice.
We argue that the relatively poor average U.S. dietary quality motivates additional policy and educational efforts aimed toward increasing fruit and vegetable purchases among U.S. households. Marginal changes and improvements to fruit and vegetable purchasing behavior can be executed directly at the retail level too. Estimating the potential impacts of increased fruit and vegetable expenditures on health care costs or on the incidence of morbidity in the United States is beyond the scope of this study. However, such work is strongly motivated in future research as a means to estimate the expected returns from government or industry efforts to increase fruit and vegetable purchases.
Our research is not without limitations. Chief among them is the limited demographic variation in the IRI Panel, which is weighted toward older higher-income, well-educated, and white families. Given this restriction, we only have a limited capacity to observe food choices and health outcomes among low-income households and those eligible for food assistance programs. We are also, as noted, unable to observe food choices and expenditures outside the home, which have the potential to impact diet quality and health. Because we use point-of-sale data, we measure fruit and vegetable expenditures, not consumption. Therefore, we are unable to control for waste or food preparation methods, both of which may affect the amount of fruit and vegetable actually being consumed following the observed purchases. Lastly, we recognize there is a degree of inconsistency with respect to statistical significance of the gender-based results. Given our current resources, we are unable to provide a specific reason for that. An interdisciplinary research to study these gender effects in more detail might be helpful to inform the results.
Our analysis could be extended in a variety of ways. It would be useful to understand how increased expenditures on other food and beverage categories influence body weight. Certain food categories such as fruit juices or red meats have health implications that have yet to be resolved by the nutrition or epidemiological literature. Our dataset allows for investigations in these questions that extend beyond dietary recall data. Another potential application of this research is to examine if the impacts of fruit and vegetable purchases on body weight are equal, regardless of point of purchase. For example, it has been suggested in consumer surveys that fruit and vegetable quality is inferior in supercenters relative to supermarkets. If this were to be the case, we would expect smaller impacts of fruit and vegetable purchases on health, among supercenter shoppers.