Hostname: page-component-745bb68f8f-mzp66 Total loading time: 0 Render date: 2025-01-13T13:27:29.163Z Has data issue: false hasContentIssue false

Accuracy and reliability of self-reported weight and height in the Sister Study

Published online by Cambridge University Press:  09 December 2011

Cynthia J Lin
Affiliation:
Epidemiology Branch, National Institute of Environmental Health Sciences, PO Box 12233, MD A3-05, Research Triangle Park, NC 27709, USA
Lisa A DeRoo
Affiliation:
Epidemiology Branch, National Institute of Environmental Health Sciences, PO Box 12233, MD A3-05, Research Triangle Park, NC 27709, USA
Sara R Jacobs
Affiliation:
Epidemiology Branch, National Institute of Environmental Health Sciences, PO Box 12233, MD A3-05, Research Triangle Park, NC 27709, USA
Dale P Sandler*
Affiliation:
Epidemiology Branch, National Institute of Environmental Health Sciences, PO Box 12233, MD A3-05, Research Triangle Park, NC 27709, USA
*
*Corresponding author: Email sandler@niehs.nih.gov
Rights & Permissions [Opens in a new window]

Abstract

Objective

To assess the accuracy and reliability of self-reported weight and height and identify the factors associated with reporting accuracy.

Design

Analysis of self-reported and measured weight and height from participants in the Sister Study (2003–2009), a nationwide cohort of 50 884 women aged 35–74 years in the USA with a sister with breast cancer.

Setting

Weight and height were reported via computer-assisted telephone interview (CATI) and self-administered questionnaires, and measured by examiners.

Subjects

Early enrolees in the Sister Study. There were 18 639 women available for the accuracy analyses and 13 316 for the reliability analyses.

Results

Using weighted kappa statistics, comparisons were made between CATI responses and examiner measures to assess accuracy and CATI and questionnaire responses to assess reliability. Polytomous logistic regression evaluated factors associated with over- or under-reporting. Compared with measured values, agreement was 96 % for reported height (±1 inch (±2·5 cm); weighted κ = 0·84) and 67 % for weight (±3 lb (±1·36 kg); weighted κ = 0·92). Obese women (BMI ≥ 30 kg/m2) were more likely than normal-weight women to under-report weight by ≥5 % and underweight women (BMI < 18·5 kg/m2) were more likely to over-report. Among normal-weight and overweight women (18·5 kg/m2 ≤ BMI < 30 kg/m2), weight cycling and lifetime weight difference ≥50 lb (≥22·68 kg) were associated with over-reporting.

Conclusions

US women in the Sister Study were reasonably reliable and accurate in reporting weight and height. Women with normal-range BMI reported most accurately. Overweight and obese women and those with weight fluctuations were less accurate, but even among obese women, few under-reported their weight by >10 %.

Type
Research paper
Creative Commons
This is a work of the US Government and is not subject to copyright protection in the United States.
Copyright
Copyright © The Authors 2011 This is a work of the US Government and is not subject to copyright protection in the United States.

Many studies have found an association between high or low BMI and risk of adverse health outcomes using self-reported data on weight and height. With an increasing prevalence of overweight and obesity in the USA(Reference Ogden, Carroll and Curtin1), the effect of anthropometric characteristics on reporting accuracy is a concern. Studies have examined the accuracy of self-reported v. directly measured height and weight but findings have varied and many studies were small or otherwise limited(Reference Engstrom, Paterson and Doherty2, Reference Paradis, Perusse and Godin3). In a meta-analysis of weight reporting in thirty-four studies, only eighteen were from the USA, sample sizes varied from eighteen to 9000, ages varied from 12 to 84 years, and measurement protocols differed or were not described(Reference Engstrom, Paterson and Doherty2). While many studies suggest that women tend to under-report their weight, less is known about the factors associated with reporting accuracy.

Current weight has been shown to influence weight reporting accuracy. The overweight and obese tend to under-report their weight and the underweight tend to over-report(Reference Gorber, Tremblay and Moher4, Reference Roberts5). Studies of selected populations, including adult women in the USA, have also suggested that age and race contribute to reporting bias(Reference Bostrom and Diderichsen6Reference Rowland8).

The impact of weight fluctuation and weight cycling on weight reporting accuracy has not been thoroughly examined in the existing literature. Weight cycling is not uncommon. Among Finnish women, the prevalence of weight cycling (defined as losing and then regaining ≥5 kg) was reported to be 29 %(Reference Lahti-Koski, Mannisto and Pietinen9). Strohacker et al. estimated that 38 % of US women weight cycle at least once in their lifetime(Reference Strohacker and McFarlin10), and 20 % of women in the Nurses’ Health Study reported at least three weight cycling episodes (defined as losing and then regaining ≥10 lb (≥4·54 kg))(Reference Field, Byers and Hunter11). Among obese bariatric surgery candidates, frequent weight cycling was associated with greater reporting accuracy, suggesting that frequent weight cycling might increase attentiveness to weight, leading to heightened accuracy in reporting(Reference White, Masheb and Burke-Martindale12). Weight cycling and fluctuation and weight reporting accuracy have not yet been examined in a large sample of the general population.

A tendency to over-report height has been observed, particularly among people who are older, shorter and/or overweight(Reference Rowland8), but under-reporting has been observed in higher income categories for certain age groups(Reference Merrill and Richardson13). Fewer studies have assessed reliability of self-reported measures and results were inconsistent(Reference Perez-Cueto and Verbeke14, Reference Probst, Faraji and Batterham15).

The present study assessed the accuracy and reliability of self-reported weight and height in a large cohort of US women and identified characteristics associated with reporting accuracy. We compared self-reported height and weight with examiner-measured values, and separately compared two self-reports obtained using different approaches, allowing us to consider design features affecting data quality.

Methods

Data collection and study population

We used data from the Sister Study, a nationwide volunteer cohort of 50 884 US (including Puerto Rico) women aged 35–74 years with a sister with breast cancer; enrolment occurred from September 2003 to March 2009. The present analysis examines early enrolees who completed baseline activities by 21 September 2007 (n 31 409). To avoid errors influenced by eating disorders(Reference McCabe, McFarlane and Polivy16Reference Meyer, McPartlan and Sines18), participants who reported ever having anorexia or bulimia were excluded (n 1066). Pregnant women delayed baseline activities until at least three months after the end of pregnancy.

Study participants reported weight (pounds) and height (feet and inches) in a computer-assisted telephone interview (CATI) and separately on a self-administered scannable diet questionnaire. During a home visit, trained examiners used digital self-calibrating scales to measure weight and metal tape measures to measure height. The order of completing the CATI, questionnaire and home visit varied; self-reports could be completed before or after the home visit. All measurements were taken three times without shoes. Measurements were rounded to the nearest whole pound for weight and quarter of an inch for height. Other variables examined from the baseline CATI were weight cycling (frequency of losing and then gaining ≥20 lb (≥9·07 kg)), lowest weight since age 20 years, heaviest non-pregnant/breast-feeding weight, age, race, education level, perceived health status, marital status, household income, smoking, alcohol, physical activity, gravidity, regular multivitamin intake, recency of last medical examination, history of depression and use of antidepressant medications.

BMI was categorized using the Centers for Disease Control and Prevention definitions(19). Lifetime weight difference was calculated by subtracting lowest weight since age 20 years from heaviest non-pregnant/breast-feeding weight. All statistical analyses were performed using the STATA/IC for Windows statistical software package version 10·1 (StataCorp LP, College Station, TX, USA).

Accuracy of telephone interview

To assess the accuracy of self-reported weight and height, we first compared CATI-reported values with examiner measures among women who completed the CATI within 30 d of the home visit (n 18 639). The primary source of Sister Study data is the telephone interview, which had less missing data and fewer structural errors (see below) for height and weight. For this analysis, examiner measures were treated as the true value. Percentage agreement and weighted kappa statistics were calculated for each variable of interest. Kappa statistics were weighted according to a standard weight in STATA to account for the degree of disagreement. Polytomous logistic regression was used to calculate odds ratios and 95 % confidence intervals for reporting accuracy by age, race, education level, perceived health status, marital status and measured BMI.

To be consistent with the existing literature, we first examined the absolute difference between self-reported and measured weight. Differences between measured and self-reported weight were categorized as under-reporting by ≥7 lb (≥3·18 kg), under-reporting by 4–6 lb (1·81–2·72 kg), reporting within 3 lb (1·36 kg) and over-reporting by ≥4 lb (≥1·81 kg). Because the relative impact of a specific weight difference will be greater in smaller than larger women, we also calculated the percentage of weight misreported; self-reports that differed by less than 5 % from measured weights were the referent category. Polytomous logistic regression models explored the effects of measured BMI, weight cycling, lifetime weight difference and current antidepressant use on under- and over-reporting, adjusting for age, race, education, perceived health status and marital status as potential confounders. Models examining weight cycling, lifetime weight difference or current antidepressant use also adjusted for measured BMI. Differences between measured and self-reported height were categorized as under-reporting by ≥1 inch (≥2·5 cm), reporting within 1 inch and over-reporting by ≥1 inch.

To determine the effect of misreporting on BMI categories, we compared categories calculated from CATI-reported data with categories based on examiner-measured data using percentage agreement and weighted kappa statistics for all women and stratified by categories of age, race, education level, perceived health status and marital status. We also determined the sensitivity and specificity of self-reported overweight/obese classification relative to examiner-measured data. To further explore the potential for bias in BMI we stratified on measured BMI and examined the percentage of CATI-determined BMI values that over- or underestimated BMI calculated from examiner-measured values.

We carried out additional analyses stratifying by or adjusting for which measure came first, the home visit or CATI.

Accuracy of self-completed questionnaire

Using data from the subset of women with CATI and questionnaire completed within 30 d of the home visit (n 13 985), we carried out similar analyses to assess the accuracy of weight and height reported in the self-completed questionnaire compared with examiner-measured data. We then compared the accuracy of the two self-report measures by calculating ratios of odds ratios from models assessing reporting by CATI or questionnaire v. measured data. An analysis including all women (n 21 935) completing the diet questionnaire within 30 d of the home visit had similar results and is not shown.

Reliability

Reliability of self-reported weight and height was assessed using percentage agreement and weighted kappa statistics to compare self-reported data from the CATI and diet questionnaires. Analyses were limited to women who completed the CATI within 30 d of submitting their questionnaire (n 13 316) and had non-missing questionnaire data for weight (n 11 585) and height (n 11 885). Similar to the accuracy analysis, we stratified and adjusted analyses by reporting order with respect to each other and with respect to the examiner measurement.

Correcting structural errors

Prior to analyses, we identified and corrected several problems inherent to the reporting method. Both random and systematic errors occurred with the self-administered diet questionnaire. About 1 % of respondents appeared to make frameshift bubbling errors for weight and/or height by mistaking the bubbles in one or more columns as starting at 1 instead of 0. Figure 1 shows a frameshift error in which the respondent filled in the wrong value for weight in the tens place and the wrong values for height in the feet and inches columns. Frameshift errors occurred frequently in the hundreds place of weight, which were detected when an unreasonable weight (<100 lb) was marked (e.g. 34 lb instead of 134 lb). We corrected obvious frameshift errors (0·7 % of weight values and 0·1 % of height values) when questionnaire values differed from both the CATI and examiner reports by >60 lb (>27·22 kg) or >11 inches (>27·9 cm).

Fig. 1 Example of frameshift errors on the self-administered diet questionnaire

Some errors were related to the choice of unit. In the diet questionnaires, a small percentage of respondents appeared to report height in total inches rather than feet and inches as instructed. For example, instead of 5 feet 4 inches, a respondent marked the total inch equivalent (64 inches) which was then mistakenly interpreted as 6 feet 4 inches. We corrected these unit errors in about 0·8 % of all responses by checking suspiciously high reports and confirming corrections with CATI and examiner reports. Although these errors occurred for units (inches, pounds) used in the USA, similar errors could occur for those used in other countries (e.g. metres, kilograms).

There were considerable missing values for weight (13 %), height (11 %) or both (8 %) in the self-administered diet questionnaires. Non-response did not substantially vary by age or BMI category. Missing weight and height were uncommon in the CATI (<1 %).

There seemed to be a tendency to round to 0 or 5 when reporting weight in the CATI (59 %) and questionnaires (52 %), whereas an end digit of 0 or 5 occurred in 27 % of examiner measures. We did not correct for this apparent rounding.

We detected infrequent random reporting errors for all modes of reporting. In self-administered questionnaires, random bubbling errors such as pencil smudges were sensitive to the questionnaire scanner. For the CATI, there were occasional data-entry errors by interviewers and for examiners, some inconsistencies following measurement protocols. We corrected CATI values if they greatly differed from both examiner and questionnaire values (≥100 lb (≥45·36 kg) for weight; ≥11 inches (≥27·9 cm) for height).

Results

Participant characteristics

Participants were predominantly white (93 %), aged 45–64 years (70 %), college-educated (>50 %), and married or living as married (77 %; Table 1). Over half (58 %) were overweight or obese; 78 % perceived themselves as being in very good or excellent health.

Table 1 Characteristics of participants by weight reporting accuracy: US women (n 18 639) aged 35–74 years, the Sister Study (2003–2009)

CATI, computer-assisted telephone interview.

Accuracy of telephone interview weight

Measured and self-reported (CATI) weight were highly correlated (correlation coefficient, r = 0·99). Overall, women under-reported their weight by an average of 1·6 lb (0·73 kg). The mean absolute difference between measured and CATI weight was 3·3 (sd 4·1; range 0–50) lb (1·50 (sd 1·86; range 0–22·68) kg). Mean self-reported weight was 160·2 (sd 35·5; range 82–402) lb (72·67 (sd 16·10; range 37·20–182·34) kg); mean examiner-measured weight was 161·8 (sd 36·4; range 80–425) lb (73·39 (sd 16·51; range 36·29–192·78) kg). The average absolute time between the CATI and examiner home visit was 12·6 (sd 8·7) d.

Overall, 66·5 % of women reported their weight within 3 lb (1·36 kg) of measured values (Table 1) with overall weighted κ = 0·92. Agreement within 3 lb increased with age and perceived health status and was greater for women who were married, had a college degree and had normal measured BMI. Agreement was lower for black women, obese women, women who weight cycled ≥3 times and women who completed the CATI before the physical exam.

Under-reporting weight

The crude odds ratio for under-reporting by ≥7 lb (≥3·18 kg) decreased with increasing age: OR = 0·84 (95 % CI 0·75, 0·94) for women aged 55–64 years and OR = 0·62 (95 % CI 0·54, 0·73) for women over 65 years, compared with those aged 45–54 years (Table 2). Compared with non-Hispanic whites, blacks had a higher odds of under-reporting weight (OR = 1·26; 95 % CI 1·00, 1·59 for under-reporting by 4–6 lb pounds (1·81–2·72 kg) and OR = 1·72; 95 % CI 1·36, 2·17 for under-reporting by ≥7 lb). Never married (OR = 1·41; 95 % CI 1·16, 1·72) and widowed/divorced/separated women (OR = 1·25; 95 % CI 1·11, 1·40) had an increased odds of under-reporting weight by ≥7 lb than married women. The odds ratio for under-reporting by ≥7 lb increased from 3·82 (95 % CI 3·29, 4·43) for overweight women to 8·92 (95 % CI 7·74, 10·29) for obese relative to normal-weight women. Associations remained after adjusting for age, race and education (Table 2); further adjustment for perceived health and marital statuses did not substantially change estimates. Results from analyses stratified by reporting order (CATI before or after physical exam) were similar.

Table 2 Association of participant characteristics with weight reporting accuracy – CATI-reported v. examiner-measured weight: US women (n 18 639) aged 35–74 years, the Sister Study (2003–2009)

CATI, computer-assisted telephone interview; OR, crude odds ratio; aOR adjusted odds ratio (adjusted for age, race and education); Ref., referent group.

Participants reporting within 3 lb (1·36 kg) is the referent group.

The effect of weight cycling differed by BMI status, affecting mainly reporting accuracy among underweight and normal-weight women (Table 3).

Table 3 Association of weight cycling with weight reporting accuracy, stratified by BMI: US women (n 18 639) aged 35–74 years, the Sister Study (2003–2009)

aOR, adjusted odds ratio (adjusted for age, race and education); Ref., referent group.

Participants reporting within 3 lb (1·36 kg) is the referent group.

About 8 % of all women (n 1439) under-reported weight by ≥5 %. Compared with normal-weight women, in adjusted analyses, the odds of under-reporting weight by ≥5 % was higher among overweight (OR = 2·38; 95 % CI 2·05, 2·77) and obese women (OR = 4·10; 95 % CI 3·54, 4·76; Fig. 2). A lifetime weight difference of 25–49 lb (11·34–22·23 kg) was also associated with under-reporting (OR = 1·35; 95 % CI 1·11, 1·65; Fig. 3). Stratifying by BMI, overweight and obese women with a lifetime weight difference of ≥50 lb (≥22·68 kg) had a decreased odds of under-reporting weight by ≥5 % compared with those with a smaller weight difference (overweight OR = 0·65; 95 % CI 0·54, 0·78; obese OR = 0·52; 95 % CI 0·39, 0·70). Conversely, underweight and normal-weight women who weight cycled at least once had an increased odds of under-reporting weight compared with those who never weight cycled (OR = 1·35; 95 % CI 1·02, 1·78).

Fig. 2 The association between BMI and the accuracy of weight reported in a computer-assisted telephone interview: US women (n 18 639) aged 35–74 years, the Sister Study (2003–2009). Odds ratios were adjusted for age, race, education, perceived health status and marital status; 95 % confidence intervals are represented by error bars

Fig. 3 The association between weight cycling and lifetime weight difference and the accuracy of weight reported in a computer-assisted telephone interview: US women (n 18 639) aged 35–74 years, the Sister Study (2003–2009). Odds ratios were adjusted for age, race, education, perceived health status, marital status and BMI; 95 % confidence intervals are represented by error bars

Over-reporting weight

Only 2 % (n 465) of all women over-reported weight by ≥5 %. In adjusted analyses, the most important factor associated with over-reporting weight by ≥5 % was being underweight: OR = 5·30 (95 % CI 3·67, 7·66; Fig. 2). Weight cycling and increasing lifetime weight difference were also associated with over-reporting weight (Fig. 3).

After excluding currently underweight and obese women, the increased odds of over-reporting by ≥5 % remained for those having a lifetime difference of ≥75 lb (≥34·02 kg; OR = 2·89; 95 % CI 1·76, 4·75; data not shown). However, the increased odds of over-reporting among women with ≥3 episodes of weight cycling was no longer significant (OR = 1·30; 95 % CI 0·89, 1·90; data not shown). After stratifying by BMI, lifetime weight difference ≥50 lb (≥22·68 kg) was associated with over-reporting among currently normal-weight (OR = 1·73; 95 % CI 1·22, 2·46) and overweight women (OR 1·58; 95 % CI 1·05, 2·38).

Factors not associated with weight reporting

While current antidepressant use seemed to have some effect on weight reporting accuracy (Table 2), the associations were attenuated after adjusting for BMI. Household income, perceived stress, physical activity (total MET-h/week; MET = metabolic equivalent task), regular multivitamin use, gravidity, recency of last medical examination, smoking and alcohol were not associated with over- or under-reporting weight (data not shown).

Accuracy of self-reported height

Measured and self-reported height were highly correlated (r = 0·96); the average absolute difference between self-reported (CATI) and examiner-measured height was only 0·5 (sd 0·6; range 0–5·9) inches (1·3 (sd 1·5; range 0–15) cm). Slight variations between the CATI and examiner were likely due to different rounding conventions. Mean self-reported height was 64·6 (sd 2·6; range 50–75) inches (164·1 (sd 6·6; range 127·0–190·5) cm) and mean examiner-measured height was 64·7 (sd 2·5; range 50·7–75·1) inches (164·3 (sd 6·4; range 128·8–190·8) cm).

Over-reporting of height increased slightly with age and BMI. The odds of under-reporting height were higher among black women compared with whites. Also, women with less than a bachelor's degree had increased odds of misreporting their height compared with women with a bachelor's degree. No other factor was associated with differences in self-reported and measured height.

Accuracy of BMI based on telephone interview weight and height

The classification of overweight or obese BMI using self-reported measures was highly sensitive (0·95) and specific (0·96). For obese classification alone, sensitivity was 0·90 and specificity was 0·98.

BMI values based on CATI-reported and examiner-measured data were very close. The mean absolute difference between CATI-reported and examiner-measured BMI was only 0·7 (sd 0·8) kg/m2; the correlation was very high (r = 0·98; Fig. 4).

Fig. 4 Comparison of BMI calculated from weight and height reported in a computer-assisted telephone interview (CATI) v. examiner measures: US women (n 18 639) aged 35–74 years, the Sister Study (2003–2009); —— indicates fitted values

Among women with normal-range examiner-based BMI, BMI values calculated from CATI reports were within 4 % of measured BMI 83·4 % of the time (Table 4). However, despite an overall high correlation between BMI values from self-reported and examiner-measured data, there were noticeable discrepancies among women with lower and higher BMI. As shown, self-reported BMI was ≥5 % greater than measured BMI for about a quarter of underweight women. Also, BMI based on CATI-reported values was under-reported by ≥5 % for about 12 % of overweight women and 17 % of obese women.

Table 4 Percentage discrepancy between BMI based on CATI-reported values and examiner-measured values, by BMI (based on examiner-measured weight and height): US women (n 18 639) aged 35–74 years, the Sister Study (2003–2009)

CATI, computer-assisted telephone interview.

Accuracy of self-completed questionnaire

Restricting to participants who completed both the questionnaire and CATI within 30 d of examiner assessment (n 13 985), the average absolute differences between CATI and measured height and weight were 0·4 (sd 0·6) inches (1·0 (sd 1·5) cm) and 3·2 (sd 4·0) lb (1·45 (sd 1·81) kg), respectively. The average absolute differences between questionnaire and measured height and weight were 0·5 (sd 0·6) inches (1·3 (sd 1·5) cm) and 3·4 (sd 3·6) lb (1·54 (sd 1·63) kg), respectively.

The tendency to under-report weight increased with BMI for both questionnaire and CATI although the differences were greater for telephone reports. For example, obese women were almost twice as likely to over-report by telephone compared with self-completed questionnaire (OR ratio = 1·86). Other differences were similarly magnified with telephone-reported data. Interestingly, while most trends suggested overweight women under-report their weight while underweight women over-report, women with large differences between heaviest and lowest weight also tended to over-report their weight when compared with examiner measurements, especially when reporting by telephone (see Appendix).

Reliability of self-reported weight and height

There were high correlations between the self-reported values for weight (r = 0·99) and height (r = 0·98). The average absolute difference between weight reported in the CATI and questionnaire was 2·0 (sd 3·3; range 0–55) lb (0·91 kg (sd 1·50; range 0–24·95) kg). The absolute difference in height was 0·2 (sd 0·5; range 0–5) inches (0·5 (sd 1·3; range 0–12·7) cm). The absolute difference in time between self reports was 15 (sd 9) d. For weight, 80 % were within 3 lb (1·36 kg). For height, 99 % were within 1 inch (2·5 cm). The overall weighted κ was 0·95 for weight and 0·92 for height.

Factors associated with agreement in self-reported weight and height were largely similar to those for accuracy. Whereas height agreement decreased with age, weight agreement within 3 lb (1·36 kg) increased with age. Percentage agreement for weight and height increased with better perceived health status. Reporting agreement was inversely associated with BMI, weight cycling and lifetime weight difference. Findings were similar in analyses stratified by reporting order.

Discussion

Overall, women in the Sister Study reported weight and height accurately. Although participants were slightly leaner (on average 2 kg/m2 lower in BMI) than middle-aged non-Hispanic white women in a smaller, nationally representative sample from the National Health and Nutrition Examination Survey (NHANES) 2003–2006(Reference McDowell, Fryar and Ogden20), we confirmed previous findings that errors in reporting weight were associated with specific weight characteristics. Besides current weight status, we found that reporting accuracy was affected by excessive weight cycling (≥3 times) and extreme lifetime weight differences in adulthood (≥75 lb (≥34·02 kg)).

The present study is among the first to examine weight cycling and lifetime weight difference and reporting accuracy in a general population of women. Since weight cycling and lifetime weight difference both involve weight fluctuation, the extent to which the two variables were related was a concern. Weight cycling was associated with a lifetime weight difference of ≥30 lb (≥13·61 kg; χ 2P < 0·0 0 1). However, 44 % of those who had a lifetime weight difference of ≥30 lb had never weight cycled, thus large changes in weight were not entirely explained by weight cycling.

Similar to previous studies, BMI values calculated from self-reported data were similar to those using measured data and there was high sensitivity for classifying a participant as overweight/obese or obese. Among adult women in the NHANES (1999–2004), there was substantial agreement between self-reported and measured BMI categories(Reference Craig and Adams7). In an overweight Dutch sample, self-reported BMI was found to be reasonably accurate for the assessment of overweight/obesity prevalence(Reference Dekkers, van Wier and Hendriksen21). Even with high correlation, there is still a potential for bias when examining associations between BMI based on self-reported measures and risk of disease and mortality(Reference Keith, Fontaine and Pajewski22). Similar to our results, self-reported BMI in NHANES 2001–2006 and the National Health Interview Survey overestimated measured BMI values at the low end of the BMI scale (<22 kg/m2) and underestimated values at the high end (>28 kg/m2), and respondent sociodemographic characteristics were associated with some misclassification of obese people as overweight(Reference Merrill and Richardson13, Reference Stommel and Schoenborn23). In our study, although BMI was underestimated by ≥5 % for over 10 % of overweight and obese women, only 3 % of obese women under-reported their weight by ≥10 % and fewer than 1 % of women in any BMI category under- or over-reported by ≥15 %. Furthermore, the average examiner-measured weight among obese women was 207 (sd 32) lb (93·90 (sd 14·52) kg) and the average amount under-reported by these women was only 3·3 (sd 6·8) lb (1·50 (sd 3·08) kg). Only 126 obese women under-reported by ≥20 lb (≥9·07 kg). For obese women, in particular, a 5 % difference in weight may have a negligible impact on associations with health outcomes.

Depression was of interest because it is associated with low self-esteem(Reference Orth, Robins and Meier24, Reference Orth, Robins and Trzesniewski25) and therefore could affect accuracy of weight reporting. However, diagnosis of depression or current use of antidepressant medication was not significantly associated with under- or over-reporting weight.

Several studies have suggested that respondents give more socially desirable answers in interviews than on self-administered questionnaires(Reference Okamoto, Ohsuka and Shiraishi26). Despite finding a high correlation between CATI and questionnaire responses and seeing similar trends in accuracy for CATI and questionnaire, overweight and obese women reported weight more accurately on the questionnaire. While this finding might suggest that the anonymity of the self-completed questionnaire promotes more honest reporting, it is also possible that women weighed themselves while completing the form at home. Access to a scale while completing the form may facilitate accurate reporting. Our participants may have been more motivated than others to do this because of the pending home visit during which they knew they would be weighed. Since women were asked to have their questionnaire ready for the examiner to collect, it is also possible that these forms were completed just before the home visit, increasing the likelihood of similar results. Thus our data may provide a ‘best case’ assessment of the validity of weight data reported on self-completed questionnaires.

Response rates and data quality can be higher in telephone interviews than mailed questionnaires(Reference Siemiatycki27, Reference Brogger, Bakke and Eide28). CATI item non-response may have been minimized because interviewers asked each question, although women could refuse to answer. Having examiners physically collect the self-administered questionnaires may have helped reduce overall non-response for that form.

The current analysis has some unique caveats. Participants were told they would be weighed and measured during a home visit and therefore may have reported more accurately than they would have otherwise. Some variation between self-reported and measured weight may have occurred because examiners weighed women with clothing whereas women may have reported their weight without clothes. There was the potential for a learning effect caused by the order of the home visit and CATI self-report. Women who had the home visit first may have remembered their measured weight and height and later reported the same values in the CATI (59 % had home visit first; 37 % completed CATI first; 4 % completed both on same day). However, when we stratified the analyses by which measure came first, we found no evidence that the order of reporting influenced accuracy. Similarly, timing of the CATI in relation to filling out the questionnaire had little impact on reliability. Data were collected by many different examiners using different scales. Although all examiners were trained, we could not verify that measurement protocols were consistently followed.

Conclusions

US women in the Sister Study were reasonably reliable and accurate in reporting weight and height. Women with normal-range BMI reported most accurately. Overweight and obese women and those with fluctuations in their weight were less accurate, but even among obese women, few women under-reported their weight by ≥10 %. Nevertheless, even though self-reported and measured weight and height are highly correlated, bias can still exist in studies relying on self-reported data due to the tendency of overweight women to under-report and underweight women to over-report their weight. The present study is among the first to show that repeated weight cycling and large weight changes in adulthood are also associated with less accurate weight reporting in a general population of women.

Acknowledgements

This research was supported by the Intramural Research Program of the National Institute of Environmental Health Sciences (Z01 ES044005), National Institutes of Health. The authors have no conflict of interest regarding this manuscript. C.J.L. carried out the data analysis and drafted the paper. L.A.D. supervised data collection, contributed to the development of the research topic and analysis strategy, supervised the analysis and edited the paper. D.P.S., Principal Investigator of the Sister Study, collected and provided the primary data, contributed to the development of the research topic and analysis strategy, supervised the analysis and edited the paper. S.R.J. participated in early evaluation of the data, literature review, data analysis and drafting of the paper.

Appendix

Comparing two types of self-reported weight (CATI and self-administered questionnaire) with examiner-measured weight among those who completed both self-reports within 30 d of the examiner measurement: US women aged 35–74 years (n 13 985), the Sister Study (2003–2009)

CATI, computer-assisted telephone interview; aOR, adjusted odds ratio (adjusted for age, race, education, perceived health status, marital status and BMI); Ref., referent category.

References

1.Ogden, CL, Carroll, MD, Curtin, LR et al. (2006) Prevalence of overweight and obesity in the United States, 1999–2004. JAMA 295, 15491555.CrossRefGoogle ScholarPubMed
2.Engstrom, JL, Paterson, SA, Doherty, A et al. (2003) Accuracy of self-reported height and weight in women: an integrative review of the literature. J Midwifery Womens Health 48, 338345.CrossRefGoogle ScholarPubMed
3.Paradis, AM, Perusse, L, Godin, G et al. (2008) Validity of a self-reported measure of familial history of obesity. Nutr J 7, 27.Google Scholar
4.Gorber, SC, Tremblay, M, Moher, D et al. (2007) A comparison of direct vs. self-report measures for assessing height, weight and body mass index: a systematic review. Obes Rev 8, 307326.CrossRefGoogle Scholar
5.Roberts, RJ (1995) Can self-reported data accurately describe the prevalence of overweight? Public Health 109, 275284.Google Scholar
6.Bostrom, G & Diderichsen, F (1997) Socioeconomic differentials in misclassification of height, weight and body mass index based on questionnaire data. Int J Epidemiol 26, 860866.CrossRefGoogle ScholarPubMed
7.Craig, BM & Adams, AK (2009) Accuracy of body mass index categories based on self-reported height and weight among women in the United States. Matern Child Health J 13, 489496.Google Scholar
8.Rowland, ML (1990) Self-reported weight and height. Am J Clin Nutr 52, 11251133.Google Scholar
9.Lahti-Koski, M, Mannisto, S, Pietinen, P et al. (2005) Prevalence of weight cycling and its relation to health indicators in Finland. Obes Res 13, 333341.CrossRefGoogle ScholarPubMed
10.Strohacker, K & McFarlin, BK (2010) Influence of obesity, physical inactivity, and weight cycling on chronic inflammation. Front Biosci (Elite Ed) 2, 98104.Google ScholarPubMed
11.Field, AE, Byers, T, Hunter, DJ et al. (1999) Weight cycling, weight gain, and risk of hypertension in women. Am J Epidemiol 150, 573579.CrossRefGoogle ScholarPubMed
12.White, MA, Masheb, RM, Burke-Martindale, C et al. (2007) Accuracy of self-reported weight among bariatric surgery candidates: the influence of race and weight cycling. Obesity (Silver Spring) 15, 27612768.CrossRefGoogle ScholarPubMed
13.Merrill, RM & Richardson, JS (2009) Validity of self-reported height, weight, and body mass index: findings from the National Health and Nutrition Examination Survey, 2001–2006. Prev Chronic Dis 6, A121.Google ScholarPubMed
14.Perez-Cueto, FJ & Verbeke, W (2009) Reliability and validity of self-reported weight and height in Belgium. Nutr Hosp 24, 366367.Google Scholar
15.Probst, YC, Faraji, S, Batterham, M et al. (2008) Computerized dietary assessments compare well with interviewer administered diet histories for patients with type 2 diabetes mellitus in the primary healthcare setting. Patient Educ Couns 72, 4955.CrossRefGoogle ScholarPubMed
16.McCabe, RE, McFarlane, T, Polivy, J et al. (2001) Eating disorders, dieting, and the accuracy of self-reported weight. Int J Eat Disord 29, 5964.Google Scholar
17.Meyer, C, Arcelus, J & Wright, S (2009) Accuracy of self-reported weight and height among women with eating disorders: a replication and extension study. Eur Eat Disord Rev 17, 366370.CrossRefGoogle ScholarPubMed
18.Meyer, C, McPartlan, L, Sines, J et al. (2009) Accuracy of self-reported weight and height: relationship with eating psychopathology among young women. Int J Eat Disord 42, 379381.CrossRefGoogle ScholarPubMed
19.Centers for Disease Control and Prevention (2009) About BMI or Adults. http://www.cdc.gov/healthyweight/assessing/bmi/adult_BMI/index.html (accessed December 2009).Google Scholar
20.McDowell, MA, Fryar, CD, Ogden, CL et al. (2008) Anthropometric Reference Data for Children and Adults: United States, 2003–2006. National Health Statistics Reports no. 10. Hyattsville, MD: National Center for Health Statistics.Google Scholar
21.Dekkers, JC, van Wier, MF, Hendriksen, IJ et al. (2008) Accuracy of self-reported body weight, height and waist circumference in a Dutch overweight working population. BMC Med Res Methodol 8, 69.Google Scholar
22.Keith, SW, Fontaine, KR, Pajewski, NM et al. (2011) Use of self-reported height and weight biases the body mass index–mortality association. Int J Obes (Lond) 35, 401408.CrossRefGoogle ScholarPubMed
23.Stommel, M & Schoenborn, CA (2009) Accuracy and usefulness of BMI measures based on self-reported weight and height: findings from the NHANES & NHIS 2001–2006. BMC Public Health 9, 421.Google Scholar
24.Orth, U, Robins, RW & Meier, LL (2009) Disentangling the effects of low self-esteem and stressful events on depression: findings from three longitudinal studies. J Pers Soc Psychol 97, 307321.CrossRefGoogle ScholarPubMed
25.Orth, U, Robins, RW, Trzesniewski, KH et al. (2009) Low self-esteem is a risk factor for depressive symptoms from young adulthood to old age. J Abnorm Psychol 118, 472478.CrossRefGoogle ScholarPubMed
26.Okamoto, K, Ohsuka, K, Shiraishi, T et al. (2002) Comparability of epidemiological information between self- and interviewer-administered questionnaires. J Clin Epidemiol 55, 505511.Google Scholar
27.Siemiatycki, J (1979) A comparison of mail, telephone, and home interview strategies for household health surveys. Am J Public Health 69, 238245.Google Scholar
28.Brogger, J, Bakke, P, Eide, GE et al. (2002) Comparison of telephone and postal survey modes on respiratory symptoms and risk factors. Am J Epidemiol 155, 572576.CrossRefGoogle ScholarPubMed
Figure 0

Fig. 1 Example of frameshift errors on the self-administered diet questionnaire

Figure 1

Table 1 Characteristics of participants by weight reporting accuracy: US women (n 18 639) aged 35–74 years, the Sister Study (2003–2009)

Figure 2

Table 2 Association of participant characteristics with weight reporting accuracy – CATI-reported v. examiner-measured weight: US women (n 18 639) aged 35–74 years, the Sister Study (2003–2009)

Figure 3

Table 3 Association of weight cycling with weight reporting accuracy, stratified by BMI: US women (n 18 639) aged 35–74 years, the Sister Study (2003–2009)

Figure 4

Fig. 2 The association between BMI and the accuracy of weight reported in a computer-assisted telephone interview: US women (n 18 639) aged 35–74 years, the Sister Study (2003–2009). Odds ratios were adjusted for age, race, education, perceived health status and marital status; 95 % confidence intervals are represented by error bars

Figure 5

Fig. 3 The association between weight cycling and lifetime weight difference and the accuracy of weight reported in a computer-assisted telephone interview: US women (n 18 639) aged 35–74 years, the Sister Study (2003–2009). Odds ratios were adjusted for age, race, education, perceived health status, marital status and BMI; 95 % confidence intervals are represented by error bars

Figure 6

Fig. 4 Comparison of BMI calculated from weight and height reported in a computer-assisted telephone interview (CATI) v. examiner measures: US women (n 18 639) aged 35–74 years, the Sister Study (2003–2009); —— indicates fitted values

Figure 7

Table 4 Percentage discrepancy between BMI based on CATI-reported values and examiner-measured values, by BMI (based on examiner-measured weight and height): US women (n 18 639) aged 35–74 years, the Sister Study (2003–2009)