A multivariate model for predicting segmental body composition

Simiao Tian; Laurence Mioche; Jean-Baptiste Denis; Béatrice Morio

doi:10.1017/S0007114513001803

A multivariate model for predicting segmental body composition

Published online by Cambridge University Press: 11 July 2013

Simiao Tian ,

Laurence Mioche ,

Jean-Baptiste Denis and

Béatrice Morio

Show author details

Simiao Tian*: Affiliation:
INRA, Unité de Recherche MIA, F-78352Jouy-en-Josas, France Unité de Nutrition Humaine, Clermont Université, Université d'Auvergne, BP 10448, F-63000Clermont-Ferrand, France INRA, UMR 1019, UNH, F-63000Clermont-Ferrand, France
Laurence Mioche: Affiliation:
INRA, UMR 1019, UNH, F-63000Clermont-Ferrand, France
Jean-Baptiste Denis: Affiliation:
INRA, Unité de Recherche MIA, F-78352Jouy-en-Josas, France
Béatrice Morio: Affiliation:
Unité de Nutrition Humaine, Clermont Université, Université d'Auvergne, BP 10448, F-63000Clermont-Ferrand, France INRA, UMR 1019, UNH, F-63000Clermont-Ferrand, France
*: *Corresponding author: S. Tian, email simiao.tian@jouy.inra.fr

Article contents

Abstract
Subjects and methods
Results
Discussion
References

Rights & Permissions

Abstract

The aims of the present study were to propose a multivariate model for predicting simultaneously body, trunk and appendicular fat and lean masses from easily measured variables and to compare its predictive capacity with that of the available univariate models that predict body fat percentage (BF%). The dual-energy X-ray absorptiometry (DXA) dataset (52 % men and 48 % women) with White, Black and Hispanic ethnicities (1999–2004, National Health and Nutrition Examination Survey) was randomly divided into three sub-datasets: a training dataset (TRD), a test dataset (TED); a validation dataset (VAD), comprising 3835, 1917 and 1917 subjects. For each sex, several multivariate prediction models were fitted from the TRD using age, weight, height and possibly waist circumference. The most accurate model was selected from the TED and then applied to the VAD and a French DXA dataset (French DB) (526 men and 529 women) to assess the prediction accuracy in comparison with that of five published univariate models, for which adjusted formulas were re-estimated using the TRD. Waist circumference was found to improve the prediction accuracy, especially in men. For BF%, the standard error of prediction (SEP) values were 3·26 (3·75) % for men and 3·47 (3·95) % for women in the VAD (French DB), as good as those of the adjusted univariate models. Moreover, the SEP values for the prediction of body and appendicular lean masses ranged from 1·39 to 2·75 kg for both the sexes. The prediction accuracy was best for age < 65 years, BMI < 30 kg/m2 and the Hispanic ethnicity. The application of our multivariate model to large populations could be useful to address various public health issues.

Keywords

Multivariate models Body composition Dual-energy X-ray absorptiometry Predictions

Type: Full Papers
Information: British Journal of Nutrition , Volume 110 , Issue 12 , 28 December 2013 , pp. 2260 - 2270

DOI: https://doi.org/10.1017/S0007114513001803 [Opens in a new window]
Copyright: Copyright © The Authors 2013

The assessment of human body composition is important for evaluating health and nutritional status. Among health issues, overweight and obesity are worldwide problems. Increased fat mass, especially in the trunk location⁽Reference Carr, Utzschneider and Hull¹^–Reference Vandervoot and Symons⁴⁾, has been associated with an increased risk of metabolic diseases, such as type 2 diabetes and CVD. The amount of lean body mass, especially of appendicular muscle mass, is also directly correlated with health and particularly with the mortality rate⁽Reference Greenlund and Nair³^, Reference Vandervoot and Symons⁴⁾. Accurate measurements of body composition can be obtained from different methods, such as underwater weighing, dilution techniques and dual-energy X-ray absorptiometry (DXA). However, their applications are not always convenient for large populations, because they require fixed equipment and they are also time consuming and expensive.

The potential uses of statistical methods for body composition assessment have been highlighted⁽Reference Snijder, Van Dam and Visser⁵⁾, and several attempts to predict body composition, particularly body fat percentage (BF%), using linear models with simple predictor variables have been made. A summary of the body composition prediction models published between 1985 and 2003 has been given by Sun & Chumlea⁽Reference Sun, Chumlea, Heymsfield, Lohman and Wang⁶⁾. They pointed out that (1) a general model for two sexes, different ethnicities and wide age ranges may lose its accuracy due to increased heterogeneity; (2) cross-validation of prediction models was needed to assess their generalisability; (3) for validation studies, accuracy should be standardised for the mean of the predicted variable; (4) few prediction models were derived from datasets using DXA.

The advantages of using sex, age, ethnicity and easily accessible anthropometric measurements, such as body weight and height, are simplicity and cost efficiency. Their use would allow access to large datasets to describe body composition characteristics. Previous published linear models have made univariate predictions⁽Reference Gallagher, Heymsfield and Heo⁷^–Reference Gómez-Ambrosi, Silva and Catalán¹¹⁾. Alternatively, a non-parametric model based on Bayesian networks that uses the same predictor variables has been proposed⁽Reference Mioche, Bidot and Denis¹²^, Reference Mioche, Brigand and Bidot¹³⁾. This Bayesian networks approach consists in selecting a subset of individuals so that their predictor variable characteristics are similar to those of the individuals to be predicted. This model allows simultaneous prediction of segmental compartments, but requires the availability of a reference dataset. To our knowledge, until now, no multivariate linear prediction model has been proposed for body composition assessment. The aim of the present study was, therefore, to develop sex-specific multivariate models for estimating some segmental compartments of metabolic importance (i.e. lean body mass, appendicular muscle mass and trunk fat (TF)) from age and easily accessible anthropometric variables. The usefulness of waist circumference was also investigated and combined with age, height and weight as predictor variables. These multivariate models, based on the reference dataset National Health and Nutrition Examination Survey (NHANES), were validated with two different populations in agreement with the principles proposed by Sun & Chumlea⁽Reference Sun, Chumlea, Heymsfield, Lohman and Wang⁶⁾.

Subjects and methods

Databases

All body composition values related to predictions were extracted from the NHANES website (http://www.cdc.gov/nchs/about/major/nhanes/) from the 1999–2004 period. Subjects were characterised by predictor variables, such as sex, ethnicity, age, height, weight and waist circumference. For the present study, we selected subjects aged 20–85 years, with BMI values ranging from 18 to 40 kg/m² and who belonged to one of the three considered ethnicity categories: White, Black and Hispanic. This selection resulted in a sample size of 3977 men (1984 White, 720 Black and 1273 Hispanic) and 3692 women (1830 White, 697 Black and 1165 Hispanic).

The study was conducted separately on men and women; therefore, the complete NHANES dataset was split by sex. For each sex, we randomly split the corresponding NHANES dataset into three sub-datasets: a training dataset (TRD); a test dataset (TED); a validation dataset (VAD).

As the number of individuals was high, the splitting was done at random as suggested by Hastie et al. ⁽¹⁴⁾ and Nivre⁽¹⁵⁾ in data-rich situations. The TRD was used as a reference dataset to fit the parameters of a series of possible models. The test dataset was used to estimate the prediction error of each fitted model to make model selection, and the VAD was used to perform a one-round validation calculation and to assess the prediction accuracy of the final chosen models.

An independent external dataset (French DB, French DXA dataset) was used to assess the performance of the prediction models in a different population context. The French DB was obtained from a routine examination at the Radiology Department of the Clermont-Ferrand University Hospital Centre between 1998 and 2008. It contains data on 1095 French subjects, 526 men and 569 women, aged between 20 and 85 years and with BMI values ranging between 18 and 40 kg/m². However, ethnicity was not mentioned and waist circumference was not measured during the examination.

The study carried out using the NHANES dataset complies with the Declaration of Helsinki, the National Center for Health Statistics Ethics Review Board approved the protocols, and written informed consent was obtained from each participant. Moreover, the study using the French dataset was conducted according to the guidelines laid down in the Declaration of Helsinki, and all procedures involving human subjects were approved by the Clermont-Ferrand University Hospital Centre, France, and by the local ethics committee. Written informed consent was obtained from all subjects at recruitment after being informed of the nature, purpose and possible risks of the protocols.

Measurement of body composition

Whole-body and segmental body compositions were assessed using DXA (Hologic QDR 4500A fanbeam densitometer for the NHANES dataset and Hologic QDR-4500 densitometer for the French DB; http://www.gmecorp-usa.com/IM/XR/BD/HOLLOGIC/4500/SV/Qdr4500dos.pdf). For the NHANES dataset, detailed descriptions have been published earlier⁽¹⁶⁾. Briefly, whole-body DXA scans were taken at the NHANES mobile examination centre for eligible participants during the 6-year period from 1999 to 2004; the participants with certain physical conditions were excluded from the DXA examination⁽¹⁷⁾. The DXA scans allow the quantification of multiple whole-body and regional components, including bone mineral content, fat and lean soft tissue. Body fat (BF) and body lean (BL) masses and TF and trunk lean masses were thus determined⁽Reference Mazess, Barden and Bisek¹⁸⁾. Appendicular composition was the sum of arm and leg fat (APF, appendicular fat) and lean (APL, appendicular lean) masses⁽Reference Wang, Visser and Ma¹⁹⁾. Body fat-free mass (BFF) was calculated as the sum of the BL mass and bone mineral content.

Statistical methods

Non-parametric approaches

First, several non-parametric approaches were evaluated to make absolute body composition predictions. The term ‘non-parametric’ implies that the number and nature of the parameters are flexible and not fixed in advance⁽Reference Sprent and Smeeton²⁰⁾. These non-parametric approaches followed the statistical methodology described by Mioche et al.⁽Reference Mioche, Bidot and Denis¹²⁾. The local prediction models included weighted linear regression, support vector machine regression⁽Reference Vapnik²¹^, Reference Lin and Wang²²⁾ and Bayesian regression⁽Reference Gelman, Carlin and Stern²³⁾. For a given individual to be predicted, these methods follow three steps: (1) dissimilarities are calculated between the individual to be predicted and each individual of the TRD based on the values of the predictor variables; (2) the dissimilarities are transformed into weights to give more importance to similar individuals; (3) a prediction model is developed from this weighted dataset. When weights are constrained to be 0 or 1, the method corresponds to the selection of a sub-dataset as performed by Mioche et al. ⁽Reference Mioche, Bidot and Denis¹²⁾.

Multivariate linear regression

In the present study, a multivariate multiple linear regression, supposed to satisfy linear model assumptions, was also used as a possible alternative to these sophisticated prediction models. Multiple univariate linear regression is easily extended to deal with situations where the response consists of P>1 different variables; this is termed ‘multivariate linear regression’⁽Reference Anderson²⁴⁾. Estimates of the regression parameters are determined by the least squares method. The fitting model in a multivariate model for each variable will be the same as that which would result from a univariate model. However, the constraint in the multivariate model consists in using identical predictor variables for all the predicted variables. The advantage of using the multivariate approach is that it takes the correlation structure between the responses into account, which is useful for a number of inference tasks, e.g. to give simultaneous confidence regions for all the responses together.

Validation analysis

The selection of models from previously described multivariate approaches was based on the prediction accuracy and complexity of the models. The accuracy was measured by the standard error of prediction (SEP) and the relative standard deviation (RSD, two criteria defined below), and the complexity of the models was assessed by the number of parameters and computing time.

Waist circumference usefulness analysis

The usefulness of waist circumference for prediction was investigated. To do so, prediction accuracy was checked for some categories of BMI (18–25, 25–30 and 30–40 kg/m²), age (20–35, 35–50, 50–65 and 65–80 years) and ethnicity (White, Black and Hispanic). This categorical analysis was performed only on the VAD. The prediction accuracy in this categorical study was expressed by a 100-scale score. A score of 100 denotes a baseline, i.e. the average level of prediction quality for all the categories; a score less than 100 denotes a better quality than the average level; and in contrast a score greater than 100 indicates a worse quality.

Comparison with published univariate models

In the literature, univariate linear regressions have been developed to primarily predict BF% from BMI, age and, occasionally, waist circumference or ethnicity as predictor variables⁽Reference Gallagher, Heymsfield and Heo⁷^–Reference Gómez-Ambrosi, Silva and Catalán¹¹⁾. Of the univariate models published between 2000 and 2012, five were retained with different combinations of predictor variables (Table 1). Gallagher's⁽Reference Gallagher, Heymsfield and Heo⁷⁾ and Larsson's⁽Reference Larsson, Henning and Lindroos⁹⁾ models were derived from a DXA dataset, Jackson's⁽Reference Jackson, Stanforth and Gagnon⁸⁾ model from a densitometry dataset from four clinical centres, Levitt's⁽Reference Levitt, Heymsfield and Pierson¹⁰⁾ model from a densitometry and water dilution dataset, and Gómez-Ambrosi's⁽Reference Gómez-Ambrosi, Silva and Catalán¹¹⁾ model from an air-displacement plethysmography dataset. Original and adjusted formulas were applied to the VAD and French DB. The adjusted formulas were derived by re-estimating the parameters of the published models using the TRD. Their prediction accuracies were considered as baseline values to evaluate those of our proposed combination of predictor variables in the multivariate models. The prediction of BF% from our multivariate model was calculated by dividing the predicted value of BF by body weight, multiplied by 100.

Table 1 Formulas of the five published prediction models for body fat percentage (BF%) for men and women*

* Adjusted formulas were estimated from the training dataset.

† For Gallagher's model, only the non-Asian model is reported.

‡ For Larsson's model, the parameter values are not provided; only the statistical formula is provided, which is as follows: $$y = a \times (1 - e ^{ - b (BMI - BMI_{o})}). $$

Assessment of the prediction accuracy

The accuracy of a prediction for a given variable was globally assessed using the SEP:

$$\begin{eqnarray} SEP = \sqrt {\frac { \sum _{ i = 1}^{ n }(measured_{ i } - predicted_{ i })^{2}}{ n },} \end{eqnarray}$$

where n is the number of subjects in the VAD or French DB. The unit of SEP is the same as the unit of the predicted variable (kg or %). The SEP is then detailed into bias and standard deviation: SEP²= bias²+sd² to investigate the trade-off between model bias and variance in prediction. The RSD provides another assessment of the prediction accuracy. It was calculated by dividing 100 × SEP by the mean of the predicted variable, and it is expressed in percentage of the global mean. Finally, the coefficient of determination R ² was used to assess the goodness of fit in the validation procedure.

Statistical test analyses

Population characteristics are expressed as means and standard deviations. Differences between each of the three subsets of the NHANES dataset and French DB were analysed using Student's t tests. These t tests aimed to assess the differences between the American and French samples. Only the SEP difference was analysed by a permutation test⁽Reference Röhmel²⁵⁾. Furthermore, paired t tests and Bland–Altman plots⁽Reference Bland and Altman²⁶⁾ were used to determine the difference and the limits of agreement between the published univariate models and the multivariate model. A CI for the mean of the difference was also calculated under a normality assumption. Statistical calculations and analyses were performed using version 2.12.2 of the R software (http://cran.r-project.org/doc/contrib/Lam-IntroductionToR_LHL.pdf)⁽²⁷⁾, a language and an environment for statistical computing.

Results

Sample characteristics

The means and standard deviations of age, anthropometric variables and DXA body composition for the different datasets are presented in Table 2 for men and women. Within the three subsets of the NHANES dataset, the men and women were of the same age, but some difference was observed for the French subjects. For men, except for height, all the variables were significantly different between the French DB and the three NHANES dataset subsets. For women, most of the variables were significantly different between the French DB and the three NHANES dataset subsets, except for trunk lean, BL and BFF.

Table 2 Age, anthropometric variables and dual-energy X-ray absorptiometry body composition characteristics for men and women in the National Health and Nutrition Examination Survey (NHANES) training dataset (TRD), test dataset (TED) and validation dataset (VAD) and in the French dataset (French DB) (Mean values and standard deviations)

* Mean values were significantly different from those of the TRD (P< 0·05; t test).

† Mean values were significantly different from those of the TED (P< 0·05; t test).

‡ Mean values were significantly different from those of the VAD (P< 0·05; t test).

§ Ethnicity was not mentioned in the French DB.

Prediction models

The study of the selection of models from the test dataset showed that the non-parametric approaches did not provide a significantly better SEP than the multivariate linear regression. Moreover, the non-parametric approaches need more parameters and computing times. Therefore, multivariate linear regression is the only model mentioned in the paper to predict segmental compartments. The parameters of this multivariate model are given in Table 3 for models with and without waist circumference (MWC and MWoC, respectively) as a predictor variable.

Table 3 Multivariate prediction model estimates of parameters for the seven segmental compartments (kg) including or not including waist circumference as a predictor variable*

TF, trunk fat; APF, appendicular fat; BF, body fat; TL, trunk lean; APL, appendicular lean; BL, body lean; BFF, body fat-free mass, calculated as the sum of the BL mass and bone mineral content.

* The parameters are, respectively, associated with the intercept, age (β_A), height (β_H), weight (β_W) and waist circumference (β_C). For the sake of presentation, all values have been multiplied by 100.

Inclusion of waist circumference

Tables 4 and 5 summarise the prediction accuracy for three categories of BMI and ethnicity and four age ranges when MWC and MWoC are applied to the VAD. Predictions were more accurate when waist circumference was included, especially for men with a BMI value that ranged from 18 to 30 kg/m² and whose age ranged from 25 to 65 years. Regarding ethnicity categories, a remarkable improvement in accuracy was found for Black men when waist circumference was used as a predictor variable. Compared with that of MWoC, the prediction accuracy of MWC was improved by a 45 % unit (in a 100-scale score) for TF and APL masses and by a 30 % unit (in a 100-scale score) for BF, BL and BFF masses. By contrast, for women in all the BMI, age and ethnicity categories, the quality of the predictions was similar between MWC and MWoC.

Table 4 Accuracy of the proposed prediction models with waist circumference (MWC) and without waist circumference (MWoC) as a predictor variable for the seven segmental compartments in different BMI, age and ethnicity categories for men in the National Health and Nutrition Examination Survey validation dataset*

TF, trunk fat; APF, appendicular fat; BF, body fat; TL, trunk lean; APL, appendicular lean; BL, body lean; BFF, body fat-free mass, calculated as the sum of the BL mass and bone mineral content.

* The accuracy is assessed by a 100-scale score: the smaller the score, the better the prediction. A value of 100 corresponds to the global standard error of prediction for all the categories with waist circumference as a predictor variable.

Table 5 Accuracy of the proposed prediction models with waist circumference (MWC) and without waist circumference (MWoC) as a predictor variable for the seven segmental compartments in different BMI, age and ethnicity categories for women in the National Health and Nutrition Examination Survey validation dataset*

TF, trunk fat; APF, appendicular fat; BF, body fat; TL, trunk lean; APL, appendicular lean; BL, body lean; BFF, body fat-free mass, calculated as the sum of the BL mass and bone mineral content.

For subjects of both sexes, the prediction by MWC was less reliable with BMI values in the range 30–40 kg/m² than for the other BMI categories. Indeed, the prediction accuracy of MWC was reduced by 35 and 25 % units for a BMI >30 kg/m² than for the BMI categories of 18–25 and 25–30 kg/m².

Regarding the three ethnicity categories, MWC provided the best quality of fit for Hispanic individuals, followed by White and Black individuals. More precisely, for the BF, BL and appendicular compartments in Hispanic individuals, the prediction accuracy of MWC was improved by 20 % unit in Hispanic men than in White and Black men. Similarly, it was improved by 10 and 30 % units in Hispanic women than in White and Black women, respectively.

Multivariate prediction models

The validation scores for the multivariate model were calculated using the VAD, and they are given in Table 6. For the prediction of BF and BL masses, a SEP value less than 2·8 kg was found for both men and women (men: 2·75 and 2·66 kg; women: 2·52 and 2·47 kg, respectively). By contrast, because of the differences in the compartment masses, the RSD values were much lower for the BL prediction than for the BF prediction (men: 4·41 and 13·08 %; women: 5·76 and 9·01 %, respectively). The corresponding R ² values averaged 0·9 for both the sexes (men: 0·88 and 0·92; women: 0·92 and 0·86, respectively).

Table 6 Accuracy of the multivariate prediction model calculated using the National Health and Nutrition Examination Survey validation dataset using waist circumference for the seven segmental compartments*

SEP, standard error of prediction; RSD, relative standard deviation; TF, trunk fat; APF, appendicular fat; BF, body fat; TL, trunk lean; APL, appendicular lean; BL, body lean; BFF, body fat-free mass, calculated as the sum of the BL mass and bone mineral content.

* The absolute value of the total weight of the segmental compartments is predicted. The accuracy is assessed by the SEP in kg and the RSD in %. RSD = 100 × (SEP/$$\overline{ y } $$) for a predicted variable Y and its mean $$\overline{ y } $$, and it is expressed as a percentage. For example, for BF of men, RSD = 100 × 2·75/21·02 = 13·08 %.

Regarding other segmental compartments such as trunk and APF and APL masses and BFF, the SEP values ranged from 1·65 to 2·75 kg for men and from 1·39 to 2·52 kg for women. Similarly, in both the sexes, because of the differences in the compartment sizes, the RSD values were lower for trunk and APL masses than for the corresponding fat masses. They varied from 5·54 to 8·76 % for trunk and APL masses and from 12·54 to 19·18 % for trunk and APF masses. The corresponding R ² values ranged from 0·8 to 0·9 for both the sexes.

The bias ranged approximately from 0·50 to 0·90 kg for both men and women, which were low in comparison with the model variance (Table 6). Comparisons of the predictions by models and the observations are shown in Fig. 1 for men and women. For men, segmental body compositions were globally well predicted, even if for extreme parts, some bias appeared: an underestimation for a high fat mass and an overestimation for low lean mass. For women, an underestimation for high APF and APL masses was observed.

Fig. 1 Scatter plot of the multivariate model for the prediction of different segmental body compositions against their observations in the validation dataset. Men are represented by × and women by ○. The first bisectors are drawn (). Men: (a) trunk fat (TF); (b) appendicular fat (APF); (c) body fat (BF); (d) trunk lean (TL); (e) appendicular lean (APL); (f) body lean (BL). Women: (g) TF; (h) APF; (i) BF; (j) TL; (k) APL; (l) BL.

When the multivariate prediction model without waist circumference was applied to the French DB, the predictions were still good (table not shown). For men, the SEP values were 2·95 kg (R ² 0·84) for BF mass and 2·84 kg (R ² 0·87) for BL mass, with the RSD values being equal to 17·01 and 4·83 %, respectively. For women, the corresponding SEP values were 2·86 kg (R ² 0·89) and 2·80 kg (R ² 0·84) with the respective RSD values of 12·27 and 6·62 %.

Comparison with published prediction models

When the published formulas with their predictor variables were re-adjusted in the TRD and then applied to the VAD and French DB, the quality of fit was improved in comparison with that of their original formulas. For the BF% of men, the prediction accuracy of the adjusted formula was increased in the VAD by 0·5 % unit for Gallagher's and Jackson's prediction models and by 1 % unit for Levitt's and Gómez-Ambrosi's prediction models. For the BF% of women, the prediction accuracy was improved, on average, by 1 % unit for all the models (Table 7). For the same compartment in French men and women, only a slight improvement in accuracy was found for the univariate models, except for Gómez-Ambrosi's prediction model, for which the prediction accuracy was improved by 1·5 % unit.

Table 7 Accuracy of the five published models, original and adjusted, and our proposed model for body fat percentage prediction calculated using the National Health and Nutrition Examination Survey (NHANES) validation dataset (VAD) and the French dataset (French DB)*

SEP, standard error of prediction; RSD, relative standard deviation.

* The accuracy is assessed by the SEP and RSD, and both are expressed in percentage. The R ² is also calculated.

† There is a significant difference in SEP values between the adjusted univariate prediction models and the multivariate prediction model with the permutation test (P< 0·05).

‡ The original parameter coefficients are not available.

The accuracy of our multivariate prediction model, based on age, height, weight and waist circumference, was compared with that of the five adjusted published prediction models. In the VAD, the multivariate prediction of BF% yielded one of the best accuracies, with SEP values of 3·26 and 3·74 %, respectively, for men and women. For men, our SEP values were 0·5 % unit better than those of Gallagher's, Levitt's and Gómez-Ambrosi's prediction models and 1 % unit better than those of Jackson's and Larsson's prediction models. By contrast, for women, the differences between SEP values of the various models were small (Table 7). The Bland–Altman plots are shown in Fig. 2 for men and women in the VAD. It appeared that the agreement between our model and the adjusted published models was better for women than for men. In addition, for all the paired t tests, P values ranged from 0·51 to 0·95 and 0·18 to 0·89 for men and women, respectively. Therefore, the difference in predictions was not statistically significant. With respect to the CI of the mean of the difference, it ranged from − 0·17 to 0·19 for men and − 0·03 to 0·13 for women. These results show that there is no systematic difference between our multivariate prediction model and each of the adjusted published prediction models.

Fig. 2 Bland–Altman plots for the difference between body fat percentage (BF%) prediction by the multivariate model and that by the five adjusted published models v. average BF% prediction by the two models. The three dashed lines represent the mean difference and the mean and 1·96 sd. (a–e) Men and (f–j) women.

In the French DB, the prediction of BF% was based on age, height and weight. The SEP values of our multivariate prediction model were 3·74 and 3·95 %. They were slightly higher than those of Gómez-Ambrosi's prediction model (3·63 %) in men and than those of Gallagher's and Levitt's prediction models (3·93 %) in women.

Discussion

BF, TF and other segmental compartments, such as appendicular muscle mass, are useful factors for assessing predisposition to metabolic risks; therefore, examinations of these segmental compartments provide interesting information. The proposed multivariate model aimed at simultaneously predicting them from age and easily measured anthropometric predictor variables, with a particular focus on the importance of waist circumference. It was built using a US dataset and validated independently using two different datasets. The present results showed that, with the proposed combination of four predictor variables, including waist circumference, the multivariate model enabled accurate predictions for segmental body compositions.

Waist circumference is a well-known predictor of abdominal accumulation of subcutaneous and visceral adipose tissues. In 2001, the National Cholesterol Education Program – Adult Treatment Panel III included waist circumference as a risk factor for the metabolic syndrome⁽Reference Carr, Utzschneider and Hull¹⁾. Waist circumference was then widely used to improve the prediction of BF% in combination with a weight-for-height index, such as BMI⁽Reference Jassen, Heymsfield and Allison²⁸^, Reference Aeberli, Gut-Knabenhans and Kusche-Ammann²⁹⁾. In the study by Lean et al. ⁽Reference Lean, Han and Deurenberg³⁰⁾, BF%, which was assessed by densitometry, was more closely related to waist circumference than to BMI, particularly for men. In another study related to BFF, Bosty-Westphal et al. ⁽Reference Bosty-Westphal, Danielzik and Geisler³¹⁾ found that waist circumference was a risk factor for decreased BFF and that it was a good anthropometric index for health risk assessment. Similarly in the present study, the accuracy of our multivariate model was improved when waist circumference was entered as a predictor variable. This was particularly meaningful for men for the segmental compartments, such as TF, APL, total BF and total BL masses. For men, a significant improvement in accuracy was observed in all the BMI categories and in the age categories of 20–35, 35–50 and 50–65 years. In addition, waist circumference was especially required to improve the prediction accuracy for Black men in comparison with the other two ethnicity categories. We thus concluded that waist circumference should be included in the multivariate prediction model for normal, overweight and obese subjects, although it is known in clinical practice that there is a physical difficulty in measuring waist circumference of the latter subjects.

One important aspect of our proposed model is that it is capable of predicting simultaneously several segmental compartments; to our knowledge, this is the first proposal made for a multivariate model. The joint use of several segmental body compositions has been justified in some metabolic disease risk studies. Indeed, an excess amount of TF is associated with a higher cardiometabolic risk, but in addition, after TF mass is controlled, a higher APF mass can be shown to be associated with a more favourable metabolic profile, particularly in women⁽Reference Snijder, Dekker and Visser³²^, Reference Van Pelt, Evans and Schechtman³³⁾. In a study on subjects aged 60–80 years, Saunders et al. ⁽Reference Saunders, Davidson and Janiszewski³⁴⁾ found that the absolute amount of TF and APF masses influenced the metabolic risk in elder men and women. Moreover, based on a study using DXA, BF was shown to be a complementary significant contributor to BMR in addition to BFF⁽Reference Johnstone, Murison and Duncan³⁵⁾. Some longitudinal studies in cohorts of older subjects⁽Reference Szulc, Munoz and Marchand³⁶^, Reference Lee, Boyko and Nielson³⁷⁾ have highlighted that the loss of APL mass, measured using DXA, was associated with a greater risk of all-cause mortality compared with individuals with stable APL mass. Furthermore, Kilgour et al. ⁽Reference Kilgour, Vigano and Trutschnigg³⁸⁾ found that in advanced cancer patients, an APL mass-for-height index, measured by DXA, had a significant impact on cancer-related fatigue in men. Therefore, in order to better assess the health status or the metabolic risks of individuals, it is beneficial to predict simultaneously several segmental compartments from the statistical models. In the present study, the results for different populations underline that our proposed model enables the accurate assessment of several segmental compartments for the three ethnicities studied. The reliable prediction for body, trunk and appendicular components may be used for further studies related to pathophysiological and metabolic issues.

Of the already published models, five were retained for evaluating the usefulness of the proposed combination of four predictor variables in the multivariate model. These published models mainly integrated BMI and age as predictor variables; some were derived from either densitometry-based or air-displacement plethysmography-based datasets. Original and adjusted formulas, derived from the TRD, were applied to the VAD and their prediction accuracies were used as baseline values for comparison. The results show that the prediction of BF% of predictor variables used in our multivariate model yields a competing accuracy in comparison with the five adjusted published models. This finding justifies the relevance of using age, height, weight and waist circumference for predicting body composition.

Measurements of body composition can be obtained using a variety of methods, each of which provides a different amount of information about body compartments. Each method has specific limitations and measurement errors⁽Reference Ellis³⁹^, Reference Lee and Gallagher⁴⁰⁾. DXA and the four-compartment models are usually designated as reference methods for assessing body composition⁽Reference Wellens, Chumlea and Guo⁴¹^–Reference Lohman, Chen, Heymsfield, Lohman and Wang⁴³⁾. For BF%, the precision is approximately 3 % for DXA and even lower than 3 % for the four-compartment models⁽Reference Plank⁴²⁾. If we take into account these measurement errors combined with the prediction accuracy of our model, we can calculate the model precision using the following formula:

$$\begin{eqnarray} \sqrt {DXA\ precision\ (\%)^{2} + model\ prediction\ accuracy\ (\%)^{2}.} \end{eqnarray}$$

In our model, the SEP values for BF% were 3·2 % for men in the VAD and less than 4 % for women in the VAD and French DB. Our model thus yields an interesting precision of 4·4 and 4·8 % for men and women in the VAD and 5·0 % in the French DB. Interestingly, Lohman⁽Reference Lohman⁴⁴⁾ developed standards for evaluating prediction errors (SEP) for BF%. He proposed that an ideal prediction would be denoted by a SEP value less than 2 %, a good prediction by a SEP value ranging from 3·5 to 4 % and a poor prediction by a SEP value greater than 5 %. According to these standards, our multivariate model with the four predictor variables yielded a good prediction error. Indeed, the SEP values for BF% were equal to 3·26 and 3·74 % in men and women, respectively. Even if our prediction model was shown to be good, it cannot replace a direct measurement such as DXA. Nevertheless, due to its easy application and cost efficiency, it appears to be a convenient tool to evaluate the need of DXA prescription. Besides this, the multivariate model enables to suggest a pathophysiological situation or detect a dangerous evolution in case of follow-up. Moreover, such applications could be of interest to educate patients with chronic metabolic diseases. Finally, from a research perspective, such a model could be highly relevant in predicting specific risks in large populations.

The present study was limited in some aspects. First, while working with the NHANES dataset, ethnic groups were limited to White, Black and Hispanic subjects for whom accurate predictions were provided. Furthermore, only subjects aged from 20 to 85 years with BMI values ranging from 18 to 40 kg/m² were examined. Subjects with a BMI >40 kg/m² were excluded because they are morbidly obese. Already for a BMI >30 kg/m², the accuracy of our model was lower than that for the other two BMI categories. Moreover, waist circumference has little incremental predictive power of disease risk for subjects with a BMI >35 kg/m²⁽⁴⁵⁾. Thus, a particular study should be conducted to predict body composition of morbidly obese individuals. Finally, since data on waist circumference were not available in the French DB, the prediction of body composition for this database only used the three other predictor variables, with the result being a lower accuracy compared with that of the VAD. This result strengthens the conclusions regarding the importance of including waist circumference as a predictor variable.

In summary, waist circumference is an important predictor variable for the prediction of segmental body composition, especially in men. When using age, height, weight and waist circumference, our multivariate model yields a competing accuracy compared with other published univariate models for the prediction of BF%. Compared with these published formulas, the originality and advantage of the proposed model consist in predicting simultaneously several segmental compartments (such as TF mass or APL mass) with a good accuracy; the multivariate outcomes might then be used in studies necessitating the assessment of metabolic risk factors in large populations.

Acknowledgements

We thank the Human Nutrition Department and Applied Mathematics and Informatics unit of the French National Institute for Agricultural Research for a fellowship that permitted us to conduct the study. The authors are grateful to Dr Ristori from the Radiology Department of the Clermont-Ferrand University Hospital for providing DXA data from the Clermont-Ferrand University Hospital dataset. The authors' responsibilities were as follows: S. T. was responsible for model computations, statistical analysis and the first draft of the manuscript; L. M. was responsible for data acquisition, design of the study and physiological interpretation; J.-B. D. was responsible for the design of the study, model computations and statistical analysis; B. M. was responsible for the design of the study and physiological interpretation. All authors read and agreed with the contents of the manuscript. None of the authors has any conflict of interest concerning the manuscript.

References

1Carr, DB, Utzschneider, KM, Hull, RL, et al. (2004) Intra-abdominal fat is a major determinant of the National Cholesterol Education Program Adult Treatment Panel III criteria for the metabolic syndrome. Diabetes 53, 2087–2094.Google Scholar

2Vega, GL, Adams-Huet, B, Peshock, R, et al. (2006) Influence of body fat content and distribution on variation in metabolic risk. J Clin Endocrinol Metab 91, 4459–4466.Google Scholar

3Greenlund, LJ & Nair, KS (2003) Sarcopenia – consequences, mechanisms, and potential therapies. Mech Ageing Dev 124, 287–299.Google Scholar

4Vandervoot, AA & Symons, TB (2001) Functional and metabolic consequences of sarcopenia. Can J Appl Physiol 26, 90–101.Google Scholar

5Snijder, MM, Van Dam, RM, Visser, M, et al. (2006) What aspects of body fat are particularly hazardous and how do we measure them? Int J Epidemiol 35, 83–92.CrossRef Google Scholar

6Sun, SS & Chumlea, WC (2005) Statistical methods. In Human Body Composition, 2nd ed., pp. 151–160 [Heymsfield, SB, Lohman, TG and Wang, Z, et al., editors]. Champaign, IL: Human Kinetics.Google Scholar

7Gallagher, D, Heymsfield, SB, Heo, M, et al. (2000) Healthy percentage body fat ranges: an approach for developing guidelines based on body mass index. Am J Clin Nutr 72, 694–701.Google Scholar

8Jackson, AS, Stanforth, PR, Gagnon, J, et al. (2002) The effect of sex, age and race on estimating percentage body fat from body mass index: The Heritage Family Study. Int J Obes 26, 789–796.Google Scholar

9Larsson, I, Henning, B, Lindroos, AK, et al. (2006) Optimized predictions of absolute and relative amounts of body fat from weight, height, other anthropometric predictors, and age. Am J Clin Nutr 83, 252–259.CrossRef Google Scholar PubMed

10Levitt, DG, Heymsfield, SB, Pierson, RN Jr, et al. (2007) Physiological models of body composition and human obesity. Nutr Metab (Lond) 4, 19–32.Google Scholar

11Gómez-Ambrosi, J, Silva, C, Catalán, V, et al. (2012) Clinical usefulness of a new equation for estimating body fat. Diabetes Care 35, 383–388.Google Scholar

12Mioche, L, Bidot, C & Denis, JB (2011) Body composition predicted with a Bayesian network from simple variables. Br J Nutr 105, 1265–1271.Google Scholar

13Mioche, L, Brigand, A, Bidot, C, et al. (2011) Fat-free mass predictions through a Bayesian network enable body composition comparisons in various populations. J Nutr 1411, 573–580.Google Scholar

14Hastie T, Tibshirani R and Friedman J (2009) The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd ed. New York: Springer.Google Scholar

15Nivre J (2006) Inductive Dependency Parsing. Dordrecht: Springer.Google Scholar

16Centers for Disease Control and Prevention (2000) National Health and Nutrition Examination Survey: body composition procedures manual. http://www.cdc.gov/nchs/data/nhanes/BC.pdf (accessed 27 September 2008).Google Scholar

17Centers for Disease Control and Prevention (2008) The 1999–2004 dual energy X-ray absorptiometry (DXA) multiple imputation data files and technical documentation. http://www.cdc.gov/nchs/about/major/nhanes/dxx/dxa.html (accessed January 2008).Google Scholar

18Mazess, RB, Barden, HS, Bisek, JP, et al. (1990) Dual-energy X-ray absorptiometry for total body and regional bone mineral and soft tissue composition. Am J Clin Nutr 51, 1106–1112.Google Scholar

19Wang, ZM, Visser, M, Ma, R, et al. (1996) Skeletal muscle mass: evaluation of neutron activation and dual-energy X-ray absorptiometry methods. J Appl Physiol 80, 824–831.Google Scholar

20Sprent, P & Smeeton, NC (2001) Applied Nonparametric Statistical Methods, 3rd ed.Boca Raton, FL: Chapman, Hall/CRC.Google Scholar

21Vapnik, VN (1998) Statistical Learning Theory. New York, NY: Wiley.Google Scholar

22Lin, CF & Wang, SD (2002) Fuzzy support vector machines. IEEE Trans Neural Net 13, 464–471.Google Scholar

23Gelman, A, Carlin, JB, Stern, HS, et al. (2003) Bayesian Data Analysis, 2nd ed.Boca Raton, FL: Chaplan & Hall/CRC.CrossRef Google Scholar

24Anderson, TW (1951) Estimating linear restrictions on regression coefficients for multivariate normal distributions. Ann Math Statist 22, 327–351.Google Scholar

25Röhmel, J (1996) Precision intervals for estimates of the difference in success rates for binary random variables based on the permutation principle. Biometrical J 38, 977–993.Google Scholar

26Bland, JM & Altman, DG (1996) Statistical method for assessing agreement between two methods of clinical measurement. Lancet i, 307–310.Google Scholar

27R Development Core Team (2006) R: A Language and Environment for Statistical Computing. Vienna: R Foundation for Statistical Computing. http://www.R-project.org (accessed accessed January 2011).Google Scholar

28Jassen, I, Heymsfield, SB, Allison, DB, et al. (2002) Body mass index and waist circumference independently contribute to prediction of nonabdominal, abdominal subcutaneous, and visceral fat. Am J Clin Nutr 75, 683–688.CrossRef Google Scholar

29Aeberli, I, Gut-Knabenhans, M, Kusche-Ammann, RS, et al. (2012) A composite score combining waist circumference and body mass index more accurately predicts body fat percentage in 6- to 13-year-old children. Eur J Nutr 52, 247–253.CrossRef Google Scholar PubMed

30Lean, ME, Han, TS & Deurenberg, P (1996) Predicting body composition by densitometry from simple anthropometric measurements. Am J Clin Nutr 63, 4–14.Google Scholar

31Bosty-Westphal, A, Danielzik, S, Geisler, C, et al. (2006) Use of height³:waist circumference³ as an index for metabolic risk assessment. Br J Nutr 95, 1212–1220.CrossRef Google Scholar

32Snijder, MB, Dekker, JM, Visser, M, et al. (2004) Trunk fat and leg fat have independent and opposite association with fasting and postload glucose levels: the Hoorn study. Diabetes Care 27, 372–377.Google Scholar

33Van Pelt, RE, Evans, EM, Schechtman, KB, et al. (2002) Contribution of total and regional fat mass to risk for cardiovascular disease in older women. J Physiol Endocrinol Metab 282, 1023–1028.Google Scholar

34Saunders, TJ, Davidson, LE, Janiszewski, PM, et al. (2009) Association of the limb fat to trunk fat ratio with makers of cardiometabolic risk in elderly men and women. J Gerontol A Biol Sci Med Sci 64, 1066–1070.Google Scholar

35Johnstone, AM, Murison, SD, Duncan, JS, et al. (2005) Factors influencing variation in basal metabolic rate include fat-free mass, fat mass, age, and circulating thyroxine but not sex, circulating leptin, or triiodothyronine. Am J Clin Nutr 82, 941–948.Google Scholar

36Szulc, P, Munoz, F, Marchand, F, et al. (2010) Rapid loss of appendicular skeletal muscle mass is associated with higher all-cause mortality in older men: the prospective MINOS study. Am J Clin Nutr 91, 1227–1236.Google Scholar

37Lee, CG, Boyko, EJ, Nielson, CM, et al. (2011) Mortality risk in older men associated with changes in weight, lean mass, and fat mass. J Am Geriatr Soc 2, 233–240.Google Scholar

38Kilgour, RD, Vigano, A, Trutschnigg, B, et al. (2010) Cancer-related fatigue: the impact of skeletal muscle mass and strength in patients with advanced cancer. J Cachexia Sarcopenia Muscle 1, 177–185.Google Scholar

39Ellis, KJ (2000) Human body composition: in vivo methods. Physiol Rev 80, 649–680.CrossRef Google Scholar PubMed

40Lee, SY & Gallagher, D (2008) Assessment methods in human body composition. Curr Opin Clin Nutr Metab Care 11, 566–572.Google Scholar

41Wellens, R, Chumlea, WC, Guo, S, et al. (1994) Body composition in white adults by dual-energy X-ray absorptiometry, densitometry, and total body water. Am J Clin Nutr 59, 547–555.Google Scholar

42Plank, LD (2005) Dual-energy X-ray absorptiometry and body composition. Curr Opin Clin Nutr Metab Care 8, 305–309.CrossRef Google Scholar PubMed

43Lohman, TG & Chen, Z (2005) Dual-energy X-ray absorptiometry. In Human Body Composition, 2nd ed., pp. 63–78 [Heymsfield, SB, Lohman, TG and Wang, Z, et al., editors]. Champaign, IL: Human Kinetics.Google Scholar

44Lohman, TG (1992) Advances in Body Composition Assessment. Current Issues in Exercise Science Series (Monograph 3). Champaign, IL: Human Kinetics.Google Scholar

45National Institutes of Health (2000) The Practical Guide: Identification, Evaluation, and Treatment of Overweight and Obesity in Adults. Bethesda, MD: National Institutes of Health.Google Scholar

Table 1 Formulas of the five published prediction models for body fat percentage (BF%) for men and women*

Table 3 Multivariate prediction model estimates of parameters for the seven segmental compartments (kg) including or not including waist circumference as a predictor variable*

Article contents

A multivariate model for predicting segmental body composition

Abstract

Keywords

Subjects and methods

Databases

Measurement of body composition

Statistical methods

Non-parametric approaches

Multivariate linear regression

Validation analysis

Waist circumference usefulness analysis

Comparison with published univariate models

Assessment of the prediction accuracy

Statistical test analyses

Results

Sample characteristics

Prediction models

Inclusion of waist circumference

Multivariate prediction models

Comparison with published prediction models

Discussion

Acknowledgements

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests