The Sum Score Model: Specifying and Testing Equally Weighted Composites Using Structural Equation Modeling

Florian Schuberth; Tamara Schamberger; Ildikó Kemény; Jörg Henseler

doi:10.1017/psy.2024.5

The Sum Score Model: Specifying and Testing Equally Weighted Composites Using Structural Equation Modeling

Published online by Cambridge University Press: 03 January 2025

Ildikó Kemény and

Florian Schuberth*: Affiliation:
Department of Design, Production & Management, University of Twente, Enschede, The Netherlands
Tamara Schamberger: Affiliation:
Department of Design, Production & Management, University of Twente, Enschede, The Netherlands Faculty of Business Administration and Economics, University of Bielefeld, Bielefeld, Germany
Ildikó Kemény: Affiliation:
Department of Design, Production & Management, University of Twente, Enschede, The Netherlands Department of Digital Marketing, Corvinus University of Budapest, Budapest, Hungary
Jörg Henseler: Affiliation:
Department of Design, Production & Management, University of Twente, Enschede, The Netherlands Nova Information Management School, Universidade Nova de Lisboa, Lisbon, Portugal
*: Corresponding author: Florian Schuberth; Email: f.schuberth@utwente.nl

Article contents

Abstract
Introduction
Existing approaches to dealing with sum scores in SEM
A new approach for modeling sum scores in SEM
Illustrative example
Discussion
Data availability statement
Funding statement
Competing interests
Footnotes
References

Rights & Permissions

Abstract

In principle, structural equation modeling (SEM) is capable of emulating all approaches based on the general linear model. Yet, modeling sum scores in a structural equation model is not straightforward. Existing approaches to studying sum scores in a structural equation model are limited in terms either of model specification or of model assessment. This paper introduces a specification to SEM that allows for directly modeling sum scores and that overcomes existing approaches’ limitations in dealing with sum scores in the SEM context. The sum score model we present builds on the recently proposed refined Henseler–Ogasawara (H–O) specification of composites. It allows us to estimate models with sum scores in an integrative way. It can mimic the results of existing approaches and provides a means of assessing whether a sum score fully transmits the effects of or on the variables that make up the sum score. In addition, it allows for taking into account random measurement error in the variables that form the sum score. Consequently, this model specification offers researchers an improved way of judging and defending the use of sum scores empirically and conceptually.

Keywords

composite model equal weights full transmission Henseler–Ogasawara specification sum scores

Type: Theory and Methods
Information: Psychometrika , First View , pp. 1 - 26

DOI: https://doi.org/10.1017/psy.2024.5 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright: © The Author(s), 2025. Published by Cambridge University Press on behalf of Psychometric Society

1 Introduction

Sum scores have a long tradition in the social sciences (Kuder & Richardson, Reference Kuder and Richardson1937; Richardson, Reference Richardson1936; Traub, Reference Traub2005). They are calculated as the sum of observed variables and constitute a straightforward way of creating scores of theoretical constructs (Nunnally & Bernstein, Reference Nunnally and Bernstein1994). Due to the ease of calculating them, sum scores are very popular in various disciplines, including psychology (McNeish & Wolf, Reference McNeish and Wolf2020), biology (Cheniti et al., Reference Cheniti, Boukezzoula and Akhtar2017), medical research (Wang & Reeve, Reference Wang and Reeve2021), and physics (Hardt et al., Reference Hardt, Hecht and J. H. L.2018). Arguably, sum scores owe their popularity to the fact that using them to build construct scores ensures comparability and simplifies the reproducibility of different studies (Widaman & Revelle, Reference Widaman and Revelle2023). Further, the popularity of sum scoring is reflected in the ample use of various metrics that build on sum scores, such as Cronbach’s $\alpha $ (Cronbach, Reference Cronbach1951; Sijtsma, Reference Sijtsma2009), McDonald’s (Reference McDonald1999) $\omega\rho _c$ , and the heterotrait-monotrait ratio of correlations (HTMT/HTMT2, Henseler et al., Reference Henseler, Ringle and Sarstedt2015; Roemer et al., Reference Roemer, Schuberth and Henseler2021; Rönkkö & Cho, Reference Rönkkö and Cho2022).

Sum scores are used for various reasons. For instance, they are used in procedures such as item parceling (Little et al., Reference Little, Cunningham and Shahar2002). Moreover, sum scores are often used as “fallible (i.e., imperfect) estimates of the relative position of individuals on the dimension implied by the sum scores and pretend to be nothing more” (Widaman & Revelle, Reference Widaman and Revelle2023, p. 794). However, a sum score also constitutes a special type of composite, i.e., a weighted linear combination of variables, and thus can be used to represent theoretical constructs (Cohen et al., Reference Cohen, Cohen and Teresi1990; Grace & Bollen, Reference Grace and Bollen2008; Saris & Gallhofer, Reference Saris and Gallhofer2014). “If a construct is defined in a way such that the building of a sum score maps on this definition well (Lundberg et al., Reference Lundberg, Johnson and Stewart2021), then its use is appropriate” (Edelsbrunner, Reference Edelsbrunner2022, p. 3). For example, job satisfaction can be considered as the sum of different aspects, which include salary and working hours, opportunities for advancement, job security, autonomy in doing the work, social contacts, and usefulness of the job for society (Saris & Gallhofer, Reference Saris and Gallhofer2014, Chapter 1). In this case, a sum score would summarize the effects of or on the variables forming the sum score, i.e., the collective effects of or on the sum score’s variables. In the literature, such a summary effect, i.e., the effect of or on the composite, is also known as the sheaf coefficient (Heise, Reference Heise1972).

A widely used approach in the social and behavioral sciences for studying relationships between variables is structural equation modeling (SEM, Bollen, Reference Bollen1989). It provides researchers with several means by which to assess models, such as the $\chi ^2$ -test (Jöreskog, Reference Jöreskog1969), various fit indices (Hu & Bentler, Reference Hu and Bentler1998), and information criteria (Bozdogan, Reference Bozdogan1987). Moreover, SEM is a very versatile and holistic framework, which can in principle emulate all approaches based on the general linear model (Graham, Reference Graham2008). Hence, it should be able to seamlessly incorporate sum scores in SEM. In fact, several suggestions have been made. Specifically, analysts can use a two-step approach to include sum scores in a structural equation model or can rely on approaches to model sum scores in SEM, such as the one-step approach (e.g., Grace & Bollen, Reference Grace and Bollen2008) and the pseudo-indicator approach (Rose et al., Reference Rose, Wagner and Mayer2019).

Unfortunately, the existing approaches that enable incorporating sum scores in SEM all come with limitations. The two-step approach does not take the formation of the sum score into account and thus omits the sum score’s components from the actual model. This prevents researchers from exploiting the full capabilities of SEM, such as applying the direct maximum likelihood approach for dealing with missing values in the variables making up the sum score (e.g., Allison, Reference Allison2003). Similarly, the one-step approach (Grace & Bollen, Reference Grace and Bollen2008) can only model a sum score as a predictor of other variables in the model (MacCallum & Browne, Reference MacCallum and Browne1993). Thus, this approach is limited in its flexibility for modeling sum scores. Further, although the pseudo-indicator approach permits the flexible modeling of sum scores in SEM, there is currently no available guidance on how to model a sum score that fully transmits the collective effects on its components. As a result, the current approaches do not allow for flexible modeling of sum scores in SEM or miss opportunities to assess them.

To overcome the limitations of the existing approaches to dealing with sum scores in a structural equation model, we introduce a new way of specifying sum scores in SEM: the sum score model. This model is a special case of the recently introduced Henseler–Ogasawara (H–O) specification of composites (Henseler, Reference Henseler2021;Schuberth, Reference Schuberth2023; Yu et al., Reference Yu, Schuberth and Henseler2023). It allows researchers to flexibly integrate sum scores into a larger model that also includes other variables. Moreover, it is possible to assess whether a sum score fully transmits all collective effects of or on the variables that make up the sum score. This can be useful when a sum score is used to model a theoretical construct (Grace & Bollen, Reference Grace and Bollen2008). Further, we present two ways of accounting for random measurement error in the sum score model to avoid attenuation bias in the parameter estimates. Finally, as our proposed sum score model is based on the refined H–O specification, it is straightforward to replace a sum score with a composite whose weights are freely estimated.

The remainder of this article is structured as follows. Section 2 presents the existing approaches to specifying sum scores in SEM. Particularly, we discuss the various approaches’ advantages and disadvantages. In Section 3, we introduce the sum score model based on the refined H–O specification as an alternative, more flexible approach to model sum scores in SEM. Additionally, we show how to relax the assumption of the original H–O specification in the sum score model according to which the sum score fully transmits all collective effects of or on its components. Further, we present two ways of accounting for random measurement errors in the sum score model. By means of three illustrative examples, Section 4 compares the results of our sum score model with those of existing approaches. This section also highlights the sum score model’s capabilities. Finally, in Section 5, we close the paper with a discussion.

2 Existing approaches to dealing with sum scores in SEM

Several approaches have been proposed for dealing with sum scores in SEM. They can be divided into approaches that include sum scores and approaches that model sum scores in a structural equation model.

2.1 Approach to include sum scores

The two-step approach can be considered the classical approach for dealing with sum scores in SEM. As its name suggests, it includes a sum score in the model following two steps. In the first step, the sum score is calculated before the actual analysis, i.e., the variables forming the sum score are simply summed up. Subsequently, in the second step, the sum score is used as a new observed variable to replace the original variables in a structural equation model.Footnote ¹

On the one hand, the two-step approach is very easy to implement. On the other hand, the fact that the creation of the sum score is not modeled and, thus, the variables making up the sum score are not included in the final model, holds major disadvantages. Particularly, it is not possible to assess whether the sum score properly summarizes the collective effects of or on its components. In other words, it is not possible to examine whether the sum score fully transmits the effects of or on its components. Hence, this approach misses opportunities for model assessment. Further, researchers studying sum scores might not benefit from all of SEM’s capabilities. For instance, researchers cannot use the direct maximum-likelihood approach (e.g., Allison, Reference Allison1987, Reference Allison2003) also known as full information maximum likelihood approach to deal with missing values in the variables making up a sum score and need to rely on alternatives such as the two-stage maximum likelihood approach (e.g., Chen et al., Reference Chen, Savalei and Rhemtulla2020; Savalei & Bentler, Reference Savalei and Bentler2009).

2.2 Approaches to model sum scores

Researchers have started to develop approaches that can be used to model sum scores in SEM. In contrast to the two-step approach, these approaches model sum scores in a single step, thus considering the creation of a sum score within the model. Arguably, the most straightforward approach to model sum scores is the one-step approach. This approach models the sum score as a latent variable in a causal-formative measurement model (e.g., Bollen & Lennox, Reference Bollen and Lennox1991). The effects of all indicators on the latent variable are fixed to one and the variance of the latent variable’s error term is fixed to zero. Consequently, the latent variable becomes an observed variable, i.e., the sum score. For more information on the one-step approach, we refer the interested reader to Grace and Bollen (Reference Grace and Bollen2008).

The one-step approach allows for directly modeling sum scores in a structural equation model, thus overcoming the two-step approach’s limitations. It takes the creation of the sum score in the model into account, and thus allows researchers to employ the direct maximum likelihood approach to deal with missing values in the observed variables that make up the sum scores. Further, it can be used to assess whether a sum score fully transmits the effects of the variables that make up the sum score on some outcome variables (Grace & Bollen, Reference Grace and Bollen2008). However, this approach also has its limitations. For instance, it does not allow for modeling effects of other variables on the sum score, i.e., modeling the sum score as an outcome variable in the structural model is not possible. In such cases, the model would not be identified (MacCallum & Browne, Reference MacCallum and Browne1993). Hence, it is not possible to model and assess whether a sum score properly summarizes the collective effects of other variables on the variables making up the sum score, i.e., whether the sum score fully transmits the effects on the variables that make up the sum score. Similarly, since this approach always models the sum score as a dependent variable, it is not possible to specify covariances between the sum score and other exogenous variables of the model. A putative solution would be to model covariances between the error term of the latent variable and other exogenous variables. However, since the error term’s variance is fixed to zero, this would be a fruitless endeavor.

A more flexible approach to model sum scores is the pseudo-indicator approach (Rose et al., Reference Rose, Wagner and Mayer2019). This approach takes advantage of the fact that a sum score is the sum of observed variables. Specifically, one of the observed variables becomes the pseudo-indicator, which is expressed as the difference between the sum score and the remaining observed variables. Thereby, the sum score is modeled as a latent variable with a single indicator, i.e., the pseudo-indicator. To ensure that the sum score is in fact the sum of its variables, the variance of the resulting error term of the pseudo-indicator needs to be fixed to zero. In addition, the effects of the remaining observed variables on the pseudo-indicator are fixed to minus one. Consequently, the latent variable becomes the sum score. Finally, the covariances between the remaining observed variables and the sum score (and usually also the covariances between the remaining observed variables and potential other exogenous variables of the model) are specified. For more details on the pseudo-indicator approach, we refer the interested reader to Rose et al. (Reference Rose, Wagner and Mayer2019).

The pseudo-indicator approach is an elegant way of flexibly modeling sum scores in SEM. It overcomes the drawbacks of the one-step approach, i.e., it can model a sum score as a predictor or an outcome variable, and it allows for specifying covariances between sum scores and other exogenous variables of the structural equation model. However, the extant literature on the pseudo-indicator approach currently lacks guidance on how to model a sum score that fully transmits the collective effects on its components. Originally, the pseudo-indicator approach models a sum score in such a way that the resulting model is equivalent to the target model. The target model is the model that includes the sum score and other variables of interest but not the observed variables that form the sum score. For this purpose, the constraints created by including the sum score’s observed variables are removed. Thereby, it is ensured that the model-implied variance-covariance matrix of the target model remains unaffected by the inclusion of the sum score’s observed variables. Specifically, various covariances between the sum score’s observed variables, i.e., all observed variables that make up the sum score, except the pseudo-indicator, and other exogenous variables of the model are specified. Although Rose et al. (Reference Rose, Wagner and Mayer2019) mention the possibility of fixing these covariances to zero, which allows one to assess whether a sum score fully transmits the collective of its components, they do not currently provide any information on how to fix these covariances to ensure that the sum score fully transmits the collective effects on its components. We must, therefore, conclude that this approach is currently limited in its ability to assess whether a sum score properly summarizes the collective effects.

3 A new approach for modeling sum scores in SEM

In this section, we present a new approach for modeling sum scores in SEM: the sum score model. The sum score model is based on the refined Henseler–Ogasawara (H–O) specification (Schuberth, Reference Schuberth2023; Yu et al., Reference Yu, Schuberth and Henseler2023). As sketched by Henseler (Reference Henseler2021), the H–O specification to flexibly model composites in SEM uses the idea, which Ogasawara (Reference Ogasawara2007) also introduced in the context of canonical correlation analysis, of expressing the relationship between a composite and the variables making up the composite in terms of (composite) loadings. This explains the name of this specification. Subsequently, the H–O specification has been elaborated (Schuberth, Reference Schuberth2023) and refined to reduce its complexity (Yu et al., Reference Yu, Schuberth and Henseler2023). Since a composite is a weighted linear combination of other variables (e.g., Cohen et al., Reference Cohen, Cohen and Teresi1990; Edwards & Bagozzi, Reference Edwards and Bagozzi2000), a sum score can be viewed as a special type of composite, namely a composite created with unit weights. Therefore, the H–O specification seems to be well suited for modeling sum scores in SEM. In the following subsections, we present the refined H–O specification and modifications of it that are potentially relevant to researchers dealing with sum scores. Specifically, in the next subsection, we present the refined H–O specification for composites, where the composite weights are free model parameters. Next, we show the parameter constraints necessary to obtain unit weights and thus sum scores, i.e., the sum score model. Further, we demonstrate in the sum score model how to relax the assumption of the original refined H–O specification that all effects on or of the variables that make up the composite are fully transmitted by the composite. In this way, the sum score model can achieve the same results as the pseudo-indicator approach. Finally, in the last subsection, we show how random measurement error in the variables that make up a sum score can be taken into account in the sum score model. To present the H–O specification and the sum score model, we use the SEM framework proposed by Jöreskog (Reference Jöreskog1970); see also Jöreskog (Reference Jöreskog1978). In particular, we assume that the observed variables follow a multivariate normal distribution, and for the sake of simplicity, we assume that the observed variables are mean centered.

3.1 Modeling composites using the refined H–O specification

The starting point is a set of observed variables $x_1, \dots , x_K$ with variance-covariance matrix $\boldsymbol {\Sigma }$ that composes a composite $\eta $ : $\eta = \sum _{i=1}^{K}{w_i x_i}$ .Footnote ² The H–O specification exploits the fact that K distinct composites can be extracted from these observed variables. For this reason, in addition to the composite of interest $\eta $ , $K-1$ further composites are extracted from the set of observed variables, as Equation (1) shows.

(1)

$$ \begin{align} \begin{pmatrix} \eta \\ \boldsymbol{\nu} \end{pmatrix} = \boldsymbol{W'} \boldsymbol{x}. \end{align} $$

The additional composites $\boldsymbol {\nu } =\begin {pmatrix} \nu _1 & \dots & \nu _{K-1} \end {pmatrix}'$ are referred to as excrescent variables and together with the composite of interest $\eta $ they span the entire space of the observed variables. The square matrix $\boldsymbol {W}$ of dimension K, in its columns, contains the weights to form the composite of interest and the $K-1$ excrescent variables.

As is known from principal component analysis, the relations between composites and their observed variables can be expressed both by means of weights and by means of composite loadings:

(2)

$$ \begin{align} \boldsymbol{x} = \left(\boldsymbol{W'}\right)^{-1} \begin{pmatrix} \eta\\ \boldsymbol{\nu} \end{pmatrix} = \boldsymbol{\Lambda} \begin{pmatrix} \eta\\ \boldsymbol{\nu} \end{pmatrix}, \end{align} $$

where the square matrix $\boldsymbol {\Lambda }$ of dimension K contains the loadings of the composite $\eta $ and the excrescent variables $\boldsymbol {\nu }$ in its columns. As Equation (2) shows, the weights can, in principle, be obtained as the elements of the inverse of the transposed composite loading matrix: $\boldsymbol {W} = (\boldsymbol {\Lambda }')^{-1}$ .

To ensure that the H–O specification is identified and that the loading matrix is invertible, some parameters need to be fixed (Schuberth, Reference Schuberth2023; Yu et al., Reference Yu, Schuberth and Henseler2023). To determine the scale of the composites, i.e., the composite of interest and the excrescent variables, one loading can be fixed for each composite. Usually, these composite loadings are set to one. Each observed variable may only be used once for scaling purposes. In addition, we need to determine how the excrescent variables are extracted from the observed variables. In general, there are various ways to do this, as long as the identification of the parameters is ensured. In this study, we use the parameterization of the refined H–O specification (Yu et al., Reference Yu, Schuberth and Henseler2023). Therefore, only two observed variables are allowed to load on each excrescent variable, and we have to ensure that no excrescent variables are connected to exactly the same observed variables. Further, no observed variable is allowed to be related to more than two excrescent variables. Finally, the excrescent variables are allowed to correlate freely with each other, but they have to be uncorrelated with the composite of interest $\eta $ . This ensures that the variance-covariance matrix of the observed variables that form a composite can be perfectly reproduced, i.e., no constraints are imposed on the variance-covariance matrix $\boldsymbol {\Sigma }$ of the composite’s observed variables. Furthermore, as the weights are freely estimated in this H–O specification, the composite of interest $\eta $ must not be isolated, i.e., it must have a relationship, e.g., a path coefficient or covariance, with at least one other variable of the model besides its observed variables (see also Dijkstra, Reference Dijkstra, Latan and Noonan2017). This is supported by the fact that the H–O specification based on free weights without additional variables related to the composite of interest shows a negative number of degrees of freedom if more than one variable forms the composite. Similarly, this highlights that the freely estimated weights are context specific, i.e., the weights depend not only on the variables that make up the composite but also on the other variables of the model, including their metrics (Heise, Reference Heise1972). The same holds for the one-step approach if the weights are freely estimated.

Figure 1 shows an example of the refined H–O specification for a composite $\eta $ made up of three observed variables $x_1$ , $x_2$ , and $x_3$ . For the sake of clarity, the variances of the exogenous variables are not shown in this figure. In this example model, the six collective effects of the observed variables $\boldsymbol {x}$ making up the composite of interest $\eta $ on the two outcome variables $\boldsymbol {y}$ are calculated as: $\boldsymbol {\gamma } \boldsymbol {w'}$ , where the column vector $\boldsymbol {\gamma }$ contains the two summary effects of the composite on the outcome variables, and the column vector $\boldsymbol {w}$ contains the three weights used to form the composite of interest. We provide the derivation of the collective effects in Appendix A.1.

Figure 1 Example of the refined Henseler–Ogasawara specification with free weights.

3.2 The sum score model based on the refined H–O specification

A sum score is a special type of composite, namely a composite created using unit weights. To account for this fact in the refined H–O specification, further constraints have to be imposed to ensure that the weights are equal to one. Therefore, we first change the scaling condition for the composite of interest. Instead of fixing one loading of the observed variables to one, we fix the sum of the loadings to one. In the context of latent variable models, this approach is also known as effects coding (e.g., Klopp & Klößner, Reference Klopp and Klößner2021; Little et al., Reference Little, Slegers and Card2006). Next, we fix the loadings of each excrescent variable in such a way that their sum equals zero, e.g., by fixing one loading to one and the other loading to minus one. Figure 2 depicts an H–O specification in which the composite of interest $\eta $ constitutes a sum score, i.e., it depicts a sum score model. For the sake of clarity, the variances of the exogenous variables are not depicted in this figure. It is noted that, in contrast to the refined H–O specification with free weights, it is no longer necessary that the sum score be connected to other variables of the model as the weights are fixed in the sum score model.

Figure 2 Example of a sum score model based on the refined Henseler–Ogasawara specification.

To demonstrate that these constraints lead to a sum score, we recall that in the H–O specification the weights are obtained as the inverse of the transposed composite loading matrix. Thus, the product of the transposed composite loading matrix and the weight matrix equals the identity matrix of dimension $K \times K$ :

(3)

$$ \begin{align} \boldsymbol{W} = (\boldsymbol{\Lambda}')^{-1} \qquad \Leftrightarrow \qquad (\boldsymbol{\Lambda}') \boldsymbol{W} = \boldsymbol{I}, \end{align} $$

where the first column of $\boldsymbol {\Lambda }$ contains the composite loadings of $\eta $ and the remaining columns contain the composite loadings of $\boldsymbol {\nu }$ . Similarly, the first column of $\boldsymbol {W}$ contains the weights to form $\eta $ , while the remaining columns contain the weights to form $\boldsymbol {\nu }$ . Consequently, the product of the composite of interest’s loadings and its weights must equal one:

(4)

$$ \begin{align} \sum\limits^{K}_{i=1}{ \lambda_{i1} w_{i1}} = 1, \end{align} $$

Similarly, the product of each individual excrescent variable’s loadings and the weights of the composite of interest must be equal to zero:

(5)

$$ \begin{align} \sum\limits^{K}_{i=1}{ \lambda_{ij} w_{i1}} = 0, \end{align} $$

where j refers to the excrescent variables, i.e., $j=2,...,K$ . In addition, each excrescent variable is related to exactly two observed variables $x_k$ and $x_l$ . Therefore, Equation (5) simplifies to: $\lambda _{kj} w_{k1} + \lambda _{lj} w_{l1} = 0$ . As the two loadings are fixed in such a way that they sum up to zero, the two weights $w_{k1}$ and $w_{l1}$ must be equal. Moreover, no observed variable is related to more than two excrescent variables, and no excrescent variables are connected to the same observed variables. This implies that the weights forming the composite of interest are all equal, i.e., $w_{11}=w_{21}=...=w_{K1}=w$ . Consequently, Equation (4) becomes: $\sum ^{K}_{i=1}{ \lambda _{i1} w} = 1$ . Since the sum of the composite of interest’s loadings is constrained to be equal to one for identification purposes, the weights that form the composite of interest are all equal to one: $w = 1/\sum ^{K}_{i=1}{ \lambda _{i1}}=1/1=1$ .Footnote ³ Consequently, the composite of interest constitutes a sum score.

The sum score model we have presented has several advantages. For instance, it provides a versatile way of integrating sum scores into structural equation models. Similar to the pseudo-indicator approach, it offers us the flexibility to model a sum score as a predictor or an outcome variable within a structural model. Further, our proposed sum score model can be used to model the sum score as a variable that transmits all collective effects of or on the variables that make up the sum score. In this case, the sum score $\eta $ accounts for all covariances between its observed variables $\boldsymbol {x}$ and other variables of the model $\boldsymbol {y}$ , i.e., $\text {cov}(\boldsymbol {x}, \boldsymbol {y} \vert \eta )=\mathbf {0}$ . In other words, the excrescent variables show no correlation with any other variable in the model. As a result, the covariance matrix of the sum score’s observed variables and all other variables of the model is of rank one, i.e., $\text {rank}(\text {cov}(\boldsymbol {x}, \boldsymbol {y}))=1$ . We call this the full transmission assumption. As Grace and Bollen (Reference Grace and Bollen2008) explained, for a composite with free weights, the effects of the components on the outcome variables, i.e., the collective effects, must be proportional, otherwise the effects of the composite on the outcome variables, i.e., the summary effects, will provide a distorted picture and the full transmission assumption will be violated.Footnote ⁴ Consequently, the full transmission assumption depends on the scale of the variables forming the composite and the other variables in the model.

In the sum score context, the full transmission assumption implies that the collective effects of the components on an outcome variable must be equal. For our example model shown in Figure 2, the collective effects of the three variables $\boldsymbol {x}$ forming the sum score $\eta $ on the outcome variables $\boldsymbol {y}$ are calculated as follows: $\boldsymbol {\gamma } \boldsymbol {1'}$ , where the two column vectors $\boldsymbol {\gamma }$ and $\boldsymbol {1}$ contain the two summary effects and ones, respectively. Note that when there is only one outcome variable, the components can always be scaled so that the full transmission assumption is satisfied. However, if there is more than one outcome variable in the model, this is no longer necessarily the case. Similar can be shown for the case in which a sum score is used to summarize the collective effects on the variables making up the sum score. The full transmission assumption offers researchers the opportunity to assess whether a sum score properly summarizes the collective effects. However, this assumption is in general not necessary for the use of sum scores in SEM, and its usefulness depends on the specific research context. Therefore, the following subsection shows how this assumption can be relaxed.

3.3 Relaxing the full transmission assumption in the sum score model

Our sum score model presented above implies that all covariances between the sum score’s observed variables, and other variables of the model are accounted for by the sum score. Since the weights are fixed in the sum score model, this assumption can be relaxed. For this purpose, the covariances between the excrescent variables and other exogenous variables of the model need to be specified as free model parameters. These covariances account for the covariation between the sum score’s observed variables and other variables of the model that the sum score does not account for. In other words, the covariances capture the covariances and/or effects that are not transmitted by the sum score. Relaxing the full transmission assumption allows us to mimic the results of the pseudo-indicator approach, as is also illustrated in Section 4. Therefore, this specification is particularly useful if the goal is to model sum scores without affecting the variance-covariance matrix implied by the target model (see also Rose et al., Reference Rose, Wagner and Mayer2019). The target model is the model that does not contain the observed variables making up the sum scores, i.e., the model of the second step in the two-step approach.

To demonstrate how the full transmission assumption can be relaxed, we consider the example model from Figure 2, where a researcher is studying the effect of a sum score on two observed outcome variables $y_1$ and $y_2$ . As can be seen in Figure 3, to relax the assumption that the sum score fully transmits all collective effects of the observed variables making up the sum score on the two outcome variables $y_1$ and $y_2$ , we can specify free covariances between the excrescent variables $\nu _1$ and $\nu _2$ and the error terms $\zeta _1$ and $\zeta _2$ of the outcome variables. Note that the variances of the exogenous variables are omitted in the figure.

Figure 3 Example of a sum score model in which the full transmission assumption has been relaxed.

3.4 Taking into account random measurement error in the sum score model

In the sum score model, we assumed that the observed variables making up the sum score are free of random measurement error. However, this does not necessarily need to be the case, and in practice, the observed variables making up a sum score may be contaminated by random measurement error: $ \boldsymbol {x} = \boldsymbol {x}^* + \boldsymbol {\varepsilon }$ , where $ \boldsymbol {x}^*$ are the variables free from random measurement error, and $\boldsymbol {\varepsilon }$ are the random measurement errors. The random measurement errors are assumed to be mutually uncorrelated and uncorrelated with $\boldsymbol {x}^*$ (and potential other variables of the model). In this case, the relationships between the sum score made up of measurement error contaminated variables and other variables of the model will most likely be distorted due to attenuation (Bollen & Lennox, Reference Bollen and Lennox1991; Cohen et al., Reference Cohen, Cohen and Teresi1990). Similar is known from factor score regression in which composites are used as approximations for latent variables (e.g., Devlieger & Rosseel, Reference Devlieger and Rosseel2017; Schuberth et al., Reference Schuberth, Schamberger and Henseler2023; Skrondal & Laake, Reference Skrondal and Laake2001). To correct for random measurement error, we follow an approach frequently mentioned in the literature (Cole & Preacher, Reference Cole and Preacher2014; Hayduk, Reference Hayduk1996; Hayduk & Littvay, Reference Hayduk and Littvay2012; Savalei, Reference Savalei2019), i.e., to model the random measurement error contaminated variable as a single indicator of a latent variable with a fixed error term variance. Consequently, for the sum score model random measurement error can be corrected in at least two ways: (1) on the sum score level or (2) on the observed variable level.Footnote ⁵

To account for random measurement error on sum score level, the sum score can be modeled as a single indicator of a latent variable. Specifically, the loading of the sum score on the single-indicator latent variable is fixed to one, and the variance $\theta $ of the resulting error term $\delta $ needs to be fixed to (1 $-$ reliability of the sum score) $\times $ the variance of the sum score $\eta $ (Nunnally & Bernstein, Reference Nunnally and Bernstein1994, Equation 7-6). In this way, the variance in the sum score that is due to random measurement error is partialled out and the variance of the latent variable accounts for the remaining variance in the sum score, i.e., the variance that is not caused by random measurement error.

Figure 4 demonstrates this approach for our example model from Figure 2. If no correction takes place, i.e., if a researcher specifies the model from Figure 2, the collective effects of $\boldsymbol {x}$ on $\boldsymbol {y}$ are calculated as: $\hat {\boldsymbol {\gamma }} \boldsymbol {1'}$ , where the estimated summary effects $\hat {\boldsymbol {\gamma }}$ (and thus the collective effects) will likely be distorted due to attenuation. Since the sum score $\eta \ (=\sum _{i=1}^{3}{(x_i^*+\varepsilon _i)}=\eta ^*+\delta )$ contains random measurement error, the estimated summary effects, i.e., the effect of $\eta $ on $\boldsymbol {y}$ will converge in probability to $\boldsymbol {\gamma } (1-\text {var}(\delta )/(\text {var}(\eta ))$ , where $\boldsymbol {\gamma }$ is the probability limit of the summary effects in case of no random measurement error. In contrast, if random measurement error is accounted for on sum score level, the estimated summary effects are calculated based on $\eta ^*$ , i.e., the sum score corrected for random measurement error. Given a correct reliability estimate of the sum score, these estimated summary effects will converge in probability to $\boldsymbol {\gamma }$ . Consequently, the collective effects are corrected for attenuation and equal the effects of $\boldsymbol {x}^*$ on $\boldsymbol {y}$ , i.e., the effects of the sum score’s components without random measurement error on the outcome variables.

Figure 4 Accounting for random measurement error at the sum score level in the sum score model.

Although correcting for measurement error on sum score level can address the issue of distorted path coefficient estimates between the sum score and the other variables of the model, the composite loadings and thus the free weights, i.e., the weights to form the excrescent variables, remain uncorrected because the correction takes place on sum score level. This is particularly problematic if a sum score is used to summarize the collective effects on its components.

To address this issue, we can take random measurement error on observed variable level into account. Specifically, each contaminated observed variable can be specified as a single indicator of a latent variable. The loading of the single indicator is fixed to one and the variance of the resulting error term is fixed to (1 $-$ reliability of the observed variable) $\times $ the variance of the observed variable. In this way, the latent variable captures the measurement error adjusted variance of the corresponding observed variable. As a result, the composites, i.e., the sum score and the excrescent variables, are formed from measurement error corrected variables, i.e., the single-indicator latent variables. Consequently, not only the relationships between the sum score and other variables of the model but also the relationships between the composites, i.e., the sum score and the excrescent variables, and their observed variables are corrected for attenuation. Hence, the composite loadings, and therefore the free weights, are corrected for random measurement error.

Taking random measurement error in the ways described above into account requires reliability estimates, which might be difficult to obtain in practice. If the variables making up a sum score are assumed to be unidimensional measures of a construct, Mosier’s (Reference Mosier1943, Equation 5) formula can be used to determine the reliability of the sum score. Note that this way of correcting for random measurement error requires reliability estimates for each observed variable that makes up the sum score. Alternatively, Cronbach’s $\alpha $ (Cronbach, Reference Cronbach1951) can be used if the observed variables are assumed to be essential tau-equivalent measures of a construct. Moreover, although the proposed corrections can address potential attenuation bias in the parameter estimates given a correct reliability estimate, fixing the variance of an error term to a value derived from a data-based reliability estimate such as Cronbach’s $\alpha $ ignores the uncertainty in the reliability estimate. This carries the risk of drawing incorrect statistical inference (e.g., Oberski & Satorra, Reference Oberski and Satorra2013). Furthermore, the two ways of correcting for random measurement error introduce additional variables. In the case of correcting for random measurement error on sum score level, the latent variable $\eta ^*$ and the corresponding error term $\delta $ need to be specified (see Figure 4). However, as the effect of the latent variable on the sum score and the variance of the error term are fixed, no additional free parameters are added to the model. The same holds for the proposed correction on observed variable level. Consequently, the two approaches to accounting for random measurement error do not alter the number of free model parameters; thus, the model’s degrees of freedom remain unchanged. Yet, the fit of the model can be altered compared to the sum score model without a correction as the parameters are corrected for attenuation. Further, taking random measurement error into account does not limit the flexibility of our sum score model. Particularly, it is still possible to model a sum score as an outcome variable and to allow for covariances between the sum score and other variables of the model. Similarly, it is possible to relax the full transmission assumption, i.e., to allow for free covariances between the excrescent variables and other exogenous variables of the model.

4 Illustrative example

We demonstrate the capabilities of the sum score model based on the H–O specification by means of three scenarios. In each scenario, we make use of a different population model. Scenarios 1 and 2 present a situation in which a researcher uses a sum score to summarize the collective effects of or on the sum score’s components. In both scenarios, we use a population model with the same structure. However, in Scenario 2, the components making up the sum score are contaminated by random measurement error. This allows us to demonstrate how random measurement error can be taken into account on observed variable level in the sum score model. Finally, in Scenario 3, we apply the sum score model to a latent variable population model, i.e., sum scores are used as approximations for latent variables. This scenario allows us to demonstrate how random measurement error can be taken into account on sum score level in the sum score model.

In each scenario, we use the corresponding population variance-covariance matrix as input for the model estimation. Hence, strictly speaking, the model parameters were retrieved, not estimated. This allows us to provide explanations for the source of potential model misfit in the absence of sampling uncertainty. The exact model specifications used in each scenario are illustrated in Appendix A.2. All calculations were carried out in the statistical programming environment R (R Core Team, Reference Team2022). The different models were estimated using the full-information maximum likelihood estimator as implemented in the R package lavaan (Rosseel, Reference Rosseel2012) based on 200 observations.Footnote ⁶

4.1 Scenario 1: Comparison of approaches to deal with sum scores

In the first scenario, we consider the population model shown in Figure 5.

Figure 5 Population model used in Scenarios 1 and 2.

This population model consists of 5 observed variables, i.e., one exogenous variable X, three mediator variables $m_1$ to $m_3$ , and one outcome variable Y. All observed variables are mean centered. The collective effects of X on the three mediator variables are 0.9, 0.6, and 0.3, and the collective effects of the mediator variables on the outcome variable Y are 0.326, 0.619, and 0.186. The variance of X is set to 1, and the variances of error terms $\zeta _1$ to $\zeta _4$ are set to 4.610, 1.354, 1.111, and 1.000, respectively. Although not shown in Figure 5, the error terms of the mediator variables are allowed to covary as follows: $\mathrm{cov}(\zeta _1,\zeta _2=-1.062)$ , $\mathrm{cov}(\zeta _1,\zeta _3)=-0.237$ , and cov( $\zeta _2,\zeta _3$ )=0.222. Finally, in this scenario, we assume that all variables are free from random measurement error. Therefore, the variances of the random measurement errors $\varepsilon $ are set to 0, i.e., var( $\varepsilon _i$ )=0 $, \forall i = 1,2,3$ .

To summarize the collective effects on and of the three mediator variables $m_1$ , $m_2$ , and $m_3$ , a researcher replaces them by a sum score. This could, for example, represent a situation in which a researcher studies a mother’s availability to interact with and monitor her children. Specifically, the mother’s availability can be regarded as the sum of the number of children, the mother’s illness, and hours of maternal employment (Cohen et al., Reference Cohen, Cohen and Teresi1990). For this purpose, the researcher used the following four approaches:

(1) The sum score model assuming full transmission,
(2) The sum score model not assuming full transmission,
(3) The pseudo-indicator approach using unit weights, and
(4) The two-step approach, which is the conventional way of dealing with sum scores in SEM, i.e., in the first step the sum score is created, and second, the sum score is used in a path analysis together with the other variables.

In addition, as a fifth approach, we consider the refined H–O specification. In this specification, the mediator variables $m_1$ to $m_3$ form a composite of which the weights are freely estimated. We did not include the one-step approach, i.e., we did not use the approach in which the sum score is modeled as a formatively measured latent variable because this approach does not permit modeling a sum score as an outcome variable.

Table 1 shows the results of the five approaches. For all approaches using the sum score, i.e., Approaches 1) to 4) above, the weights were equal to one. In contrast, in the refined H–O specification, where $m_1$ , $m_2$ , and $m_3$ , instead of a sum score, form a composite with free weights, we obtained the following weights: $w_{m_1}=0.407$ , $w_{m_2}=0.773$ , and $w_{m_3}=0.232$ . Also, Table 1 reports the results for the two summary effects, i.e., the path coefficient estimates between X and M, and M and Y, the direct effect of X on Y, the collective effects of X on $m_1$ to $m_3$ and $m_1$ to $m_3$ on Y, and various model fit statistics, i.e., $\chi ^2$ -test statistic with its degrees of freedom (df), the root mean square error of approximation (RMSEA, Hu & Bentler, Reference Hu and Bentler1999), and the standardized root mean square residual (SRMR, Hu & Bentler, Reference Hu and Bentler1998). The collective effects are no model parameters in the different sum score models and the refined H–O specifications, but they can be derived as indirect effects of X on $m_1$ to $m_3$ , and $m_1$ to $m_3$ on Y, respectively. The values in parentheses show the population values of the parameters (see also Figure 5). For the two-step approach, the collective effects cannot be derived as the mediator variables are not part of the model of the second step. Similarly, the collective effects are not reported for the pseudo-indicator approach.

Table 1 Results of Scenario 1

Note: The refined H–O specification does not create sum scores, but weighted composites; it is listed for comparative reasons only.

As can be seen from Table 1, all approaches using sum scores, i.e., Approaches (1) to (4), produced the same direct effect of X on Y and the same summary effects of X on M, and M on Y, i.e., 0.435, 1.800, and 0.325, respectively. Similarly, the corresponding standard errors (SEs) were the same. However, the approaches differed in respect of the model fit statistics, i.e., $\chi ^2$ -test statistic, RMSEA, SRMR, and the collective effects.

The sum score model that does not assume full transmission showed no misfit, and in fact, it showed the exact same $\chi ^2$ -test statistic, number of df, and the RMSEA value as the pseudo-indicator approach and the two-step approach. This was expected as this sum score model emulates the pseudo-indicator approach which was designed to model the sum score in such a way that its inclusion does not affect the model-implied variance-covariance matrix of the target model (Rose et al., Reference Rose, Wagner and Mayer2019).Footnote ⁷ In our example, the target model is identical to the model of the second step of the two-step approach. Since this model is saturated, all three approaches show a perfect fit.

Considering the sum score model that assumes full transmission, the various model fit criteria showed a misfit. This is because the sum score does not fully transmit the collective effects of and on its components. This is also evidenced by the derived collective effects, which differ from the collective effects in the population model. Consequently, a researcher would likely draw the wrong conclusions from the summary effects. Finally, the refined H–O specification could perfectly reproduce the variables’ variance-covariance matrix as the overall model fit criteria highlight. Moreover, the derived collective effects are identical to the ones of the population model. Consequently, although it was not possible to properly summarize the collective effects of the mediator variables using a sum score, allowing for different weights, the collective effects could be properly summarized.

4.2 Scenario 2: Correcting for random measurement error on the observed variable level

Scenario 2 demonstrates how random measurement error can be taken into account on the observed variable level in the sum score model. For this reason, we use the population model of Scenario 1, see Figure 5. However, and in contrast to Scenario 1, in this scenario, the mediator variables $m_1$ , $m_2$ , and $m_3$ are contaminated by random measurement error, i.e., the variances of the random measurement errors $\varepsilon _i$ are positive. Specifically, we contaminated each mediator variable by random measurement error in such a way that the reliabilities of the mediator variables are 0.935, 0.852, and 0.659, respectively. In this scenario, we consider the following three approaches:

(6) The sum score model assuming full transmission and not taking measurement error into account,
(7) The sum score model assuming full transmission and taking random measurement error into account on the observed variable level, and
(8) The refined H–O specification with free weights, thus assuming full transmission, and taking random measurement error on the observed variable level into account.

The Approaches (7) and (8) take random measurement on the observed variable level into account. For this purpose, we model each mediator variable as a single indicator of a latent variable, as described in Subsection 3.4. Specifically, we use the population reliabilities and therefore fix the variances of the resulting error terms to 0.38, 0.3, and 0.6276. As a result, the sum score and composite, respectively, are made up of random measurement error-corrected variables. In contrast, in the sum score model that assumes full transmission and does not take random measurement into account, i.e., in Approach 6), the sum score is made up of the original mediator variables. Consequently, due to attenuation, its parameter estimates are expected to be biased.

Table 2 shows the results for the three approaches. The two sum score models yielded unit weights and the refined H–O specification produced weights of 0.407, 0.773, and 0.232 for $m_1$ , $m_2$ , and $m_3$ , respectively. The sum score model that does not account for random measurement error, i.e., Approach (6), produced biased parameter estimates, i.e., a biased direct effect of X on Y and biased collective effects. In addition, the model fit criteria showed a model misspecification, which is caused by attenuation and the fact that the sum score cannot fully transmit the collective effects. Similarly, although Approach (7) corrects for random measurement error, it produced biased parameter estimates. As in Scenario 1, this is because the sum score cannot fully transmit the collective effects. This is also evidenced by the various model fit criteria which indicate a model misfit. Finally, the refined H–O specification taking random measurement error into account was able to retrieve the population parameters, and the model fit criteria showed no misfit.

Table 2 Results of Scenario 2

Note: The refined H–O specification does not create sum scores, but weighted composites; it is listed for comparative reasons only.

4.3 Scenario 3: Correcting for random measurement error on the sum score level

In the third scenario, we consider a latent variable population model as depicted in Figure 6. This population model consists of three latent variables $f_1$ , $f_2$ , and $f_3$ , where each is measured by three observed variables. The values of the population parameters are given in the figure.

Figure 6 Population model used in Scenario 3.

In this scenario, the researcher uses sum scores to approximate the latent variables. Therefore, the researcher’s main interest is in studying the relationships between the latent variables and not in summarizing collective effects. For this reason, we relax the full transmission assumption in all sum score models. In particular, we consider the following four approaches:

(9) The sum score model not assuming full transmission,
(10) The two-step approach,
(11) The sum score model not assuming full transmission and taking random measurement error on the sum score level into account, and
(12) The two-step approach with a correction for random measurement error.

The Approaches (9) and (10) and the Approaches (11) and (12), respectively, are expected to produce the same results. For the sum score model and the two-step approach that do not correct for random measurement error, the estimated relationships between the latent variables are expected to be biased due to attenuation (Cohen et al., Reference Cohen, Cohen and Teresi1990; Schuberth et al., Reference Schuberth, Schamberger and Henseler2023). In contrast, Approaches (11) and (12) correct for random measurement error on the sum score level, i.e., each sum score is modeled as a single indicator of a latent variable with a fixed loading and error term’s variance. For more details, see Section 3.4 above. We have not included the pseudo-indicator approach in this scenario as it produces the exact same results as the sum score model that does not assume full transmission. Similarly, we have not included the one-step approach as it does not permit modeling a sum score as an outcome variable.

Table 3 presents the results for the various approaches. The sum score model and the two-step approach that do not correct for random measurement error produced the same standardized path coefficient estimates. As expected, the estimated standardized path coefficient estimates differed from the standardized population path coefficients because of attenuation. Also, the $\chi ^2$ -test statistic, the df, and the RMSEA were the same for the two approaches, indicating a model misfit. The SRMR differed for the two approaches because the number of residuals is different. Similarly, the sum score model and the two-step approach that correct for random measurement error showed the same results. However, in this case, the distortion in the standardized path coefficient estimates diminished when random measurement error was taken into account. Finally, the model fit criteria indicated no misfit.

Table 3 Results of Scenario 3

5 Discussion

Traditionally, sum scores are studied in SEM following a two-step procedure, which omits the creation of the sum score from the model and, therefore, does not permit researchers to rigorously assess their sum scores and exploit SEM’s full potential. More recent approaches to model sum scores in structural equation models address some of the drawbacks of the traditional approach (Grace & Bollen, Reference Grace and Bollen2008; Rose et al., Reference Rose, Wagner and Mayer2019). However, they also show some limitations. Particularly, the one-step approach shows limited flexibility in modeling sum scores in a structural equation model. Moreover, the literature on the pseudo-indicator approach currently lacks guidance on modeling a sum score that fully transmits the collective effects on its components, making it difficult for applied researchers to generally assess the full transmission assumption using this approach.

To address this issue, we introduce the sum score model based on the refined H–O specification. The sum score model overcomes the limitations of the existing approaches. First, our sum score model explicates the creation of a sum score, i.e., it models the sum score, and allows us to specify a sum score as an outcome variable in the structural model. Thus, it overcomes the limitations of both the two-step approach and the one-step approach, which either do not model sum scores or have limited flexibility in modeling sum scores. Second, as our illustrative example shows, our sum score model can mimic the results of the pseudo-indicator approach, which was proposed to emulate the results of the two-step approach, and thus offers all the advantages of the pseudo-indicator approach. Particularly, it allows researchers to include a sum score retaining its components in the model without affecting the model-implied variance-covariance matrix of the target model, i.e., the model that contains the sum score and other variables of interest but not the sum score’s components. Third, our sum score model offers researchers the opportunity to assess whether the sum score fully transmits the collective effects of or on the variables that make up a sum score. This is not possible with the two-step approach because the components of a sum score are not modeled in this approach. Similarly, the one-step approach is limited in this regard because it does not allow a sum score to be modeled as a dependent variable. Moreover, considering the pseudo-indicator approach, Rose et al. (Reference Rose, Wagner and Mayer2019, p. 6) mention that “some of the rules may be relaxed to simplify the model or to consider specific assumptions.” In addition, they provide an example in which they fix the covariances between the components of a sum score and the measurement error variances of the indicators of one or more latent variables to zero to ensure that the covariances between the sum scores’ components and the latent variables’ indicators are fully accounted for by the sum score. Although this way of specifying the covariance allows for modeling a sum score that fully transmits the collective effects of its components, there is currently no guidance on how to constrain these covariances to ensure that a sum score fully transmits the collective effects on its components. It is up to future research to show whether it is possible to assess the full transmission assumption in general using the pseudo-indicator approach and, if so, how it should be done. Note that the usefulness of assessing the full transmission assumption depends on the specific research context and is not a requirement for using composite scores including sum scores in SEM. Therefore, researchers are encouraged to carefully consider the exact specification of the sum score model. Fourth, the H–O specification allows us to freely estimate weights, which gives researchers more flexibility and overcomes a further limitation of the pseudo-indicator approach. Although our illustrative example and first studies provide arguments for using composites with free weights (e.g., Grace & Bollen, Reference Grace and Bollen2008; Heise, Reference Heise1972), future research needs to provide sophisticated guidelines and further recommendations as to when free weights should be preferred over fixed weights such as unit weights. Against this background, the sum score model based on the H–O specification allows researchers to better judge and defend the use of sum scores. They can do this empirically, by means e.g., of model comparisons using a chi square difference test or information criteria such as the Akaike information criterion (AIC, Akaike, Reference Akaike, Parzen, Tanabe and Kitagawa1998) or the Bayesian information criterion (BIC, Schwarz, Reference Schwarz1978), as well as conceptually, in that researchers can better understand whether the model containing sum scores reflects their theoretical arguments. Table 4 juxtaposes the properties of the different approaches for dealing with sum scores in SEM.

Table 4 Properties of the different approaches for dealing with sum scores in SEM

Note: ✓: possible; 🔾: limitedly possible; ✗: not possible; ?: currently unclear

Further, our study presents two ways of accounting for random measurement error in the sum score model, i.e., on the observed variable or the sum score level. Note that these ways of dealing with random measurement errors are not unique to our sum score model, but can also be applied to the other approaches presented. Accounting for random measurement error allows researchers to address attenuation bias in the parameter estimates (Cohen et al., Reference Cohen, Cohen and Teresi1990). For this purpose, reliability estimates of the variables that make up a sum score or a reliability estimate of the sum score are required to perform these corrections. Empirical researchers can obtain such reliability estimates from existing literature, or they could use the test-retest method (Guttman, Reference Guttman1945) to estimate the reliability of an observed variable or a sum score. Further, a closed-form formula for McDonald’s (Reference McDonald1999) $\omega $ can be used to estimate the reliability of a sum score under unidimensionality without running a confirmatory factor analysis (Hancock & An, Reference Hancock and An2020). Similarly, Cronbach’s $\alpha $ can be used to determine the reliability of a sum score if its variables are essential tau-equivalent measures (Novick & Lewis, Reference Novick and Lewis1967).Footnote ⁸ Alternatively, if various measures for the variables that make up a sum score are available, the original variables can be replaced by latent variables using the measures as indicators. If possible, this approach is preferred because the researcher does not risk incorrect statistical inference by ignoring uncertainty in the reliability estimate (Oberski & Satorra, Reference Oberski and Satorra2013).

A potential drawback of the presented sum score model could be its complexity as it additionally introduces new composites, i.e., the excrescent variables. Compared to the two-step approach, which omits the creation of the sum score from the model, this is certainly true. However, the two-step approach sacrifices technical rigor in favor of practicality (Li & Calantone, Reference Li and Calantone1998). In comparison to the one-step approach, our sum score model is not more complex. Although additional variables need to be specified in the sum score model, the number of free model parameters and thus degrees of freedom remain the same. While in the one-step approach, the variances and covariances of the variables that make up a sum score are free model parameters, in our sum score model, these parameters are replaced by the same number of free model parameters, i.e., variances of the excrescent variables and the sum score, covariances between the excrescent variables, and composite loadings. Note that the use of more complicated parameterization to achieve a specific goal is not uncommon in SEM. For instance, Rindskopf (Reference Rindskopf1984) introduced phantom and imaginary variables to model equality and inequality constraints in SEM.

A limitation of our sum score model is that we followed the SEM framework proposed by Jöreskog (Reference Jöreskog1970), and we, therefore, assumed that the observed variables of a model, including those making up a sum score, follow a multivariate normal distribution. However, in empirical studies, this assumption is likely to be violated. To account for this fact, ML estimation with robust standard errors and test statistics could be used (e.g., Satorra & Bentler, Reference Satorra, Bentler, von Eye and Clogg1994). Further, the observed variables are often categorical in empirical research, e.g., as responses to five or seven-point Likert scales (Rhemtulla et al., Reference Rhemtulla, Brosseau-Liard and Savalei2012). In such cases, treating the observed variables as continuous could lead to biased parameter estimates (Johnson & Creech, Reference Johnson and Creech1983). Although weighted least squares estimators have been proposed in the SEM context to deal with categorical observed variables (e.g., Lee et al., Reference Lee, Poon and Bentler1990; Muthén, Reference Muthén1984), future research needs to show whether these estimators are compatible with the sum score model.

Data availability statement

The results of the illustrative example including the R code to reproduce the results are available in the Open Science Framework repository at https://osf.io/y3m4r/?view_only=e2b016994d764bd28645c346a933409d

Acknowledgements

No acknowledgments to be submitted.

Funding statement

This work was supported by national funds through FCT (Fundação para a Ciência e a Tecnologia), under the project UIDB/04152/2020 (DOI: 10.54499/UIDB/04152/2020)—Centro de Investigação em Gestão de Informação (MagIC)/NOVA IMS.

Competing interests

The authors declare none.

APPENDIX

A.1 Collective effects for the example model from Figure 1

In the following, we derive the collective effects of the observed variables $\boldsymbol {x}$ making up the composite of interest $\eta $ on the outcome variables $\boldsymbol {y}$ for the example model shown in Figure 1. For this example model, the collective effects are calculated as the coefficients of the regression $\boldsymbol {y}$ on $\boldsymbol {x}$ , i.e., $\boldsymbol {y} = \boldsymbol {B} \boldsymbol {x} + \boldsymbol {u}$ , where $\boldsymbol {y} = \boldsymbol {\gamma } \eta + \boldsymbol {\zeta }$ , $\boldsymbol {x} = \boldsymbol {\Lambda } \begin {pmatrix} \eta \\ \boldsymbol {\nu } \end {pmatrix}$ and $\text {E}(\boldsymbol {u}\lvert \boldsymbol {x})=\mathbf {0}$ . For the sake of simplicity, it is assumed that all variables are mean centered. The regression coefficients are calculated as follows:

(A.1)

$$ \begin{align} \boldsymbol{B}=& \operatorname{\mathrm{E}}(\boldsymbol{y} \boldsymbol{x'})\operatorname{\mathrm{E}}(\boldsymbol{x} \boldsymbol{x'})^{-1}= \end{align} $$

(A.2)

$$ \begin{align} &\operatorname{\mathrm{E}}((\boldsymbol{\gamma} \eta + \boldsymbol{\zeta})\begin{pmatrix}\eta & \boldsymbol{\nu'} \end{pmatrix} \boldsymbol{\Lambda}')\operatorname{\mathrm{E}}(\boldsymbol{\Lambda} \begin{pmatrix} \eta \\ \boldsymbol{\nu} \end{pmatrix} \begin{pmatrix} \eta & \boldsymbol{\nu'} \end{pmatrix} \boldsymbol{\Lambda}')^{-1}= \end{align} $$

(A.3)

$$ \begin{align} &\operatorname{\mathrm{E}}(\boldsymbol{\gamma} \eta\begin{pmatrix}\eta & \boldsymbol{\nu'} \end{pmatrix} \boldsymbol{\Lambda}')\operatorname{\mathrm{E}}(\boldsymbol{\Lambda} \begin{pmatrix} \eta \\ \boldsymbol{\nu} \end{pmatrix} \begin{pmatrix} \eta & \boldsymbol{\nu'} \end{pmatrix} \boldsymbol{\Lambda'})^{-1} + \end{align} $$

(A.4)

$$ \begin{align}& \operatorname{\mathrm{E}}(\boldsymbol{\zeta}\begin{pmatrix}\eta & \boldsymbol{\nu'} \end{pmatrix} \boldsymbol{\Lambda}')\operatorname{\mathrm{E}}(\boldsymbol{\Lambda} \begin{pmatrix} \eta \\ \boldsymbol{\nu} \end{pmatrix} \begin{pmatrix} \eta & \boldsymbol{\nu'} \end{pmatrix} \boldsymbol{\Lambda'})^{-1} = \end{align} $$

(A.5)

$$ \begin{align} &\boldsymbol{\gamma} \begin{pmatrix} \text{var}(\eta)& \mathbf {0}' \end{pmatrix}\boldsymbol{\Lambda'}\boldsymbol{\Lambda}^{\prime-1} \text{diag}(\text{var}(\eta),\text{var}(\boldsymbol{\nu}))^{-1} \boldsymbol{\Lambda}^{-1} + \mathbf {0} = \end{align} $$

(A.6)

$$ \begin{align} &\boldsymbol{\gamma} \begin{pmatrix} \text{var}(\eta) & \mathbf{0}' \end{pmatrix} \text{diag}(\text{var}(\eta)^{-1},\text{var}(\boldsymbol{\nu})^{-1})\boldsymbol{W'}= \end{align} $$

(A.7)

$$ \begin{align} &\boldsymbol{\gamma} \begin{pmatrix}1& \mathbf {0}' \end{pmatrix}\boldsymbol{W'}= \end{align} $$

(A.8)

$$ \begin{align} &\boldsymbol{\gamma} \boldsymbol{w'} \end{align} $$

A.2 Model specifications used in the scenario analysis

A.2.1 Scenario 1

Figure A1 Approach 1) The sum score model assuming full transmission.

Figure A2 Approach 2) The sum score model relaxing the full transmission assumption.

Figure A3 Approach 3) The pseudo-indicator approach using unit weights.

Figure A4 Approach 4) The two-step approach.

Figure A5 Approach 5) The refined H–O specification.

A.2.2 Scenario 2

Figure A6 Approach 6) The sum score model assuming full transmission and not taking measurement error into account.

Figure A7 Approach 7) The sum score model assuming full transmission and taking random measurement error into account on the observed variable level.

Figure A8 Approach 8) The refined H–O specification taking into account random measurement error on the observed variable level.

A.2.3 Scenario 3

Figure A9 Approach 9) The sum score model relaxing the full transmission assumption.

Figure A10 Approach 10) The two-step approach.

Figure A11 Approach 11) The sum score model relaxing the full transmission assumption and taking into account random measurement error on the sum score level.

Figure A12 Approach 12) The two-step approach with a correction for random measurement error.

Footnotes

1 More recently, it has been proposed to calculate a sum score as the factor score for a parallel measurement model (McNeish & Wolf, Reference McNeish and Wolf2020). This approach could also be considered a two-step approach, as the first step is to extract the factor scores, and the second step is to study the relationships between the factor scores and potential other variables. Although the factor score extracted from a parallel measurement model is perfectly correlated with the sum score, the factor score is not necessarily identical to the sum score because the factor score is likely to have a different variance depending on the parameterization of the parallel measurement model. This was also recognized by McNeish and Wolf (Reference McNeish and Wolf2020), who recommend the pseudo-indicator approach (Rose et al., Reference Rose, Wagner and Mayer2019) for researchers interested in obtaining scores that are exactly equal to sum scores. As the pseudo-indicator approach is discussed in our paper, we do not discuss the approach based on the parallel measurement model.

2 We assume that the observed variables are not perfectly linearly dependent, i.e., their variance-covariance matrix $\boldsymbol {\Sigma }$ is of full rank.

3 As can be seen immediately, if the sum of the loadings of the composite of interest was fixed to the number of variables that make up the composite, the composite of interest would be the average of its components.

4 In the case of free weights, this assumption cannot be violated if there is only one other variable in the model besides the composite and its components.

5 In the literature on causal-formative measurement models, it has been proposed to partition the causal indicators into at least two composites and to use these composites as reflective indicators of a latent variable to allow for random measurement error in the composites (Treiblmaier et al., Reference Treiblmaier, Bentler and Mair2011). Although this approach no longer models the sum score as a single variable, this approach is also conceivable for our sum score model. For more details, we refer the interested reader to the original study of Treiblmaier et al. (Reference Treiblmaier, Bentler and Mair2011).

6 The complete R code and the results can be accessed via the following link: https://osf.io/y3m4r/?view_only=e2b016994d764bd28645c346a933409d.

7 As Rose et al. (Reference Rose, Wagner and Mayer2019) explained, the pseudo-indicator approach, and thus the sum score model with relaxed full transmission assumption, produces the exact same $\chi ^2$ test statistic, df and RMSEA as the target model. However, between the target model and the pseudo-indicator approach the SRMR value can differ because the number of residuals is different. To ensure that the pseudo-indicator approach and the sum score model produce the SRMR of the target model, the calculation of the SRMR needs to be adjusted (Rose et al., Reference Rose, Wagner and Mayer2019).

8 Otherwise, if these variables are congeneric measures, Cronbach’s $\alpha $ still provides a lower bound estimate for the reliability of a sum score (Sijtsma, Reference Sijtsma2009).

References

Akaike, H. (1998). Information theory and an extension of the maximum likelihood principle . In Parzen, E., Tanabe, K., & Kitagawa, G. (Eds.), Selected papers of Hirotugu Akaike (pp. 199–213). Springer Series in Statistics (Perspectives in Statistics). Springer, https://doi.org/10.1007/978-1-4612-1694-0˙15 Google Scholar

Allison, P. D. (1987). Estimation of linear models with incomplete data. Sociological Methodology, 17, 71. https://doi.org/10.2307/271029 Google Scholar

Allison, P. D. (2003). Missing data techniques for structural equation modeling. Journal of Abnormal Psychology, 112(4), 545–557. https://doi.org/10.1037/0021-843x.112.4.545 Google Scholar

Bollen, K. A. (1989). Structural equations with latent variables. Wiley Google Scholar

Bollen, K. A., & Lennox, R. (1991). Conventional wisdom on measurement: A structural equation perspective. Psychological Bulletin, 110(2), 305–314. https://doi.org/10.1037/0033-2909.110.2.305 Google Scholar

Bozdogan, H. (1987). Model selection and Akaike’s information criterion (AIC): The general theory and its analytical extensions. Psychometrika, 52(3), 345–370. https://doi.org/10.1007/BF02294361 Google Scholar

Chen, L., Savalei, V., & Rhemtulla, M. (2020). Two-stage maximum likelihood approach for item-level missing data in regression. Behavior Research Methods, 52(6), 2306–2323. https://doi.org/10.3758/s13428-020-01355-x Google Scholar

Cheniti, M., Boukezzoula, N. E., & Akhtar, Z. (2017). Symmetric sum-based biometric score fusion. IET Biometrics, 7(5), 391–395. https://doi.org/10.1049/iet-bmt.2017.0015 Google Scholar

Cohen, P., Cohen, J., Teresi, J., et al. (1990). Problems in the measurement of latent variables in structural equations causal models. Applied Psychological Measurement, 14(2), 183–196. https://doi.org/10.1177/014662169001400207 Google Scholar

Cole, D. A., & Preacher, K. J. (2014). Manifest variable path analysis: Potentially serious and misleading consequences due to uncorrected measurement error. Psychological Methods, 19(2), 300–315. https://doi.org/10.1037/a0033805 Google Scholar

Cronbach, L. J. (1951). Coefficient alpha and the internal structure of tests. Psychometrika, 16(3), 297–334. https://doi.org/10.1007/bf02310555 Google Scholar

Devlieger, I., & Rosseel, Y. (2017). Factor score path analysis: An alternative for SEM? Methodology, 13, 31–38. https://doi.org/10.1027/1614-2241/a000130 Google Scholar

Dijkstra, T. K. (2017). A perfect match between a model and a mode. In Latan, H & Noonan, R. (Eds.), Partial least squares path modeling: Basic concepts, methodological issues and applications (pp. 55–80). Springer, https://doi.org/10.1007/978-3-319-64069-34 Google Scholar

Edelsbrunner, P. A. (2022). A model and its fit lie in the eye of the beholder: Long live the sum score. Frontiers in Psychology, 13. https://doi.org/10.3389/fpsyg.2022.986767 Google Scholar

Edwards, J. R., & Bagozzi, R. P. (2000). On the nature and direction of relationships between constructs and measures. Psychological Methods, 5(2), 155. https://doi.org/10.1037/1082-989X.5.2.155 Google Scholar

Grace, J. B., & Bollen, K. A. (2008). Representing general theoretical concepts in structural equation models: The role of composite variables. Environmental and Ecological Statistics, 15(2), 191–213. https://doi.org/10.1007/s10651-007-0047-7 Google Scholar

Graham, J. M. (2008). The general linear model as structural equation modeling. Journal of Educational and Behavioral Statistics, 33(4), 485–506. https://doi.org/10.3102/1076998607306151 Google Scholar

Guttman, L. (1945). A basis for analyzing test-retest reliability. Psychometrika, 10(4), 255–282. https://doi.org/10.1007/bf02288892 Google Scholar

Hancock, G. R., & An, J. (2020). A Closed-Form Alternative for Estimating ω Reliability under Unidimensionality. Measurement: Interdisciplinary Research and Perspectives, 18(1), 1–14. https://doi.org/10.1080/15366367.2019.1656049 Google Scholar

Hardt, K., Hecht, M., J. H. L., Oud, et al. (2018). Where have the persons gone? – An illustration of individual score methods in autoregressive panel models. Structural Equation Modeling: A Multidisciplinary Journal, 26(2), 310–323. https://doi.org/10.1080/10705511.2018.1517355 Google Scholar

Hayduk, L. A. (1996). LISREL issues, debates and strategies . John Hopkins University Press.Google Scholar

Hayduk, L. A., & Littvay, L. (2012). Should researchers use single indicators, best indicators, or multiple indicators in structural equation models? BMC Medical Research Methodology, 12(159), 1–17. https://doi.org/10.1186/1471-2288-12-159 Google Scholar

Heise, D. R. (1972). Employing nominal variables, induced variables, and block variables in path analyses. Sociological Methods & Research, 1(2), 147–173. https://doi.org/10.1177/004912417200100201 Google Scholar

Henseler, J. (2021). Composite-based structural equation modeling: Analyzing latent and emergent variables. Guilford Press.Google Scholar

Henseler, J., Ringle, C. M., & Sarstedt, M. (2015). A new criterion for assessing discriminant validity in variance-based structural equation modeling. Journal of the Academy of Marketing Science, 43(1), 115–135. https://doi.org/10.1007/s11747-014-0403-8 Google Scholar

Hu, Lt, & Bentler, P. M. (1998). Fit indices in covariance structure modeling: Sensitivity to underparameterized model misspecification. Psychological Methods, 3(4), 424–453. https://doi.org/10.1037/1082-989X.3.4.424 Google Scholar

Hu, Lt, & Bentler, P. M. (1999). Cutoff criteria for fit indexes in covariance structure analysis: Conventional criteria versus new alternatives. Structural Equation Modeling: A Multidisciplinary Journal, 6(1), 1–55. https://doi.org/10.1080/10705519909540118 Google Scholar

Johnson, D. R., & Creech, J. C. (1983). Ordinal measures in multiple indicator models: A simulation study of categorization error. American Sociological Review, 48(3), 398–407. https://doi.org/10.2307/2095231 Google Scholar

Jöreskog, K. G. (1969). A general approach to confirmatory maximum likelihood factor analysis. Psychometrika, 34(2), 183–202. https://doi.org/10.1007/bf02289343 Google Scholar

Jöreskog, K. G. (1970). A general method for analysis of covariance structures. Biometrika, 57(2), 239–251. https://doi.org/10.2307/2334833 Google Scholar

Jöreskog, K. G. (1978). Structural analysis of covariance and correlation matrices. Psychometrika, 43(4), 443–477. https://doi.org/10.1007/bf02293808 Google Scholar

Klopp, E., & Klößner, S. (2021). The impact of scaling methods on the properties and interpretation of parameter estimates in structural equation models with latent variables. Structural Equation Modeling: A Multidisciplinary Journal, 28(2), 182–206. https://doi.org/10.1080/10705511.2020.1796673 Google Scholar

Kuder, G. F., & Richardson, M. W. (1937). The theory of the estimation of test reliability. Psychometrika, 2(3), 151–160. https://doi.org/10.1007/bf02288391 Google Scholar

Lee, S. Y., Poon, W. Y., & Bentler, P. (1990). A three-stage estimation procedure for structural equation models with polytomous variables. Psychometrika, 55(1), 45–51. https://doi.org/10.1007/BF02294742 Google Scholar

Li, T., & Calantone, R. J. (1998). The impact of market knowledge competence on new product advantage: Conceptualization and empirical examination. Journal of Marketing, 62(4), 13–29. https://doi.org/10.1177/002224299806200402 Google Scholar

Little, T. D., Cunningham, W. A., Shahar, G., et al. (2002). To parcel or not to parcel: Exploring the question, weighing the merits. Structural Equation Modeling: A Multidisciplinary Journal, 9(2), 151–173. https://doi.org/10.1207/s15328007sem0902˙1 Google Scholar

Little, T. D., Slegers, D. W., & Card, N. A. (2006). A non-arbitrary method of identifying and scaling latent variables in SEM and MACS models. Structural Equation Modeling: A Multidisciplinary Journal, 13(1), 59–72. https://doi.org/10.1207/s15328007sem1301˙3 Google Scholar

Lundberg, I., Johnson, R., & Stewart, B. M. (2021). What is your estimand? Defining the target quantity connects statistical evidence to theory. American Sociological Review, 86(3), 532–565. https://doi.org/10.1177/00031224211004187 Google Scholar

MacCallum, R. C., & Browne, M. W. (1993). The use of causal indicators in covariance structure models: Some practical issues. Psychological Bulletin, 114(3), 533–541. https://doi.org/10.1037/0033-2909.114.3.533 Google Scholar

McDonald, R. P. (1999). Test theory: A unified treatment. Mahwah, NJ: Lawrence Erlbaum.Google Scholar

McNeish, D., & Wolf, M. G. (2020). Thinking twice about sum scores. Behavior Research Methods, 52(6), 2287–2305. https://doi.org/10.3758/s13428-020-01398-0 Google Scholar

Mosier, C. I. (1943). On the reliability of a weighted composite. Psychometrika, 8(3), 161–168. https://doi.org/10.1007/bf02288700 Google Scholar

Muthén, B. (1984). A general structural equation model with dichotomous, ordered categorical, and continuous latent variable indicators. Psychometrika, 49(1), 115–132. https://doi.org/10.1007/BF02294210 Google Scholar

Novick, M., & Lewis, C. (1967). Coefficient alpha and the reliability of composite measurements. Psychometrika, 32(1), 1–13 Google Scholar

Nunnally, J. C., & Bernstein, I. H. (1994). Psychometric theory (3rd ed.). McGraw-Hill.Google Scholar

Oberski, D. L., & Satorra, A. (2013). Measurement error models with uncertainty about the error variance. Structural Equation Modeling: A Multidisciplinary Journal, 20(3), 409–428. https://doi.org/10.1080/10705511.2013.797820 Google Scholar

Ogasawara, H. (2007). Asymptotic expansions of the distributions of estimators in canonical correlation analysis under nonnormality. Journal of Multivariate Analysis, 98(9), 1726–1750. https://doi.org/10.1016/j.jmva.2006.12.001 Google Scholar

Team, R Core (2022). R: A language and environment for statistical computing. R Foundation for Statistical Computing. https://www.R-project.org/ Google Scholar

Rhemtulla, M., Brosseau-Liard, P. E., & Savalei, V. (2012). When can categorical variables be treated as continuous? A comparison of robust continuous and categorical SEM estimation methods under suboptimal conditions. Psychological Methods, 17(3), 354–373 Google Scholar

Richardson, M. W. (1936). Notes on the rationale of item analysis. Psychometrika, 1(1), 69–76. https://doi.org/10.1007/bf02287926 Google Scholar

Rindskopf, D. (1984). Using phantom and imaginary latent variables to parameterize constraints in linear structural models. Psychometrika, 49(1), 37–47. https://doi.org/10.1007/bf02294204 Google Scholar

Roemer, E., Schuberth, F., & Henseler, J. (2021). HTMT2–An improved criterion for assessing discriminant validity in structural equation modeling. Industrial Management & Data Systems, 121(12), 2637–2650. https://doi.org/10.1108/imds-02-2021-0082 Google Scholar

Rönkkö, M., & Cho, E. (2022). An updated guideline for assessing discriminant validity. Organizational Research Methods, 25(1), 6–47. https://doi.org/10.1177/1094428120968614 Google Scholar

Rose, N., Wagner, W., Mayer, A., et al. (2019). Model-based manifest and latent composite scores in structural equation models. Collabra: Psychology, 5(1). https://doi.org/10.1525/collabra.143 Google Scholar

Rosseel, Y. (2012). lavaan: An R package for structural equation modeling. Journal of Statistical Software, 48(2), 1–36. https://doi.org/10.18637/jss.v048.i02 Google Scholar

Saris, W. E., & Gallhofer, I. N. (2014). Design, evaluation, and analysis of questionnaires for survey research (2nd ed). Wiley Series in Survey Methodology. John Wiley & Sons, Inc.Google Scholar

Satorra, A., & Bentler, P. M. (1994). Corrections to test statistics and standard errors in covariance structure analysis. In von Eye, A. & Clogg, C. C. (Eds.), Latent variable analysis: Applications to developmental research (pp. 399–419). SAGE Publications Google Scholar

Savalei, V. (2019). A comparison of several approaches for controlling measurement error in small samples. Psychological Methods, 24(3), 352–370. https://doi.org/10.1037/met0000181 Google Scholar

Savalei, V., & Bentler, P. M. (2009). A two-stage approach to missing data: Theory and application to auxiliary variables. Structural Equation Modeling: A Multidisciplinary Journal, 16(3), 477–497. https://doi.org/10.1080/10705510903008238 Google Scholar

Schuberth, F. (2023. The Henseler–Ogasawara specification of composites in structural equation modeling: A tutorial. Psychological Methods, 28(4), 843–859. https://doi.org/10.1037/met0000432 Google Scholar

Schuberth, F., Schamberger, T., & Henseler, J. (2023). More powerful parameter tests? No, rather biased parameter estimates. Some reflections on path analysis with weighted composites. Behavior Research Methods, 56, 4205–4215. https://doi.org/10.3758/s13428-023-02256-5 Google Scholar

Schwarz, G. (1978). Estimating the dimension of a model. The Annals of Statistics, 6(2), 461–464. https://doi.org/10.1214/aos/1176344136 Google Scholar

Sijtsma, K. (2009). On the use, the misuse, and the very limited usefulness of Cronbach’s alpha. Psychometrika, 74(1), 107–120. https://doi.org/10.1007/s11336-008-9101-0 Google Scholar

Skrondal, A., & Laake, P. (2001). Regression among factor scores. Psychometrika, 66(4), 563–575. https://doi.org/10.1007/bf02296196 Google Scholar

Traub, R. E. (2005). Classical test theory in historical perspective. Educational Measurement: Issues and Practice, 16(4), 8–14. https://doi.org/10.1111/j.1745-3992.1997.tb00603.x Google Scholar

Treiblmaier, H., Bentler, P. M., & Mair, P. (2011). Formative constructs implemented via common factors. Structural Equation Modeling: A Multidisciplinary Journal, 18(1):1–17. https://doi.org/10.1080/10705511.2011.532693 Google Scholar

Wang, M., & Reeve, B. B. (2021). Evaluations of the sum-score-based and item response theory-based tests of group mean differences under various simulation conditions. Statistical Methods in Medical Research, 30(12), 2604–2618. https://doi.org/10.1177/09622802211043263 Google Scholar

Werts, C. E., Linn, R. L., & Jöreskog, K. G. (1974). Intraclass reliability estimates: Testing structural assumptions. Educational and Psychological Measurement, 34(1), 25–33. https://doi.org/10.1177/001316447403400104 Google Scholar

Widaman, K. F., & Revelle, W. (2023). Thinking thrice about sum scores, and then some more about measurement and analysis. Behavior Research Methods, 55, 788–806. https://doi.org/10.3758/s13428-022-01849-w Google Scholar

Yu, X., Schuberth, F., & Henseler, J. (2023). Specifying composites in structural equation modeling: A refinement of the Henseler–Ogasawara specification. Statistical Analysis and Data Mining, 16(4), 348–357. https://doi.org/10.1002/sam.11608 Google Scholar

Figure 1 Example of the refined Henseler–Ogasawara specification with free weights.

Figure 2 Example of a sum score model based on the refined Henseler–Ogasawara specification.

Figure 3 Example of a sum score model in which the full transmission assumption has been relaxed.

Figure 4 Accounting for random measurement error at the sum score level in the sum score model.

Figure 5 Population model used in Scenarios 1 and 2.

Table 1 Results of Scenario 1

Table 2 Results of Scenario 2

Figure 6 Population model used in Scenario 3.

Table 3 Results of Scenario 3

Table 4 Properties of the different approaches for dealing with sum scores in SEM

Figure A1 Approach 1) The sum score model assuming full transmission.

Figure A2 Approach 2) The sum score model relaxing the full transmission assumption.

Figure A3 Approach 3) The pseudo-indicator approach using unit weights.

Figure A4 Approach 4) The two-step approach.

Figure A5 Approach 5) The refined H–O specification.

Figure A6 Approach 6) The sum score model assuming full transmission and not taking measurement error into account.

Figure A7 Approach 7) The sum score model assuming full transmission and taking random measurement error into account on the observed variable level.

Figure A8 Approach 8) The refined H–O specification taking into account random measurement error on the observed variable level.

Figure A9 Approach 9) The sum score model relaxing the full transmission assumption.

Figure A10 Approach 10) The two-step approach.

Figure A11 Approach 11) The sum score model relaxing the full transmission assumption and taking into account random measurement error on the sum score level.

Figure A12 Approach 12) The two-step approach with a correction for random measurement error.

Article contents

The Sum Score Model: Specifying and Testing Equally Weighted Composites Using Structural Equation Modeling

Abstract

Keywords

1 Introduction

2 Existing approaches to dealing with sum scores in SEM

2.1 Approach to include sum scores

2.2 Approaches to model sum scores

3 A new approach for modeling sum scores in SEM

3.1 Modeling composites using the refined H–O specification

3.2 The sum score model based on the refined H–O specification

3.3 Relaxing the full transmission assumption in the sum score model

3.4 Taking into account random measurement error in the sum score model

4 Illustrative example

4.1 Scenario 1: Comparison of approaches to deal with sum scores

4.2 Scenario 2: Correcting for random measurement error on the observed variable level

4.3 Scenario 3: Correcting for random measurement error on the sum score level

5 Discussion

Data availability statement

Acknowledgements

Funding statement

Competing interests

APPENDIX

A.1 Collective effects for the example model from Figure 1

A.2 Model specifications used in the scenario analysis

A.2.1 Scenario 1

A.2.2 Scenario 2

A.2.3 Scenario 3

Footnotes

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests