1 Introduction
Children live in multiple environments: home, school, neighborhood, community and society. Within the interaction between children and these environments are risk factors for victimization from bullying. Bullying in the school setting has become a significant public health issue in the United States (Perkins, Perkins & Craig, Reference Perkins, Perkins and Craig2009). Public concern about the pervasiveness and harmful consequences of bullying in its schools has spanned the U.S. political spectrum (Puhl, Luedicke & King, Reference Puhl, Luedicke and King2015). From 1999 to 2010, more than 120 anti-bullying bills were enacted by 46 U.S. state legislatures (U.S. Department of Education, 2011).
Bullying induces emotional and physical fear in students. A relentless threat of direct or indirect actions such as teasing, threatening to cause harm, intentional social exclusion, spreading rumors, or physical or verbal attacks pervades the lives of bullying victims. Student truancy and drop-out rates are higher in schools with higher bullying rates (Eriksen, Nielsen & Simonsen, Reference Eriksen, Nielsen and Simonsen2014). And although it is inherently difficult to assess the long-term human capital consequences of bullying separate from the effects of other genetic and environmental factors, an extensive multidisciplinary chronicle of evidence suggests that some of the emotional and/or physical damages from bullying can extend into adulthood. Bullying victims exhibit higher rates of unemployment and financial, mental, and physical health problems (Baldry, Reference Baldry2004; Arseneault et al., Reference Arseneault, Walsh, Trzesniewski, Newcombe, Caspi and Moffitt2006; Arseneault, Bowes & Shakoor, Reference Arseneault, Bowes and Shakoor2010; Brown & Taylor, Reference Brown and Taylor2008; Wolke, Copeland, Angold & Costello, Reference Wolke, Copeland, Angold and Costello2013; Sarzosa & Urzúa, Reference Sarzosa and Urzúa2015). This evidence suggests a tangible ex post human capital benefit of reducing children’s exposure to bullying.
In addition to the conditional or unconditional altruism parents experience from their children’s human capital formation, they may directly enjoy their own lives more when they can engage or participate in the lives of their children (Agee & Crocker, Reference Agee and Crocker2011). From an economic perspective, the value of a public anti-bullying program partly depends on whether parents themselves value the program since they value their children’s well-being. A program in which parents realize no personal value will obtain less political support. But we are unaware of any findings offering estimates of the value parents attach to reducing the risk their children will be victimized by bullies at school.
A well-established economic valuation literature (e.g., Agee & Crocker, Reference Agee and Crocker2007; Dickie & Gerking, Reference Dickie and Gerking2007) shows that parents’ ex ante preference-based values can be a significant component of the benefits of reducing environmental disamenities affecting children. Here our focus on school bullying suggests the analytical framework this literature embodies can monetize the benefits of a broad range of educational and social interventions to be set against program money costs. Also, our further focus on parental values takes account of the fact that children live in households. Household parents or caregivers make resource reallocations in response to changes in their children’s environments. Overlooking behavioral reactions and the endogeneity of parental reactions to these changes will undervalue them (Pitt, Rosenzweig & Hassan, Reference Pitt, Rosenzweig and Hassan1990). Rarely does the noneconomic literature on educational and social interventions systematically recognize these issues.
In this paper, we apply a unique data set to a discrete choice version of the household production framework (Becker, Reference Becker1965; Lancaster, Reference Lancaster1966) to assess parents’ ex ante preference-based value to reduce the risk of their child being bullied at school. We treat parents’ ex ante perceived quality of their child’s current school as conditional upon their child’s exposure to bullying, school and family characteristics, and the child protection efforts parents choose to make. Parents’ preference-based value is the monetary equivalent of the impact of the child’s bully victimization on parents’ discrete choice to change their child’s school. Section 2 outlines a simple household production model, derives value expressions, and describes our estimation and identification strategies. Section 3 details the data and presents empirical results based on bivariate probit estimates of parents’ conditional demand for a school change. Section 4 provides an illustrative comparison of parental values to the costs of several commonly used U.S. anti-bullying programs. Section 5 offers concluding remarks and some caveats.
2 Model
The household production framework (e.g., Becker, Reference Becker1965; Lancaster, Reference Lancaster1966 serves as the theoretical basis for deriving parents’ ex ante willingness to pay for reduced own child bully victimization. This framework emphasizes that households combine market good purchases with time and other nonmarket inputs to produce commodities for final consumption. Applications often posit a cooperative agreement among adult household members about the utility function to be maximized subject to a full-income budget constraint determined by members’ pooled resources (e.g., Scapecchi, Reference Scapecchi2005). Let parents’ expected utility, $U(\cdot )$ , be a quasiconcave, weakly separable function of consumption goods, $Z$ , that directly affect utility, and their perceived quality, $Q$ , of their child’s schooling:
where $U_{Z}>0$ and $U_{Q}>0$ . The variable $Q$ measures on a real interval parents’ perceptions of overall school quality. A subset of consumption goods, $Z$ , (e.g., interactions with their child; location of the family home) can also affect $Q$ . Perceived quality $Q(\cdot )$ is a concave, weakly separable function of parents’ known attributes, $\unicode[STIX]{x1D6FC}$ , of their child’s school (e.g., teacher qualifications, class composition, problems with bullying) and the set, $R$ , of protection inputs, which affect parental utility indirectly by way of their direct impact on perceived school quality (e.g., seeking support and help from teachers or school administrators, moving the child to a new school). The vectors, $\unicode[STIX]{x1D6FE}_{U}$ and $\unicode[STIX]{x1D6FE}_{Q}$ denote exogenous or predetermined household and school attributes that may influence parents’ preferences and/or their production or perceptions of school quality.
Households face a full-income budget constraint written as:
where full prices are $q_{j}=(m_{j}+w\unicode[STIX]{x1D70F}_{j})$ , $j=Z,R$ , and full income is $Y=I+w(T-\unicode[STIX]{x1D70F}_{L})$ . $m_{j}$ is the money price of commodity $j$ ; $w$ is the opportunity cost of parental time; $\unicode[STIX]{x1D70F}_{j}$ is the time required to produce and/or consume one unit of commodity $j$ ; and $\unicode[STIX]{x1D70F}_{L}$ is parental time spent away from work, including childcare time and personal leisure. Full income $Y$ is the sum of nonlabor income, $I$ , and labor income, $w(T-\unicode[STIX]{x1D70F}_{L})$ . Households allocate their time and market goods purchases to produce commodities, including perceived quality of child schooling, so as to maximize their joint utility function in (1) subject to their technology, $Q(\cdot )$ , and time and income constraints embedded in (2). Children are presumed passive regarding parents’ decisions.
If parents’ perceptions of school quality can be influenced by their personal choices, the abovementioned model can be used to derive a preference-based value associated with an exogenous change in any perceived school attribute contained in $\unicode[STIX]{x1D6FC}$ . Given the assumed curvature properties of the school quality production function, $Q(\cdot )$ , and household utility function in (1), parents’ maximization of (1) subject to (2) yields their indirect utility function, $v(\cdot )$ , which is continuous and strictly increasing in income, $Y$ . $v(\cdot )$ can be inverted to find the expenditure function, $E(\cdot )$ , which satisfies
Denoting $\mathit{Victim}\in \unicode[STIX]{x1D6FC}$ as an indicator of parents’ awareness that their child is being bullied at school, their marginal valuation, $MV_{\mathit{Victim}}$ , of a reduction in child bullying is given by:
where $\unicode[STIX]{x1D706}\equiv \unicode[STIX]{x2202}v(\cdot )/\unicode[STIX]{x2202}Y$ denotes the marginal utility of income. Expression (4) portrays parents’ marginal disutility of bullying, converted to monetary units by way of the marginal utility of income. Expression (4) is empirically intractable because actual utility levels are not observed. However, a consensus branch of the valuation literature has shown $MV_{\mathit{Victim}}$ to be estimable using parents’ conditional compensated demand for a remedial input linked to Victim (Freeman, Herriges & Kling, Reference Freeman, Herriges and Kling2014). Herein, a discrete choice version of $MV_{\mathit{Victim}}$ is framed as the marginal impact of a child’s risk of exposure to bullying upon parents’ decision to change their child’s school in a given period, conditional on family characteristics and attributes of the child’s current school. Throughout, we assume the distribution of the child’s school change opportunities to be independent of current school attributes, such as location and socioeconomic makeup which may help (e.g., busing) or hinder (e.g., exclusions) these opportunities. We thus consider only one of the two ways in which attributes of the child’s current school can affect the parents’ school change decision. That is, we account for the impact of the child’s current school upon parental utility but set aside any impact upon school change opportunities.
Observation of discrete choices reflects individuals’ optimization of “conditional” utility when their utility functions include a stochastic term comprised of personal tastes and/or tendencies unobserved by the investigator (e.g., McFadden, Reference McFadden1980). Small and Rosen (Reference Small and Rosen1981), Hau (Reference Hau1985), and others derive expressions for the ex ante utility-preserving tradeoff between money income and an exogenous increase (or decrease) in the price or quality of a particular activity, conditional on the individual’s choice to engage the activity. Bockstael and McConnell (Reference Bockstael and McConnell1983) examine this approach in the context of household production. Following these literature results, let
define household $i$ ’s conditional indirect utility, at time $t$ , associated with changing their child’s current school. As in McFadden (Reference McFadden1974, Reference McFadden and Berg1976), conditional utility in (5) can be split into two components: $\bar{v}_{it}(\cdot )$ represents the conditional benefit of a school change for the “average” parent; and $\unicode[STIX]{x1D707}_{it}$ is a stochastic term encompassing unobserved family-specific influences and random factors that separate parent $i$ from the average. The vector $\unicode[STIX]{x1D6FE}$ contains observable attributes of the family, the child, and the child’s school. Assuming parents base their decision rule on utility maximization, outcome probabilities can be derived specifying parents’ decision rules and a given distribution of utility levels (Hau, Reference Hau1985). Denoting $\bar{v}_{1it}$ as parents’ maximum attainable utility when a school change is chosen, and $\bar{v}_{0it}$ as maximum attainable utility when no change is chosen, parent $i$ at time $t$ will choose to change their child’s school if $\bar{v}_{1it}>\bar{v}_{0it}$ . The probability of a school change conditional on prices, income and observable characteristics, is
Following Small and Rosen (Reference Small and Rosen1981), if the compensated demand curve for a school change is approximated by its Marshallian counterpart, parents’ $MV_{\mathit{Victim}}$ in expression (4) is
Since the vast majority of sample families used in this study send their children to public school, following Willig (Reference Willig1976), tuition and fees for child schooling are small enough (see Table 1) to assume a reasonable compensated-Marshallian approximation.
Expression (7) complicates welfare measurement because parents’ choice of a school change for their child represents only one of plausibly several remedial actions available to parents to reduce their child’s exposure to bullying (e.g., seeking help from teachers). Bockstael and McConnell (Reference Bockstael and McConnell1983) showed in a general context that if positive parental benefits associated with reduced bullying exist even when a school change is not chosen, $MV_{\mathit{Victim}}$ in (7) will underestimate the true $MV_{\mathit{Victim}}$ . Thus, while some parents might find their bullied child’s current school unacceptable, other parents might opt for remedial actions whilst keeping their child’s current school or simply doing nothing. As a consequence, the empirical value estimates reported below are interpreted as a lower bound on parents’ true willingness to pay to reduce their child’s bully victimization.
Empirical specification
To specify the binary choice model for a school change, let parents’ conditional indirect utility function in (5) be expressed as a linear combination of observable variables:
where $\unicode[STIX]{x1D706}$ and $\unicode[STIX]{x1D6FD}_{V}$ , respectively, are interpreted as the conditional marginal utilities of income and bully victimization. The variable Victim enters (8) as a binary indicator of the child’s exposure to bullying, and is assumed endogenous according to:
where $\mathit{Victim}_{it}^{\ast }$ is a latent indicator of child $i$ ’s victimization from bullying at time $t$ , whereby $\mathit{Victim}_{it}=1$ if $\mathit{Victim}_{it}^{\ast }>0$ , and $X_{hit}$ is the $h$ th exogenous or predetermined explanatory variable related to being bullied. As discussed further below, some or all of the explanatory variables in (8) may or may not enter $X_{hit}$ in (9) depending on exclusion restrictions. Assuming the error terms $\unicode[STIX]{x1D707}_{it}$ and $\unicode[STIX]{x1D714}_{it}$ are distributed bivariate normal with mean zero, constant variance, and $\text{corr}(\unicode[STIX]{x1D707}_{it},\unicode[STIX]{x1D714}_{it})=\unicode[STIX]{x1D70C}$ , consistent estimates of (8) and (9) can be obtained using a recursive bivariate probit model. If $\unicode[STIX]{x1D70C}=0$ , $\mathit{Victim}_{it}$ and $\unicode[STIX]{x1D707}_{it}$ are uncorrelated implying $\mathit{Victim}_{it}$ in expression (8) is exogenous.
Inattention to $\unicode[STIX]{x1D70C}\neq 0$ may confound an estimate of the marginal effect of (9) upon (8) with unobserved parent, child, and school variables correlated with observed variables. Bhattacharya, Goldman and McCaffrey (Reference Bhattacharya, Goldman and McCaffrey2006) used Monte Carlo simulations to compare the performance of various estimators commonly used to estimate the effect of a binary treatment variable upon a binary outcome variable. They found the bivariate probit model performed best for generating consistent estimates of the marginal treatment effect. The recursive structure of the bivariate probit model builds on the reduced form equation (9) for the posited endogenous dummy variable, Victim, which then enters the structural form equation (8) determining parents’ conditional benefit of changing their child’s school. This model belongs to a general class of simultaneous equation models with discrete endogenous variables introduced by Heckman (Reference Heckman1978). Heckman stated in his more general context that full rank of the regressor matrix is sufficient for the identification of all model parameters; however, Maddala (Reference Maddala1986) argued that some variables contained in the reduced form equation must be excluded from the structural equation in order to produce consistent estimates. Wilde (Reference Wilde2000) showed that identification is indeed achieved as soon as both equations of the model contain a varying exogenous regressor. While Wilde’s result, commonly referred to as “identification by function form,” does not require availability of additional instruments in the reduced form equation, it is valid only in the context of a bivariate normal distribution. Thus, in the absence of additional instruments, identification relies heavily on the assumption of bivariate normality. Since bivariate normality may be a strong assumption, exclusion restrictions help in making estimation results more robust to distributional misspecification (Jones, Reference Jones2007), including exogeneity tests (Monfardini & Radice, Reference Monfardini and Radice2008). As a result, Table 2 reports bivariate probit estimates of expressions (8) and (9) with and without exclusion restrictions.
a Robust standard errors in parentheses.
Note: Significant at: *** less than 1%; ** less than 5%; * less than 10%.
3 Data and empirical implementation
Data for this study come from the National Institute of Child Health and Human Development Study of Early Child Care and Youth Development (NICHD-SECCYD, 2001), a longitudinal study of the linkages between child behavior and development, particularly as it relates to early child care, schooling, and after school care. Families were recruited shortly after the birth of the subject child in 1991 from 10 geographically dispersed areas of the United States, both urban and rural, and data were collected prospectively from birth to age 15. All sample children had a mother who was over 18 at the time of birth, had no disabilities or health conditions requiring a hospital stay exceeding 7 days postpartum, and lived in a home where English was the first language. In addition, all respondent families affirmed no future plans of moving from their current residence location. Details of the recruitment methods and sampling strategy are available at U.S. Department of Health and Human Services (2010). The initial sample included 1,364 children, very few of whom shared the same school. Although the sample is not nationally representative, it is representative of the demographics of the 10 areas from which the sample was recruited. The NICHD data includes extensive information about parenting, marriage, employment, income, and participation in public programs, as well as other relevant topics, such as detailed assessments of children’s health and scholastic abilities, social and behavioral attributes, and qualities of their home, school, and after school environments.
The present analysis uses data from phase 3 of the NICHD study, which followed sample children from grades two through six. The phase 3 data measures two variables relevant to the current study. First, at the end of each school year, at least one respondent parent or primary caregiver completed a questionnaire regarding attributes of their child’s schooling and, if any, after school care. In Table 1, the dichotomous variable Changed School equals one if the respondent parent moved their child to a different school during the indicated (Table 1) year, or planned a school change the following year for reasons other than ordinary grade progression. Table 1 shows that approximately 4%–5% of sample parents changed their child’s school in either third, fifth, or sixth grades.
Unique to the phase 3 data is the subsample of 895 children who completed (with at least one parent/caregiver present) a detailed questionnaire intended to measure each child’s degree of victimization from bullying at their current school. The questionnaire was given to children on three separate occasions: grade 3 (children approximately age 9), grade 5 (age 11), and grade 6 (age 12). Children’s self-reported victimization from bullying was measured using the Kids in My Class at School questionnaire (U.S. Department of Health and Human Services, 2010). Created specifically for use in the SECCYD, the Kids questionnaire was adapted from an instrument developed by Ladd, Kochenderfer and Coleman (Reference Ladd, Kochenderfer and Coleman1997) to study the impact of bullying victimization upon various measures of children’s elementary school adjustment. Levels of bully victimization measured by the Kids questionnaire have been found to be comparable to levels measured by the more widely used Olweus Bullying Survey (Kochenderfer & Ladd, Reference Kochenderfer and Ladd1996, Reference Kochenderfer and Ladd1997; Henrich & Shahar, Reference Henrich and Shahar2014). Respondent children were asked to choose a number from 1 to 5 (never; hardly ever; sometimes; most of the time; and always). The questionnaire asked children to consider the extent to which they experienced peer bullying or aggression in their class or around school using the following questions: Does anyone in your class ever: (1) pick on you at school; (2) say mean things to you at school; (3) say bad things about you to other kids at school; and/or (4) hit you at school? An overall bullying victim score was then calculated as the sum of the four items (Cronbach’s alpha $=$ 0.76–0.85 across the three waves indicating good internal reliability). In accordance with previous studies utilizing NICHD bullying victim data (e.g., Fanti & Georgiou, Reference Fanti and Georgiou2013; Fanti, Frick & Georgiou, Reference Fanti, Frick and Georgiou2009), this study measures child bully victimization using the constructed binary variable, Victim, which equals one in the indicated year if a sample child’s questionnaire score was at least one standard deviation above the sample mean, and zero otherwise. Based on this classification method, Table 1 shows that approximately 13% of sample children were victims of bullying in grade 3, 14% were victims in grade 5, and 17% were victims in grade 6. These rates are comparable to the 15%–20% U.S. prevalence rate reported by the U.S. Department of Education (NCES, 2014), and by Cook, Williams, Guerra, Kim and Sadek (Reference Cook, Williams, Guerra, Kim and Sadek2010).
Table 1 presents definitions, means, and standard deviations of all covariates used to estimate expressions (8) and (9). Our sample consists of 1,785 observations on the 595 children and families having no missing data for the years 2000, 2002, and 2003. In comparison to the 2003 average U.S. household, the average sample household in Table 1 is more educated (mother’s and father’s education is approximately 14.25 years vs. the U.S. average of 12.7) and earns more income ($77,000 vs. the U.S. average of $51,750), but differs only slightly according to ethnicity (12.9% African American vs. 12.5% U.S., and 6.1% Hispanic vs. 9.1% U.S.).
Variables identified in the literature as important determinants of a child’s risk of being bullied guide the specification of Victim in expression (9). While any child can be the victim of bullying, children who are different, either socially or physically, from their peers are more likely to be victimized. Children who are in the ethnic minority in a school are more likely to be bullied (Graham, Reference Graham2006). Children with special needs are also disproportionately victims of bullying. In particular, children with attention or hyperactivity disorders often report that they are bullied as a result of their disorder (Rose, Allison & Simpson, Reference Rose, Allison, Simpson and Hollar2012). Children with other different physical characteristics may be victimized, such as being overweight or underweight, short or tall, wear their hair differently, wear different clothing, wear glasses, or come from a different socioeconomic background (Cook et al., Reference Cook, Williams, Guerra, Kim and Sadek2010; Duncan, Reference Duncan, Espelage and Swearer2011; Due, Merlo, Harel-Fisch, Trab Damsgaard & Holstein, Reference Due, Merlo, Harel-Fisch, Trab Damsgaard and Holstein2009). In contrast, children who exhibit prosocial behavior (e.g., nonaggressive assertiveness, increased interaction, and conflict resolution skills) are at significantly lower risk of being bullied (Schwartz, Dodge & Coie, Reference Schwartz, Dodge and Coie1993; Egan & Perry, Reference Egan and Perry1998; Eisenberg, Fabes & Spinrad, Reference Eisenberg, Fabes and Spinrad2006). School characteristics also play a major role. Bullying problems proliferate in unsafe and/or unsupportive school environments (Kasen, Johnson, Chen, Crawford & Cohen, Reference Kasen, Johnson, Chen, Crawford, Cohen, Espelage and Swearer2011). One of the strongest predictors of bullying victimization is some form of delinquency occurring on or near school grounds, such as vandalism, carrying of weapons, or gang activity (Swearer et al., Reference Swearer, Peugh, Espelage, Siebecker, Kingsbury, Bevins, Jimerson, Swearer and Espelage2006). Other school characteristics associated with higher bullying rates include understaffed and less academically engaged classrooms, inexperienced teachers and administrators, and inadequate financial support or administrative leadership for teachers (Smedley & Willower, Reference Smedley and Willower1981; Short, Reference Short1988; Tirozzi, Reference Tirozzi2001; Nansel, Haynie & Simonsmorton, Reference Nansel, Haynie and Simonsmorton2003; Doll, Song & Siemers, Reference Doll, Song, Siemers, Espelage, Gutgsell and Swearer2004; Holt & Keyes, Reference Holt, Keyes, Espelage, Gutgsell and Swearer2004).
Parent and child variables entering expression (9) include parent education, income, household composition and ethnicity (identical to child ethnicity), the mother’s age at the birth of the subject child, the subject child’s birth order, height-for-age z-score (indicating relative stature), and gender. Since no sample household changed residence, children’s schooling was mandatory, and schooling options were nearly all public (and thus similar in cost), we use residual income (annual income minus annual school cost) as our income measure. Also entering expression (9) is a dummy variable indicating whether the subject child was diagnosed with an attention or hyperactivity disorder (ADHD), and the child’s total raw score on the Social Skills Rating System prosocial behavior subscale (Prosocial Behavior). The prosocial behavior subscale assesses multiple elements of a child’s general social skills in and out of school (Gresham & Elliott, Reference Gresham and Elliott1990). In Table 1, the Prosocial Behavior total raw score is the arithmetic mean of parents’ and teachers’ Likert scale responses about their observations of each child’s peer and group interactions, problem-solving/decision-making, self-management, communication, cooperation, and assertiveness. The Social Skills Rating System subscale factors for prosocial behavior have been found to be stable over early and middle childhood developmental stages among elementary school children in longitudinal studies (Diperna & Volpe, Reference Diperna and Volpe2005).
Variables used in expression (9) to characterize the subject child’s school include School Delinquencies, the number of student-related delinquencies in the indicated year reported by the school principal (from the NICHD principle questionnaire); Principal Age, a proxy for administrative experience; years of Teacher Education; and Teacher Support, a dummy variable indicating whether the subject child’s teacher (based on NICHD teacher survey responses) acknowledged at least “good” administrative support for teaching quality in the indicated year.
Empirical Results
Preliminary binary probit estimates of the probability of Victim are reported in column 1 of Table 2. Results indicate that subject children who reside in households with a father present, have a higher birth order (possibly the result of increased sibling socialization as in Hay, Payne & Chadwick, Reference Hay, Payne and Chadwick2004, and Swearer, Song, Cary, Eagle & Mickelson, Reference Swearer, Song, Cary, Eagle and Mickelson2001), and greater parent- and teacher-observed prosocial behavior are less likely to be bullied at school. Also less likely to be bullied are children who attend schools staffed with older principals and more educated teachers who acknowledge good or better administrative support for quality teaching. In agreement with the literature, children with attention and/or hyperactivity disorders are more likely to be bullied as are children who attend schools with student delinquency problems. Parental income and education, and mother’s age at birth, as well as child ethnicity, gender, stature, or grade level do not emerge as statistically significant influences on children’s likelihood of bully victimization. Victimization probabilities are assumed independent across grades.
Columns 2–5 in Table 2 report results from estimating two bivariate probit specifications of structural expressions (8) and (9). Both specifications exclude Mom Age at Birth and Child Height as these variables produced collinearity problems and were likewise individually and jointly insignificant in all specifications estimated. At the bottoms of columns 2–5, likelihood ratio (LR) $\unicode[STIX]{x1D712}^{2}$ test statistics reject with $p$ values less than 0.01 the null hypothesis of $\unicode[STIX]{x1D70C}=0$ (Victim is exogenous), where $\unicode[STIX]{x1D70C}$ is the coefficient of correlation between the residuals from expressions (8) and (9). Monfardini and Radice (Reference Monfardini and Radice2008) found the LR test of $\unicode[STIX]{x1D70C}=0$ performed better than Lagrange multiplier and conditional moment tests. Both models have a negative estimated residual correlation, suggesting that unobservable factors which decrease the probability of Victim also decrease the probability of Changed School.
We employ the exogenous or predetermined covariates Birth Order, Principal Age, and Prosocial Behavior as instruments to identify the single endogenous treatment covariate, Victim, in expression (8). Birth Order and Principal Age have no clearly compelling influence upon a parent’s decision to change a child’s school. As for the exclusion of Prosocial Behavior, since longitudinal studies of children’s assessed social skills using the Social Skills Rating System are stable over early to middle childhood, a time-varying school change will not affect a child’s Prosocial Behavior. The attribute likely therefore plays no direct role in a parent’s school change decision.
Statistical inference also supports the logic of the abovementioned exclusion restrictions. First, the reduced form results reported in the first column of Table 2 suggest the variations of these instruments in these data make them credible identifiers. Neither do these instruments appear weak. As Chernozhukov and Hansen (Reference Chernozhukov and Hansen2008) point out, because reduced form estimates are proportional to whatever causal effects are of interest, the statistical significance of these reduced form estimates for the instruments should be high if they are strong. They are high in Table 2. Staiger and Stock (Reference Staiger and Stock1997) propose a rule-of-thumb first-stage F-statistic of at least 10.0 from the OLS regression of a single endogenous variable for an instrument set to qualify as strong. Here, an OLS regression of Victim on our instrument set yields an F-statistic of 29.3, implying that our instrument set is not weak.
Finally, we cannot unequivocally rule out the possibility that our instruments directly affect the School Change outcome as well as the Victim treatment. Nor can we deny the chance they are systematically related to unobservables such as student groupings and their norms. Yet if either of these possibilities holds, the common parameter estimates for School Change in columns 3 and 5 of Table 2 should vary suggestively with inclusion or exclusion of the instrument set. Here this inclusion or exclusion generates no clearly suggestive differences. The signs for all common School Change coefficients in columns 3 and 5 are identical and only Income – School Cost, School Delinquencies, and the Grade 6 dummy variable display any appreciable difference in magnitude and statistical significance between these columns. Nevertheless, the difference between the restricted and the unrestricted estimates for each of these variables is only a tiny fraction of their standard errors. We conclude that our instruments drive Victim, the treatment variable but not School Change, the outcome variable. These empirical counters to weak identification and unobserved heterogeneity make us comfortable in relying on standard methods of statistical inference using the estimated coefficients and standard errors.
Results in Table 2 columns 2–5 indicate that parents’ knowledge of their child’s victimization from bullying is positively associated with their probability of changing schools. Household income net of any school cost is also positively associated with school change probability. Families with more educated mothers are less likely to change schools. Male children are more likely to change. Children less likely to change schools are those studying in schools where teachers obtain more administrative support for their teaching. Estimated coefficients from both bivariate probit specifications exhibit similar signs and magnitudes with a few exceptions. In the unrestricted model, the positive association between Changed School and Income – School Cost is slightly weaker. The unrestricted model in column 5 indicates that Birth Order, Prosocial Behavior, and Principal Age impart no statistically significant influence. Finally, several specifications not reported in Table 2 were estimated containing multiple combinations of interactions among model covariates. In particular, Victim and Income – School Cost were interacted with Child Age and/or Grade dummies to investigate any time-varying relationships between these variables and parents’ choice to change schools. No statistically significant time interactions emerged nor were there any significant interactions between parent, child, and school variables.
Using the model with exclusion restrictions in Table 2 columns 2–3 as the basis for value calculations, Table 3 reports mean estimates and 95% confidence intervals of sample parents’ probability of a school change together with their annual marginal valuations of reduced bully victimization, $MV_{\mathit{Victim}}$ . Table 3 values are calculated in accordance with expression (7): the direct product of the fitted probability of Changed School (for the indicated sample) and the ratio of estimated coefficients for Victim and Income – School Cost. Row 1 of Table 3 reports overall sample mean estimates. Rows 2–3 report estimates for subsamples of parents whose child was bullied and not bullied, and rows 4–6 report subsample estimates by household income. As earlier mentioned, estimated $MV_{\mathit{Victim}}$ constitutes a lower bound on the true value if Changed School is a nonessential input for the production of reduced bully victimization (Bockstael & McConnell, Reference Bockstael and McConnell1983). Since the essential input criterion cannot be guaranteed here, Table 3 figures are interpreted as lower bound estimates, expressed in 2003 dollars, of parents’ preference-based annual value to save one elementary school child from being bullied. In Table 3, the average sample parent is willing to pay $137 annually for a one-victim reduction. Sample parents of a child who was not bullied are willing to pay $55 each year; parents of a bullied child are willing to pay $664. Parental marginal valuations vary inversely with household income. This is due to the combined effect of a constant marginal utility of income and the negative estimated impact of Income – School Cost on the probability of Victim evident in column 2 (and 4) of Table 2.
a 2003 dollars; figures are rounded to the nearest dollar.
b Standard errors are in parentheses.
Source: Washington, DC: National Education Association (2003).
4 Benefits and costs of bullying prevention programs in U.S. schools
Although Table 3 benefit estimates likely understate parents’ true marginal willingness to pay to reduce bully victimization, they nonetheless suggest that U.S. households’ net return on investments in elementary and middle school bullying prevention programs could be substantial. This idea can be illustrated using a simple benefit-cost framework combined with some consensus literature estimates of key parameters. Suppose an elementary school with student population, $p$ , adopts a bullying prevention program with total cost, $p\times c$ , where $c$ denotes program cost per student. Given the school’s current rate of bullying, $b$ , the school has a total of $p\times b$ student victims. If the program has success rate, $s$ , defined as the fraction of total student victims reduced, then, by adopting the program, the school will save $p\times b\times s$ students from bullying.
As for the monetary benefit, since the bullying prevention program creates a school-wide public good (reduced victimization risk) which is nonexcludable and nondepletable, the aggregate marginal value of the public good equals the sum of marginal values of each of the affected individuals within the population (Johansson, Reference Johansson1991; Freeman et al., Reference Freeman, Herriges and Kling2014). In parental value terms (and using earlier notation), the aggregate benefit of preventing one student bullying victim within the elementary school is approximately $p\times \overline{MV}_{\mathit{Victim}}$ , where $\overline{MV}_{\mathit{Victim}}$ is the marginal value of the average parent. Multiplying the aggregate parental benefit by the number of student victims saved and differencing this product by total program cost (and extracting like terms) yields parents’ total net benefit (NB) of the school’s bullying prevention program:
where the difference in square brackets represents net benefit per student. Expression (10) says the bullying prevention program is more likely to be net beneficial if: the school’s parents attach a higher value to reduced bullying; the program is more successful ( $s$ ); the school’s bullying problem is worse ( $b$ ); or the program’s per student cost is lower ( $c$ ).
Expression (10) serves as a basis for evaluating some of the more researched and utilized bullying prevention programs in the United States. One of these is the Olweus Bullying Prevention Program (OBPP). To date, the OBPP has been implemented in several hundred U.S. schools and around the world (Olweus & Limber, Reference Olweus and Limber2007). The OBPP has been associated with reductions in bullying behavior as well as decreases in student reports of general antisocial behavior, such as school vandalism, fighting, theft, and truancy (Limber, Reference Limber2011). While OBPP cost data vary somewhat by region, grade level, and school size, according to Olweus.org (OBPP, 2015), for an elementary school with enrollment of 500 students (an enrollment close to the U.S. average; NCES, 2013), the total OBPP annual cost for materials, training, and other outlays (all figures hereafter expressed in 2003 dollars) is approximately $12,500, or $25 per student. Persson and Svensson (Reference Persson and Svensson2013) noted that this figure excludes the program’s indirect costs in the form of lost resources due to teacher and staff hours being used for program implementation. Since U.S. data on indirect costs of the OBPP are unavailable, based on Persson and Svensson (Reference Persson and Svensson2013), we assume a total OBPP annual cost range of $12,500–$75,000 ($25–$150 per student) for a 500-student U.S. elementary school. We interpret this range as a lower bound estimate of the annual cost of implementing the OBPP. This range is large enough to encompass the costs of other bullying prevention programs found to be effective in U.S. elementary schools, including Al’s Pals (Lynch, Geller & Schmidt, Reference Lynch, Geller and Schmidt2004), Second Step (Cooke et al., Reference Cooke, Ford, Levine, Bourke, Newell and Lapidus2007), and Steps to Respect (Brown, Low, Smith & Haggerty, Reference Brown, Low, Smith and Haggerty2011).
In their systematic review and meta-analysis of 44 studies evaluating the effectiveness of anti-bullying programs in schools, Ttofi and Farrington (Reference Ttofi and Farrington2011) found that school-based anti-bullying programs were, on average, 17%–20% effective in reducing students’ victimization from bullying. Assuming Ttofi and Farrington’s (Reference Ttofi and Farrington2011) most conservative 17% estimate as well as the U.S. Education Department’s (2014) most conservative 15% estimated prevalence of bully victimization in U.S. schools (referred to earlier in Section 3), together with the annual $130 $\overline{MV}_{\mathit{Victim}}$ estimate in Table 3, the NB per student of an annual $25–$150 per student investment in a bullying prevention program for a 500-student elementary school ranges from $1508 to $1633 per year with a total NB of $0.754 to $0.8165 million per year. Holding all parameters constant, parental $\overline{MV}_{\mathit{Victim}}$ would have to fall below $11.76 for the program to be unbeneficial at the annual $150 per student cost. Similarly, the program’s effectiveness could be as low as 1.54% and still be beneficial at the $150 per student cost. In addition, as evidenced by expression (10), the bullying prevention program’s NB falls as the school’s student enrollment falls. This is due to the fact that the aggregate marginal value of a public good declines with fewer users of the good. Thus for instance, at current parameter values and a $150 per student annual cost, the elementary school needs to have an enrollment of at least 46 students for the bullying prevention program to be net beneficial (8 or more students are required at a $25 per student cost). There is power in numbers, however. If the question of U.S. representativeness of the sample used in this study is set aside, for the 32.565 million children who attended a U.S. elementary school in 2003, a yearly $150 per student investment in school-based bullying prevention yields an annual NB per student $=$ $107.95 million with a total annual NB $=$ $3.5 billion.
5 Conclusions
This paper developed and implemented an approach to assess the ex ante monetary benefits parents attach to reductions in their children’s victimization from bullying at school. Though these benefits accrue to the parents alone, they add to the literature cited in Section 1 about the ex post private (e.g., increased adult earnings) and public (e.g., increased adult productivity) benefits the child may ultimately realize. Parental benefit measures are derived in the context of a household utility maximization model and reflect a point estimate of parents’ conditional marginal rate of substitution between parental income and their disutility of bully victimization. A lower bound estimate of this substitution rate is inferred from parents’ discrete choices to change their child’s school conditional on their child’s bully victimization status and parent, child, and school characteristics. The conditional choice model is estimated using a bivariate probit model and a unique panel of 595 families from the NICHD Study of Early Child Care and Youth Development for 2000 to 2003. Empirical results are consistent with the hypothesis that when children are victims of bullying at school, at least some parents alter their perceptions of school quality and engage in the remedial action of switching their child’s school. Sample lower bound estimates of parents’ ex ante marginal annual value of reduced child bully victimization average $130 and range from $54 for parents whose child was not bullied to $633 for parents whose child was bullied.
Though the technique used in this paper likely undervalues parents’ true willingness to pay for reduced bully victimization, when combined with literature estimates of U.S. bullying prevalence and the cost and effectiveness of currently available anti-bullying programs, in nearly all cases willingness to pay estimates produced positive, and most often substantial, net program benefits. To the extent that victims of bullying incur any additional ex post psychological, wealth, and punitive costs (Wolke et al., Reference Wolke, Copeland, Angold and Costello2013), U.S. investments in school-based anti-bullying programs could be welfare improving even at program costs well in excess of the upper range assumed in this paper. Some caveats are worthy of mention. First, while the positive net benefits presented in Section 4 are robust to parental marginal values substantially lower than those reported in Table 3, parameter estimates from qualitative choice models like bivariate probit can be sensitive to the assumed form of the underlying distribution. Second, we ignore possible influences of the attributes of the child’s current school upon school change opportunities. Third, all self-reported measures are subject to error and a variety of conscious and unconscious distortions of actual events. While the test–retest reliability of the Kids in My Class at School questionnaire to measure children’s victimization from bullying is favorable (Demaray & Malecki, Reference Demaray and Malecki2002), the rate of bully victimization measured from this instrument may not accurately reflect the true rate.