Hostname: page-component-cd9895bd7-7cvxr Total loading time: 0 Render date: 2024-12-25T18:37:27.755Z Has data issue: false hasContentIssue false

Psychometric properties of the five-level EuroQoL-5 dimension and Short Form-6 dimension measures of health-related quality of life in a population of pregnant women with depression

Published online by Cambridge University Press:  07 October 2019

Margaret Heslin*
Affiliation:
Research Fellow, King's Health Economics, Health Service & Population Research Department, Institute of Psychiatry, Psychology and Neuroscience, Kings College London, UK
Kia-Chong Chua
Affiliation:
Lecturer in Applied Health Statistics, Centre for Implementation Science, Health Service & Population Research Department, Institute of Psychiatry, Psychology and Neuroscience, Kings College London; and Quality Improvement, South London and Maudsley NHS Foundation Trust, UK
Kylee Trevillion
Affiliation:
Lecturer in Mental Health Services Research, Section of Women's Mental Health, Health Service & Population Research Department, Institute of Psychiatry, Psychology and Neuroscience, Kings College London, UK
Selina Nath
Affiliation:
Research Fellow, Section of Women's Mental Health, Health Service & Population Research Department, Institute of Psychiatry, Psychology and Neuroscience, Kings College London; and Population, Policy and Practice Programme, UCL Great Ormond Street Institute of Child Health, UK
Louise M. Howard
Affiliation:
Professor of Women's Mental Health, Section of Women's Mental Health, Health Service & Population Research Department, Institute of Psychiatry, Psychology and Neuroscience, Kings College London, UK
Sarah Byford
Affiliation:
Professor of Health Economics, King’s Health Economics, Health Service & Population Research Department, Institute of Psychiatry, Psychology and Neuroscience, Kings College London, UK
*
Correspondence: Margaret Heslin, King's Health Economics, Institute of Psychiatry, Psychology & Neuroscience at King's College London, Box 024, The David Goldberg Centre, De Crespigny Park, Denmark Hill, London SE5 8AF, UK. Email: margaret.heslin@kcl.ac.uk
Rights & Permissions [Opens in a new window]

Abstract

Background

Although evidence suggests that the EuroQoL-5 dimension (EQ-5D) and Short Form-6 dimension (SF-6D) have equivalent psychometric properties in people with depression, there is some evidence that the EQ-5D may lack responsiveness in certain populations with depression.

Aims

To examine the psychometric properties of the five-level EQ-5D (EQ-5D-5L) and SF-6D measures of health-related quality of life in a representative sample of pregnant women with depression.

Method

Data were taken from a cohort of pregnant women identified at or soon after the first antenatal care contact and followed-up at 3 months postpartum. Health-related quality of life was measured using both the EQ-5D-5L and the SF-6D at baseline and follow-up. We examined acceptability and conducted psychometric validation in the aspects of concurrent validity, convergent validity, known-group validity and responsiveness in 421 women with available data.

Results

The EQ-5D-5L and SF-6D have similarly high levels of acceptability. However, concurrent validation shows a lack of concordance between the EQ-5D-5L and SF-6D. The EQ-5D-5L tends to be higher than the SF-6D in individuals with better health states. The SF-6D tends to be higher than EQ-5D-5L in individuals with poorer health states. Convergent and known-group validity are comparable between the two utility measures. Longitudinally, women who recovered show larger increase in SF-6D utilities than those who did not recover at follow-up. With the EQ-5D-5L, this is not the case. Additionally, the ceiling effects were more apparent in the EQ-5D-5L.

Conclusions

The effectiveness of perinatal mental health interventions may be better captured by the SF-6D than the EQ-5D-5L but this needs to be cross-validated in more studies.

Declaration of interest

L.M.H. chaired the National Institute for Health and Care Excellence CG192 guidelines development group on antenatal and postnatal mental health in 2012–2014. L.M.H. reports grants from NIHR, MRC, Nuffield and the Stefanou Foundation, UK. K.T., M.H. and S.B. report funding by NIHR and the Stefanou Foundation, UK.

Type
Papers
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
Copyright © The Authors 2019

Economic evaluations such as cost-utility and cost-effectiveness analyses are used to provide evidence on the value for money of new interventions for relevant decision-makers.Reference Glick, Doshi, Sonnad and Polsky1 One of the largest guideline development bodies in the UK is the National Institute for Health and Care Excellence (NICE). NICE provides national guidance and advice to improve health and social care2 and uses economic evidence of cost-effectiveness as well as effectiveness evidence in the development of guidelines. NICE provide methodological guidelines for organisations considering submitting evidence to NICE, outlining the principles and methods of health technology assessment and appraisal within the context of the NICE appraisal process. These guidelines describe the NICE ‘reference case’ for economic evaluation, which includes a preference for outcomes to be measured in terms of quality-adjusted life-years using the EuroQoL-5 dimension (EQ-5D) measure of health-related quality of life.3

The EQ-5D dimension and Short Form-6 dimension (SF-6D) are generic preference-based patient-reported outcome measures that are used to derive health-related quality of life. Research suggests that the EQ-5D and SF-6D have equivalent psychometric properties when examined in people with depressionReference Peasgood, Brazier and Papaioannou4,Reference Mulhern, Mukuria, Barkham, Knapp, Byford and Brazier5 and are therefore both equally as good for producing utility values for economic evaluation. However, there is some evidence that the EQ-5D may lack responsiveness in certain populations with depression (for example in elderly populationsReference Peasgood, Brazier and Papaioannou4) and no work has been conducted on the psychometric properties of the EQ-5D and SF-6D in pregnant women with depression. We therefore aimed to explore the psychometric properties of the five-level EQ-5D (EQ-5D-5L) and SF-6D measures of health-related quality of life in a representative sample of pregnant women with depression, through the psychometric assessment of validity, responsiveness and acceptability.

Method

Data

Data were taken from the Wellbeing in pregnancy: identification and prevalence of common mental health problems (WENDY) cohort study.Reference Howard, Ryan, Trevillion, Anderson, Bick and Bye6 WENDY was a cohort study of pregnant women identified around the maternity booking appointment (approximately 10 weeks of pregnancy) and followed-up to 3 months postpartum. The purpose of the WENDY study was to determine the prevalence of antenatal common mental disorders and to investigate the effectiveness and cost-effectiveness of the Whooley questions and the Edinburgh Postnatal Depression Scale (EPDS) in identifying antenatal depression.Reference Howard, Ryan, Trevillion, Anderson, Bick and Bye6

Ethical approval for the research was granted by the National Research Ethics Service, London Committee – Camberwell St Giles (ref no 14/LO/0075). Written informed consent was obtained.

Participants

Pregnant women attending an antenatal booking clinic in a South East London maternity service between 10 November 2014 and 30 June 2016, aged 16 years or older were recruited into the WENDY study. Data from all women including those with and without depression were included.

Outcome measures

Outcomes for the purpose of the current study were assessed at baseline (booking appointment) and 3 months post-delivery. The Structured Clinical Interview DSM-IV (SCID)Reference First, Spitzer, Gibbon and Williams7 was used to determine who met criteria for a DSM-IV-TRReference First, Spitzer, Gibbon and Williams7 diagnosis of current depression (mild, moderate or severe major depressive disorder, or mixed anxiety and depressive disorder).

Health-related quality of life was measured using both the EQ-5D-5L and the SF-6D. The EQ-5D-5L is measured on five dimensions (mobility, self-care, usual activities, pain/discomfort and anxiety/depression), each with five levels (no problems, slight problems, moderate problems, severe problems and extreme problems).Reference Herdman, Gudex, Lloyd, Janssen, Kind and Parkin8 This allows participants to be classified into one of 3125 health states. Appropriate utility weights can then be attached to these health states.Reference Devlin, Shah, Feng, Mulhern and van Hout9

The SF-6D is derived from the Short Form-36 item survey of heath.Reference Ware, Kosinski, Dewey and Gandek10 It measures health on eight dimensions: (a) limitations in physical activities because of health problems; (b) limitations in social activities because of physical or emotional problems; (c) limitations in usual role activities because of physical health problems; (d) bodily pain; (e) general mental health (psychological distress and well-being); (f) limitations in usual role activities because of emotional problems; (g) vitality (energy and fatigue); and (h) general health perceptions.Reference Ware, Kosinski, Dewey and Gandek10 This allows participants to be classified into one of 18 000 health states. Appropriate utility weights can then be attached to these health states.Reference Brazier, Roberts and Deverill11

The EPDS is used to measure depression symptoms. It is a ten-item screening tool for perinatal depression.Reference Cox, Holden and Sagovsky12 The 10 items correspond with various clinical depression symptoms, for example guilt, low energy and suicidal ideation. Studies have shown that it is sensitive to changes in severity of depression over time.Reference Cox, Holden and Sagovsky12

Analysis

Analysis included: acceptability; concurrent validity; convergent validity; known-group validity; and responsiveness. Acceptability was assessed descriptively in terms of completion rates at baseline and follow-up at 3 months post-delivery. Concurrent validity refers to the extent an outcome of interest (for example SF-6D utility scores) shows an expected association with other measures of the same target construct (for example EQ-5D utility scores). The association between EQ-5D and SF-6D was examined using the intraclass correlation coefficient.Reference Shrout and Fleiss13 Bland–Altman plotsReference Bland and Altman14 were also used to display the limits of agreement between EQ-5D and SF-6D measurements. The plots in this study were generated using the Stata module provided by Mander.Reference Mander15

Convergent validity refers to the extent an outcome of interest (such as utility scores) shows an expected association with other logical outcomes (such as depression scores) measured at the same time point. Convergent validity was assessed by examining the correlation between baseline EQ-5D-5L or SF-6D scores and baseline scores on the EPDS using Spearman's rank correlation coefficient or Pearson's correlation coefficient as appropriate. A coefficient greater than 0.5 or less than −0.5 is considered strong, values between 0.3 and 0.49 or −0.3 and −0.49 are considered moderate and values between −0.3 and 0.3 are considered weak.Reference Fleiss16

Known-group validity refers to the extent an outcome measure of interest helps distinguish between groups that are theoretically expected to differ. For example, people with depression would be expected to have lower levels of quality of life than people without depression. Using the SCID and EPDS, we grouped participants in the following ways before comparing their utility scores.

  1. (a) SCID: non-depressed versus any depression (mild, moderate or severe major depressive disorder, or mixed anxiety and depressive disorder).

  2. (b) SCID: mild depression versus moderate/severe depression.

  3. (d) EPDS: non-depressed (indicated by a score of 14 or less on the EPDSReference Matthey, Henshaw, Elliott and Barnett17) versus any depression (indicated by a score of 15 or more on the EPDSReference Matthey, Henshaw, Elliott and Barnett17).

  4. (e) EPDS: no/mild depressive symptoms (indicated by the EPDS cut-offs of 0–13 for no/mild depressionReference McCabe-Beane, Segre, Perkhounkova, Stuart and O'Hara18) versus moderate/severe depressive symptoms (indicated by the EPDS cut-offs of 14–30 for moderate/severe depressive symptomsReference McCabe-Beane, Segre, Perkhounkova, Stuart and O'Hara18).

In each case the baseline mean EQ-5D-5L and SF-6D scores were calculated for each group and tested for differences using t-tests (or non-parametric equivalent as appropriate).

Responsiveness refers to the ability of an outcome of interest to distinguish clinically important changes and was explored in a number of ways. Floor (lowest possible) and ceiling (highest possible) scores were examined at baseline and follow-up for the EQ-5D-5L and SF-6D. These affect a measure's ability to detect deterioration or improvements in health states and large numbers at the ceiling or floor would suggest that the measure may not be able to adequately capture an improvement or deterioration in health status, respectively. The magnitude of change in EQ-5D-5L and SF-6D scores were examined and compared using the standardised response mean statistic, which is calculated by dividing the mean change on the measure by the standard deviation of the change, for those who recovered (defined as those who were above the EPDS threshold for probable depression at baseline using the cut-off of 15 or moreReference Matthey, Henshaw, Elliott and Barnett17 but then below the cut-off at follow-up) versus those who did not recover (defined as those remaining above the EPDS threshold for probable depression at baseline and follow-up using the cut-off of 15 or moreReference Matthey, Henshaw, Elliott and Barnett17).

Results

Participants

A total of 545 participants were recruited into the WENDY study. Of these, 27% (147/545) had a diagnosis of mild, moderate or severe major depressive disorder, or mixed anxiety and depressive disorder according to the SCID, and 73% (398/545) did not. Mild depression was the most common (14%, 74/545) followed by moderate depression (11%, 59/545), mixed anxiety and depression (2%, 11/545) and finally severe depression (1%, 3/545).

The baseline sociodemographic and clinical variables for the participants are described in Table 1. The mean age of the sample was 33 years (s.d. = 5.75). Of the 545 participants, 34% (n = 184) were White British, 25% were Black non-British (n = 138), and 18% (n = 100) were White other. In total, 48% (n = 262) were born in the UK. The majority were married or cohabiting (72%, n = 392), had a bachelor's degree or higher (60%, n = 326), and were in paid employment (64%, n = 349). The mean EQ-5D-5L utility score at baseline was 0.87 (s.d. = 0.16). The mean SF-6D utility score at baseline was 0.67 (s.d. = 0.12).

Table 1 Sociodemographic and clinical data for participants

EQ-5D-5L, five-level EuroQoL-5 dimension; SF-6D, Short Form-6 dimension.

Of the 545 included participants, 77% (421/545) had full data on the measures required for the current analyses. Table 2 describes the sociodemographic and clinical variables at baseline for those with and without full data. There were no differences in the follow-up of those with and without depression. Participants without full data appeared to be more likely to be of Black non-British ethnicity, from outside the UK, have qualification lower than a degree and be unemployed. However, in terms of baseline EQ-5D-5L and SF-6D utility, the groups were very similar.

Table 2 Sociodemographic and clinical data for those with and without full data

SCID, Structured Clinical Interview DSM-IV; EQ-5D, EuroQoL-5 dimension; SF-6D, Short Form-6 dimension.

Utility scores

At baseline, mean EQ-5D-5L utility was 0.87 (s.d. = 0.16), ranging from −0.10 to 1, and mean SF-6D utility was 0.67 (s.d. = 0.12), ranging from 0.32 to 1. At 3-month follow-up, the mean EQ-5D-5L utility was 0.91 (s.d. = 0.12), ranging from 0.07 to 1 and mean SF-6D utility was 0.76 (s.d. = 0.13), ranging from 0.38 to 1.

Acceptability

The EQ-5D-5L was fully completed by 95.78% of all participants (522/545) and 93.88% (138/147) for those with mild, moderate or severe major depressive disorder, or mixed anxiety and depressive disorder. This compares with 92.48% (504/545) of all participants for the SF-6D and 91.16% (134/147) for those with mild, moderate or severe major depressive disorder, or mixed anxiety and depressive disorder. This suggests interview fatigue or lack of acceptability in only a very small proportion of respondents and little difference for those with and without depression.

Concurrent validity

Figure 1 presents the Bland-Altman plots depicting the agreement between the EQ-5D-5L and SF-6D utility scores at baseline. Under 5% of values lay outside of the 95% agreement limits. The mean difference in utility values was 0.195 with a 95% limit of agreement of −0.054 to –0.445. The figure shows that, in general, the EQ-5D-5L overestimates utility in the higher utility range, and the SF-6D overestimates utility in the lower utility range. For lower utility values, there is a larger discrepancy between EQ-5D-5L and the SF-6D.

There were 20 of 421 (4.75%) outside the limits of agreement. Mean difference 0.195, 95% limits of agreement –0.054 to 0.445. Averages lie between 0.110 and 1.000.

Fig. 1 Agreement between the five-level EuroQoL-5 dimension (EQ-5D-5L) and Short Form-6 dimension (SF-6D) utility at baseline.

Convergent validity

Both the EQ-5D-5L utility score and the SF-6D utility score were significantly and negatively associated with EPDS scores meaning that as EDPS scores decreased (depression symptoms were improving), utility scores increased (indicating improvements in quality of life). The EQ-5D-5L's correlation was strong according to Fleiss'sReference Fleiss16 threshold of 0.5/−0.5 (Spearman's rho = −0.558, P<0.001) as was correlation for the SF-6D's (Spearman's rho = −0.553, P<0.001). Both EQ-5D-5L and SF-6D show a similar magnitude in association with the EPDS.

Known-group validity

Tests of known-group validity are reported in Table 3. There was a significant difference in EQ-5D-5L utility between those who had a SCID diagnosis of depression and those who did not (z = 9.362, P<0.001) with those with depression having a lower utility value. The same was found for the SF-6D utility (z = 10.372, P<0.001). The mean difference for the EQ-5D-5L was 0.16 compared with 0.14 for the SF-6D with an effect size of 1 and 1.17 respectively. Similarly, there was a significant difference in EQ-5D-5L utility (z = 8.995, P<0.001) and SF-6D utility (z = 8.725, P<0.001) for those who had a diagnosis of depression according to the EPDS. The mean difference for the EQ-5D-5L and SF-6D for this was 0.20 and 0.13, respectively, with effect sizes of 1.25 and 1.08, respectively. There was no difference in EQ-5D-5L utility (z = 1.040, P = 0.2982) or SF-6D utility (z = 1.927, P = 0.0552) between those with mild versus moderate/severe depression according to the SCID, but there was a difference between those with mild versus moderate/severe depression according to the EPDS (EQ-5D-5L: z = 6.237, P<0.001; SF-6D: z = 5.996, P<0.001) with mean differences of 0.16 for the EQ-5D-5L and 0.09 for the SF-6D and effect sizes of 1 and 0.75, respectively.

Table 3 Mean baseline EQ-5D-5L and SF-6D utility by known groups

EQ-5D-5L, five-level EuroQoL-5 dimension; SF-6D, Short Form-6 dimension; SCID, Structured Clinical Interview DSM-IV; EPDS, Edinburgh Postnatal Depression Scale.

*P<0.05; **P<0.01; ***P<0.001.

Responsiveness

At baseline and follow-up, there were no participants reporting the lowest possible score on the EQ-5D-5L or SF-6D. At baseline 28.50% of participants (120/421) reported having the highest possible score on the EQ-5D and at follow-up this rose to 42.28% (178/421). This compared with 0.24% (1/421) for the SF-6D at baseline and 1.43% (6/421) at follow-up. Figure 2 shows the distribution of EQ-5D-5L and SF-6D utility at baseline and follow-up. Figure 3 shows the EQ-5D-5L and SF-6D utility plotted between baseline and follow-up. Replicating these figures for the subsample of participants with a SCID diagnosis of depression found the same patterns (supplementary Figs S1 and S2 available at https://doi.org/10.1192/bjo.2019.71).

EQ-5D-5L at baseline (a) and follow-up (b) and SF-6D at baseline (c) and follow-up (d).

Fig. 2 Distribution of five-level EuroQoL-5 dimension (EQ-5D-5L) and Short Form-6 dimension (SF-6D) utility at baseline and follow-up.

Fig. 3 Scatterplot of (a) five-level EuroQoL-5 dimension (EQ-5D-5L) and (b) Short Form-6 dimension (SF-6D) utility plotted between baseline and follow-up.

At baseline there were 72 people who had a diagnosis of depression according to the EPDS cut-off of 15 and above.Reference Matthey, Henshaw, Elliott and Barnett17 At follow-up, 60 of these had recovered (EPDS score fell below 15) and 12 had not recovered (retained an EPDS score of 15 or above). The change in EQ-5D-5L utility for those who had recovered was 0.14 (s.d. = 0.21) compared with 0.24 (s.d. = 0.24) for those who did not. The standardised response mean was 0.67 for those who recovered versus 1.03 in those who did not. The change in SF-6D utility for those who had recovered was 0.14 (s.d. = 0.13) compared with 0.03 (s.d. = 0.14). The standardised response mean was 1.07 for those who recovered versus 0.22 in those who did not.

Discussion

Main findings

This is the first comparison of the relevance of the EQ-5D-5L and SF-6D utility measures of health-related quality of life in a population of pregnant women with depression. With five and six items respectively, acceptability rates were high and comparable for both measures. However, the EQ-5D-5L tended to show higher utility than SF-6D for the same individual. This means that study conclusions may differ depending on the choice of utility measurement used. Further, the EQ-5D-5L showed a substantial ceiling effect that was far less with the SF-6D. Despite this, we found that both measures show associations with depression symptomatology in the expected direction and at similar magnitudes. Both measures also show logical utility differences that we would expect to find between groups with and without depression, and between groups with increasing depression severity. However, such group differences tended to be more apparent with EQ-5D-5L utility. When examined alongside longitudinal changes in depression symptomatology, SF-6D utility showed a much larger increase among those who recovered than among those who did not. This was not the case for the EQ-5D-5L. In fact, EQ-5D-5L utility showed a much larger increase among those who did not recover. However, the difference in EQ-5D-5L utility for those who did and did not recover was not significantly different from each other, and those who did not recover, started with lower utilities and therefore, had more room for an increase in utilities. This was not the case for the SF-6D, which showed a lower increase in utility among those who did not recover compared with those who did, and the baseline utility of those who did and did not recover were similar. Many EQ-5D-5L utility values were at the maximum or near maximum at baseline (almost 30%) meaning that there was little room for utility to improve over time. This was still the case even when only those with depression were examined but was not the case for the SF-6D utility. It is possible that the EQ-5D-5L is failing to pick up the mental health issues to the same extent as the SF-6D. However, it is not clear whether the lower scores on the SF-6D are the result of the mental health issues or the effect of pregnancy.

Various studies have examined the psychometric properties of the EQ-5D and SF-6D in common mental disorders such as depression. However, the majority of these studies have examined each measure in separate populations,Reference Mulhern, Mukuria, Barkham, Knapp, Byford and Brazier5 making direct comparisons of the psychometric properties difficult. Only one study has examined both measures in the same sample,Reference Lamers, Bouwmans, van Straten, Donker and Hakkaart19 only a limited comparison was undertaken (included only examination of known-group validity and responsiveness) and none have been undertaken in a population of pregnant women with depression.

Strengths and limitations

The major strength of this study is that the psychometric properties of the EQ-5D-5L and SF-6D could be examined and compared against each other as they were collected at the same time in the same cohort. This collection of data at the same time allowed not only direct comparisons between the EQ-5D-5L and SF-6D in terms of known-group validity, convergent validity, responsiveness and acceptability, but it also allowed for the examination of concurrent validity, which cannot be done when data on different measures are collected from separate sources.

The main limitation of this study was the collection of data from a single site in inner-city London, meaning results may not be applicable to the rest of the UK. There was also a low response rate, with only 33% of those eligible for study inclusion agreeing to take part. However, the sample was still found to be representative of women in the catchment area including women from very diverse backgrounds and those who did not speak English.Reference Howard, Ryan, Trevillion, Anderson, Bick and Bye6 Further, the non-depressed group included women with other mental disorders. However, this is a realistic sample of women without depression – this will include women with and without other disorders. Finally, the known-group validity by SCID severity result did not attained statistical significance. This could be explained by small subgroup sizes. While sample size is a study limitation, our examination of longitudinal changes in utility values has been anchored on clinically significant changes in depression scores.

Implications

In conclusion, there may be an advantage of using the SF-6D rather than EQ-5D-5L in economic evaluations with pregnant women who are depressed because of the substantial ceiling effect shown by the EQ-5D-5L, but this finding needs to be cross-validated in more studies.

Funding

This paper summarises independent research funded by the National Institute for Health Research (NIHR) under the Programme Grants for Applied Research programme (ESMI Programme: grant reference number RP-PG-1210–12002) and the National Institute for Health Research (NIHR)/Wellcome Trust Kings Clinical Research Facility and the NIHR Biomedical Research Centre and Dementia Unit at South London and Maudsley NHS Foundation Trust and Kings College London. L.M.H. is also supported by a National Institute for Health Research (NIHR) Research Professorship (NIHR-RP-R32–011). The views expressed are those of the author(s) and not necessarily those of the NHS, the NIHR or the Department of Health. The study team acknowledges the study delivery support given by the South London Clinical Research Network.

Acknowledgements

We gratefully acknowledge the advice received from our Patient and Public Advisory Group (Clare Dolman, Sarah Spring, Ceri Rose, Liberty Mosse, Amanda Grey, Henry Fay, Kathryn Grant, Maria Bavetta, Eleanor O'Sullivan, Jesse Hunt, Diana Rose, chair), our Programme Steering Committee (Professor Rona McCandlish (Chair), Dr Heather O'Mahen, Dr Pauline Slade, Ceri Rose, Sarah Spring and Rosemary Jones) and our Data Monitoring and Ethics Committee (Roch Cantwell (chair), Liz McDonald-Clifford, Marian Knight, Stephen Bremner). We also want to take the opportunity to thank the women who participated in this study. The data are not publicly available due to being of a sensitive nature but are available from the corresponding author on reasonable request.

Supplementary material

Supplementary material is available online at https://doi.org/10.1192/bjo.2019.71.

References

1Glick, HA, Doshi, JA, Sonnad, SS, Polsky, D. Economic Evaluation in Clinical Trials. Oxford University Press, 2015.Google Scholar
2National Institute for Health and Care Excellence. What We Do. NICE, 2018 (https://www.nice.org.uk/about/what-we-do).Google Scholar
3National Institute for Health and Care Excellence. Guide to the Methods of Technology Appraisal 2013. NICE, 2013.Google Scholar
4Peasgood, T, Brazier, J, Papaioannou, D. A Systematic Review of the Validity and Responsiveness of EQ-5D and SF-6D for Depression and Anxiety. HEDS Discussion Paper 12/15. White Rose University Consortium, 2012 (http://eprints.whiterose.ac.uk/74659/1/12.15.pdf).Google Scholar
5Mulhern, B, Mukuria, C, Barkham, M, Knapp, M, Byford, S, Brazier, J. Using generic preference-based measures in mental health: psychometric validity of the EQ-5D and SF-6D. Br J Psychiatry 2014; 205: 236–43.Google Scholar
6Howard, LM, Ryan, EG, Trevillion, K, Anderson, F, Bick, D, Bye, A, et al. Accuracy of the Whooley questions and the Edinburgh Postnatal Depression Scale in identifying depression and other mental disorders in early pregnancy. Br J Psychiatry 2018; 212: 50–6.Google Scholar
7First, MB, Spitzer, RL, Gibbon, M, Williams, JB. Structured Clinical Interview for DSM-IV-TR Axis I Disorders, Research Version, Patient Edition. Biometrics Research, New York State Psychiatric Institute, 2002.Google Scholar
8Herdman, M, Gudex, C, Lloyd, A, Janssen, MF, Kind, P, Parkin, D, et al. Development and preliminary testing of the new five-level version of EQ-5D (EQ-5D-5L). Qual Life Res 2011; 20: 1727–36.Google Scholar
9Devlin, N, Shah, K, Feng, Y, Mulhern, B, van Hout, B. Valuing Health-Related Quality of Life: An EQ-5D-5L Value Set for England. OHE Research Paper 16/01. Office of Health Economics, 2016.Google Scholar
10Ware, JE, Kosinski, M, Dewey, JE, Gandek, B. SF-36 Health Survey: Manual and Interpretation Guide. Quality Metric Inc, 2000.Google Scholar
11Brazier, J, Roberts, J, Deverill, M. The estimation of a preference-based measure of health from the SF-36. J Health Econ 2002; 21: 271–92.Google Scholar
12Cox, JL, Holden, JM, Sagovsky, R. Detection of postnatal depression: development of the 10-item Edinburgh Postnatal Depression Scale. Br J Psychiatry 1987; 150: 782–6.Google Scholar
13Shrout, PE, Fleiss, JL. Intraclass correlations: uses in assessing rater reliability. Psychol Bull 1979; 86: 420–8.Google Scholar
14Bland, JM, Altman, DG. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet 1986; 1: 307–10.Google Scholar
15Mander, A. BATPLOT: Stata Module to Produce Bland-Altman Plots Accounting for Trend. Statistical Software Components, 2005. (http://ideas.repec.org/c/boc/bocode/s448703.html).Google Scholar
16Fleiss, JL. Statistical Methods for Rates and Proportions. Wiley & Sons, 1982.Google Scholar
17Matthey, S, Henshaw, C, Elliott, S, Barnett, B. Variability in use of cut-off scores and formats on the Edinburgh Postnatal Depression Scale–implications for clinical and research practice. Arch Womens Ment Health 2006; 9: 309–15.Google Scholar
18McCabe-Beane, JE, Segre, LS, Perkhounkova, Y, Stuart, S, O'Hara, MW. The identification of severity ranges for the Edinburgh Postnatal Depression Scale. J Reprod Infant Psychol 2016; 34: 293303.Google Scholar
19Lamers, LM, Bouwmans, CA, van Straten, A, Donker, MC, Hakkaart, L. Comparison of EQ-5D and SF-6D utilities in mental health. Health Econ 2006; 15: 1229–36.Google Scholar
Figure 0

Table 1 Sociodemographic and clinical data for participants

Figure 1

Table 2 Sociodemographic and clinical data for those with and without full data

Figure 2

Fig. 1 Agreement between the five-level EuroQoL-5 dimension (EQ-5D-5L) and Short Form-6 dimension (SF-6D) utility at baseline.

There were 20 of 421 (4.75%) outside the limits of agreement. Mean difference 0.195, 95% limits of agreement –0.054 to 0.445. Averages lie between 0.110 and 1.000.
Figure 3

Table 3 Mean baseline EQ-5D-5L and SF-6D utility by known groups

Figure 4

Fig. 2 Distribution of five-level EuroQoL-5 dimension (EQ-5D-5L) and Short Form-6 dimension (SF-6D) utility at baseline and follow-up.

EQ-5D-5L at baseline (a) and follow-up (b) and SF-6D at baseline (c) and follow-up (d).
Figure 5

Fig. 3 Scatterplot of (a) five-level EuroQoL-5 dimension (EQ-5D-5L) and (b) Short Form-6 dimension (SF-6D) utility plotted between baseline and follow-up.

Supplementary material: File

Heslin et al. supplementary material

Heslin et al. supplementary material

Download Heslin et al. supplementary material(File)
File 37.2 KB
Submit a response

eLetters

No eLetters have been published for this article.