Introduction
In recent years mental health care services and social organizations increased their focus on implementing peer support initiatives to promote recovery and expand the availability of support for individuals coping with mental illness (Stratford et al., Reference Stratford, Halpin, Phillips, Skerritt, Beales, Cheng and Tiengtom2017). This growing interest in peer support is stimulated by the World Health Organization (WHO), as they consider it a feasible tool which adds a person-centered, recovery, and rights-based approach to biomedical practices in mental health services (WHO, 2021). Also, the (coronavirus disease 2019) COVID-19 pandemic increases the need for community-based interventions such as peer support (Suresh, Alam, & Karkossa, Reference Suresh, Alam and Karkossa2021), since mental health problems may have exacerbated and mental health services may be less accessible (Salari et al., Reference Salari, Hosseinian-Far, Jalali, Vaisi-Raygani, Rasoulpoor, Mohammadi and Khaledi-Paveh2020).
Peer support involves a mutual exchange of practical and emotional support, based on ‘shared understanding, respect, and mutual empowerment between people in similar situations’ (Mead, Hilton, & Curtis, Reference Mead, Hilton and Curtis2001) with critical ingredients such as shared responsibility (Mead, Reference Mead2003; Mead & MacNeil, Reference Mead and MacNeil2006), hope, self-determination over one's life, and the use of lived experience knowledge (Repper & Carter, Reference Repper and Carter2011; Slade et al., Reference Slade, Amering, Farkas, Hamilton, O'Hagan, Panther, Perkins and Whitley2014; Solomon, Reference Solomon2004). These aspects are embedded within the varying peer support programs implementing different structures, content, duration, and delivery formats, targeting different populations, and evaluating a wide range of outcomes (Chien, Clifton, Zhao, & Lui, Reference Chien, Clifton, Zhao and Lui2019; Lloyd-Evans et al., Reference Lloyd-Evans, Mayo-Wilson, Harrison, Istead, Brown, Pilling and Kendall2014).
Previous meta-analyses examining the effects of peer support interventions (PSIs) were focused on specific target groups, such as patients with (perinatal) (Huang et al., Reference Huang, Yan, Tian, Lei, Yang, Liu and Lei2020) depression (Bryan & Arkowitz, Reference Bryan and Arkowitz2015; Pfeiffer, Heisler, Piette, Rogers, & Valenstein, Reference Pfeiffer, Heisler, Piette, Rogers and Valenstein2011) or serious mental illness (SMI) (Chien et al., Reference Chien, Clifton, Zhao and Lui2019; Fuhr et al., Reference Fuhr, Salisbury, De Silva, Atif, van Ginneken, Rahman and Patel2014; Lloyd-Evans et al., Reference Lloyd-Evans, Mayo-Wilson, Harrison, Istead, Brown, Pilling and Kendall2014), or only analyze specific outcomes (e.g. cost-effectiveness; Chien et al., Reference Chien, Clifton, Zhao and Lui2019; Huang et al., Reference Huang, Yan, Tian, Lei, Yang, Liu and Lei2020) and empowerment (Burke, Pyle, Machin, Varese, & Morrison, Reference Burke, Pyle, Machin, Varese and Morrison2019) or included either one-to-one (White et al., Reference White, Foster, Marks, Morshead, Goldsmith, Barlow and Gillard2020) or group interventions (Lyons, Cooper, & Lloyd-Evans, Reference Lyons, Cooper and Lloyd-Evans2021).
To the best of our knowledge, no previous meta-analysis has examined the effects of peer support across all patient groups and intervention types. We conducted a comprehensive systematic review and meta-analysis of randomized controlled trials (RCTs) comparing the effects of any peer support intervention with control conditions. We focused on 3 pre-specified main outcomes – clinical, personal, and functional recovery – and, when possible, we also examined specific outcomes within these main categories (e.g. depressive symptoms, empowerment, and quality of life).
Methods
Protocol registration
This study adheres to the Preferred Reporting Items for Systematic Reviews and Meta-analyses (PRISMA) reporting guideline (Moher, Liberati, Tetzlaff, Altman, & The, Reference Moher, Liberati, Tetzlaff, Altman and The2009), and focuses on the effect of peer support for individuals with mental health disorders, corresponding to the main part of our protocol (https://osf.io/58urb). This protocol also includes our search for RCTs on peer support for relatives and caregivers of individuals with mental illness, which will not be reported here.
Search strategy
We searched PubMed, Embase, and PsycINFO up to December 18th 2020, without language restriction. We used index terms from database-specific thesauruses as well as free text words indicative of mental illness and peer support (search strings are available in Appendix A). References of included trials and previous systematic reviews were reviewed for eligibility.
Identification and selection of studies
Two authors (DS and CM) independently screened titles and abstracts to identify eligible papers for inclusion. To determine final inclusions, full texts of the selected papers were examined. We included studies: (a) that were RCTs; (b) comparing any PSI format; (c) for adults with a clinical or self-reported mental disorder diagnosis, or a score above a cut-off on a standardized mental disorder symptom measure; (d) with care-as-usual (CAU), waiting list (WL), or other active (e.g. clinician-led therapies) or inactive comparators (e.g. an attention control website) (Griffiths et al., Reference Griffiths, Mackinnon, Crisp, Christensen, Bennett and Farrer2012); and (e) outcomes focusing on at least one of 3 categories: clinical (i.e. symptomatic) recovery (Slade et al., Reference Slade, Amering, Farkas, Hamilton, O'Hagan, Panther, Perkins and Whitley2014; van Eck, Burger, Vellinga, Schirmbeck, & de Haan, Reference van Eck, Burger, Vellinga, Schirmbeck and de Haan2018); personal recovery (e.g. empowerment; Mueser et al., Reference Mueser, Meyer, Penn, Clancy, Clancy and Salyers2006; van Weeghel, van Zelst, Boertien, & Hasson-Ohayon, Reference van Weeghel, van Zelst, Boertien and Hasson-Ohayon2019); functional recovery (e.g. quality of life; Mueser et al., Reference Mueser, Meyer, Penn, Clancy, Clancy and Salyers2006). For a definition of the categories, see Appendix B. Peers are defined as individuals recovered or in recovery from a mental illness. We excluded trials when the intervention was partially or co-delivered by a non-peer (e.g. a lay health worker), targeting substance use, somatic disorder self-management, or including (ex-)employees with mental illness due to their job (e.g. veterans). Any disagreement was resolved with a third author (PC), and central issues were discussed in meetings with all authors.
Data extraction and risk of bias assessment
A standardized form was used by 2 authors (DS and CM) to extract data regarding study context, participants' and intervention characteristics, including diagnoses, intervention format, control condition, and outcome data. When multiple measurements or control groups were available, we followed our developed decision tool (see Appendix C).
Study authors DS and CM independently assessed included trials using the Cochrane Collaboration Risk of Bias (RoB) tool 2.0 (Higgins et al., Reference Higgins, Altman, Gøtzsche, Jüni, Moher, Oxman and Sterne2011), resolving any discrepancy with a third researcher (PC). Each of the following RoB-domains was rated as high risk, some concerns, or low risk: (a) the randomization process; (b) deviations from the intended interventions; (c) missing outcome data (up to 10% drop out was rated as low risk); (d) inappropriate measurement of the outcome; (e) selection of the reported result. An overall RoB score was calculated for each study, following our approach as presented in Appendix C.
Outcome measures
Outcomes included three pre-specified recovery categories: (1) clinical recovery, indicating the degree of psychiatric symptomatology (Slade et al., Reference Slade, Amering, Farkas, Hamilton, O'Hagan, Panther, Perkins and Whitley2014), with measures including the Brief Symptom Inventory (BSI), and Brief Psychiatric Rating Scale (BPRS); (2) personal recovery, concerning the extents of perceived recovery, sense of purpose, and personal agency [Mueser et al., Reference Mueser, Meyer, Penn, Clancy, Clancy and Salyers2006; e.g., Recovery Assessment Schedule (RAS), Empowerment Scale (ES)]; (3) functional recovery, referring to the quality of life and the degree of vocational and social functioning [Robinson, Woerner, McMeniman, Mendelowitz, & Bilder, Reference Robinson, Woerner, McMeniman, Mendelowitz and Bilder2004; e.g., World Health Organization Quality of Life (WHOQOL), EuroQoL 5D (EQ-5D)].
Also, we examined subcategories within the main categories of outcomes: clinical recovery (depressive symptoms), personal recovery (empowerment, RAS, hope), and functional recovery (quality of life, social support, and loneliness). These subcategories of specific outcomes were pooled when a minimum of five trials were available. In Appendix B, a comprehensive definition for each outcome category is provided, with details on data extraction per category described in Appendix C, and corresponding instruments in Appendix D.
Statistical analysis
We conducted separate meta-analyses comparing PSIs and control conditions for each main group of outcomes (clinical, functional, and personal recovery) as well as subcategories of outcomes within the main groups (e.g. hope, quality of life). Effects were estimated at post-test, and when possible, at long-term follow-ups (⩾6 months after randomization).
We calculated between-group effect sizes (Hedges' g) by using means, standard deviations and N. When these were not reported, we used dichotomous outcomes or other statistics (e.g. p value, t value) for calculating effect sizes. Intention-to-treat data were used. Effect sizes were pooled with a random-effects model, using the Hartung-Knapp-Sidik-Jonkman method (IntHout, Ioannidis, & Borm, Reference IntHout, Ioannidis and Borm2014). Heterogeneity was estimated with the I 2 statistic and its 95% confidence interval (CI). In addition, we included prediction intervals (PI), which represent 95% CI of the predictive distribution of effects in future comparable trials.
Categorical moderators of effects were explored in subgroup analyses by using a mixed-effects model. We conducted subgroup analyses when a minimum of three studies were available per subgroup.
We estimated publication bias through visual funnel plot inspection, Egger's test (Egger, Smith, Schneider, & Minder, Reference Egger, Smith, Schneider and Minder1997), and with Duval and Tweedie trim-and-fill procedure (Duval & Tweedie, Reference Duval and Tweedie2000). We conducted sensitivity analyses by: (a) excluding outliers (defined as studies whose 95% CI effect size did not overlap with the 95% CI of the pooled effect), and (b) exploring the influence of RoB in the results.
All meta-analyses were conducted in version 4.1.1 of R, using the packages meta (Balduzzi, Rücker, & Schwarzer, Reference Balduzzi, Rücker and Schwarzer2019), metafor (Viechtbauer, Reference Viechtbauer2010), and dmetar (Harrer, Cuijpers, Furukawa, & Ebert, Reference Harrer, Cuijpers, Furukawa and Ebert2019).
Results
Inclusion of studies
The PRISMA flowchart is presented in Fig. 1. We screened 3455 hits, and we examined the full-text of 133 studies. A total of 30 studies (for references, see Appendix E) were included, of which 28 trials and 4152 participants, were included in the meta-analysis. Three studies (Field, Diego, Delgado, & Medina, Reference Field, Diego, Delgado and Medina2013; Ludman et al., Reference Ludman, Simon, Grothaus, Luce, Markley and Schaefer2007; Mathews et al., Reference Mathews, Mackin, Chou, Uhm, Bain, Stark and Delucchi2018) included a clinician-led group as comparator [e.g. Interpersonal Psychotherapy (IPT) or Cognitive Behavioral Therapy (CBT)], including one overlapping trial (Ludman et al., Reference Ludman, Simon, Grothaus, Luce, Markley and Schaefer2007) which examined a control condition and a clinician-led comparator. Due to the limited number of studies, we did not pool trials with clinician-led comparators. A narrative description of these studies is presented in Appendix F.
Study characteristics
Selected characteristics of 30 included studies are presented in Appendix D. Two main subgroups were identified across the included trials: patients with SMI (20 trials) and individuals with depression (7 trials). SMI studies included a heterogeneous group of patients including but not limited to psychosis, depressive disorders, anxiety disorders, or bipolar disorders. The majority of depression studies (5 trials) focused on perinatal depression (Dennis, Reference Dennis2003; Dennis et al., Reference Dennis, Hodnett, Kenton, Weston, Zupancic, Stewart and Kiss2009; Gjerdingen, McGovern, Pratt, Johnson, & Crow, Reference Gjerdingen, McGovern, Pratt, Johnson and Crow2013; Letourneau et al., Reference Letourneau, Stewart, Dennis, Hegadoren, Duffett-Leger and Watson2011; Shorey et al., Reference Shorey, Chee, Ng, Lau, Dennis and Chan2019), with participants scoring above a cut-off on a questionnaire. One study focused on women with eating disorders (Ranzenhofer et al., Reference Ranzenhofer, Wilhelmy, Hochschild, Sanzone, Walsh and Attia2020). Most studies had CAU (16 trials) or WL (9 trials) as a control condition.
In 12 trials the PSI consisted of group meetings, 17 evaluated one-to-one peer support, and one trial implemented a mixed format. Face-to-face delivery was most common (16 trials), three trials evaluated telephone-based support, two trials examined internet support groups, and nine trials examined a mixed intervention, bringing together the latter formats. Intervention duration and frequency were heterogeneous and reported inconsistently, ranging from three weeks to six months with weekly meetings or a more flexible frequency.
Risk of bias
Overall, there is a high RoB in the majority of included studies: 21 trials were rated at high risk (21/30, 70%), six studies were judged as having some concerns for risk of bias (6/30, 20%), and only three studies met criteria for low risk of bias (3/30, 10%). Focusing on the separate RoB domains, twelve studies (12/30, 40%) were rated at low risk of bias for domain 1, due to reporting an adequate randomization process. Due to the unstructured naturalistic approach of peer support, 23 studies (23/30, 77%) were rated at low risk in domain 2 (deviations from the intended interventions). Ten trials (10/30, 33%) were rated as low RoB in domain 3 due to missing outcome data. Thirteen trials (13/30, 43%) were judged at low risk in domain 4 due to measurement of the outcome, using self-report measures only. For domain 5, only five studies (5/30, 17%) were prospectively registered and were rated at low risk (see Figures G1 and G2 in Appendix G, and Appendix H for RoB rating per domain and study).
Clinical recovery
The pooled effect size at post-test across 22 PSI studies measuring clinical recovery was significant, with g = 0.19, 95% CI (0.11–0.27) (see Table 1 and Fig. 2). Heterogeneity was low, I 2 = 10%, 95% CI (0–44). The PI was consistent with benefit, overlapping completely with the 95% CI.
CAU, care-as-usual; CI, confidence interval; NA, not applicable; PI, prediction interval; WL, waiting list.
a According to the random-effects model.
b k = 6 studies included individuals with depressive symptoms scoring above a cut-off on a standardized mental disorder symptom measure (of which k = 5 are on perinatal depression), and k = 1 study included adults with a clinical diagnosis.
c Egger's test was not significant (p = 0.99) and the number of imputed studies using Duvall and Tweedie trim-and-fill procedure was 23.
d The p value for the between-group effect sizes is significant (p = 0.02).
e Of the k = 8 studies only one study included 18 months follow-up data, the remaining studies reported 12 months follow-up data.
For the subgroup of patients with SMI (Boevink, Kroon, van Vugt, Delespaul, & van Os, Reference Boevink, Kroon, van Vugt, Delespaul and van Os2016; Cook et al., Reference Cook, Copeland, Floyd, Jonikas, Hamilton, Razzano and Boyd2012a; Corrigan et al., Reference Corrigan, Kraus, Pickett, Schmidt, Stellon, Hantke and Lara2017; Davidson et al., Reference Davidson, Shahar, Stayner, Chinman, Rakfeldt and Tebes2004; Johnson et al., Reference Johnson, Lamb, Marston, Osborn, Mason, Henderson and Christoforou2018; Kaplan, Salzer, Solomon, Brusilovskiy, & Cousounis, Reference Kaplan, Salzer, Solomon, Brusilovskiy and Cousounis2011; Mahlke et al., Reference Mahlke, Priebe, Heumann, Daubmann, Wegscheider and Bock2017; O'Connell et al., Reference O'Connell, Sledge, Staeheli, Sells, Costa, Wieland and Davidson2018; Pfeiffer et al., Reference Pfeiffer, King, Ilgen, Ganoczy, Clive, Garlick and Valenstein2019; Rivera, Sullivan, & Valenti, Reference Rivera, Sullivan and Valenti2007; Rogers et al., Reference Rogers, Maru, Johnson, Cohee, Hinkel and Hashemi2016; Russinova et al., Reference Russinova, Rogers, Gagne, Bloch, Drake and Mueser2014; Solomon & Draine, Reference Solomon and Draine1995), the effect size was significant, g = 0.18, 95% CI (0.10–0.26) (14 trials). However, for the subgroup of patients with depression (Dennis, Reference Dennis2003; Dennis et al., Reference Dennis, Hodnett, Kenton, Weston, Zupancic, Stewart and Kiss2009; Gjerdingen et al., Reference Gjerdingen, McGovern, Pratt, Johnson and Crow2013; Griffiths et al., Reference Griffiths, Mackinnon, Crisp, Christensen, Bennett and Farrer2012; Letourneau et al., Reference Letourneau, Stewart, Dennis, Hegadoren, Duffett-Leger and Watson2011; Ludman et al., Reference Ludman, Simon, Grothaus, Luce, Markley and Schaefer2007; Shorey et al., Reference Shorey, Chee, Ng, Lau, Dennis and Chan2019), no significant effects were detected, g = 0.19, 95% CI (−0.20 to 0.58) (7 trials). In the same line, no significant effects were found when pooling 12 trials that specifically reported depression outcomes. Subgroup analyses to examine potential moderators of intervention effects showed no significant differences between subgroups (see Appendix I). There were significant differences in effects based on RoB levels, p = 0.016; Q 2 = 8.30, with the three studies rated at low risk showing a significant effect of g = 0.52, 95% CI (0.29–0.76).
Inspection of funnel plots, Egger's test, p = 0.99, and the trim-and-fill procedure did not indicate significant publication bias (see Figure J1 in Appendix J). Removing one outlier (Letourneau et al., Reference Letourneau, Stewart, Dennis, Hegadoren, Duffett-Leger and Watson2011) did not have a substantial impact on the effect, g = 0.20, 95% CI (0.14–0.27).
Long-term effects for all clinical recovery outcomes indicated that the effect remained significant at six to nine months follow-up, g = 0.17, 95% CI (0.08–0.26), but not at 12 to 18 months follow-up, g = 0.10, 95% CI (−0.21 to 0.40).
Personal recovery
The pooled effect size at post-test across 19 PSI studies measuring personal recovery was significant, g = 0.15, 95% CI (0.04–0.27) (see Table 2 and Figure K1 in Appendix K). Heterogeneity was moderate, I2 = 43%, 95% CI (1–67), although the PI (−0.16–0.47) was wide and contained the null effect.
CAU, care-as-usual; CI, confidence interval; NA, not applicable; PI, prediction interval; WL, waiting list.
a According to the random-effects model.
b Both studies (k = 2) included individuals with perinatal depressive symptoms scoring above a cut-off on a standardized mental disorder symptom measure.
c Egger's test was not significant (p = 0.66) and the number of imputed studies using Duvall and Tweedie trim-and-fill procedure was 24.
d The p value for the between-group effect sizes is not significant (p = 0.79).
e Of the k = 7 studies, only one study included 18 months follow-up data, the remaining studies reported 12 months follow-up data.
For the subgroup of individuals with SMI (Boevink et al., Reference Boevink, Kroon, van Vugt, Delespaul and van Os2016; Castelein et al., Reference Castelein, Bruggeman, van Busschbach, van der Gaag, Stant, Knegtering and Wiersma2008; Cook et al., Reference Cook, Copeland, Floyd, Jonikas, Hamilton, Razzano and Boyd2012a, Reference Cook, Steigman, Pickett, Diehl, Fox, Shipley and Burke-Miller2012b; Corrigan et al., Reference Corrigan, Kraus, Pickett, Schmidt, Stellon, Hantke and Lara2017, Reference Corrigan, Sheehan, Morris, Larson, Torres, Lara and Doing2018; Davidson et al., Reference Davidson, Shahar, Stayner, Chinman, Rakfeldt and Tebes2004; Johnson et al., Reference Johnson, Lamb, Marston, Osborn, Mason, Henderson and Christoforou2018; Kaplan et al., Reference Kaplan, Salzer, Solomon, Brusilovskiy and Cousounis2011; Mahlke et al., Reference Mahlke, Priebe, Heumann, Daubmann, Wegscheider and Bock2017; Pfeiffer et al., Reference Pfeiffer, King, Ilgen, Ganoczy, Clive, Garlick and Valenstein2019; Rogers et al., Reference Rogers, Maru, Johnson, Cohee, Hinkel and Hashemi2016; Russinova et al., Reference Russinova, Rogers, Gagne, Bloch, Drake and Mueser2014; Rüsch et al., Reference Rüsch, Abbruzzese, Hagedorn, Hartenhauer, Kaufmann, Curschellas and Corrigan2014; Salzer et al., Reference Salzer, Rogers, Salandra, O'Callaghan, Fulton, Balletta and Brusilovskiy2016; van Gestel-Timmermans, Brouwers, van Assen, & van Nieuwenhuizen, Reference van Gestel-Timmermans, Brouwers, van Assen and van Nieuwenhuizen2012), the effect size was significant, g = 0.15, 95% CI (0.02–0.28) (17 trials). For individuals with depressive symptoms, the number of trials (Dennis, Reference Dennis2003; Griffiths et al., Reference Griffiths, Mackinnon, Crisp, Christensen, Bennett and Farrer2012) was too small to reliably detect effects. Pooling specific outcomes within personal recovery resulted in significant effects for hope outcomes, g = 0.13, 95% CI (0.03–0.22), but not for empowerment or the Recovery Assessment Scale. In subgroup analyses, we found no differences in the effect of PSIs among potential moderators (see Appendix I).
No indications of publication bias were observed, Egger's test, p = 0.66, see Figure J2 in Appendix J. The effect size did not substantially change when excluding one outlier (Salzer et al., Reference Salzer, Rogers, Salandra, O'Callaghan, Fulton, Balletta and Brusilovskiy2016), g = 0.13, 95% CI (0.05–0.21). Subgroup analyses did not detect differences in effects between RoB levels, although only one trial was rated at low risk and the impact of RoB is uncertain due to lack of power.
Long-term effects for all personal recovery outcomes showed nonsignificant results for both periods, six to nine months: g = 0.10, 95% CI (−0.10 to 0.30), and 12 to 18 months follow-up: g = 0.54, 95% CI (−0.33 to 1.41).
Functional recovery
The pooled effect size at post-test across 25 PSI studies measuring functional recovery was nonsignificant, g = 0.08, 95% CI (−0.02 to 0.18), with low heterogeneity, I2 = 36%, 95% CI (0–61) (see Table 3 and Figure K2 in Appendix K). The PI was wide and contained the null effect (−0.16 to 0.32).
CAU, care-as-usual; CI, confidence interval; NA, not applicable; PI, prediction interval; WL, waiting list.
a According to the random-effects model.
b k = 6 studies included individuals with depressive symptoms scoring above a cut-off on a standardized mental disorder symptom measure (of which k = 5 are on perinatal depression), and k = 1 study included adults with a clinical diagnosis.
c Egger's test was not significant (p = 0.74) and the number of imputed studies using Duvall and Tweedie trim-and-fill procedure was 26.
d The p value for the between-group effect sizes is not significant (p = 0.45).
e Of the k = 10 studies, only one study included 18 months follow-up data, the remaining studies reported 12 months follow-up data.
For the subgroup of patients with SMI (Boevink et al., Reference Boevink, Kroon, van Vugt, Delespaul and van Os2016; Cook et al., Reference Cook, Copeland, Floyd, Jonikas, Hamilton, Razzano and Boyd2012a; Corrigan et al., Reference Corrigan, Kraus, Pickett, Schmidt, Stellon, Hantke and Lara2017; Davidson et al., Reference Davidson, Shahar, Stayner, Chinman, Rakfeldt and Tebes2004; Johnson et al., Reference Johnson, Lamb, Marston, Osborn, Mason, Henderson and Christoforou2018; Kaplan et al., Reference Kaplan, Salzer, Solomon, Brusilovskiy and Cousounis2011; Mahlke et al., Reference Mahlke, Priebe, Heumann, Daubmann, Wegscheider and Bock2017; O'Connell et al., Reference O'Connell, Sledge, Staeheli, Sells, Costa, Wieland and Davidson2018; Pfeiffer et al., Reference Pfeiffer, King, Ilgen, Ganoczy, Clive, Garlick and Valenstein2019; Rivera et al., Reference Rivera, Sullivan and Valenti2007; Rogers et al., Reference Rogers, Maru, Johnson, Cohee, Hinkel and Hashemi2016; Russinova et al., Reference Russinova, Rogers, Gagne, Bloch, Drake and Mueser2014; Solomon & Draine, Reference Solomon and Draine1995), the effect size was significant, g = 0.18, 95% CI (0.10–0.26) (14 trials), but not for the six trials targeting individuals with depressive symptoms (Dennis, Reference Dennis2003; Dennis et al., Reference Dennis, Hodnett, Kenton, Weston, Zupancic, Stewart and Kiss2009; Gjerdingen et al., Reference Gjerdingen, McGovern, Pratt, Johnson and Crow2013; Griffiths et al., Reference Griffiths, Mackinnon, Crisp, Christensen, Bennett and Farrer2012; Letourneau et al., Reference Letourneau, Stewart, Dennis, Hegadoren, Duffett-Leger and Watson2011; Shorey et al., Reference Shorey, Chee, Ng, Lau, Dennis and Chan2019), g = 0.02, 95% CI (−0.34 to 0.37). No significant effect sizes were observed in any of the examined specific outcomes: for quality of life, g = 0.08, 95% CI (−0.04 to 0.19), social functioning, g = 0.07, 95% CI (−0.05 to 0.18), and loneliness, g = 0.09, 95% CI (−0.05 to 0.23). Conducting subgroup analyses, we found no differences in the effect of PSIs among potential moderators (see Appendix I).
No indications of publication bias were observed, Egger's test, p = 0.74, see Figure J3 in Appendix J. When one outlier was removed (Salzer et al., Reference Salzer, Rogers, Salandra, O'Callaghan, Fulton, Balletta and Brusilovskiy2016), the effect size remained significant, g = 0.06, 95% CI (−0.01 to 0.13). Subgroup analyses showed no differences in effects between RoB levels. Pooling the three trials rated at low risk resulted in a nonsignificant effect of g = 0.19, 95% CI (−0.37 to 0.76).
Long-term effects for all functional recovery outcomes demonstrated a significant effect size at six to nine months follow-up, g = 0.14, 95% CI (0.01-0.27) (17 trials). At 12 to 18 months follow-up, effects were nonsignificant, g = 0.38, 95% CI (−0.21 to 0.98).
Discussion
In this comprehensive meta-analysis of 28 RCTs (n = 4152), PSIs for patients covering a broad spectrum of mental illnesses were associated with superior outcomes compared with control conditions regarding: (a) clinical recovery at post-test, and six to nine months follow-up; (b) personal recovery at post-test; and (c) functional recovery limited to six to nine months follow-up. When examining specific groups, we saw that specifically in the SMI patients – individuals with serious mental disorders – peer support was associated with significant superiority to control conditions at post-intervention across all three recovery categories. For the subgroup of individuals with elevated depressive symptoms – most of them being perinatal women – no significant effects were found in any of the recovery categories. Nonetheless, the number of trials targeting this group was small and nonsignificant results could be due to a lack of power. Also, the analyses for more category-specific outcomes within each main outcome category were exploratory due to the small number of studies. Only the effect size for hope, considered part of personal recovery, was significant.
We found no significant differences in the effect of PSIs among potential moderators (e.g. intervention delivery) for any of the outcomes, which could suggest that common values of peer support exceed disorder-specific needs and the intervention type. However, subgroup analyses should be considered with caution, since the number of trials for some categories was small and these analyses are likely underpowered. Accordingly, we could not analyze differences in effects between internet-based PSIs (2 trials) and traditional face-to-face interventions (16 trials; see Appendix I). Since the evidence-base for eHealth is increasing (Chan et al., Reference Chan, Jiang, Lee, Ramachandran, Teo, Seah and Wang2022; Deady et al., Reference Deady, Choi, Calvo, Glozier, Christensen and Harvey2017; Massoudi, Holvast, Bockting, Burger, & Blanker, Reference Massoudi, Holvast, Bockting, Burger and Blanker2019) and digital PSIs for individuals with SMI seem to be associated with positive changes for both clinical and psychosocial outcomes (Fortuna et al., Reference Fortuna, Naslund, LaCroix, Bianco, Brooks, Zisman-Ilani and Deegan2020), the effectiveness for technology-based PSIs should be further investigated.
The pooled effect sizes, that were confirmed in sensitivity analyses, were small ranging from g = 0.15 for overall personal recovery to g = 0.19 for overall clinical recovery at post-test. A surprising finding was low to moderate heterogeneity, suggesting that the effects were consistent across wide-varying studies. However, due to the relatively large width of the 95% CIs, caution must be applied. Moreover, although the effect size for clinical recovery appeared to be more robust, the prediction intervals for personal and functional recovery suggested that the effects are considerably uncertain. In addition, the risk of bias was high for the majority of included studies and we could not reliably estimate its impact on the results of the meta-analysis.
Operating with a broad scope, including the largest number of trials on peer support to date, we found a significant though small effect size for clinical recovery. This was not detected in previous meta-analyses (Burke et al., Reference Burke, Pyle, Machin, Varese and Morrison2019; Chien et al., Reference Chien, Clifton, Zhao and Lui2019; Fuhr et al., Reference Fuhr, Salisbury, De Silva, Atif, van Ginneken, Rahman and Patel2014; Huang et al., Reference Huang, Yan, Tian, Lei, Yang, Liu and Lei2020; Lloyd-Evans et al., Reference Lloyd-Evans, Mayo-Wilson, Harrison, Istead, Brown, Pilling and Kendall2014; Lyons et al., Reference Lyons, Cooper and Lloyd-Evans2021; White et al., Reference White, Foster, Marks, Morshead, Goldsmith, Barlow and Gillard2020), possibly due to lack power. Considering the efficacy of peer support for personal recovery, we confirmed and extended the results of previous meta-analyses (Bryan & Arkowitz, Reference Bryan and Arkowitz2015; Burke et al., Reference Burke, Pyle, Machin, Varese and Morrison2019; Fuhr et al., Reference Fuhr, Salisbury, De Silva, Atif, van Ginneken, Rahman and Patel2014; Lloyd-Evans et al., Reference Lloyd-Evans, Mayo-Wilson, Harrison, Istead, Brown, Pilling and Kendall2014; Lyons et al., Reference Lyons, Cooper and Lloyd-Evans2021; White et al., Reference White, Foster, Marks, Morshead, Goldsmith, Barlow and Gillard2020). So far, outcomes for functional recovery are scarcely addressed in peer support meta-analyses (Fuhr et al., Reference Fuhr, Salisbury, De Silva, Atif, van Ginneken, Rahman and Patel2014; Lyons et al., Reference Lyons, Cooper and Lloyd-Evans2021). Whilst only valid for the subgroup SMI and long-term analysis, we found significant effect sizes for functional recovery, with quality of life as the most important outcome parameter. Overall, results indicate that peer support is of clinical relevance for individuals with mental illness, and not limited to reinforcing personal recovery following the generally accepted recovery-oriented approach (Leamy, Bird, Le Boutillier, Williams, & Slade, Reference Leamy, Bird, Le Boutillier, Williams and Slade2011; van Weeghel et al., Reference van Weeghel, van Zelst, Boertien and Hasson-Ohayon2019).
Limitations
The results of this study should be considered with caution because of several important limitations. First, measures for clinical, personal, and functional recovery differed considerably across studies. Second, long-term effects were limited to smaller samples of trials up to 12 months follow-up. Third, a major limitation of this study is the high risk of bias for the majority of trials, with limited reporting for many of the risk of bias items. Since peer support has an informal nature, it is difficult to quantitatively analyze these interventions. An established protocol would help to quantify variables that could be evaluated in trials, but this would restrict the open nature of PSIs. Still, since peer support has been increasingly considered an essential element for recovery there have been attempts to structure and professionalize PSIs (Chinman et al., Reference Chinman, McCarthy, Mitchell-Miland, Daniels, Youk and Edelen2016; SAMHSA, 2015). However, doubts remain because the core of peer support is its naturalistic approach (Fortuna, Solomon, & Rivera, Reference Fortuna, Solomon and Rivera2022). The feasibility, acceptability, and benefits of structuring and professionalizing PSIs need further investigation. To improve the quality of studies, future research should implement clinician-rated instruments and prospective registration in clinical trial registries. Finally, though comparing the efficacy of PSIs with clinical psychotherapies seems relevant for implementing or referring to PSIs in mental health care, the number of trials was too small to conduct a meta-analysis for RCTs with a clinician-led comparator.
Conclusions
Engaging in a peer support intervention may be effective for reducing clinical mental illness symptoms, improving overall personal recovery, and more specifically hope. In particular for individuals with SMI, peer support demonstrated probable efficacy across the three recovery categories. Although the effects were small, peer support is a potentially cost-effective and relatively easy-to-implement intervention, and may complement professional treatment. Therapists, general practitioners, and employees of recovery-oriented services may refer their clients to peer support initiatives to expand the individuals' context to work on recovery when coping with mental illness.
Supplementary material
The supplementary material for this article can be found at https://doi.org/10.1017/S0033291722002422.
Author contributions
Dr Groeneweg, Dr Cuijpers, and Dr Spijker conceptualized and developed the study design. Smit (MSc) and Miguel (MSc) analyzed and interpreted the data and drafted the manuscript. Smit (MSc) and Miguel (MSc) had full access to all of the data in the study and take responsibility for the integrity of the data and the accuracy of the data analysis. Dr Cuijpers, Dr Spijker, Dr Vrijsen, and Dr Groeneweg supervised the study by providing intellectual content, reviewing data analysis and interpretation and critical revision of the manuscript.
Financial support
Smit, MSc is funded by a PhD Studentship of Pro Persona mental health care, which is partly funded by ZonMw, the Dutch organization for health research and health innovation. Research reported in this publication was supported by the Dutch Depression Association, who received funding from Janssen-Cilag to conduct this project. The funder had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; preparation, review or approval of the manuscript and decision to submit the manuscript for publication. No other disclosures were reported.
Conflicts of interest
Dr Groeneweg reports having a management function of the Research Committee of the Dutch Depression Association. Dr Cuijpers received expense allowances for his membership of the Board of Directions of ‘Mind.nl,’ for being Chair of the Research Committee of the Dutch Council for military care and research, and for being Chair of the Mental Health Priority Area of the Wellcome Trust in London, England, in 2018. In addition, he received royalties for books he has authored or coauthored and for occasional workshops and invited addresses.