Meta-analytical prognostic accuracy of the Comprehensive Assessment of at Risk Mental States (CAARMS): The need for refined prediction

D. Oliver; M. Kotlicka-Antczak; A. Minichino; G. Spada; P. McGuire; P. Fusar-Poli

doi:10.1016/j.eurpsy.2017.10.001

Meta-analytical prognostic accuracy of the Comprehensive Assessment of at Risk Mental States (CAARMS): The need for refined prediction

Published online by Cambridge University Press: 01 January 2020

D. Oliver ,

M. Kotlicka-Antczak ,

A. Minichino ,

G. Spada ,

P. McGuire and

P. Fusar-Poli

Show author details

D. Oliver*: Affiliation:
aEarly Psychosis: Interventions & Clinical-detection (EPIC) lab, Department of Psychosis Studies, Institute of Psychiatry, Psychology and Neuroscience, King's College London, LondonSE5 8AF, United Kingdom
M. Kotlicka-Antczak: Affiliation:
bMedical University of Lodz, Department of Affective and Psychotic Disorders, Lodz, Poland
A. Minichino: Affiliation:
cDepartment of Neurology and Psychiatry, Sapienza University of Rome, Rome, Italy
G. Spada: Affiliation:
aEarly Psychosis: Interventions & Clinical-detection (EPIC) lab, Department of Psychosis Studies, Institute of Psychiatry, Psychology and Neuroscience, King's College London, LondonSE5 8AF, United Kingdom
P. McGuire: Affiliation:
dDepartment of Psychosis Studies, IoPPN, King's College London, LondonSE5 8AF, United Kingdom eOASIS Service, South London and the Maudsley NHS National Health Service Foundation Trust, London, United Kingdom fNational Institute for Health Research (NIHR) Biomedical Research Centre for Mental Health, IoPPN, King's College London, SE5 8AF, United Kingdom
P. Fusar-Poli: Affiliation:
aEarly Psychosis: Interventions & Clinical-detection (EPIC) lab, Department of Psychosis Studies, Institute of Psychiatry, Psychology and Neuroscience, King's College London, LondonSE5 8AF, United Kingdom eOASIS Service, South London and the Maudsley NHS National Health Service Foundation Trust, London, United Kingdom fNational Institute for Health Research (NIHR) Biomedical Research Centre for Mental Health, IoPPN, King's College London, SE5 8AF, United Kingdom
*: *Corresponding author. E-mail address: dominic.a.oliver@kcl.ac.uk (D. Oliver).

Article contents

Abstract
Introduction
Methods
Results
Discussion
Conclusion
Disclosure of interest
Acknowledgements
Supplementary data
References

Abstract

Primary indicated prevention is reliant on accurate tools to predict the onset of psychosis. The gold standard assessment for detecting individuals at clinical high risk (CHR-P) for psychosis in the UK and many other countries is the Comprehensive Assessment for At Risk Mental States (CAARMS). While the prognostic accuracy of CHR-P instruments has been assessed in general, this is the first study to specifically analyse that of the CAARMS. As such, the CAARMS was used as the index test, with the reference index being psychosis onset within 2 years. Six independent studies were analysed using MIDAS (STATA 14), with a total of 1876 help-seeking subjects referred to high risk services (CHR-P+: n = 892; CHR-P–: n = 984). Area under the curve (AUC), summary receiver operating characteristic curves (SROC), quality assessment, likelihood ratios, and probability modified plots were computed, along with sensitivity analyses and meta-regressions. The current meta-analysis confirmed that the 2-year prognostic accuracy of the CAARMS is only acceptable (AUC = 0.79 95% CI: 0.75–0.83) and not outstanding as previously reported. In particular, specificity was poor. Sensitivity of the CAARMS is inferior compared to the SIPS, while specificity is comparably low. However, due to the difficulties in performing these types of studies, power in this meta-analysis was low. These results indicate that refining and improving the prognostic accuracy of the CAARMS should be the mainstream area of research for the next era. Avenues of prediction improvement are critically discussed and presented to better benefit patients and improve outcomes of first episode psychosis.

Keywords

Psychosis CAARMS Prevention Prognostic accuracy Clinical utility

Type: Review
Information: European Psychiatry , Volume 49 , March 2018 , pp. 62 - 68

DOI: https://doi.org/10.1016/j.eurpsy.2017.10.001 [Opens in a new window]
Creative Commons: This is an open access article under the CC BY license
Copyright: Copyright © 2017 The Authors

1. Introduction

Psychosis is a severe psychiatric condition and there is limited evidence that treatments are successful in improving patients’ functioning once the disorder is established Reference Insel[1]. Intervening in the earlier phases is therefore the only viable possibility to substantially alter the course of the disorder [Reference Fusar-Poli, McGorry and Kane2, Reference Millan, Andrieux, Bartzokis, Cadenhead, Dazzan and Fusar-Poli3]. Within early intervention, a key focus for improving the outcome has been primary indicated prevention [Reference Fusar-Poli, McGorry and Kane2, Reference McGorry4, Reference Fusar-Poli5]. Primary indicated prevention allows for early intervention for those at clinical high risk of developing psychosis (CHR-P), with greater scope for improving outcomes. To do this effectively, the first necessary step is to reach an accurate, robust prognostic identification of individuals meeting CHR-P criteria who will subsequently develop psychosis or not. Ideally, all subjects who will actually develop psychosis should be classified as “at risk” (CHR-P+) while those not developing an established psychosis should be classified as “not at risk” (CHR-P–). These key concepts involved in prognostic reasoning in the CHR-P have been detailed and presented in a recent paper by our group Reference Fusar-Poli and Schultze-Lutter[6].

Prognostic prediction is used in many branches of medicine to identify individuals who may develop a particular disease Reference Arbyn, Verdoodt, Snijders, Verhoef, Suonio and Dillner[7]. For example, fasting glucose, oral glucose tolerance test and glycated haemoglobin are used to detect individuals at high risk for developing diabetes (pre-diabetes or intermediate hyperglycaemia) Reference Tabák, Herder, Rathmann, Brunner and Kivimäki[8] and systolic blood pressure and ratio of total serum cholesterol to high density lipoprotein cholesterol levels are used to detect individuals at high risk for developing cardiovascular disease Reference Hippisley-Cox, Coupland, Vinogradova, Robson, May and Brindle[9]. However, unlike these other fields, there are no biological tests to assess the risk of developing mental disorders Reference Fusar-Poli and Meyer-Lindenberg[10], which is instead reliant on semi-structured CHR-P psychometric interviews, such as the CAARMS (Comprehensive Assessment for At Risk Mental States) Reference Yung, Yuen, McGorry, Phillips, Kelly and Dell’Olio[11]. Recently, the CAARMS has become the mainstream tool to detect CHR-P individuals in the UK, recommended by international bodies, such as NICE [12]. Therefore, understanding its exact psychometric properties is of paramount clinical relevance. The CAARMS shows excellent inter-rater reliability when performed by trained raters (0.85) Reference Yung, Stanford, Cosgrave, Killackey, Phillips and Nelson[13]. However, its prognostic accuracy is uncertain. A recent meta-analysis by our lab Reference Fusar-Poli, Cappucciati, Rutigliano, Schultze-Lutter, Bonoldi and Borgwardt[14] investigated the prognostic accuracy of CHR-P instruments, showing generally excellent prognostic performance of these instruments. However, CHR-P tools were grouped together including the CAARMS Reference Yung, Yuen, McGorry, Phillips, Kelly and Dell’Olio[11], the SIPS (Structured Interview for Prodromal Syndromes) Reference Miller, McGlashan, Rosen, Cadenhead, Cannon and Ventura[15] and the SPI-A (Schizophrenia Proneness Instrument-Adult Version) Reference Schultze-Lutter, Ruhrmann, Picker and Klosterkötter[16]. This was due to the fact that there were not enough studies contributing data to assess the meta-analytical prognostic accuracy of the CAARMS specifically. Given the marked differences between the CAARMS and other CHR-P instruments Reference Fusar-Poli, Cappucciati, Rutigliano, Lee, Beverly and Bonoldi[17], in particular with respect to the functional deterioration criterion Reference Fusar-Poli, Rocchetti, Sardella, Avila, Brandizzi and Caverzasi[18], it is possible that the previously reported meta-analytical prognostic accuracy is not completely accurate. In addition, the previous meta-analysis combined multiple follow-up time points, and even though meta-regressions of this variable found no significant effect, validity of the prognostic accuracy results would be improved by using a more defined and consistent follow-up time Reference Fusar-Poli, Cappucciati, Rutigliano, Schultze-Lutter, Bonoldi and Borgwardt[14].

The current study tackles these caveats and advances knowledge in the psychometric properties of the CAARMS. We capitalize on recently published CAARMS studies reporting useful and innovative meta-analytical data to conduct a meta-analytical prognostic accuracy analysis of the CAARMS at two-year follow-up. This is the period of time during which most transitions to psychosis occur Reference Kempton, Bonoldi, Valmaggia, McGuire and Fusar-Poli[19]. The results will hopefully support the refinement of psychosis prediction and therefore facilitate indicated primary prevention in CHR-P individuals.

2. Methods

2.1. Search strategy

Two investigators (DO, PFP) conducted a two-step literature search. At a first step, the Web of Knowledge database was searched, incorporating both the Web of Science and Medline. The search was extended until August 2017, only including abstracts in English. The electronic research adopted several combinations of the following keywords: “at risk mental state”, “psychosis risk”, “prodrome”, “prodromal psychosis”, “ultra-high risk”, “high risk”, “help-seeking”, “diagnostic accuracy”, “sensitivity”, “specificity”, “psychosis prediction”, “psychosis onset”. The second step involved the use of Scopus to investigate citations of previous systematic reviews on transition outcomes in CHR-P subjects and a manual search of the reference lists of the retrieved articles.

Articles identified through these two steps were then screened for the selection criteria on the basis of abstract reading. The articles surviving this selection were assessed for eligibility on the basis of full text reading. To achieve a high standard of reporting, we adopted the Meta-analysis Of Observational Studies in Epidemiology (MOOSE) checklist Reference Stroup, Berlin, Morton, Olkin, Williamson and Rennie[20].

2.2. Selection criteria

Studies were eligible for inclusion if:

• they were reported in original articles, written in English;
• they had used the CAARMS (index test) in the same pool of referrals;
• they had followed up both CHR-P+ and CHR-P– subjects for psychosis onset (reference index) using established international diagnostic manuals (ICD or DSM);
• they had reported sufficient prognostic accuracy data at 2-year follow-up.

With respect to this last point, when data were not directly presented, they were indirectly extracted from associated data. Additionally, we contacted all corresponding authors to request additional data when needed.

We excluded:

• abstracts, reviews, articles in a language other than English;
• studies in which interviews were not conducted in the same pool of referrals or that used an external CHR-P group of healthy controls;
• studies with overlapping datasets.

In case of multiple publications deriving from the same study population, we selected the article reporting the largest and most recent data set. The literature search was summarized according to PRISMA guidelines Reference Moher, Liberati, Tetzlaff and Altman[21].

2.3. Recorded variables

Data extraction was independently performed by two investigators (DO, PFP). Data included author, year of publication, characteristics of subject samples (baseline sample sizes, mean age and age range, proportion of females), diagnostic criteria used at follow-ups to assess the psychotic outcome, prognostic accuracy data (number of true and false positives, true and false negatives or associated data) and quality assessment conducted with the Quality Assessment of Diagnostic Accuracy Studies (QUADAS) checklist Reference Whiting, Rutjes, Westwood, Mallett, Deeks and Reitsma[22].

2.4. Statistical analysis

The statistical analysis followed the Cochrane Guidelines for Systematic Reviews of Diagnostic Test Accuracy, Version 1.0 Reference Macaskill, Gatsonis, Deeks, Harbord, Takwoingi, Deeks, Bossuyt and Gatsonis[23] and the Methods Guide for Authors of Systematic Reviews of Medical Tests by the Agency for Healthcare Research and Quality (chapter 8) Reference Smetana, Umscheid, Chang and Matchar[24]. Evaluating test accuracy requires knowledge of two quantities: the test's sensitivity (Se) and specificity (Sp). Meta-analysis methods for diagnostic test accuracy thus have to deal with two summary statistics simultaneously rather than one Reference Macaskill, Gatsonis, Deeks, Harbord, Takwoingi, Deeks, Bossuyt and Gatsonis[23]. Methods for undertaking analyses, which account for both Se and Sp, the relationship between them, and the heterogeneity in test accuracy, require fitting advanced hierarchical random effects models Reference Macaskill, Gatsonis, Deeks, Harbord, Takwoingi, Deeks, Bossuyt and Gatsonis[23].

For each study, we constructed a two-by-two table, which included true positive, false positive, true negative, and false negative values. The baseline sample size was conservatively used as the base reference.

Data were then analysed with MIDAS (Meta-analytical Integration of Diagnostic Accuracy Studies) Reference Dwamena[25], a comprehensive program of statistical and graphical routines for undertaking meta-analysis of diagnostic/prognostic test performance in STATA 14 software Reference StataCorp.[26]. The index tests of CHR-P status (CHR-P+ or CHR-P–) and reference tests of transition to psychosis according to international diagnostic manuals (ICD or DSM as gold standard) were dichotomous.

Primary data synthesis was performed within the bivariate mixed-effects regression framework for the logit transforms of Se and Sp. In addition to accounting for study size, the bivariate model estimates and incorporates the intrinsic negative correlation that may arise between Se and Sp within studies (threshold effect) Reference Harbord and Whiting[27], as a result of differences in the test threshold between studies Reference Janda, Shahidi, Gin and Swiston[28]. The bivariate model allows for heterogeneity beyond chance as a result of clinical and methodological differences between studies Reference Janda, Shahidi, Gin and Swiston[28].

We estimated the summary Se and Sp and the hierarchical SROC (summary receiver operator characteristic) curves [Reference Macaskill, Gatsonis, Deeks, Harbord, Takwoingi, Deeks, Bossuyt and Gatsonis23, Reference Higgins, Thompson, Deeks and Altman32]. A SROC graph across each predictor, with the y-axis representing the predictor's Se and the x-axis representing 1–specificity, was used to plot a 95% confidence region and a 95% prediction region around the summary estimates to illustrate the precision with which the summary values were estimated (confidence ellipse of a mean), and to show the amount of between-study variation (prediction ellipse; the likely range of values for a new study). We also estimated the AUC (area under the curve). The AUC serves as a global measure of test performance. Values in the range of 0.9–1 are considered outstanding, between 0.8 and 0.9 are considered excellent, between 0.7 and 0.8 are considered acceptable Reference Hosmer, Lemeshow and May[29].

Heterogeneity across studies was assessed using the I ², with values of 25%, 50% and 75% representing mild, moderate and severe inconsistency, respectively Reference Lipsey and Wilson[30]. Within MIDAS, forest plots and heterogeneity statistics can be created for each test performance parameter individually or may be displayed as paired plots. Meta-regressions were used to examine the influence of mean age, gender (% females), sample size, and quality assessment (QUADAS) on meta-analytical estimates. Furthermore, we investigated the prognostic accuracy difference between CAARMS based studies and studies employing the SIPS, as detected in the previous meta-analysis Reference Fusar-Poli, Cappucciati, Rutigliano, Schultze-Lutter, Bonoldi and Borgwardt[14]. To control for biases associated with imbalanced datasets Reference Bekkar, Djemaa and Alitouche[31], we further tested the impact of the proportion of CHR-P+ subjects in the overall samples. The meta-regressions were used if there was substantial heterogeneity (I ² > 50%) Reference Higgins, Thompson, Deeks and Altman[32] and when more than 10 studies were available.

Sensitivity analyses (i.e., exclusion of outliers and rerunning of the model) were conducted to further explore heterogeneity. We did not test publication bias Reference Deeks, Macaskill and Irwig[33], because no proven statistical method exists for this type of meta-analysis Reference Wanders, East, Uitentuis, Leeflang and Dekker[34].

In a second step, we employed the probability-modifying plot to estimate the clinical or patient-relevant utility of the CAARMS in subjects seeking help at CHR-P services.

The clinical utility was evaluated using the positive and negative likelihood ratios (LR+ and LR–) to calculate post-test probability (post-TP) based on Bayes’ theorem (with pre-test probability, pre-PT, being the prevalence of the condition in the target population), as follows: post-TP = LR × pre-TP/[(1 – pre-TP)+(pre-TP × LR)] Reference Harbord and Whiting[27]. Specifically, the probability-modifying plot Reference Dwamena[25] is a graphical sensitivity analysis of the test's predictive values across a baseline psychosis risk continuum in people seeking help at CHR-P services. It depicts separate curves for positive and negative tests and uses general summary statistics (i.e., unconditional positive and negative predictive values, NPV and PPV, which permit underlying psychosis risk heterogeneity) to evaluate the effect of the CHR-P assessment on predictive values Reference Li, Fine and Safdar[35]. The pre-TP probability of psychosis risk in subjects seeking help at early detection services was computed in the current dataset as the proportion of subjects developing psychosis on the total baseline sample (CHR-P+ plus CHR-P–) Reference Dwamena[25].

Statistical tests were two-sided and statistical significance was defined as P values < 0.05.

3. Results

3.1. Database

The literature search produced 6 independent studies [Reference Fusar-Poli, Rutigliano, Stahl, Davies, De Micheli and Ramella-Cravaro36–Reference Lee, Rekhi, Mitter, Bong, Kraus and Lam41] that met inclusion criteria with a total of 1876 subjects (CHR+: n = 892; CHR–: n = 984) referred to clinical high risk services. The dataset was balanced with CHR+ individuals composing 47.5% of the total subjects. The characteristics of the studies are reported in the Table 1 while the PRISMA diagram is depicted in Fig. 1. The MOOSE checklist is reported in the Table 1. The detailed QUADAS assessment is reported in the eTable 2 and eFig. 1.

3.2. Prognostic accuracy of the CAARMS at 2-years

The summary meta-analytical estimate of Se at 2 years (0.86, 95% CI = 0.76–0.92) was outstanding with the 2-year AUC (0.79, 95% CI = 0.75–0.83) acceptable but the estimate of 2-year Sp (0.55, 95% CI = 0.48–0.63) was poor (Fig. 2). There was severe heterogeneity present in this analysis (I ² = 93.28%, 95% CI = 89.42–97.15%).

3.3. Clinical utility of the CAARMS at 2 years

The 2-year psychosis transition risk in the 1876 subjects was 0.09 (95% CI = 0.05–0.13). On the basis of the prior distribution, the continuous relationship between pre-TP and post-TP probability is summarized in Fig. 3. Being CHR-P+ was associated with a 0.16 (95% CI = 0.10–0.22) risk of developing psychosis within 2 years, yet a small LR+ of just 1.9 (95% CI = 1.5–2.4) while being CHR-P– was associated with a 0.03 (95% CI = 0.02–0.05) risk of transition to psychosis with a moderate LR– of 0.25 (95% CI = 0.13–0.48).

3.4. Meta-regressions and sensitivity analyses

Sensitivity analysis suggested that one study Reference Lee, Rekhi, Mitter, Bong, Kraus and Lam[41] was influential with a Cook's distance > 1. While we hypothesised this was due to the study reporting 0 false negatives, we were unable to test the effect of false negatives through meta-regression due to low number of studies. Similarly, we were unable to perform meta-regressions for age, gender, QUADAS score or sample size as there were fewer than 10 studies contributing data. As indicated in the methods, we were able to perform a meta-regression comparing the prognostic accuracy of the CAARMS vs. that of the SIPS, using the studies reporting SIPS data [Reference Woods, Addington, Cadenhead, Cannon, Cornblatt and Heinssen42–Reference Simon, Grädel, Cattapan-Ludewig, Gruber, Ballinari and Roth46] only as identified in our previous study (n = 5, CHR+: n = 783; CHR–: n = 360). As indicated in Fig. 4, Se was significantly higher (P < 0.001) for the SIPS (n = 5, mean = 0.95, 95% CI 0.91–0.99) compared to the CAARMS (n = 6, mean = 0.87, 95% CI 0.79–0.96), while Sp was comparably (P = 0.27) low in the SIPS (n = 5, mean = 0.45, 95% CI = 0.38–0.53) and in the CAARMS (n = 6, mean = 0.55, 95% CI 0.48–0.62).

4. Discussion

This is the first meta-analysis specifically investigating the prognostic accuracy of the CAARMS for the prediction of psychosis. We found 6 studies that investigated prognostic accuracy of the CAARMS at two-year follow-up, which contributed a relatively large database of 1876 subjects overall, with 892 considered CHR-P+ and 984 CHR-P–. Prognostic accuracy of the CAARMS in terms of AUC was found to be only acceptable (0.79), mostly mediated by its substantial ability to rule out psychosis (i.e. LR– was relatively small and Se high). However, this was at the expense of ruling in psychosis (i.e. LR+ was small and Sp was poor). While prognostic accuracy was overall acceptable, this study indicates that refining the prediction of outcomes should be the key priority of future research in this field.

Table 1 Independent studies included in the meta-analysis (studies n=6; 1876 subjects; CHR+: n=892; CHR–: n=984).

CAARMS: Comprehensive Assessment of At Risk Mental States; DSM-IV: Diagnostic and Statistical Manual of Mental Disorders; ICD-10: International Classification of Diseases Tenth Revision; N/A: Not available.

a Updated follow-up data provided by the authors.

Fig. 1 PRISMA flow chart.

The primary aim of the study was to synthesize available data for the prognostic accuracy of the CAARMS in determining psychosis risk 2 years after young help-seeking subjects presented to CHR-P services. As noted in the introduction, our recent meta-analysis Reference Fusar-Poli, Cappucciati, Rutigliano, Schultze-Lutter, Bonoldi and Borgwardt[14] looked into the prognostic accuracy of CHR-P instruments as a collective. The current study advances knowledge indicating that the exact prognostic accuracy of the CAARMS alone is weaker (0.79) than the overall value previously observed when the CHR-P instruments were pooled together (0.90). Although not as outstanding as before, the AUC value here reported is still considered to be acceptable for a diagnostic test and is comparable to other prognostic tools used in different areas of medicine, such as the AUC = 0.76 attributed to the Cambridge risk score for pre-diabetes Reference Thomas, Hyppönen and Power[47]. In a similar fashion, we found that the Se (0.86) of the CAARMS alone was less impressive than the Se (0.96) of CHR-P instruments assessed in the previous meta-analysis. Interestingly, there was an apparent minor increase in Sp (0.55 for CAARMS alone compared to 0.47 for CHR-P instruments generally) Reference Fusar-Poli, Cappucciati, Rutigliano, Schultze-Lutter, Bonoldi and Borgwardt[14]. The lower AUC compared to the previous general estimate may reflect profound operationalization differences between the CAARMS and the other CHR-P instruments. For example, a comparative analysis between the CAARMS and the SIPS confirmed caseness discrepancies between the two instruments Reference Fusar-Poli, Cappucciati, Rutigliano, Lee, Beverly and Bonoldi[17], mostly due to different definition of brief limited intermittent psychotic cases, ascertainment of comorbidities Reference Fusar-Poli, Cappucciati, Rutigliano, Lee, Beverly and Bonoldi[17] and of functional level at intake Reference Fusar-Poli, Rocchetti, Sardella, Avila, Brandizzi and Caverzasi[18]. To directly test the effect of these differences on the prognostic performance, in the current study, we performed the first meta-analytical comparison of Se and Sp across the CAARMS and SIPS, using previously published SIPS data Reference Fusar-Poli, Cappucciati, Rutigliano, Schultze-Lutter, Bonoldi and Borgwardt[14]. We found that Se was higher in the SIPS compared to the CAARMS, while there were no substantial differences in Sp. Overall, it is unlikely that these differences may account for significant differences in the positive predictive values of the two instruments, as confirmed by previous meta-analyses in CHR-P+ samples Reference Fusar-Poli, Rocchetti, Sardella, Avila, Brandizzi and Caverzasi[18].

Fig. 2 Meta-analytical summary receiver operating characteristic (SROC) curve of CAARMS assessment. SENS – sensitivity, SPEC – specificity, AUC – area under the curve, 1 – Yung et al., 2008 Reference Yung, Nelson, Stanford, Simmons, Cosgrave and Killackey[39], 2 – Lee et al., 2013 [12], 3 – Fusar-Poli et al., 2017 Reference Fusar-Poli, McGorry and Kane[2], 4 – Francesconi et al., 2017 Reference Francesconi, Minichino, Carrión, Delle Chiaie, Bevilacqua and Parisi[37], 5 – Kotlicka-Antczak et al., 2015 Reference Kotlicka-Antczak, Pawełczyk, Rabe-Jabłońska and Pawełczyk[40], 6 – Spada et al., 2016 Reference Spada, Molteni, Pistone, Chiappedi, McGuire and Fusar-Poli[38].

Fig. 3 Meta-analytical probability-modifying plot, illustrating the relationship between the pre-test probability (pre-TP) and post-test probability (post-TP) i.e. psychosis risk in help-seeking subjects following CAARMS assessment, computed as the likelihood of a positive (above diagonal line; LR+) or negative (below diagonal line; LR–) test result over the pre-TP between 0 and 1.

In a second step, we estimated the clinical utility of the CAARMS. As previously reported by our lab, clinical utility is not static, instead reliant on the underlying pre-test risk in any given population [Reference Fusar-Poli, Rutigliano, Stahl, Schmidt, Ramella-Cravaro and Hitesh49–Reference Fusar-Poli, Palombini, Davies, Oliver, Bonoldi and Ramella-Cravaro51]. We found that being classified as CHR+ by the CAARMS is associated with a 16.4% risk of developing psychosis within 2 years, which is lower than the 29.1% 2-year transition risk previously reported Reference Fusar-Poli, Bonoldi, Yung, Borgwardt, Kempton and Valmaggia[52]. This was driven by a small LR+ (1.9), similar to the LR+ seen previously (1.82) Reference Fusar-Poli, Cappucciati, Rutigliano, Schultze-Lutter, Bonoldi and Borgwardt[14]. CHR– individuals had 3.38% 2-year transition rate and this was driven by a moderate LR– (0.25), which was not as large as the LR– for CHR assessments as a whole (0.09) Reference Fusar-Poli, Cappucciati, Rutigliano, Schultze-Lutter, Bonoldi and Borgwardt[14]. These findings taken altogether indicate that the acceptable prognostic accuracy is due to an imbalance between Se and Sp and LR+ and LR–, with the CAARMS being a valuable tool to correctly identify individuals who will develop psychosis however showing only modest ability to identify those who will not.

Fig. 4 Meta-regression analyses comparing at meta-analytical level the 2-year sensitivity and specificity of the CAARMS vs. SIPS for the prediction of psychosis. SIPS data taken from Reference Fusar-Poli, Cappucciati, Rutigliano, Schultze-Lutter, Bonoldi and Borgwardt[14].

On a pragmatic level, the results of this meta-analysis show that the only acceptable prognostic accuracy of the CAARMS needs improving through a refined assessment of psychosis risk. An improved detection of individuals who will transition would lead to improved clinical and research opportunities. For example, a greater proportion of true positives would lead to more efficient primary indicated prevention as well as a more homogenous CHR-P group [Reference Fusar-Poli, Cappucciati, Borgwardt, Woods, Addington and Nelson48, Reference Fusar-Poli, Cappucciati, De Micheli, Rutigliano, Bonoldi and Tognin53] for developing putative treatments. This manuscript has the clinical potential to be the reference point for refining future versions of the CAARMS or for the development of refined prognostic tools and assessments. To improve prediction of psychosis, it seems necessary to tailor it on an individual level. To date, the CAARMS has just considered CHR-P+ individuals as belonging to a whole group. However, it is now clear that such an assumption is incorrect, given the profound difference in level of psychosis risk observed across different CHR-P+ subgroups [Reference Fusar-Poli, Cappucciati, Borgwardt, Woods, Addington and Nelson48, Reference Fusar-Poli, Cappucciati, De Micheli, Rutigliano, Bonoldi and Tognin53]. Furthermore, to date, psychosis prediction has been limited to the assessment and rating of CHR-P symptoms and signs. However, it is evident that these are only epiphenomena of underlying neurobiological and psychological processes that may characterize the onset of psychosis in vulnerable individuals. Research evidence in the field of risk and protective factors associated with an impending vulnerability to psychosis has accumulated over the past few decades and only recently has it been systematically assessed. In a recent large-scale meta-analysis, our lab has stratified the level of evidence for associations of several risk or protective factors and established psychotic disorders Reference Ramella-Cravaro, Radua, Ionnidis, Reichenberg, Phiphopthatsanee and Amir[54]. This study may lay the groundwork for investigating how specific risk or protective factors accumulate in CHR-P+ individuals explaining their increased liability to develop psychosis. In a first attempt by our lab Reference Fusar-Poli, Tantardini, De Simone, Ramella-Cravaro, Oliver and Kingdon[55], we reviewed forty-four studies encompassing 170 independent datasets and 54 risk/protective factors in CHR-P+ individuals. We showed that CHR-P+ individuals were more likely to show obstetric complications, tobacco use, physical inactivity, childhood trauma/emotional abuse/physical neglect, high perceived stress, childhood and adolescent low functioning, affective comorbidities, male gender, single status, unemployment and low educational level as compared to controls. The differential accumulation of these factors in each CHR-P+ individuals are likely to account for the different outcomes observed in these samples, such as psychosis onset, persistence of CHR-P+ features or remission. A refinement of psychosis prediction in these samples would inevitably require a careful investigation of these factors beyond the rating of severity and frequency of CHR-P+ symptoms as currently required by the CAARMS.

4.1. Limitations

Some limitations of this meta-analysis need to be acknowledged. Firstly, only 6 studies were able to be synthesised for this meta-analysis, and although supplying a healthy number of subjects, power could be questioned. Another limitation of our meta-analysis is the small sample size. However, conducting longitudinal studies in individuals assessed for a CHR-P state but not meeting intake criteria is logistically challenging and therefore only a few studies are currently available. Secondly, heterogeneity was very high and this could potentially have been reduced through a greater pool of studies. Thirdly, this heterogeneity remains unexplained as we were unable to perform meta-regressions because there were not enough studies.

5. Conclusion

The 2-year meta-analytical prognostic accuracy of the CAARMS in predicting psychosis is only acceptable. A refined prediction of psychosis risk is necessary to advance clinical research in this area.

Disclosure of interest

The authors declare that they have no competing interest.

Acknowledgements

DO is supported by the Medical Research Council.

Appendix A. Supplementary data

Supplementary data associated with this article can be found, in the online version, at https://doi.org/10.1016/j.eurpsy.2017.10.001.

References

Insel, TRRethinking schizophrenia. Nature 2010;468(7321):187–93.CrossRef Google Scholar PubMed

Fusar-Poli, PMcGorry, PKane, JImproving outcomes of first episode psychosis. World Psychiatry 2017;16(3):251–65.CrossRef Google Scholar PubMed

Millan, MJAndrieux, ABartzokis, GCadenhead, KDazzan, PFusar-Poli, P et al. Altering the course of schizophrenia: progress and perspectives. Nat Rev Drug Discov 2016;15(7):485–15.CrossRef Google Scholar

McGorry, PDEarly clinical phenotypes, clinical staging, and strategic biomarker research: building blocks for personalized psychiatry. Biol Psychiatry 2013;74(6):394–5.CrossRef Google Scholar PubMed

Fusar-Poli, PExtending the benefits of indicated prevention to improve outcomes of first episode psychosis. JAMA Psychiatry 2017;74(7):667–8.CrossRef Google Scholar PubMed

Fusar-Poli, PSchultze-Lutter, FPredicting the onset of psychosis in patients at clinical high risk: practical guide to probabilistic prognostic reasoning. Evid Based Ment Health 2016;19(1):10–5.CrossRef Google Scholar PubMed

Arbyn, MVerdoodt, FSnijders, PJFVerhoef, VMJSuonio, EDillner, L et al. Accuracy of human papillomavirus testing on self-collected versus clinician-collected samples: a meta-analysis. Lancet Oncol 2014;15(2):172–83.CrossRef Google Scholar PubMed

Tabák, AGHerder, CRathmann, WBrunner, EJKivimäki, MPrediabetes: a high-risk state for diabetes development. Lancet 2012;379(9833):2279–90.CrossRef Google Scholar PubMed

Hippisley-Cox, JCoupland, CVinogradova, YRobson, JMay, MBrindle, PDerivation and validation of QRISK, a new cardiovascular disease risk score for the United Kingdom: prospective open cohort study. BMJ 2007;335(7611):136.CrossRef Google Scholar PubMed

Fusar-Poli, PMeyer-Lindenberg, AForty years of structural imaging in psychosis: promises and truth. Acta Psychiatr Scand 2016;134(3):207–24.CrossRef Google Scholar PubMed

Yung, ARYuen, HPMcGorry, PDPhillips, LJKelly, DDell’Olio, M et al. Mapping the onset of psychosis: the Comprehensive Assessment of At-Risk Mental States. Aust N Z J Psychiatry 39(11–12)2005; 964–71.CrossRef Google Scholar PubMed

NHS England Intervention in Psychosis Access and Waiting Time Standard [Internet] 2016, NHS England Publications UKAvailable from: https://www.england.nhs.uk/mentalhealth/wp-content/uploads/sites/29/2016/04/eip-guidance.pdf.Google Scholar

Yung, ARStanford, CCosgrave, EKillackey, EPhillips, LNelson, B et al. Testing the ultra high risk (prodromal) criteria for the prediction of psychosis in a clinical sample of young people. Schizophr Res 2006;84(1):57–66.CrossRef Google Scholar

Fusar-Poli, PCappucciati, MRutigliano, GSchultze-Lutter, FBonoldi, IBorgwardt, S et al. At risk or not at risk? A meta-analysis of the prognostic accuracy of psychometric interviews for psychosis prediction. World Psychiatry 2015;14(3):322–32.CrossRef Google Scholar PubMed

Miller, TJMcGlashan, THRosen, JLCadenhead, KCannon, TVentura, J et al. Prodromal assessment with the structured interview for prodromal syndromes and the scale of prodromal symptoms: predictive validity, interrater reliability, and training to reliability. Schizophr Bull 2003;29(4):703–15.CrossRef Google Scholar PubMed

Schultze-Lutter, FRuhrmann, SPicker, HKlosterkötter, JWC1C development and evaluation of the Schizophrenia Proneness Instrument, Adult version (SPI-A). Schizophr Res 2006; 86:S4–5.CrossRef Google Scholar

Fusar-Poli, PCappucciati, MRutigliano, GLee, TYBeverly, QBonoldi, I et al. Towards a standard psychometric diagnostic interview for subjects at ultra high risk of psychosis: CAARMS versus SIPS. Psychiatry J 2016; 7146–341.Google Scholar PubMed

Fusar-Poli, PRocchetti, MSardella, AAvila, ABrandizzi, MCaverzasi, E et al. Disorder, not just state of risk: meta-analysis of functioning and quality of life in people at high risk of psychosis. Br J Psychiatry 2015;207(3):198–206.CrossRef Google Scholar

Kempton, MJBonoldi, IValmaggia, LMcGuire, PFusar-Poli, PSpeed of psychosis progression in people at ultra-high clinical risk. JAMA Psychiatry 2015;72(6):622.CrossRef Google Scholar PubMed

Stroup, DFBerlin, JAMorton, SCOlkin, IWilliamson, GDRennie, D et al. Meta-analysis of observational studies in epidemiology: a proposal for reporting. Meta-analysis Of Observational Studies in Epidemiology (MOOSE) group. JAMA 2000;283(15):2008–12.CrossRef Google Scholar PubMed

Moher, DLiberati, ATetzlaff, JAltman, DGPRISMA Group Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. PLoS Med 2009;6(7):e1000097.CrossRef Google Scholar PubMed

Whiting, PFRutjes, AWSWestwood, MEMallett, SDeeks, JJReitsma, JB et al. QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann Intern Med 2011;155(8):529–36.CrossRef Google Scholar PubMed

Macaskill, PGatsonis, CDeeks, JJHarbord, RMTakwoingi, YAnalysing and presenting resultsDeeks, JJBossuyt, PMGatsonis, CHandbook for systematic reviews of diagnostic test accuracy 2010, The Cochrane Colloration.Google Scholar

Smetana, GWUmscheid, CAChang, SMatchar, DBMethods guide for authors of systematic reviews of medical tests: a collaboration between the Agency for Healthcare Research and Quality (AHRQ) and the Journal of General Internal Medicine. J Gen Intern Med 27(Suppl. 1)2012; S1–3.CrossRef Google Scholar PubMed

Dwamena, BAMIDAS: computational and graphical routines for meta-analytical integration of diagnostic accuracy studies in Stata 2007, Division of Nuclear Medicine, Department of Radiology, University of Michigan Medical School Ann Arbor.Google Scholar

StataCorp., STATA. College Station 2015, StataCorp LP TX.Google Scholar

Harbord, RWhiting, PMetandi: meta-analysis of diagnostic accuracy using hierarchical logistic regression. Stata Journal [Internet] 2009;9(2):211–29Available from: http://www.stata-journal.com/article.html?article=st0163 [19].CrossRef Google Scholar

Janda, SShahidi, NGin, KSwiston, JDiagnostic accuracy of echocardiography for pulmonary hypertension: a systematic review and meta-analysis. Heart 2011;97(8):612–22.CrossRef Google Scholar PubMed

Hosmer, DWLemeshow, SMay, SApplied survival analysis: regression modeling of time-to-event data. 2nd ed 2008, Wiley-Interscience Hoboken, N.J.CrossRef Google Scholar

Lipsey, MWWilson, DBPractical meta-analysis 2000, Sage Publications Thousand Oaks, Calif.Google Scholar

Bekkar, MDjemaa, HAlitouche, TEvaluation measures for modelsassessment over imbalanced data sets. J Inf Eng Appl 2013; 3:27–38.Google Scholar

Higgins, JPTThompson, SGDeeks, JJAltman, DGMeasuring inconsistency in meta-analyses. BMJ 2003;327(7414):557–60.CrossRef Google Scholar PubMed

Deeks, JJMacaskill, PIrwig, LThe performance of tests of publication bias and other sample size effects in systematic reviews of diagnostic test accuracy was assessed. J Clin Epidemiol 2005;58(9):882–93.CrossRef Google Scholar PubMed

Wanders, LKEast, JEUitentuis, SELeeflang, MMGDekker, EDiagnostic performance of narrowed spectrum endoscopy, autofluorescence imaging, and confocal laser endomicroscopy for optical diagnosis of colonic polyps: a meta-analysis. Lancet Oncol 2013;14(13):1337–47.CrossRef Google Scholar PubMed

Li, JFine, JPSafdar, NPrevalence-dependent diagnostic accuracy measures. Stat Med 2007;26(17):3258–73.CrossRef Google Scholar PubMed

Fusar-Poli, PRutigliano, GStahl, DDavies, CDe Micheli, ARamella-Cravaro, V et al. Long-term validity of the At Risk Mental State (ARMS) for predicting psychotic and non-psychotic mental disorders. Eur Psychiatry 2017; 42:49–54.CrossRef Google Scholar PubMed

Francesconi, MMinichino, ACarrión, REDelle Chiaie, RBevilacqua, AParisi, M et al. Psychosis prediction in secondary mental health services. A broad, comprehensive approach to the “at risk mental state” syndrome. Eur Psychiatry 2017; 40:96–104.CrossRef Google Scholar PubMed

Spada, GMolteni, SPistone, CChiappedi, MMcGuire, PFusar-Poli, P et al. Identifying children and adolescents at ultra high risk of psychosis in Italian neuropsychiatry services: a feasibility study. Eur Child Adolesc Psychiatry 2016;25(1):91–106.CrossRef Google Scholar PubMed

Yung, ARNelson, BStanford, CSimmons, MBCosgrave, EMKillackey, E et al. Validation of “prodromal” criteria to detect individuals at ultra high risk of psychosis: 2 year follow-up. Schizophr Res 105(1–3)2008; 10–7.CrossRef Google Scholar PubMed

Kotlicka-Antczak, MPawełczyk, TRabe-Jabłońska, JPawełczyk, APORT (Programme of Recognition and Therapy): the first Polish recognition and treatment programme for patients with an at-risk mental state. Early Interv Psychiatry 2015;9(4):339–42.CrossRef Google Scholar PubMed

Lee, JRekhi, GMitter, NBong, YLKraus, MSLam, M et al. The Longitudinal Youth at Risk Study (LYRIKS) – an Asian UHR perspective. Schizophr Res 151(1–3)2013; 279–83.CrossRef Google Scholar PubMed

Woods, SWAddington, JCadenhead, KSCannon, TDCornblatt, BAHeinssen, R et al. Validity of the prodromal risk syndrome for first psychosis: findings from the North American Prodrome Longitudinal Study. Schizophr Bull 2009;35(5):894–908.CrossRef Google Scholar PubMed

Addington, JPiskulic, DPerkins, DWoods, SWLiu, LPenn, DLAffect recognition in people at clinical high risk of psychosis. Schizophr Res 140(1–3)2012; Error:FPage (87) is higher than LPage (92)!.CrossRef Google Scholar PubMed

Liu, C-CLai, M-CLiu, C-MChiu, Y-NHsieh, MHHwang, T-J et al. Follow-up of subjects with suspected pre-psychotic state in Taiwan. Schizophr Res 126(1–3)2011; 65–70.CrossRef Google Scholar

Schultze-Lutter, FKlosterkötter, JRuhrmann, SImproving the clinical prediction of psychosis by combining ultra-high risk criteria and cognitive basic symptoms. Schizophr Res 154(1–3)2014; 100–6.CrossRef Google Scholar PubMed

Simon, AEGrädel, MCattapan-Ludewig, KGruber, KBallinari, PRoth, B et al. Cognitive functioning in at-risk mental states for psychosis and 2-year clinical outcome. Schizophr Res 142(1–3)2012; 108–15.CrossRef Google Scholar PubMed

Thomas, CHyppönen, EPower, CType 2 diabetes mellitus in midlife estimated from the Cambridge Risk Score and body mass index. Arch Intern Med 2006;166(6):682–8.CrossRef Google Scholar PubMed

Fusar-Poli, PCappucciati, MBorgwardt, SWoods, SWAddington, JNelson, B et al. Heterogeneity of psychosis risk within individuals at clinical high risk: a meta-analytical stratification. JAMA Psychiatry 2016;73(2):113–20.CrossRef Google Scholar PubMed

Fusar-Poli, PRutigliano, GStahl, DSchmidt, ARamella-Cravaro, VHitesh, S et al. Deconstructing pretest risk enrichment to optimize prediction of psychosis in individuals at clinical high risk. JAMA Psychiatry 2016;73(12):1260–7.CrossRef Google Scholar PubMed

Fusar-Poli, PSchultze-Lutter, FCappucciati, MRutigliano, GBonoldi, IStahl, D et al. The dark side of the moon: meta-analytical impact of recruitment strategies on risk enrichment in the clinical high risk state for psychosis. Schizophr Bull 2016;42(3):732–43.CrossRef Google Scholar PubMed

Fusar-Poli, PPalombini, EDavies, COliver, DBonoldi, IRamella-Cravaro, V et al. Why transition risk to psychosis is not declining at the OASIS ultra high risk service: the hidden role of stable pretest risk enrichment. Schizophr Res S0920-9964(17)2017; 30353–5.Google Scholar

Fusar-Poli, PBonoldi, IYung, ARBorgwardt, SKempton, MJValmaggia, L et al. Predicting psychosis: meta-analysis of transition outcomes in individuals at high clinical risk. Arch Gen Psychiatry 2012;69(3):220–9.CrossRef Google Scholar PubMed

Fusar-Poli, PCappucciati, MDe Micheli, ARutigliano, GBonoldi, ITognin, S et al. Diagnostic and prognostic significance of brief limited intermittent psychotic symptoms (BLIPS) in individuals at ultra high risk. Schizophr Bull 2017;43(1):48–56.CrossRef Google Scholar PubMed

Ramella-Cravaro, VRadua, JIonnidis, JPAReichenberg, APhiphopthatsanee, NAmir, T et al. What causes psychosis? Umbrella review of risk factors. World Psychiatry 2017(In Press).Google Scholar

Fusar-Poli, PTantardini, MDe Simone, SRamella-Cravaro, VOliver, DKingdon, J et al. Deconstructing vulnerability for psychosis: meta-analysis of environmental risk factors for psychosis in subjects at ultra high-risk. Eur Psychiatry 2017; 40:65–75.CrossRef Google Scholar PubMed

Table 1 Independent studies included in the meta-analysis (studies n=6; 1876 subjects; CHR+: n=892; CHR–: n=984).

Fig. 1 PRISMA flow chart.

Fig. 2 Meta-analytical summary receiver operating characteristic (SROC) curve of CAARMS assessment. SENS – sensitivity, SPEC – specificity, AUC – area under the curve, 1 – Yung et al., 2008 [39], 2 – Lee et al., 2013 [12], 3 – Fusar-Poli et al., 2017 [2], 4 – Francesconi et al., 2017 [37], 5 – Kotlicka-Antczak et al., 2015 [40], 6 – Spada et al., 2016 [38].

Fig. 4 Meta-regression analyses comparing at meta-analytical level the 2-year sensitivity and specificity of the CAARMS vs. SIPS for the prediction of psychosis. SIPS data taken from [14].

Oliver et al. supplementary material

File 1.2 MB

Submit a response

Comments

No Comments have been published for this article.

Article contents

Meta-analytical prognostic accuracy of the Comprehensive Assessment of at Risk Mental States (CAARMS): The need for refined prediction

Abstract

Keywords

1. Introduction

2. Methods

2.1. Search strategy

2.2. Selection criteria

2.3. Recorded variables

2.4. Statistical analysis

3. Results

3.1. Database

3.2. Prognostic accuracy of the CAARMS at 2-years

3.3. Clinical utility of the CAARMS at 2 years

3.4. Meta-regressions and sensitivity analyses

4. Discussion

Table 1 Independent studies included in the meta-analysis (studies n=6; 1876 subjects; CHR+: n=892; CHR–: n=984).

4.1. Limitations

5. Conclusion

Disclosure of interest

Acknowledgements

Appendix A. Supplementary data

References

Oliver et al. supplementary material

Comments

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests