The internet is a major source of information for the public about health problems, including mental health problems. Reference Morahan-Martin1 However, formal evaluations of information on websites have frequently noted that the quality is low or variable. Reference Ipser, Dewing and Stein2–Reference Kisely, Ong and Takyar7 Standards have been proposed for website quality, 8 but these have not affected the widespread availability of low-quality information. Here we report what we believe to be the first randomised controlled trial of an intervention to improve the quality of websites, using suicide prevention websites as the target. Little information is available on use of the internet to assist someone who is suicidal. However, many members of the Australian public believe that website information is potentially helpful to a suicidal person, Reference Leach, Christensen, Griffiths, Jorm and Mackinnon9 and in the UK the Samaritans report receiving many requests for help through email. Reference Dobson10
Method
Our sample comprised the top 52 English-language websites on suicide prevention identified from a Google Australia search using the terms ‘help suicide’. The search was not meant to be exhaustive, but rather to locate sites that might be found by a member of the public looking for assistance. Website content was captured on 6 November 2008, with the use of TeleportPro software version 1.60 (www.tenmax.com). Websites were included if they were sponsored by any organisation (to exclude those maintained by an individual, such as a blog) and listed the contact details of a website administrator.
Websites were scored for quality of information by two of the authors (J-A.F. and E.O.) using a 26-item checklist, giving one point per suicide prevention action recommended, based on the document Suicidal Thoughts and Behaviours: First Aid Guidelines, which was developed through a Delphi expert consensus study. 11,Reference Kelly, Jorm, Kitchener and Langlands12 Administrators of the websites allocated to receive the intervention were emailed a letter from A.F.J. giving detailed scoring on the checklist and the mean score of all 52 websites for comparison. The letter was headed ‘University of Melbourne Project to Evaluate and Improve Suicide Prevention Websites’ and had a large university logo at the top to add authority. The administrators were also sent a copy of the guidelines in the form of an attractively designed portable file that could be easily posted, an article on how the guidelines were produced, Reference Kelly, Jorm, Kitchener and Langlands12 and an information sheet about the study. This sheet explained how the websites were selected, the nature of the intervention, the random assignment to intervention or control groups, and when the post-test assessment would occur. The control websites were not communicated with at all and received no feedback. Websites were scored at baseline (in months 1 and 2 of the study) and 6 months later (in months 6 and 7). Intervention letters were sent out 2 months after baseline scoring. This allowed a period of 4 months for website administrators to make any changes in response to the feedback.
The outcome was the quality of information about how to assist someone who is suicidal. Quality was measured against a 26-item checklist. Each item was scored as ‘yes’ or ‘no’, depending on whether or not a particular action was recommended on the website. Two of the authors (J-A.F. and E.O.) independently selected content from each website and came to a consensus about which text was relevant to score. They also scored the material independently and came to a consensus about the quality score at both assessment points.
There were 52 websites. This sample size gives 80% power to detect a large effect size (d = 0.8) at α = 0.05. Reference Cohen13 To put this effect size in concrete terms, it would have been achieved if 7 out of the 26 intervention group websites had simply posted the guidelines on their website and hence received the maximum score of 26.
Websites were given an identification number (1–52) and an independent researcher used the Random.org sequence generator (www.random.org) to randomly order these. The first 26 numbers were allocated to the feedback group and the second 26 to the control group. These random allocations were provided to A.F.J. who linked them to a list of websites, their web addresses and quality scores provided by J-A.F. Feedback letters containing the quality scores and guidelines for improvement were emailed to website administrators by A.F.J. Neither of the researchers involved in scoring the websites (J-A.F. and E.O.) knew the condition to which the websites were randomised. The trial was registered with the Australia New Zealand Clinical Trials Registry (ACTRN12609000449235). Ethical approval was provided by the University of Melbourne ethics committee.
Interrater reliability and stability of quality ratings were assessed with Pearson correlations. The effect of the intervention on quality scores was assessed using analysis of variance, where the factors were intervention (feedback or control) and time (pre-test or post-test). The P<0.05 significance level was used.
Results
Interrater reliability was high, with correlations of 0.90 at pre-test and 0.92 at post-test. Quality was also stable over time, with a correlation between the pre-test and post-test consensus ratings of 0.79. The quality of the information provided was highly variable, with pre-test scores ranging from 1 to 19 with a mean of 9.79 (s.d. = 4.18). The two most highly rated websites both before and after the intervention were Suicide.org (http://www.suicide.org) and Suicideline (http://suicideline.org.au).
Online Fig. DS1 shows the CONSORT flow diagram for the trial. There was no significant difference in mean quality scores at pre-test (t = 0.96, d.f. = 50, P = 0.34). The mean scores of the intervention websites changed from 9.23 (s.d. = 4.85, 95% CI 7.58–10.88) at pre-test to 11.62 (s.d. = 4.81, 95% CI 9.84–13.39) at post-test. For the control websites the respective means were 10.35 (s.d. = 3.38, 95% CI 8.70–11.99) and 11.54 (s.d. = 4.16, 95% CI 9.77–13.31). Analysis of variance showed a significant effect of time (F = 21.31, d.f. = 1,50, P<0.001), but no difference between intervention and control websites (F = 0.21, d.f. = 1,50, P = 0.65) or intervention time interaction (F = 2.37, d.f. = 1,50, P = 0.13). A post hoc examination of whether a website improved by 5 points or more (corresponding to roughly a 25% improvement or more than 1 standard deviation unit) showed that five intervention websites improved to this degree compared with one control website, but this difference was not significant (two-tailed P = 0.19, Fisher's exact test). Although no reply was requested, administrators of nine websites wrote personal replies acknowledging the feedback or commenting on it. Post hoc analysis showed that the intervention had no greater impact on these websites than on the other intervention or control websites (P = 0.30 for interaction effect).
Discussion
Our findings are consistent with most previous evaluations of website information in finding many sites of low quality. However, we found no evidence that feedback to website administrators had any effect. One possible reason is that the feedback did not reach an appropriate person: emails to websites might not be read at all or read by an information technology employee who takes no interest in the content. Although a post hoc analysis of the data did not support this interpretation, the number of websites from which we received correspondence was small. This trial can be seen as an effectiveness rather than an efficacy trial, because it evaluated feedback under realistic conditions, but it does not tell us whether or not feedback would work if it was actually received by the relevant person. Another possibility is lack of statistical power: we set the sample size to detect a large effect size on the basis that the intervention would lead to a large improvement in quality score. Furthermore, a feedback intervention resulting in small effects is of little practical relevance. However, a post hoc analysis of power, using the observed pre-test post-test correlation in analysis of covariance, showed that the actual power was greater than planned, with 81% power to detect a medium effect size (α = 0.05, two-tailed). Power to detect a small effect size was low (21%).
Although the intervention had no effect, there was a significant overall improvement from pre-test to post-test. Real improvement might have occurred independently of the intervention, but we believe that the change was mainly due to greater experience of the raters, both of whom felt that they had become more skilled at finding relevant information on the websites, because they had already scored and discussed them. However, given that the raters were masked to allocation, this improvement did not affect the validity of the comparison of intervention and control websites. It may have meant that the pre-test score used for feedback was an underestimate of a website's quality, but even after the intervention all websites had room for improvement, most substantially so, which was the main message the feedback conveyed.
Given the lack of success of this simple intervention, what else could be done to improve website quality? One possibility is to engage in a dialogue with the organisations behind the websites, including exploration of barriers to making changes. This might include personal contact by telephone. There may be useful guidance on how to produce behaviour change from the general literature on computerised health interventions. Reference Lustria, Cortese, Noar and Glueckauf14 Some of the website administrators requested further information, such as examples of better-quality websites; however, the protocol set up and registered for this trial was to investigate the impact of simple feedback alone and we did not deviate from this. We hope that this first attempt at a randomised trial in the area of website quality may influence other researchers to look at more sophisticated interventions.
Acknowledgements
Thanks to Amy Morgan for assistance. Support was provided by the National Health and Medical Research Council and the Colonial Foundation.
eLetters
No eLetters have been published for this article.