I nutrition is important and the World Health Organization( 1 ) recommends monitoring of all populations. Urinary iodine concentration (UIC) is a recommended method to assess I status in a population but it has limitations. First, it portrays the I intake over the preceding hours only( Reference Keating and Albert 2 – Reference Pearce and Caldwell 5 ). Second, it holds uncertainties due to a marked variation in urinary I excretion( Reference Pearce and Caldwell 5 , Reference Andersen, Pedersen and Pedersen 6 ). Hence, a considerable number of individuals or samples are required for a reliable estimate of I nutrition level( Reference Andersen, Karmisholt and Pedersen 7 , Reference Karmisholt, Laurberg and Andersen 8 ). Alternative approaches to assess I nutrition include measurement of thyroglobulin in serum (s-TG)( Reference Pearce and Caldwell 5 , Reference Knudsen, Bülow and Jørgensen 9 – Reference Zimmermann, Aeberli and Andersson 11 ) and thyroid volume by ultrasound( Reference Hegedus, Perrild and Poulsen 12 , Reference Knudsen, Bols and Bulow 13 ). They differ from UIC in that thyroid volume reflects the I status over the preceding years and it correlates positively to s-TG( Reference Rasmussen, Ovesen and Bülow 14 ).
TG is a protein produced exclusively by the thyroid gland. It plays an important role in the synthesis of thyroid hormones and an increased amount of TG is released into the blood in I deficiency( Reference Pearce and Caldwell 5 , Reference Knudsen, Bülow and Jørgensen 9 – Reference Zimmermann, Aeberli and Andersson 11 ). Thus, s-TG is a sensitive marker of I deficiency in a population.
S-TG is used to monitor patients treated for differentiated thyroid cancer and the effectiveness is well established in this group of patients( Reference Schlumberger, Fragu and Gardet 15 , Reference Haugen, Alexander and Bible 16 ). However, even though it is well recognised that s-TG is elevated with I deficiency, data are lacking on the number of samples needed and on the reliability of s-TG as a measure of I deficiency. These measures are well established for urinary I excretion( Reference Andersen, Karmisholt and Pedersen 7 , Reference Karmisholt, Laurberg and Andersen 8 , Reference König, Andersson and Hotz 17 ) and similar estimates are relevant for s-TG.
This led us to conduct a survey with repeated collection of serum and urine for measurement of s-TG and I in urine. Data were used to establish (a) the number of participants needed in surveys of I nutrition and (b) to calculate the reliability of studies of I nutrition by both s-TG and UIC with a given number of participants. Finally, we assessed the potential benefit of the combined use of both methods.
Methods
Participants
We enrolled ninety-seven healthy subjects living in Ilulissat or Saqqaq in the Disco Bay area in North Greenland. A random sample of individuals was drawn from the National Civil Registration System in which every person living in Denmark, the Faeroe Islands and Greenland is registered. Subjects invited to participate showed an interest in supporting the points raised and agreed to contribute. In all, eighty-five out of ninety-seven (88 %) participated in three or four of the data collections. Participants in Ilulissat were stratified by age, sex and place of birth to approximate equal participation in the age groups 30–39 and 40–49 years, men and women, and the three groups consisting of (a) subjects not born in Greenland, (b) subjects born in Greenland living in town and (c) in a settlement. Subjects with both parents born in Greenland are hereafter named Inuit, and subjects whose parents are both born outside Greenland are named Caucasians.
Procedures
Data were collected four times during a full year to include also seasonal differences in the estimate of individual variance (30 March through 10 April, 25 June through 5 July, 25 September through 5 October and 7 January through 19 January).
Participants were invited by letter to participate in the investigation. The investigation took place at the local hospital or nursing station or, at request, as home visits.
A physical examination was performed by one of the doctors collecting data (S. A., P. L.). We measured height and weight in indoor clothing, calculated BMI(weight in kilograms divided by height in metres squared) and recorded if any disability was present. Information on smoking habits (present/past/never), alcohol intake (units/week), use of I containing supplements and medication was obtained by a questionnaire. None took medication containing I or known to influence the thyroid. Information regarding sex and age was obtained from the National Civil Registration System.
Information on dietary habits was obtained by a FFQ as described previously( Reference Andersen, Hvingel and Kleinschmidt 18 ). In brief, the frequency of intake of seven traditional Inuit food items and seven imported food items were given in six categories ranging from never to daily. Inuit food items scored positively and imported food items scored negatively. The sum of food frequency score for all food items consumed by each participant was calculated based on this recording and participants were categorised into groups of intake of <40, 40–60 and >60 % traditional Inuit food items scores on a scale where 100 % was purely Inuit foods and 0 % was purely imported food. Moreover, participants were asked how many days of the week the main meal was of Greenlandic food items and the number of days it was imported foods for cross-validation.
A non-fasting blood sample was drawn from the antecubital vein using minimal tourniquet. Blood was allowed to clot and serum was separated and kept at −20°C until analysis. A spot urine sample was collected in I-free polyethylene containers and stored at −20°C until analysis.
Ethical approval was obtained from the Commission for Scientific Research in Greenland before the commencement of this study (505-63), and all subjects gave informed written consent in Danish or Greenlandic by participant choice.
Assays
TG was measured in serum by the LUMItest (BRAHMS) that had a working range of 1–500 μg/l. All samples from an individual were included in the same assay run. Median values about 9, 10 and 15 μg/l are seen in I replete, mildly deficient and moderately deficient Caucasians, respectively( Reference Knudsen, Bülow and Jørgensen 9 , Reference Vejbjerg, Knudsen and Perrild 10 ). Thyroglobulin antibodies (TGAb) were measured using Dynotest RIA (BRAHMS Diagnostica) with a functional sensitivity of 20 kU/l. TGAb in serum did not influence measurement of s-TG when <100 U/ml and all individuals with TGAb <100 U/ml were included in the analysis. Thus, eighty-one participants were included in the analysis including s-TG.
Urine samples were analysed for I by using the Sandell–Kolthoff reaction modified after Wilson & van Zyl( Reference Wilson and van Zyl 19 ) as described in detail previously( Reference Andersen, Hvingel and Kleinschmidt 18 , Reference Laurberg 20 ).
Statistics and calculations
Population characteristics were compared using Mann–Whitney U test for comparison of two groups. Kendall’s τ was used to describe associations between groups. Frequencies were compared using Fisher’s exact test or χ 2 test with groups of less than five participants included in the adjacent group when appropriate.
The number of samples needed to assess the I status of a population or an individual was calculated from the equation developed to estimate the precision of a set-point, D, in biochemical variables( Reference Fraser and Harris 21 ): n=(Z×CV%/D)2. Similar calculations are recommended for use when estimating the number of specimens required in biochemical measures. This provides an estimate of the certainty or reliability of the results of sample collections. It is described in detail by Fraser & Harris( Reference Fraser and Harris 21 ) and we followed these recommendations in keeping with our previous reports on urinary I in different populations( Reference Andersen, Karmisholt and Pedersen 7 , Reference Karmisholt, Laurberg and Andersen 8 ). The percentiles of standard normal distribution (Z) used were 2·58 for 99 %, 2·33 for 98 %, 1·96 for 95 %, 1·64 for 90 %, 1·28 for 80 %, 1·04 for 70 %, 0·84 for 60 %, 0·67 for 50 %. The precision range (D), a measure of reliability of the assessment of I nutrition level, used in the calculations was set to vary from ±50 to ±1 %. Using the z-statistics may underestimate the sample size for small n by up to 30 % compared with using t-statistics but this was chosen in order to comply with the recommendations as noted above( Reference Andersen, Karmisholt and Pedersen 7 ). Mean within-individual variances were similar whether assessed as the mean variance among individual or using ANOVA techniques. Mean, highest and lowest within-individual CV was used for calculation of number of samples needed to assess the I excretion in an individual. The CV% was the percentage of variance square root divided by the mean. Variances were compared by Levene’s test for homogeneity of variances.
The statistical program for the social sciences version 13.0, Corel Quattro Pro X3 and a Texas Instruments TI-30X IIS calculator were used to process data and perform the calculations.
Results
Totally, ninety-seven participants were enrolled with thirty-three Caucasians, thirty-nine Inuit in the town Ilulissat and twenty-five Inuit in the settlement Saqqaq (Table 1). None of the participants had disease affecting the thyroid, were pregnant or took I-containing supplements. The number of subjects participating in four, three, two and one data collection was seventy-three, twelve, nine and three. Participants who attended less than three times (n 12) and participants who had TGAb that could affect the measurement of s-TG (n 5) were excluded from the calculations. The calculations thus included eighty-one subjects.
TGAb, thyroglobulin antibodies.
* P value for ethnic differences: χ 2 test, except alcohol that was tested using Fisher’s exact test and urinary I using Mann–Whitney test; NS designates P>0·1. Groups with less than five participants were merged with adjacent groups (BMI <25 kg/m2; Inuit diet scores <40 %; alcohol <7 units/week; less than four data collections).
† Of the 97 subjects enrolled, eighty-one were included in the calculations as they participated in three or four data collections and did not have antibodies against thyroglobulin that could influence the measurement of thyroglobulin.
‡ Calculated from a FFQ on the intake of seven Greenlandic and seven imported food items.
§ 2 missing.
|| 4 missing.
¶ One unit equals 2 g of alcohol.
** Number of participations in data collection with questionnaire and collection of specimens.
†† Level for positive thyroglobulin antibody as given by the manufacturer.
‡‡ Serum TG was influenced only when TGAb was above 100.
Caucasians were bigger than Inuit (men/women, height P<0·001/P<0·001; weight P=0·01/P=0·026). Still, BMI was similar (Table 1). Height, weight and BMI did not differ with residence or age (residence, all, P>0·1; age in Caucasians, all, P>0·1; age in Inuit, women’s height, P=0·054, all other P>0·1). Dietary habits differed with ethnicity, and more Inuit than Caucasians were smokers and alcohol abstainers. Inuit had slightly higher urinary I excretion (Table 1).
The frequency of TGAb positive individuals did not differ with ethnicity (Table 1). Median s-TG did not differ with ethnicity (P=0·70). S-TG was higher when UIC was <50 μg/l, and the frequency of s-TG >20 μg/l decreased with rising UIC (Kendall’s τ −0·29, P=0·005) (Fig. 1).
Variation
Table 2 shows mean, variance and CV% for s-TG and UIC among Inuit and Caucasians. Overall, between individual, mean individual and median individual variation was higher in Inuit compared with Caucasians. Overall variation was higher in UIC compared with s-TG (P<0·01) and the two measures of individual variation were markedly higher for UIC compared with s-TG (Table 2).
* Based on three or four samples in each of eighty-one participants.
† Calculated as $$\left( {\Sigma {\rm cv}\,\%\,^{{\rm 2}} _{{\ {\rm 1}{\minus}n}} } \right)^{{{\raise0.7ex\hbox{$1$} \!\mathord{\left/ {\vphantom {1 2}}\right.\kern-\nulldelimiterspace}\!\lower0.7ex\hbox{$2$}}}} $$ . Calculations using ANOVA techniques gave similar results.
‡ Square root of sum of squared CV for each data collection: $$\left( {\Sigma {\rm cv}\,\%\,^{{\rm 2}} _{{{\rm \ 1}{\minus}n}} } \right)^{{{\raise0.7ex\hbox{$1$} \!\mathord{\left/ {\vphantom {1 2}}\right.\kern-\nulldelimiterspace}\!\lower0.7ex\hbox{$2$}}}} $$ .
Number of samples needed
Table 3 lists the number of samples need for a chosen precision range of 1–50 % with a confidence of 95 %. This is shown both for an individual (right columns) and for populations (left column). It can be seen that 982 subjects need to donate one sample each to be 95 % certain of having a UIC within 5 % of the true mean of that population. For s-TG the same precision requires 823 subjects. As can be read from Table 3, 20 % more participants are needed to portray I deficiency in a population using UIC compared with s-TG.
* Calculated from n=(Z×CV/D)2, where Z=1·96 for 95 % CI; D=precision range.
† Number of individuals needed was calculated based on the average CV in the population.
‡ Variation differs between individuals. Number of samples needed to sample in an individual are given for individuals with median, lowest and highest variation.
§ Calculated with a 95 % CI (Z=1·96).
For an individual, the results of two samples gives a precision range of 20 % when using s-TG, whereas the same precision range for UIC requires twenty-one urine samples from that individual. In addition, a single blood sample for s-TG measurement in an individual gives a precision of 30 % CI, whereas nine urine samples are need from that individual to match this precision with 95 % CI. The difference between individuals in variation for s-TG is lower than that found for urinary I.
Table 4 lists the number of samples needed in an individual (right column) and the number of participants needed in population surveys (left column) to detect I deficiency when the CI is chosen to vary in parallel with the precision range. In a population, a 10 % precision with 90 % CI requires 144 participants for s-TG and 172 participants for urinary I excretion. A precision range of 2 % with 98 % CI requires 7268 participants for s-TG and 8669 participants for urinary I analysis. Thus, 20 % more participants are needed to settle I deficiency when using UIC compared with s-TG.
* Calculated from n=(Z×CV/D)2, where Z=CI (Z=2·58 for 99 %, 2·33 for 98 %, 1·96 for 95 %, 1·64 for 90 %, 1·28 for 80 %, 1·04 for 70 %, 0·84 for 60 %, 0·67 for 50 %); D=precision range.
† Number of individuals needed was calculated based on the average CV in the population.
‡ Variation differs between individuals. Number of samples needed to sample in an individual are given for individuals with median, lowest and highest variation.
§ CI set to vary in parallel with the precision range.
For an individual, a precision range of 20 % with 80 % CI requires a single sample for s-TG and nine samples for UIC. A precision range of 10 % with 90 % CI requires six blood samples for s-TG and fifty-eight urine samples for UIC. Thus, just under ten times more urine samples are needed in an individual to depict I deficiency when using urinary I excretion compared with measurement of s-TG.
Reliability of surveys
The reliability of studies of I nutrition can be read from the x-axis in Fig. 2 matching a specified number of participants on the y-axis. The upper panel illustrates that about ten times more urine samples are needed compared with serum to obtain a similar reliability in an individual. The lower panel shows that 20 % more urine samples are needed in a population. The logarithmic scale on the y-axis illustrates the steep rise in the number of participants required when the reliability of I nutrition studies is increased.
Discussion
I deficiency may cause a spectrum of disorders that can be prevented by cheap and simple I supplementation( Reference Laurberg, Bülow Pedersen and Knudsen 22 ). Identification of I-deficient populations and monitoring of I fortification programmes are done by population surveys( Reference Andersen, Hvingel and Kleinschmidt 18 , Reference Teng, Shan and Teng 23 – Reference Laurberg, Jørgensen and Perrild 25 ), and collection of urine samples for measuring I excretion is the standard method. However, this method has two major limitations. First, I is excreted within hours after ingestion and urinary I thus represents I intake only over the preceding hours( Reference Keating and Albert 2 – Reference Pearce and Caldwell 5 ) and spot urine samples may underestimate the true I excretion in populations with I-rich main meals( Reference Andersen, Waagepetersen and Laurberg 26 ). The thyroid has the capacity to store large amounts of I and the urinary I excretion may thus not represent the true I intake level in I-deficient populations( Reference Andersen, Pedersen and Pedersen 6 , Reference Andersen, Karmisholt and Pedersen 7 ). Second, I excretion varies considerably due to both differences in diet and due to dilution depending on the fluid intake, perspiration, ambient temperature and other factors( Reference Pearce and Caldwell 5 , Reference Andersen, Pedersen and Pedersen 6 ). This marked variation causes a considerable number of samples to be required for a reliable estimate of the true I nutrition level of that population or individual( Reference Andersen, Karmisholt and Pedersen 7 , Reference Karmisholt, Laurberg and Andersen 8 , Reference König, Andersson and Hotz 17 ). Both of these issues are addressed by the use of TG in serum.
S-TG is used to monitor patients treated for differentiated thyroid cancer by detection of low levels of s-TG. Elevated s-TG is seen with sustained I deficiency and it has been suggested and used in surveys of I nutrition( Reference Knudsen, Bülow and Jørgensen 9 – Reference Zimmermann, Aeberli and Andersson 11 , Reference Krejbjerg, Bjergved and Bulow Pedersen 27 ). The present study provides the first data to describe the reliability of surveys using s-TG to assess and monitor I nutrition. In addition, we calculated the number of samples needed for a certain level of confidence in the I nutrition level estimated. Similar data have been published for urinary I excretion( Reference Andersen, Karmisholt and Pedersen 7 , Reference Karmisholt, Laurberg and Andersen 8 , Reference König, Andersson and Hotz 17 ) and we included data on UIC in the present study to allow for direct comparisons.
Reliability of iodine nutrition studies
We found a slightly lower between-individual variation for s-TG compared with UIC providing an advantage to s-TG over UIC in population surveys. Thus, 20 % more samples are needed for the same precision when using UIC as compared with s-TG. For example, if a precision of 5 % is aimed for, a survey requires one sample from each of 823 individuals when measuring s-TG and one sample from each of 982 individuals when measuring UIC. Conversely, if 1000 subjects are surveyed then the result is within ±4·5 % of the true value for s-TG and ±5·0 % for UIC. If 100 participants are included then this precision range is 14 % for s-TG and 16 % for UIC with 95 % CI. In other words, a mean value of 100 suggests that the true population mean is between 86 and 114 for s-TG and between 84 and 116 for UIC in the example.
Within-individual variation is much lower for s-TG than for UIC. Consequently, the number of samples needed in an individual is much lower and about ten times more samples are needed for the same precision when using UIC as compared with s-TG. It takes twenty-one urine samples to obtain a precision of ±20 %, whereas this is seen with just two measurements of s-TG. Thus, ten times more samples are needed in an individual for the same reliability of the results when comparing UIC with s-TG.
The risk of error is lower for s-TG compared with UIC. The importance of number of samples for the risk of error can be estimated by comparing Tables 3 and 4. A 90 % confidence of being within 10 % from the true I nutrition level requires 6 (33) less samples for s-TG (UIC) than a 95 % CI of being within 10 % from the actual level.
The variance found for urinary I was high, similar to previous studies of variation in urinary I excretion( Reference Andersen, Karmisholt and Pedersen 7 , Reference Karmisholt, Laurberg and Andersen 8 , Reference König, Andersson and Hotz 17 ). This is comparable across populations( Reference Andersen, Karmisholt and Pedersen 7 , Reference König, Andersson and Hotz 17 ), I excretion levels( Reference Andersen, Karmisholt and Pedersen 7 , Reference Karmisholt, Laurberg and Andersen 8 ) and after initiation of an I fortification programme( Reference Karmisholt, Laurberg and Andersen 8 ). This reinforces the reliability of the results of these studies of variation and reliability that support design of future I nutrition studies.
Design of iodine nutrition studies
Urinary I portrays the I intake during the hours before sampling while s-TG is rather a measure of long-term I nutrition( Reference Vejbjerg, Knudsen and Perrild 10 , Reference Andersen, Petersen and Laurberg 28 ). The two measures of I nutrition thus provide information on different aspects of the I nutrition level of the population surveyed. The two measures combined could provide a more detailed description of the true I nutrition level of a population. Moreover, the use of both measures could be speculated to be a more accurate predictor of I deficiency disorders than the single measure of UIC. Thus, the combined UIC and s-TG could provide a two-dimensional insight into I nutrition level of a population. A description of the importance of variation of both of these measures is available from this study and may guide an intelligent study design.
The intelligent study design would seek to benefit from the advantages of each of the two measures. We suggest first to assess the nutrition level from UIC and subsequently to add details and subgroup analysis within that population by the use of s-TG.
For example, we aimed for a precision of 5 % in an I nutrition survey. First, we need to survey 982 participants to be 95 % sure of this precision for UIC. The confirmation and subgroup analysis by the use of s-TG requires 823, 20 % less participants, and the precision would still be 5 %. Thus, the combined use of UIC and s-TG allows for further analysis with unaltered validity of the estimates.
Populations may be heterogeneous. They may cover different geographical areas with different subsurface geology( Reference Andersen, Petersen and Laurberg 28 , Reference Andersen, Guan and Teng 29 ), differences in water supply, differences in dietary habits( Reference Andersen, Hvingel and Kleinschmidt 18 , Reference Rasmussen, Ovesen and Bülow 30 ) or different ethnic groups that call for subgroup analysis. The I nutrition level should be assessed for each of such groups and our data guide the analysis if s-TG is used.
Level of serum thyroglobulin
A cut-off level for s-TG has to be decided upon. S-TG concentrations were between 5 and 14 μg/l in healthy adults( Reference Knudsen, Bülow and Jørgensen 9 , Reference Vejbjerg, Knudsen and Perrild 10 , Reference Rasmussen, Ovesen and Bülow 14 , Reference Nakamura, Sakata and Minamori 31 , Reference Pacini, Pinchera and Giani 32 ) and between 94 and 208 μg/l in adults in an area with endemic goitre( Reference van Herle, Hershman and Homabrook 33 , Reference Bayram, Beyazyildiz and Gökce 34 ). A s-TG cut-off of 13 μg/l was suggested for children but it should not be concluded that this is a suitable level for I status in adults( Reference Ma and Skeaff 35 ).
The occurrence of s-TG above a cut-off level of 40 μg/l was used to delineate I deficiency in children( Reference Zimmermann, Aeberli and Andersson 11 ) and adults( Reference Vejbjerg, Knudsen and Perrild 10 ). This contrasted the cut-off level of 13 μg/l reported to be the median value that delineated I-deficient adults( Reference Ma and Skeaff 35 ). The authors of the latter review emphasised the need for further investigation to settle a cut-off level( Reference Ma and Skeaff 35 ). A recent randomised trial reported a median s-TG of 16·6 μg/l in adults with mild I deficiency and a decrease in s-TG with I supplementation( Reference Ma, Venn and Manning 36 ). In our population the overall mean level was 13·5 μg/l and the individual median was 11·5 μg/l. The cut-off for I deficiency may be set to detect those with raised values rather than just above the median value. Thus, the 75th-percentile in our population of 19·1 μg/l guided the level of 20 μg/l set to delineate individuals with I deficiency in our investigation. In addition, this was in between previous suggestions( Reference Knudsen, Bülow and Jørgensen 9 , Reference Vejbjerg, Knudsen and Perrild 10 , Reference Rasmussen, Ovesen and Bülow 14 , Reference Andersen, Guan and Teng 29 , Reference Nakamura, Sakata and Minamori 31 , Reference Ma and Skeaff 35 ) and provided a distinct separation of I deficiency groups in our data. It may thus be suggested to use this level to delineate I deficiency in an adult population.
Limitations
Serum concentration of TG has limitations. It is not specific for I deficiency but increases also with excess I intake, increasing thyroid mass, inflammation of the thyroid, cold, if the TSH receptor is stimulated and in pregnancy( Reference Teng, Shan and Teng 23 , Reference Feldt-Rasmussen, Bech and Date 37 – Reference Laurberg, Andersen and Bjarnadottir 41 ). These groups should be identified and excluded or taken into consideration and corrected for in surveys of I deficiency using s-TG.
The use of s-TG should also consider inter-assay differences and possibly detection differences in s-TG between I nutrition levels. Attempts have been made to reduce the consequences thereof by standardisation( Reference Feldt-Rasmussen, Profilis and Colinet 42 ) but differences between assays remain and should be taken into consideration( Reference Vejbjerg, Knudsen and Perrild 10 , Reference Feldt-Rasmussen, Profilis and Colinet 42 ). Hence, we suggest a two dimensional approach that includes I in urine for the overall assessment of I nutrition and s-TG for validation and individual I nutrition assessment. The results will thus be reliable based on UIC and supported by using s-TG.
The spot urine I excretion of our study population suggested borderline to mild I deficiency. Different I intake levels associate with different s-TG levels. Whether variance of s-TG also differs with levels of s-TG remains to be settled. Also, the variance for other groups such as school children needs to be clarified.
Conclusion
Variation was lower for s-TG than for UIC. Thus, more participants are needed for similar reliability of the results for UIC compared with s-TG. Consequently, 80 % of samples are redundant in an individual if s-TG is used to assess I nutrition level, and 20 % less individuals are needed in a population when using s-TG rather than UIC. Consequently, s-TG provides the opportunity for either fewer participants or a higher reliability.
This difference may be used to gain more from I nutrition surveys. The reliability of the study is upheld when UIC is used to assess the overall I nutrition level and s-TG is added to assess I nutrition in subgroups. This is important when planning and evaluating I nutrition surveys in populations and in individuals. We thus suggest using both measures to assess I nutrition in a smart I nutrition survey design.
Acknowledgements
The authors gratefully acknowledge Carla Hame and Ruth Møller Jensen for their support during the data collection in Saqqaq and Eskild Boeskov for support in Ilulissat.
This work was supported by grants from Greenland Government’s Research Council and by Karen Elise Jensen Fond. They had no role in the design, analysis or writing of this article.
S. A. contributed to study design, raising of funds, data collection, analysis and interpretation of data and writing of the manuscript. P. N. contributed substantially to interpretation of data and critical review of the manuscript. L. W. contributed substantially to interpretation of data and critical review of the manuscript. P. L. contributed to study design, raising of funds, data collection, analysis and interpretation of data and reviewing of the manuscript. P. L. tragically passed away but did approve the final version of the manuscript before the initial review.
The authors declare that there are no conflicts of interest.