5Academic Unit of Primary Health Care, Department of Community Based Medicine, University of Bristol, Bristol, United Kingdom

Correspondence to: P Jüni, Institute of Social and Preventive Medicine, University of Bern, Switzerland juni{at}ispm.unibe.ch

Accepted 5 July 2010

Abstract

Objective To determine the effect of glucosamine, chondroitin, or the two in combination on joint pain and on radiological progression of disease in osteoarthritis of the hip or knee.

Design Network meta-analysis. Direct comparisons within trials were combined with indirect evidence from other trials by using a Bayesian model that allowed the synthesis of multiple time points.

Main outcome measure Pain intensity. Secondary outcome was change in minimal width of joint space. The minimal clinically important difference between preparations and placebo was prespecified at −0.9 cm on a 10 cm visual analogue scale.

Eligibility criteria for selecting studies Large scale randomised controlled trials in more than 200 patients with osteoarthritis of the knee or hip that compared glucosamine, chondroitin, or their combination with placebo or head to head.

Results 10 trials in 3803 patients were included. On a 10 cm visual analogue scale the overall difference in pain intensity compared with placebo was −0.4 cm (95% credible interval −0.7 to −0.1 cm) for glucosamine, −0.3 cm (−0.7 to 0.0 cm) for chondroitin, and −0.5 cm (−0.9 to 0.0 cm) for the combination. For none of the estimates did the 95% credible intervals cross the boundary of the minimal clinically important difference. Industry independent trials showed smaller effects than commercially funded trials (P=0.02 for interaction). The differences in changes in minimal width of joint space were all minute, with 95% credible intervals overlapping zero.

Conclusions Compared with placebo, glucosamine, chondroitin, and their combination do not reduce joint pain or have an impact on narrowing of joint space. Health authorities and health insurers should not cover the costs of these preparations, and new prescriptions to patients who have not received treatment should be discouraged.

Introduction

Osteoarthritis of the hip or knee is a chronic condition mostly treated with analgesics and non-steroidal anti-inflammatory drugs, but these drugs can cause serious gastrointestinal and cardiovascular adverse events, especially with long term use.12 Disease modifying agents that not only reduce joint pain but also slow the progression of the condition would be desirable. Throughout the world for the past 10 years, the cartilage constituents chondroitin and glucosamine have been increasingly recommended in guidelines, prescribed by general practitioners and rheumatologists, and used by patients as over the counter medications to modify the clinical and radiological course of the condition.3 Global sales of glucosamine supplements reached almost $2bn (£1.3bn, €0.8bn) in 2008, which represents an increase of about 60% compared with 2003, with a forecasted continued growth through 2013 reaching $2.3bn.4 The oral administration of cartilage constituents in patients with osteoarthritis is thought to make up for the apparent cartilage loss in affected joints. Chondroitin is a highly hydrophilic, gel forming polysaccharide macromolecule. Its hydrocolloid properties convey much of the compressive resistance of cartilage. Glucosamine is an amino sugar that is a building block for the glycosaminoglycans that are part of the structure of cartilage. Ingested chondroitin and glucosamine are both partially absorbed in the intestine, and it has been suggested that some of the ingested amount reaches the joints.567

Results from randomised trials about the effectiveness of chondroitin and glucosamine are conflicting.891011 Trials that have reported large effects on joint pain were often hampered by poor study quality and small sample sizes,9101112 whereas large methodologically sound trials often found only small or no effects.101113

Bayesian approaches towards network meta-analyses allow a unified, coherent analysis of data recorded at multiple time points in randomised trials that compare either of these preparations with placebo or head to head.141516 The approaches fully respect randomisation, account for the correlation of multiple observations within the same trial, and allow the estimation of the relative effectiveness of the different preparations and their combination. We performed a systematic review with network meta-analysis including data from large methodologically sound randomised trials at multiple follow-up times to determine the effect of these preparations on joint pain and on radiological progression of disease.

Methods

Literature search

We searched the Cochrane Controlled Trials Register, Medline, Embase, and CINAHL (from inception to June 2010) using a combination of keywords and text words related to osteoarthritis; these were combined with generic and trade names of the various preparations plus a validated filter for controlled clinical trials.17 We also retrieved reports citing relevant articles via Science Citation Index (1981-2008). In addition, we manually searched conference proceedings and text books, screened reference lists of all obtained papers, and contacted content experts.

Study selection

We included randomised trials with an average of at least 100 patients with knee or hip osteoarthritis per arm.18 Trials compared chondroitin sulphate, glucosamine sulphate, glucosamine hydrochloride, or the combination of any two with placebo or head to head. A sample size of 2×100 patients will yield more than 80% power to detect a small to moderate effect size of −0.40 at a two sided P=0.05, which corresponds to a difference of 1 cm on a 10 cm visual analogue scale between the experimental and control intervention. Two of four reviewers (BT, EN, SR, ST) evaluated reports independently for eligibility. They excluded trial arms with sub-therapeutic doses (<800 mg/day of chondroitin and <1500 mg/day of glucosamine, in accordance with doses licensed in Europe). Disagreements were resolved by consensus.

Outcome measures

The prespecified primary outcome was absolute pain intensity reported in any of nine time windows organised in increments of three months (up to 3 months, 6, 9, 12, 15, 18, 21 months, and 22 months or more). If more than one time point was reported in a window, we extracted data nearest to the longest follow-up time included in that window; for the window covering 22 months or more, we extracted the follow-up closest to 24 months. When an article provided data on more than one pain scale, we referred to a previously described hierarchy of pain related outcomes and extracted the outcome that was highest on this list.9 Global pain took precedence over pain on walking and pain subscores on the Western Ontario and McMaster Universities (WOMAC) arthritis index. If a trial report provided data on both—for example, global pain scores and WOMAC pain subscores—we recorded only data on global pain scores. Secondary outcomes were changes in the minimum radiographic joint space between baseline and the end of treatment, the number of individuals withdrawn or who dropped out because of an adverse event, and the number of patients experiencing any adverse event.

Quality assessment

Two of the four reviewers independently assessed concealment of allocation, blinding, and adequacy of analyses.19 Concealment of allocation was considered adequate if the investigators responsible for the selection of patients did not know before allocation which treatment was next in line (central randomisation, sealed, opaque, sequentially numbered assignment envelopes, coded drug packs, etc). Any procedures based on predictable generation of allocation sequences, and potentially transparent attempts to conceal allocation, such as non-opaque envelopes, were considered inadequate. We extracted the number of patients initially randomised and the number of patients analysed per group at each time point to distinguish between trials that had included all randomised patients in the analysis (intention to treat analysis) and trials that had not. Finally, we determined whether experimental preparations had undergone quality control—that is, if either a formally approved preparation was used or pharmacological laboratory analysis confirmed the content of the preparation. Disagreements were resolved by consensus.

Data collection

Two of the four reviewers used a standardised form to extract in duplicate data on publication status, trial design, patients’ characteristics, treatment regimens, outcome modalities, and funding. Results of pain, joint space narrowing, and adverse events were extracted by one reviewer (ST) and cross checked by another (PJ). When necessary, means and measures of dispersion were approximated from figures in the reports.

Statistical analysis

We used an extension of multivariable Bayesian hierarchical random effects models for mixed multiple treatment comparisons with minimally informative prior distributions.2021 It fully preserves the comparison of randomised treatments within each trial while combining all available comparisons between treatments and accounts for multiple comparisons within a trial when there are more than two treatment arms.22 For the analysis of effect sizes of pain, the model included random effects at the level of trials and time points. It accounted for the correlation of outcome data reported at different time points within a trial and allowed the estimation of the variance of treatment effects between trials (τ2). Effect sizes were calculated by dividing the differences in mean values between treatment groups in a time window by the median pooled standard deviation (SD) observed across all time points in a trial.23 If SDs were not provided, we calculated them from standard errors or confidence intervals as described elsewhere.1024 An effect size of −0.20 SD units suggests an overlap in the distributions of reported pain scores in the experimental group with pain scores in placebo group in 85% and can be considered a small difference between experimental and control group.923 An effect size of −0.50 indicates an overlap in about 67% and can be considered a moderate difference, whereas −0.80 suggests an overlap in 53% and is considered a large difference.923

To allow intuitive interpretation of pooled effects, we back transformed effect sizes to differences on a 10 cm visual analogue scale on the basis of a median pooled SD of 2.5 cm found in large scale osteoarthritis trials that assessed pain on a 10 cm visual analogue scale.12 We prespecified a minimal clinically important difference of 0.37 SD units, corresponding to 0.9 cm on a 10 cm visual analogue scale. This was based on the median minimal clinically important difference found in recent studies in patients with osteoarthritis.25262728 As the analysis of changes of minimum radiographic joint space did not include multiple time points, the model used for this outcome included only a random effect at the level of trials. To achieve comparability of the magnitude of effects on joint space and on pain and distinguish between small, moderate, and large treatment effects, we expressed differences in the width of the joint space as effect sizes, dividing the pooled estimates in millimetres by the median pooled SD of 1.2 mm found in included trials.

Whenever possible, we used results of intention to treat analysis including all randomised patients.12 Pooled effect sizes were estimated from the median of the posterior distribution. A negative effect size indicates a benefit of the experimental intervention. Corresponding 95% credible intervals were estimated from the 2.5th and 97.5th centiles of the posterior distribution.15 In the presence of minimally informative priors, credible intervals can be interpreted in a similar way to conventional confidence intervals. To determine whether the variation of treatment effects over time was over and above what would be expected by chance, we calculated a P value for heterogeneity across time points of follow-up.29 The P value was derived from the proportion of observations of the posterior distribution of the variance observed across time points within trials smaller than or equal to the variance within trials typically found in large osteoarthritis trials (0.01 for an effect size scale, 0.0625 for a 10 cm visual analogue scale).

To explore possible time trends, we included a linear term for time as a covariate in the analyses. We then included characteristics of the trials as covariates in the network meta-analysis to estimate effects according to concealment of allocation; intention to treat analysis; high methodological quality defined as adequate concealment of allocation, adequate blinding of patients, and the presence of an intention to treat analysis; source of funding (industry independent v other); type of glucosamine used (sulphate v hydrochlorides); quality control of preparations; and type of joint affected (knee v hip). P values for interaction between trial characteristics and treatment effect were derived from the posterior distribution of covariates and can be interpreted in the same way as a traditional P value for interaction.30

Heterogeneity between trials was estimated from the median variance between trials (τ²) observed in the posterior distribution with the following prior distributions: a gamma distribution for heterogeneity between trials (1/τ² ∼ gamma(0.001,0.001)I(0,2000)), and a uniform distribution for heterogeneity between time points (τ ∼ unif(0,50)). In a sensitivity analysis we also used a uniform prior for the heterogeneity between trials. The consistency of the network was determined by use of inconsistency factors: the estimated difference between the effect size from direct comparisons within randomised trials and the effect size from indirect comparisons between randomised trials with one intervention in common.31 Estimates of variation and consistency are based on back transformations to differences on a 10 cm visual analogue scale. Goodness of ﬁt was assessed with Q-Q plots.

Finally, we performed pairwise meta-analyses with random effects at the level of trials and time points, as well as a simpler network meta-analysis including only one treatment effect per trial (absolute pain intensity at the longest follow-up available). Convergence of Markov chains was deemed to be achieved if plots of the Gelman-Rubin statistics indicated that widths of pooled runs and individual runs stabilised around the same value and their ratio around one.32 Accordingly, all analyses are based on 150 000 iterations, of which the first 50 000 were discarded as burn-in period. We used Stata (Stata Statistical Software: release 10; StataCorp LP 2005, College Station, TX) and WinBUGS (version 1.4; MRC Biostatistics Unit 2007, Cambridge, UK) for all analyses.

Results

Out of 58 potentially eligible reports, 12 reports describing 10 trials met our inclusion criteria and were included in the network meta-analysis.133334353637383940414243 All trials were published as full journal articles. For one trial two publications were included1342; for another trial43 additional data were provided in an electronic rapid response.40

Study characteristics

The 10 included trials had randomly allocated a total of 3803 patients to either of the experimental interventions or placebo. Figure 1 shows the network of interventions⇓. Five trials (1104 randomised patients) compared glucosamine sulphate with placebo.3334353941 In another placebo controlled trial (205 patients), the investigators were forced to change from glucosamine sulphate to glucosamine hydrochloride after 80% of the patients had been treated with glucosamine sulphate because the manufacturer of glucosamine sulphate declined to supply matching placebos.36 Three trials (1229 patients) compared chondroitin sulphate with placebo,373843 and one trial (1265 patients) compared glucosamine hydrochloride, chondroitin sulphate, and their combination with placebo.13 Tables 1 and 2 show the characteristics of trials⇓⇓.

Table 1

Characteristics of identified randomised trials of glucosamine or chondroitin for osteoarthritis of hip or knee

Six trials described adequate concealment of allocation,133536394143 nine trials reported adequate blinding of patients, and in one trial34 it was unclear. Seven trials performed an intention to treat analysis.13343536374143 Eight trials included patients with osteoarthritis of the knee only,1333343536373943 one trial included patients with osteoarthritis of the hip or knee,38 and one trial included patients with osteoarthritis of the hip only.41 All except three trials133641 were funded by manufacturers of supplements. In eight trials, experimental preparations had undergone quality control to ensure adequate concentrations of glucosamine or chondroitin, and in two trials3841 it was unclear. The average age of patients was 58-66 (median 62), and the percentage of women ranged from 27% to 86% (median 68%). The average duration of symptoms ranged from a minimum of six months to more than 10 years. All treatments were administered on consecutive days in all trials. Duration of follow-up varied substantially between trials, from one month33 to 36 months,3435 and the number of follow-up visits from one13333437 to 1235 (table 1)⇑.

Fig 1 Structure of network formed by interventions and their direct comparisons. Numbers of trials and patients do not add up to numbers reported in table 2 because of multi-arm trial by Clegg et al13

Effects on joint pain

All trials contributed to the network meta-analysis of pain related outcomes (see appendix 1 on bmj.com). Figure 2⇓ presents pooled estimates across different time points. The variation across time points was not over and above what would be expected by chance (τ2=0.04 for variation across time points on a 10 cm visual analogue scale, P=0.93 for interaction between treatment effect and time). The overall difference in pain intensity versus placebo based on a summary of all time points was −0.4 cm (95% credible interval −0.7 to −0.1 cm) on a 10 cm visual analogue scale for glucosamine, −0.3 cm (−0.7 to 0.0 cm) for chondroitin, and −0.5 cm (−0.9 to 0.0 cm) for the combination of glucosamine and chondroitin. Corresponding effect sizes were −0.17 (−0.28 to −0.05) for glucosamine, −0.13 (−0.27 to 0.00) for chondroitin, and −0.19 (−0.37 to 0.00) for the combination. Heterogeneity between trials was low (τ2=0.04 for heterogeneity between trials on a 10 cm visual analogue scale), there was no evidence for inconsistency (inconsistency factor 0.2 cm, −0.7 to 1.1, P=0.63), and the goodness of fit of the model to the data was excellent (data available on request). Results from the primary network meta-analysis were concordant with a model including a linear term for time, conventional meta-analyses of direct comparisons, a network meta-analysis, which included only one time point for pain intensity at the end of follow-up, and an analysis with a different prior distribution for the heterogeneity between trials (see appendix 2 on bmj.com).

Figure 3 shows the results from stratified analyses⇓. Estimates comparing supplements with placebo depended to some extent on the quality of the trials, the presence or absence of quality control measures for preparations, the type of study joint, and the type of glucosamine salt used, but tests for interaction were all negative for these variables (P≥0.20 for interaction). The estimated differences between supplements and placebo, however, were, on average, 0.5 cm (0.1 to 0.9 cm) less pronounced in industry independent trials compared with industry sponsored trials (P=0.02 for interaction).

Effects on radiological joint space

Six trials reported changes in width of joint space.343537414243 The network meta-analysis of differences in changes in minimal joint space narrowing at the end of the treatment period showed minute effects for all preparations compared with placebo. The difference was −0.2 mm (−0.3 to 0.0 mm) in favour of glucosamine, −0.1 mm (−0.3 to 0.1 mm) in favour of chondroitin, and 0.0 mm (−0.2 to 0.2 mm) for the combination, which corresponded to effect sizes of −0.16 (−0.25 to 0.0), −0.08 (−0.25 to 0.08), and 0.00 (−0.16 to 0.16). Heterogeneity between trials was low (τ2=0.02), there was no evidence for inconsistency (inconsistency factor −0.1 mm, −0.6 to 0.4 mm; P=0.54), and the goodness of fit of the model to the data was excellent.

Safety

Five trials reported on adverse events,3334353841 all 10 reported withdrawals or drop-outs because of adverse events, and three reported serious adverse events.333841 The odds ratios of adverse events compared with placebo were 0.94 (0.59 to 1.47) for glucosamine and 0.99 (0.49 to 2.00) for chondroitin; no data were available on adverse events overall for the combination. The odds ratios for withdrawals or drop-outs because of adverse events were 0.99 (0.61 to 1.50) for glucosamine, 0.92 (0.56 to 1.51) for chondroitin, and 0.90 (0.43 to 1.85) for the combination. Heterogeneity between trials was low for both outcomes, with τ2 of 0.02 and 0.03, respectively. We could estimate inconsistency only for drop-outs because of adverse events, with some evidence of inconsistency (ratio of relative risks 0.54, 0.19 to 1.46, P=0.22 for inconsistency).

Discussion

Principal findings

Our network meta-analysis of all 10 available large scale patient blind randomised trials in 3803 patients with knee or hip osteoarthritis showed no clinically relevant effect of chondroitin, glucosamine, or their combination on perceived joint pain. Despite abundant statistical power, none of the pooled estimates crossed the pre-specified boundary of a minimal clinically important difference of −0.9 cm on a 10 cm visual analogue scale at any of the recorded time points. At some time points the 95% credible interval crossed this boundary (see fig 3), which could mean that we cannot exclude a relevant effect at such time points. The overall estimates, which combine effects over different time points, were precise, however, and the lower end of their credible intervals did not cross the pre-specified boundary. These estimates should be considered most valid in view of the negative test of interaction of treatment effects by time (P=0.93), which indicates that the observed variation over different time points is not over and above what would be expected by chance alone.

The upper limit of the 95% credible interval of the overall pooled estimate of glucosamine versus placebo and chondroitin versus placebo did not overlap the line of no effect, which suggests that a traditional P value for this comparison would be significant at the conventional 5% level. Statistical significance should not, however, be confused with clinical relevance. With the observed differences in pain intensity of 0.3 to 0.5 cm between supplements and placebo on a 10 cm visual analogue scale, the range and distribution of pain scores in patients receiving supplements and placebo are near identical,923 and it would be impossible, based on the reported pain intensity at the end of a trial, to determine whether a patient was allocated to a supplement or to placebo.

In stratified analyses, we found that estimates comparing supplements with placebo depended to some extent on the quality of the trials, the presence or absence of quality control measures for preparations, the joint studied, and the type of glucosamine salt used, but tests for interaction were all negative for these variables (P≥0.20 for interaction). On average, the estimated differences between supplements and placebo were 0.5 cm less pronounced in industry independent trials compared with industry sponsored trials, and estimated treatment effects in industry independent trials were minute to zero and by no means clinically relevant (see fig 2). The effects on minimal width of joint space were small, again clinically irrelevant, and—with credible intervals overlapping the line of no effect—non-significant at the conventional α level of 5%.

Strengths and weaknesses

Our network meta-analysis integrated evidence from direct and indirect comparisons while fully preserving randomisation. It enabled us to simultaneously analyse effect sizes reported at different follow-up times in a single model and to estimate the overall effect of preparations irrespective of the duration of follow-up while fully accounting for potential variation across time points and for the correlation of estimates within a trial. Consequently, estimates in our analysis were more precise than the pairwise meta-analyses or the network meta-analysis with only pain intensity at the end of follow-up (see appendix 2 on bmj.com).

We performed an extensive literature search,44 which makes it unlikely that we missed any relevant trial. Trial selection and data extraction including quality assessment were done independently by two authors to minimise bias and transcription errors.45 Components used for quality assessment are validated and reported to be associated with bias.121946 In line with our pre-specified inclusion criteria, the trials in our network were large and of satisfactory methodological quality.

As with conventional meta-analyses, some will argue that we have not compared like with like. Our model, however, was based on relative treatment effects (differences between groups expressed as effect sizes23), and variations in patients’ characteristics between trials are fully accounted for in the analysis by maintaining randomised comparisons within each trial. Network meta-analysis makes similar assumptions to standard meta-analysis of direct comparisons within trials but requires that these assumptions hold over the entire set of trials in the network—that is, for the indirect comparisons also. In addition, our model assumes that relative treatment effects comparing two interventions in different trials are from the same common distribution. The smaller the heterogeneity between trials, and the smaller the inconsistency between direct randomised comparisons and indirect comparisons, the more likely these assumptions hold. The heterogeneity between trials in our analysis was near zero and the upper credible interval for the τ2 estimate was 0.24 on a 10 cm visual analogue scale (the maximum τ of the underlying distribution of treatment effects compatible with the credible interval would be 0.5 cm). In addition, we investigated potential sources of variation in the network by including characteristics of trials as covariates in the analysis of the primary outcome. Taken together, results of these analyses make it likely that relative treatment effects originate from one common distribution and confirm one of our key assumptions. As with heterogeneity between trials, inconsistency between direct and indirect comparisons was also near zero (inconsistency factor 0.2 cm). Although we cannot rule out clinically relevant inconsistency (the upper credible interval for the inconsistency factor crossed the pre-specified threshold for a clinically relevant effect at 0.9 cm), we have no indication that clinical characteristics of included patients or other trial characteristics confounded the indirect comparisons. The use of different instruments to measure joint pain made it necessary to calculate effect sizes as a common measure of effectiveness to ensure comparability between outcomes assessed with different instruments. Poor correlation or differences in responsiveness of different instruments could be a potential threat to the validity of results.47 The scales used in the component trials of our network (10 cm visual analogue scale and WOMAC pain subscales), however, were highly correlated and have comparable responsiveness.48

Relation to other studies

Several systematic reviews and meta-analyses on glucosamine and chondroitin have been published.8101149505152 The three most recent ones were by Vlad et al11 on glucosamine, Reichenbach et al10 on chondroitin, and Lee et al52 on radiographic outcomes of both preparations. Vlad and colleagues analysed 15 trials comparing glucosamine with placebo.11 They found a pooled effect size of −0.35 (95% confidence interval −0.56 to −0.14) in favour of glucosamine, but there was substantial heterogeneity. Trials with adequate concealment of allocation, industry independent trials, and trials evaluating glucosamine hydrochloride showed less beneficial effects and less pronounced heterogeneity between trials than the remainder. The authors concluded that glucosamine hydrochloride is ineffective but could not exclude the possibility of a clinically relevant effect of glucosamine sulphate. Reichenbach and colleagues found large heterogeneity among 20 chondroitin trials, which could be explained by a lack of concealment of allocation, failure to perform an intention to treat analysis, and small sample sizes.10 The initial pooled effect size of −0.75 (−0.99 to −0.50) in favour of chondroitin sulphate diminished to zero when the analysis was restricted to methodologically sound trials of adequate sample size. Both groups had analysed only one time point per trial, which was criticised.40 Lee and colleagues included six trials evaluating the effects of chondroitin or glucosamine on narrowing of joint space (four were included in our analysis and we excluded two because of small sample size).52 They found significant small to moderate protective effects. They did not, however, include the GAIT trial.42 We included methodologically superior large scale patient blinded trials in more than 200 patients in our network meta-analysis and used a statistical model that allowed the simultaneous analysis and summary of treatment effects observed at multiple time points. Addressing earlier concerns about time dependency of effects,40 quality control of preparations,53 and differences between different formulations of glucosamine,11 we conclude that there is no evidence for time dependent effects, that the lack of a clinically relevant effect of these preparations is not related to a lack of quality control, and that the lack of a clinically relevant effect is also apparent for glucosamine sulphate. With the summary of multiple time points and the combination of direct comparisons within trials between preparations with indirect evidence from other trials, these conclusions are based on considerably more high quality evidence than the previous restricted analyses of trials considered least biased by Vlad et al and Reichenbach et al.1011

Implications

We believe it unlikely that future trials will show a clinically relevant benefit of any of the evaluated preparations. Some will argue, however, that many patients included in the trials of our network were too ill in radiological terms to benefit and that their advanced radiological stage meant that the subtotal to total cartilage damage could not be influenced any more by the experimental preparations. Others will argue that many patients were not ill enough in clinical terms and that their small amount of experienced pain meant that they could not benefit from the analgesic effects of the preparations.54 To address these concerns, in addition to the trials by Clegg et al,13 Rozendaal et al,41 and McAlindon et al,36 some might consider the necessity for a fourth industry independent trial, which would exclusively include patients with an experienced pain intensity at baseline of at least 4 cm on a 10 cm visual analogue scale and moderate osteoarthritis, corresponding to a Kellgren and Lawrence score of 2.55 Inclusion of 150 to 200 patients in each comparison group would yield more than 90% power to detect a minimal clinically relevant difference of −0.9 cm on a 10 cm visual analogue scale for any of these preparations compared with placebo at a conventional two sided α level of 5%. The trial should use coded drug packs with preparations and placebos of identical appearance and taste to conceal treatment allocation and ensure blinding of patients and care givers, carefully control and monitor analgesic cointerventions, and fully adhere to the principle of intention to treat by the inclusion of all patients in the analysis in the groups to which they were originally allocated. The evaluated preparations should have undergone thorough quality control to ensure appropriate concentrations of chondroitin and glucosamine sulphate. The industry independent randomised Long Term Evaluation of Glucosamine Sulphate Study (LEGS) will probably satisfy most of these criteria.56 It allocated 600 patients to one of four treatment arms— chondroitin sulphate, glucosamine sulphate, their combination, or matching placebo—and closed recruitment in October 2009. First results will become available at the earliest in November 2011 (M Fransen, personal communication).

Conclusions

Our findings indicate that glucosamine, chondroitin, and their combination do not result in a relevant reduction of joint pain nor affect joint space narrowing compared with placebo. Some patients, however, are convinced that these preparations are beneficial,57 which might be because of the natural course of osteoarthritis, regression to the mean, or the placebo effect.58 We are confident that neither of the preparations is dangerous. Therefore, we see no harm in having patients continue these preparations as long as they perceive a benefit and cover the costs of treatment themselves.57 Coverage of costs by health authorities or health insurers for these preparations and novel prescriptions to patients who have not received other treatments should be discouraged.

What is already known on this topic

Chondroitin and glucosamine have been recommended in guidelines, prescribed by general practitioners and rheumatologists, and used by patients as over the counter medications to modify the clinical and radiological course of osteoarthritis

Results from randomised trials about the effectiveness of chondroitin and glucosamine are conflicting

What this study adds

Chondroitin, glucosamine, and their combination do not have a clinically relevant effect on perceived joint pain or on joint space narrowing

Estimated differences between supplements and placebo were less pronounced on average in industry independent trials, and estimated treatment effects in industry independent trials were small or absent and clinically irrelevant

Notes

Cite this as:BMJ 2010;341:c4675

Footnotes

We thank Bruno da Costa for helpful discussions related to minimal clinically important differences and limitations of effect sizes and Malcolm Sturdy for database development and maintenance.

Contributors: SW and PJ contributed equally. PJ conceived the study. PJ, ST, and SW and were responsible for conception and design of the study. SW, PJ, NJW, and ST did the analysis and interpreted the analysis in collaboration with BT, EN, PMV, and SR. SW, PJ, BT, EN, SR, and ST were responsible for the acquisition of data. PJ and SW wrote the ﬁrst draft of the manuscript. All authors critically revised the manuscript for important intellectual content and approved the ﬁnal version of the manuscript. PJ and SR obtained public funding. PJ and PMV provided administrative, technical, and logistical support. PJ is guarantor.

Funding: The study was funded by grants from the Swiss National Science Foundation’s National Research Program 53 on musculoskeletal health (PJ and SR) (No 4053-0-104762/3). PJ was a senior research fellow in the Program for Social Medicine, Preventive and Epidemiological Research funded by the Swiss National Science Foundation (grant No 3233-066377). SR was a recipient of a research fellowship funded by the Swiss National Science Foundation (grant No PBBEB-115067). SW was a recipient of an individual fellowship of the Janggen-Poehn-Foundation. The study sponsor had no role in study design, data collection, data synthesis, data interpretation, writing the report, or the decision to submit the manuscript for publication. None of the authors is affiliated with or funded by any manufacturer of any of the agents evaluated in this study.

Competing interests: All authors have completed the Unified Competing Interest form at www.icmje.org/coi_disclosure.pdf (available on request from the corresponding author) and declare: no support from any institution for the submitted work; no financial relationships with any institutions that might have an interest in the submitted work in the previous 3 years; no other relationships or activities that could appear to have influenced the submitted work.

Ethical approval: Not required.

Data sharing: Technical details, statistical code, and dataset available from the corresponding author.

Angst F, Aeschlimann A, Stucki G. Smallest detectable and minimal clinically important differences of rehabilitation intervention with their implications for required sample sizes using WOMAC and SF-36 quality of life measurement instruments in patients with osteoarthritis of the lower extremities. Arthritis Rheum2001;45:384-91.