Abstract

Objectives This study sought to perform an indirect comparison analysis of dabigatran etexilate (2 doses), rivaroxaban, and apixaban for their relative efficacy and safety against each other.

Background Data for warfarin compared against the new oral anticoagulants (OACs) in large phase III clinical trials of stroke prevention in atrial fibrillation (AF) are now available for the oral direct thrombin inhibitor, dabigatran etexilate, in 2 doses (150 mg twice daily [BID], 110 mg BID), and the oral Factor Xa inhibitors, rivaroxaban and apixaban. A “head-to-head” direct comparison of drugs is the standard method for comparing different treatments, but in the absence of such head-to-head direct comparisons, another alternative to assess the relative effect of different treatment interventions would be to perform indirect comparisons, using a common comparator. Nonetheless, any inter-trial comparison is always fraught with major difficulties, and an indirect comparison analysis has many limitations, especially with the inter-trial population differences and thus, should not be overinterpreted.

Methods Indirect comparison analysis was performed using data from the published trials.

Results There was a significantly lower risk of stroke and systemic embolism (by 26%) for dabigatran (150 mg BID) compared with rivaroxaban, as well as hemorrhagic stroke and nondisabling stroke. There were no significant differences for apixaban versus dabigatran (both doses) or rivaroxaban; or rivaroxaban versus dabigatran 110 mg BID in preventing stroke and systemic embolism. For ischemic stroke, there were no significant differences between the new OACs. Major bleeding was significantly lower with apixaban compared with dabigatran 150 mg BID (by 26%) and rivaroxaban (by 34%), but not significantly different from dabigatran 110 mg BID. There were no significant differences between apixaban and dabigatran 110 mg BID in safety endpoints. Apixaban also had lower major or clinically relevant bleeding (by 34%) compared with rivaroxaban. When compared with rivaroxaban, dabigatran 110 mg BID was associated with less major bleeding (by 23%) and intracranial bleeding (by 54%). There were no significant differences in myocardial infarction events between the dabigatran (both doses) and apixaban.

Conclusions Notwithstanding the limitations of an indirect comparison study, we found no profound significant differences in efficacy between apixaban and dabigatran etexilate (both doses) or rivaroxaban. Dabigatran 150 mg BID was superior to rivaroxaban for some efficacy endpoints, whereas major bleeding was significantly lower with dabigatran 110 mg BID or apixaban. Only a head-to-head direct comparison of the different new OACs would fully answer the question of efficacy/safety differences between the new drugs for stroke prevention in AF.

Stroke is a devastating complication associated with atrial fibrillation (AF), which is the most common sustained cardiac rhythm disorder. Effective prevention of stroke requires oral anticoagulation (OAC) therapy, and until recently, this was dependent upon the vitamin K antagonist class of drugs, for example, warfarin (1). The limitations of warfarin, including the need for regular monitoring, have led to the quest for new OACs that would be safe and effective alternatives to warfarin (2).

Data for warfarin compared against the new OACs in large phase III clinical trials are now available for the oral direct thrombin inhibitor, dabigatran etexilate, in 2 doses (150 mg twice daily [BID], 110 mg BID) and the oral Factor Xa inhibitors, rivaroxaban and apixaban (3–6). Apixaban also has 1 clinical trial against aspirin, among patients deemed unsuitable for or who have refused warfarin (7).

The RE-LY (Randomized Evaluation of Long-Term Anticoagulant Therapy) trial compared 2 doses of dabigatran etexilate against warfarin in AF patients with 1 or more stroke risk factors, and reported that dabigatran 110 mg BID was noninferior to warfarin for the primary endpoint of stroke and systemic embolism, with 20% less major bleeding events (3). Following additional adjudication, some endpoints from the original 2009 publication were updated in a 2010 letter to the New England Journal of Medicine (4). The ROCKET-AF (Rivaroxaban Once Daily Oral Direct Factor Xa Inhibitor Compared With Vitamin K Antagonism for Prevention of Stroke and Embolism Trial in Atrial Fibrillation) trial studied a high-risk population of patients with AF, and found that rivaroxaban 20 mg once daily (OD) (with a dose adjustment of 15 mg OD for those with moderate renal impairment) was noninferior to warfarin for stroke and systemic embolism, with a similar rate of major bleeding (5). The ARISTOTLE (Apixaban for Reduction in Stroke and Other Thromboembolic Events in Atrial Fibrillation) trial reported that apixaban 5 mg BID (with a dose adjustment to 2.5 mg BID for patients with 2 of 3 criteria: age >80 years, body mass index <60 kg/m2, and serum creatinine >133 μmol/l) was superior to warfarin (by 21%) for reducing stroke and systemic embolism, with 31% fewer major bleeding events (6). All the drugs demonstrated significantly less hemorrhagic stroke and intracranial hemorrhage, compared with warfarin. The outcome data from these trials are summarized in Online Table 1.

Both dabigatran and rivaroxaban already have regulatory approval and a license for stroke prevention in some countries, and apixaban is anticipated to gain regulatory approval in 2012. Thus, clinicians may have a number of new OACs available, but would need information about how these agents directly compare against each other in terms of efficacy and safety. Head-to-head clinical trials of these new OACs would require huge numbers of patients to achieve statistical power for noninferiority, and may not be viable options in the near future. In the absence of such direct comparisons in large prospective randomized controlled trials, another accepted alternative to assess the relative effect of different treatment interventions would be to perform indirect comparisons on the basis of the published trials so far, using a common comparator (8). In the case of the new OACs, warfarin was the comparator used in common for all the main noninferiority trials.

The aim of the present study was to perform an indirect comparison analysis of apixaban against dabigatran etexilate (2 doses) and rivaroxaban, as well as rivaroxaban against dabigatran etexilate (2 doses), for their relative efficacy and safety against each other. Nonetheless, any intertrial comparison is always fraught with major difficulties, and an indirect comparison analysis has many limitations, especially with the intertrial population differences, and thus should not be over-interpreted. However, this method is a well-accepted analysis in the absence of head-to-head trials.

Methods

The main efficacy and safety endpoints from the RE-LY, ROCKET-AF, and ARISTOTLE clinical trials were reviewed for comparability and consistency of definitions (Online Table 1). RE-LY and ARISTOTLE were broadly similar in patient demography (e.g., age, gender mix, etc.) and baseline stroke risk (average CHADS2 score of 2.1). By contrast, ROCKET-AF patients were slightly older (median age: 73 years), were at higher stroke risk (mean CHADS2 score: 3.5), and 55% were a secondary prevention population. The average time in therapeutic range values in the warfarin-treated patients for RE-LY, ROCKET-AF, and ARISTOTLE were 64%, 55%, and 62%, respectively. RE-LY was conducted as an open trial, (prospective, randomized, open-blinded endpoint evaluation, i.e., PROBE design), whereas the other trials were double-blind trials.

All baseline characteristics, except for age and average CHADS2 score, were described with proportions/risk for reported characteristics. The studies were then compared in terms of risk differences and corresponding 95% confidence intervals, which was considered a relevant measure for the quantification of the difference between trials. Comparison of average CHADS2 score between the trials was a standard 2-sample comparison for normal data. Age differences were not compared statistically due to differences in reporting between the trials.

Our endpoints of interest for this mixed treatment comparisons focused on the primary efficacy and safety endpoints. For all trials, the primary efficacy endpoint was “all stroke and systemic embolism,” whereas the primary safety endpoint for all trials, except ROCKET-AF, was major bleeding by International Society on Thrombosis and Haemostasis (ISTH) criteria (3–7). In ROCKET-AF, the primary safety endpoint was the composite of “major and clinically relevant non-major bleeding,” results of which were not reported in RE-LY.

“Life-threatening bleeding” was not reported in the ARISTOTLE trial. Also, “major or clinical relevant non-major bleeding” was not reported in RE-LY, but included in ARISTOTLE. The combined endpoint “ischemic or uncertain type of stroke” was not reported for ROCKET-AF, but the result from ischemic stroke was used because the reported frequencies of uncertain stroke types were very low.

Secondary endpoints of interest were also compared in this study, including individual components of the primary efficacy and safety endpoints. Also, there was reported a numerical increase in myocardial infarction (MI) events in the RE-LY trial (3,4), and some debate has arisen as to whether this is due to dabigatran or reflects a protective effect of warfarin (9). Finally, the endpoint of pulmonary embolism was not reported in ROCKET-AF.

For all endpoints found appropriate for analysis, the reported hazard rate ratio and confidence intervals were extracted from the study publications. In the RE-LY study, the results were reported as risk ratio and confidence intervals. Also, an update was published for the RE-LY study (4), and the data from this update was used for our analysis. Results from intention-to-treat analyses were used throughout.

Statistical methods

We used the so-called Bucher method (10) for indirect comparisons using a common comparator, which is a statistical method for estimating hazard rate ratio and corresponding uncertainty. The method is recommended (8) as a preferred method for indirect comparison, superior to informal methods, as comparison of confidence intervals.

For these comparisons, we let HRAB and HRCB be the reported hazard rate ratio of treatment A versus B and of treatment C versus B. Observing that B is the common reference comparator (warfarin), HRAC may be estimated as HRAC = HRAB/HRCB. A confidence interval was estimated using the reported confidence intervals (CIl, CIu). The standard error (SE) of log(HR) was estimated by: ½(log(CIu) − log(CIl))/1.96, where 1.96 is the 97.5% fractile of the standard normal distribution. The variance of log(HRAC), (seAC)2, is under independence given as (seAB)2 + (seCB)2, and thus a 95% confidence interval of HRAC is: exp(log(HRAC) ± 1.96 × seAC). Reported p values are for the hypothesis H0:log(HRAC) = 0 versus HA:log(HRAC) <> 0, assuming log(HRAC) as normal with variance (seAC)2.

The application of the method relies on a similarity assumption (11) that the hazard rate ratio HRAB was likely also obtained if it was determined on the basis of the population from which HRCB was estimated. Importantly, the most direct way to consider this similarity assumption valid is to require that the clinical studies be absolutely comparable in terms of population characteristics and conduct of experiment (and as highlighted earlier, there are some differences between the trials—see Tables 1 and 2).

Summary of the Main Clinical Trials Involving Novel Anticoagulants for Stroke Prevention in Nonvalvular AF

First, the expected effect of “any new anticoagulant” versus warfarin was estimated as a weighted average using the inverse of the variance of the log(HR) as weights. Only results from the dabigatran 150 mg BID arm of the RE-LY study was used in combination with data from ARISTOTLE and ROCKET-AF to ensure independence between the components of the average. No statistical adjustments were made for multiple comparisons within this study. The second focus in this analysis was the indirect comparisons of apixaban versus dabigatran (both doses) and rivaroxaban, as well as rivaroxaban versus dabigatran (both doses). Direct comparisons of dabigatran 110 mg BID versus dabigatran 150 mg BID were already available from the RE-LY trial (3).

Results

The clinical trials being compared for this analysis are summarized in Table 1, with their primary safety and efficacy endpoints summarized in Online Table 2.

Summary patient characteristics and risk differences (with confidence intervals) are presented for the entire study population in Table 2. As expected, there were important differences between the studies, particularly for the CHADS2 score (>50 percentage point difference between ROCKET-AF and the other trials) and the proportion of secondary prevention (approximately 35 percentage point difference between ROCKET-AF and the other trials). Mean/median age was broadly similar, as were the proportions of female patients. There were more patients with paroxysmal AF in RE-LY (36.4%) compared with ROCKET-AF and ARISTOTLE (approximately 15 to 17 percentage point difference). The prevalence of prior heart failure, diabetes, and hypertension was highest in ROCKET-AF (62.5%, 40%, and 90.5%, respectively) compared with the other 2 trials. Prior warfarin use was 62.4% in ROCKET-AF, compared with 50% in RE-LY (by design) and 57% in ARISTOTLE. The proportions with prior MI and prior aspirin use were broadly similar in the 3 trials.

Risk Differences and Confidence Intervals, in Relation to Differences in the Study Populations at Baseline

When data with dabigatran 150 mg BID were used for the weighted average effects analysis, the new OACs as a whole were associated with lower stroke or systemic embolism (21%, p < 0.001), lower stroke (23% p < 0.001), and lower hemorrhagic stroke (53%, p < 0.001) than warfarin. All-cause mortality was lower for any new OAC (by 12%, p < 0.001). Major and intracranial bleeding were lower for any new OAC by 13% (p < 0.001) and 51% (p < 0.001) respectively. Weighted average hazard ratio and confidence intervals are given in Table 3.

Hazard rate ratio point estimates and 95% confidence intervals for the main trial endpoints for all indirect comparisons using warfarin as a common comparator are shown in Table 4 and Figures 1 and 2.⇓

Indirect Comparison Using Warfarin as Single Common Comparator, on the Basis of the RE-LY, ROCKET-AF, and ARISTOTLE Trials

Relative efficacy of dabigatran, apixaban, and rivaroxaban

There was a significantly lower risk of stroke and systemic embolism (by 26%) for dabigatran (150 mg BID) compared with rivaroxaban, as well as less hemorrhagic stroke (by 56%, p = 0.039) and nondisabling stroke (by 40%, p = 0.038). There were no significant differences for apixaban versus dabigatran (both doses) or rivaroxaban, or rivaroxaban versus dabigatran 110 mg BID, in preventing stroke and systemic embolism. For the ischemic stroke endpoint, there were no significant differences between the new OACs.

Apixaban also had lower major or clinically relevant bleeding (by 34%, p < 0.001) compared with rivaroxaban. There was an increase (278%, p = 0.027) in systemic embolism for apixaban compared with rivaroxaban with lower bound of 95% confidence interval at 16% increase. Gastrointestinal bleeds were only reported numerically in ROCKET-AF and therefore not used in this analysis.

Our indirect comparison analysis did not find any significant differences in MI events between dabigatran (both doses) and apixaban but more MI events were seen with dabigatran (>50%) compared to rivaroxaban (Table 4, Fig. 2).

Discussion

In this indirect comparison analysis, the new OACs as a whole resulted in a lower risk of stroke or systemic embolism, stroke, hemorrhagic stroke, and all-cause mortality compared with warfarin, Also, major and intracranial bleeding were lower for any new OAC.

Notwithstanding the limitations of an indirect comparison study, this analysis shows that there were no significant differences for apixaban versus dabigatran (both doses) or rivaroxaban; or between rivaroxaban versus dabigatran 110 mg BID in preventing stroke and systemic embolism, although dabigatran 150 mg BID was superior to rivaroxaban for preventing stroke and systemic embolism, hemorrhagic stroke, and nondisabling stroke. There were no differences between the new OACs in the ischemic stroke endpoint.

We found no significant differences between dabigatran 110 mg BID and apixaban in the main safety endpoints. When compared with dabigatran 150 mg BID, apixaban had significantly less major bleeding and gastrointestinal bleeding, but there were no significant differences between apixaban and dabigatran 110 mg BID in the safety endpoints. When compared with rivaroxaban, apixaban had lower major bleeding, whereas dabigatran 110 mg BID was associated with less major bleeding and lower intracranial bleeding.

The indirect comparison using a common comparator could be considered the gold standard approach of performing indirect comparisons in the absence of head-to-head clinical trials with these new agents (8). Such a trial would require huge numbers of study subjects to demonstrate noninferiority, especially in a broad group of AF patients with ≥1 stroke risk factors and given that overall stroke rates may be declining (12,13).

In the RE-LY trial, which did directly compare dabigatran 110 mg BID against dabigatran 150 mg BID (and both against warfarin), there was a significant reduction in the primary endpoint of stroke and systemic embolism with dabigatran 150 mg BID over dabigatran 110 mg BID (relative risk: 0.73, 95% confidence interval: 0.58 to 0.91, p < 0.001) (3). Even for ischemic stroke, where this endpoint was significantly reduced compared with warfarin in the RE-LY trial per se, the present analysis does not support any statistically significant differences in this endpoint between apixaban versus dabigatran (both doses) or rivaroxaban, or even rivaroxaban versus dabigatran (both doses). Nonetheless, dabigatran 150 mg BID did appear to have significantly less hemorrhagic stroke and nondisabling stroke compared with rivaroxaban.

Bleeding is clearly an endpoint of concern in anticoagulated AF patients (14). The present analysis clearly shows significantly lower major bleeding rates with apixaban compared with dabigatran 150 mg BID and rivaroxaban, and a nonsignificant difference between apixaban and dabigatran 110 mg BID. In the ARISTOTLE trial, major bleeding was 31% lower with apixaban compared with warfarin, and our observations in the present analysis would be consistent with this. For the endpoint of major and clinically relevant nonmajor bleeding, apixaban also did better than rivaroxaban. Of note, major bleeding with dabigatran 110 mg BID was not significantly different from apixaban, but significantly less (as was intracranial bleeding) when compared with rivaroxaban.

Bleeding risk in AF is multifactorial (14), and indeed, the addition of apixaban to antiplatelet therapy in the setting of acute coronary syndromes resulted in a significant increase in major bleeding with no evidence of efficacy (15). In the setting of acute coronary syndromes, a low-dose (2.5 mg and 5.0 mg), twice-daily regime of rivaroxaban resulted in a significant reduction in the primary endpoint (a composite of death from cardiovascular causes, MI, or stroke) compared with placebo among patients taking dual antiplatelet therapy, at the cost of more major bleeding events and intracranial hemorrhage (16).

Another safety endpoint of concern in the AF trials was a numerical increase in MI events in the RE-LY trial (0.8%/year with dabigatran compared with warfarin, 0.6%/year), and this indirect comparison analysis did not show any statistically significant differences in MI events between dabigatran (both doses) and apixaban. However, our indirect comparison analysis found both doses of dabigatran were associated with >50% more MI events compared to rivaroxaban but this needs to be put in context of the low absolute rates and the positive net clinical benefit overall of using these new drugs. A recent detailed analysis from RE-LY found that whereas MI events were numerically increased, other myocardial ischemic events were not (17). Also, the relative effects of dabigatran versus warfarin on myocardial ischemic events were consistent in patients with or without a baseline history of MI (17). Of note, vascular mortality was significantly reduced with dabigatran 150 mg BID compared with warfarin in the RE-LY trial (3). One meta-analysis was suggestive of a significant increase in MI events with dabigatran use in various different clinical trials for AF, acute coronary syndromes, and venous thromboembolism, although total mortality was significantly less with dabigatran versus comparator (18).

Study limitations

This indirect comparison addresses the main efficacy and safety endpoints reported in the trials, and the fundamental challenges for such an approach are as follows: 1) the differences in patient population; 2) differences in definition of major bleeding; and 3) unblinded versus nonblinded/double-blinded comparisons. Indeed, deviations from similarity are potentially more confounding in this approach than in a traditional meta-analysis. However, major bleeding and major plus clinically relevant nonmajor bleeding were all defined by ISTH criteria. Also, “stroke and systemic embolism” is a standardized primary endpoint for clinical trials of stroke prevention in AF (19). Other adverse effects, such as dyspepsia, were not compared, although this was increased with dabigatran compared with warfarin in the RE-LY trial (3).

Clearly, there are a number of constraints and limitations related to the method used for an indirect comparison analysis, but this is still considered a reasonable statistical tool to qualify a comparison of effects that have not yet been investigated head to head. The method and its usage, pros, and cons have been the subject of many papers in recent years, and is an accepted technique and method (8,10,20).

Another limitation is that this analysis cannot adjust for patient demography and stroke risk of the different trial populations, and some differences are evident (Tables 1 and 2, Online Table 2), with, e.g., ROCKET-AF studying a higher-risk population. Furthermore, we cannot account for differences in warfarin control between the trials, with mean time in therapeutic range in the RE-LY and ARISTOTLE trials being much better than that seen in the ROCKET-AF trial. An alternative technique to the Bucher method used in this paper would be a network-based indirect comparison, which is a Bayesian Markov chain Monte Carlo method. The latter has its strengths in situations where there are many trials and with trials that have several treatment arms. We feel that in this particular setting with only 3 trials, each with its own treatment, that the Bucher method was preferred because it is much more simple and transparent method. Differences in inclusion and exclusion criteria, patient population, data collection methods, and outcome definitions or adjudication may result in residual confounding and persisting heterogeneity. Indeed, the statistical technique used somewhat assumes that the patients entered in the various trials and the level of anticoagulation were comparable.

Also, although all patients had AF, the risk of stroke and perhaps other endpoints differed between trials, and no statistical adjustments were made for multiple comparisons within this study. Finally, we have only used data from the published phase III trials for this analysis, and there is still 1 large ongoing trial with the oral Factor Xa inhibitor edoxaban in AF, which has yet to report (21).

Conclusions

Notwithstanding the limitations of an indirect comparison study, we found no profound significant differences in efficacy between apixaban and dabigatran etexilate (both doses) or rivaroxaban. Dabigatran 150 mg BID was superior to rivaroxaban for efficacy (with less stroke and systemic embolism and hemorrhagic stroke). Major bleeding was significantly lower with dabigatran 110 mg BID or apixaban. Only a head-to-head direct comparison of the different new OACs would fully answer the question of efficacy/safety differences between the new drugs for stroke prevention in AF.

Appendix

For supplemental tables, please see the online version of this article.

Appendix

Online Tables 1 and 2

Indirect Comparisons of New Oral Anticoagulant Drugs for Efficacy and Safety When Used for Stroke Prevention in Atrial Fibrillation

Footnotes

Prof. Lip has served as a consultant for Bayer, Astellas, Merck, AstraZeneca, Sanofi, Bristol-Myers Squibb/Pfizer, Daiichi-Sankyo, Biotronik, Portola, and Boehringer-Ingelheim; and has been on the Speaker's Bureau for Bayer, BMS/Pfizer, Boehringer-Ingelheim, and Sanofi-Aventis. Drs. Larsen and Rasmussen have served as speakers for BMS/Pfizer and Boehringer-Ingelheim. Dr. Skjøth has reported that he has no relationships relevant to the contents of this paper to disclose.

(2011) Bleeding risk assessment and management in atrial fibrillation patients: Executive summary of a position document from the European Heart Rhythm Association [EHRA], endorsed by the European Society of Cardiology [ESC] Working Group on Thrombosis. Thromb Haemost106:997–1011.

(2007) Outcome parameters for trials in atrial fibrillation: recommendations from a consensus conference organized by the German Atrial Fibrillation Competence NETwork and the European Heart Rhythm Association. Europace9:1006–1023.

Toolbox

Thank you for your interest in spreading the word about JACC: Journal of the American College of CardiologyNOTE: We request your email address only as a reference for the recipient. We do not save email addresses.

Your Email *

Your Name *

Send To *

Enter multiple addresses on separate lines or separate them with commas.