Comparative effectiveness of urate lowering with febuxostat versus allopurinol in gout: analyses from large U.S. managed care cohort

Introduction To assess the comparative effectiveness of febuxostat and allopurinol in reducing serum urate (sUA) levels in a real-world U.S. managed care setting. Methods This retrospective study utilized 2009 to 2012 medical and pharmacy claims and laboratory data from a large U.S. commercial and Medicare Advantage health plan. Study patients had at least one medical claim with a diagnosis of gout, at least one filled prescription for febuxostat or allopurinol and at least one sUA measurement post-index prescription. Reduction in sUA was examined using propensity score-matched cohorts, matched on patient demographics (gender, age), baseline sUA, comorbidities, geographic region and insurance type. Results The study sample included 2,015 patients taking febuxostat and 14,025 taking allopurinol. At baseline, febuxostat users had a higher Quan-Charlson comorbidity score (0.78 vs. 0.53; P <0.001), but similar age and gender distribution. Mean (standard deviation (SD)) sUA level following propensity score matching among treatment-naïve febuxostat vs. allopurinol users (n = 873 each) were: pre-index sUA, 8.86 (SD, 1.79) vs. 8.72 (SD, 1.63; P = 0.20); and post-index sUA, 6.53 (SD, 2.01) vs. 6.71 (SD, 1.70; P = 0.04), respectively. A higher proportion of febuxostat users attained sUA goals of <6.0 mg/dl (56.9% vs. 44.8%; P <0.001) and <5.0 mg/dl (35.5% vs. 19.2%; P <0.001), respectively. Time to achieve sUA goals of <6.0 mg/dl (346 vs. 397 days; P <0.001) and <5.0 mg/dl was shorter in febuxostat vs. allopurinol users (431 vs. 478 days; P <0.001), respectively. Similar observations were made for overall propensity score-matched cohorts that included both treatment-naïve and current users (n = 1,932 each). Conclusions Febuxostat was more effective than allopurinol at the currently used doses (40 mg/day for febuxostat in 83% users and 300 mg/day or lower for allopurinol in 97% users) in lowering sUA in gout patients as demonstrated by post-index mean sUA level, the likelihood of and the time to achieving sUA goals. Electronic supplementary material The online version of this article (doi:10.1186/s13075-015-0624-3) contains supplementary material, which is available to authorized users.


Introduction
Gout is the most common inflammatory arthritis in adults, affecting 3.9% of the U.S. population [1]. The prevalence of gout far exceeds that of rheumatoid arthritis (RA) at 1% [2], the prototype of inflammatory arthritis in adults. Gout is associated with significant morbidity, functional limitation and health-related quality of life (HRQOL) deficits [3][4][5] as well as increased cardiovascular morbidity and mortality [6][7][8][9]. Optimal treatment of gout is based on two principles: adequate chronic use of urate-lowering therapies (ULT; a xanthine oxidase (XO) inhibitor and uricosurics) aiming to achieve target serum urate (sUA) levels and anti-inflammatory therapies for acute flares and antiinflammatory prophylaxis [10]. Adequate lowering of sUA to a target level of <6.0 mg/dl is associated with lower risk of acute flares [11] and better function and quality of life [12], and is cost effective in various health care settings [13][14][15][16][17]. Thus, achievement of target sUA <6.0 mg/dl is key to quality management of gout [18][19][20]. With 8.3 million U.S. adults suffering from gout [1], appropriate sUA lowering will likely reduce its public health burden and associated cost.
Comparative effectiveness research (CER) is a high priority area for research, practice and policy-making and recent commitment of $1.1 billion to CER by the American Recovery and Reinvestment Act of 2009 underscores its importance to health policy [21]. For 50 years, a single purine XO inhibitor, allopurinol, was available in the U.S. [22,23]. Since allopurinol is available as a generic medication and is an effective ULT, it is used in >95% of cases, while uricosurics are used infrequently for the treatment of hyperuricemia in gout [24][25][26]. In 2009, a nonpurine XO inhibitor, febuxostat, was approved by the U.S. Food and Drug Administration for the treatment of gout [27]. We aimed to assess the patterns of use of allopurinol and febuxostat in a large managed care organization (MCO) and perform a CER study comparing the ability of allopurinol and febuxostat to lower sUA. Our main study objective was to study the change in sUA with allopurinol versus febuxostat treatment by assessing the proportion of patients achieving a post-index sUA goal of <6.0 mg/dL in the follow-up period, a clinically meaningful and important outcome for patients with gout [11][12][13][14][15][16]. As a secondary objective, we compared serum creatinine (SCr) levels between allopurinol and febuxostat users.

Setting, participants and data sources
Methods and results are described as recommended in the STrengthening of Reporting in OBservational studies in Epidemiology (STROBE) statement [28]. This retrospective study analyzed U.S. patients with a prescription for either febuxostat or allopurinol for the treatment of gout from February 1, 2009 to May 31, 2012. Medical data, pharmacy data, enrollment information and laboratory results from study subjects from both Medicare Advantage plans and a commercial plan were obtained from the Optum Research Database (ORD). Patients in this dataset are similar to U.S. insured population in terms of race, gender, age, and geographic distribution, which has been described previously [29]. All administrative claims data were de-identified and study procedures adhered to the provisions of the Health Insurance Portability and Accountability Act (HIPAA) of 1996. Because this study did not involve the collection, use, or transmittal of individually identifiable data, Institutional Review Board review or approval was not required and no patient consent was needed.
Commercial and Medicare Advantage health plan enrollees were identified between February 1, 2009 and May 31, 2012 (the identification period). Medicare Advantage is a managed Medicare health insurance plan offered by private insurers [30]. To be eligible for study inclusion, patients must have met both of the following criteria during the identification period: 1) had at least one medical claim with an International Classification of Diseases, ninth revision (ICD-9) diagnosis code for gout (274.xx); and 2) had at least one pharmacy claim for allopurinol or febuxostat. This time frame included the febuxostat launch date. The date of the first febuxostat or allopurinol filled prescription was defined as the patient's index date; if a patient had a filled prescription for both febuxostat and allopurinol, then the first filled prescription for febuxostat was defined as the index date. Patients were excluded if they were younger than 18 years as of the year of index date; had evidence of cancer or rheumatoid arthritis during the study period; had no sUA laboratory result 14 or more days after the index date (the primary outcome measure); had less than six months of continuous enrollment prior to their index date (baseline period); or had <90 days follow-up after index prescription of allopurinol or febuxostat. In order to control for confounding, baseline data were obtained during the six-month baseline period prior to the index date. Patients were followed until August 30, 2012 or until the patient was no longer enrolled in the health plan, whichever was earlier.

Independent variable (drug exposure)
Patients were assigned to one of two study cohorts based on whether their index medication filled prescription was for febuxostat or allopurinol. Because febuxostat was approved in 2009, patients who received allopurinol and then switched to febuxostat were assigned to the febuxostat cohort. Patients were not excluded from the study if they had evidence of febuxostat or allopurinol use during the baseline period. Allopurinol-treated patients were allowed to have an index medication dose between 100 and 1,500 mg/day, because allopurinol is available in various tablet strengths. Febuxostat-treated patients were included if they had an index medication dose 40 mg or 80 mg, because febuxostat is only available in these recommended doses. Patient demographics (age, gender and region) and baseline clinical characteristics (utilization and comorbid conditions assessed by Quan-Charlson comorbidity score [31]) were assessed. Gout is frequently associated with comorbidities [32][33][34][35][36].

Study outcome measures
Outcomes were assessed during a variable follow-up period of at least three months following the index date. The main outcome measure was post-index mean sUA levels. The proportions of patients who achieved a target sUA <6 mg/ dl or <5 mg/dl were also assessed. When patients had more than one post-index sUA level, the earliest value that attained goal was selected.

Bias
We anticipated selection bias, that is, confounding by indication, because patient characteristics impact the choice of allopurinol vs. febuxostat. Therefore, we used propensity score matching (PSM) to minimize this bias. In the absence of chart review, some misclassification error due to use of codes for gout may have occurred. We did not think that this led to biased estimates, since there is no evidence that this may have occurred more often with one medication versus the other.

Sample size
No formal sample size calculations were done a priori. All available patients who met the study inclusion and exclusion criteria were included in this analysis.

Patient matching and statistical analysis
We used PSM methodology to account for selection/ channeling bias [37]. Propensity scores were estimated by unconditional logistic regression analyses that incorporate predictors of therapy as independent variables in the regression and treatment cohort as the outcome. The propensity score was the fitted value of the probability of being a member of the febuxostat cohort given membership in the study population and the covariate pattern. To the extent that the clinical decision to use febuxostat in a particular patient depends on the health characteristics of the patient at the time of the decision, the propensity score modeled the clinical decision-making process. The covariates used in the propensity analysis included: age, gender, insurance type, region, baseline medication use (allopurinol, uricosurics), baseline comorbidities (kidney failure, kidney stones, dialysis, angina, diabetes, coronary artery disease, heart failure, myocardial infarction, stroke, peripheral artery disease, osteoarthritis, hypertension, hyperlipidemia, and gout flares), baseline sUA, baseline health care costs and the follow-up duration (post-index prescription period). For each febuxostat patient, an allopurinol patient with the closest propensity score (±0.01 units) was selected. Patients who were not matched were excluded from analysis. Febuxostat and allopurinol patients were matched in a 1:1 ratio. Following the propensity score match, all categorical variables were examined descriptively. Comparisons between the febuxostat and allopurinol cohorts for categorical outcomes of interest (post-index sUA goal <6.0 mg/dl and <5.0 mg/dl) were done using a McNemar's test while continuous measures (pre-and post-index prescription sUA and SCr, time to sUA) were examined using a paired Student's t test; an a priori two-tailed level of significance was set at the 0.05 level.

Demographic and clinical characteristics
Nearly 315,076 commercial and Medicare Advantage patients had a medical claim indicating gout, and 143,237 of these patients also had a pharmacy claim for febuxostat or allopurinol. After application of continuous enrollment, laboratory result, dose and cancer criteria, a final sample of 16,040 patients was available for analysis ( Figure 1). Of these, 2,015 patients were assigned to the febuxostat cohort and 14,025 were assigned to the allopurinol cohort. Table 1 presents characteristics of the study sample before and after matching. The post-index follow-up periods in these unmatched patient populations differed somewhat and were as follows: allopurinol group, 224 days (standard deviation (SD), 256); and febuxostat group, 188 days (SD, 220). Therefore, the cohorts were matched on the postindex follow-up duration, among other variables, to avoid selection bias. More than 80% of patients were male in both cohorts.
Twenty-four percent of febuxostat patients had switched from allopurinol. Mean (SD) allopurinol and febuxostat doses prior to sUA target <6 mg/dl were as follows: 284 mg/day (SD, 123 mg/day) and 54 mg/day (SD, 22 mg/ day). Following PSM, no significant differences were observed in demographic characteristics, or frequencies of comorbidities (Table 1).

Main analysis: serum urate (sUA) in treatment-naïve subjects
There were 873 matched pairs that had no evidence of treatment with allopurinol or febuxostat in the pre-index period ( Table 2). The most common doses were 300 mg/ day or lower dose for allopurinol and 40 mg/day for febuxostat ( Table 2). Patterns of use of anti-inflammatory prophylaxis and index prescription dose change are shown in Table 2. Rates of comorbidities during the post-index period were similar between cohorts in the propensity score-matched analysis (Table 3).
In this subset, febuxostat patients were more likely to have commercial insurance vs. Medicare Advantage (85.5% vs. 78.7%, P <0.001). There were no differences between the febuxostat and allopurinol populations in baseline comorbidities or the overall comorbidity score.
The final mean (SD) allopurinol and febuxostat doses prior to the target sUA achievement of <6 mg/dl were as follows: 276 mg/day (SD, 109 mg/day) and 54 mg/ day (SD, 20 mg/day). Of the treatment-naïve febuxostat patients, 56.9% attained the sUA goal of <6.0 mg/dl compared to 44.8% of the allopurinol patients (P <0.001; Figure 2a). A total of 35.5% of the treatment-naïve febuxostat patients attained the sUA goal of <5.0 mg/dl versus 19.2% of the allopurinol patients (P <0.001; Figure 2a).
Sensitivity analyses: propensity score-matched full study cohort, new and current users Table 4 presents sUA and SCr results in the full propensity score-matched sample of patients, including new and current users (n = 1,932 matched pairs). Pre-index period mean sUA was non-significantly higher among febuxostat users, 8.52 in febuxostat users and 8.36 in allopurinol users (P = 0.29). The average post-index sUA was lower in febuxostat compared to the allopurinol users (6.41 vs. 6.64, P <0.001), a difference that is statistically,   though perhaps not clinically, significant (Table 4). Index allopurinol dose and changes, prophylaxis treatments were similar to the treatment-naïve cohorts (Additional file 1). The pattern of post-index comorbidities was similar between propensity score-matched cohorts (Additional file 2). Post-index target sUA goal attainment differed significantly between the cohorts (Figure 2b). A higher proportion of febuxostat users compared with allopurinol users attained the sUA goal of <6.0 mg/dL (58.5% vs. 47.1%, P <0.001) and the sUA goal <5.0 mg/dL (36.2% vs. 21.8%, P <0.001), findings very similar to those from the treatment-naïve population (Table 4; Figure 2b). Febuxostat users also had a shorter average length of time to target sUA goal attainment than allopurinol users (goal of <6.0 mg/dL: 348 days vs. 410 days, P <0.001; goal of <5.0 mg/dL: 443 days vs. 501 days, P <0.001) (Figure 3).
In the pre-index period, the final SCr result was 1.36 mg/dl among febuxostat users, and 1.28 mg/dl among allopurinol users (P <0.001) ( Table 4). In the post-index period, the average SCr result was 1.31 mg/dl in febuxostat users, and 1.21 mg/dl in allopurinol users (P <0.001). The change from pre-index to mean post-index SCr result (a decrease in both cohorts) did not differ across treatment cohorts (−0.02 mg/dl in both cohorts (P = 0.82)).

Discussion
In this study we found that in most commonly used doses, a significantly higher proportion of patients receiving febuxostat (most common dose, 40 mg/day) achieved a target sUA of <6.0 mg/dl and <5.0 mg/dl compared to those receiving allopurinol (most common dose, 300 mg/ day or lower). Until recently, the treatment of hyperuricemia in gout mostly hinged on the appropriate use of a single drug, that is, allopurinol, since the use of uricosurics is uncommon. Since the U.S. launch of febuxostat in 2009, patients now have two XO inhibitors to choose from, allopurinol and febuxostat. Therefore, a comparative study of these two drugs is needed to help patients, providers and policy makers in making treatment decisions. These findings support the results observed in the pivotal clinical trials for febuxostat [33,38]. The target goals of serum urate <6.0 mg/dl and <5.0 mg/dl were achieved a month sooner in febuxostat-treated compared to allopurinol-treated patients. Several findings from this study deserve further discussion. After matching patients on baseline characteristics, a significantly higher proportion of patients taking febuxostat than allopurinol achieved the target sUA level at commonly prescribed doses, with approximately 66% more patients achieving sUA <5.0 mg/dl (36.2% vs. 21.8%; relative difference) and a quarter more achieving sUA <6.0 mg/dl (58.5% vs. 47.1%). This difference is not only statistically significant, but also clinically meaningful. Patients receiving febuxostat (mostly at doses of 40 mg/day; 19% received 80 mg/day) achieved target sUA in one month less time than allopurinol-treated (most common dose 300 mg daily or lower; 5% received >300 mg/day) patients, a significant difference, both statistically and clinically. However, one must remember that in this effectiveness study, these measurements were not done at the predefined time, but at a time clinically indicated and as a part of routine clinical care. It may have taken a shorter time to achieve these target sUA levels.
These are important findings and indicate that febuxostat is an effective option for treatment of hyperuricemia in patients with gout. The higher rate of sUA testing in the febuxostat vs. allopurinol group (2.15 vs. 1.85 times, respectively) and a higher proportion of febuxostat-treated than allopurinol-treated patients getting a higher dose (19% received febuxostat 80 mg/day vs. 5% received allopurinol >300 mg/day), may have contributed to the ability to achieve target sUA, but it is unclear as to how much this contributed to the success rates. This needs to be examined in future studies. Our study extends similar findings by Kim et al. [39] in their unadjusted comparison of post-index sUA <6 mg/dl to a propensity score-adjusted analysis. Our study also adds new knowledge regarding both sUA goals and the time to achievement of both sUA goals of <6 and <5 mg/dl. Less than one-third of patients had a dose change in allopurinol and less than one-fifth in febuxostat doses after the index prescription, indicating that the dose titration as recommended by the guidelines to achieve target sUA is not a common practice [20].
These data from MCO enrollees reflect the common practices regarding treatment of hyperuricemia prevalent in the U.S. Just under half of patients using allopurinol received 300 mg/day (45%) with 50% getting <300 mg/ day and only 5% getting >300 mg daily dose, as previously reported [24][25][26], and recently confirmed by Kim et al. [39] The demographics of patients in this dataset are similar to the U.S. commercially insured population, with similarities in age distribution and comorbidities to another gout study [39]. For example, 50% of individuals Table 3 Post-index clinical characteristics in the treatment-naïve propensity score-matched cohorts -(rates per 1,000 patient-years) in this database were male, 73% were white, 8% were African American, and 9% were Hispanic. This database has previously been used to study ULT in gout patients [29]. While a higher dose of allopurinol is needed in many gout patients with higher body mass index, the most prescribed allopurinol dose for gout is still ≤300 mg daily [24][25][26]. Thus, the CER presented provides evidence for comparison of these most commonly used doses of allopurinol, not all allopurinol doses. It is likely that allopurinol dose of 300 mg/day is subtherapeutic in many patients with gout; lower allopurinol doses <300 mg/day may be even more likely to be subtherapeutic. There is an emerging trend to increase allopurinol dose to 800 to 1,500 mg/day until target sUA is achieved. SCr decreased in both allopurinol and febuxostat users after treatment. This is an interesting finding, since gout is a metabolic disease with an effect on renal function. ULTs have been hypothesized to improve renal function, but definitive proof is lacking. A multicenter randomized trial assessing the impact of allopurinol on improving renal function in patients with diabetes with normal or moderately impaired kidney function was recently funded by the National Institutes of Health to answer this question more definitively [40]. Our observational study generates this hypothesis that can be tested in future randomized trials.
Our findings must be interpreted considering study limitations. Our study was observational, making it liable to residual confounding. Another limitation is that of selection bias, due to clinicians' likelihood of choosing one or the other medication based on patient and/or disease characteristics. For example, compared to allopurinol  users, febuxostat users may be expected to have had more severe gout and a higher rate of renal failure, since febuxostat was shown to be effective and safe in patients with renal failure. PSM was done to overcome selection bias. Misclassification error is possible, since we used ICD-9 codes to identify our study cohort. However, in a previous validation study at a Veterans Affairs Medical Center, 78% of patients with a database code for gout had evidence of this diagnosis in medical charts [41]. This was a real-world, observational analysis, and patient  compliance with gout medication treatment may have varied over time. This may be one of the reasons that time to target sUA levels were more than one year on average in this analysis, a longer time period than is noted in clinical trials with controlled dosing environments. Quan-Charlson index score, our measure of comorbidity (renal disease, cardiac disease and so on), is a standard validated measure [31], but depends on the presence of ICD-9 codes in medical records, which raises the possibility of misclassification. We were unable to assess comparative safety of the two medications, due to limited resources. A small proportion of patients (5%) receiving allopurinol dose >300 mg/day of allopurinol limited us from comparing higher doses of allopurinol to febuxostat. Another study limitation is that we performed PSM only at baseline that included the duration of the medication exposure. The use of propensity scores at multiple follow-up times may allow for adjustment for other confounders; however, this could not be undertaken due to resource constraints.

Conclusions
We found in this study that at the currently used doses, febuxostat (most common dose of 40 mg/day) was more effective in achieving the target sUA than allopurinol (most common doses of 300 mg/day or lower). The time to achieve target sUA in the febuxostat group is a month shorter than in the group receiving allopurinol. Slight improvements in renal function were noted with both allopurinol and febuxostat. These findings can inform patients and physicians when they are making a choice regarding the treatment of hyperuricemia. Obviously, the cost differences between the two treatments (febuxostat with much higher cost than allopurinol) should be taken into account. It is likely that individualized patient-physician decisionmaking that incorporates these data along with the risk of medication side effects and costs will lead to a more informed decision and a more satisfied patient. Future research with this data source will focus on the impact that the higher comparative effectiveness of febuxostat might have on health care costs.

Additional files
Additional file 1: Patient treatment patterns among full matched study population (N = 3,864).
Additional file 2: Post-index clinical characteristics in the propensity score-matched cohorts (%).
Competing interests JAS has received research grants from Takeda and Savient and consultant fees from Savient, Takeda, Regeneron and Allergan. JAS is a member of the executive committee of OMERACT, an organization that develops outcome measures in rheumatology and receives arms-length funding from 36 companies; a member of the American College of Rheumatology's Guidelines Subcommittee of the Quality of Care Committee; and a member of the Veterans Affairs Rheumatology Field Advisory Committee. AS is an employee of Takeda Pharmaceuticals International, Inc. (TPI) and KSA was employed by TPI at the time the research was conducted. No other conflicts exist for authors, including all financial and nonfinancial conflicts.
Authors' contributions JAS as the project principal investigator (PI) was responsible for the study concept and design, modification of the study design, review and interpretation of analyses, writing the first draft of the manuscript and making revisions to the manuscript. KSA was a Takeda employee at the time work was performed, who helped with development of the study protocol, reviewed and interpreted data analyses, and made revisions to the manuscript. AS is a Takeda employee, who reviewed the study design, reviewed and interpreted data analyses, and made revisions to the manuscript. The decision to submit the manuscript was made by the project PI. Takeda pharmaceuticals provided a scientific review of the manuscript and comments. The decision to or not to incorporate these comments was made by the lead author and study PI (Dr. Singh). All authors read and approved the final manuscript.
Authors' information Kasem S. Akhras was employed by Takeda Pharmaceutical International Inc., Deerfield, IL, USA at the time work was performed.