- Research article
- Open Access
A five-year model to assess the early cost-effectiveness of new diagnostic tests in the early diagnosis of rheumatoid arthritis
Arthritis Research & Therapyvolume 18, Article number: 135 (2016)
There is a lack of information about the sensitivity, specificity and costs new diagnostic tests should have to improve early diagnosis of rheumatoid arthritis (RA). Our objective was to explore the early cost-effectiveness of various new diagnostic test strategies in the workup of patients with inflammatory arthritis (IA) at risk of having RA.
A decision tree followed by a patient-level state transition model, using data from published literature, cohorts and trials, was used to evaluate diagnostic test strategies. Alternative tests were assessed as add-on to or replacement of the ACR/EULAR 2010 RA classification criteria for all patients and for intermediate-risk patients. Tests included B-cell gene expression (sensitivity 0.60, specificity 0.90, costs €150), MRI (sensitivity 0.90, specificity 0.60, costs €756), IL-6 serum level (sensitivity 0.70, specificity 0.53, costs €50) and genetic assay (sensitivity 0.40, specificity 0.85, costs €750). Patients with IA at risk of RA were followed for 5 years using a societal perspective. Guideline treatment was assumed using tight controlled treatment based on DAS28; if patients had a DAS28 >3.2 at 12 months or later patients could be eligible for starting biological drugs. The outcome was expressed in incremental cost-effectiveness ratios (€2014 per quality-adjusted life year (QALY) gained) and headroom.
The B-cell test was the least expensive strategy when used as an add-on and as replacement in intermediate-risk patients, making it the dominant strategy, as it has better health outcomes and lower costs. As add-on for all patients, the B-cell test was also the most cost-effective test strategy. When using a willingness-to-pay threshold of €20,000 per QALY gained, the IL-6 and MRI strategies were not cost-effective, except as replacement. A genetic assay was not cost-effective in any strategy. Probabilistic sensitivity analysis revealed that the B-cell test was consistently superior in all strategies. When performing univariate sensitivity analysis for intermediate-risk patients, specificity and DAS28 in the B-cell add-on strategy, and DAS28 and sensitivity in the MRI add-on strategy had the largest impact on the cost-effectiveness.
This early cost-effectiveness analysis indicated that new tests to diagnose RA are most likely to be cost-effective when the tests are used as an add-on in intermediate-risk patients, and have high specificity, and the test costs should not be higher than €200–€300.
Rheumatoid arthritis (RA) is a chronic inflammatory joint disease characterised by structural irreversible joint damage, leading to severe disability, serious loss of quality of life and premature death if left untreated [1–6]. Disease progression can be slowed down by synthetic and biologic disease-modifying antirheumatic drugs (DMARDs), especially if started early in the disease [7–11]. This requires early detection of RA. However, early diagnosis is complex, because RA-related symptoms early in the disease course resemble those of other musculoskeletal disorders . Early detection would result in DMARDs being started early and improved prognosis. Therefore, several diagnostic tests are currently being developed to improve the early diagnosis of RA in patients with inflammatory arthritis (IA), e.g., B-cell related gene expression, IL-6 serum level test, magnetic resonance imaging (MRI) of hands and feet, and genetic assays with susceptibility single nucleotide polymorphisms (SNPs) for RA. However, little is known about their potential cost-effectiveness.
To guide implementation of new diagnostic tests in the workup of patients at risk of having RA, a decision model could be used to evaluate the test performance (i.e., sensitivity and specificity), test costs, and positioning of the test in terms of the clinical outcomes and societal costs. Evaluating the cost-effectiveness of new drugs before entering the market is well-implemented because reimbursement authorities request this. However, it is less common to evaluate the cost-effectiveness of new medical tests that enter the market while this also affects healthcare spending directly. Given the constraints on healthcare budgets, it is likely that clinicians evaluate the impact of new tests on their departmental budget and consider diagnostic uncertainty. To inform this clinical decision problem, we conducted an early cost-effectiveness analysis (early-CEA). In this analysis, the incremental costs of the new potential and current diagnostic test strategies are weighed against the gain in quality-adjusted life years (QALYs) and potential improved labour force participation . The main assumption is that early diagnosis results in a timely start of effective treatment that reduces disease activity and consequently postpones or prevents treatment with a biologic DMARD.
The aim of this study was twofold. The first objective was to develop an early-CEA model to evaluate the costs and health effects (in terms of QALYs) of new and current diagnostic test strategies from a societal perspective in patients with IA who are suspected of having RA. The second objective was to analyse the costs and health effects of new test strategies compared to the American College of Rheumatology/European League Against Rheumatism (ACR/EULAR) 2010 RA classification criteria (referred to as RA-2010 criteria).
We applied the framework following the general steps used in early-CEA of medical tests as developed by Buisman et al. . This framework is a useful guidance for researchers performing early-CEA of medical tests. Early-CEA evaluates medical tests in development by assessing how much these tests could improve health outcomes and healthcare efficiency. Moreover, early-CEA helps test developers to decide about further development of medical tests, set realistic performance-price goals and design and manage reimbursement strategies [14, 15].
Current diagnostic test strategy
IA patients at risk of having RA undergo a diagnostic workup to establish the presence of RA. This entails affected joint counts, blood testing, and radiographs ordered or established by the rheumatologist. If no other explanation for the symptoms is found (e.g., systemic lupus erythematosus (SLE), gout, or psoriatic arthritis) the patient is classified as having RA if at least 6 out of 10 points on the RA-2010 criteria are scored. Patients who score less than 6 but more than 2 points are regarded as patients at intermediate risk who do not fulfil the RA-2010 criteria. The RA-2010 criteria were the comparator in our early-CEA .
New diagnostic test strategies
The cost-effectiveness was assessed of four diagnostic tests that are currently being developed as part of the TRACER project : B-cell related gene expression , IL-6 serum level test , MRI of the hands and feet [19–24] and genetic assays with susceptibility SNPs for RA . For each of the tests (described subsequently), three different test strategies were modelled: add-on to the RA-2010 criteria for all patients with IA, add-on for intermediate-risk patients only and replacement of all blood tests and radiographs used to classify patients according to the RA-2010 criteria. For the add-on test strategies, the performance of the new test strategy was estimated by combining the sensitivity and specificity of the RA-2010 criteria and the new tests.
B-cell related gene expression
During the development of arthritis, B-cell RNA expression decreases over time . Although the exact mechanism is poorly understood, the marker is useful to predict early arthritis in patients with seropositive arthralgia . Currently no data are available for patients with IA. Therefore, we used the data from the seropositive arthralgia cohort studied by Baarsen et al. . After discussion with the developers of this test, sensitivity of 0.60 and specificity of 0.90 was used. The cost of the test was set at €150.
IL-6 serum level test
IL-6 is a cytokine that is present in inflammation. In a recent evaluation of IL-6 serum level test performance in detecting RA in patients with IA the sensitivity was 0.70 and specificity was 0.53 . We used these sensitivity and specificity values and assumed a cost of €50 per test.
MRI of hands and feet
MRI may reclassify patients to different joint domains of the RA-2010 criteria if there are more swollen joints than are identified on physical examination. MRI also provides additional information on bone marrow edema  and tenosynovitis [20, 21]. Based on the literature [22–24] and discussions with test developers, we set the sensitivity of MRI at 0.90 and the specificity at 0.60. The costs of MRI were assumed to be equal to the unit costs currently used by the Dutch Healthcare Authority, of €189 per MRI examination (€756 for four MRI scans (both hands and both feet)) .
Genetic assay with susceptibility SNPs for RA
RA is a complex disease involving several genes. Heritability for RA is estimated to be around 50–60 % . Expert review of the literature suggests that using genetic risk factors combined with current knowledge would result in sensitivity of 0.40 and specificity between 0.80 and 0.90 to identify patients with RA . We used these estimations and applied sensitivity of 0.40 and specificity of 0.85 with an estimated cost of €750 per test based on expert opinion from test developers.
In the current and new diagnostic test strategies, test-positive patients received methotrexate (MTX) at 25 mg/week orally . Due to the side effects of MTX, patients could switch to other synthetic DMARDs (e.g., Sulfasalazine, Leflunomide). After failure of two synthetic DMARDs, patients could switch to biologic DMARDs (i.e., TNF-inhibitors, IL6-inhibitors, B cell depletion, or T cell inhibition) .
RA patients who were additionally detected by the new diagnostic test strategies as compared to the RA-2010 criteria were assumed to be given early treatment. As a result, we modelled that they had an improvement of 0.2 in the disease activity score in 28 joints (DAS28) at 12 months as compared to patients in the current test strategy. The improvement of 0.2 in the DAS28 was based on sensitivity analysis in which we evaluated the effect of changing this value on the model results (see univariate sensitivity analysis below).
In RA, the diagnosis and subsequent prognosis are complex processes in which various tests and measures of disease activity influence treatment decisions and subsequently outcomes in terms of both costs and effects. As the diagnosis is often reconsidered in the first year of disease, especially in those initially not classified as having RA, we decided to model the first year as a decision tree with chance nodes at 6 and 12 months to classify patients as true positive (TP), false positive (FP), true negative (TN) and false negative (FN) during the first year. Patients were classified as TP if they had a positive test result (≥6 points on the RA-2010 criteria or positive on the new test) at baseline and at 12 months, used MTX or stopped MTX due to side effects. Moreover, the symptoms should not be explained by another classified diagnosis . Patients were considered as TN if they had a negative test result (<6 points on the RA-2010 criteria or negative on the new test) at baseline, did not use MTX at 12 months, and had symptoms explained by another classified diagnosis. Patients were considered FP if they scored ≥6 points on the RA-2010 criteria or were positive on the new test at baseline but did not use MTX at 12 months, and had symptoms explained by another classified diagnosis. Patients were classified as FN if they scored <6 points on the RA-2010 criteria or were negative on the new test at baseline but used MTX or stopped MTX due to side effects at 12 months, and had no symptoms explained by another classified diagnosis. Using a combination of initial RA-2010 criteria scores and the use of DMARDs not explained by any other disease is a common way of dealing with a disease for which no hallmark sign is available [6, 29].
The first year is followed by a four-year individual-level Markov model (i.e., patient-level state transition model) that simulates the change in disease activity (DAS28) over time in 3-month cycles. The cycle time is 3 months because patients are commonly seen by the rheumatologists every 3 months. This 5-year model was used to simulate what would happen if a proportion of FN patients in the current strategy were diagnosed earlier with lower levels of disease activity in the new test strategy. The time horizon was 5 years because the long-term effects of biological drug use are unknown. Patients were categorised into three disease states: remission (DAS28 ≤ 2.6), low disease activity (DAS28 > 2.6 to ≤3.2) and moderate and severe disease activity (DAS28 > 3.2).Footnote 1 This categorization by DAS28 score is common in the field of RA [30–32]. Resource use, costs and utilities were linked to these three categories. The patients who were classified as TP or FN at 12 months entered the patient-level state transition model. A proportion of patients with DAS28 > 3.2 were modelled to start a biologic DMARD in addition to MTX. They were assumed to stay on a biologic DMARD and could switch to another biologic DMARD for the remainder of the 4 years. The patients who were classified as TN or FP at 12 months entered a background model in which they stayed for the remaining 4 years, assuming no change in utilities, biologic DMARD costs for 10 % of FP patients in the first year after diagnosis and otherwise, no RA-related costs. Figure 1 shows the decision model comparing the current diagnostic test strategy with the new add-on diagnostic test strategy for intermediate-risk patients (described subsequently). This 5-year cost-effectiveness model is an extension of our 1-year model published elsewhere .
To populate the model, we mainly used data from three different sources. Additional file 1: Table S1 shows the characteristics of the three sources. First, data from the REACH cohort (usual care) were used to populate the 1-year decision tree with 552 patients with IA who were suspected of having RA (details about the cohort can be found in Additional file 2: Table S2). Patients had to have at least one joint clinically diagnosed as affected by synovitis that could not be classified as another inflammatory joint disease. The prevalence of RA was 54 % at 12 months based on the RA-2010 criteria and MTX use.
Second, data from the tREACH trial were used in our model for RA patients after 1-year follow-up . The tREACH trial includes patients aged 18 year-old or older with arthritis in at least one joint, and symptom duration less than 1 year. Patients were randomized into three initial treatment strategies: triple DMARD therapy (MTX, sulfasalazine and hydroxychloroquine) with intramuscular glucocorticoids, triple DMARD therapy with an oral glucocorticoid tapering scheme and MTX monotherapy with an oral glucocorticoid tapering scheme. See Claessen et al.  for a detailed description of the tREACH trial.
Third, summary data from the DREAM registry as published by Vermeer et al.  were used to inform on the start of biological drugs. This publication describes data from two cohorts, a treat-to-target cohort and a usual-care cohort of patients with clinical RA. The treat-to-target strategy used a standardized treatment step-up protocol . In contrast, the treatment switches were not performed by protocols in the usual care cohort.
Additional file 3: Table S3 gives an overview of all model input parameters, their estimates and distributions for probabilistic sensitivity analysis, and data sources.
Estimation of transition probabilities
During the first 12 months of our model, the probabilities of patients being TP, FN, TN and FP were elicited from the REACH cohort in which patients were classified according to the RA-2010 criteria, use of MTX and use of other synthetic DMARDs at baseline, 6 and 12 months.
At the start of the patient-level state transition model (i.e., at 12 months), the DAS28 of TP and FN patients at 12 months in the REACH cohort resulted in patients entering one of the three disease states. Patients who entered the DAS28 >3.2 state at the start or later in time could be eligible for starting biological drugs. To model this, summary data from the DREAM cohort on the start of biological drugs were used, in which the observed use of biologic DMARDs in clinical practice was 15 % at 24 months. We transformed this 15 % rate into a 3-month transition probability of 2 % to start biologic DMARDs in those patients with a DAS28 >3.2. This 2 % was distributed over the three disease states in a 1–3–6 distribution (state 1– state 2– state3) based on flare rates (DAS28 > 3.2) in the tREACH cohort.
Estimates of resource use and costs
We distinguished two cost categories: direct medical and productivity costs. Direct medical costs include costs of visits to rheumatologists and other health professionals (e.g., physical therapist), laboratory tests including diagnostic tests and those to monitor side effects, and medication use. Productivity costs represent the number of days that a patient with a paid job was absent from work in the past 3 months. Resource use and productivity losses per disease state were obtained from the REACH study in the first year  and from the tREACH study [34, 35] in the second and third year. The latter was extrapolated to 5 years. In the background model TN patients were assumed to incur no RA-related costs, while 10 % of FP patients incurred biologic DMARD costs in the first year after diagnosis due to misdiagnosis, for which the frequency was based on the REACH study.
The unit costs of visits and productivity losses were based on reference prices published in the Dutch Manual of Costing in economic evaluations . Diagnostic test costs were based on tariffs published by the Dutch Healthcare Authority , and medication costs were obtained from the National Health Care Institute . All costs were adjusted to €2014 using the general price index from the Dutch Central Bureau of Statistics . All cost parameters can be found in Additional file 3: Table S3.
Estimation of QALYs
When assessing the impact of a new test or treatment on quality of life over time, the health outcomes are usually measured in terms of quality-adjusted life years (QALYs). The QALY combines the number of life years with the level of health-related quality of life (i.e., utilities) in those years . The EuroQol 5-dimension 3-level questionnaire (EQ-5D-3L) was used to estimate utilities. The baseline utilities of TP, FP, TN, and FN were obtained from the REACH study and were 0.60, 0.65, 0.65 and 0.60, respectively. Based on the literature we assigned an improvement of 0.10 over the first year to the TPs [41–45]. Based on the REACH study we assigned an improvement of 0.05 and 0.10 over the first year to the FPs and TNs, respectively. Based on the placebo group in the STIVEA trial, we assigned a 0.05 reduction over the first year for FNs, assuming that FN patients would receive little therapy .
In the patient-level state transition model, patients were assigned EQ-5D values based on their DAS28 every 3 months, stratified for the start of biologic DMARDs. As observed in the tREACH study, the EQ-5D values for patients not using biologic DMARDs were higher. Furthermore, the EQ-5D values were not normally distributed. About 25 % of patients in the tREACH study had a decrease in EQ-5D at least once in 3 years, which led to a utility score lower than 0.50. Therefore, different distributions of utility values were estimated. One distribution was estimated for patients with at least one EQ-5D decrease below 0.50 over time and another distribution was estimated for patients who always had an EQ-5D higher than 0.50 over time. In the background model, patients were assumed to have an EQ-5D value of 0.75 that remained constant over time.
We performed a base-case analysis with four diagnostic tests that were used in three diagnostic test strategies as described above. We calculated the incremental costs per QALY gained in each new test strategy compared with the current test strategy (i.e., incremental cost-effectiveness ratio (ICER)). Probabilistic sensitivity analyses were performed in which incremental costs and QALYs were calculated as the mean of 1,000 Monte Carlo simulations, where each simulation samples simultaneously from the appropriate distributions of the input parameters (see Additional file 3: Table S3 for the distributions). Cost-effectiveness planes and acceptability curves were constructed from the Monte Carlo simulation. In addition, we used the headroom (i.e., potential profit) method to assess the maximum additional cost for which each new diagnostic test was still likely to be cost-effective at a willingness-to-pay threshold of €20,000 per QALY gained [46, 47].
Furthermore, we explored the impact of our model parameters in univariate sensitivity analyses, varying the sensitivity, specificity, new test costs, improvement in the DAS28 in TP patients in the new test strategy, who were FN in the current test strategy, and costs of biologic DMARDs for FP patients in the first year after diagnosis. The range over which the model parameters were varied are shown between brackets in Fig. 4. We report these analyses for an add-on test for intermediate-risk patients. For each analysis, one model parameter was altered while the other parameters were held constant at the baseline value. In our analyses, differential discounting was applied in accordance with the Dutch guidelines for economic evaluation research, with an annual discount rate of 4.0 % for all costs and 1.5 % for health effects .
The model structure and input parameters were checked for clinical correctness by rheumatologists. We also verified the model for coding and logical correctness by running extreme value scenarios. Furthermore, an independent modeller internally validated our model to check the model structure, all input parameters with distributions and the visual basic code used to programme the model in Excel.
Ethical approval and patient consent
No ethical approval and consent from patients was needed for this study.
In the add-on test strategy, only intermediate-risk patients could be reclassified as high risk or low risk of having RA, and low-risk patients could be reclassified as high-risk patients. Table 1 shows the reclassification for both add-on strategies. Due to the high specificity of the B-cell test and genetic assay, more patients were reclassified as low risk, while due to the high sensitivity of IL-6 and MRI, more patients were reclassified as high risk.
Table 2 shows the results of the RA-2010 criteria and the four new tests used as add-on for all patients, add-on for intermediate-risk patients, and replacement of the RA-2010 criteria.
The B-cell test was the least expensive strategy when used as an add-on in intermediate-risk patients and as a replacement test, making it the dominant strategy, as it has better health outcomes and lower costs. The B-cell test was also the most cost-effective test strategy as an add-on in all patients. When using a willingness-to-pay threshold of €20,000 per QALY gained, the IL-6 and MRI test strategies were not cost-effective, except in the replacement strategy. A genetic assay was not cost-effective in any strategy. When comparing the test strategies, replacement of the RA-2010 criteria was the most cost-effective test strategy, followed by the add-on test in intermediate-risk patients, and the least favourable was the add-on test in all patients.
Table 2 shows the maximum additional cost for which each new test was likely to be cost-effective at a willingness-to-pay threshold of €20,000 per QALY gained (i.e., the headroom). Given the sensitivity and specificity of the different tests, an IL-6 test will only be cost-effective with a unit cost below €59 in the replacement test strategy. The headroom of a genetic assay test (€210 as the add-on in all patients, €173 as the add-on in intermediate-risk patients, and €543 as the replacement) also shows that the current unit costs of this test were too high, because the headroom was lower than the current unit costs of this test (€750).
Probabilistic sensitivity analysis
Figure 2 shows the cost-effectiveness planes with the average costs and effects and the uncertainty around this average for the add-on test strategy in intermediate-risk patients. All estimates lie within the northeast or southeast quadrants, meaning improved health outcomes. In the northeast quadrant, this is accompanied by higher costs as seen for IL-6, MRI, and gene assay. Nearly 100 % of these estimates (99.6 %, 99.7 %, and 99.8 %, respectively) lie within this quadrant. For the B cell test, part of the estimate (57.2 %) lies within the southeast quadrant depicting lower costs and improved health outcomes. Uncertainty for the MRI and IL-6 test were greater than for the B cell test and gene assay as shown by the width and height of the cloud, which is a consequence of the low specificity of 0.60 (MRI) and 0.53 (IL-6).
Figure 3 shows the cost-effectiveness acceptability curves for the add-on test strategy for intermediate-risk patients. If a willingness-to-pay threshold of €20,000 per QALY gained is used, there is a probability of cost-effectiveness for the B-cell test of 100 %.
Univariate sensitivity analysis
The sensitivity of our results was evaluated by varying single model parameters and comparing the ICERs to the base-case ICERs of the most cost-effective test (B cell) and the most effective test (MRI) when used as the add-on in intermediate-risk patients. The base-case ICERs were estimated using an improvement of 0.2 in the DAS28 at 12 months in the FN patients, 10 % of FP patients with biologic DMARD costs in the first year after diagnosis and the test-specific sensitivity, specificity and costs.
For a B-cell test, the change in specificity had the largest impact (see Fig. 4), followed by improvement in the DAS28 in TP patients in the new test strategy, who were FN in the current test strategy. No improvement in the DAS28 resulted in an ICER of about €16,000, while an improvement of 0.6 in the DAS28 resulted in a dominant new test strategy (i.e., better health outcomes and lower costs). Varying the new test costs, the number of FP patients having biological DMARD costs between 0 % and 20 %, and sensitivity had less impact on the ICER, but still caused variation.
For MRI, the change in improvement in the DAS28 in TP patients in the new test strategy, who were FN in the current test strategy, had the largest impact (see Fig. 4). No improvement in DAS28 resulted in an ICER of about €56,000, while an improvement of 0.6 in the DAS28 resulted in an ICER of about €3000. Changing sensitivity to 0.50 or 1.00 had about the same impact as varying the number between 0 % and 20 % of FP patients having biological DMARD costs. Adjustments in specificity and costs had the least impact on the ICER.
Various new medical technologies enable the detection of RA at an increasingly earlier stage. RA is a disease in which early detection has high potential because there are treatments available that effectively reduce disease progression, especially if introduced early in the disease. In our study, we compared four different tests and we found that a B-cell test was the most cost-effective test in all test strategies. This is mainly due to the high specificity (0.90) in combination with moderate sensitivity (0.60), and relatively low test costs (€150). When the specificity of the test increases, the number of FP patients who may receive unnecessary expensive treatment is reduced, which largely increases the likelihood that the test becomes cost-effective. MRI was the second most cost-effective test as an add-on in intermediate-risk patients. Mainly as a result of the higher costs of MRI (€756) compared to a B-cell test (€150), MRI was less cost-effective, even though it had much higher sensitivity than the B-cell test (0.90 versus 0.60). However, the MRI had lower specificity, which had more impact on the ICER than lower sensitivity. The specificity is more important because an add-on test with a specificity below 100 % results in more patients classified as FP, while these patients were TN according to the current test strategy.
This study indicated where it is most likely to position a new diagnostic test given the sensitivity, specificity and costs in relation to the current RA-2010 criteria test strategy. B-cell gene expression as an add-on test in intermediate-risk patients was the most likely cost-effective add-on strategy and could even replace the RA-2010 criteria, given its moderate sensitivity (0.60), high specificity (0.90) and low costs (€150). However, it is unlikely that a replacement test would be discovered that provides the rheumatologists with the same richness of information as the current RA-2010 criteria. To use a new test in addition to the RA-2010 criteria in all patients is the least cost-effective, because it would not alter treatment for the high-risk patients (243 of 552 patients). These patients will have higher test costs without additional health gain.
The discussion about the potential benefits of early detection and treatment of RA is also relevant in the light of the current discussion about biosimilar drugs. Currently, the price of biologic DMARDs is set at €14,000 per patient per year. Due to this high price (which is about the same for all biologic DMARDs), it is an important driver of the cost-effectiveness results. If the price of a biosimilar will be significantly lower, it will have a smaller impact on the results. On the one hand, this could result in starting biological drugs earlier, resulting in more TP patients using biologic DMARDs. On the other hand, a cheaper biosimilar may worsen the cost-effectiveness of new tests because the savings from postponing or preventing treatment with a biologic drug are smaller. The net result is hard to predict, as prescription behaviour may change with lower costs of biological drugs.
The results of our study are likely to be generalizable to other countries and different healthcare systems because the RA-2010 criteria are widely used internationally for classifying RA. The costs of diagnostic tests and treatment patterns obviously differ between countries and healthcare systems (e.g., higher costs in the USA). However, this would not result in differences in the most cost-effective test strategy.
The results of our simulation should be interpreted taking into account that we used fixed values for the sensitivity, specificity and costs of the alternative tests. This was done to show the differences in early cost-effectiveness between tests with different test characteristics. If we had added distributions around these parameters, the differences in cost-effectiveness between the tests would likely decrease due to large uncertainties of these parameters in early-CEA. To overcome this, we performed univariate sensitivity analysis to explore the impact of changing these parameters. We found that multiple factors had an impact on the cost-effectiveness of a new test strategy. The main drivers of the ICER include the sensitivity, specificity and costs of the new test, but also the improvement in the DAS28 for TP patients in the new test strategy, who were FN according to the RA-2010 criteria, and the percentage of FP patients with biological DMARD costs in the first year after diagnosis.
In addition, as RA is a heterogeneous disease, a new test might provide additional diagnostic information in particular subgroups, such as obese patients or those with coexistent osteoarthritis. Furthermore, the choice of a new test might also be guided by additional diagnostic information that is expected in particular subgroups. Further research should investigate the impact of this additional diagnostic information on the cost-effectiveness of these new tests.
Like any early-CEA study, our study had limitations. One was that limited data were available on long-term disease progression in a cohort receiving usual care. Therefore, we had to synthesize data from different data sources and to make assumptions about the new test strategies based on expert opinion. Typically, the improvement in health outcomes of adding a new test without observed data is difficult to estimate. In the first year after diagnosis, we assigned equal improvements in utilities for TP and TN patients, less gain for FP patients, and reduced utilities for FP patients. Whether this reflects clinical practice needs to be proven. Another limitation could be that a death state was not included in our model. However, because of the low (1–2 %) mortality risk observed in patients with RA during 5 years of follow up , and the equal mortality risk with the current and new diagnostic test strategy, the mortality risk would not influence the model results.
We have shown that the B-cell gene expression as an add-on test for intermediate-risk patients was the most likely cost-effective add-on strategy. A new add-on test for intermediate-risk patients should have high specificity and the costs should not be higher than €200–€300.
Moderate and severe DAS28 were categorised in one disease state. As treatment choice is based on DAS28 >3.2, our categorisation does not affect the model results.
disease activity score DAS28, disease activity score in 28 joints
disease-modifying antirheumatic drug
early cost-effectiveness analysis
American College of Rheumatology
EuroQol five dimension 3-level questionnaire
European League Against Rheumatism
incremental cost-effectiveness ratio
magnetic resonance imaging
quality-adjusted life year
- RA-2010 criteria:
American College of Rheumatology/European League Against Rheumatism (ACR/EULAR) 2010 rheumatoid arthritis classification criteria
susceptibility single nucleotide polymorphism
tumour necrosis factor
Pincus T, Callahan LF, Sale WG, et al. Severe functional declines, work disability, and increased mortality in seventy-five rheumatoid arthritis patients studied over nine years. Arthritis Rheum. 1984;27(8):864–72.
Mitchell DM, Spitz PW, Young DY, et al. Survival, prognosis, and causes of death in rheumatoid arthritis. Arthritis Rheum. 1986;29(6):706–14.
Scott DL, Symmons DP, Coulton BL, et al. Long-term outcome of treating rheumatoid arthritis: results after 20 years. Lancet. 1987;1(8542):1108–11.
Isomäki H. Long-term outcome of rheumatoid arthritis. Scand J Rheumatol Suppl. 1992;95:3–8.
Wolfe F. The natural history of rheumatoid arthritis. J Rheumatol Suppl. 1996;44:13–22.
Aletaha D, Neogi T, Silman AJ, et al. Rheumatoid arthritis classification criteria: an American College of Rheumatology/European League Against Rheumatism collaborative initiative. Ann Rheum Dis. 2010;69(9):1580–8.
Bijlsma JW, Weinblatt ME. Optimal use of methotrexate: the advantages of tight control. Ann Rheum Dis. 2007;66(11):1409–10.
Jacobs JW. Optimal use of non-biologic therapy in the treatment of rheumatoid arthritis. Rheumatology. 2012;51(4):iv3–8.
Chen YF, Jobanputra P, Barton P, et al. A systematic review of the effectiveness of adalimumab, etanercept and infliximab for the treatment of rheumatoid arthritis in adults and an economic evaluation of their cost-effectiveness. Health Technol Assess. 2006;10(42):iii–iv. xi-xiii, 1-229.
Donahue KE, Garthehner G, Jonas DE, et al. Comparative Effectiveness of Drug Therapy for Rheumatoid Arthritis and Psoriatic Arthritis in Adults. Rockville: AHRQ Comparative Effectiveness Reviews; 2007.
Singh JA, Christensen R, Wells GA, et al. A network meta-analysis of randomized controlled trials of biologics for rheumatoid arthritis: a Cochrane overview. CMAJ. 2009;181(11):787–96.
Hazes JM, Luime JJ. The epidemiology of early inflammatory arthritis. Nat Rev Rheumatol. 2011;7(7):381–90.
Drummond MF, Sculpher MJ, Torrance GW, et al. Methods for the Economic Evaluation of Health Care Programmes. 3rd ed. Oxford: Oxford University Press; 2005.
Buisman LR, Rutten-van Mölken MPMH, Postmus D, et al. The early bird catches the worm: early cost-effectiveness analysis of new medical tests. Int J Technol Assess Health Care. 2016;32(1):1–8. doi:10.1017/S0266462316000064.
IJzerman MJ, Steuten LM. Early assessment of medical technologies to inform product development and market access: a review of methods and applications. Appl Health Econ Health Policy. 2011;9(5):331–47.
The Center for Translational Molecular Medicine. Full article on project TRACER. http://www.ctmm.nl/en/downloads/full-article-downloads-pdf/full-article-on-project-tracer. Accessed 27 Jan 2015.
Van Baarsen LG, Bos WH, Rustenburg F, et al. Gene expression profiling in autoantibody-positive patients with arthralgia predicts development of arthritis. Arthritis Rheum. 2010;62(3):694–704.
Gottenberg JE, Dayer JM, Lukas C, et al. Serum IL-6 and IL-21 are associated with markers of B cell activation and structural progression in early rheumatoid arthritis: results from the ESPOIR cohort. Ann Rheum Dis. 2012;71(7):1243–8.
Duer-Jensen A, Hørslev-Petersen K, Hetland ML, et al. Bone edema on magnetic resonance imaging is an independent predictor of rheumatoid arthritis development in patients with early undifferentiated arthritis. Arthritis Rheum. 2011;63(8):2192–202.
Bird P, Conaghan P, Ejbjerg B, et al. The development of the EULAR-OMERACT rheumatoid arthritis MRI reference image atlas. Ann Rheum Dis. 2005;64(1):i8–10.
Nieuwenhuis WP, Krabben A, Stomp W, et al. Evaluation of magnetic resonance imaging-detected tenosynovitis in the hand and wrist in early arthritis. Arthritis Rheumatol. 2015;67(4):869–76.
Sugimoto H, Takeda A, Hyodoh K. Early-stage rheumatoid arthritis: prospective study of the effectiveness of MR imaging for diagnosis. Radiology. 2000;216(2):569–75.
Narváez J, Sirvent E, Narváez JA, et al. Usefulness of magnetic resonance imaging of the hand versus anticyclic citrullinated peptide antibody testing to confirm the diagnosis of clinically suspected early rheumatoid arthritis in the absence of rheumatoid factor and radiographic erosions. Semin Arthritis Rheum. 2008;38(2):101–9.
Machado PM, Koevoets R, Bombardier C, van der Heijde DM. The value of magnetic resonance imaging and ultrasound in undifferentiated arthritis: a systematic review. J Rheumatol Suppl. 2011;87:31–7.
Yarwood A, Huizinga TW, Worthington J. The genetics of rheumatoid arthritis: risk and protection in different stages of the evolution of RA. Rheumatology (Oxford). 2016;55(2):199–209.
Dutch Healthcare Authority (Nederlandse Zorgauthoriteit, NZa). NZa Zorgproducten Tariefapplicatie. http://dbc-zorgproducten-tarieven.nza.nl/nzaZpTarief/ZoekfunctieDbc.aspx. Accessed 1 May 2013.
Smolen JS, Breedveld FC, Burmester GR, et al. Treating rheumatoid arthritis to target: 2014 update of the recommendations of an international task force. Ann Rheum Dis. 2015. doi:10.1136/annrheumdis-2015-207524.
Dutch Institute for Healthcare Improvement CBO. Guideline diagnostics and treatment for rheumatoid arthritis (Richtlijn Diagnostiek en behandeling van reumatoïde artritis). Utrecht: Dutch Institute for Healthcare Improvement CBO; 2008.
Alves C, Luime JJ, Van Zeben D, et al. Diagnostic performance of the ACR/EULAR 2010 criteria for rheumatoid arthritis and two diagnostic algorithms in an early arthritis clinic (REACH). Ann Rheum Dis. 2011;70(9):1645–7.
Van Gestel AM, Haagsma CJ, Van Riel PL. Validation of rheumatoid arthritis improvement criteria that include simplified joint counts. Arthritis Rheum. 1998;41(10):1845–50.
Fransen J, Creemers MC, Van Riel PL. Remission in rheumatoid arthritis: agreement of the disease activity score (DAS28) with the ARA preliminary remission criteria. Rheumatology. 2004;43(10):1252–5.
Prevoo ML, v 't Hof MA, Kuper HH, et al. Modified disease activity scores that include twenty-eight-joint counts. Development and validation in a prospective longitudinal study of patients with rheumatoid arthritis. Arthritis Rheum. 1995;38(1):44–8.
Luime JJ, Buisman LR, Oppe M, et al. A cost-effectiveness model for evaluating new diagnostic tests in the work-up of patients with inflammatory arthritis at risk of having Rheumatoid Arthritis. Arthritis Care Res (Hoboken). 2015. doi:10.1002/acr.22776.
De Jong PH, Hazes JM, Han HK, et al. Randomised comparison of initial triple DMARD therapy with methotrexate monotherapy in combination with low-dose glucocorticoid bridging therapy; 1-year data of the tREACH trial. Ann Rheum Dis. 2014;73(7):1331–9.
Claessen SJ, Hazes JM, Huisman MA, et al. Use of risk stratification to target therapies in patients with recent onset arthritis; design of a prospective randomized multicenter controlled trial. BMC Musculoskelet Disord. 2009;10:71.
Vermeer M, Kievit W, Kuper HH, et al. Treating to the target of remission in early rheumatoid arthritis is cost-effective: results of the DREAM registry. BMC Musculoskelet Disord. 2013;14:350.
Tan SS, Bouwmans CA, Rutten FF, et al. Update of the Dutch Manual for Costing in Economic Evaluations. Int J Technol Assess Health Care. 2012;28(2):152–8.
National Health Care Institute. http://www.medicijnkosten.nl/default.asp. Accessed 2 Feb 2015.
Statistics Netherlands. http://statline.cbs.nl. Accessed 2 Feb 2015.
Pliskin JS, Shepard D, Weinstein MC. Utility functions for life years and health status. Operations Res. 1980;28(1):206–24.
Buitinga L, Braakman-Jansen LM, Taal E, et al. Comparative responsiveness of the EuroQol-5D and Short Form 6D to improvement in patients with rheumatoid arthritis treated with tumor necrosis factor blockers: results of the Dutch Rheumatoid Arthritis Monitoring registry. Arthritis Care Res. 2012;64(6):826–32.
Lillegraven S, Kristiansen IS, Kvien TK. Comparison of utility measures and their relationship with other health status measures in 1041 patients with rheumatoid arthritis. Ann Rheum Dis. 2010;69(10):1762–7.
Linde L, Sørensen J, Østergaard M, et al. Gain in quality-adjusted life-years in patients with rheumatoid arthritis during 1 year of biological therapy: a prospective study in clinical practice. J Rheumatol. 2013;40(9):1479–86.
Sørensen J, Linde L, Ostergaard M, et al. Quality-adjusted life expectancies in patients with rheumatoid arthritis--comparison of index scores from EQ-5D, 15D, and SF-6D. Value Health. 2012;15(2):334–9.
Verstappen SM, McCoy MJ, Roberts C, et al. Beneficial effects of a 3-week course of intramuscular glucocorticoid injections in patients with very early inflammatory polyarthritis: results of the STIVEA trial. Ann Rheum Dis. 2010;69(3):503–9.
Girling A, Young T, Brown C, et al. Early-stage valuation of medical devices: the role of developmental uncertainty. Value Health. 2010;13(5):585–91.
Cosh E, Girling A, Lilford R, et al. Investing in new medical technologies: a decision framework. J Commerc Biotechnol. 2007;13(4):263–71.
College voor zorgverzekeringen (CVZ). Guidelines for pharmacoeconomic research, updated version. Diemen: College voor zorgverzekeringen; 2006.
Toledano E, Candelas G, Rosales Z, et al. A meta-analysis of mortality in rheumatic diseases. Reumatol Clin. 2012;8(6):334–41.
This research was performed within the framework of the Center for Translational Molecular Medicine (CTMM) (www.ctmm.nl) and the Dutch Arthritis Foundation, project TRACER (grant 04I-202). We are grateful to Nasuh Büyükkaramikli for performing an internal validity assessment of the model.
The authors declare that they have no competing interests.
LRB drafted and revised the manuscript for content, including medical writing of the content, contributed to the design of the study, analysed and interpreted the data, developed the early cost-effectiveness model, performed the statistical analyses and handled the study coordination. JJL revised the manuscript for content, including medical writing of the content, contributed to the design of the study, analysed and interpreted the data, performed the statistical analyses and handled the study coordination. MO revised the manuscript for content, including medical writing of the content, advised in the development of the early cost-effectiveness model and interpreted the data. JMWH revised the manuscript for content, including medical writing of the content, acquired and interpreted the data and obtained study funding. MPMHR revised the manuscript for content, including medical writing of the content, contributed to the design of the study, interpreted the data, handled the study supervision and obtained study funding. All authors gave final approval for publication of this manuscript.