Systemic sclerosis-associated pulmonary hypertension: why disease-specific composite endpoints are needed

Pulmonary arterial hypertension (PAH) is a serious complication of systemic sclerosis (SSc). In clinical trials PAH-SSc has been grouped with other forms, including idiopathic PAH. The primary endpoint for most pivotal studies was improvement in exercise capacity. However, composite clinical endpoints that better reflect long-term outcome may be more meaningful. We discuss potential endpoints and consider why the same measures may not be appropriate for both idiopathic PAH and PAH-SSc due to inherent differences in clinical outcome and management strategies of these two forms of PAH. Failure to take this into account may compromise progress in managing PAH in SSc.

patients with PAH related to SSc (PAH-SSc) show blunted responses to therapy when compared with those with idiopathic PAH, including key measures of outcome such as the six-minute walk test, time to clinical worsening and survival [3]. Th e very presence of SSc provides an enriched population at high risk of PAH and should off er the opportunity for early diagnosis, yet registry and centre-based data reveal no improvement in referral intervals. Part of this may refl ect an increased understanding of the lack of sensitivity and specifi city of echocardiography, particularly at the lower end of pulmonary pressures but also because of confounding issues posed by concomitant interstitial lung disease. Finally, patients with SSc tend to be under-represented in modern trials, which are typically rather short in duration (12 to 18 weeks), result ing in inadequacy of data to support defi nitive recom mendations [4].
With this background, there has recently been a systematic eff ort to improve the assessment of PAH occurring in association with SSc. Th e main drivers for this have included a desire for better validated endpoints that could be used as a core set applied to clinical trials, the wish for a clinically meaningful endpoint that would refl ect practice, and the need for less invasive longitudinal assessment tools that might replace right heart catheterisation (RHC) as the perceived gold standard test for PAH. At present, RHC is essential for diagnosis but there are questions about the feasibility of this as a tool to follow patients clinically over time. Th ere is clear need for a non-invasive endpoint as well as for validation and critical analysis of the eff ectiveness of screening modalities. Th ere are particular challenges in addressing this for PAH-SSc, a condition that requires multidisciplinary care and that may present and be followed up by a number of diff erent subspecialists. Each will be an expert in their own fi eld and be familiar with managing and interpreting certain investigations but there will be diff erences of opinion between the diff erent experts as to what the best tests are and how they should be interpreted and used in practice. In addition, the tools that may be validated in other forms of PAH are unlikely to have been formally assessed in PAH-SSc, which has its own unique characteristics that may aff ect standard PAH outcome measures; an example would include the asso ciated musculoskeletal manifestations. Moreover, there may be diff erences for other forms of connective tissue diseaseassociated PAH. Th ere are biases introduced based upon diff erent clinical experiences and also related to the familiarity with clinical trials.
An approach that has been used successfully in rheumatology and has a clearly defi ned framework is covered by the Outcome Measures in Rheuma tology (OMERACT) methodology. Th is uses a standardised frame work to assess potential disease measures for clinical trials to consider utility under the subcategories of the OMERACT fi lter [5]. Th e Expert Panel on Outcomes measures in PAH related to Systemic Sclerosis (EPOSS)-OMERACT group was established to begin to apply this approach to evaluating PAH-SSc. Th is group has integrated expertise in cardiology, pulmonary medicine, rheumatology and biostatistics as well as clinical trial design and outcome development and validation. It has applied the OMERACT fi lter to individual tools that could represent endpoints in trials and has critically reviewed the published literature to explore the extent to which outcomes have been validated. In addition, it has sought to develop consensus about individual outcomes. In particular, the EPOSS group has identifi ed through a Delphi process a series of recommended domains and their assessment tools [6]. Th e data that could validate these tools have been considered systematically and this has led to a series of important observations. One of the goals was to identify a measure or series of measures that could replace RHC as the gold standard of assessment. Th ese are signifi cant achievements and have resulted in a series of relevant publications. So far, six substantive papers [6][7][8][9][10][11] have been published as a direct result of the EPOSS initiative, and more are expected.
Work is now underway and there will be attempts to validate the individual components and to review the available data that provide some validation. Th is is a daunting task as for many tools there are not suffi cient results from research studies to undertake this. First attempts to evaluate the routine clinical tool of Dopplerechocardiography are testament to the challenge that lies ahead [7]. An important output of this exercise has been the defi nition of research agenda to prioritise eff ort in addressing the data that are available and determine the extent to which the available information from national (such as the French Intinair project [12], UK single centre registries and compERA-XL [13]) or from clinical trial datasets might be interrogated. One limitation of most clinical trials is that they usually include a minority, typically around 20%, of cases with connective tissue disease-associated PAH, and fewer with PAH-SSc. Th e EPOSS group provides a template for the type of international multidisciplinary approach that could tackle these important challenges.
In the meantime the clinical arena has moved on and a large number of major clinical trials in PAH include cases of PAH-SSc. Th ere has been the strong suggestion that a composite endpoint that refl ects clinical practice be used. Th is has become defi ned as the time to clinical worsening (TTCW). A formal measure of TTCW has emerged as an attractive composite endpoint that measures progression in PAH. It was originally included as a secondary endpoint in several major clinical trials that led to licensing of PAH therapies based upon a primary endpoint of change in exercise capacity (the six-minute walk test). At face value it makes obvious sense, especially as a clinically meaningful endpoint that may be used in licensing and in post-licensing evaluation of therapies. However, the devil is in the detail. Diff erent studies have used diff erent components in the TTCW defi nition and there are major potential local diff erences in practice that may make a measure unworkable or unreliable in diff erent centres. Th us, some centres have an outpatient ambulatory emphasis whereas others may often hospitalise cases of PAH. In addition, availability of therapies and expertise in procedures such as surgical intervention or transplantation may be relevant. More over, as discussed above, it is likely that diff erent standards may be applicable for PAH-SSc versus idiopathic PAH due to comorbidity and potential diff erences between outcome and progression of PAH and suitability for treatments. In the short term, TTCW is very likely to be adopted as a useful measure and one that is especially relevant in early stage disease where stabilisation can be a very appropriate management goal. However, this should serve as an impetus to further research to validate and understand individual compo nents of TTCW and develop new and potentially better composite tools. In particular, there are self-evident reasons why some of the components of TTCW are likely to be unreliable or incomparable in PAH-SSc. Musculo skeletal involvement and comorbidity, such as lung fi brosis or cardiac complications, are clearly likely to aff ect exercise capacity. One study suggests that musculo skeletal deconditioning is the major determinant of six-minute walk test distance [14], and no relationship with parameters of lung function has been shown [15]. Th ere are multiple causes of diseaserelated mortality in SSc -for example, renal crisis, lung fi brosis and gut disease -and so mortality cannot be taken as a surrogate for PAH outcome. Finally, comorbidity and age make PAH-SSc cases much less likely than idiopathic PAH to be referred for transplantation and even less likely to be transplanted. Th e impact of these individual component diff erences may be cancelled out in a composite score but this cannot be assumed. Th is should be considered especially in subtypes of PAH and so the research work now emerging from the EPOSS initiative is likely to be very relevant. In the meantime, the recent report from the Fourth World Congress in PH in Dana Point, USA, represents current best expert consensus on how to standardise and use TTCW as an outcome measure in PAH clinical trials [16]. It seems likely that TTCW will be a benchmark in future studies. It has already replaced exercise capacity assessed by the six-minute walk test distance, generally now there is a move towards a robust hemodynamic endpoint or the composite clinical measure.
So where does that leave the EPOSS initiative? Th e work could be regarded as done and the TTCW be adopted as a gold standard. RHC would remain for diagnosis but would only be performed later as directed clinically. But this would not be a correct approach for PAH-SSc. Table 1 highlights some specifi c aspects that would be relevant to the domains and measures that have been identifi ed through the EPOSS initiative and that may be incorporated into composite endpoints, including TTCW. Th ere is a strong need for systematic validation and much to be learnt from the defi ned research agendas along the way. Th e two concepts must co-exist but cannot do so without interplay so that both may inform the other. It can be argued that rigorous concerns about validity of endpoints could have severely impeded progress and treatment opportunities but the challenge must now be faced so that there is a real consensus that can be applied and eventuality the clinical needs of patients and the methodological needs of trialists and the exacting standards of the regulatory authorities that license new agents can all be met. For the time being TTCW is probably the most usable endpoint for clinical studies but the component terms need better standardisation and must be clearly defi ned. In the future, through initiatives such as EPOSS, these components can be validated in PAH-SSc and it is imperative that the challenge of this task is not used as justifi cation for not addressing these important questions. Global state by Survival SSc-PAH currently has a substantially worse mortality than iPAH physician a Domains identifi ed through the EPOSS initiative may be integrated into the TTCW composite end-point and whether these may be diff erent for idiopathic pulmonary arterial hypertension and pulmonary arterial hypertension associated with systemic sclerosis (PAH-SSc). 6MWD, 6-minute walking distance; HAQ DI = Health Assessment Questionnaire Disability Index; iPAH, idiopathic pulmonary arterial hypertension; PAH, pulmonary arterial hypertension; SF-36, Short Form 36 score; SSc, systemic sclerosis; VAS, visual analog scale.