Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2025 Mar 28.
Published in final edited form as: Osteoarthr Imaging. 2024 Dec 25;5(1):100256. doi: 10.1016/j.ostima.2024.100256

Which Endpoints Should be Applied in Interventional Trials? –

From Single Uni-dimensional Assessment Tailored to a Drug’s Mechanism of Action to Multi-Component Measures and Multi-Domain Composites

Felix Eckstein 1,2,3, Tanja Stamm 2,4, Jamie Collins 5
PMCID: PMC11951169  NIHMSID: NIHMS2046140  PMID: 40160440

Abstract

Objective:

A vast array of structural/imaging and clinical endpoints/outcomes are available today to osteoarthritis epidemiologists or trialists. Which assessments are best suited for which studies remains unsettled. When several assessments are available, these may be analyzed together (simultaneously or hierarchically), using statistical modeling and adjustment. Or, alternatively, they may be combined to form more complex multi-component or composite (potentially multi-domain) endpoints/outcomes. This review describes such concepts and their challenges, using examples from current osteoarthritis imaging research.

Design:

A narrative, non-systematic literature search (PubMed and others) was conducted, and informal consultations were held with experts in the field. The identified concepts and experimental findings were then organized to present an integrated framework.

Results:

Single imaging assessments can encompass one (uni-dimensional) or more (multi-dimensional) structures. Integration of image assessments of one structure/tissue across anatomical locations provides aggregate measures. This can also be created across heterogeneous (multi-dimensional) types of assessments (multi-component/composite), either within an area (such as imaging - single domain) or across broader areas of health and well-being (multi-domain). Weighting, standardization, and (clinical) usefulness are crucial characteristics of multi-component/composite endpoints. Examples of these concepts are here provided in the context of osteoarthritis imaging.

Conclusions:

Options for multi-component/composite endpoints in osteoarthritis research are virtually infinite. Smart research strategies are required to explore and validate these vast possibilities, with appropriate statistical treatment being paramount. A one-size/endpoint-fits-all approach will likely fail in observational and interventional studies. Imaging assessment needs to be tailored to both the drug’s unique mechanism of action, and to the participants’ morpho-type.

Keywords: Endpoint, Osteoarthritis, Image Assessment, Composite Measure, Multi Component Measure, Mechanism of Action (MOA)

INTRODUCTION

With the advent of more advanced technology for imaging acquisition and analysis in osteoarthritis (OA) research several decades ago[1,2], the number and types of structural assessments available to OA experimentalists, epidemiologists and trialists has risen dramatically. Today, the number by far exceeds that of the well-established clinical measures (of pain, function, and joint-related quality of life). These days researchers can thus select from an overwhelming, but sometimes also confusing, wealth of imaging assessments to characterize joint morpho-types[3], and the impact of interventions on structural pathology. Yet, it is a matter of ongoing debate which (or which combination) of image assessments should preferably be used in various experimental, observational or interventional studies. If more than one type of assessment is desired as study endpoint, these may be looked at together, simultaneously or hierarchically, yet remaining separate entities[4]. In this case, an a-priori defined hierarchical statistical analysis plan, specifying the order of statistical testing (primary and secondary endpoints) as well as adjustment for multiple comparisons, is necessary[4]. Alternatively, multi-component or composite endpoints may be used[5], where these assessments are merged to form novel measures, potentially spanning across broader areas of health (domains).

The current review bases on an invited presentation delivered at the 18th International Workshop of OsteoArthritis Imaging (IWOAI) in June 2024. It aims to provide a summary and framework on single vs. multi-component /composite imaging assessment. We introduce terminology and present theoretical concepts and challenges of this research area, giving examples from OA imaging science.

METHODS

A non-systematic literature search (PubMed) was conducted, starting with a list of terms from the title of the presentation at the IWOAI, and search terms from the OA imaging field. These terms included (non-exclusive list): “imaging”, “quantitative”, “joint”, “whole organ”, “joint tissue”, “joint structure” (cartilage, meniscus, bone, bone marrow, synovium, etc.), “assessment”, “score”, “measure”, “endpoint”, “outcome”, “intervention”, “trial”, “DMOAD”, “uni-dimensional”, “multi-dimensional”, “multi-component”, “composite”, “domain”, “multi-domain”, “drug”, “mechanism of action”, “phenotype”, “morpho-type” and others. Additional conceptional information was obtained using Google and Chat GPT 4.o. Finally, to receive broad and detailed input, informal (non-structured) consultations were held with experts in various research areas, before and during the 18th IWOAI (June 2024; Marrakech). The identified terminology, experimental findings, concepts, and recommendations were then organized into a systematic framework, suggesting a classification of terms, and providing examples from current OA imaging science.

RESULTS

1). Definition of terms

Evaluation of images (“assessment”) in medicine is commonly performed using “scores” or “measures” (Table 1). A “score” thereby represents a discrete numerical value assigned to describe an observation within the context of a standardized system, typically based on evaluation by an expert or algorithm. Although “grades” represent categorical classifications describing the severity of imaging findings (staging) and represent a parameter of “classification” rather than “quantification” (score), both terms are mostly used interchangeably in imaging. They are represented by ranked, categorial, ordinal values. Examples are the Kellgren Lawrence (KL)[6] or joint space narrowing (JSN) grade[7] in the radiographic evaluation of knee OA, where grade 2, for instance, does not at all represent twice the damage observed in grade 1. Another example is the whole-organ MRI Osteoarthritis Knee Score (MOAKS), used in the semi-quantitative evaluation of knee joint structural pathology[1]. A measure, in contrast, represents the product of a fully quantitative process of data collection (measurement), representing a continuous value. Examples covering different dimensionalities are cartilage thickness (mm)[8], total and denuded subchondral bone area (cm2)[8], or cartilage or bone volume (cm3)[2]. Measures may actually be converted into grades, using thresholds or cut-points. The body mass index, for instance (a continuous value) often is converted into ordinal grades of underweight, normal weight, overweight, and obese using thresholds of 18.5, 25, and 30, respectively. In OA trials, a continuous measure of longitudinal change in JSW may be converted to progressors (>0.7mm) or non-progressors (≤0.7mm)[9]. A metric represents a specific variable, related to the performance or quality of a score, grade, or measure (Table 1). Examples are validity/accuracy, reliability/precision[10] (repeatability and reproducibility); the Dice similarity coefficient (segmentation agreement), standardized response means (sensitivity to change), responsiveness to treatment (effect size, for instance by Cohens’ D), and others. For further reviews of definitions and the evaluation of quantitative performance measures by the Quantitative Imaging Biomarker Association (QIBA) see[11,12]. Here, we will use the term “score/grade” when describing discrete and semi-quantitative variables, and “measure” when referring to continuous, fully quantitative measures. The term “assessment” will be used “neutrally” when addressing either semi-quantitative scores or quantitative measures.

Table 1:

Terms frequently used in context with the assessment of medical images in clinical studies

Term Short explanation (for longer explanations see full text)
Assessment Evaluation of images by either scores or measures
Score / Grade Discrete numerical value assigned to an observation (ranked, categorial, ordinal values)
Measure Product of a quantitative process of data collection (continuous value)
Metric Variable related to the performance or quality of an assessment
Biomarker Characteristic indicative of a normal biological or pathological process, or a response to an exposure
Endpoint Biomarkers used to evaluate scientific relationships or the efficacy of an intervention
Outcome Biomarker representing a broader aspect of health (feel, function, survive)
Surrogate outcome Biomarker that has been qualified to substitute a direct measure or an outcome

Image assessments can adopt the role of a “biomarker”, a defined characteristic indicative of a normal biological process, of a pathological process, or of a biological response to an exposure, for instance to a therapeutic intervention[13]. When biomarkers are used to evaluate scientific relationships (i.e. risk factors) in an epidemiological study, or to interrogate the efficacy of an intervention in a clinical trial, they are termed “endpoint” (Table 1). Change in cartilage volume, thickness, bone size, bone shape, and others, have, for instance been used as endpoints in clinical trials of DMOADs [2]. When such a metric represents a broader aspect of a participant’s health, specifically how a patient feels, functions, or survives (or her/his joint survives) it fulfills the requirements of being an “outcome”. Hence imaging assessments can only represent endpoints, but never outcomes. However, if appropriately qualified (validated), an imaging biomarker can assume the role of a “surrogate outcome” and may act as substitute of a direct assessment of a clinically meaningful endpoint (i.e. an outcome)[14] (Table 1). Surrogates are extremely versatile and useful because they are generally more straight-forward, and often more objectively measured; often also they respond earlier to disease progression or to a therapeutic intervention that clinical outcomes. However, even if surrogate outcomes “correlate” with clinical outcomes, they do not necessarily predict these reliably, and therefore must be qualified. Generally, this must occur “de novo” in every specific context and novel application, pertinent to different modes of action (MOA) of new drugs[15]. Although OA imaging assessments commonly serve as endpoints in clinical trials, the FDA has not accepted any of them as a “surrogates” for evaluating efficacy of disease modifying osteoarthritis drugs (DMOADs) in their 2018 Guidance document (https://www.fda.gov/regulatory-information/search-fda-guidance-documents/osteoarthritis-structural-endpoints-development-drugs). This applies neither generally nor in any particular context, and thus represents a great current obstacle to an efficient DMOAD development process.

2). From uni-dimensional to multi-component/composite multi-domain endpoints

Single, uni-dimensional (or uni-variable) assessment represents one structure/tissue or entity only (Table 2). Examples are an anterior cruciate ligament (ACL) “score” (normal vs. complete tear)[16], or a “measure” (continuous scale) of meniscus extrusion distance in a single slice, or extrusion area across several MR images (Table 2)[17]. Yet, they can also be more complex, from either a segmentation point of view (e.g. bone marrow lesion (BML) volume[18,19] or in the way they capture the intricate shape of an anatomical structure such as the bony metaphysis in a single continuous measure, e.g. the B-score[2,20]. Examples of clinical single, uni-dimensional assessments in OA are the numerical rating scale for pain (NRS; a score), or the chair stand test for physical function (CST; a measure), both used, for instance, in the Osteoarthritis Initiative[21] (Table 2).

Table 2:

Classification of different types of assessments (scores or measurements) in osteoarthritis (OA) research: Short explanation of category, and examples given from various domains, including imaging (structure), pain, function, and surgery

Type of Assessment Short explanation Examples
Single uni-variable Single assessment capturing 1 entity (single location) Anterior Cruciate Ligament (ACL) tear #
Meniscus extrusion distance or area
Bone Marrow Lesion (BML) volume
B-Score (Bone shape)
Numerical rating scale (NRS) pain
Chair stand test (functional performance)
Single multi-variable Single assessment capturing > 1 entity (single location) Joint space narrowing (JSN) score
Joint space width (JSW) measurement
Effusion synovitis (ES) volume
Aggregate (uni-variable) Assessment capturing the same type of entity at multiple locations (same joint); various ways of integrating data Osteophyte score
MRI Osteoarthritis Knee Score (MOAKS):
Cartilage damage, or meniscus morphology/position, or bone marrow lesions/cysts, or Hoffa / Effusion synovitis
Cartilage thickness
Cartilage composition:
T2 or T1rho or dGEMRIC or GAGCest, or Sodium, etc.
Synovial thickness
Multi-component: multi-dimensional single-domain Assessment integrating several entities belonging to the same broader area of health/well-being (domain) Kellgren Lawrence Grade (KLG)
MOAKS total Score [or WORMS or BLOKS total score]
Western Ontario & McMaster (WOMAC) pain or function
Multi-component: multi-dimensional multi-domain Assessment integrating several entities belonging to different broader areas of health/well-being (domain) American College of Rheumatology (ACR) definition of OA
Western Ontario & McMaster (WOMAC) total score
Knee Injury and Osteoarthritis Outcome Score (KOOS)
Virtual knee replacement (KR)*
FNIH OA Biomarker Consortium Progressor Definition
Composite: single-domain Assessment marked positive if any single of several entities from the same domain is positive “Any surgery”,
(e.g. meniscectomy or KR)
Composite: multi-domain Assessment marked positive if any single of several entities from different domain is positive Joint “death” defined as virtual KR or KR
OA progression, either structure or pain
#

associated bone marrow lesion/cyst at the site of insertion/origin or repair status may also be evaluated;

*

some definitions only include pain variables: then this is a multi-component; multi-dimensional, single-domain assessment

Minimum (medial) radiographic joint space width (JSW) is considered a simple, one-dimensional measure for assessing OA radiographic disease status and structural progression. Yet, it is multi-dimensional (multi-variable) in nature, as it is governed by at least two structures, i.e. by femoro-tibial cartilage thickness, and by meniscus pathology/extrusion. Both make relevant contributions to JSW cross-sectionally[22] and longitudinally[23] with joint positioning vs. the film and weight-bearing being further influential methodological factors[24]. Similar considerations apply to the radiographic JSN grade for the reasons mentioned above, and to synovitis effusion (SE), because in inter-mediate or proton density weighted spin-echo sequences the inflamed synovium and the (ample) synovial fluid both appear hyper-intense and are therefore visually indistinguishable[25] (Table 2). Single, multi-dimensional assessments are sometimes addressed as “composite” measures, because more than one factor governs the target assessment; yet, this is inviting confusion, because there are stricter definitions of “composites”, which we will address below.

If several assessments are taken of the same structure, e.g. across different regions (or of the same clinical entity), this may be considered an aggregate. An aggregate assessment is always uni-dimensional (Table 2). Different scores or measures are typically combined using sums, maxima, averages/means, medians, standard deviations, etc. Examples of aggregate scores are those used for osteophyte scores around the knee by MRI[26], or those for cartilage lesions[27], meniscus lesions[28], or synovitis (effusion or Hoffa)[29] (Table 2). Examples of aggregate measures are “cartilage thickness” determined by MRI-based cartilage morphometry, typically encompassing 1000 data points per square centimeter of the joint surface [2,30]. Other examples are cartilage transverse relaxation time, T2[31,32] (or other compositional quantitative (q)MRI measures in the deep and superficial cartilage zone), and “synovial thickness” by contrast-enhanced (CE) MRI[25,33]. Within-measure summaries can be complex, for instance when using longitudinal regional variability to guide the integration process[34]. This strategy is useful to determine the overall magnitudes of cartilage loss and cartilage gain, independent of their regional distribution and associated biomechanical determinants/constraints[35]. Local variation in MRI relaxometry signals can be employed to create complex summaries, which potentially relate better to tissue properties and relevant clinical conditions than means or standard deviations. These include texture analysis[3639], statistical parametric mapping[40,41], spatial gradient method[42,43], principal component analysis of Z-score difference maps[44,45], and others.

The terms “multi-component” and “composite” assessment are applicable when the evaluation is multi-dimensional (covering >1 structure/tissue, or clinical entity). In contrast to (the above) single, multi-dimensional measurements, such as JSW or ES, the components are not intricate to another by laws of nature, but represent separate scores or measures that contribute to a common endpoint along the rules defined by a scientist. These components can remain within a single domain (such as pain, or functional disability, or imaging, or surgical intervention, etc.), or may extend across these broader areas of health and well-being (multi-domain). The combination of imaging endpoints obtained from different modalities (such as MRI and CT, or MRI and PET) sometimes is addressed as a “multi-domain” approach; however, we consider this combination “multi-modal”, as it spans across various imaging technologies, but nevertheless remains within the broader scope of medical imaging. A “composite” endpoint is defined in a way that only one (of the several) components needs to reach a specified threshold so that the entire endpoint becomes “positive”, irrespective of the status of the other components[46,47], An analogy from professional sports is the “Tricot jaune” at the Tour de France; this is granted to the fastest rider in the overall standings, independent of how his team mates are ranked. A medical example from the cardiovascular field is the “major adverse cardiac event” (MACE) as composite outcome, typically including either cardiovascular death, or non-fatal acute myocardial infarction, or non-fatal stroke, or revascularization[48]. Yet, if the components are combined to a novel score, and a clear definition is provided above which threshold this score will be “positive”, this may also be considered a composite[46,47]. “Multi component” endpoints, in contrast, become “positive” when certain combinations of components meet some pre-determined thresholds, in certain pre-set combinations[5]. Whilst each component contributes a separate piece of information, the components remain separate and distinguishable throughout. Analogies from sports are the team time trials at the Tour de France, where time counts when the 4th (of 8) cyclists reaches the finish line. The standards by which components in multi-component endpoints (or composites) are aggregated, weighted and standardized are not defined. Therefore, these principles must be established, explored, and validated empirically, best by testing clinical usefulness. The QIBA outlined that when quantitative imaging biomarkers (QIBs) are combined to multi-parametric QIBs (multi-dimensional, multi-component endpoints) obtained from one or several imaging modalities, additional clinical utility may be drawn over single measures for detecting disease, identifying phenotypes, detecting longitudinal change, predicting outcomes, or other potential intended uses[49]. The QIBA proposed that, if carefully selected, each QIB can contribute unique clinical information, with the value of the multi-parametric QIBs exceeding those of their individual components[49].

A good example of a multi-component, multi-dimensional, single domain endpoint is the Kellgren Lawrence grade (KLG), reflecting radiographic bone pathology (osteophytes, subchondral bone sclerosis and attrition) and JSN (cartilage and meniscus pathology)[22,23]. Another example from the radiology field is the whole-organ MRI-based MOAKS reading system that permits evaluating structural pathology in different joint regions, using different imaging/pathology features, and for most joint tissues (multi-dimensionality)[1,16,50]. This provides a plethora of potential different combinations, with the opportunity to characterize various morpho- /pheno-types[3,51] of the disease that may require different medical treatments, but also with challenges in how to integrate in the vast amount of readings available[51]. Multi component endpoints derived from quantitative imaging measures have been probed for identifying morpho-types that relate to incident[52] or progressive[53] knee OA. Based on 12 potential composite qMRI candidates, Harkey et al.[52] defined a “disease activity” multi-component endpoint (sum of BML and effusion synovitis volume) and a “cumulative cartilage damage metric” (obtained from several local cartilage thickness measures) and found that both were associated with accelerated incident radiographic knee OA over 2 years. In a later study[53], the authors tested the construct validity of qMRI composite candidates in predicting disease progression. Whereas the “disease activity metric” was more strongly related to symptomatic progression (WOMAC pain), the” cumulative cartilage damage metric” was showed a stronger relationship with radiographic (KLG and medial JSW) progression[53]. WOMAC pain (or function) scores also represent multi-dimensional, single domain multi component endpoints, each collecting clinical information on the same domain, but for variable conditions, such as pain or functional limitations during different tasks and activities (Table 2). Again, if assessment integrates across broader areas of health and wellbeing, such as imaging (structure) and clinical information (pain or function) these are considered “multi-domain”. Examples for such a construct are the American College of Rheumatology (ACR) definition of OA, which is based on clinical symptoms, physical examination, and radiographic findings[54]. As typical for multi-component endpoints/outcomes, each of these “domains” provides a separate piece of information, contributing to the overall diagnosis[54]. The Foundation for the National Institute of Health (FNIH) Osteoarthritis Initiative (OAI) Biomarker Consortium definition of “disease progression, for instance, is based on meting a certain (>0.7mm) change in radiographic JSW as well as a change in WOMAC pain over year 2–5 follow-up”[9,55,56], a typical characteristic of a multi-domain multiple-component outcome. Had a different definition been chosen, such as labeling participants with pain progression or radiographic progression (or both) as progressors, this would equate to a multi-domain composite outcome[9,55,56]. Another example is virtual knee replacement (vKR), where a definition of “endstage” knee OA that requires replacements is based on pain (function) and radiographic status[57,58].

Examples of composite assessments that remain within a single domain are “any surgery” endpoints, where for instance either meniscectomy or knee replacement count as events rendering the outcome positive[59] (Table 2). Other examples in OA research are a definition of “joint death” indicated by either “real KR” or “virtual KR”[60], or “real KR or severe symptom state”[14].

3). Statistical considerations

When more than one (single or aggregate) assessment is to be included in the overall evaluation, a predefined statistical analysis plan needs to define the endpoint hierarchy (primary, [co-primary], key secondary, secondary, [co-secondary], exploratory, etc.), each with certain statistical considerations for multiple comparisons[4]. Co-primary endpoints require that treatment effects be demonstrated for all selected endpoints, and therefore require adjustment of statistical power for each individual test to ensure the overall power is maintained (Fig. 1). Using a multiple endpoint approach, a treatment effect needs to be demonstrated on at least one endpoint, requiring an adjustment of alpha for each test to ensure overall Type I error is maintained (Fig. 1). For example, to detect a 0.5 effect size with an overall type I error of 0.05 and an overall power of 80%, n=64 participants are required per group using a single primary endpoint, n=78 using two multiple endpoints, and n=83 using two co-primary endpoints.

Fig. 1:

Fig. 1:

Sample size required for a given effect size for a single endpoint (red), multiple endpoints (blue) and a co-primary endpoint (green)

Because multiple-component or composite endpoints combine assessments into one (single) endpoint, there is no statistical penalty for multiplicity, as the “inherent” components are not tested in parallel. However, it has to be borne in mind that the “factors”, which serve as components, remain in a “black box”. If an intervention affects various components of the endpoint differently, there is a risk of masking of the treatment effect[61,62]. While it is possible to disentangle the effects of treatment on individual components of the outcome, these analyses must be considered as exploratory and/or undergo proper adjustments for multiple testing, and require a thoughtful analytic approach[62]. Composite endpoints may be defined as the incidence of any individual component, i.e., KR or vKR[60], which considers each endpoint as being of equal clinical importance; however, it must first be established whether this is appropriate[63]. Yet, algorithms that weigh individual components also require thorough validation[5].

4). Application of these concepts to interventional trials

Attempts are often made to define, qualify and validate a single imaging endpoint or outcome measure as being optimal for all OA trials, and in convincing regulatory agencies to approve such an endpoint universally. Yet, this is neither realistic nor reasonable, given differences in the MOA between drugs and differences in patient cohorts with matching morpho-types.

In a randomized clinical trial (RCT) on a putative anabolic cartilage agent (FORWARD), an aggregate measure of mean cartilage thickness change across the (medial and lateral) weight-bearing femorotibial joint showed a statistically significant difference between participants treated i.a. with 100μg Sprifermin vs. those injected with placebo, whereas the single, multivariable measure of radiographic JSW did not [64]. This is most likely due to a) greater robustness and sensitivity of an aggregate measure that integrates information across the entire joint compared with a single point measure, b) greater specificity of an MRI based measure of cartilage to the MOA of the anabolic cartilage drug than a multi-dimensional measure that depends both on cartilage and meniscus status, and c) the fact that a single (uni-dimensional) measurement is more strongly dependent on the image acquisition conditions (orientation of the film relative to the joint) than one that is three-dimensional. Yet, a measure of cartilage thickness (or volume) is unable to differentiate to what extent disease-related cartilage thinning in the joint has been affected, or just cartilage thickening has been boosted, potentially at anatomical locations where such an increase in cartilage thickness is biomechanically and functionally not useful. Yet, separating the measure of thickness change from its specific anatomical location, and creating aggregate measures of thickening and thinning revealed that not only the cartilage thickening was enhanced by the drug, but that cartilage thinning was substantially reduced, wherever it may have occurred in the joint. Surprisingly, this reduction was almost the level of that observed (methodologically) in healthy reference subjects[35]. Further, in a DMOAD trial the type of assessment also needs to match the patient morpho-/pheno-types[3], and hence the respective patient selection criteria. Post-hoc analyses of the FORWARD study revealed that the effect of Sprifermin on cartilage was similar for patients with thinner baseline cartilage (less JSW) and greater baseline pain than for those with greater cartilage thickness and less pain[65]. This was opposite to the often-made assumption that a therapeutic effect can only be expected at earlier stages the disease. And all the more, in participants with less cartilage thickness and more pain the treatment-related positive effect on structure did in fact translate into a meaningful clinical benefit (vs. placebo), whereas in those with thicker baseline cartilage and less baseline pain in the FORWARD study, a treatment-response to pain was not seen in this cohort[65]. From a patho-physiology standpoint it makes sense that participants who display signs of net cartilage matrix loss at baseline will profit more symptom-wise from anabolic cartilage treatment, because an existing loss of cartilage at baseline suggests that the tissues may be structurally involved in the disease, and an anabolic reversal of that status may improve its condition[65]. Other DMOAD trials also have attempted to match the MOA and the patient inclucion criteria with the measurement endpoints, including those targeting modification of cartilage[6669], synovitis[6971], BMLs[7274], or bone (shape)[75,76]. In cases of synovitis and BMLs, it is an open question on whether demonstrating a positive effect on structure will be sufficient to receive DMOAD approval from a regulatory agency; synovium and bone marrow represent strongly reactive joint tissues and fluctuations of their status are seen in normal disease progression[7779]. Structurally, they are not directly involved in load-bearing and biomechanical functioning, and most of the above trials (therefore) have examined cartilage at least as as a secondary endpoint. Certain degradation products of the cartilage are known to cause synovitis and to have negative downstream effects on other joint tissues that are deleterious to joint health and enhance progression[80]. Hence, DMOAD trials may use the opportunity to assess synovitis or BMLs as primary, and cartilage (or other weight-bearing tissues) as co-primary or secondary structural endpoints. Alternatively, both (or more) structural dimensions may be included in multi-component, multi-dimensional, single domain (imaging) endpoint, or could be combined further with measures of pain and/or function to form a multi-component, multi-domain outcome. Yet, more is not necessarily better, and important considerations are to what extent the components combined a) are correlated (providing redundant information), b) change in the same or different directions and potentially cancel each other out, c) fluctuate in sync or independently, and d) are precise (low test-retest error). As mentioned previously, multi-component endpoints and composites need to be thoroughly validated clinically, best repeatedly and de novo in each new context.

Applied to late stage OA, multi-component endpoints may be created as a composite of knee replacement (KR) or severe clinical (symptom) state, the latter defined by conservative thresholds of patient-reported outcomes (PROs)[14]. Assessments of severe clinical state conditions may be combined to a multi-component outcome of virtual KR (vKR), either including or not including radiographiy[57,58,81]. In terms of validation, they should display clinical utility, i.e. be related to KR[57,58], or should be similarly related to relevant predictors of KR[60]. The attractiveness of such an approach is that regulatory agencies are very focused on outcomes such as KR (or vKR), as these are very closely related to how a patients feels, function or her/his knee survives (i.e. receives surgery)[14]. A multi-domain composite defined as “KR or vKR” may be powerful, as the vKR component may partially compensate for the elective nature of KR. The number of KRs is limited by the availability of surgical intervention in a given healthcare system, the socio-economic status of the patient or country, the educational status of the patient, the unwillingness or un-ability of the patient to undergo surgery, and further constraints. A recent estimate showed that using a composite KR or critical thresholds of pain and function outcomes, rather than KR alone, reduces required sample sizes by 40%[14], albeit the estimated sample sizes for OA confirmatory trials of greater than 2000 participants are still quixotic and barely feasible for industry sponsors.

If a study is performed at earlier disease stages, for instance when testing physiotherapy or DMOADS, which aim to prevent disease progression in patients with meniscus injury, an “any surgery” composite outcome[59] may be used effectively to explore whether the intervention prevents surgery. If the study is targeted at more long-term effects of the therapy, single or multi-component imaging endpoints can be used, but these would have to be previously qualified as surrogate biomarkers that eventually predict KR (or a KR/vKR composite[60]).

CONCLUSION AND OUTLOOK

Multi-component endpoints are seductive, given the level of data integration achievable across different dimensions, modalities, and domains. However, the components remain in a “black box” and attempts to unravel their individual contribution statistically comes with threats to type I error and/or power. Multi-component endpoints may thus be more suitable for interventional studies seeking regulatory approval than for research applications that try to unravel risk factors, proof of (therapeutic) concepts, or other exploratory relationships in OA research. The plethora of available endpoints in OA research is daring, and the number of options almost infinite. Therefore, smart research strategies need to explore and validate these gigantic possibilities, using an appropriate statistical framework. A one-endpoint-fits-all approach must fail, because (imaging) assessment needs to be tailored to the participants’ morpho-type and to the putative DMOAD’s mechanism of action (MOA). Integrative assessments across dimensions (and domains) have great potential, but in each case this needs to be balanced against the loss of information how each component contributes.

Fig. 2:

Fig. 2:

Schematic visualization of multiplicity approaches of endpoints (EP):

A) Multiple independent endpoints that require statistical adjustment when tested in parallel to become positive

B) COMP EP = composite endpoint. Only one of several defined endpoints (here blue) has to pass a previously defined threshold, for the COMP EP to become positive (no statistical adjustment required)

C) MC EP = multiple component endpoint. All endpoints contribute based on previously defined mathematical relationships. If passing the threshold according to this algoithm, the MC EP becomes positive (no statistical adjustment required)

ACKNOWLEDGMENT:

We would like to thank the OSTIMA Journal Editor in Chief, Prof. Frank Roemer, and the Editors of this Special Issue, for the opportunity to publish the content of our presentation at the 18th International Workshop of Osteoarthritis Imaging. And we would like to wholeheartedly thank Professor Ali Guermazi for organizing, once more, such a fantastic event in Marrakech in June 2024.

Role of the funding source:

The authors did not receive funding for writing this article or activities associated with its making.

CONFLICTS:

  • Felix Eckstein is CEO/CMO and co-owner of Chondrometrics GmbH; he has provided consulting services to Merck KGaA, Kolon-Tissuegene (KTG), Servier, Galapagos, Novartis, 4P Pharma, Formation Bio, and Peptinov outside the submitted work.

  • Jamie Collins provides consulting services to Boston Imaging Core Labs, LLC and is on the Scientific Advisory Board of Chondrometrics GmbH.

  • Tanja Stamm has received grants and consulting fees from AbbVie, Roche, Sanofi, Takeda, and Novartis outside the submitted work.

Footnotes

Consent for publication: All authors have consented publication

Ethics approval: No original data is published in this article

REFERENCES

  • [1].Roemer FW, Jarraya M, Hayashi D, Crema MD, Haugen IK, Hunter DJ, et al. A perspective on the evolution of semi-quantitative MRI assessment of osteoarthritis: Past, present and future. Osteoarthr Cartil 2024. 10.1016/j.joca.2024.01.001. [DOI] [PubMed] [Google Scholar]
  • [2].Eckstein F, Wluka AE, Wirth W, Cicuttini F. 30 Years of MRI-based cartilage & bone morphometry in knee osteoarthritis: From correlation to clinical trials. Osteoarthr Cartil 2024;32:439–51. 10.1016/j.joca.2024.02.002. [DOI] [PubMed] [Google Scholar]
  • [3].Mobasheri A, Loeser R. Clinical phenotypes, molecular endotypes and theratypes in OA therapeutic development. Nat Rev Rheumatol 2024. 10.1038/s41584-024-01126-4. [DOI] [PubMed] [Google Scholar]
  • [4].Pocock SJ. Clinical trials with multiple outcomes: a statistical perspective on their design, analysis, and interpretation. Control Clin Trials 1997;18:530–9. 10.1016/s0197-2456(97)00008-1. [DOI] [PubMed] [Google Scholar]
  • [5].Sankoh AJ, Li H, D’Agostino RBS. Composite and multicomponent end points in clinical trials. Stat Med 2017;36:4437–40. 10.1002/sim.7386. [DOI] [PubMed] [Google Scholar]
  • [6].Schiphof D, de Klerk BM, Kerkhof HJM, Hofman A, Koes BW, Boers M, et al. Impact of different descriptions of the Kellgren and Lawrence classification criteria on the diagnosis of knee osteoarthritis. Ann Rheum Dis 2011;70:1422–7. [DOI] [PubMed] [Google Scholar]
  • [7].Altman RD, Gold GE. Atlas of individual radiographic features in osteoarthritis, revised. Osteoarthr Cartil 2007;15:A1−−56. 10.1016/j.joca.2006.11.009. [DOI] [PubMed] [Google Scholar]
  • [8].Buck RJ, Wyman BT, Le Graverand MP, Wirth W, Eckstein F. An efficient subset of morphological measures for articular cartilage in the healthy and diseased human knee. Magn Reson 2010;63:680–90. [DOI] [PubMed] [Google Scholar]
  • [9].Eckstein F, Collins JE, Nevitt MC, Lynch JA, Kraus V, Katz JN, et al. Cartilage thickness change as an imaging biomarker of knee osteoarthritis progression - data from the fnih OA biomarkers consortium. Arthritis Rheumatol (Hoboken, NJ) 2015;67:3184–9. 10.1002/art.39324. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [10].Gluer CC, Blake G, Lu Y, Blunt BA, Jergas M, Genant HK, et al. Accurate assessment of precision errors: how to measure the reproducibility of bone densitometry techniques. OsteoporosInt 1995;5:262–70. [DOI] [PubMed] [Google Scholar]
  • [11].Sullivan DC, Obuchowski NA, Kessler LG, Raunig DL, Gatsonis C, Huang EP, et al. Metrology Standards for Quantitative Imaging Biomarkers. Radiology 2015;277:813–25. 10.1148/radiol.2015142202. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [12].Raunig DL, McShane LM, Pennello G, Gatsonis C, Carson PL, Voyvodic JT, et al. Quantitative imaging biomarkers: a review of statistical methods for technical performance assessment. Stat Methods Med Res 2015;24:27–67. 10.1177/0962280214537344. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [13].Roemer FW, Wirth W, Demehri S, Kijowski R, Jarraya M, Hayashi D, et al. Imaging Biomarkers of Osteoarthritis. Semin Musculoskelet Radiol 2024;28:14–25. 10.1055/s-0043-1776432. [DOI] [PubMed] [Google Scholar]
  • [14].Kim Y, Levin G, Nikolov NP, Abugov R, Rothwell R. Concept End Points Informing Design Considerations for Confirmatory Clinical Trials in Osteoarthritis. Arthritis Care Res (Hoboken) 2022;74:1154–62. 10.1002/acr.24549. [DOI] [PubMed] [Google Scholar]
  • [15].Kraus VB, Simon LS, Katz JN, Neogi T, Hunter D, Guermazi A, et al. Proposed study designs for approval based on a surrogate endpoint and a post-marketing confirmatory study under FDA’s accelerated approval regulations for disease modifying osteoarthritis drugs. Osteoarthr Cartil 2019;27:571–9. 10.1016/j.joca.2018.11.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [16].Hunter DJ, Guermazi A, Lo GH, Grainger AJ, Conaghan PG, Boudreau RM, et al. Evolution of semi-quantitative whole joint assessment of knee OA: MOAKS (MRI Osteoarthritis Knee Score). Osteoarthr Cartil 2011;19:990–1002. 10.1016/j.joca.2011.05.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [17].Wirth W, Frobell RB, Souza RB, Li X, Wyman BT, Le Graverand MP, et al. A three-dimensional quantitative method to measure meniscus shape, position, and signal intensity using MR images: a pilot study and preliminary results in knee osteoarthritis. Magn Reson 2010;63:1162–71. [DOI] [PubMed] [Google Scholar]
  • [18].Driban JB, Price LL, Lo GH, Pang J, Hunter DJ, Miller E, et al. Evaluation of bone marrow lesion volume as a knee osteoarthritis biomarker - longitudinal relationships with pain and structural changes: data from the Osteoarthritis Initiative. Arthritis ResTher 2013;15:R112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [19].Pang J, Driban JB, Destenaves G, Miller E, Lo GH, Ward RJ, et al. Quantification of bone marrow lesion volume and volume change using semi-automated segmentation: data from the osteoarthritis initiative. BMCMusculoskeletDisord 2013;14:3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [20].Bowes MA, Kacena K, Alabas OA, Brett AD, Dube B, Bodick N, et al. Machine-learning, MRI bone shape and important clinical outcomes in osteoarthritis: data from the Osteoarthritis Initiative. Ann Rheum Dis 2021;80:502–8. 10.1136/annrheumdis-2020-217160. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [21].Nevitt MC, Felson DT, Lester G. The Osteoarthritis Initiative: Protocol for the Cohort Study. 2006.
  • [22].Roth M, Wirth W, Emmanuel K, Culvenor AG, Eckstein F. The contribution of 3D quantitative meniscal and cartilage measures to variation in normal radiographic joint space width—Data from the Osteoarthritis Initiative healthy reference cohort. Eur J Radiol 2017;87:90–8. 10.1016/j.ejrad.2016.12.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [23].Roth M, Emmanuel K, Wirth W, Kwoh CK, Hunter DJ, Eckstein F. Sensitivity to change and association of three-dimensional meniscal measures with radiographic joint space width loss in rapid clinical progression of knee osteoarthritis. Eur Radiol 2018;28:1844–53. 10.1007/s00330-017-5140-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [24].Messieh SS, Fowler PJ, Munro T. Anteroposterior radiographs of the osteoarthritic knee. JBone Jt SurgBr 1990;72:639–40. [DOI] [PubMed] [Google Scholar]
  • [25].Hayashi D, Roemer FW, Jarraya M, Guermazi A. Update on recent developments in imaging of inflammation in osteoarthritis: a narrative review. Skeletal Radiol 2023;52:2057–67. 10.1007/s00256-022-04267-3. [DOI] [PubMed] [Google Scholar]
  • [26].Hakky M, Jarraya M, Ratzlaff C, Guermazi A, Duryea J. Validity and responsiveness of a new measure of knee osteophytes for osteoarthritis studies: Data from the osteoarthritis initiative. Osteoarthr Cartil 2015;23:2199–205. 10.1016/j.joca.2015.07.003. [DOI] [PubMed] [Google Scholar]
  • [27].Reichenbach S, Yang M, Eckstein F, Niu J, Hunter DJ, McLennan CE, et al. Do cartilage volume or thickness distinguish knees with and without mild radiographic osteoarthritis? The Framingham study. AnnRheumDis 2010;69:143–9. 10.1136/ard.2008.099200.Do. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [28].Englund M, Guermazi A, Gale D, Hunter DJ, Aliabadi P, Clancy M, et al. Incidental meniscal findings on knee MRI in middle-aged and elderly persons. NEnglJMed 2008;359:1108–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [29].Hagiwara S, Yang A, Takao S, Kaneko Y, Nozaki T, Yoshioka H. New scoring system in assessment of Hoffa’s fat pad synovitis: A comparative study with established scoring systems. World J Radiol 2018;10:162–71. 10.4329/wjr.v10.i11.162. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [30].Eckstein F, Sittek H, Gavazzeni A, Schulte E, Milz S, Kiefer B, et al. Magnetic resonance chondro-crassometry (MR CCM): a method for accurate determination of articular cartilage thickness? Magn Reson 1996;35:89–96. [DOI] [PubMed] [Google Scholar]
  • [31].Eckstein F, Brisson NM, Maschek S, Wisser A, Berenbaum F, Duda GN, et al. Clinical validation of fully automated laminar knee cartilage transverse relaxation time (T2) analysis in anterior cruciate ligament (ACL)-injured knees— on behalf of the osteoarthritis (OA)-Bio consortium. Quant Imaging Med Surg 2024;14:4319–32. 10.21037/qims-24-194. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [32].Herger S, Wirth W, Eckstein F, Nüesch C, Egloff C, Mündermann A. Anterior cruciate ligament (ACL) injury and age affect knee cartilage T2 but not thickness. Osteoarthr Cartil 2024. 10.1016/j.joca.2024.06.014. [DOI] [PubMed] [Google Scholar]
  • [33].Fotinos-Hoyer AK, Guermazi A, Jara H, Eckstein F, Ozonoff A, Khard H, et al. Assessment of synovitis in the osteoarthritic knee: Comparison between manual segmentation, semiautomated segmentation, and semiquantitative assessment using contrast-enhanced fat-suppressed T1-weighted MRI. Magn Reson 2010;64:604–9. [DOI] [PubMed] [Google Scholar]
  • [34].Eckstein F, Buck R, Wirth W. Location-independent analysis of structural progression of osteoarthritis - taking it all apart, and putting the puzzle back together makes the difference. Semin Arthritis Rheum 2017;46:404–10. 10.1016/j.semarthrit.2016.08.016. [DOI] [PubMed] [Google Scholar]
  • [35].Eckstein F, Wax S, Aydemir A, Wirth W, Maschek S, Hochberg M. Intra-articular sprifermin reduces cartilage loss in addition to increasing cartilage gain independent of femorotibial location: a post-hoc analysis of a randomized, placebo-controlled phase ii clinical trial. Ann Rheum Dis 2020;79:525–8. 10.1016/j.joca.2018.02.590. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [36].Carballido-Gamio J, Joseph GB, Lynch JA, Link TM, Majumdar S. Longitudinal analysis of MRI T2 knee cartilage laminar organization in a subset of patients from the osteoarthritis initiative: a texture approach. Magn Reson 2011;65:1184–94. [DOI] [PubMed] [Google Scholar]
  • [37].Peuna A, Thevenot J, Saarakkala S, Nieminen MT, Lammentausta E. Machine learning classification on texture analyzed T2 maps of osteoarthritic cartilage: oulu knee osteoarthritis study. Osteoarthr Cartil 2021;29:859–69. 10.1016/j.joca.2021.02.561. [DOI] [PubMed] [Google Scholar]
  • [38].Joseph GB, Baum T, Carballido-Gamio J, Nardo L, Virayavanich W, Alizai H, et al. Texture analysis of cartilage T2 maps: individuals with risk factors for OA have higher and more heterogeneous knee cartilage MR T2 compared to normal controls - data from the osteoarthritis initiative. Arthritis ResTher 2011;13:R153. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [39].Urish KL, Keffalas MG, Durkin JR, Miller DJ, Chu CR, Mosher TJ. T2 texture index of cartilage can predict early symptomatic OA progression: data from the osteoarthritis initiative. OsteoarthritisCartilage 2013;21:1550–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [40].Pedoia V, Li X, Su F, Calixto N, Majumdar S. Fully automatic analysis of the knee articular cartilage T1$ρ$ relaxation time using voxel-based relaxometry. J Magn Reson Imaging 2016;43:970–80. 10.1002/jmri.25065. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [41].Liao T-C, Pedoia V, Neumann J, Link TM, Souza RB, Majumdar S. Extracting Voxel-Based Cartilage Relaxometry Features in Hip Osteoarthritis Subjects Using Principal Component Analysis. J Magn Reson Imaging 2020;51:1708–19. 10.1002/jmri.26955. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [42].Li X, Pai A, Blumenkrantz G, Carballido-Gamio J, Link T, Ma B, et al. Spatial distribution and relationship of T1rho and T2 relaxation times in knee cartilage with osteoarthritis. Magn Reson 2009;61:1310–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [43].Wilson RL, Emery NC, Pierce DM, Neu CP. Spatial Gradients of Quantitative MRI as Biomarkers for Early Detection of Osteoarthritis: Data From Human Explants and the Osteoarthritis Initiative. J Magn Reson Imaging 2023;58:189–97. 10.1002/jmri.28471. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [44].Dunn TC, Lu Y, Jin H, Ries MD, Majumdar S. T2 relaxation time of cartilage at MR imaging: comparison with severity of knee osteoarthritis. Radiology 2004;232:592–8. 10.1148/radiol.2322030976. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [45].Roach KE, Souza RB, Majumdar S, Pedoia V. Local Patterns in 2-Year T(1ρ) and T(2) Changes of Hip Cartilage Are Related to Sex and Functional Data: A Prospective Evaluation on Hip Osteoarthritis Participants. J Magn Reson Imaging 2023;57:1042–53. 10.1002/jmri.28347. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [46].Sankoh AJ, Li H, D’Agostino RBS. Use of composite endpoints in clinical trials. Stat Med 2014;33:4709–14. 10.1002/sim.6205. [DOI] [PubMed] [Google Scholar]
  • [47].Sankoh AJ, Li H, D’Agostino RBS. Use of composite endpoints in clinical trials. Stat Med 2016;35:319–20. 10.1002/sim.6581. [DOI] [PubMed] [Google Scholar]
  • [48].Marsal JR, Urreta-Barallobre I, Ubeda-Carrillo M, Osorio D, Lumbreras B, Lora D, et al. Sample size requirement in trials that use the composite endpoint major adverse cardiovascular events (MACE): new insights. Trials 2022;23:1037. 10.1186/s13063-022-06977-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [49].Delfino JG, Pennello GA, Barnhart HX, Buckler AJ, Wang X, Huang EP, et al. Multiparametric Quantitative Imaging Biomarkers for Phenotype Classification: A Framework for Development and Validation. Acad Radiol 2023;30:183–95. 10.1016/j.acra.2022.09.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [50].Guermazi A, Niu J, Hayashi D, Roemer FW, Englund M, Neogi T, et al. Prevalence of abnormalities in knees detected by MRI in adults without knee osteoarthritis: population based observational study (Framingham Osteoarthritis Study). BMJ 2012;345:e5339. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [51].Collins JE, Roemer FW, Guermazi A. Approaches to optimize analyses of multidimensional ordinal MRI data in osteoarthritis research: A perspective. Osteoarthr Cartil Open 2024;6:100465. 10.1016/j.ocarto.2024.100465. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [52].Harkey MS, Davis JE, Price LL, Ward RJ, MacKay JW, Eaton CB, et al. Composite quantitative knee structure metrics predict the development of accelerated knee osteoarthritis: data from the osteoarthritis initiative. BMC Musculoskelet Disord 2020;21:299. 10.1186/s12891-020-03338-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [53].Driban JB, Price LL, LaValley MP, Lo GH, Zhang M, Harkey MS, et al. Novel Framework for Measuring Whole Knee Osteoarthritis Progression Using Magnetic Resonance Imaging. Arthritis Care Res (Hoboken) 2022;74:799–808. 10.1002/acr.24512. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [54].Altman R, Asch E, Bloch D, Bole G, Borenstein D, Brandt K, et al. Development of criteria for the classification and reporting of osteoarthritis. Classification of osteoarthritis of the knee. Diagnostic and Therapeutic Criteria Committee of the American Rheumatism Association. Arthritis Rheum 1986;29:1039–49. [DOI] [PubMed] [Google Scholar]
  • [55].Hunter DJ, Deveza LA, Collins JE, Losina E, Nevitt MC, Roemer FW, et al. Multivariable modeling of biomarker data from the phase 1 Foundation for the NIH Osteoarthritis Biomarkers Consortium. Arthritis Care Res (Hoboken) 2022;74:1142–53. 10.1002/acr.24557. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [56].Hunter DJ, Nevitt M, Losina E, Kraus V. Biomarkers for osteoarthritis: current position and steps towards further validation. Best Pract Res Clin Rheumatol 2014;28:61–71. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [57].Gossec L, Paternotte S, Maillefert JF, Combescure C, Conaghan PG, Davis AM, et al. The role of pain and functional impairment in the decision to recommend total joint replacement in hip and knee osteoarthritis: an international cross-sectional study of 1909 patients. Report of the OARSI-OMERACT Task Force on total joint replacement. OsteoarthritisCartilage 2011;19:147–54. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [58].Dougados M, Hawker G, Lohmander S, Davis AM, Dieppe P, Maillefert JF, et al. OARSI/OMERACT criteria of being considered a candidate for total joint replacement in knee/hip osteoarthritis as an endpoint in clinical trials evaluating potential disease modifying osteoarthritic drugs. JRheumatol 2009;36:2097–9. [DOI] [PubMed] [Google Scholar]
  • [59].Sullivan JK, Irrgang JJ, Losina E, Safran-Norton C, Collins J, Shrestha S, et al. The TeMPO trial (treatment of meniscal tears in osteoarthritis): rationale and design features for a four arm randomized controlled clinical trial. BMC Musculoskelet Disord 2018;19:429. 10.1186/s12891-018-2327-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [60].Eckstein F, Hannon MJ, Fujii T, Boudreau RM, Wirth W, Maschek S, et al. Quantitative Cartilage Thickness Change Preceding Surgical and Virtual Knee Replacement Status – Comparative Analysis Using Data From the OAI. Osteoarthr. Cartil, vol. 25, 2017, p. S248–9. 10.1016/j.joca.2017.02.420. [DOI] [Google Scholar]
  • [61].Ferreira-Gonzalez I, Permanyer-Miralda G, Busse JW, Devereaux PJ, Guyatt GH, Alonso-Coello P, et al. Composite outcomes can distort the nature and magnitude of treatment benefits in clinical trials. Ann Intern Med 2009;150:566–7. 10.7326/0003-4819-150-8-200904210-00016. [DOI] [PubMed] [Google Scholar]
  • [62].Pogue J, Devereaux PJ, Thabane L, Yusuf S. Designing and analyzing clinical trials with composite outcomes: consideration of possible treatment differences between the individual outcomes. PLoS One 2012;7:e34785. 10.1371/journal.pone.0034785. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [63].Lauer MS, Topol EJ. Clinical trials--multiple treatments, multiple end points, and multiple lessons. JAMA 2003;289:2575–7. 10.1001/jama.289.19.2575. [DOI] [PubMed] [Google Scholar]
  • [64].Hochberg MC, Guermazi A, Guehring H, Aydemir A, Wax S, Fleuranceau-Morel P, et al. Effect of Intra-Articular Sprifermin vs Placebo on Femorotibial Joint Cartilage Thickness in Patients With Osteoarthritis. JAMA 2019;322:1360–70. 10.1001/jama.2019.14735. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [65].Guehring H, Moreau F, Daelken B, Ladel C, Guenther O, Bihlet AR, et al. The effects of sprifermin on symptoms and structure in a subgroup at risk of progression in the FORWARD knee osteoarthritis trial. Semin Arthritis Rheum 2021;51:450–6. 10.1016/j.semarthrit.2021.03.005. [DOI] [PubMed] [Google Scholar]
  • [66].Imbert O, Chimits D, Guedj M, Lorieau F, Bernard K, Lalande A, et al. A simple inclusion criteria combination increases the rate of cartilage loss in patients with knee osteoarthritis. Osteoarthr Cartil Open 2021;3:100188. 10.1016/j.ocarto.2021.100188. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [67].Imbert O, Deckx H, Bernard K, van der Aar E, Pueyo M, Saeed N, et al. The design of a randomized, placebo-controlled, dose-ranging trial to investigate the efficacy and safety of the ADAMTS-5 inhibitor S201086/GLPG1972 in knee osteoarthritis. Osteoarthr Cartil Open 2021;3:100209. 10.1016/j.ocarto.2021.100209. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [68].Schnitzer T, Pueyo M, Deckx H, van der Aar E, Bernard K, Hatch S, et al. Evaluation of S201086/GLPG1972, an ADAMTS-5 inhibitor, for the treatment of knee osteoarthritis in ROCCELLA: a phase 2 randomized clinical trial. Osteoarthr Cartil 2023;31:985–94. 10.1016/j.joca.2023.04.001. [DOI] [PubMed] [Google Scholar]
  • [69].Guermazi A, Kalsi G, Niu J, Crema MD, Copeland RO, Orlando A, et al. Structural effects of intra-articular TGF-β1 in moderate to advanced knee osteoarthritis: MRI-based assessment in a randomized controlled trial. BMC Musculoskelet Disord 2017;18:461. 10.1186/s12891-017-1830-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [70].Fleischmann RM, Bliddal H, Blanco FJ, Schnitzer TJ, Peterfy C, Chen S, et al. A Phase II Trial of Lutikizumab, an Anti-Interleukin-1$α$/$β$ Dual Variable Domain Immunoglobulin, in Knee Osteoarthritis Patients With Synovitis. Arthritis Rheumatol (Hoboken, NJ) 2019;71:1056–69. 10.1002/art.40840. [DOI] [PubMed] [Google Scholar]
  • [71].Wang X, Cicuttini F, Jin X, Wluka AE, Han W, Zhu Z, et al. Knee effusion-synovitis volume measurement and effects of vitamin D supplementation in patients with knee osteoarthritis. Osteoarthr Cartil 2017;25:1304–12. 10.1016/j.joca.2017.02.804. [DOI] [PubMed] [Google Scholar]
  • [72].McAlindon T, LaValley M, Schneider E, Nuite M, Lee JY, Price LL, et al. Effect of vitamin D supplementation on progression of knee pain and cartilage volume loss in patients with symptomatic osteoarthritis: a randomized controlled trial. JAMA 2013;309:155–62. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [73].Cai G, Aitken D, Laslett LL, Pelletier JP, Martel-Pelletier J, Hill C, et al. Effect of Intravenous Zoledronic Acid on Tibiofemoral Cartilage Volume among Patients with Knee Osteoarthritis with Bone Marrow Lesions: A Randomized Clinical Trial. JAMA - J Am Med Assoc 2020;323:1456–66. 10.1001/jama.2020.2938. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [74].Seefried L, Genest F, Baumann J, Heidemeier A, Meffert R, Jakob F. Efficacy of Zoledronic Acid in the Treatment of Nonmalignant Painful Bone Marrow Lesions: A Triple-Blind, Randomized, Placebo-Controlled Phase III Clinical Trial (ZoMARS). J Bone Miner Res Off J Am Soc Bone Miner Res 2022;37:420–7. 10.1002/jbmr.4493. [DOI] [PubMed] [Google Scholar]
  • [75].Conaghan PG, Bowes MA, Kingsbury SR, Brett A, Guillard G, Rizoska B, et al. Disease-modifying effects of a novel cathepsin k inhibitor in osteoarthritis: A randomized controlled trial. Ann Intern Med 2020;172:86–95. 10.7326/M19-0675. [DOI] [PubMed] [Google Scholar]
  • [76].Pelletier J-P, Roubille C, Raynauld J-P, Abram F, Dorais M, Delorme P, et al. Disease-modifying effect of strontium ranelate in a subset of patients from the Phase III knee osteoarthritis study SEKOIA using quantitative MRI: reduction in bone marrow lesions protects against cartilage loss. Ann Rheum Dis 2015;74:422–9. 10.1136/annrheumdis-2013-203989. [DOI] [PubMed] [Google Scholar]
  • [77].Hill CL, Hunter DJ, Niu J, Clancy M, Guermazi A, Genant H, et al. Synovitis detected on magnetic resonance imaging and its relation to pain and cartilage loss in knee osteoarthritis. AnnRheum Dis 2007;66:1599–603. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [78].Dore D, Quinn S, Ding C, Winzenberg T, Zhai G, Cicuttini F, et al. Natural history and clinical significance of MRI-detected bone marrow lesions at the knee: a prospective study in community dwelling older adults. Arthritis ResTher 2010;12:R223. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [79].Zhang Y, Nevitt M, Niu J, Lewis C, Torner J, Guermazi A, et al. Fluctuation of knee pain and changes in bone marrow lesions, effusions, and synovitis on magnetic resonance imaging. Arthritis Rheum 2011;63:691–9. 10.1002/art.30148. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [80].Miller RE, Ishihara S, Tran PB, Golub SB, Last K, Miller RJ, et al. An aggrecan fragment drives osteoarthritis pain through Toll-like receptor 2. JCI Insight 2018;3. 10.1172/jci.insight.95704. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [81].Boudreau RM, Hunter DJ, Wang Z, Roemer FW, Eckstein F, Hannon MJ, et al. A virtual knee joint replacment clinical endpoint based on longitudinal trends and thresholds in KOOS knee pain and funtion in Osteoarthritis Initiative participants. Arthritis Rheum 2012;64 [Suppl.:S1040 [Abstract]. [Google Scholar]

RESOURCES