Skip to main content
Osteoarthritis and Cartilage Open logoLink to Osteoarthritis and Cartilage Open
. 2024 Feb 24;6(2):100448. doi: 10.1016/j.ocarto.2024.100448

Radiomics analysis of patellofemoral joint improves knee replacement risk prediction: Data from the Multicenter Osteoarthritis Study (MOST)

Jiang Zhang a,f, Tianshu Jiang b,f, Lok-Chun Chan b, Sing-Hin Lau b, Wei Wang b, Xinzhi Teng a, Ping-Keung Chan c, Jing Cai a,d,e, Chunyi Wen b,d,∗,e
PMCID: PMC10910336  PMID: 38440779

Abstract

Objective

Knee replacement (KR) is the last-resort treatment for knee osteoarthritis. Although radiographic evidence of tibiofemoral joint has been widely adopted for prognostication, patellofemoral joint has gained little attention and may hold additional value for further improvements. We aimed to quantitatively analyse patellofemoral joint through radiomics analysis of lateral view radiographs for improved KR risk prediction.

Design

From the Multicenter Osteoarthritis Study dataset, we retrospectively retrieved the initial-visit lateral left knee radiographs of 2943 patients aged 50 to 79. They were split into training and test cohorts at a 2:1 ratio. A comprehensive set of radiomic features were extracted within the best-performing subregion of patellofemoral joint and combined into a radiomics score (RadScore). A KR risk score, derived from Kellgren-Lawrence grade (KLG) of tibiofemoral joint and RadScore of patellofemoral joint, was developed by multivariate Cox regression and assessed using time-dependent area under receiver operating characteristic curve (AUC).

Results

While patellofemoral osteoarthritis (PFOA) was insignificant during multivariate analysis, RadScore was identified as an independent risk factor (multivariate Cox p-value < 0.001) for KR. The subgroup analysis revealed that RadScore was particularly effective in predicting rapid progressor (KR occurrence before 30 months) among early- (KLG < 2) and mid-stage (KLG ​= ​2) patients. Combining two joints radiographic information, the AUC reached 0.89/0.87 for predicting 60-month KR occurrence.

Conclusions

The RadScore of the patellofemoral joint on lateral radiographs emerges as an independent prognostic factor for improving KR prognosis prediction. The KR risk score could be instrumental in managing progressive knee osteoarthritis interventions.

Keywords: Patellofemoral joint, Osteoarthritis, Knee replacement, Lateral knee radiograph, Radiomics

1. Introduction

Knee osteoarthritis (KOA) is a leading cause of chronic pain and disability in older adults [1]. Knee replacement (KR) surgery is often the last resort of treatment for KOA once the joint is rapidly destructed [2,3]. Therefore, early prediction of KOA progression and KR risk is highly desired to enable timely intervention and effective management of disease deterioration. Some studies have predicted KR using demographics and clinical data, highlighting the importance of the Kellgren-Lawrence grading (KLG) in disease prognosis [4,5]. With advancements in medical imaging analysis technology, several studies directly analysed medical images, primarily Magnetic Resonance Imaging (MRI) and X-ray, for KR prediction [[6], [7], [8], [9], [10], [11], [12]]. One study analysed the shape of MRI femur bone from the Osteoarthritis Initiative (OAI) dataset. It proposed a B-score, yielding a modest improvement in KR prediction with an area under receiver operating characteristic curve (AUC) of 0.85 when combined with the KLG compared to 0.83 of KLG alone [6]. Based on the same dataset, a multi-task deep learning-based model has been developed based on posteroanterior (PA) view TF joint radiographs, achieving an AUC of 0.87 for KR prediction [9]. Notably, a recent study comparing MRI and radiographs from both OAI and Multicenter Osteoarthritis Study (MOST) datasets found that, despite the superior soft tissue contrast of MRI, its predictive performance was only marginally better than radiographs (AUC: 0.89 vs. 0.87) [7]. The majority of these studies have exploited the prognostic value of tibiofemoral (TF) joint in KR surgery. However, while advanced algorithms and various imaging modalities have been used for KOA progression and KR risk, performance improvements remain marginal when focusing solely on the TF joint.

In the meantime, a growing body of evidence suggests the prognostic value of the patellofemoral (PF) joint, the other essential compartment of the knee, in KOA progression. PF osteoarthritis (PFOA) is a highly prevalent disease [13,14] that often develops prior to TF osteoarthritis (TFOA) [15] and leads the way in entire knee joint degeneration [16]. Intriguingly, PFOA exhibits a stronger association with KOA symptomatic deterioration than TFOA [17]. Very recently, radiographic PFOA at baseline was associated with a higher likelihood of KR in 10 years [18]. All the evidence suggests that PF joint status may also render high clinical value in early detection and intervention for progressive KOA.

However, technical challenges remain in how to assess PF joint in a reliable and accurate manner for KR risk evaluation. First, there are no standard assessment criteria for PFOA. Most studies directly employed the KLG system, considering both joint space narrowing and the presence of osteophytes on lateral and/or skyline radiographs [14]. Second, a high inter-observer variability on joint spacing narrowing was reported on lateral radiographs due to the interference of overlapping [19]. On the other hand, skyline view radiographs are more effective than lateral X-rays in thoroughly assessing the extent of patellofemoral joint issues. However, they have been less commonly used by surgeons in the past decades, while lateral X-rays are preferred [20]. This preference has probably led to a lack of skyline view radiographs in public datasets such as the Osteoarthritis Initiative (OAI) and the Multicenter Osteoarthritis Study (MOST). MRI provides excellent detail for studying knee degradation, proving valuable in identifying subchondral bone marrow lesions [21] and cartilage defects [22], and thus informing the development of new biomarkers for KOA [23]. However, due to its high cost, limited accessibility, and risk of overdiagnosis, MRI is not deemed suitable for routine clinical diagnosis or as the primary tool for image biomarker development in everyday clinical practice [24,25]. As such, there is a pressing demand to effectively extract radiographic information on PF joint from the widely available lateral knee radiographs and further enhance the KR risk prediction performance.

In this context, radiomics has been identified as a potentially valuable tool in analysing KOA imaging by leveraging high-throughput extraction of quantitative features from medical images [26]. It could capture additional diagnostic or prognostic information that might not be easily perceived through visual inspection, which can serve as a more objective and quantifiable method to forecast disease progression patterns. This methodology has been proven successful in various areas of clinical research, including oncology, neurology, and increasingly in musculoskeletal disorders such as osteoarthritis.

Given the above, the purpose of this study is to characterise the PF joint using radiomics from lateral knee radiographs for KR prognosis and establish a KR risk score by incorporating demographic and two joints’ radiographic information. By leveraging the radiomics approach, we attempted to extract quantitative radiographic information on the PF joint from lateral knee radiographs, which is prognostic and complementary to the existing KLG, thereby enhancing the KR risk prediction performance and improving patient care.

2. Method

2.1. Data collection

A total of 3026 patients aged between 50 and 79 were retrospectively recruited from the MOST dataset [27], with 2943 remaining after applying the exclusion criteria (Fig. 1(a)). Patients were then randomly split (2:1) into one training cohort (n ​= ​1962) and one testing cohort (n ​= ​981). We collected the initial visit left knee weight-bearing lateral radiographs and numerical data, including left knee KLG, KR time and event, and demographic information (age, gender, body mass index (BMI)). Left knee KLG at follow-up visits were also collected. The MOST study was approved by local institutional review boards, and all participants gave written consent at the initial visit. Details about the participants and variable acquisitions can be found in the Supplementary Materials.

Fig. 1.

Fig. 1

(a) Cohort exclusion criteria and (b) the workflow for knee replacement (KR) risk score development and risk stratification. Tibial-femoral joint Kellgren-Lawrence grade (KLG) was assessed on the anteroposterior view of radiographs. Demographic information includes age, gender, and body mass index (BMI).

2.2. Patient and public involvement

It was impossible to involve patients or the public in the design, conduct, reporting, or dissemination plans of our research since it is a secondary analysis of the MOST study.

2.3. Study design

Lateral knee radiographs were analysed quantitatively via radiomics, where a large quantity of pre-defined image features was extracted and correlated with KR occurrence at 60-month follow-up. The resulting Radiomics score (RadScore) of PF joint was combined with KLG of TF joint as well as independent prognostic demographic information to generate a comprehensive KR risk score. A three-class risk stratification system was then established based on the KR risk score by optimising two cutoff thresholds targeting different speeds of disease progression. Workflows of KR risk score development and risk stratification are illustrated in Fig. 1(b). The RadScore, KR risk score, and stratification thresholds were solely developed from the training cohort and validated in both the training and testing cohort.

2.4. Image processing and region-of-interest (ROI) generation

In order to minimise noise and enhance reproducibility, images were preprocessed by patella alignment, signal enhancement, and noise reduction before feature extraction. First, all patella segmentations were manually drawn by professional radiologists with the Computer Vision Annotation Tool (CVAT). To ensure the vertical alignment of the patella, image orientations were adjusted until the maximum ratio between the vertical and horizontal size of the patella bounding box was reached. Images were first resampled into an isotropic resolution of 0.5 ​mm ​× ​0.5 ​mm to harmonise physical spacing and reduce noise. Contrasts within the patella region were enhanced by Z-score normalisation and thresholding by six standard deviations. The gray levels within the patella bounding box were reduced to 32 bins to further compress the noise.

Three equal-sized rectangular regions: ROIsup, ROImid, and ROIinf were automatically generated as ROIs to cover three different areas of the PF joint for feature extraction, as demonstrated by an example in Fig. 2(a). They were defined as equal partitions of one-third of the patella bounding box closest to the trochlear area, with a double-size extension towards the posterior direction. This ROI definition was adopted due to its simplicity in clinical application and reduced susceptibility to different observers. All the image processing and ROI generation procedures were performed by the Python package SimpleITK (version 2.2.1) [28].

Fig. 2.

Fig. 2

(a) Region-of-interest (ROI) segmentation of one example patient. (b) Bar plot of RadScore AUC of each ROI in prediction 60-month KR (right). ROIinf, which is located at the inferior region of the patellofemoral joint and marked by green rectangles, achieved the best performance in both training and testing.

2.5. Feature extraction

A comprehensive set of hand-crafted first-order and texture radiomic features were extracted from each ROI. Specifically, 93 first-order and texture features were calculated following the protocol specified in the Image Biomarker Standardization Initiative (IBSI) [29]. In addition to the original image features, advanced radiomic features were extracted from seven filtered images before gray-level reduction. Both image filtering and radiomic feature extraction were performed by PyRadiomics (version 3.0.0) [30]. In total, 930 radiomics were extracted for each ROI. Detailed settings of the radiomic feature extraction can be found in Supplementary Material Table S1.

2.6. RadScore development

Radiomic features were selected separately for each ROI based on inter-feature redundancy and relevancy to 60-month KR to increase the robustness and generalizability of RadScore. They were ranked using the “minimum Redundancy Maximum Relevance” (mRMR) algorithm [31]. The top five features were chosen for RadScore developments.

RadScores were modelled from the selected radiomic features independently for the three ROIs, and the best-performing one was selected as the final RadScore. Binary classification models based on Ridge regression were fitted on 60-month left KR occurrence using the features normalised by Z-score. An easy-ensemble approach was adopted to account for the highly imbalanced distribution of KR, where 500 sub-models were trained under random subsampling and combined to give the final probability prediction. It was performed by the Python package imbalance-learn (version 0.10.1) [32]. The final RadScore was determined as the one with the highest area under the receiver operating characteristic curve (AUC) on 60-month KR occurrence.

2.7. KR risk score development and risk stratification

A comprehensive KR risk score was constructed by combining RadScore, KLG, and independently prognostic demographic factors using a multivariate Cox regression model. Patients were stratified into three risk levels based on the KR risk score. The stratification system was designed by optimising two thresholds in classifying.

  • Non-progressors: did not receive KR within 84 months (KR- (84 ​m))

  • Slow progressors: received KR between 30 and 84 months (KR+ (30 m – 84 ​m))

  • Fast progressors: received KR on and before 30 months (KR+ (30 ​m))

The threshold optimisation was performed by maximising Youden's index.

2.8. Statistical analyses

In order to evaluate the independent prognostic values of the constructed RadScore, univariate and multivariate Cox regression was used to assess the hazard ratios (HRs) and p-values of each KR risk factor. Subgroup analyses were also performed where the time-dependent receiver operating characteristic curves (ROCs) and their AUCs were evaluated within early-stage (KLG < 2), mid-stage (KLG ​= ​2), and late-stage patients (KLG > 2).

The current clinical model that combines the TF and PF joint information was also constructed by joining baseline KLG and PFOA with Cox regression. Performance of the baseline KLG, RadScore, PFOA ​+ ​KLG, and KR risk score were evaluated by concordance index (C-index) and time-dependent ROCs and their AUCs at 30, 60, and 84 months, followed by statistical comparison with the KR risk score. A 95% confidence interval (95CI) was given for each C-index by 1000-iteration bootstrapping. P-values for C-index comparisons were calculated by permutation test (one-sided), where the labels were randomly permutated between two groups 1000 times. 95CI estimation and permutation test were performed by Ref. [33] in Python.

The KR risk stratification performance was evaluated by Kaplan-Meier (KM) analysis and further compared with KLG by confusion matrix in classifying the non-, slow, and fast progressors. KM analysis was performed by the Python package lifelines (version 0.27.4) [34].

3. Results

3.1. Patient characteristics

Distributions of the KR risk factors used in this study and 84-month left KR occurrence were listed in Table 1. Statistically lower baseline BMI (p-value ​= ​0.020) was recorded from the training patients compared to testing. The rest of the characteristics, including age, gender, KLG, KR event, and follow-up duration had similar distributions between the two patient cohorts. For patients without KR records in training/testing cohorts, 17/8 patients lost follow-up before 30 months, 42/18 before 60 months, and 79/35 before 84 months.

Table 1.

Distributions of the included knee replacement risk factors of the training and testing patients.

Parameter Training Testing p-value
Patient No. 1962 981
Age
 Mean 62.42 62.40 0.951
 Range 50–79 50–79
Gender
 Male 1177 587 0.968
 Female 785 394
BMI
 Mean 30.45 30.99 0.020
 Range 16.72–57.83 18.50–71.91
Baseline KLG
 0 882 417 0.251
 1 347 165
 2 293 143
 3 294 165
 4 146 91
Baseline PFOA
 0 1546 755 0.519
 1 283 155
 Missing 133 71
84-month KR
 0 1753 876 1.000
 1 209 105
KR follow-up time
 Medium 84 84 0.536
 Range 2–97 3–101

Note: p-values were acquired by Student t-test for continuous variables and nominal variables with > 5 levels, including age, BMI, and KR follow-up time. The rest of the nominal and categorical variables were compared by the Chi-square test.

Abbreviations: KR, knee replacement; BMI, body mass index; KLG, Kellgren and Lawrence grade; PFOA: patellofemoral osteoarthritis.

3.2. RadScore composition

Heterogeneous RadScore performances were found among the three ROIs in 60-month KR prediction (Fig. 2(b)). ROIinf, covering the inferior PF joint area, reached the highest 60-month KR prediction performance with training and testing AUCs of 0.81 and 0.82, respectively. Therefore, ROIinf was chosen as the ROI for the final RadScore for KR risk score development. Details of the final RadScore compositions can be found in Supplementary Matderial Table S2.

3.3. RadScore's independence and predictive value

The PF joint RadScore is an independent risk factor (p-value < 0.001) for KR in both univariate and multivariate settings, as reported in Table 2. During the univariate test, all the demographic and radiographic factors were significantly associated with KR in training and testing. However, only KLG and RadScore persisted as independent prognostic factors in both training and testing. Notably, PFOA did not demonstrate independent prognostic value with the presence of RadScore.

Table 2.

Univariate and multivariate survival analysis results of the final RadScore, baseline KLG, and other knee replacement risk factors in training and testing.

Cohort Risk factor Univariate
Multivariate
HR (95CI) p-value HR (95CI) p-value
Training RadScore 2.49 (2.19–2.83) < 0.001 1.45 (1.21–1.73) < 0.001
Age 1.04 (1.02–1.05) < 0.001 1.01 (0.99–1.03) 0.270
Gender 0.74 (0.56–0.99) 0.046 0.81 (0.60–1.09) 0.172
BMI 1.06 (1.04–1.08) < 0.001 0.99 (0.97–1.01) 0.409
KLG 2.53 (2.25–2.84) < 0.001 1.88 (1.63–2.17) < 0.001
PFOA 4.72 (3.55–6.26) < 0.001 1.19 (0.85–1.68) 0.304
Testing RadScore 2.29 (1.91–2.75) < 0.001 1.40(1.09–1.80) 0.009
Age 1.04 (1.01–1.06) 0.002 1.00 (0.98–1.03) 0.745
Gender 0.61 (0.40–0.93) 0.022 0.67 (0.43–1.04) 0.073
BMI 1.05 (1.02–1.07) < 0.001 0.99 (0.96–1.02) 0.682
KLG 2.39 (2.02–2.82) < 0.001 2.02 (1.65–2.47) < 0.001
PFOA 4.17 (2.79–6.23) < 0.001 1.12 (0.69–1.83) 0.635

Note: Univariate and multivariate survival analyses were performed by Cox regression. P-value less than 0.05 (bolded) was considered significant.

Abbreviations: RadScore, radiomics score; HR, hazard ratio; 95CI, 95% confidence interval; BMI, body mass index; KLG, Kellgren and Lawrence grade; PFOA, patellofemoral osteoarthritis.

During subgroup analysis, RadScore demonstrated high predictive values for fast progressors (KR+ (30 ​m)) within early-stage patients (KLG < 2) with AUC ​= ​0.94/0.75 (training/testing) (Fig. 3). It also had high discriminative power for KR at all time points within mid-stage patients (KLG ​= ​2), and the highest AUCs (0.83/0.79) were also achieved in predicting 30-month KR occurrence.

Fig. 3.

Fig. 3

The receiver operating characteristic curves of RadScore in predicting 30-, 60-, and 84-month KR classification in training and testing under different disease stages at baseline visit. High predictive values can be observed within the early-stage (KLG < 2) and mid-stage (KLG ​= ​2) for predicting fast disease progressions (30-/60-month KR).

3.4. Optimal prognostic performance by KR risk score

The KR risk score achieved the best KR prognosis performance by combining RadScore and KLG. Table 3 reports the performance comparison by C-index and time-dependent AUCs among KLG, RadScore, PFOA ​+ ​KLG, and KR risk score (RadScore ​+ ​KLG). KLG itself had a C-index of 0.83 in training and 0.82 in testing, which were significantly improved by the addition of RadScore to 0.85/0.84 (p-value ​= ​0.003/0.002). Similar trends were observed in time-dependent AUCs where the highest was achieved by the KR risk score (30-month: 0.91/0.83, 60-month: 0.89/0.87, 84-month: 0.86/0.86). The KR risk score also achieved significantly higher C-index values than the PFOA ​+ ​KLG model in both the training (p-value ​= ​0.035) and testing set (p-value ​= ​0.011). Hazard ratios and p-values of the covariates of the KR risk score are presented in Table S3.

Table 3.

Training and testing performance of three knee replacement risk prediction models.

Model Training
Testing
C-index p-value C-index p-value

KLG 0.83 (0.81–0.86) 0.003 0.82 (0.78–0.86) 0.002
RadScore 0.78 (0.75–0.81) <0.001 0.78 (0.74–0.82) 0.018
PFOA ​+ ​KLG 0.84 (0.81–0.86) 0.035 0.82 (0.78–0.86) 0.011
KR risk score (RadScore ​+ ​KLG)
0.85 (0.82–0.87)
-
0.84 (0.80–0.87)
-

30 ​m AUC
p-value
30 ​m AUC
p-value
KLG 0.90 (0.86–0.92) 0.007 0.80 (0.73–0.87) 0.020
RadScore 0.83 (0.79–0.88) 0.015 0.81 (0.74–0.87) 0.290
PFOA ​+ ​KLG 0.90 (0.87–0.92) 0.023 0.81 (0.74–0.88) 0.191
KR risk score (RadScore ​+ ​KLG)
0.91 (0.89–0.94)
-
0.83 (0.77–0.89)
-

60 ​m AUC
p-value
60 ​m AUC
p-value
KLG 0.87 (0.85–0.90) <0.001 0.84 (0.79–0.89) <0.001
RadScore 0.82 (0.78–0.85) <0.001 0.83 (0.78–0.87) 0.123
PFOA ​+ ​KLG 0.88 (0.85–0.91) 0.045 0.85 (0.79–0.90) 0.002
KR risk score (RadScore ​+ ​KLG)
0.89 (0.87–0.92)
-
0.87 (0.83–0.91)
-

84 ​m AUC
p-value
84 ​m AUC
p-value
KLG 0.84 (0.81–0.87) 0.008 0.84 (0.80–0.87) 0.003
RadScore 0.78 (0.74–0.81) <0.001 0.79 (0.74–0.83) 0.009
PFOA ​+ ​KLG 0.85 (0.82–0.87) 0.093 0.84 (0.80–0.88) 0.007
KR risk score (RadScore ​+ ​KLG) 0.86 (0.83–0.88) - 0.86 (0.82–0.89) -

Note: One-sided p-values were calculated by permutation test with 1000 iterations. P-value less than 0.05 (bolded) was considered significant. Significant performance improvements can be observed when combining RadScore with KLG, compared to using KLG alone.

Abbreviations: C-index, concordance index; RadScore, radiomics score; KLG, Kellgren and Lawrence grade; 30 ​m, 30-month; 60 ​m, 60-month; 84 ​m, 84-month.

3.5. Risk stratification and survival analysis

Significant KR risk score differences were detected among the three follow-up time points, as shown in Fig. 4(a). The KR risk scores of non-progressive patients at 84 months were the lowest, with an average value of 0.74. They were significantly higher (p-value < 0.001) for slow progressors (KR+ (30m–84 ​m)) with an average value of 2.42. Fast progressors (KR+ (30 ​m)) achieved the highest average KR risk score (2.96), which is significantly higher than slow progressors (p-value < 0.001).

Fig. 4.

Fig. 4

(a) KR risk score (RadScore ​+ ​KLG) distribution comparisons among non-progressors within 84 months (KR- (84 ​m)), slow progressors later than 30 months (KR+ (30 m – 84 ​m)), and fast progressors earlier than 30 months (KR+ (30 ​m)). (b) Knee replacement-free survival (KRFS) curves of the low risk (green), medium risk (orange), and high risk (red) patients from Kaplan-Meier analysis on the entire patient cohort.

Three risk groups were stratified based on the optimised KR risk score thresholds of 1.86 and 3.11 (Fig. 4(a), dashed lines), and distinct survival patterns were observed among the three risk groups Fig. 4(b). Patients with KR risk score less than 1.86 were classified as low risk (n ​= ​2110) with minimum risk of KR progression within 84 months (6%), as drawn by the survival curves in Fig. 4(b) and confusion matrix in Fig. 5. Meanwhile, patients with the score of more than 1.86 but less than 3.11 were classified as medium risk (n ​= ​582), showing a relatively higher risk of KR within 84 months (25%), but the fast progression (KR+ (30 ​m)) rate remained as low as 7%. The high-risk group of patients (n ​= ​251) who had the score greater than 3.11 demonstrated the highest risk of receiving KR within 84 months (48%) and 30 months (19%). In contrast, only 11% and 3% of KLG ​= ​2 patients were slow and fast progressors, respectively. Although similar rates of slow (49%) and fast (20%) progressors were achieved by the KLG of 4, more progressive patients were identified by the proposed high-risk criteria. Specifically, the positive predictive value (PPV) of our RadScore for predicting knee replacement (KR) was 46.41%, significantly outperforming the KLG's PPV of 34.54%. Furthermore, in predicting KR within 30 months, the RadScore's PPV was 27.01%, compared to only 11.61% for the KLG. These findings underscore the enhanced precision of RadScore over KLG in identifying patients at high risk of both overall KR and rapid progression to KR within a shorter timeframe.

Fig. 5.

Fig. 5

Confusion matrix of the proposed stratification system and KLG in predicting the three KR progression speeds.

4. Discussion

This study, for the first time, highlights the importance of quantitative analysis of PF joint from lateral knee radiographs in KR prediction. It also provides a comprehensive tool incorporating TF and PF joints radiographic information for assisting clinicians in stratifying patients based on disease progression speed. The developed PF joint RadScore was validated as an independent prognostic factor for KR and achieved better KR prognostic performance in early- and mid-stage. The comprehensive KR risk score achieved the highest performance based on the combination of two joints’ radiographic information. Distinct KR-free survival patterns were delineated for the three stratified risk groups, which could benefit precise rehabilitation therapy by prioritising higher risk patients with faster disease progressions.

4.1. Clinical implications

Despite the heterogeneous KR prediction performance, all three regional ROIs of PF joint demonstrated certain prognostic values. Those ROIs are located at the surface between the patella and the femoral notch, known as the trochlea, which is a key area of contact between these bones. According to Wolff's Law, bones adapt to the loads under which they are placed. Therefore, changes in this area can reflect the abnormal stresses on the knee, indicating early signs of OA. Previous research by Bayramoglu et al. once emphasised the importance of ROI location [35]. It confirmed the PFOA diagnostic ability of two lateral patella ROIs at the PF joint margin [36], which was consistent with the ROI definitions in our study. The best-performing ROI was located on the inferior region of the PF joint, with a significant area outside the patella bone. Based on the distinctive patella shape differences observed from the three groups of patients, the final RadScore, built mainly from first-order radiomic features, may capture the patella morphological change due to the altered mechanical loading with knee joint deterioration. Similarly, previous studies have suggested that patella shape and alignment strongly correlate with PFOA, PF joint cartilage defect, and physical activity reduction [[37], [38], [39], [40], [41]]. Such visually appreciable changes were effectively captured and quantified by radiomics, which might reduce the inter-observer variability and improve diagnostic consistency.

4.2. Independent predictors of KR

Results from multivariate analysis suggest that our radiomic characterisation of PF joint on lateral radiographs (RadScore) was independently prognostic to the TF joint KLG, and the integration of RadScore to KLG could significantly boost the performance of KR prediction in the MOST dataset. Despite the limited increments in C-index and AUC values in the entire MOST cohort, our model revealed its unique advantages in predicting fast progressors among early- and mid-stage patients in the subgroup analysis. In contrast, the study that primarily focused on the TF joint demonstrated the highest performance for late-stage patients [7]. This is consistent with previous research conclusions indicating that the PFOA manifests before the TFOA [15]. Predicting fast progressors in the early and mid stages is crucial, as early intervention may alter the disease trajectory and lead to improved outcomes.

On the other hand, demographic information had limited independent prognostic values for KR prediction, and the current clinical diagnostic criteria for the PF joint (PFOA) did not achieve an independent prognostic value. This finding further underlies the importance of PF joint as well as its quantitative characterisation compared to the other risk factors. It may also suggest stronger correlations of PF joint with symptomatic presentations, which is consistent with previous clinical observations [17].

4.3. Limitations of this study

Several limitations in this study in data interpretation shall be fully aware, which warrant further improvements in future investigations. First, only the initial visit radiographs were analysed for KR prediction. A dynamic risk assessment method using image sets from a time series may further improve prediction accuracy. Second, although the MOST dataset is combined by several cohorts, a comprehensive assessment of the proposed patella RadScore and KR risk score on various external datasets with different patient distributions is necessary to further demonstrate the model's generalizability. We have investigated the OAI dataset, but it cannot fulfil our purposes. The dataset lacks sufficient subjects with lateral view X-rays; none of these cases had KR surgery records. Future research could explore alternative datasets or await updated data releases. Third, our machine learning analysis of lateral view radiographs requires patella segmentation, which was achieved by manually contouring. In addition, the KLG of the PA view was acquired by manual reading. A fully automated risk assessment pipeline requires automatic lateral view patella segmentation and quantitative TF and PF joint assessments from the PA view radiographs, which will be conducted in the next stage of our research.

5. Conclusion

In summary, we developed a PF joint RadScore on lateral knee radiographs, which was validated as an independent prognostic factor to predict KR risk among KOA patients. The KR risk score that incorporates TF and PF joints radiographic information achieved the best KR prognostic performance. Based on this score, the stratification system could triage KOA patients into three distinct KR-free survival groups to reflect the progress speed. It would serve as a clinical reference to guide exercise or other physical therapy for secondary prevention of KOA deterioration.

Author contributions

Jiang Zhang contributed to the conception and design of the study, analysis and interpretation of the data, and drafting of the article. Tianshu Jiang, Lok-Chun Chan, Sing-Hin Lau, and Wei Wang contributed to the acquisition and preprocessing of the data, logistic support, and critical revision of the article for important intellectual content. Xinzhi Teng provided statistical expertise and critical revision of the article. Ping-Keung Chan, Jing Cai, and Chunyi Wen contributed to the conception and design of the study and critical revision of the article. Chunyi Wen also provided administrative and logistic support and funding for this study. All authors gave final approval of the version to be submitted.

Funding

This study is supported by SZRI start-up (“百城百園”專項啟動基金", I2021A013) and RISports Seed Fund (P0043526), Projects of RISA (P0043001, P0043002), Project of RIAM (P0041372), and Project of Strategic Importance Fund (P0035421) of the Hong Kong Polytechnic University.

Code availability

The code can be downloaded publicly at https://github.com/John136219655/PatellaRadiomics. Other information is available upon request to the corresponding author.

Conflict of interest

The authors have no conflicts of interest relevant to this article.

Acknowledgements

The authors would like to thank Ho-Yu Leung, Jin Yan, and Sing-Hin Lau for the organisation of patient's demographic information, Ziheng Qin and Yilan Lin for segmentation of patella.

Handling Editor: Professor H Madry

Footnotes

Appendix A

Supplementary data to this article can be found online at https://doi.org/10.1016/j.ocarto.2024.100448.

Appendix A. Supplementary data

The following is the Supplementary data to this article:

Multimedia component 1
mmc1.docx (22.5KB, docx)

References

  • 1.Cross M., Smith E., Hoy D., Nolte S., Ackerman I., Fransen M., et al. The global burden of hip and knee osteoarthritis: estimates from the global burden of disease 2010 study. Ann. Rheum. Dis. 2014;73(7):1323–1330. doi: 10.1136/annrheumdis-2013-204763. [DOI] [PubMed] [Google Scholar]
  • 2.Cheung A., Chan P.K., Fu H., Cheung M.H., Chan V.W.K., Yan C.H., et al. Total knee arthroplasty is safe for patients aged ≥80 years in Hong Kong. Hong Kong Med. J. Xianggang yi xue za zhi. 2021;27(5):350–354. doi: 10.12809/hkmj208942. [DOI] [PubMed] [Google Scholar]
  • 3.Dieppe P., Basler H.D., Chard J., Croft P., Dixon J., Hurley M., et al. Knee replacement surgery for osteoarthritis: effectiveness, practice variations, indications and possible determinants of utilization. Rheumatology. 1999;38(1):73–83. doi: 10.1093/rheumatology/38.1.73. [DOI] [PubMed] [Google Scholar]
  • 4.Heisinger S., Hitzl W., Hobusch G.M., Windhager R., Cotofana S. Predicting total knee replacement from symptomology and radiographic structural change using artificial neural networks—data from the osteoarthritis initiative (OAI) J. Clin. Med. 2020;9(5):1298. doi: 10.3390/jcm9051298. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Jafarzadeh S., Felson D.T., Nevitt M.C., Torner J.C., Lewis C.E., Roemer F.W., et al. Use of clinical and imaging features of osteoarthritis to predict knee replacement in persons with and without radiographic osteoarthritis: the most study. Osteoarthritis Cartilage. 2020;28:S308–S309. [Google Scholar]
  • 6.Bowes M.A., Kacena K., Alabas O.A., Brett A.D., Dube B., Bodick N., et al. Machine-learning, MRI bone shape and important clinical outcomes in osteoarthritis: data from the Osteoarthritis Initiative. Ann. Rheum. Dis. 2021;80(4):502–508. doi: 10.1136/annrheumdis-2020-217160. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Rajamohan H.R., Wang T., Leung K., Chang G., Cho K., Kijowski R., et al. Prediction of total knee replacement using deep learning analysis of knee MRI. Sci. Rep. 2023;13(1):6922. doi: 10.1038/s41598-023-33934-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Liu Q., Chu H., LaValley M.P., Hunter D.J., Zhang H., Tao L., et al. Prediction models for the risk of total knee replacement: development and validation using data from multicentre cohort studies. Lancet Rheumatol. 2022;4(2):e125–e134. doi: 10.1016/S2665-9913(21)00324-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Leung K., Zhang B., Tan J., Shen Y., Geras K.J., Babb J.S., et al. Prediction of total knee replacement and diagnosis of osteoarthritis by using deep learning on knee radiographs: data from the osteoarthritis initiative. Radiology. 2020;296(3):584–593. doi: 10.1148/radiol.2020192091. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Tiulpin A., Thevenot J., Rahtu E., Lehenkari P., Saarakkala S. Automatic knee osteoarthritis diagnosis from plain radiographs: a deep learning-based approach. Sci. Rep. 2018;8(1):1727. doi: 10.1038/s41598-018-20132-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Tolpadi A.A., Lee J.J., Pedoia V., Majumdar S. Deep learning predicts total knee replacement from magnetic resonance images. Sci. Rep. 2020;10(1):6371. doi: 10.1038/s41598-020-63395-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Tiulpin A., Saarakkala S., Mathiessen A., Hammer H.B., Furnes O., Nordsletten L., et al. Predicting total knee arthroplasty from ultrasonography using machine learning. Osteoarthritis Cartilage Open. 2022;4(4) doi: 10.1016/j.ocarto.2022.100319. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Duncan R.C., Hay E.M., Saklatvala J., Croft P.R. Prevalence of radiographic osteoarthritis—it all depends on your point of view. Rheumatology. 2006;45(6):757–760. doi: 10.1093/rheumatology/kei270. [DOI] [PubMed] [Google Scholar]
  • 14.Hart H.F., Stefanik J.J., Wyndow N., Machotka Z., Crossley K.M. The prevalence of radiographic and MRI-defined patellofemoral osteoarthritis and structural pathology: a systematic review and meta-analysis. Br. J. Sports Med. 2017;51(16):1195–1208. doi: 10.1136/bjsports-2017-097515. [DOI] [PubMed] [Google Scholar]
  • 15.Duncan R., Peat G., Thomas E., Hay E.M., Croft P. Incidence, progression and sequence of development of radiographic knee osteoarthritis in a symptomatic population. Ann. Rheum. Dis. 2011;70(11):1944–1948. doi: 10.1136/ard.2011.151050. [DOI] [PubMed] [Google Scholar]
  • 16.de Lange-Brokaar B.J.E., Bijsterbosch J., Kornaat P.R., Yusuf E., Ioan-Facsinay A., Zuurmond A.M., et al. Radiographic progression of knee osteoarthritis is associated with MRI abnormalities in both the patellofemoral and tibiofemoral joint. Osteoarthritis Cartilage. 2016;24(3):473–479. doi: 10.1016/j.joca.2015.09.021. [DOI] [PubMed] [Google Scholar]
  • 17.Culvenor A.G., Lai C.C.H., Gabbe B.J., Makdissi M., Collins N.J., Vicenzino B., et al. Patellofemoral osteoarthritis is prevalent and associated with worse symptoms and function after hamstring tendon autograft ACL reconstruction. Br. J. Sports Med. 2014;48(6):435–439. doi: 10.1136/bjsports-2013-092975. [DOI] [PubMed] [Google Scholar]
  • 18.Macri E.M., van Middelkoop M., Damen J., Bos P.K., Bierma-Zeinstra S.M.A. Higher risk of knee arthroplasty during ten-year follow-up if baseline radiographic osteoarthritis involves the patellofemoral joint: a CHECK Cohort Study. BMC Muscoskel. Disord. 2022;23(1):1–600. doi: 10.1186/s12891-022-05549-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Cicuttini F.M., Baker J., Hart D.J., Spector T.D. Choosing the best method for radiological assessment of patellofemoral osteoarthritis. Ann. Rheum. Dis. 1996;55(2):134–136. doi: 10.1136/ard.55.2.134. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Vince A.S., Singhania A.K., Glasgow M.M.S. What knee X-rays do we need? A survey of orthopaedic surgeons in the United Kingdom. Knee. 2000;7(2):101–104. doi: 10.1016/s0968-0160(00)00036-3. [DOI] [PubMed] [Google Scholar]
  • 21.Yusuf E., Kortekaas M.C., Watt I., Huizinga T.W.J., Kloppenburg M. Do knee abnormalities visualised on MRI explain knee pain in knee osteoarthritis? A systematic review. Ann. Rheum. Dis. 2011;70(1):60–67. doi: 10.1136/ard.2010.131904. [DOI] [PubMed] [Google Scholar]
  • 22.Everhart J.S., Abouljoud M.M., Kirven J.C., Flanigan D.C. Full-thickness cartilage defects are important independent predictive factors for progression to total knee arthroplasty in older adults with minimal to moderate osteoarthritis: data from the osteoarthritis initiative. J. Bone Jt. Surg. Am. Vol. 2019;101(1):56. doi: 10.2106/JBJS.17.01657. [DOI] [PubMed] [Google Scholar]
  • 23.Hunter D.J., Guermazi A., Lo G.H., Grainger A.J., Conaghan P.G., Boudreau R.M., et al. Evolution of semi-quantitative whole joint assessment of knee OA: MOAKS (MRI Osteoarthritis Knee Score) Osteoarthritis Cartilage. 2011;19(8):990–1002. doi: 10.1016/j.joca.2011.05.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Emery C.A., Whittaker J.L., Mahmoudian A., Lohmander L.S., Roos E.M., Bennell K.L., et al. Establishing outcome measures in early knee osteoarthritis. Nat. Rev. Rheumatol. 2019;15(7):438–448. doi: 10.1038/s41584-019-0237-3. [DOI] [PubMed] [Google Scholar]
  • 25.Im G.-I. The concept of early osteoarthritis and its significance in regenerative medicine. Tissue Eng. Regen. Med. 2022;19(3):431–436. doi: 10.1007/s13770-022-00436-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Lambin P., Leijenaar R.T.H., Deist T.M., Peerlings J., De Jong E.E.C., Van Timmeren J., et al. Radiomics: the bridge between medical imaging and personalized medicine. Nat. Rev. Clin. Oncol. 2017;14(12):749–762. doi: 10.1038/nrclinonc.2017.141. [DOI] [PubMed] [Google Scholar]
  • 27.Segal N.A.M.D., Nevitt M.C.P., Gross K.D.M.D., Hietpas J.M.S.W., Glass N.A.M.A., Lewis C.E.M.D., et al. The multicenter osteoarthritis study: opportunities for rehabilitation research. PM & R. 2013;5(8):647–654. doi: 10.1016/j.pmrj.2013.04.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Lowekamp B.C., Chen D.T., Ibáñez L., Blezek D. The design of simpleITK. Front. Neuroinf. 2013;7:45. doi: 10.3389/fninf.2013.00045. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Zwanenburg A., Vallières M., Abdalah M.A., Aerts H.J.W.L., Andrearczyk V., Apte A., et al. The image biomarker standardization initiative: standardized quantitative radiomics for high-throughput image-based phenotyping. Radiology. 2020;295(2):328–338. doi: 10.1148/radiol.2020191145. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Van Griethuysen J.J.M., Fedorov A., Parmar C., Hosny A., Aucoin N., Narayan V., et al. Computational radiomics system to decode the radiographic phenotype. Cancer Res. 2017;77(21):e104–e107. doi: 10.1158/0008-5472.CAN-17-0339. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Ding C., Peng H. Minimum redundancy feature selection from microarray gene expression data. J. Bioinf. Comput. Biol. 2005;3(2):185–205. doi: 10.1142/s0219720005001004. [DOI] [PubMed] [Google Scholar]
  • 32.Lemaître G., Nogueira F., Aridas C.K. Imbalanced-learn: a python toolbox to tackle the curse of imbalanced datasets in machine learning. J. Mach. Learn. Res. 2017;18:1–5. [Google Scholar]
  • 33.Virtanen P., Gommers R., Oliphant T.E., Haberland M., Reddy T., Cournapeau D., et al. SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat. Methods. 2020;17(3):261–272. doi: 10.1038/s41592-019-0686-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Davidson-Pilon C. lifelines: survival analysis in Python. J. Open Source Softw. 2019;4(40):1317. [Google Scholar]
  • 35.Bayramoglu N., Tiulpin A., Hirvasniemi J., Nieminen M.T., Saarakkala S. Adaptive segmentation of knee radiographs for selecting the optimal ROI in texture analysis. Osteoarthritis Cartilage. 2020;28(7):941–952. doi: 10.1016/j.joca.2020.03.006. [DOI] [PubMed] [Google Scholar]
  • 36.Bayramoglu N., Nieminen M.T., Saarakkala S. Machine learning based texture analysis of patella from X-rays for detecting patellofemoral osteoarthritis. Int. J. Med. Inf. 2022;157 doi: 10.1016/j.ijmedinf.2021.104627. [DOI] [PubMed] [Google Scholar]
  • 37.Kalichman L., Zhang Y., Niu J., Goggins J., Gale D., Felson D.T., et al. The association between patellar alignment and patellofemoral joint osteoarthritis features—an MRI study. Rheumatology. 2007;46(8):1303–1308. doi: 10.1093/rheumatology/kem095. [DOI] [PubMed] [Google Scholar]
  • 38.Eijkenboom J.J., Waarsing J.H., Lankhorst N.E., Bierma-Zeinstra S.M., van Middelkoop M. Statistical shape modelling of the patella: patients with patellofemoral pain and patellofemoral osteoarthritis share similar aberrant shape aspects compared to healthy controls. Osteoarthritis Cartilage. 2016;24:S243–S244. [Google Scholar]
  • 39.Liao T.C., Jergas H., Tibrewala R., Bahroos E., Link T.M., Majumdar S., et al. Longitudinal analysis of the contribution of 3D patella and trochlear bone shape on patellofemoral joint osteoarthritic features. J. Orthop. Res. 2021;39(3):506–515. doi: 10.1002/jor.24836. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Dai Y., Yin H., Xu C., Zhang H., Guo A., Diao N. Association of patellofemoral morphology and alignment with the radiographic severity of patellofemoral osteoarthritis. J. Orthop. Surg. Res. 2021;16(1):1–548. doi: 10.1186/s13018-021-02681-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Gudas R., Šiupšinskas L., Gudaitė A., Vansevičius V., Stankevičius E., Smailys A., et al. The patello-femoral joint degeneration and the shape of the patella in the population needing an arthroscopic procedure. Medicina. 2018;54(2):21. doi: 10.3390/medicina54020021. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Multimedia component 1
mmc1.docx (22.5KB, docx)

Articles from Osteoarthritis and Cartilage Open are provided here courtesy of Elsevier

RESOURCES