Abstract
The aim of the study was to longitudinally investigate symptomatic and structural factors prior to total knee replacement (TKR) surgery in order to identify influential factors that can predict a patient’s need for TKR surgery. In total, 165 participants (60% females; 64.5 ± 8.4 years; 29.7 ± 4.7 kg/m2) receiving a TKR in any of both knees within a four-year period were analyzed. Radiographic change, knee pain, knee function and quality of life were annually assessed prior to the TKR procedure. Self-learning artificial neural networks were applied to identify driving factors for the surgical procedure. Significant worsening of radiographic structural change was observed prior to TKR (p ≤ 0.0046), whereas knee symptoms (pain, function, quality of life) worsened significantly only in the year prior to the TKR procedure. By using our prediction model, we were able to predict correctly 80% of the classified individuals to undergo TKR surgery with a positive predictive value of 84% and a negative predictive value of 73%. Our prediction model offers the opportunity to assess a patient’s need for TKR surgery two years in advance based on easily available patient data and could therefore be used in a primary care setting.
Keywords: Osteoarthritis, Knee, Total Knee Replacement, Neural Networks, Prediction model
1. Introduction
Osteoarthritis (OA) contributes strongly to one individual’s global disability and has been shown to be the leading cause of immobility and impaired health related quality of life in the elderly as compared to any other chronic disease [1,2,3,4]. End-stage osteoarthritis of the knee can be understood as a total organ failure of the synovial knee joint resulting from damage and subsequent loss of function of the involved structures: bone, meniscus, synovium, synovial fluid and cartilage [5]. To date, there is still no curative treatment available, and thus, the ultimate cure remains the total surgical replacement of the affected knee joint.
Radiographic evidence of end-stage knee OA and consistent pain refractory to treatment have been postulated to be the leading key indicators for the taking the decision for total knee replacement (TKR) surgery [6,7]. Of those two criteria, the x-ray imaging-based assessment of the knee joint is the more objective and reliable method and has been shown to serve as a good parameter when trying to evaluate the patient’s need for TKR [8,9,10,11,12]. However, as it has been shown that the x-ray status of the knee does not fully reflect one patient’s pain and symptom status, there is a need to include symptom parameters such as pain, loss of function and quality of life and to balance each factor on an individual basis when taking the decision for performing TKR surgery [13].
A previous report from 2003 by Escobar et al. tried to identify factors for the appropriateness of the indication for TKR in patients with osteoarthritis by using a modified Delphi panel judgment based on outcome measures and confirmed that symptomatology and radiology explained most of the variability in the appropriateness of indication [14]. A previous study by Hitzl et al. [15] revealed that individuals receiving a TKR within a period of 4 years displayed a significantly reduced cartilage thickness at base-line when compared to controls, indicating a change in structure; here, the magnetic resonance imaging (MRI) parameter is a predicting factor that can guide a surgeon’s decision for performing TKR surgery.
Various studies have determined the radiographic status based on conventional radiographs and MRI to be predictive for TKR [16,17,18,19]. Other studies, conducted by various authors, focused on knee specific symptomology and demographic factors [18,19,20].
To date, no study analyzed in detail the longitudinal change in radiographic structural change as assessed via Kellgren and Lawrence grades and knee specific symptomology, i.e., pain, function and quality of life in a 4-year period prior to the TKR surgery. Therefore, the aims of the present study were to investigate structural, symptomatic, socio-economic and demographic factors in a four-year period prior to TKR surgery in order to identify which of those factors were of influence to a surgeon’s decision to performing TKR surgery. We additionally performed statistical analyses based on self-learning neural networks to design a model based on the factors described above for predicting a patient’s need for TKR.
2. Material and Methods
2.1. The Osteoarthritis Initiative
The Osteoarthritis Initiative (OAI) is a National Institutes of Health (NIH) and privately funded longitudinal cohort study of subjects with, or at risk of developing, symptomatic knee OA [10,11]. Data used in the preparation of this article were obtained from the Osteoarthritis Initiative (OAI) database, which is available for public access at https://nda.nih.gov/oai/. Specific datasets used are OAI, version 3.2.1 clinical data, San Francisco, CA, USA [21]. Individuals of multiple ethnicities, aged 45–75 years, were recruited for study purposes. Subjects were excluded from the OAI if the following conditions were present: rheumatoid arthritis, bilateral knee arthroplasty or pre-existing plans to undergo bilateral (not unilateral) knee arthroplasty in the next 3 years, bilateral Osteoarthritis Research Society International (OARSI) stage 3 (severe) knee OA [12], positive pregnancy test, inability to provide a blood sample, use of ambulatory aids other than a single straight cane for >50% of the time, comorbid conditions that might interfere with 4-year participation, unlikely to reside in the clinic area for at least 3 years, current participation in a double-blind randomized controlled trial and being unwilling to sign an informed consent. Men above 130 kg and women above 114 kg were also excluded, due to specific requirements with the acquisition of magnetic resonance images, not relevant to the current study. In total, 27% (n = 4796) of all subjects screened (n = 17,457) were included in the study. The study was approved by the institutional review boards and was conducted in accordance with the Declaration of Helsinki. The clinical data are available for public use at https://nda.nih.gov/oai/. The study design protocol is available at https://oai.epi-ucsf.org/datarelease/docs/StudyDesignProtocol.pdf and has been previously described [14,22,23]. All variables used in this analysis were sampled from the publicly available OAI data base.
2.2. Sample Selection
Out of the 4796 OAI participants, 165 individuals were selected which received a femoro-tibial total knee replacement (TKR) during the interval between the baseline and the 4-year annual screening visit. The presence of a TKR was confirmed via radiographic assessment during a consecutive annual visit for each knee individually. Additionally, knees were classified as valid cases for the purposes of this study when a TKR was documented in the patient’s medical record.
Of those 165 individuals 24 (15%) received a TKR between the baseline and the 1-year screening visit, 33 (20%) between the 1-year and 2-year visit, 40 (24%) between 2-year and 3-year visit, 38 (23%) between 3-year and 4-year visit and 30 individuals (18%) between the 4-year and the 5-year screening visit. Detailed demographic data are shown in Table 1.
Table 1.
Demographics | |
Females (%) | 100 (60) |
Age (SD) | 64.5 (8.4) |
Body Mass Index (SD) | 29.7 (4.7) |
Ethnicity | |
Other Non-White (%) | 2 (1.2) |
Caucasian (%) | 140 (85) |
African American (%) | 20 (12) |
Asian (%) | 2 (1.2) |
Not Available (%) | 1 (0.6) |
Medication | |
Any Medication Used for Pain, Aching or Stiffness (%) | 98 (59) |
Annual Income in United States Dollar (USD) | |
Less Than 10,000 (%) | 2 (1.3) |
10,000 < 25,000 (%) | 16 (9.7) |
25,000 < 50,000 (%) | 41 (25) |
50,000 < 100,000 (%) | 57 (35) |
>100,000 (%) | 39 (24) |
Not available (%) | 10 (6) |
Education | |
Less Than High School (%) | 6 (3.5) |
High School (%) | 31 (19) |
Any College (%) | 44 (27) |
College Graduate (%) | 29 (18) |
Any Graduate School (%) | 12 (7) |
Graduate Degree (%) | 42 (25) |
Not Available (%) | 1 (0.5) |
Depression | |
At Risk for Clinical Depression (%) | 15 (9) |
Start of Knee Symptoms Prior to Baseline Screening Visit | |
None (%) | 11 (6.6) |
0–1 year (%) | 21 (12.6) |
2–5 years (%) | 47 (28.1) |
> 5 years (%) | 67 (40.1) |
Not Available (%) | 21 (12.6) |
Total Knee Replacement at Which Year | |
Between Baseline and 1-Year Follow-up Screening Visit (%) | 24 (15) |
Between 1-Year and 2-Year Follow-up Screening Visit (%) | 33 (20) |
Between 2-Year and 3-Year Follow-up Screening Visit (%) | 40 (24) |
Between 3-Year and 4-Year Follow-up Screening Visit (%) | 38 (23) |
Between 4-Year and 5-Year Follow-up Screening Visit (%) | 30 (18) |
Ethnicity: distribution of ethnical groups among patients; Medication: proportion of patients taking any medication used for pain, aching or stiffness at baseline screening visit; Annual Income In United States Dollar (USD): distribution of annual income among patients at baseline screening visit; Education: distribution of highest educational levels among patients at baseline screening visit; Depression: proportion of patients who are at risk for clinical depression at baseline screening visit; Start of Knee Symptoms Prior to Baseline Screening Visit: distribution of duration of knee symptoms prior to baseline screening visit among patients; Total Knee Replacement at Which Year: distribution of timepoints at which patients received total knee replacement.
2.3. Clinical Data
Rearrangement of the data revealed that knee-specific clinical and imaging information was available for 140 knees 1 year before the TKR, for 107 knees 2 years before TKR, for 68 knees 3 years before TKR and for 30 knees information was available 4 years before TKR surgery (Table 2).
Table 2.
0 Years Prior TKR | 1 Year Prior TKR | 2 Years Prior TKR | 3 Years Prior TKR | 4 Years Prior TKR | |
---|---|---|---|---|---|
n = 165 | n = 140 | n = 107 | n =68 | n =30 | |
Quality of Life | 43 (25–56.3) | 50 (37.5–62.5) | 50 (31.3–62.5) | 50 (37.5–62.5) | 56.3 (43.8–68.8) |
WOMAC Pain Subscore | 8 (4–11) | 5 (3–8) | 5 (2–7.25) | 4 (3–7) | 4 (0–6) |
WOMAC Total Score | 34.3 (24–48) | 26 (12–36.1) | 21.4 (9.2–37.5) | 22.5 (10.4–34.7) | 19 (4–27) |
Pain Intensity | 7 (5–8) | 5 (4–7) | 5 (4–7) | 5 (3–6) | 4 (2–6) |
Kellgren and Lawrence Grades (%) 1 | *** | ** | ** | * | |
0 | 2 (1) | 4 (3) | 3 (3) | 2 (3) | 3 (10) |
1 | 2 (1) | 2 (1) | 2 (2) | 6 (9) | 3 (10) |
2 | 15 (9) | 21 (15) | 21 (19) | 12 (18) | 4 (13) |
3 | 46 (28) | 44 (31) | 33 (31) | 25 (37) | 14 (47) |
4 | 92 (56) | 64 (46) | 47 (44) | 22 (32) | 6 (20) |
Not Available | 8 (5) | 5 (4) | 1 (1) | 1 (1) | 0 (0) |
1 Significant difference were calculated by comparison to the previous years’ visit using a marginal homogeneity test. Significances were: *** p < 0.001; ** 0.001 < p <0.01; * 0.01 < p < 0.05.
Radiographic classification of each knee relied on central radiographic readings (OAI release 0.5) assigning a Kellgren Lawrence grade (KLG) based on osteophytosis and joint space narrowing with higher grades indicating a worse knee state [24]. A knee showing evidence of a KL grade of more than II was classified as having radiographic evidence of knee osteoarthritis (ROA) [25,26].
The person specific quality of life was assessed via a subscore of the Knee Injury and Osteoarthritis Outcome Score (KOOS), ranging from 100–0, with higher values indicating a better quality [27].
The Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) total score and pain subscore was used to assess the functional and pain status of each individual knee and ranged from 0–100 or 0–20, respectively, with higher values indicating worse knee status [28].
Knee pain intensity was assessed via the numeric scale, ranging from 0 “no pain” to 10 “extreme pain” [29]. All variables included were publicly available and downloaded from the Osteoarthritis Initiative (OAI, version 3.2.1 clinical data): https://oai.nih.gov/.
Knees were classified as valid cases when a TKR was documented in the patient’s medical record and was additionally confirmed by radiography in the consecutive screening visits.
2.4. Analysis Strategy
Variables of interest were publicly available and sampled from the OAI database. Information obtained from the screening visits prior to the TKR surgery were re-arranged in order to be able to calculate differences between the years prior to the TKR surgery since the surgery could have occurred at any time point within the 4-year observational period. Differences in values were longitudinally compared between the years 0, 1, 2, 3, and 4 prior to TKR surgery. Variables were additionally included in the artificial neural networks in order to identify factors driving doctor’s decision for performing the surgical procedure in the OAI study population.
2.5. Statistical Methods
Data were analyzed for consistency, outliers and for normality using probability plots. Quality of life, WOMAC pain subscore, WOMAC total score and pain intensity were firstly tested globally by using Friedman’s ANOVA, and secondly, pairwise tests were performed by using Wilcoxon’s signed ranked test (based on the Monte Carlo method). 95% confidence intervals for medians and differences of medians were computed by using Hodge-Lehman intervals. KLG distributions were tested by using the marginal homogeneity test for dependent distributions and McNemar’s test.
Artificial neural networks (ANNs) with linear, radial basis function and three-layer perceptron neural networks architectures were used. A total of 14 variables were included into the model and the most predictive group consisting of pain intensity, Kellgren and Lawrence grades, WOMAC total score, used of medication (for pain, aching, or stiffness in the knee) and body mass index were identified by the integrated neural network variable selection algorithm to have the best outcome performance for predicting that a patient will undergo TKR within the next two years. For preprocessing, minimax coding and various activation functions were applied, i.e., linear, logistic and hyperbolic, whereas the linear function was used as a post synaptic potential function. More than 250 neural networks models with different network architecture were created and tested. Backpropagation method was used to train the network. In order to avoid overlearning, data were randomly split into a training, a verification and a test sample (ratio 2:1:1). A variable selection algorithm was used to find a subset of variables with low dimensions with sufficient prediction power. The process of identifying a proper network was monitored in the training, verification and test sample to avoid overlearning [30]. Negative and positive predictive value, sensitivity, specificity, percentage of unclassified and total percentage of correctly predicted patients were analyzed in each of these three samples. Instead of using one threshold, two prediction thresholds—one for acceptance and one for rejection—were used resulting into a grey area, i.e., if data of a subject fall into this area, no prediction is made. p-values less than 5% indicate a statistically significant difference. All analyses were done using PASW 22 (IBM SPSS Statistics for Windows, Version 19.0., Armonk, NY, USA), StatXact 10 (Cytel Software 2013, Cambridge MA, USA), Mathematica 7 (Wolfram Research, Inc., Mathematica, Version 7.0, Champaign, IL, USA), STATISTICA for neural networks 1.2 and STATISTICA 13 (Hill, T. & Lewicki, P. Statistics: Methods and Applications. StatSoft, Tulsa, OK, USA).
3. Results
3.1. Demographic Data
Demographic data of the 165 participants (60% females) included in this analysis revealed a mean age of 64.5 ± 8.4 years with a mean BMI of 29.7 ± 4.7 kg/m2. 85% of the study sample were of Caucasian ethnicity, 59% had an annual income of more than 50,000 USD and 50% had received during their education a college degree or above. Additional demographic data is shown in Table 1.
3.2. Change in Symptomology
Symptoms related to the knee affected by osteoarthritis increased sparsely in severity between 4 and 1 years prior to the TKR procedure for the WOMAC total score, the pain subscore, the knee related quality of life and knee pain intensity without reaching statistical significant worsening as compared between the annual screening visits year 4 to year 1 (Table 2 and Table 3). However, when compared between the years 1 and 0 prior to TKR surgery, symptom status worsened significantly in all of the measured variables: WOMAC total: 9.7 95% CI (7–12.5), p = < 0.0001; WOMAC pain subscore: 0.5 (1.5–3), p = < 0.0001; quality of life 9.4 (6.3–12.6), p = < 0.0001; and pain intensity 1.5 (1–2), p = < 0.0001 (Figure 1).
Table 3.
0 vs. 1 Years | 1 vs. 2 Years | 2 vs. 3 Years | 3 vs. 4 Years | |
---|---|---|---|---|
Median Difference, 95% CI, p-value | ||||
Quality of Life 1 | 9.4 (6.3–12.6) p < 0.0001 *** |
0 (−3.2–3.2) p = 0.73 |
0 (−6.2–6.2) p = 0.99 |
0 (−6.2–12.5) p = 0.32 |
WOMAC Pain Subscore 1 | 0.5 (1.5–3) p < 0.0001 *** |
0.5 (−0.3–1) p = 0.27 |
0 (−1.0–1.0) p = 0.75 |
1.2 (−0.5–3) p = 0.12 |
WOMAC Total Score 1 | 9.7 (7–12.5) p < 0.0001 *** |
2.6 (−0.1–5.4) p = 0.062 |
1.8 (−1.5–4.9) p = 0.24 |
2.3 (−1.8–7.2) p = 0.23 |
Pain Intensity 1 | 1.5 (1–2) p < 0.0001 *** |
0.5 (0.3–1.5) p = 0.014 * |
0 (−0.5–1.0) p = 0.89 |
0 (−1.0–1.5) p = 0.92 |
Change in KLG ≥ 2 | ||||
0 vs. 1 years | 1 vs. 2 years | 2 vs. 3 years | 3 vs. 4 years | |
Kellgren and Lawrence grades 2 | p = 0.0002 *** | p = 0.008 * | p = 0.002 * | p = 0.045 * |
1 Significant differences were calculated by comparison to the previous years’ visit using a global Friedman test and local using Wilcoxon signed ranked test based on Monte Carlo methods and 2 McNemar Test (one sided). Significances were: *** p < 0.001; * 0.01< p < 0.05.
3.3. Structural Change as Evaluated by Kellgren and Lawrence Grading
Four years prior TKR, 80% of the sample was graded as KL grade 2 or worse, 88% as KLG > 1 three years prior TKR, 96% as KLG > 1 three years prior TKR and 98% in the year prior to the TKR procedure (Table 2). There was a significant worsening in KL grades between each of the annual screening visits: 4–3 years: p = 0.045, 3–2 years: p = 0.002, 2–1 years: p = 0.008 and in the year prior TKR i.e., 1–0 years: p = 0.0002 (Figure 2 and Table 3).
3.4. Prediction of TKR Using Artificial Neural Networks
After applying the variable selection algorithm and testing more than 250 models, KL grades, WOMAC total score, body mass index measures, pain intensity and pain medication revealed the following performance: negative predictive value 73% (52–88%), positive predictive value 84% (71–94%), specificity 30% (19–42%), sensitivity 41% (27–57%), percentage of unclassified knees 54% (46–62%) and total percentage of correctly predicted knees 80% (69–89%) (Figure 3).
4. Discussion
Our data revealed that structural change as evaluated by the x-ray based Kellgren and Lawrence grading a significant worsening occurred between every of the four annual visits prior to the TKR procedure. When analyzing the symptomology of the investigated 165 included participants, as assessed by the WOMAC total score, by the WOMAC pain subscore, by the knee specific quality of life or by the knee pain intensity, a significant worsening was detected only in the year prior to the TKR procedure but in not in the years earlier. When including selected factors into our artificial neural network model, we were able to compute that 80% of all patients with a prediction were correctly predicted with a positive predictive value of 84% and a negative predictive value of 73% for the total model.
One limitation of this study is that we investigated a subpopulation of the OAI, which had specific criteria for including participants into the study that had present or being at risk of developing symptomatic knee osteoarthritis [10,11]. This might not reflect the general population, as knee OA subtypes such as post-traumatic knee OA or individuals at a young age (<45 years) were not considered specifically. On the other hand, the OAI provides the unique opportunity for researchers to sample publicly available datasets with having a specific population that is prone to receiving a TKR within the duration of the study. This would be less feasible in other experimental settings that emphasize the uniqueness of the data used in the present study to obtain representative results in a rationale longitudinal follow-up. Another limitation of the present study is that the outcome of the TKR procedure and the long-term outcome were not considered in our prediction model as previously performed in study by Escobar et al. [14]. In this study, the appropriateness for the indication to TKR surgery in patients with osteoarthritis was re-evaluated also in consideration of the outcome of the surgical procedure. This allows one to include the effects of the taken decision into the decision-making process, but it is also prone to blurring the integrated factors due a good outcome independent of the factors included into the model. Another limitation of this study is that the patients included in this study were treated at different institutions and the decision to perform TKR surgery is an individual one and can vary between centers and even between doctors within one center.
It is widely accepted that radiographic assessments of the knee serve as a good parameter when trying to evaluate a patients’ need for TKR [8,9,10,11,12] and that radiographic end-stage knee OA is the key-factor for taking the decision to perform KKR surgery [6,7]. A limitation to this objective parameter is that a patients’ symptom status, i.e., pain, disability and quality of life, are not reflected. Our results confirm this observation, as 98% of the individuals have definite radiographic knee OA with a Kellgern and Lawrence score of 2 or worse. However, four individuals had a KL score of 0 or 1 and were nevertheless scheduled for TKR surgery. This interesting fact receives support by our observations on the symptomology in the years prior to the procedure. In the years prior to TKR the worsening between the annuals screening visits was not significant as compared to the preceding screening visit in the years four to one prior to TKR. However, in the year prior to TKR, the symptoms, as assessed by the WOMAC total and pain subscore, by the KOOS quality of life or by knee pain intensity, worsened significantly as compared to the year prior to TKR. These factors can be regarded as the major driving factors for the decision making, as during the following year, TKR surgery was performed in these individuals, independent of their radiographic knee status.
In order to have the possibility to predict other patients’ need for TKR surgery, i.e., other than those included in the OAI, we performed analyses using artificial neural networks. Artificial neural networks support the decision making process based on input variables for a defined outcome. As all of the individuals included in our investigation received a TKR during the observational period as a determined outcome, we included 14 variables representing information on structural, symptomatic, socio-economic and demographic influencing factors. The model used a selection algorithm to select six variables that had, in combination, the best performance. As in general such models perform great in a given sample, it has been shown that they fail when they are applied to predict new cases.
Yu et al. established two prediction models for TKR and total hip replacement using Cox proportional hazards models based on two large cohorts [31]. This study yielded well-performing risk prediction equations however they established their model based on epidemiological patient data, whereas we implemented both epidemiological data as well as data on knee functionality, symptomology and radiographical status of the patients [31]. Furthermore, our prediction model is easily applicable using the figure presented in this study. Another study conducted by Zeni et al. used a logistic regression model to identify predictive variables however the patient specific data was not analyzed longitudinally in the course of time prior to TKR and no prediction model was established [20]. Nevertheless, the study by Zeni et al. also emphasizes the crucial role of knee functionality in regard to the risk of TKR [20]. Additionally, various studies have shown the relevance of magnetic resonance imaging (MRI) in regard to TKR prediction but MRI is still not as easily accessible as conventional radiographs [16,17]. Gossec et al. have previously shown that radiological grade, mean patient global assessment and the need for NSAIDs were predictive of total hip replacement, which corresponds well to our results for TKR [18]. Chan et al. created a formula that provides acceptable diagnostic efficacy in retrospect for the prediction of TKR based on similar variables as our prediction model, which suggests that our model focuses on appropriate predictive factors [19]. Concludingly our study takes a broad variety of variables into account, while the prediction model remains easily applicable in a primary care setting.
The uniqueness of our present model is that we used a subsample of our total sample, i.e., out of 165 individuals to train the model, another subsample to verify the training effect and another subsample (not included in the model thus far) to calculate the performance of the applied artificial neural networks. As a result, of those who were able to be classified, 80% were classified correctly. Another unique fact of our applied model is that we used two thresholds, i.e., one for acceptance and one for rejection with leaving a “grey area,” in which a classification, i.e., a TKR procedure that will be performed within the next 2 years, was not possible.
As stated previously by Fei et al., artificial neural networks (ANN) have exceptional information processing abilities learning and generalization capabilities, and fault and noise tolerance; moreover, ANNs were able to assess more variables than logistic regression [32,33,34]. In brief, ANNs can be seen as enhancements of classic regression models [30,35]. As reviewed by Miller et al. ANNs have been successfully applied in terms of diagnostic applications and are likely to play an important role in regard to diagnostics and treatment of chronic diseases in the future [36]. Taking these advantages into account, we decided to establish our prediction model based on ANNs. However, this specific technique is not able to analyze and evaluate individual factors, while logistic regression modelling can provide this kind of information, which can be seen as a limitation of our prediction model [32]. Another limitation of our study and ANNs in general are the “black-box” characteristics of this technique, making it extremely difficult to determine how certain processes are performed [37]. Another limitation to our study is the relatively small sample size; therefore, further studies will be needed to improve and validate our prediction model. However, we believe that our novel approach using ANNs to predict the risk of TKR is of clinical value, and the applied technique is well-suited to address this controversially discussed topic.
The mathematical basis of our model is provided in the Supplementary Material: The Mathematical Basis of this paper and can be easily implemented in an Excel worksheet for the daily use in clinical care.
5. Conclusions
Longitudinal analyses of symptomology and structural change in a four-year period prior to TKR surgery revealed that the significant worsening in pain, function and quality of life in the year prior to the TKR procedures seems to be the more dominant decision driving factor as compared to the radiographic structural progression of the knee. Analyzing a total of 14 different structural, symptomatic, socio-economic and demographic factors via artificial neural networks, we were able to establish a prediction model that predicted correctly 80% of the classified individuals to undergo TKR surgery within the next 2 years with a positive predictive value of 84% and a negative predictive value of 73%. Considering that our prediction model is based on easily accessible patient data, it could be used in a primary care setting to evaluate the need for TKR surgery within the next two years.
Acknowledgments
The OAI is a public-private partnership comprised of five contracts (N01-AR-2-2258; N01-AR-2-2259; N01-AR-2-2260; N01-AR-2-2261; N01-AR-2-2262) funded by the National Institutes of Health, a branch of the Department of Health and Human Services, and conducted by the OAI Study Investigators. Private funding partners include Merck Research Laboratories; Novartis Pharmaceuticals Corporation, GlaxoSmithKline; and Pfizer, Inc. Private sector funding for the OAI is managed by the Foundation for the National Institutes of Health. This manuscript was prepared using an OAI public use data set and does not necessarily reflect the opinions or views of the OAI investigators, the NIH or the private funding partners [6].
Supplementary Materials
The following are available online at https://www.mdpi.com/2077-0383/9/5/1298/s1, The Mathematical Basis.
Author Contributions
S.H. was responsible for the analysis and interpretation of the data and the drafting of the article. W.H. was responsible for the analysis and interpretation of the data and contributed substantially with his statistical expertise. G.M.H. was involved in the critical revision of the manuscript and gave his final approval. R.W. was involved in the critical revision of the manuscript, provided logistic support and gave his final approval. S.C. was responsible for the conception and design of the study, critical revision of the article, provided logistic and technical support and gave his final approval. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Conflicts of Interest
The authors declare no conflicts of interest.
References
- 1.White D.K., Neogi T., Nguyen U.-S., Niu J., Zhang Y. Trajectories of functional decline in knee osteoarthritis: The Osteoarthritis Initiative. Rheumatology. 2015;55:801–808. doi: 10.1093/rheumatology/kev419. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Cross M., Smith E., Hoy D.G., Nolte S., Ackerman I.N., Fransen M., Bridgett L., Williams S., Guillemin F., Hill C.L., et al. The global burden of hip and knee osteoarthritis: Estimates from the global burden of disease 2010 study. Ann. Rheum. Dis. 2014;73:1323–1330. doi: 10.1136/annrheumdis-2013-204763. [DOI] [PubMed] [Google Scholar]
- 3.Guccione A., Felson D., Anderson J.J., Anthony J.M., Zhang Y., Wilson P.W., Kelly-Hayes M., Wolf P.A., Kreger B.E., Kannel W.B. The effects of specific medical conditions on the functional limitations of elders in the Framingham Study. Am. J. Public Health. 1994;84:351–358. doi: 10.2105/AJPH.84.3.351. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Murray C.J., Richards M.A., Newton J.N., Fenton K.A., Anderson H.R., Atkinson C., Bennett D., Bernabe E., Blencowe H., Bourne R., et al. UK health performance: Findings of the Global Burden of Disease Study 2010. Lancet. 2013;381:997–1020. doi: 10.1016/S0140-6736(13)60355-4. [DOI] [PubMed] [Google Scholar]
- 5.Loeser R.F., Goldring S.R., Scanzello C.R., Goldring M.B. Osteoarthritis: A disease of the joint as an organ. Arthritis Rheum. 2012;64:1697–1707. doi: 10.1002/art.34453. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Van Manen M.D., Nace J., Mont M.A. Management of primary knee osteoarthritis and indications for total knee arthroplasty for general practitioners. J. Am. Osteopath. Assoc. 2012;112:709–715. [PubMed] [Google Scholar]
- 7.Feeley B.T., Gallo R.A., Sherman S., Williams R.J. Management of osteoarthritis of the knee in the active patient. J. Am. Acad. Orthop. Surg. 2010;18:406–416. doi: 10.5435/00124635-201007000-00003. [DOI] [PubMed] [Google Scholar]
- 8.Gossec L., Paternotte S., Maillefert J.F., Combescure C., Conaghan P.G., Davis A.M., Gunther K.-P., Hawker G., Hochberg M., Katz J.N., et al. The role of pain and functional impairment in the decision to recommend total joint replacement in hip and knee osteoarthritis: An international cross-sectional study of 1909 patients. Report of the OARSI-OMERACT Task Force on total joint replacement. Osteoarthr. Cartil. 2011;19:147–154. doi: 10.1016/j.joca.2010.10.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Jones C.A., Voaklander D.C., Suarez-Alma M.E. Determinants of function after total knee arthroplasty. Phys. Ther. 2003;83:696–706. doi: 10.1093/ptj/83.8.696. [DOI] [PubMed] [Google Scholar]
- 10.Löfvendahl S., Bizjajeva S., Ranstam J., Lidgren L. Indications for hip and knee replacement in Sweden. J. Eval. Clin. Pract. 2011;17:251–260. doi: 10.1111/j.1365-2753.2010.01430.x. [DOI] [PubMed] [Google Scholar]
- 11.Mancuso C.A., Ranawat C.S., Esdaile J.M., Johanson N.A., Charlson M.E. Indications for total hip and total knee arthroplasties. Results of orthopaedic surveys. J. Arthroplast. 1996;11:34–46. doi: 10.1016/S0883-5403(96)80159-8. [DOI] [PubMed] [Google Scholar]
- 12.Verra W.C., Witteveen K.Q., Maier A.B., Gademan M.G.J., Van Der Linden H.M.J., Nelissen R.G.H.H. The reason why orthopaedic surgeons perform total knee replacement: Results of a randomised study using case vignettes. Knee Surg. Sports Traumatol. Arthrosc. 2016;24:2697–2703. doi: 10.1007/s00167-015-3961-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Hunter D.J., Guermazi A., Roemer F., Zhang Y., Neogi T. Structural correlates of pain in joints with osteoarthritis. Osteoarthr. Cartil. 2013;21:1170–1178. doi: 10.1016/j.joca.2013.05.017. [DOI] [PubMed] [Google Scholar]
- 14.Peterfy C.G., Schneider E., Nevitt M. The osteoarthritis initiative: Report on the design rationale for the magnetic resonance imaging protocol for the knee. Osteoarthr. Cartil. 2008;16:1433–1441. doi: 10.1016/j.joca.2008.06.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Hitzl W., Wirth W., Maschek S., Cotofana S., Nevitt M., John M.R., Ladel C., Eckstein F. Greater Lateral Femorotibial Cartilage Loss in Osteoarthritis Initiative Participants With Incident Total Knee Arthroplasty: A Prospective Cohort Study. Arthritis Care Res. 2015;67:1481–1486. doi: 10.1002/acr.22608. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Heilmeier U., Wamba J.M., Joseph G.B., Darakananda K., Callan J., Neumann J., Link T.M. Baseline knee joint effusion and medial femoral bone marrow edema, in addition to MRI-based T2 relaxation time and texture measurements of knee cartilage, can help predict incident total knee arthroplasty 4-7 years later: Data from the Osteoarthritis Initiative. Skelet. Radiol. 2019;48:89–101. doi: 10.1007/s00256-018-2995-4. [DOI] [PubMed] [Google Scholar]
- 17.Cicuttini F.M., Jones G., Forbes A.B., Wluka A.E. Rate of cartilage loss at two years predicts subsequent total knee arthroplasty: A prospective study. Ann. Rheum. Dis. 2004;63:1124–1127. doi: 10.1136/ard.2004.021253. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Gossec L., Tubach F., Baron G., Ravaud P., Logeart I., Dougados M. Predictive factors of total hip replacement due to primary osteoarthritis: A prospective 2 year study of 505 patients. Ann. Rheum. Dis. 2005;64:1028–1032. doi: 10.1136/ard.2004.029546. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Chan W.P., Hsu S.-M., Huang G.-S., Yao M.-S., Chang Y.-C., Ho W.-P. Creation of a reflecting formula to determine a patient’s indication for undergoing total knee arthroplasty. J. Orthop. Sci. 2010;15:44–50. doi: 10.1007/s00776-009-1418-8. [DOI] [PubMed] [Google Scholar]
- 20.Zeni J.A., Jr., Axe M.J., Snyder-Mackler L. Clinical predictors of elective total joint replacement in persons with end-stage knee osteoarthritis. BMC Musculoskelet. Disord. 2010;11:86. doi: 10.1186/1471-2474-11-86. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Osteoarthritis Initiative (OAI) Guidelines For Publications Using Public Use Data Sets. [(accessed on 8 March 2020)]; Available online: https://oai.epi-ucsf.org/datarelease/Docs/OAIPubsGuidePublicData.pdf.
- 22.Eckstein F., Boudreau R., Wang Z., Hannon M.J., Duryea J., Wirth W., Cotofana S., Guermazi A., Roemer F., Nevitt M., et al. Comparison of radiographic joint space width and magnetic resonance imaging for prediction of knee replacement: A longitudinal case-control study from the Osteoarthritis Initiative. Eur. Radiol. 2016;26:1942–1951. doi: 10.1007/s00330-015-3977-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Steidle-Kloc E., Wirth W., Glass N., Ruhdorfer A., Cotofana S., Eckstein F., Segal N., Steidle E. Is Pain in One Knee Associated with Isometric Muscle Strength in the Contralateral Limb?: Data From the Osteoarthritis Initiative. Am. J. Phys. Med. Rehabil. 2015;94:792–803. doi: 10.1097/PHM.0000000000000262. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Eckstein F., Wirth W., Nevitt M.C. Recent advances in osteoarthritis imaging—The osteoarthritis initiative. Nat. Rev. Rheumatol. 2012;8:622–630. doi: 10.1038/nrrheum.2012.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Kellgren J.L.J. Atlas of Standard Radiographs. Blackwell Scientific; Oxford, UK: 1963. [Google Scholar]
- 26.Haugen I.K., Englund M., Aliabadi P., Niu J., Clancy M., Kvien T.K., Felson D. Prevalence, incidence and progression of hand osteoarthritis in the general population: The Framingham Osteoarthritis Study. Ann. Rheum. Dis. 2011;70:1581–1586. doi: 10.1136/ard.2011.150078. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Roos E.M., Lohmander L.S. The Knee injury and Osteoarthritis Outcome Score (KOOS): From joint injury to osteoarthritis. Health Qual. Life Outcomes. 2003;1:64. doi: 10.1186/1477-7525-1-64. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Bellamy N., Buchanan W.W., Goldsmith C.H., Campbell J., Stitt L.W. Validation study of WOMAC: A health status instrument for measuring clinically important patient relevant outcomes to antirheumatic drug therapy in patients with osteoarthritis of the hip or knee. J. Rheumatol. 1988;15:1833–1840. [PubMed] [Google Scholar]
- 29.Zobel I., Erfani T., Bennell K.L., Makovey J., Metcalf B., Chen J.S., March L., Zhang Y., Eckstein F., Hunter D.J., et al. Relationship of Buckling and Knee Injury to Pain Exacerbation in Knee Osteoarthritis: A Web-Based Case-Crossover Study. Interact. J. Med. Res. 2016;5:e17. doi: 10.2196/ijmr.5452. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Bishop C. Neural Networks for Pattern Recognition. Oxford University Press, Inc.; New York, NY, USA: 1995. [Google Scholar]
- 31.Yu D., Jordan K.P., Snell K.I., Riley R.D., Bedson J., Edwards J., Mallen C.D., Tan V., Ukachukwu V., Prieto-Alhambra D., et al. Development and validation of prediction models to estimate risk of primary total hip and knee replacements using data from the UK: Two prospective open cohorts using the UK Clinical Practice Research Datalink. Ann. Rheum. Dis. 2019;78:91–99. doi: 10.1136/annrheumdis-2018-213894. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Fei Y., Hu J., Li W., Wang W., Zong G. Artificial neural networks predict the incidence of portosplenomesenteric venous thrombosis in patients with acute pancreatitis. J. Thromb. Haemost. 2017;15:439–445. doi: 10.1111/jth.13588. [DOI] [PubMed] [Google Scholar]
- 33.Smith A.E., Nugent C.D., McClean S.I. Evaluation of inherent performance of intelligent medical decision support systems: Utilising neural networks as an example. Artif. Intell. Med. 2003;27:1–27. doi: 10.1016/S0933-3657(02)00088-X. [DOI] [PubMed] [Google Scholar]
- 34.Statnikov A., Aliferis C., Tsamardinos I., Hardin D., Levy S.E. A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis. Bioinformatics. 2005;21:631–643. doi: 10.1093/bioinformatics/bti033. [DOI] [PubMed] [Google Scholar]
- 35.Kaiser A.K., Hitzl W., Iglseder B. Three-question dementia screening. Development of the Salzburg Dementia Test Prediction (SDTP) Z. Gerontol. Geriatr. 2014;47:577–582. doi: 10.1007/s00391-013-0568-7. [DOI] [PubMed] [Google Scholar]
- 36.Miller D.D., Brown E.W. Artificial Intelligence in Medical Practice: The Question to the Answer? Am. J. Med. 2018;131:129–133. doi: 10.1016/j.amjmed.2017.10.035. [DOI] [PubMed] [Google Scholar]
- 37.Kourou K., Exarchos T.P., Exarchos K.P., Karamouzis M.V., Fotiadis D.I. Machine learning applications in cancer prognosis and prediction. Comput. Struct. Biotechnol. J. 2015;13:8–17. doi: 10.1016/j.csbj.2014.11.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.