Abstract
Objective
The goal of this study was to test the reliability and validity of a handheld mechanical three-dimensional (3D) ultrasound (US) device for quantifying femoral articular cartilage (FAC) against the current clinical standard of magnetic resonance imaging (MRI).
Design
Bilateral knee images of 25 healthy volunteers were acquired with 3D US and 3.0 T MRI. The trochlear FAC was segmented by two raters who repeated segmentations on five cases during separate sessions. MRI and 3D US segmentations were registered using a semi-automated surface-based registration algorithm, and MRI segmentations were trimmed to match the FAC region from 3D US. Intra- (n = 5) and inter-rater (n = 25) reliabilities were assessed using intraclass correlation coefficients (ICCs) calculated from FAC volumes. Relationships between MRI and 3D US were assessed using Spearman correlation and linear regression (n = 25).
Results
MRI intra-rater ICCs were 0.97 (0.79, 1.00) and 0.90 (0.25, 0.99) for each rater with an inter-rater ICC of 0.83 (0.48, 0.94). 3D US intra-rater ICCs were 1.00 (0.98, 1.00) and 0.98 (0.84, 1.00) for each rater with an inter-rater ICC of 0.96 (0.90, 0.98). Spearman correlation and linear regression revealed a strong correlation ρ = 0.884 (0.746, 0.949) and regression R2 = 0.848 (0.750, 0.950).
Conclusion
These results suggest 3D US demonstrates excellent intra- and inter-rater reliabilities and strong concurrent validity with MRI when quantifying healthy trochlear FAC volume. 3D US may reduce imaging costs and greatly improve feasibility of quantifying knee cartilage volume during knee arthritis clinical trials and patient care.
Keywords: 3D ultrasound, Knee cartilage volume, Osteoarthritis, Segmentation, MRI, Validation
1. Introduction
Knee osteoarthritis (KOA) is a whole-joint disease with a prevalence of 7–17% among adults 45+ years old, and is increasing with rising obesity rates and population aging [1,2]. KOA affects all knee joint tissues, leading to cartilage degradation, subchondral bone remodeling, and muscle atrophy [3]. Cartilage degradation, a hallmark of KOA, has motivated efforts to characterize disease severity through measures of femoral articular cartilage (FAC) loss, where decreases in FAC quality and quantity are interpreted as increased KOA severity. Semi-quantitative scoring systems, such as the Kellgren-Lawrence (KL) grading scale, define the presence of KOA using tibiofemoral (TF) joint space narrowing (JSN) as a surrogate for FAC loss. Most imaging-based KOA scales target TF cartilage because of easy visualization with weight-bearing radiography. Although radiographic JSN may represent FAC loss, radiographic grading has poor sensitivity to detect FAC changes in early stage KOA [4]. Furthermore, radiographic JSN suffers from limited reproducibility for visualizing three-dimensional (3D) features due to variations in knee joint angulation [5]. Additionally, JSN is a composite of meniscal positioning and degeneration, which are not necessarily associated with KOA severity [6,7].
Limitations of radiographic JSN have motivated magnetic resonance imaging (MRI) investigations of FAC as a discriminative and evaluative KOA tool. The MRI Osteoarthritis Knee Score (MOAKS), Boston-Leeds Osteoarthritis Knee Score (BLOKS), Knee Osteoarthritis Scoring System (KOSS), and Whole-Organ MRI Score (WORMS) are all MRI-based semi-quantitative scales that have shown excellent reliabilities in OA populations [[8], [9], [10], [11]]. Furthermore, compositional MRI techniques produce quantitative measurements of cartilage biochemistry and have primarily been developed to investigate early stage KOA. Due to the ability of MRI to assess the status of whole joint cartilage with reasonable spatial resolution, it has been largely accepted as the gold standard for KOA FAC assessments. While MRI has accelerated the scientific and medical communities’ understanding of KOA, it has limitations. MRI is not feasible for point-of-care disease classification due to high manufacturing and operating costs, long acquisition times, and inaccessibility to all patients at all times [12]. However, while other modalities may be less expensive and more accessible than MRI, finding individuals that possess the expertise needed to interpret images in under-served areas of the world is challenging.
Conventional two-dimensional (2D) ultrasound (US) is widely accessible, relatively inexpensive, and overcomes the limitations associated with MRI. 2D US is a high-resolution imaging modality that has been increasingly used for point-of-care assessments of rheumatological diseases [[13], [14], [15], [16]]. 2D US has been implemented in KOA research via Outcome Measures in Rheumatology (OMERACT) US working group's semi-quantitative grading scale [17]. However, this scale has not been formally validated, and conventional 2D US is associated with limitations. Clinicians must cognitively integrate multiple 2D images to mentally reconstruct 3D anatomy, which is inefficient and leads to operator variability [18]. Additionally, 2D US tissue volume calculations require measurements of height, width, and length in two orthogonal views and are associated with low accuracy, high variability, and large operator dependency. Furthermore, sensitivity to change is limited when using ordinal scales with small dynamic range such as 0–3 in the OMERACT scale. Alternatively, 3D US techniques involve translating a 2D US transducer while continually acquiring images that are reconstructed into a 3D image. 3D US imaging overcomes the limitations of 2D US and may fill the clinical need for an objective imaging-based point-of-care tool for assessing KOA status, progression, and response to treatment.
3D US techniques have been applied to neonatal, gynaecological, and vascular applications, among others [[19], [20], [21]]. We have developed a handheld mechanical 3D US device to provide point-of-care assessments of trochlear FAC (tFAC). The objectives of this cross-sectional study were to investigate the intra- and inter-rater reliabilities of our 3D US scanner for measuring tFAC volumes in healthy volunteers, and to assess its concurrent validity compared to the current clinical standard of MRI. We hypothesized that tFAC volumes measured from 3D US would demonstrate excellent reliability (ICC > 0.90) and be strongly correlated (ρ > 0.80) to MRI measurements in the same region-of-interest (ROI).
2. Methods
Twenty-five volunteers over the age of 18 without a recent history of chronic knee joint pathology (healthy knees) in the year prior to the study were recruited for MR and 3D US knee imaging. The imaging protocol was approved by the Research Ethics Board at Western University Canada and all volunteers provided written informed consent prior to imaging. Knees were deemed healthy if volunteers denied experiencing knee pain on most days of the weeks prior to this study and had not been diagnosed with any type of knee arthritis. Volunteers with prior knee injuries and/or surgeries that occurred before the year leading up to the study were not excluded from the cohort if they denied experiencing frequent knee symptoms including pain, aching, or stiffness on most days of the weeks prior to this study.
2.1. Image acquisition
MRI scans were acquired on a 3.0 T MR system (General Electric Healthcare, Milwaukee, WI, USA) using a 3D Multiple Echo Recombined Gradient Echo (MERGE) sequence in accordance with the Osteoarthritis Research Society International (OARSI) recommendations for KOA imaging clinical trials [22]. An HD T/R Knee Array Coil (8 Channels) was used while volunteers were positioned supine with minimal knee flexion. Images were acquired in the sagittal plane with voxel sizes of 0.63 × 0.63 × 0.40 mm3, an average of 250 slices, a reconstructed matrix size of 256 by 256 voxels, and a field-of-view of 16 cm. The excitation flip angle was 5° with a repetition time (TR) of 30 ms and an echo time (TE) of 11.71 ms. The MERGE sequence scan time for one knee was 4 min and 27 s. Total scan time was 45 min including both knees.
3D US images were acquired using an Aplio i800 US machine (Canon Medical Systems Corporation, Ōtawara, Tochigi, Japan) equipped with a 14L5 linear transducer with a 58 mm footprint length and an operating frequency of 10 MHz (3.8 MHz–10.0 MHz). The 2D US transducer was mounted to our 3D US scanner using a custom 3D-printed mold (Fig. 1). Our 3D US device consisted of a motorized drive mechanism that linearly translated the transducer over 4.0 cm along the patient's skin. 2D US images were continually acquired at regular spatial intervals which were reconstructed into a 3D image immediately after scanning via computer software [18]. Our 3D US scanner has previously been validated on tungsten filament phantoms and volumetric agar phantoms, demonstrating the ability to acquire Euclidean distance and volumetric measurements with errors <2% [23]. For 3D US acquisition, volunteers were positioned supine and instructed to flex their knee to the maximum range of motion without eliciting pain. 3D US images of the tFAC were acquired at the distal end of the femur, proximal to the patella during maximum knee flexion. 120 2D US images were acquired in the transverse plane with transducer translation along the perpendicular axis. Reconstructed 3D US image voxel sizes were 0.058 × 0.058 × 0.33 mm3 with 2D US in-plane image dimensions of 968 × 694 voxels. 3D US acquisition time was 15 s for one knee.
2.2. Manual segmentation
MRI voxel resampling was performed to ensure that the segmentation pixel spacings were substantially smaller than the smallest FAC image feature. Voxel resampling was conducted in MATLAB R2019b (MathWorks, Natick, Massachusetts, USA) using the interp2 function with the spline interpolation method. The resampled voxel size was 0.15 × 0.15 × 0.40 mm3 to provide a balance between segmentation sensitivity and computation time.
Manual tFAC segmentations were completed by two raters (SP, RD) on MRI and 3D US after receiving training during three formal calibration sessions with a rheumatologist possessing advanced diagnostic and interventional musculoskeletal ultrasonography training (CTA). One rater had no prior experience with medical image segmentation but possesses a medical physics academic background with courses in medical imaging modalities including US and MRI. The other rater is a registered diagnostic medical sonographer with formal training and clinical experience in medical imaging. Segmentations were performed in the open-source software 3D Slicer (3D Slicer 4.11.0 Preview Release) using the segment editor module and were conducted in the sagittal MRI and transverse 3D US planes [24]. Segmentations were completed using every second 2D image to decrease segmentation time for both MRI and 3D US without a reduction in sensitivity to tFAC volume changes [25]. Segmented 2D images were interpolated using a morphological contour interpolation algorithm in 3D Slicer, resulting in an average of 146 and 92 segmented 2D images per MRI and 3D US image, respectively [26].
During MRI segmentations, the posterior condylar cartilage was excluded by defining the anterior border of the posterior aspect of the lateral and medial menisci as a segmentation border to further reduce segmentation times (Fig. 2a and b). The hyperintense synovial membrane lining Hoffa's fat pad was excluded from MRI segmentations. For 3D US segmentations, the anterior hyperechoic tFAC surface and the hyperechoic border of the cortex were defined as boundaries for the anechoic cartilage (Fig. 2c and d). With these boundaries and definitions, total segmentation times were approximately 45–60 min per knee for MRI and 20–30 min per knee for 3D US. Five knees from separate volunteers were randomly selected by each rater and re-segmented on MRI and 3D US. Repeated segmentations were conducted during sessions separated by a two-week washout period.
2.3. Reliability and validation analysis
MRI and 3D US segmentations were registered via manual initialization followed by automated surface-based registration in 3D Slicer (Fig. 3). Initialization involved manipulating 3D US tFAC models using linear transformations and rotations along the three Cartesian axes to align the segmentations with MRI using the intercondylar notch as an anatomical landmark. An automatic surface-based registration method (Jean-Baptiste & Vinicius Boen, University of Michigan) was applied to the segmentations to complete the registration. Intra- and inter-rater reliabilities were assessed using the same registration procedures. Reliability analysis was conducted using the entire segmented area of MRI and 3D US tFAC models, while validation between modalities involved additional trimming of MRI segmentations. MR images captured a larger FAC field-of-view than 3D US, resulting in segmentations that did not represent identical anatomical ROI when comparing modalities. Therefore, MRI segmentations were manually trimmed using the overlaid 3D US segmentations as guides, ensuring that tFAC models represented the same ROI on both modalities. Registration and trimming were repeated on five knees selected at random during sessions separated by a two-week washout period.
Segmentation volumes were computed by 3D Slicer and the percent differences between MRI and 3D US volumes were calculated. The mean surface distance (MSD), Hausdorff distance (HD), and Dice similarity coefficient (DSC) were computed as these metrics are widely used to compare and evaluate segmentations [27]. MSD represents the mean distance from a point on one surface to the nearest corresponding point on the other surface, while HD is the largest distance from a point on one surface to the closest point on the other surface (Fig. 3e). DSC provides a measure of similarity in terms of overlap between segmentations and ranges from 0% (no overlap) to 100% (identical objects). MSD and HD values were computed using the open-source software CloudCompare (CloudCompare v2.11 beta), and DSC values were computed using the segment comparison module in 3D Slicer (Csaba Pinter, PerkLab, Queen's University, Canada).
2.4. Statistical analysis
Statistical analyses were performed using SPSS (SPSS Statistics v26; IBM, Armonk, NJ). All data were initially tested for normality using the Shapiro-Wilk test. Intra- and inter-rater segmentation reliabilities from MRI and 3D US for both raters were assessed using intraclass correlation coefficients (ICCs). Intra-rater ICCs were based on a single-rating, absolute-agreement, 2-way mixed-effects model, while inter-rater ICCs were based on a single-rating, absolute-agreement, 2-way random-effects model. ICCs were interpreted as less than 0.50 indicating poor reliability, between 0.50 and 0.75 indicating moderate reliability, between 0.75 and 0.90 indicating good reliability, and greater than 0.90 indicating excellent reliability [28]. Bland-Altman plots were used to assess differences between intra- and inter-rater tFAC volumes along with differences between MRI and 3D US segmentations. A cumulative percentile plot was used to observe the relationship of the differences between MRI and 3D US tFAC volumes. Correlations between tFAC volumes calculated as the mean of the two raters from MRI and 3D US segmentations were determined using Spearman Rank-Order Correlation due to the non-normal distribution of data. Linear regression analysis was conducted using MRI segmentation volumes as predictors for 3D US tFAC volumes and the enter method for equation construction.
3. Results
The demographic data of the volunteers is shown in Table 1 and was available from 24 of the 25 participants.
Table 1.
Volunteers with healthy knees | |
---|---|
% Women | 58.3 |
Age [year] (mean ± SD) | 29.9 ± 14.5 |
Height [m] (mean ± SD) | 1.68 ± 0.11 |
Weight [kg] (mean ± SD) | 67.0 ± 14.8 |
BMI [kg/m2] (mean ± SD) | 23.4 ± 3.3 |
3.1. Reliability
Similar mean segmentation volumes and mean absolute volume differences between intra- and inter-rater comparisons were observed using the same modality for each rater (Table 2, Fig. 4). The smallest ICC was 0.83 (0.48, 0.94) and was observed for the inter-rater comparison of MRI, while the largest ICC was 1.00 (0.98, 1.00) and was observed for the intra-rater 3D US comparison for rater 1 (Table 3). Global mean MSD and HD were smaller for 3D US than MRI for intra- and inter-rater comparisons, while DSC was larger for 3D US than MRI during all comparisons (Table 3).
Table 2.
Mean Volume [cm3] | Mean Volume (repeated) [cm3] | Absolute Difference [cm3] | |
---|---|---|---|
Intra-rater (n = 5) | |||
MRI (rater 1) | 4.71 ± 1.18 | 4.76 ± 1.20 | 0.232 ± 0.152 |
MRI (rater 2) | 4.56 ± 1.10 | 4.20 ± 1.04 | 0.366 ± 0.351 |
3D US (rater 1) | 2.52 ± 1.01 | 2.53 ± 0.96 | 0.0516 ± 0.0531 |
3D US (rater 2) | 2.15 ± 0.92 | 2.17 ± 1.08 | 0.167 ± 0.111 |
Inter-rater (n = 25) | |||
MRI | 4.79 ± 1.23 | 4.38 ± 1.03 | 0.494 ± 0.465 |
3D US | 2.29 ± 0.72 | 2.30 ± 0.64 | 0.155 ± 0.134 |
Registration & trimming (n = 5) | |||
Single rater | 2.14 ± 0.56 | 2.13 ± 0.54 | 0.0173 ± 0.0166 |
Table 3.
ICC (95% CI) | P value | MSD [mm] | HD [mm] | DSC [%] | ||
---|---|---|---|---|---|---|
Intra-rater (n = 5) | ||||||
MRI (rater 1) | 0.97 (0.79, 1.00) | 0.001 | 0.218 ± 0.109 | 2.88 ± 1.37 | 87.3 ± 2.8 | |
MRI (rater 2) | 0.90 (0.25, 0.99) | 0.002 | 0.499 ± 0.275 | 6.66 ± 2.76 | 83.5 ± 4.6 | |
3D US (rater 1) | 1.00 (0.98, 1.00) | <0.0001 | 0.126 ± 0.024 | 1.76 ± 0.35 | 92.9 ± 0.2 | |
3D US (rater 2) | 0.98 (0.84, 1.00) | 0.0003 | 0.256 ± 0.143 | 3.70 ± 2.23 | 88.1 ± 2.6 | |
Inter-rater (n = 25) | ||||||
MRI | 0.83 (0.48, 0.94) | <0.0001 | 0.274 ± 0.122 | 3.51 ± 1.77 | 83.1 ± 3.6 | |
3D US | 0.96 (0.90, 0.98) | <0.0001 | 0.243 ± 0.133 | 2.89 ± 1.72 | 86.4 ± 3.1 | |
Registration & trimming (n = 5) | ||||||
Single rater | 1.00 (0.99, 1.00) | <0.0001 | 0.101 ± 0.090 | 1.72 ± 0.94 | 94.3 ± 4.4 |
3.2. 3D US to MRI registration and trimming
The mean percent difference between MRI and 3D US volumes averaged across all comparisons including both raters individually, following registration and trimming, was 16.7 ± 12.9% (n = 50). 3D US tFAC volume measurements were larger than MRI volume measurements in 88% of the comparisons between the two modalities (Fig. 5a and b). Spearman Rank-Order Correlation revealed a strong correlation between MRI and 3D US volumes (ρ = 0.884 (0.746, 0.949), p < 0.0001), and linear regression resulted in R2 = 0.848 (0.750, 0.950), p < 0.0001, and Y = 1.29∗X – 230 (Fig. 5c). Global mean MSD, HD, and DSC between registered segmentations averaged between both raters were 0.375 ± 0.071 mm, 2.85 ± 1.18 mm, and 71.2 ± 6.5%, respectively (n = 25).
4. Discussion
This is the first study investigating the reliability and validation of FAC volume measurements using 3D US in healthy volunteers. This study focused on validation of tFAC volumes, which is important when studying the status and progression of KOA affecting patellofemoral articulation. Since KOA affects the entire joint, these results are pertinent to the study of nearly all KOA phenotypes. Healthy FAC possesses a relatively smooth and continuous surface without distinct anatomical landmarks that can be used for registering segmentations, besides the intercondylar notch. Therefore, tFAC images that included the intercondylar notch enabled registration of MRI and 3D US segmentations. Additionally, the intercondylar notch can be used as an anatomical landmark during longitudinal studies to ensure repeated measures are taken from the same ROI.
3D US imaging is possible in any application involving 2D US since the only modification required is mounting the 2D US transducer to a 3D US scanning device. Several studies have previously investigated the application of 2D US for evaluating femoral condylar cartilage for KOA assessments [[29], [30], [31], [32]]. However, quantitative image analysis of non-invasive knee US has only been reported for cartilage thickness measurements but not entire cartilage volumes [[33], [34], [35]]. Quantitative image analysis may provide more sensitive information regarding early KOA than semi-quantitative grading scales, which are subjective and potentially susceptible to US operator/rater differences. However, semi-quantitative grading scales are potentially faster than manual quantitative image analysis. Therefore, this study builds on previous work and is easily implemented in similar clinical settings.
Many studies have investigated cartilage thickness measurements for assessing KOA severity [36,37]. However, thickness measurements are highly variable and dependent on the FAC ROI being measured, which can vary within subjects due to US transducer placement and angulation at different time points [[38], [39], [40]]. Detecting changes in cartilage loss using thickness measurements requires the ability to sample the same ROI with good test-retest reliability. Volume measurements may overcome these limitations by enabling quantification of cartilage loss in all dimensions and provide a similar metric to average cartilage thickness. Furthermore, 3D US may provide meaningful advantages over MRI for quantifying FAC volume. Our 3D US device is compatible with any commercially available US machine and is associated with low manufacturing and operating costs. Additionally, the portability of our 3D US device enables FAC volume measurements to be acquired at the patient's bedside.
4.1. Reliability
Intra-rater ICCs for MRI and 3D US demonstrated excellent reliabilities (>0.90). Inter-rater ICC for MRI demonstrated good reliability (0.75–0.90) while 3D US ICC demonstrated excellent reliability (>0.90). Intra- and inter-rater Bland-Altman plots displayed smaller volume difference variations for 3D US compared to MRI in all comparisons (Fig. 4). Additionally, global mean MSD and HD were smaller for 3D US than MRI, and mean DSC for 3D US was higher than MRI for intra- and inter-rater comparisons (Table 3). Collectively, these results suggest that our 3D US system can quantify tFAC volume with similar or perhaps superior reliability and precision than MRI.
The higher spatial resolution of 3D US images acquired with the Canon 14L5 linear transducer compared to 3.0 T MRI may partially account for reliability and precision differences. Resolution differences between modalities were most apparent during MRI segmentations when raters attempted to define the interface between tFAC and the synovial lining of Hoffa's fat pad. Differentiating tFAC from slightly hyperintense synovial lining proved extremely difficult or impossible during MRI segmentations despite manipulating image contrast. Additionally, the TF cartilage interface was difficult to identify on MRI as both cartilage structures were equally hyperintense. The synovial lining of Hoffa's fat pad along with the TF contact point were not within the ROI of 3D US acquisitions since images were acquired during maximum knee flexion. Healthy FAC produced ideal US images with excellent differentiation from surrounding tissues. The difficulties in identifying borders on MRI likely also contributed to higher segmentation times compared to 3D US. The MRI and 3D US resolutions were chosen to match what is routinely used in both patient care and clinical trials for OA to enable comparisons in a clinically relevant context.
4.2. Validity
3D US tFAC segmentations possessed larger volumes than MRI segmentations. Considering the higher spatial resolution of US compared to MRI, it is possible that MRI segmentations were not able to capture the true cartilage volume as effectively as 3D US. Medial and lateral portions of the tFAC and condylar cartilage become thin and difficult to delineate from thin adipose tissue and may often not be visible in MRI. Due to the high spatial resolution of 3D US, the thin medial and lateral portions of tFAC were easily identified and therefore included in segmentations. This will be of great importance in clinical studies of joint disease since thinner areas of cartilage are particularly susceptible to damage and loss in KOA. Our 3D US device was able to visualize tFAC and condylar cartilage regions that were difficult or impossible to visualize using MRI, providing a more comprehensive model of the cartilage and improved volume quantifications. Notwithstanding these differences in absolute cartilage volumes, Spearman Rank-Order Correlation and linear regression analyses revealed a strong correlation between MRI and 3D US tFAC measurements, and that MRI tFAC volumes can predict 3D US volumes.
4.3. Limitations and impact
This study was conducted on volunteers with healthy knees rather than KOA patients. Validating our 3D US system on healthy knees prior to testing with KOA patients was a necessary first step for developing image acquisition, segmentation, and analysis protocols. In KOA patients, FAC characteristically develops fissures, abrasions, and other surface irregularities whereas healthy cartilage is smooth and continuous. Therefore, before implementing our 3D US device clinically, the measurement properties of this system should also be evaluated in KOA patients. Results from a KOA patient study will enable us to determine if KOA cartilage pathology impacts measurement properties of our system relative to healthy cartilage. However, given the high resolution and excellent soft tissue contrast of clinical US systems, we anticipate our 3D US system will perform similarly in KOA patients.
Only a portion of FAC was captured in a single pass 3D US acquisition as the FAC cannot be visualized through the patella and tibia using US. Therefore, knees were scanned in maximum flexion proximal to the patella to capture the greatest portion of FAC possible. However, during maximum knee flexion the posterior medial and lateral condylar cartilage are in contact with the tibial cartilage and are not visible with 3D US. This limitation can be overcome if the tFAC is used as a non-invasive imaging “biopsy” of knee cartilage, providing clinicians with an indication of FAC status representative of overall joint health. Additionally, manual trimming of MRI segmentations to match the 3D US ROI was only necessary for validating our system against MRI and would not be required when using 3D US independently in future studies. While this procedure may have introduced variability or bias, repeated registrations, along with repeated trimming, revealed nearly perfect reproducibility (Table 3), indicating that our protocol results in very little variability or bias.
The weight-bearing condylar cartilage was able to be visualized using our 3D US device. However, this required additional acquisitions on the medial and lateral sides of the patella during maximum knee flexion. Since MR images of FAC were acquired during minimal knee flexion, variations in patella positioning relative to the FAC surface in MRI compared to 3D US resulted in difficulties registering 3D US condylar cartilage segmentations to MRI. Therefore, this study focused on the tFAC region for validation with MRI, but 3D US could be used for monitoring condylar cartilage volume changes over time without requiring MRI comparisons. Finally, a small subset of patients with severe KOA may experience limited range of motion, which might interfere with visualization of the most inferior aspects of the tFAC.
The greatest advantage of our 3D US system is the ability to acquire images quickly, easily, and comfortably at the patient's bedside, providing cost-effective and non-invasive assessments of FAC status for reliable longitudinal monitoring. Our 3D US device could alter the workflow of orthopedic, sports medicine, primary care, and arthritis clinics by enabling clinicians and researchers to obtain more information without added complexity or additional stress and discomfort to patients. This technology may be well-suited to longitudinal and interventional clinical studies where detecting changes in cartilage volume is required. In the future, this system will also be useful in a routine clinical care context. Currently, the use of KL grading for assessing KOA progression is insensitive to change and relies on indirect features of FAC thinning. MRI-based measures of cartilage volume are superior to radiographic measures, but are limited by cost, time, accessibility, and patient-related factors, preventing generalized use of quantitative MRI for KOA. Our study demonstrates that cartilage volume measurements acquired using 3D US represent a more feasible method to quantitatively assess tFAC volume with very high reliability and accuracy.
In conclusion, we have demonstrated the reliability and validity of a handheld mechanical 3D US device we developed to quantify tFAC volumes in healthy volunteers. We demonstrated that 3D US segmentations are associated with excellent intra- and inter-rater reliabilities and possess strong agreement with MRI tFAC volume measurements. The tFAC is a vital region of the knee joint for investigating the progression of patellofemoral OA and could also be used as a non-invasive imaging “biopsy” of the FAC to monitor KOA progression and response to treatment. Future work will assess the reliability of our 3D US device in KOA patients and the ability to monitor FAC volume changes over time. Further assessment of measurement properties including sensitivity to change is necessary before its use can be recommended in clinical trials. Future work will also assess the test-retest reliability of 3D US during image acquisitions separated by time. In addition to longitudinal construct validity, future work will also assess the intra- and inter-rater reliability of 3D US cartilage measurements in a longitudinal study to monitor the progression of tFAC change and degradation for early detection of KOA. 3D US is a promising, inexpensive, and widely accessible imaging modality for point-of-care assessments of KOA and will enable clinicians and researchers to obtain additional information without added complexity or discomfort to patients.
Author contributions
SP, CTA, and AF contributed to the conception and design of the study. All authors contributed to the analysis and interpretation of the data. SP and RD performed the acquisition of images, segmentations, and analyses on both imaging modalities. DJG contributed to developing the image processing, segmentation, and analysis workflow. All authors contributed to drafting the article with critical revisions and final approvals before submission. SP (spaperni@uwo.ca) takes full responsibility for the integrity of the study from inception to the finished article.
Role of the funding source
This work is funded by the Canadian Institutes of Health Research (CIHR #154314) and a Collaborative Research Seed Grant from the Schulich School of Medicine and Dentistry (R5727A08). The funding agencies did not have any role in the design, analysis, interpretation of data, writing of the manuscript, or the decision to submit for publication.
Sources of support
Work is funded by the Canadian Institutes of Health Research (#154314) and a Collaborative Research Seed Grant from the Schulich School of Medicine and Dentistry (R5727A08).
Declaration of competing interest
The authors have no competing interests with respect to the work completed in this study.
Acknowledgements
The authors acknowledge Dr. David Tessier for volunteer recruitment and image coordination, and Holly Philpott for 3D US image acquisition. We especially thank all of the volunteers who made this study possible. The authors acknowledge Dr. Trevor Birmingham for providing guidance with experimental design.
Footnotes
Supplementary data related to this article can be found at https://doi.org/10.1016/j.ocarto.2020.100127.
Contributor Information
S. Papernick, Email: spaperni@uwo.ca.
R. Dima, Email: rdima@uwo.ca.
D.J. Gillies, Email: dgillie@uwo.ca.
C.T. Appleton, Email: tom.appleton@sjhc.london.on.ca.
A. Fenster, Email: afenster@robarts.ca.
Appendix A. Supplementary data
The following is the supplementary data related to this article:
References
- 1.Jordan J.M., Helmick C.G., Renner J.B., Luta G., Dragomir A.D., Woodard J., et al. Prevalence of knee symptoms and radiographic and symptomatic knee osteoarthritis in african Americans and caucasians: the johnston county osteoarthritis project. J. Rheumatol. 2007;34(1):172–180. [PubMed] [Google Scholar]
- 2.Felson D.T., Naimark A., Anderson J., Kazis L., Castelli W., Meenan R.F. The prevalence of knee osteoarthritis in the elderly. the framingham osteoarthritis study. Arthritis Rheum. 1987;30(8):914–918. doi: 10.1002/art.1780300811. [DOI] [PubMed] [Google Scholar]
- 3.Felson D.T. Osteoarthritis of the knee. N. Engl. J. Med. 2006;354(8):841–848. doi: 10.1056/NEJMcp051726. [DOI] [PubMed] [Google Scholar]
- 4.Amin S., LaValley M.P., Guermazi A., Grigoryan M., Hunter D.J., Clancy M., et al. The relationship between cartilage loss on magnetic resonance imaging and radiographic progression in men and women with knee osteoarthritis. Arthritis Rheum. 2005;52(10):3152–3159. doi: 10.1002/art.21296. [DOI] [PubMed] [Google Scholar]
- 5.Guermazi A., Roemer F.W., Burstein D., Hayashi D. Why radiography should no longer be considered a surrogate outcome measure for longitudinal assessment of cartilage in knee osteoarthritis. Arthritis Res. Ther. 2011;13(6):247. doi: 10.1186/ar3488. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Hunter D.J., Zhang Y.Q., Tu X., Lavalley M., Niu J.B., Amin S., et al. Change in joint space width: hyaline articular cartilage loss or alteration in meniscus? Arthritis Rheum. 2006;54(8):2488–2495. doi: 10.1002/art.22016. [DOI] [PubMed] [Google Scholar]
- 7.Yusuf E., Kortekaas M.C., Watt I., Huizinga T.W.J., Kloppenburg M. Do knee abnormalities visualised on MRI explain knee pain in knee osteoarthritis? A systematic review. Ann. Rheum. Dis. 2011;70(1):60–67. doi: 10.1136/ard.2010.131904. [DOI] [PubMed] [Google Scholar]
- 8.Hunter D.J., Guermazi A., Lo G.H., Grainger A.J., Conaghan P.G., Boudreau R.M., et al. Evolution of semi-quantitative whole joint assessment of knee OA: MOAKS (MRI Osteoarthritis Knee Score) Osteoarthritis Cartilage. 2011;19(8):990–1002. doi: 10.1016/j.joca.2011.05.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Hunter D.J., Lo G.H., Gale D., Grainger A.J., Guermazi A., Conaghan P.G. The reliability of a new scoring system for knee osteoarthritis MRI and the validity of bone marrow lesion assessment: BLOKS (Boston Leeds Osteoarthritis Knee Score) Ann. Rheum. Dis. 2008;67(2):206–211. doi: 10.1136/ard.2006.066183. [DOI] [PubMed] [Google Scholar]
- 10.Kornaat P.R., Ceulemans R.Y.T., Kroon H.M., Riyazi N., Kloppenburg M., Carter W.O., et al. MRI assessment of knee osteoarthritis: knee Osteoarthritis Scoring System (KOSS)--inter-observer and intra-observer reproducibility of a compartment-based scoring system. Skeletal Radiol. 2005;34(2):95–102. doi: 10.1007/s00256-004-0828-0. [DOI] [PubMed] [Google Scholar]
- 11.Peterfy C.G., Guermazi A., Zaim S., Tirman P.F.J., Miaux Y., White D., et al. Whole-organ magnetic resonance imaging Score (WORMS) of the knee in osteoarthritis. Osteoarthritis Cartilage. 2004;12(3):177–190. doi: 10.1016/j.joca.2003.11.003. [DOI] [PubMed] [Google Scholar]
- 12.Schmitz R.J., Wang H.-M., Polprasert D.R., Kraft R.A., Pietrosimone B.G. Evaluation of knee cartilage thickness: a comparison between ultrasound and magnetic resonance imaging methods. Knee. 2017;24(2):217–223. doi: 10.1016/j.knee.2016.10.004. [DOI] [PubMed] [Google Scholar]
- 13.Iagnocco A., Perella C., D'Agostino M.A., Sabatini E., Valesini G., Conaghan P.G. Magnetic resonance and ultrasonography real-time fusion imaging of the hand and wrist in osteoarthritis and rheumatoid arthritis. Rheumatol Oxf Engl. 2011;50(8):1409–1413. doi: 10.1093/rheumatology/ker111. [DOI] [PubMed] [Google Scholar]
- 14.Ohrndorf S., Backhaus M. Pro musculoskeletal ultrasonography in rheumatoid arthritis. Clin. Exp. Rheumatol. 2015;33(4 Suppl 92):S50–S53. PubMed PMID: 26457337. [PubMed] [Google Scholar]
- 15.Okano T., Mamoto K., Di Carlo M., Salaffi F. Clinical utility and potential of ultrasound in osteoarthritis. Radiol Med (Torino) 2019 doi: 10.1007/s11547-019-01013-z. Published online March 4. [DOI] [PubMed] [Google Scholar]
- 16.Ponikowska M., Świerkot J., Nowak B. The importance of ultrasound examination in early arthritis. Reumatol Wars. 2018;56(6):354–361. doi: 10.5114/reum.2018.80712. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Bruyn G.A., Naredo E., Iagnocco A., Balint P.V., Backhaus M., Gandjbakhch F., et al. The OMERACT ultrasound working group 10 years on: update at OMERACT 12. J. Rheumatol. 2015;42(11):2172–2176. doi: 10.3899/jrheum.141462. [DOI] [PubMed] [Google Scholar]
- 18.Fenster A., Downey D.B., Cardinal H.N. Three-dimensional ultrasound imaging. Phys. Med. Biol. 2001;46(5):R67–R99. doi: 10.1088/0031-9155/46/5/201. [DOI] [PubMed] [Google Scholar]
- 19.Kishimoto J., de Ribaupierre S., Lee D.S.C., Mehta R., St Lawrence K., Fenster A. 3D ultrasound system to investigate intraventricular hemorrhage in preterm neonates. Phys. Med. Biol. 2013;58(21):7513–7526. doi: 10.1088/0031-9155/58/21/7513. [DOI] [PubMed] [Google Scholar]
- 20.Saravelos S.H., Kong G.W.S., Chung J.P.W., Mak J.S.M., Chung C.H.S., Cheung L.P., et al. A prospective randomized controlled trial of 3D versus 2D ultrasound-guided embryo transfer in women undergoing ART treatment. Hum Reprod Oxf Engl. 2016;31(10):2255–2260. doi: 10.1093/humrep/dew206. [DOI] [PubMed] [Google Scholar]
- 21.Zahalka A., Fenster A. An automated segmentation method for three-dimensional carotid ultrasound images. Phys. Med. Biol. 2001;46(4):1321–1342. doi: 10.1088/0031-9155/46/4/327. [DOI] [PubMed] [Google Scholar]
- 22.Hunter D.J., Altman R.D., Cicuttini F., Crema M.D., Duryea J., Eckstein F., et al. OARSI Clinical Trials Recommendations: knee imaging in clinical trials in osteoarthritis. Osteoarthritis Cartilage. 2015;23(5):698–715. doi: 10.1016/j.joca.2015.03.012. [DOI] [PubMed] [Google Scholar]
- 23.Papernick S., Gillies D.J., Appleton T., Fenster A. Three-dimensional ultrasound for monitoring knee inflammation and cartilage damage in osteoarthritis and rheumatoid arthritis. Proc. SPIE. 2020;11315 doi: 10.1117/12.2549624. [DOI] [Google Scholar]
- 24.Fedorov A., Beichel R., Kalpathy-Cramer J., Finet J., Fillion-Robin J.C., Pujol S., et al. 3D slicer as an image computing platform for the quantitative imaging network. Magn. Reson. Imaging. 2012;30(9):1323–1341. doi: 10.1016/j.mri.2012.05.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Wirth W., Nevitt M., Hellio Le Graverand M.-P., Benichou O., Dreher D., Davies R.Y., et al. Sensitivity to change of cartilage morphometry using coronal FLASH, sagittal DESS, and coronal MPR DESS protocols – comparative data from the osteoarthritis initiative (OAI) Osteoarthritis Cartilage. 2010;18(4):547–554. doi: 10.1016/j.joca.2009.12.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Albu A.B., Beugeling T., Laurendeau D. A morphology-based approach for interslice interpolation of anatomical slices from volumetric images. IEEE Trans. Biomed. Eng. 2008;55(8):2022–2038. doi: 10.1109/TBME.2008.921158. [DOI] [PubMed] [Google Scholar]
- 27.Taha A.A., Hanbury A. Metrics for evaluating 3D medical image segmentation: analysis, selection, and tool. BMC Med. Imag. 2015;15 doi: 10.1186/s12880-015-0068-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Koo T.K., Li M.Y. A guideline of selecting and reporting intraclass correlation coefficients for reliability research. J Chiropr Med. 2016;15(2):155–163. doi: 10.1016/j.jcm.2016.02.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Iagnocco A., Coari G., Zoppini A. Sonographic evaluation of femoral condylar cartilage in osteoarthritis and rheumatoid arthritis. Scand. J. Rheumatol. 1992;21(4):201–203. doi: 10.3109/03009749209099222. [DOI] [PubMed] [Google Scholar]
- 30.McCune W., Dedrick D., Aisen A., MacGuire A. Sonographic evaluation of osteoarthritic femoral condylar cartilage. Correlation with operative findings. Clin. Orthop. 1990;254:230–235. PubMed PMID: 2182254. [PubMed] [Google Scholar]
- 31.Lee C.-L., Huang M.-H., Chai C.-Y., Chen C.-H., Su J.-Y., Tien Y.-C. The validity of in vivo ultrasonographic grading of osteoarthritic femoral condylar cartilage: a comparison with in vitro ultrasonographic and histologic gradings. Osteoarthritis Cartilage. 2008;16(3):352–358. doi: 10.1016/j.joca.2007.07.013. [DOI] [PubMed] [Google Scholar]
- 32.Saarakkala S., Waris P., Waris V., Tarkiainen I., Karvanen E., Aarnio J., et al. Diagnostic performance of knee ultrasonography for detecting degenerative changes of articular cartilage. Osteoarthritis Cartilage. 2012;20(5):376–381. doi: 10.1016/j.joca.2012.01.016. [DOI] [PubMed] [Google Scholar]
- 33.Spannow A.H., Pfeiffer-Jensen M., Andersen N.T., Stenbøg E., Herlin T. Inter -and intraobserver variation of ultrasonographic cartilage thickness assessments in small and large joints in healthy children. Pediatr. Rheumatol. 2009;7(1):12. doi: 10.1186/1546-0096-7-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Faisal A., Ng S.-C., Goh S.-L., Lai K.W. Knee cartilage segmentation and thickness computation from ultrasound images. Med. Biol. Eng. Comput. 2018;56(4):657–669. doi: 10.1007/s11517-017-1710-2. [DOI] [PubMed] [Google Scholar]
- 35.Bedewi M.A., Elsifey A.A., Naguib M.F., Saleh A.K., Nwihadh N.B., Abd-Elghany A.A., et al. Sonographic assessment of femoral cartilage thickness in healthy adults. J. Int. Med. Res. 2020;48(8) doi: 10.1177/0300060520948754. 300060520948754. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Yoon C.-H., Kim H.-S., Ju J.H., Jee W.-H., Park S.-H., Kim H.-Y. Validity of the sonographic longitudinal sagittal image for assessment of the cartilage thickness in the knee osteoarthritis. Clin. Rheumatol. 2008;27(12):1507–1516. doi: 10.1007/s10067-008-0956-3. [DOI] [PubMed] [Google Scholar]
- 37.Naredo E., Acebes C., Möller I., Canillas F., de Agustin J.J., de Miguel E., et al. Ultrasound validity in the measurement of knee cartilage thickness. Ann. Rheum. Dis. 2009;68(8):1322–1327. doi: 10.1136/ard.2008.090738. [DOI] [PubMed] [Google Scholar]
- 38.Draper C.E., Besier T.F., Gold G.E., Fredericson M., Fiene A., Beaupre G.S., et al. Is cartilage thickness different in young subjects with and without patellofemoral pain? Osteoarthritis Cartilage. 2006;14(9):931–937. doi: 10.1016/j.joca.2006.03.006. [DOI] [PubMed] [Google Scholar]
- 39.Koo S., Gold G.E., Andriacchi T.P. Considerations in measuring cartilage thickness using MRI: factors influencing reproducibility and accuracy. Osteoarthritis Cartilage. 2005;13(9):782–789. doi: 10.1016/j.joca.2005.04.013. [DOI] [PubMed] [Google Scholar]
- 40.Williams T.G., Holmes A.P., Bowes M., Vincent G., Hutchinson C.E., Waterton J.C., et al. Measurement and visualisation of focal cartilage thickness change by MRI in a study of knee osteoarthritis using a novel image analysis tool. Br. J. Radiol. 2010;83(995):940–948. doi: 10.1259/bjr/68875123. [DOI] [PMC free article] [PubMed] [Google Scholar]