Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2023 Apr 1.
Published in final edited form as: Med Image Anal. 2022 Jan 1;77:102343. doi: 10.1016/j.media.2021.102343

DADP: Dynamic abnormality detection and progression for longitudinal knee magnetic resonance images from the Osteoarthritis Initiative

Chao Huang a, Zhenlin Xu b, Zhengyang Shen b, Tianyou Luo c, Tengfei Li d,g, Daniel Nissman d, Amanda Nelson e, Yvonne Golightly e,f, Marc Niethammer b,g,*, Hongtu Zhu b,c,g,**
PMCID: PMC8901568  NIHMSID: NIHMS1770695  PMID: 35026528

Abstract

Osteoarthritis (OA) is the most common disabling joint disease. Magnetic resonance (MR) imaging has been commonly used to assess knee joint degeneration due to its distinct advantage in detecting morphologic cartilage changes. Although several statistical methods over conventional radiography have been developed to perform quantitative cartilage analyses, little work has been done capturing the development and progression of cartilage lesions (or abnormal regions) and how they naturally progress. There are two major challenges, including (i) the lack of building spatial-temporal correspondences and correlations in cartilage thickness and (ii) the spatio-temporal heterogeneity in abnormal regions. The goal of this work is to propose a dynamic abnormality detection and progression (DADP) framework for quantitative cartilage analysis, while addressing the two challenges. First, spatial correspondences are established on flattened 2D cartilage thickness maps extracted from 3D knee MR images both across time within each subject and across all subjects. Second, a dynamic functional mixed effects model (DFMEM) is proposed to quantify abnormality progression across time points and subjects, while accounting for the spatio-temporal heterogeneity. We systematically evaluate our DADP using simulations and real data from the Osteoarthritis Initiative (OAI). Our results show that DADP not only effectively detects subject-specific dynamic abnormal regions, but also provides population-level statistical disease mapping and subgroup analysis.

Keywords: Osteoarthritis, Abnormal region detection, Dynamic functional mixed effect model, Dynamic conditional random field model

1. Introduction

Osteoarthritis (OA) is a common, chronic disease affecting all joint tissues and characterized by cartilage loss. Over 50 million people have doctor-diagnosed arthritis (Hootman et al., 2016), the majority of which is OA, and even this is likely an underestimate (Jafarzadeh and Felson, 2017), leading to health expenditures totaling over $27 billion annually (Gupta et al., 2005). OA is commonly associated with symptoms including swelling, pain, discomfort, locking, and other problems in mobility, which can be severe (Felson et al., 2000). The disease of OA is also characterized by the progressive loss of joint cartilage, which is believed to be the main factor in OA. Different medical imaging modalities have been employed to investigate and quantitatively measure the knee joint cartilage, such as radiography, Magnetic resonance (MR) imaging, ultrasound (US), Computed Tomography (CT), and optical coherence tomography (OCT) (Braun and Gold, 2012; Kijowski et al., 2020). Among all these available modalities, MR imaging is a non-invasive modality with high spatial resolution. Although MR imaging cannot capture the bone architecture as well as CT, it is useful for capturing some characteristics of the joint, including cartilage morphology and biochemical composition (Guermazi et al., 2003; Roemer et al., 2020). Furthermore, MR imaging is very sensitive to full or partialthickness changes of articular cartilage over time (Huang et al., 2015; Karim et al., 2021). Therefore, MR imaging has been commonly used to detect cartilage morphological changes and understand OA progression (Raynauld, 2003; Cicuttini et al., 2005; Conaghan et al., 2011). Fig. 1 shows MR images of the human knee illustrating cartilage loss.

Fig. 1:

Fig. 1:

MR images of the human knee illustrating cartilage loss. (left) A sagittal slice of a healthy knee. Bones are annotated in blue, femoral cartilage in green and tibial cartilage in orange; (middle) A coronal slice of the same healthy knee; (right) A coronal slice of an OA knee with cartilage loss indicated by the red arrow.

Several large-scale longitudinal multicenter MR imaging databases, such as the Multicenter Osteoarthritis STudy (MOST) (Segal et al., 2013) and the Osteoarthritis Initiative (OAI) (Eckstein et al., 2012), have been acquired to understand the progression of cartilage change and associated risk factors (Felson et al., 2013). Although quantitative MR image-based biomarkers, such as the volume of the articular cartilage (Eckstein et al., 2006; Schaefer et al., 2017), shape information (An et al., 2016; Bowes et al., 2021), and T2 relaxometry patterns (Pedoia et al., 2017, 2019), show potential for OA diagnosis and treatment planning, little work has been done in assessing the development of cartilage lesions (or abnormal regions), such as soft tissue swelling or small cartilage or bone fragments, and their natural progression, due to several challenges, which will be detailed below. From a clinical perspective, it is also of great interest to delineate the location and progression of abnormal cartilage regions for refining patient treatment.

There are at least two major challenges in existing abnormal region detection approaches. First, spatio-temporal correspondences and correlations in cartilage thickness are required for statistical localized analysis. It is difficult to establish such correspondences due to the small cartilage volume in relation to the remainder of the knee (Huang et al., 2015; Pedoia et al., 2017, 2019; Roach et al., 2021). Second, abnormal regions across both subjects and time points exhibit spatio-temporal heterogeneity. For the spatial heterogeneity, the cartilage loss pattern is only consistent within a small set of the entire population of subjects. For the temporal heterogeneity, over time cartilage damage and cartilage loss have been found from longitudinal OA studies (Crema et al., 2013; Glyn-Jones et al., 2015; Chen et al., 2017). Thus, treating all the subjects across different time points equally and applying standard statistical methods (e.g., voxel-based analysis and subregion-based analysis (Wirth and Eckstein, 2008)) would be problematic.

To address the two challenges above, a Gaussian Hidden Markov Model (GHMM) was proposed in Huang et al. (2015) to simultaneously model the progression and abnormality of cartilage thickness. However, there are some drawbacks in GHMM. First, GHMM ignores the temporal heterogeneity and assumes that the subject-specific abnormal region pattern is time-invariant, which would be sub-optimal in practice, especially in characterizing and predicting individual OA progression. Second, for normal control subjects, GHMM applies a voxel-wise linear mixed model and treats all voxels as independent units, which ignores two key functional features of imaging data, including spatial smoothness and spatial correlation. Third, GHMM does not provide the population level statistical disease mapping and the detection of some potential OA-subgroups based on the cartilage thickness abnormality (Liu et al., 2021).

The aim of this work is to propose a Dynamic Abnormality Detection and Progression (DADP) framework. First, longitudinal knee MR images are processed to derive flattened 2D cartilage thickness maps. Second, the subject-specific dynamic abnormal regions are detected based on the thickness maps. Compared to GHMM and other existing approaches, the main contributions of our DADP framework are summarized as follows:

  1. A deep learning based image analysis pipeline is proposed in DADP to extract 2D cartilage thickness maps from 3D knee MR images, in a common coordinate system, establishing correspondences in space and time;

  2. A dynamic functional mixed effects model is proposed in DADP to (i) establish the relationship between the cartilage thickness map and a set of covariates of interest, (ii) capture the spatial smoothness and spatio-temporal correlations within the thickness map through functional data analysis tools (Yuan et al., 2014), and (iii) detect the spatio-temporal heterogeneity in subject-specific dynamic abnormal regions through a dynamic conditional random field model (Sutton et al., 2007);

  3. A series of formal statistical inference procedures are established in DADP to (i) test whether there is any significant effect of some covariates of interest on the cartilage thickness, (ii) detect population level OA risk on the thickness map through the statistical disease mapping approach, and (iii) conduct subgroup analysis through a non-negative matrix factorization method (Ding et al., 2005);

  4. A powerful and user-friendly package is developed in DADP for the entire OAI MR image dataset. Moreover, our package can be straightforwardly applied for analyzing other knee MR image data at scale. The package will be released to the public through GitHub.

2. DADP

Our DADP includes two main components: (i) an image analysis module and (ii) a statistical modeling module. Specifically, in the image analysis module, the 3D knee MR images are preprocessed through a series of image analysis steps to derive flattened 2D cartilage thickness maps in a common coordinate system. Through this common coordinate system, we obtain spatial correspondence across subjects and/or time points. Then, in the statistical modeling module, a dynamic functional mixed effects model is established based on the 2D cartilage thickness maps to characterize the spatio-temporal correlations within the cartilage thickness map, address the spatio-temporal heterogeneity of abnormal regions, and quantify local OA risk at both population-level and subgroup-level. The workflow of our DADP is summarized in Fig. 2. Next, we introduce the detailed steps in the image analysis module and the estimation and inference procedures in the statistical modeling module.

Fig. 2:

Fig. 2:

The DADP framework includes (i) an image analysis module and (ii) a statistical modeling module. Image analysis module extracts 2D cartilage thickness maps in a common coordinate system from the 3D knee MR images through a series of image analysis steps. Statistical modeling module includes (a) association analysis between cartilage thickness and covariates of interest, (b) individual abnormal regions detection, and (c) multi-level statistical disease mapping.

2.1. Image analysis module

The aim of this image analysis module is to extract spatially aligned 2D thickness maps of femoral/tibial cartilage from the 3D MR images. The whole pipeline consists of four image preprocessing & analysis steps, including ① segmentation & meshing, ② 3D thickness map computation, ③ registration, and ④ thickness map projection. Fig. 3 shows the overall preprocessing pipeline. First, the femoral cartilage (FC) and tibial cartilage (TC) are segmented from 3D MR images with a U-Net-like network (Ronneberger et al., 2015; Xu et al., 2018), where the model is trained with two parallel output layers. Each layer predicts a binary segmentation for FC or TC, respectively. Compared to some existing methods for knee cartilage segmentation (e.g., graph model based approach (Yin et al., 2010), 2D U-Net network (Norman et al., 2018), volumetric convolution neural network (Raj et al., 2018), and multi-agent learning (Tan et al., 2019)), our proposed cascaded model of simple U-Nets can match the performance of a complex U-Net, while providing better efficiency in terms of using fewer parameters and requiring less memory (Xu et al., 2018). Based on the resulting (soft) segmentations, we build triangle meshes for each cartilage using marching cubes (Lorensen and Cline, 1987). Second, we compute the cartilage thickness for every vertex on the triangle meshes. The thickness value at a vertex on the cartilage surface mesh is approximated by the distance between the vertex’s position and its closest point on the opposite side of the surface. Therefore, we divide the cartilage surface mesh into two opposite parts. Some local smoothing is applied to improve incorrect divisions caused by mesh artifacts and imperfect segmentations. Third, we align the cartilage meshes to an atlas space using a deep registration network in order to align each MR image to an atlas image. Here the atlas is built via an unbiased atlas-building approach. Specifically, we consider 60 affine-aligned images with segmentations. The initial atlas is simply the average image of these aligned affine images. We then iterate B-Spline registrations (using NiftyReg and NMI as similarity measures) to the current atlas image 10 times. After each round of registrations, the atlas image is updated as the average of the set of deformed images given the current registration transformations. In this way, the atlas gets successively sharper. After the 10 registration rounds, we warp the segmentations given the current transformations from image space to the atlas image and obtain the constructed atlas and a probabilistic segmentation. Compared to some widely-used public registration methods (e.g., SyN (Avants et al., 2008, 2014), Demons (Vercauteren et al., 2008, 2009), NiftyReg (Modat et al., 2010, 2014), and Voxel-Morph variant (Dalca et al., 2018)), our proposed registration approach can achieve comparable and sometimes better performance for longitudinal registration with a dramatically reduced computation time and with excellent deformation regularity and symmetry (Shen et al., 2019). Specifically, the registration involves two phases: an affine registration as pre-alignment followed by a non-parametric registration to capture local deformations. For the non-parametric registration, we use a vector momentum-parameterized stationary velocity field (vSVF) approach (Niethammer et al., 2019). In particular, we use its deep learning version (Shen et al., 2019). We then transfer the cartilage thicknesses to the atlas cartilage mesh. Fourth, we project the 3D atlas points onto the 2D plane and interpolate thickness values to obtain spatially aligned flattened 2D cartilage thickness maps for the FC and TC respectively.

Fig. 3:

Fig. 3:

Image analysis module in DADP: ① segmentation & meshing; ② 3D thickness map computation; ③ registration; and ④ thickness map projection.

2.2. Statistical modeling module

2.2.1. Notations & Assumptions

Suppose that we obtain longitudinal imaging data and clinical variables for n unrelated subjects. Let ni be the total number of observations for the i-th subject for i = 1, …, n. Our images consist of preprocessed 2D cartilage thickness maps calculated from the OAI study, which are registered to a common template S2 with nv grid points {s1,,snv}. For the i-th subject at the j-th time point, we observe (i) 2D imaging data, denoted by yi,j(s), indicating the cartilage thickness at pixel s; (ii) a px-dimensional covariate vector of interest, denoted by xi,j, including the intercept, time-independent covariates (e.g., gender), and time-dependent ones (e.g., age and body mass index (BMI)); and (iii) the Kellgren-Lawrence grade (KLG) (Kellgren and Lawrence, 1957) denoted as gi,j. The KLG score corresponds to 5 diagnostic outcomes, starting from normal stage (gi,j = 0), early stage (gi,j = 1), mild OA (gi,j = 2), moderate OA (gi,j = 3), to severe OA (gi,j = 4). Without loss of generality, we assume that the first n* subjects are OA patients with at least one observation being at OA stage, whereas the rest nn* subjects are normal controls with all observations being at normal stage. In addition, for the i-th subject at the j-th time point, we assume that S can be decomposed into the union of normal regions Ri,j0 and abnormal regions Ri,j1, that is

S=Ri,j0Ri,j1 and Ri,j0Ri,j1=. (1)

Here we also assume that: (i) observations at the normal stage are perfectly healthy and do not exhibit any abnormal regions, i.e., Ri,j1=; (ii) for OA patients, the shape, size, and location of abnormal regions Ri,j1 may vary across subjects and time points; and (iii) the abnormal regions satisfy an inclusion property such that Ri,j1Ri,j+11 for j = 1, …, ni − 1. To further illustrate these assumptions, Fig. 4 shows three different abnormal region patterns. In this example, patients at the normal stage do not have any abnormal regions, while the shape, size, and location of abnormal regions are different across subjects and time points. In particular, once an abnormal region is present it does not disappear and cannot shrinkage at later time points.

Fig. 4:

Fig. 4:

Three different abnormal region patterns. P1: Normal stage for the first two time points and transition to an early stage for the follow-up two time points. One abnormal region (red) occurs at the third time point and becomes larger at the fourth time point; P2: Early stage at baseline and transition to late stage for the follow-up three time points. The abnormal regions (red) grow in size and number; P3: Early stage at the first three time points and transition to late stage at the fourth time point. One abnormal region (red) grows over time.

2.2.2. DFMEM: dynamic functional mixed effects model

We propose a dynamic functional mixed effects model (DFMEM) to simultaneously detect individual abnormal regions and capture the relationship between cartilage thickness and covariates of interest. The path diagram of DFMEM is presented in Fig. 5. Specifically, DFMEM is defined as

yi,j(s)=xi,jβ(s)+bi,j(s)wi,jα+zi,jγi(s)+ηi,j(s)+ϵi,j(s) (2)

for j = 1, …, ni and i = 1, …, n. Here β(s) is a px-dimensional vector representing the fixed effect at pixel s, wi,j is a pw-dimensional vector of covariates (commonly a subset of xi,j) associated with the effect α occurring within the abnormal regions (i.e., {bi,j(s),sS}, bi,j(s) = 0 if sRi,j0 and bi,j(s) = 1 otherwise), and zi,j is a pz-dimensional vector of covariates (commonly a subset of xi,j) associated with the random effect γi(s). In addition, ηi,j(s) is a stochastic individual function and ϵi,j(s) is a measurement error. Furthermore, we assume that γi(s), {ηi,j(s)}, and {ϵi,j(s)} are independent copies of GP(0,Σγ(s,s)), GP(0,ση2(s,s)), and (0,σϵ2(s)), respectively, where GP(μ(s),Σ(s,s)) denotes a Gaussian process with mean function μ(s) and covariance function Σ(s, s′). Therefore, given bi(s)=(bi,1(s),,bi,ni(s)), the covariance structure for yi(s)=(yi,1(s),,yi,ni(s)) can be written as

Σyi(s,s)=ZiΣγ(s,s)Zi+(ση2(s,s)+σϵ2(s)1(s=s))Ini, (3)

where Zi=(zi,1,,zi,ni), 1(·) is an indicator function, and Ini is an ni × ni identity matrix. Detailed derivations of (3) can be found in Appendix A. Model (2) consists of four key components: (i) a fixed effect component xi,jβ(s) to characterize the fixed effect between cartilage thickness and covariates of interest; (ii) a subject-specific random effect component including zi,jγi(s) to characterize the within-subject spatio-temporal correlation structure and ηi,j(s) to capture the spatial correlation structure within S; (iii) measurement errors ϵi,j(s); and (iv) a disease related random effect component bi,j(s)wi,jα to characterize the disease related dynamic effect within abnormal regions.

Fig. 5:

Fig. 5:

Path diagram of DFMEM. The change of cartilage thickness at pixel s includes four parts in the proposed model, including (i) the common dynamic change xi,jβ(s) in both normal controls and OA subjects, (ii) the change caused by random effects {zi,jγi(s),ηi,j(s)}, (iii) measurement errors ϵi,j(s), and (iv) the change wi,jα caused by dynamic abnormal regions (red).

Next, we propose a dynamic conditional random field model (DCRFM) (Wang and Ji, 2005; Wang et al., 2006; Sutton et al., 2007; Yin et al., 2009; Deufemia et al., 2014) to describe the subject-specific latent variables {bi(s),sS}. Some assumptions are introduced here. First, it is assumed that bi(s), γi(s), {ηi,j(s)}, and {ϵi,j(s)} are mutually independent. Second, it is assumed that the pattern of bi,j(s) at pixel s for the i-th OA patient at the j-th time point is affected by three possible factors: (i) population factor b0(Ds)={b0(s),sDs}: patterns of pixels in Ds, including both pixel s and pixels in its neighborhood, at the population level, (ii) spatial factor bi,j(Ns)={bi,j(s),sNs}: patterns of pixels in Ns, the neighborhood of pixel s, for the i-th OA patient at the j-th time point, and (iii) temporal factor bi,j1(Ms)={bi,j1(s),sMs}: patterns of pixels in Ms, including both pixel s and pixels in its neighborhood, for the i-th OA patient at the previous time point. Specifically, the pattern of bi,j(s) at baseline is assumed to be affected by the population and spatial factors, whereas that at follow-up visits is assumed to be affected by all the three factors. Further illustration of the three factors is presented in Fig. 6.

Fig. 6:

Fig. 6:

Illustration of the three factors affecting the pattern of pixel s for one patient at the j-th time point. (i) population factor (yellow): patterns of both pixel s and pixels in its neighborhood at the population level; (ii) spatial factor (red): current patterns of the neighborhood of pixel s at the subject level; and (iii) temporal factor (blue): previous patterns of both pixel s and pixels in its neighborhood at the subject level.

In order to model the population level pattern of b0(s), a Potts model (Green and Richardson, 2002) is considered here:

p0(b0(s)τ)exp{τU(b0(s),b0(Ns))}, (4)

where U(b0(s),b0(Ns))=sNs1(b0(s)b0(s)) and the turning parameter τ is introduced to encourages spatial smoothness in homogeneous regions. Then, given b0(Ds) and bi(Ns), bi(s) is assumed to follow a DCRFM with the conditional probability mass function, p(bi(s)bi(Ns),b0(Ds),τ,κ,ρ) given by

p1(bi,1(s)bi,1(Ns),b0(Ds),τ,κ)×j=2nip2(bi,j(s)bi,j1(Ms),bi,j(Ns),b0(Ds),τ,κ,ρ), (5)

where p1(bi,1(s)bi,1(Ns),b0(Ds),τ,κ) is the conditional probability mass function of bi,j(s) at baseline and proportional to

exp{τU(bi,1(s),bi,1(Ns))κU(bi,1(s),b0(Ds))}, (6)

in which κ is introduced to control the population factor affecting the subject-specific abnormal region pattern at pixel s. In addition, p2(bi,j(s)bi,j1(Ms),bi,j(Ns),b0(Ds),τ,κ,ρ) is the conditional probability mass function of bi,j(s) at follow-up visits. Specifically, in order to make sure that the inclusion property of abnormal region is satisfied, bi,j(s) = 1 with probability 1 when bi,j−1(s) = 1. Otherwise, the conditional probability mass function is proportional to

exp{τU(bi,j(s),bi,j(Ns))κU(bi,j(s),b0(Ds))ρU(bi,j(s),bi,j1(Ms))}. (7)

where ρ influences the strength of temporal dependencies. It should be noted that, Ds, Ns, and Ms may have different sizes. For simplicity, throughout this work we consider Ns is the set of the closest 8 neighbors of pixel s, and Ds=Ms=Ns{s}.

2.2.3. Estimation procedure

Our next task is to detect the individual abnormal regions {bi(s),sS}i=1n, and estimate all unknown parameters decomposed into three parts: (i) θ including β(s),Σγ(s,s), ση2(s,s), and σϵ2(s), (ii) α, and (iii) τ, κ, ρ. The whole estimation procedure can be treated as an iterative approach, in which either the detected abnormal regions or estimated parameters in one of the three parts are updated with others fixed. Specifically, at the (t+1)-th iteration, the detected abnormal regions {b^i(t)(s),sS} and current estimates Θ^(t){θ^(t),α^(t),τ^(t),κ^(t),ρ^(t)} are updated until certain criterion is satisfied. The overall estimation procedure is summarized in Algorithm 1. More detailed updating procedure and initialization procedure can be found in Appendix B and Appendix D, respectively.

2.2.

2.2.4. Inference Procedure

After obtaining all the estimated parameters and detected abnormal regions, we carry out a series of formal statistical inference procedures consisting of three different statistical tools: (i) hypothesis testing on parameters of interest including β(s) and α; (ii) statistical disease mapping; and (iii) subgroup analysis.

Hypothesis testing.

In practice, we are interested in investigating (i) whether there is any significant effect associated with some covariates of interest on the cartilage thickness, and (ii) whether there is any significant effect associated with covariates of interest on the cartilage thickness within the abnormal regions. Both of these two scientific questions can be written as hypothesis testing problems in the following general forms respectively:

0(s):Cββ(s)=0 v.s. 1(s):Cββ(s)0, (8)
0:Cαα=0 v.s. 1:Cαα0, (9)

where Cβ is a 1 × px vector and Cα is a 1 × pw vector. For testing problem (8), a sequence of Wald tests can be constructed as follows:

Tβ(s)=Cββ^(s)[CβΣ^β(s)Cβ]1β^(s)Cβ,sS, (10)

where Σ^β(s)=[i=1nXi{ZiΣ^γ(s,s)Zi+σ^η2(s,s)Ini}1Xi]1 is the estimated covariance matrix of β^(s). Given the estimate α^ and the detected abnormal regions {b^i(s),sS}i=1n, the corresponding p-values can be derived based on the asymptotic properties of the test statistics under 0. In particular, the false discovery rate (FDR) adjustment method (Yekutieli and Benjamini, 1999) can be employed to calculate the adjusted p-values for addressing the multiple comparison issue in (8). In addition to the local test statistics, an integral version of Wald test statistic is established to investigate the global effect on the whole thickness map:

Tβ=STβ(s)ds, (11)

where the wild bootstrap method is considered in calculating the p-values (Zhu et al., 2010, 2011, 2012, 2019). The details of wild bootstrap procedure can be found in Appendix E. For the testing problem in (9), a typical Wald test is also considered here:

Tα=Cαα^[CαΣ^αCα]1α^Cα, (12)

where Σ^α is the estimated covariance matrix of α^, which can be calculated via the bootstrap method as well (detailed procedure can be found in Appendix F).

Statistical disease mapping.

In addition to the subject-specific abnormal region pattern, we can also derive the statistical disease mapping at the population level. Specifically, the posterior probability that the pixel site belongs to the abnormal region is calculated as below:

exp{τ^n*U(1,b^0(Ns))κ^gi,j>0U(1,b^i,j(Ds))}l=01exp{τ^n*U(l,b^0(Ns))κ^gi,j>0U(l,b^i,j(Ds))}. (13)

Therefore, the statistical disease mapping is able to locate OA related regions of interest (ROIs) at the population level and helpful in imaging biomarker development and OA early prevention.

Subgroup analysis.

Besides the abnormal region pattern at both the individual and the population levels, one is also interested in utilizing the abnormal region based imaging markers to identify subgroups with spatially distinct cartilage changes over time. Specifically, standard subgroup outcomes for OA progression (e.g., KLG) have low reliability, are difficult to interpret, and respond poorly to change (Schiphof et al., 2011; Felson et al., 2011; Guermazi et al., 2012). In comparison, our detected individual dynamic abnormal regions are localized cartilage thickness based features and highly sensitive to the OA progression (Balamoody et al., 2010; Wirth et al., 2013). Therefore, the abnormal region based subgroup analysis has advantages in providing improved subgroup measures, which is important in specific OA prevention and clinical treatment management. In order to conduct this clustering analysis, we consider the non-negative matrix factorization (NMF) method which has an inherent clustering property (Ding et al., 2005). Specifically, we first construct a nv × n* matrix Ω, whose i-th column is defined as Ωi=j=1niωi,j1(gi,j>0)/j=1ni1(gi,j>0). Here ωi,j is a nv × 1 vector, whose l-th element is the estimated probability (bi,j(sl)=1). By adopting the NMF method, the approximation of matrix Ω is achieved by minimizing the following loss function:

ΩMCF, subject to 0M1,0C, (14)

where ‖ · ‖F is the Frobenius matrix norm, M is an nv × k matrix, C is a k × n* matrix, and k is the number of clusters. The computed matrix C gives the cluster membership, i.e., the i-th patient belongs to the l-th cluster if the element Cl,i > Cl′,i for all l′ ≠ l, while the computed matrix M gives the subgroup level statistical disease mapping, i.e., the l-th column in M represents the disease mapping of the l-th cluster. In addition, the consensus matrix (Brunet et al., 2004; Kim and Park, 2007) is adopted here to determine the number of clusters for both simulation studies and real data analysis. The consensus matrix Λ is defined as the average connectivity matrix whose entry Λi,i reflects the probability that subjects i and i′ belong to the same cluster. To measure the dispersion of a consensus matrix, the dispersion coefficient is defined as ρΛ=i=1n*i=1n*4(Λi,i0.5)2. After obtaining ρΛ values for various k, we can determine the number of clusters from the maximal ρΛ.

3. Experiments and results

3.1. OAI data description

OAI Data and/or research tools used in the preparation of this manuscript were obtained and analyzed from the controlled access datasets distributed from the Osteoarthritis Initiative (OAI). OAI is a collaborative informatics system created by the National Institute of Mental Health and the National Institute of Arthritis, Musculoskeletal and Skin Diseases (NIAMS) to provide a worldwide resource to quicken the pace of biomarker identification, scientific investigation and OA drug development. Dataset identifier(s): [NIMH Data Archive Collection ID(s) or NIMH Data Archive Digital Object Identifier (DOI)]. If the Research Project involves collaboration with Submitters or NIH staff (as indicated in the DUC), then Recipient will acknowledge Submitters or NIH staff as co-authors, if appropriate, on any presentation, disclosure, or publication.

We consider the longitudinal MR image data from the OAI study, which comprises normal controls and individuals with, or at high risk of, knee OA incidence or progression. The OAI study contains a total of 4,796 men and women at ages 4579 enrolled between 2004 and 2006 (Eckstein et al., 2012). Those subjects consist of a progression sub-cohort (1,389), an incidence sub-cohort (3,285), and a normal control unexposed reference sub-cohort (122) (Peterfy et al., 2008; Nevitt et al., 2006). Further details about OAI can be found on http://www.oai.ucsf.edu/.

For the MR images, four 3.0 Tesla Siemens Trio MRI scanners were dedicated to imaging the OAI participants at baseline, 12, 24, 36, 48, 72 and 96 months. All images have uniform dimension of 384 × 384 × 160 and resolution of 0.36 × 0.36 × 0.7mm3 per voxel. After preprocessing, we have 19, 915 left knee FC/TC thickness maps from 4, 343 subjects and 21, 229 right knee FC/TC thickness maps from 4, 339 subjects. The demographic and KLG information of the OAI dataset used here is presented in Table 1.

Table 1:

Demographic (gender, BMI, and age) and KLG information of patients at baseline and overall visits.

Left knee Right knee
Baseline Overall Baseline Overall
Gender (F/M) 2,517/1,826 - 2,522/1,817 -
BMI (kg/m2) 28.6 ± 4.8 28.3 ± 4.8 28.6 ± 4.8 28.3 ± 4.8
Age (years) 61.1 ± 9.2 63.2 ± 9.2 61.1 ± 9.2 63.3 ± 9.2
KLG=0 1,729 8,339 1,627 8,223
KLG=1 759 3,640 762 3,914
KLG=2 1,111 4,697 1,214 5,404
KLG=3 603 2,491 587 2,889
KLG=4 141 748 149 799
Total 4,343 19,915 4,339 21,229

3.2. Simulation Studies

We examine the finite sample performance of our DADP framework. Here the simulation data was generated based on the preprocessed 2D thickness maps of normal controls in the OAI study. First, we fitted the following model to the left knee FC thickness maps from normal controls,

yi,j(s)=β1(s)+β2(s)× gender +β3(s)× BMI +β4(s)×log( Age )+γ1(s)+γ2(s)×BMI+γ3(s)×log( Age )+ηi,j(s)+ϵi,j(s), (15)

where the assumptions on random effects {γl(s)}l=13, individual stochastic term ηi,j(s), and measurement error ϵi,j(s) are the same as those in model (2). Here both BMI and logarithm transformed age information are normalized through the Z-transformation. Then, the obtained parameter estimators of {β(s),Σγ(s,s),ση2(s,s),σϵ2(s)} were used as the true values in model (2). In addition, the true values of elements in α were independently generated from U(−0.3, 0) and wi,j = (1, normalized BMI, normalized log(Age)).

On the other hand, the individual abnormal regions were pre-specified according to three types of abnormal region patterns based on the statistical disease mapping learned from the real data (see Fig. 7). Specifically, we first selected two seed points with high probability in the population level statistical disease mapping. For each observation with abnormal region pattern 1, one abnormal region is randomly generated with its center around seed point 1; for each observation with abnormal region pattern 2, one abnormal region is randomly generated with its center around seed point 2; and for each observation with abnormal region pattern 3, two abnormal regions are randomly generated with their centers around two seed points respectively. In addition, the sizes of the abnormal region at baseline and follow-up visits vary across subjects. Finally, we generated 624 thickness maps from 160 subjects with 100 normal controls and 60 diseased patients equally divided into 3 groups according to their abnormal region patterns with 3 or 4 observations for each subject, and the average signal-to-noise ratio (SNR) is 11.44.

Fig. 7:

Fig. 7:

Illustrations of three predefined abnormal region patterns (background is annotated in dark blue; normal region in light blue; abnormal region in yellow). Two seed points are pre-selected according to the population level statistical disease mapping learned from the left knee FC 2D thickness maps. Pattern 1: one abnormal region is randomly generated around seed point 1; Pattern 2: one abnormal region is randomly generated around seed point 2; Pattern 3: two abnormal regions are randomly generated from two seed points respectively. Examples are presented for the three predefined abnormal region patterns.

First, we fit DFMEM in our DADP with the simulated data and investigate the estimation accuracy of all the functional coefficients βl(s), l = 1, 2, 3, 4. The mean absolute error (MAE), defined as k=1nv|β^l(sk)βl(sk)|/nv, is calculated and reported in Table 2. For comparison, we consider the spatial random effects model (SREM, Zhu et al. (2007); Huang et al. (2015)) integrated with DCRFM, and the corresponding MAEs are reported in Table 2 as well. The key difference between SREM and our DFMEM is that the smoothness of both the thickness maps and functional coefficients are considered in DFMEM rather than SREM. According to Table 2, the estimation accuracy based on our DADP outperforms that based on SREM in terms of MAE, showing the power of functional data analysis tools used in our DADP.

Table 2:

Estimation accuracy of functional coefficients {βl(s)}l=14 for DADP and SREM in terms of MAE, defined as k=1nv|β^l(sk)βl(sk)|/nv.

Method β1(s) β2(s) β3(s) β4(s)
SREM 0.0452 0.0682 0.0323 0.0322
DADP 0.0134 0.0079 0.0048 0.0051

Second, we apply our DADP to detect the abnormal regions for each subject at each time point. For three randomly selected abnormal subjects, the detection results are presented in Fig. 8. It can be seen that the abnormal regions for subjects at each time point can be successfully detected, while the detection performance at follow-up visits is better than that at baseline. The possible reason is that the abnormal regions for subjects at baseline are small and the additional effect in the abnormal region is relatively weak. We also apply the K-means clustering method, SREM+K (SREM integrated with the K-means), and GHMM Huang et al. (2015) to the simulated data. Specifically, the K-means clustering method is the initialization procedure in the statistical modeling module of DADP, and GHMM is an integration of SREM with a Potts model. Compared to the alternative approaches, our DADP has three unique features including (i) the smoothness property of thickness maps, (ii) the dynamic pattern of individual abnormal regions, and (iii) the connection between the individual abnormal region pattern and population level one. For comparison, some metrics, including adjusted Rand index (Hubert and Arabie, 1985), Dice similarity coefficient (Carass et al., 2020), Sensitivity, and Specificity, are considered to evaluate the detection performance of all the four methods for each thickness map, and the mean performance metrics across all thickness maps are reported in Table 3. It can be found that, our DADP outperforms the other three methods in terms of all the performance metrics, indicating that all three features in DADP enhance its performance in terms of dynamic abnormal region detection.

Fig. 8:

Fig. 8:

Detected abnormal regions for four randomly selected subjects including three patients with different abnormal region patterns (background in dark blue; normal region in light blue; abnormal region in yellow).

Table 3:

Detection performance for DADP and three other methods (K-means, GHMM, and SREM+K) in terms of adjusted Rand index (ARI), Dice similarity coefficient (DSC), Sensitivity (SEN), and Specificity (SPE). Their corresponding standard errors are presented in parentheses.

ARI DSC SEN SPE
K-means 0.46 (0.24) 0.47 (0.19) 0.69 (0.29) 0.70 (0.22)
SREM+K 0.61 (0.19) 0.64 (0.14) 0.77 (0.16) 0.75 (0.15)
GHMM 0.75 (0.22) 0.77 (0.16) 0.88 (0.12) 0.86 (0.10)
DADP 0.80 (0.18) 0.81 (0.10) 0.98 (0.09) 0.95 (0.01)

Next, we investigate the influences of segmentation accuracy on the performance of dynamic abnormal region detection. To mimic the inaccurate segmentation results, we generated the thickness maps from a misspecified model, where the disease related random effect component was constructed as bi,j(s)[wi,jα+δi,j(s)] and δi,j(s)~ iid N(0,0.01). Here δi,j(s) is introduced to capture the potential voxel-wised perturbation caused by the inaccurate segmentation results. The detection performance of all the four methods in terms of ARI is reported in Table 4. It can be found that, although the ARI of our DADP decreases compared to that in Table 3 (i.e., the “perfect segmentation” scenario), our DADP still achieves reasonable detection performance and outperforms the other three methods when the model is misspecified, which means that our DADP is not sensitive to the accuracy of cartilage segmentation results.

Table 4:

Detection performance for DADP with model misspecification and three other methods (K-means, GHMM, and SREM+K) in terms of adjusted Rand index (ARI). Their corresponding standard errors are presented in parentheses.

K-means SREM+K GHMM DADP
0.45 (0.28) 0.58 (0.18) 0.71 (0.24) 0.77 (0.20)

Finally, the statistical disease mapping at the population level is presented in Fig. 9. It can be found that the regions around these two selected seed points are much more likely to be affected by the disease, which is in agreement with the simulation mechanism of abnormal regions. In addition, based on the detected abnormal regions, we conduct the subgroup analysis and present the clustering results in Table 5. It can be found that subjects with abnormal region pattern 2 are successfully identified by subgroup 2, whereas all patients with abnormal region pattern 1 are clustered into the other two subgroups. However, the subjects with abnormal region pattern 3 is not well detected due to two reasons. First, pattern 3 has some overlaps with other two patterns in terms of the design mechanism as shown in Fig. 7. Second, although the mechanisms for the generation of the three patterns are not exactly the same, the simulated abnormal region with pattern 3 may be similar to that with other patterns at baseline or some early time points. For example, the simulated disease region with pattern 3 is almost the same as that with pattern 2 at baseline in Fig. 7. Moreover, the corresponding statistical disease mapping at the subgroup level is also presented in Fig. 9, which successfully recovers the predefined abnormal region patterns. Thus, our DADP provides reasonable subgroup analysis in terms of the abnormal region patterns.

Fig. 9:

Fig. 9:

Statistical disease mapping at both population level and subgroup level.

Table 5:

Subgroup analysis based on DADP: relationship between clustering results and abnormal region patterns.

Subgroup 1 Subgroup 2 Subgroup 3
Pattern 1 11 0 6
Pattern 2 0 24 0
Pattern 3 7 3 9

3.3. Real data analysis

First, we consider DFMEM (2) with xi,j = (1, gender, normalized BMI, normalized log(Age)), zi,j = (1, normalized BMI, normalized log(Age)), and wi,j = zi,j. Then, we fit the proposed model on the thickness maps extracted from the left knee FC/TC and the right knee FC/TC, respectively. For the left knee FC, the estimates of coefficient functions associated to the covariates including gender, normalized BMI and normalized log(age) are presented in Fig. 10. In order to test how different covariates locally affect the cartilage thickness, the pixel-wise Wald test statistics in (10) are calculated, and the FDR adjusted − log10 p-values across all pixels are shown in Fig. 10. For the other three cartilages, the estimates of coefficient functions and corresponding FDR adjusted − log10 (p)-values can be found in Appendix G. According to the adjusted p-value maps, compared to the gender effect, both BMI and age effects are more significant on both medial and lateral regions in all four different cartilages, which is consistent with findings in the existing literature (Cai et al., 2019). Meanwhile, the global effects of these covariates on thickness maps are also investigated. The global test statistics in (11) along with the p-values are calculated and reported in Table 6. According to the global p-values, the overall BMI effect is found to be more significant than the age effect on both left knee FC and right knee FC, while the overall age effect is more significant than the BMI effect on both left knee TC and right knee TC. In addition to the coefficient functions, the inference results of α are presented in Table 7. Both BMI and age effects are found significant on the abnormal regions for all the four cartilages.

Fig. 10:

Fig. 10:

Left knee FC: estimates of functional coefficients associated to the covariates including gender, normalized BMI, and normalized log(Age) (top); corresponding FDR adjusted − log10 p-value maps (bottom).

Table 6:

Inference on coefficient functions β(s): global test statistics and p-values (in parentheses) of three covariates including gender, BMI, and log(Age).

Cartilage Gender BMI (kg/m2) log(Age) (years)
Left knee FC 1.02 (0.45) 31.06 (2.3E-3) 11.43 (2.1E-2)
Left knee TC 2.13 (0.17) 20.01 (8.9E-3) 30.12 (9.8E-4)
Right knee FC 0.85 (0.66) 28.41 (3.6E-3) 13.07 (1.0E-2)
Right knee TC 1.69 (0.33) 20.59 (6.2E-3) 31.21 (1.0E-3)

Table 7:

Inference on coefficients α: estimates associated to three covariates including gender, BMI, and log(Age); and p-values (in parentheses).

Cartilage BMI (kg/m2) log(Age) (years)
Left knee FC −0.15 (0.044) −0.24 (0.012)
Left knee TC −0.15 (0.046) −0.26 (0.009)
Right knee FC −0.18 (0.032) −0.30 (0.004)
Right knee TC −0.17 (0.038) −0.27 (0.009)

Next, the detected abnormal regions on left knee FC for randomly selected two patients are plotted in Fig. 11. Detection results on other cartilages can be found in Appendix H. The corresponding baseline age, baseline BMI, KLG scores, and time points are also presented. It can be found that the abnormal regions vary across subjects and time points and the inclusion assumption is satisfied in the detection results. Besides the detected individual abnormal regions, the population level statistical disease mapping for different cartilages are calculated and reported in Fig. 12. It can be found that, regions with high probability affected by OA are similar between the left knee and the right knee. This symmetry has also been discovered in the existing studies (Metcalfe et al., 2012).

Fig. 11:

Fig. 11:

Left knee FC: detected abnormal regions for two randomly selected patients with observations at baseline, 12 months, 24 months, and 48 months: one female (baseline age: 57 years, baseline BMI: 38.6 kg/m2, KLG score at baseline is 2, and 3 at follow-up visits) and one male (baseline age: 74 years, baseline BMI: 31.1 kg/m2, KLG score is 2 at all four time points).

Fig. 12:

Fig. 12:

Population level statistical disease mapping for different cartilages: left knee FC (top left), left knee TC (top right), right knee FC (bottom left), and right knee TC (bottom right).

Finally, the subgroup analysis is conducted based on the detected individual abnormal regions. The clustering results are reported in Table 8, in which the KLG information is reported as well to build up the relationship between clustering results and clinical outcomes. According to the clustering results, 78.9% observations at early stage (KLG score 1) are clustered into subgroup 3, 85.5% observations at mild stage (KLG score 2) are clustered into subgroup 1, and 89.0% observations at moderate and severe stages (KLG score 3 or 4) are clustered into subgroups 1 and 2. Similar clustering results can be found on the right knee FC. Specifically, 80.8% observations at early stage (KLG score 1) are clustered into subgroup 3, 84.3% observations at mild stage (KLG score 2) are clustered into subgroup 1, and 73.7% observations at moderate and severe stages (KLG score 3 or 4) are clustered into subgroups 1 and 2. Therefore, the individual abnormal regions on both left knee FC and right knee FC can be treated as useful biomarkers in identifying OA patients at early stage. Moreover, the statistical disease maps at the subgroup level are presented in Fig. 13. It can be found that, for all the different cartilages, each subgroup has its unique statistical disease map, which may be treated as a new imaging biomarker for OA early prevention and treatment development.

Table 8:

Subgroup analysis based on the detected individual abnormal regions: clustering results are summarized according to the KLG scores. Three subgroups are detected for left knee FC and right knee FC, while two subgroups are detected for left knee TC and right knee TC.

Cartilage Subgroup KLG
1 2 3 4
Left knee FC 1 484 3931 713 361
2 256 326 1449 286
3 2777 342 179 168
Left knee TC 1 2833 2351 514 405
2 766 2266 1879 401
Right knee FC 1 597 4594 1220 334
2 207 441 700 188
3 3387 415 692 181
Right knee TC 1 2981 2808 701 405
2 1003 2524 2109 354

Fig. 13:

Fig. 13:

Subgroup level statistical disease mapping for different cartilages: left knee FC (top left), left knee TC (top right), right knee FC (bottom left), and right knee TC (bottom right).

4. Conclusions & Discussions

In this work, a dynamic abnormal region detection framework, including an image analysis module and a statistical modeling module, has been proposed for quantitative cartilage analysis. In the image analysis module, the spatial correspondence has been well established on 2D cartilage thickness maps extracted from 3D knee MR images through a series of image analysis steps. In the statistical modeling module, a dynamic functional mixed effects model has been proposed. Specifically, the relationship between cartilage thicknesses and covariates of interest has been characterized through functional regression analysis tools, the spatial smoothness and spatial-temporal correlation within the thickness map has been established through functional data analysis tools, and the spatial-temporal heterogeneity in abnormal regions has been detected through dynamic conditional random field models. In addition to the subject-specific dynamic abnormal regions, the population-level statistical disease mapping has been derived, and subgroup analysis has also been conducted through the non-negative matrix factorization method. The proposed framework has been assessed through both simulation studies and real data analysis from the OAI study. It shows that our method outperforms other existing approaches in detecting the subject-specific dynamic abnormal regions and in providing population-level statistical disease mapping and subgroup analysis. All the results derived from our method bring new insights in OA prediction and clinical practice.

Several important issues need to be addressed in our future research. First, the estimation procedure in our DADP can be treated as an iterative approach, and the performance strongly depends on the initialization of abnormal regions and parameter estimation. “Bad” initial values could make the whole algorithm converge to some local maxima or even collapse. To address this issue, a Bayesian framework (Jin et al., 2016; Guo et al., 2020) can be considered here instead, where the estimated parameters can be derived from the posterior distribution via introducing proper prior information, and the latent abnormal regions can be detected via the Monte Carlo Markov Chain (MCMC) sampling method.

Second, in order to detect the subject-specific dynamic abnormal regions, a DCRFM is adopted in this manuscript. Although it works well in our simulated dataset, the simple one-layer latent structure in DCRFM would be inefficient once the spatio-temporal heterogeneity is more complicated within the imaging dataset. To address this issue, the idea of attention-based deep multiple instance learning (Ilse et al., 2018) can be considered and extended to longitudinal datasets. Specifically, both the individual abnormal regions and statistical disease mapping at the subgroup level can be learned through the estimated attention weights.

Third, our current subgroup analysis is conducted via applying the NMF method on the average probability that each pixel belongs to the abnormal region across time, where the dynamic abnormality pattern could not be perfectly characterized. It is of great interest to develop some cartilage thickness progression based features that can integrate the abnormality information to better capture the dynamic abnormality pattern.

Highlights.

  • We propose a DADP framework for longitudinal knee MR image analysis

  • A DL pipeline is proposed to extract 2D cartilage thickness maps from 3D images

  • A DFMEM is proposed for dynamic abnormality detection and progression

  • Statistical inferences are introduced for both association and subgroup analysis

  • Statistical disease mapping is conducted at both population and subgroup level

Appendix A. Covariance function Σyi(s,s)

Define ηi(s)=(ηi,1(s),,ηi,ni(s)) and ϵi(s)=(ϵi,1(s),,ϵi,ni(s)). Then the covariance function Σyi(s,s), s′) given bi(s) can be derived as

ZiCov[γi(s),γi(s)]Zi+Cov[ηi(s),ηi(s)]+Cov[ϵi(s),ϵi(s)]. (A.1)

According to the assumptions on γi(s), ηi,j(s), and ϵi,j(s), we have Cov[γi(s), γi(s′)] = Σγ(s, s′), Cov[ηi(s),ηi(s)]=ση2(s,s)Ini, and Cov[ϵi(s),ϵi(s)]=σϵ2(s)1(s=s)Ini. Therefore, the covariance structure of yi(s) given bi(s) in (3) holds.

Appendix B. Key steps in estimation procedure

Update θ^(t).

Given the current detected abnormal regions and estimated parameters in part (ii), model (2) can be written as

y˜i,ja(s)=xi,jβ(s)+zi,jγi(s)+ηi,j(s)+ϵi,j(s), (B.1)

where y˜i,ja(s)=yi,j(s)b^i,j(t)(s)wi,jα^(t). Then, model (B.1) can be treated as a spatiotemporal (or functional) linear mixed effects model (Bernal-Rusiel et al., 2013; Yuan et al., 2014; Hyun et al., 2016; Zhu et al., 2019). The updating procedure for θ can be summarized in three key steps: (i) derive an initial estimate of β(s) by the local linear regression method (Fan and Gijbels, 1996), while ignoring the temporal structure among repeated observations from one single subject; (ii) derive the updated estimates, Σ^γ(t+1)(s,s), σ^η2(t+1)(s,s), and σ^η2(t+1)(s,s), respectively, by the local constant method (Ramsay and Silverman, 2006) given the updated estimate β^(t+1)(s); and (iii) refine the estimated β^(t+1)(s) by incorporating the temporal-spatial structure obtained from the estimated covariance functions. The detailed updating procedure can be found in Appendix C.

Update α^(t).

Let Xi=(xi,1,,xi,ni) and W^i(sk)=Widiag(b^i(t)(sk)), where Wi=(wi,1,,wi,ni) and diag(b^i(t)(sk)) returns a diagonal matrix with b^i(t)(sk)s' on the main diagonal. Then, the estimate of α can be updated via minimizing the weighted least squares function

i=1nk=1nvyi(sk)Xiβ^(t+1)(sk)W^i(sk)αΣ^i,sk2, (B.2)

where Σ^i,sk=ZiΣ^γ(t+1)(sk,sk)Zi+σ^η2(t+1)(sk,sk) and the norm operator αΣ^i,sk is defined as αΣ^i,sk1α.

Update b^i(t)(s).

Here the detected abnormal region pattern at both individual level and population level are updated based on the MRF-MAP approach, which is an efficient method in latent variable recovering and adopted in many practical applications such as image segmentation (Zhang et al., 2001; Nie et al., 2009; Ahmadvand et al., 2017). Specifically, the population level abnormal region pattern b^0(t)(s) is updated via minimizing the following loss function

κgi,j>0U(b0(s),b^i,j(t)(Ds))+τn*U(b0(s),b^0(t)(Ns)), (B.3)

while the individual abnormal region pattern is updated via minimizing the following objective function

Ei(s)ln p(bi(s)b^i(t)(Ns),b^0(t)(Ds),τ^(t),κ^(t),ρ^(t)), (B.4)

where Ei(s)=12yi(s)Xiβ^(t+1)(s)Widiag(bi(s))α^(t+1)Σˇi,s2 and Σˇi,s=Σ^i,s+σ^ϵ2(t+1)(s)Ini. Furthermore, the iterated conditional modes (ICM) algorithm (Besag, 1986; Zhang et al., 2001), which uses a greedy iterative strategy for optimization, is adopted to obtain the optimal solutions to (B.3) and (B.4).

Update τ^(t), κ^(t), and ρ^(t).

Since τ, κ, and ρ are not the primary parameters of interest, we use an approximate but computationally efficient method based on a pseudo-likelihood function (Huang et al., 2015). A key advantage of using the pseudo-likelihood function is that it does not involve the intractable partition function. The pseudo-likelihood at the (t +1)-th iteration, denoted as PL(κ, τ, ρ), is a simple product of the conditional likelihood

i=1n*sSSp(b^i(t+1)(s)b^i(t+1)(Ns),b^0(t+1)(Ds),τ,κ,ρ)bi(s)p(bi(s)b^i(t+1)(Ns),b^0(t+1)(Ds),τ,κ,ρ), (B.5)

where S denotes the set of points at the boundaries of S. Thus, the estimates {τ^(t+1),κ^(t+1),ρ^(t+1)} can be obtained by maximizing the log-pseudo-likelihood function ln PL(τ, κ, ρ) through certain numerical algorithm, e.g., Newton-Raphson method.

Appendix C. Key steps in updating the estimate of θ

Step I.

We first ignore the temporal structure among repeated observations from one single subject and then derive an initial estimate of β(s). Specifically, based on the local linear regression method (Fan and Gijbels, 1996), the initial estimate is obtained via minimizing the following weighted least squares function

i,j,k(y˜i,ja(sk)xi,j(β(s)+β.(s)(sks)))2KH(sks), (C.1)

where KH(s) = |H|−1K(H−1s), K(s) is the Gaussian kernel function, and H is the positive definite bandwidth matrix.

Step II.

Given the updated estimate β^(t+1)(s) from (C.1), we first calculate the residuals

v^i(s)=yi(s)Xiβ^(t+1)(s),i=1,,n. (C.2)

Then, let σ02(s,s)ση2(s,s)+σϵ2(s)1(s=s), and the estimates Σ˜γ(s,s) and σ˜02(s,s) can be derived by minimizing the following least squares function:

i=1nv^i(s)v^i(s)ZiΣγ(s,s)Ziσ02(s,s)IniF2. (C.3)

Next, by adopting the local constant method (Ramsay and Silverman, 2006), the updated estimates Σ^γ(t+1)(s,s) and σ^η2(t+1)(s,s) can be, respectively, derived by minimizing the following weighted least squares functions,

k,knvΣ˜γ(sk,sk)Σγ(s,s)F2KH(sks)KH(sks), (C.4)
k,knvσ˜02(sk,sk)ση2(s,s)F2KH(sks)KH(sks), (C.5)

and the estimate σϵ2(t+1) can be consequently written as

σϵ2(t+1)(s)=(σ˜02(s,s)σ^η2(t+1)(s,s))+, (C.6)

where the operator (a)+ is defined as a · 1(a > 0).

Step III.

Let y˜ia(sk)=(y˜i,1a(sk),,y˜i,nia(sk)) and Xi=(xi,1,,xi,ni). Given the estimated covariance functions, we refine the estimate β^(t+1)(s) by minimizing the updated weighted least squares function:

i=1nk=1nvy˜ia(sk)Xi[β(s)+β.(s)(sks)]Σ^i,sk2KH(sks), (C.7)

where the norm operator αΣ^i,sk is defined as αΣ^i,sk1α, and Σ^i,sk=ZiΣ^γ(t+1)(sk,sk)Zi+σ^η2(t+1)(sk,sk). In particular, the bandwidth matrices in (C.1), (C.4), (C.5), and (C.7) can be selected by using a leave-one-curve-out cross-validation method (Zhang and Chen, 2007; Zhu et al., 2019).

Appendix D. Initialization procedure

The initialization of abnormal regions and parameter estimation plays a critical role in our iterative estimation procedure. Here, an initial estimation of θ is first derived based on observations at the normal stage. Then, given the estimated θ^(0), an initial detection of abnormal regions {b^i(0)(s),sS}i=1n is determined based on the residual maps through a conditional clustering analysis approach. Specifically, the abnormal regions for the i-th OA patient at the disease baseline are initialized via clustering analysis, such as K-means, on the residual map, whereas the abnormal regions at follow-up visits are initialized within the detected abnormal regions at the previous time point. Furthermore, the initial estimate of the population level abnormal region pattern is derived from the specified abnormal regions at disease baseline according to the majority rule. In addition to the initialization of abnormal regions, the initial estimator of α can be derived via minimizing the loss function in (B.2). Finally, for the three tuning parameters {τ, κ, ρ}, we set all of them to 0.5 as initial values, which have been found reasonable according to existing studies (Ahmadvand et al., 2017).

Appendix E. Wild bootstrap procedure for approximating the null distribution of Tβ

  1. Fit model (2) under the null hypothesis with the original data and retain the fitted values y^i,j(s)=xi,jβ^(s)+b^i,j(s)wi,jα^, individual functions f^i.j(s)=zi,jγ^i(s)+η^i,j(s), and residuals ϵ^i,j(s)=yi,j(s)y^i,j(s)f^i.j(s);

  2. Create synthetic response variables yi,j(l)(s)=y^i,j(s)+ζi(l)f^i.j(s)+ψi,j(l)(s)ϵ^i(s), where ζi(l) and {ψi,j(l)(s),sS} are random variables following standard normal distribution;

  3. Given {b^i,j(s),sS}, refit the model using the synthetic response variables and calculate the test statistic Tβ(l);

  4. Aggregate the results of Steps 2 and 3 over l = 1, …, K (K = 500) to obtain {Tβ(l)}l=1K and calculate the approximated p-value p=K1l=1K1(Tβ(l)>Tβ).

Appendix F. Wild bootstrap method in estimating the covariance of α^

  1. Fit model (2) under the null hypothesis with the original data and retain the fitted values y^i,j(s)=xi,jβ^(s)+b^i,j(s)wi,jα^, individual functions f^i.j(s)=zi,jγ^i(s)+η^i,j(s), and residuals ϵ^i,j(s)=yi,j(s)y^i,j(s)f^i.j(s);

  2. Create synthetic response variables yi,j(l)(s)=y^i,j(s)+ζi(l)f^i.j(s)+ψi,j(l)(s)ϵ^i(s), where ζi(l) and {ψi,j(l)(s),sS} are random variables following standard normal distribution;

  3. Given {b^i,j(s),sS}, refit the model using the synthetic response variables and derive the estimate α^(l);

  4. Aggregate the results of Steps 2 and 3 over l = 1, …, K (K = 500) to obtain {α^(l)}l=1K, where the empirical covariance is calculated to represent the covariance matrix of α^.

Appendix G. Inference on coefficient functions for left knee TC and right knee FC/TC

Fig. G.14:

Fig. G.14:

Left knee TC: estimates of functional coefficients associated to the covariates including gender, normalized BMI, and normalized log(age) (top); and their corresponding FDR adjusted − log10 p-value maps (bottom).

Fig. G.15:

Fig. G.15:

Right knee FC: estimates of functional coefficients associated to the covariates including gender, normalized BMI, and normalized log(age) (top); and their corresponding FDR adjusted − log10 p-value maps (bottom).

Fig. G.16:

Fig. G.16:

Right knee TC: estimates of functional coefficients associated to the covariates including gender, normalized BMI, and normalized log(age) (top); and their corresponding FDR adjusted − log10 p-value maps (bottom).

Appendix H. Detected abnormal regions for left knee TC nd right knee FC/TC

Fig. H.17:

Fig. H.17:

Left knee TC: detected abnormal regions for two randomly selected patients with observations at five time points: one male (baseline age: 57 years, baseline BMI: 23.4 kg/m2, KLG score is 2 at baseline and 12 months, and 3 at 24 months, 36 months, and 48 months) and one female (baseline age: 52 years, baseline BMI: 29.5 kg/m2, KLG score is 1 at all five time points, i.e., baseline, 24 months, 48 months, 72 months, and 96 months).

Fig. H.18:

Fig. H.18:

Right knee FC: detected abnormal regions for two randomly selected patients with observations at four time points: one male (baseline age: 64 years, baseline BMI: 25.1 kg/m2, KLG score is 1 at all the time points including baseline, 48 months, 72 months, and 96 months) and one female (baseline age: 66 years, baseline BMI: 32.5 kg/m2, KLG score is 2 at baseline, and 3 at 24 months, 36 months, and 48 months).

Fig. H.19:

Fig. H.19:

Right knee TC: detected abnormal regions for two randomly selected patients with observations at four time points including baseline, 24 months, 36 months, and 48 months: one female (baseline age: 79 years, baseline BMI: 30.5 kg/m2, KLG score is 1 at baseline, and 2 at follow-up visits) and one male (baseline age: 73 years, baseline BMI: 21.6 kg/m2, KLG score is 2 at all four time points).

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

Declaration of interests

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References

  1. Ahmadvand A, Yousefi S, Manzuri Shalmani M, 2017. A novel markov random field model based on region adjacency graph for T1 magnetic resonance imaging brain segmentation. Int. J. Imaging Syst. Techno 27, 78–88. [Google Scholar]
  2. An H, Marron J, Schwartz TA, Renner JB, Liu F, Lynch JA, Lane NE, Jordan JM, Nelson AE, 2016. Novel statistical methodology reveals that hip shape is associated with incident radiographic hip osteoarthritis among african american women. Osteoarthr. Cartil 24, 640–646. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Avants BB, Epstein CL, Grossman M, Gee JC, 2008. Symmetric diffeomorphic image registration with cross-correlation: evaluating automated labeling of elderly and neurodegenerative brain. Med. Image Anal 12, 26–41. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Avants BB, Tustison NJ, Stauffer M, Song G, Wu B, Gee JC, 2014. The insight toolkit image registration framework. Front. Neuroinform 8, 44. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Balamoody S, Williams TG, Waterton JC, Bowes M, Hodgson R, Taylor CJ, Hutchinson CE, 2010. Comparison of 3T MR scanners in regional cartilage-thickness analysis in osteoarthritis: a cross-sectional multicenter, multivendor study. Arthritis Res. Ther 12, 1–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Bernal-Rusiel JL, Reuter M, Greve DN, Fischl B, Sabuncu MR, 2013. Spatiotemporal linear mixed effects modeling for the mass-univariate analysis of longitudinal neuroimage data. NeuroImage 81, 358 – 370. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Besag J, 1986. On the statistical analysis of dirty pictures. J. R. Stat. Soc. Ser. B Methodol 48, 259–302. [Google Scholar]
  8. Bowes MA, Kacena K, Alabas OA, Brett AD, Dube B, Bodick N, Conaghan PG, 2021. Machine-learning, mri bone shape and important clinical outcomes in osteoarthritis: data from the osteoarthritis initiative. Ann. Rheum. Dis 80, 502–508. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Braun HJ, Gold GE, 2012. Diagnosis of osteoarthritis: imaging. Bone 51, 278–288. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Brunet JP, Tamayo P, Golub TR, Mesirov JP, 2004. Metagenes and molecular pattern discovery using matrix factorization. Proc. Natl. Acad. Sci. U.S.A 101, 4164–4169. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Cai G, Jiang M, Cicuttini F, Jones G, 2019. Association of age, sex and bmi with the rate of change in tibial cartilage volume: a 10.7-year longitudinal cohort study. Arthritis Res. Ther 21, 273. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Carass A, Roy S, Gherman A, Reinhold JC, Jesson A, Arbel T, Maier O, Handels H, Ghafoorian M, Platel B, et al. , 2020. evaluating white matter lesion segmentations with refined sørensen-dice analysis. Sci. Rep 10, 1–19. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Chen D, Shen J, Zhao W, Wang T, Han L, Hamilton JL, Im HJ, 2017. Osteoarthritis: toward a comprehensive understanding of pathological mechanism. Bone Res. 5, 1–13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Cicuttini F, Hankin J, Jones G, Wluka A, 2005. Comparison of conventional standing knee radiographs and magnetic resonance imaging in assessing progression of tibiofemoral joint osteoarthritis. Osteoarthr. Cartil 13, 722–727. [DOI] [PubMed] [Google Scholar]
  15. Conaghan P, Hunter D, Maillefert JF, Reichmann W, Losina E, 2011. Summary and recommendations of the oarsi fda osteoarthritis assessment of structural change working group. Osteoarthr. Cartil 19, 606–610. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Crema MD, Felson DT, Roemer FW, Wang K, Marra MD, Nevitt MC, Lynch JA, Torner J, Lewis CE, Guermazi A, 2013. Prevalent cartilage damage and cartilage loss over time are associated with incident bone marrow lesions in the tibiofemoral compartments: the most study. Osteoarthr. Cartil 21, 306–313. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Dalca AV, Balakrishnan G, Guttag J, Sabuncu MR, 2018. Unsupervised learning for fast probabilistic diffeomorphic registration, in: International Conference on Medical Image Computing and Computer-Assisted Intervention, Springer. pp. 729–738. [Google Scholar]
  18. Deufemia V, Risi M, Tortora G, 2014. Sketched symbol recognition using latent-dynamic conditional random fields and distance-based clustering. Pattern Recognit. 47, 1159–1171. [Google Scholar]
  19. Ding C, He X, Simon HD, 2005. On the equivalence of nonnegative matrix factorization and spectral clustering, in: Proceedings of the 2005 SIAM International Conference on Data Mining, SIAM. pp. 606–610. [Google Scholar]
  20. Eckstein F, Cicuttini F, Raynauld JP, Waterton JC, Peterfy C, 2006. Magnetic resonance imaging (mri) of articular cartilage in knee osteoarthritis (oa): morphological assessment. Osteoarthr. Cartil 14, 46–75. [DOI] [PubMed] [Google Scholar]
  21. Eckstein F, Wirth W, Nevitt MC, 2012. Recent advances in osteoarthritis imagingthe osteoarthritis initiative. Nat. Rev. Rheumatol 8, 622. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Fan J, Gijbels I, 1996. Local Polynomial Modelling and Its Applications. Chapman and Hall, London. [Google Scholar]
  23. Felson DT, Lawrence RC, Dieppe PA, Hirsch R, Helmick CG, Jordan JM, Kington RS, Lane NE, Nevitt MC, Zhang Y, et al. , 2000. Osteoarthritis: new insights. part 1: the disease and its risk factors. Ann. Intern. Med 133, 635–646. [DOI] [PubMed] [Google Scholar]
  24. Felson DT, Niu J, Guermazi A, Sack B, Aliabadi P, 2011. Defining radiographic incidence and progression of knee osteoarthritis: suggested modifications of the kellgren and lawrence scale. Ann. Rheum. Dis 70, 1884–1886. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Felson DT, Niu J, Yang T, Torner J, Lewis CE, Aliabadi P, Sack B, Sharma L, Guermazi A, Goggins J, et al. , 2013. Physical activity, alignment and knee osteoarthritis: data from most and the oai. Osteoarthr. Cartil 21, 789–795. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Glyn-Jones S, Palmer A, Agricola R, Price A, Vincent T, Weinans H, Carr A, 2015. Osteoarthritis. Lancet 386, 376–387. [DOI] [PubMed] [Google Scholar]
  27. Green PJ, Richardson S, 2002. Hidden markov models and disease mapping. J. Am. Stat. Assoc 97, 1055–1070. [Google Scholar]
  28. Guermazi A, Hunter DJ, Li L, Benichou O, Eckstein F, Kwoh CK, Nevitt M, Hayashi D, 2012. Different thresholds for detecting osteophytes and joint space narrowing exist between the site investigators and the centralized reader in a multicenter knee osteoarthritis studydata from the osteoarthritis initiative. Skelet. Radiol 41, 179–186. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Guermazi A, Zaim S, Taouli B, Miaux Y, Peterfy CG, Genant HK, 2003. Mr findings in knee osteoarthritis. Eur. Radiol 13, 1370–1386. [DOI] [PubMed] [Google Scholar]
  30. Guo C, Kang J, Johnson TD, 2020. A spatial bayesian latent factor model for image-on-image regression. Biometrics. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Gupta S, Hawker G, Laporte A, Croxford R, Coyte P, 2005. The economic burden of disabling hip and knee osteoarthritis (oa) from the perspective of individuals living with this condition. Rheumatology 44, 1531–1537. [DOI] [PubMed] [Google Scholar]
  32. Hootman JM, Helmick CG, Barbour KE, Theis KA, Boring MA, 2016. Updated projected prevalence of self-reported doctor-diagnosed arthritis and arthritis-attributable activity limitation among us adults, 2015–2040. Arthritis Rheumatol. 68, 1582–1587. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Huang C, Shan L, Charles HC, Wirth W, Niethammer M, Zhu H, 2015. Diseased region detection of longitudinal knee magnetic resonance imaging data. IEEE Trans. Med. Imag 34, 1914–1927. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Hubert L, Arabie P, 1985. Comparing partitions. J. Classif 2, 193–218. [Google Scholar]
  35. Hyun JW, Li Y, Huang C, Styner M, Lin W, Zhu H, Initiative ADN, et al. , 2016. Stgp: Spatio-temporal gaussian process models for longitudinal neuroimaging data. NeuroImage 134, 550–562. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Ilse M, Tomczak J, Welling M, 2018. Attention-based deep multiple instance learning, in: International Conference on Machine Learning, PMLR. pp. 2127–2136. [Google Scholar]
  37. Jafarzadeh SR, Felson DT, 2017. Corrected estimates for the prevalence of self-reported doctor-diagnosed arthritis among us adults. Arthritis Rheumatol. 69, 1701. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Jin IH, Yuan Y, Bandyopadhyay D, 2016. A bayesian hierarchical spatial model for dental caries assessment using non-gaussian markov random fields. Ann. Appl. Stat 10, 884–905. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Karim MR, Jiao J, Döhmen T, Cochez M, Beyan O, Rebholz-Schuhmann D, Decker S, 2021. Deepkneeexplainer: Explainable knee osteoarthritis diagnosis from radiographs and magnetic resonance imaging. IEEE Access 9, 39757–39780. [Google Scholar]
  40. Kellgren J, Lawrence J, 1957. Radiological assessment of osteo-arthrosis. Ann. Rheum. Dis 16, 494. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Kijowski R, Demehri S, Roemer F, Guermazi A, 2020. Osteoarthritis year in review 2019: imaging. Osteoarthr. Cartil 28, 285–295. [DOI] [PubMed] [Google Scholar]
  42. Kim H, Park H, 2007. Sparse non-negative matrix factorizations via alternating non-negativity-constrained least squares for microarray data analysis. Bioinformatics 23, 1495–1502. [DOI] [PubMed] [Google Scholar]
  43. Liu R, Zhu H, for the Alzheimer’s Disease Neuroimaging Initiative, 2021. Statistical disease mapping for heterogeneous neuroimaging studies. Can. J. Stat 49, 10–34. [Google Scholar]
  44. Lorensen WE, Cline HE, 1987. Marching cubes: A high resolution 3d surface construction algorithm. Comput. Graph. (ACM) 21, 163–169. [Google Scholar]
  45. Metcalfe AJ, Andersson ML, Goodfellow R, Thorstensson CA, 2012. Is knee osteoarthritis a symmetrical disease? analysis of a 12 year prospective cohort study. BMC Musculoskelet. Disord 13, 153. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Modat M, Cash DM, Daga P, Winston GP, Duncan JS, Ourselin S, 2014. Global image registration using a symmetric block-matching approach. J. Med. Imaging 1, 024003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Modat M, Ridgway GR, Taylor ZA, Lehmann M, Barnes J, Hawkes DJ, Fox NC, Ourselin S, 2010. Fast free-form deformation using graphics processing units. Comput. Methods Programs Biomed 98, 278–284. [DOI] [PubMed] [Google Scholar]
  48. Nevitt M, Felson D, Lester G, 2006. The osteoarthritis initiative. Protocol for the Cohort Study 1. [Google Scholar]
  49. Nie J, Xue Z, Liu T, Young GS, Setayesh K, Guo L, Wong ST, 2009. Automated brain tumor segmentation using spatial accuracy-weighted hidden markov random field. Comput. Med. Imaging Graph 33, 431–441. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Niethammer M, Kwitt R, Vialard FX, 2019. Metric learning for image registration, in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 8463–8472. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Norman B, Pedoia V, Majumdar S, 2018. Use of 2d u-net convolutional neural networks for automated cartilage and meniscus segmentation of knee mr imaging data to determine relaxometry and morphometry. Radiology 288, 177–185. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Pedoia V, Gallo MC, Souza RB, Majumdar S, 2017. Longitudinal study using voxel-based relaxometry: Association between cartilage t1ρ and t2 and patient reported outcome changes in hip osteoarthritis. J. Magn. Reson. Imaging 45, 1523–1533. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Pedoia V, Lee J, Norman B, Link TM, Majumdar S, 2019. Diagnosing osteoarthritis from t2 maps using deep learning: an analysis of the entire osteoarthritis initiative baseline cohort. Osteoarthr. Cartil 27, 1002–1010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Peterfy CG, Schneider E, Nevitt M, 2008. The osteoarthritis initiative: report on the design rationale for the magnetic resonance imaging protocol for the knee. Osteoarthr. Cartil 16, 1433–1441. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Raj A, Vishwanathan S, Ajani B, Krishnan K, Agarwal H, 2018. Automatic knee cartilage segmentation using fully volumetric convolutional neural networks for evaluation of osteoarthritis, in: 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018), IEEE. pp. 851–854. [Google Scholar]
  56. Ramsay J, Silverman B, 2006. Functional Data Analysis. Springer; New York. [Google Scholar]
  57. Raynauld JP, 2003. Quantitative magnetic resonance imaging of articular cartilage in knee osteoarthritis. Curr. Opin. Rheumatol 15, 647–650. [DOI] [PubMed] [Google Scholar]
  58. Roach KE, Pedoia V, Lee JJ, Popovic T, Link TM, Majumdar S, Souza RB, 2021. Multivariate functional principal component analysis identifies waveform features of gait biomechanics related to early-to-moderate hip osteoarthritis. J. Orthop. Res 39, 1722–1731. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Roemer FW, Demehri S, Omoumi P, Link TM, Kijowski R, Saarakkala S, Crema MD, Guermazi A, 2020. State of the art: Imaging of osteoarthritisrevisited 2020. Radiology 296, 5–21. [DOI] [PubMed] [Google Scholar]
  60. Ronneberger O, Fischer P, Brox T, 2015. U-net: Convolutional networks for biomedical image segmentation, in: International Conference on Medical Image Computing and Computer-Assisted Intervention, Springer. pp. 234–241. [Google Scholar]
  61. Schaefer LF, Sury M, Yin M, Jamieson S, Donnell I, Smith SE, Lynch JA, Nevitt MC, Duryea J, 2017. Quantitative measurement of medial femoral knee cartilage volume–analysis of the oa biomarkers consortium fnih study cohort. Osteoarthr. Cartil 25, 1107–1113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Schiphof D, de Klerk BM, Kerkhof HJ, Hofman A, Koes BW, Boers M, Bierma-Zeinstra SM, 2011. Impact of different descriptions of the kellgren and lawrence classification criteria on the diagnosis of knee osteoarthritis. Ann. Rheum. Dis 70, 1422–1427. [DOI] [PubMed] [Google Scholar]
  63. Segal NA, Nevitt MC, Gross KD, Hietpas J, Glass NA, Lewis CE, Torner JC, 2013. The multicenter osteoarthritis study: Opportunities for rehabilitation research. PM&R 5, 647–654. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Shen Z, Han X, Xu Z, Niethammer M, 2019. Networks for joint affine and non-parametric image registration, in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4224–4233. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Sutton C, McCallum A, Rohanimanesh K, 2007. Dynamic conditional random fields: Factorized probabilistic models for labeling and segmenting sequence data. J. Mach. Learn. Res 8, 693–723. [Google Scholar]
  66. Tan C, Yan Z, Zhang S, Li K, Metaxas DN, 2019. Collaborative multi-agent learning for mr knee articular cartilage segmentation, in: International Conference on Medical Image Computing and Computer-Assisted Intervention, Springer. pp. 282–290. [Google Scholar]
  67. Vercauteren T, Pennec X, Perchant A, Ayache N, 2008. Symmetric log-domain diffeomorphic registration: A demons-based approach, in: International Conference on Medical Image Computing and Computer-Assisted Intervention, Springer. pp. 754–761. [DOI] [PubMed] [Google Scholar]
  68. Vercauteren T, Pennec X, Perchant A, Ayache N, 2009. Diffeomorphic demons: Efficient non-parametric image registration. NeuroImage 45, S61–S72. [DOI] [PubMed] [Google Scholar]
  69. Wang Y, Ji Q, 2005. A dynamic conditional random field model for object segmentation in image sequences, in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 264–270. [Google Scholar]
  70. Wang Y, Loe KF, Wu JK, 2006. A dynamic conditional random field model for foreground and shadow segmentation. IEEE Trans. Pattern Anal. Mach. Intell 28, 279–289. [DOI] [PubMed] [Google Scholar]
  71. Wirth W, Duryea J, Le Graverand MPH, John MR, Nevitt M, Buck R, Eckstein F, Group OII, et al. , 2013. Direct comparison of fixed flexion, radiography and mri in knee osteoarthritis: responsiveness data from the osteoarthritis initiative. Osteoarthr. Cartil 21, 117–125. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Wirth W, Eckstein F, 2008. A technique for regional analysis of femorotibial cartilage thickness based on quantitative magnetic resonance imaging. IEEE Trans. Med. Imag 27, 737–744. [DOI] [PubMed] [Google Scholar]
  73. Xu Z, Shen Z, Niethammer M, 2018. Contextual additive networks to efficiently boost 3d image segmentations, in: Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support. Springer, pp. 92–100. [PMC free article] [PubMed] [Google Scholar]
  74. Yekutieli D, Benjamini Y, 1999. Resampling-based false discovery rate controlling multiple test procedures for correlated test statistics. J. Stat. Plan. Inference 82, 171–196. [Google Scholar]
  75. Yin J, Hu DH, Yang Q, 2009. Spatio-temporal event detection using dynamic conditional random fields, in: Proceedings of the 21st International Jont Conference on Artifical Intelligence, Morgan Kaufmann Publishers Inc.. pp. 1321–1326. [Google Scholar]
  76. Yin Y, Zhang X, Williams R, Wu X, Anderson DD, Sonka M, 2010. Logismoslayered optimal graph image segmentation of multiple objects and surfaces: cartilage segmentation in the knee joint. IEEE Trans. Med. Imag 29, 2023–2037. [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Yuan Y, Gilmore JH, Geng X, Martin S, Chen K, Wang J.l., Zhu H, 2014. Fmem: Functional mixed effects modeling for the analysis of longitudinal white matter tract data. NeuroImage 84, 753–764. [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Zhang J, Chen J, 2007. Statistical inference for functional data. Ann. Stat 35, 1052–1079. [Google Scholar]
  79. Zhang Y, Brady M, Smith S, 2001. Segmentation of brain mr images through a hidden markov random field model and the expectation-maximization algorithm. IEEE Trans. Med. Imag 20, 45–57. [DOI] [PubMed] [Google Scholar]
  80. Zhu H, Chen K, Luo X, Yuan Y, Wang JL, 2019. Fmem: Functional mixed effects models for longitudinal functional responses. Stat. Sin 29, 2007–2033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Zhu H, Gu M, Peterson B, 2007. Maximum likelihood from spatial random effects models via the stochastic approximation expectation maximization algorithm. Stat. Comput 17, 163–177. [Google Scholar]
  82. Zhu H, Kong L, Li R, Styner M, Gerig G, Lin W, Gilmore JH, 2011. Fadtts: functional analysis of diffusion tensor tract statistics. NeuroImage 56, 1412–1425. [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Zhu H, Li R, Kong L, 2012. Multivariate varying coefficient model for functional responses. Ann. Stat 40, 2634–2666. [DOI] [PMC free article] [PubMed] [Google Scholar]
  84. Zhu H, Styner M, Li Y, Kong L, Shi Y, Lin W, Coe C, Gilmore JH, 2010. Multivariate varying coefficient models for dti tract statistics, in: International Conference on Medical Image Computing and Computer-Assisted Intervention, Springer. pp. 690–697. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES