Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2015 Feb 3.
Published in final edited form as: J Neurosci Methods. 2013 Oct 9;221:78–84. doi: 10.1016/j.jneumeth.2013.09.017

Application of fused lasso logistic regression to the study of corpus callosum thickness in early Alzheimer's disease

Sang H Lee a,*, Donghyeon Yu b, Alvin H Bachman a, Johan Lim b, Babak A Ardekani a
PMCID: PMC4314964  NIHMSID: NIHMS658294  PMID: 24121089

Abstract

We propose a fused lasso logistic regression to analyze callosal thickness profiles. The fused lasso regression imposes penalties on both the l1-norm of the model coefficients and their successive differences, and finds only a small number of non-zero coefficients which are locally constant. An iterative method of solving logistic regression with fused lasso regularization is proposed to make this a practical procedure. In this study we analyzed callosal thickness profiles sampled at 100 equal intervals between the rostrum and the splenium. The method was applied to corpora callosa of elderly normal controls (NCs) and patients with very mild or mild Alzheimer’s disease (AD) from the Open Access Series of Imaging Studies (OASIS) database. We found specific locations in the genu and splenium of AD patients that are proportionally thinner than those of NCs. Callosal thickness in these regions combined with the Mini Mental State Examination scores differentiated AD from NC with 84% accuracy.

Keywords: Alzheimer’s disease, brain, corpus callosum, fused lasso, logistic regression, MRI

1. Introduction

Alzheimer’s disease (AD) is an age-related, irreversible neurodegenerative disease that develops gradually and results in episodic memory loss and behavioral and personality changes. There is great interest in finding neuroimaging based markers to predict the progression of AD, which could be very helpful in managing care for individual patients, and in AD research to develop interventions for altering the course of disease. Gray matter, especially the medial temporal lobe (MTL) and hippocampus, is a major focus in these studies (see Risacher and Saykin (2013) for a comprehensive review), but it is hard to reliably measure the MTL and hippocampus automatically. On the other hand, a number of studies have reported structural changes in the corpus callosum (CC) in AD (Di Paola et al., 2010; Frederiksen et al., 2011; Teipel et al., 2002; Wang et al., 2006), and there are fully automated procedures to measure the CC with good accuracy (Ardekani et al., 2012a). In this paper, we attempt to identify the regions of the CC whose thickness could differentiate patients with early AD from normal controls (NCs) and use them in a fused lasso logistic regression classifier for early AD.

Two major approaches have been used to localize the differences in the CC between two populations: (1) defining a small number of geometric regions and (2) measuring the thickness of the CC at a large number (50–100) of evenly spaced points along the midline and doing regression or factor analysis to determine locations of significant difference. In (1) there are a number of different geometric parcellations in use (Hampel et al., 2002; Weis et al., 1991; Witelson, 1989) and no general agreement on how many divisions are meaningful. Method (2) does not require a priori parcellations and can give much finer detail, but suffers from multiple comparisons and does not exploit the correlations expected between neighboring thickness. Factor analysis has been used to divide the CC into anatomically “meaningful” subdivisions and a post-hoc ANOVA-type analysis has been applied to these subdivisions (Denenberg et al., 1991b; Hallam et al., 2008; Kilian et al., 2007). However, factor analysis can be subjective in determining the number of regions and it is not reliable in determining subdivisions.

We explore the second approach to corpus callosum analysis by measuring its thickness profile from rostrum to splenium. We introduce a new statistical approach to analyze the thickness profile to find regional differences in the CC between two populations. To this end, we extend a logistic regression model by incorporating fused lasso regularization to select contiguous regions rather than individual thicknesses. This approach does not require a priori divisions of the CC, but is able to identify regionally specific differences in CC thickness between two populations. We demonstrate the proposed method to determine regional specificity of the CC thickness differences between NCs and patients with very mild or mild AD. For this purpose, we utilize images from the Open Access Series of Imaging Studies (OASIS) MRI database (Marcus et al., 2007).

2. Method

2.1. MRI data

The OASIS cross-sectional data set (Marcus et al., 2007) contains 3D MPRAGE MRI brain scans from 416 right-handed subjects. Of these, we include in this study the 98 healthy normal subjects aged 60 or above without dementia (CDR=0) and all 98 subjects aged 60 or above with very mild/mild AD (CDR=0.5 or 1). See Marcus et al. (2007) for details of subject demographics, inclusion/exclusion criteria, MRI acquisition protocol, and preprocessing steps. Table 1 summarizes the demographics of the subjects included in this paper.

Table 1.

Sample characteristics of the data set as given by Marcus et al. (2007). Values are mean ± SD (minimum - maximum). The range of MMSE score is between 0 (worst) and 30 (best).

CDR=0 CDR=0.5 CDR=1

Number 98 70 28
Sex (female/male) 72/26 39/31 19/9
Age (years) 75.9 ± 9.0 (60–94) 76.4 ± 7.0 (63–92) 82 ± 5.7 (78–86)
MMSE score 29.0 ± 1.2 (25–30) 25.6 ± 3.5 (14–30) 21.7 ± 3.8 (15–29)

2.2. Corpus callosum segmentation

A fully automated method (Ardekani et al., 1997) is applied to find the mid-sagittal plane (MSP) of the MRI volumes to bring the head yaw and roll angles as close as possible to zero. In addition, the anterior commissure (AC) and posterior commissure (PC) were located on the MSP using a fully automated model-based method (Ardekani and Bachman, 2009). Using this information, the original MRI volume was re-sliced to obtain a single image of matrix size: 512 × 512 and pixel size: 0.5 × 0.5 mm2 representing the true AC-PC aligned MSP. Using a priori information obtained from manually traced corpora callosa on this type of image, a rectangular CC search region was identified on the MSP. Finally, a multi-atlas model-based segmentation method (Aljabar et al., 2009; Ardekani et al., 2012a,b; Cabezas et al., 2011) using the Automatic Registration Toolbox (ART) non-linear registration algorithm (Ardekani et al., 2005; Klein et al., 2009) was used to locate the CC within the search region as shown in Figure 1(a). If necessary, small corrections to the detected CC were made manually using the ITK-SNAP software (Yushkevich et al., 2006) by an operator blinded to subject status. The final segmentation was represented as a binary image where pixels were assigned a value of 1 if they belonged to the CC and a value of 0 otherwise. The CC segmentation algorithm described above has been implemented in C++ and using message passing interface (MPI) parallel programming library. The software, yuki, is available publicly at www.nitrc.org/projects/art. The average processing time for a CC segmentation running yuki in parallel on 7 of the 8 existing Cores in a 2.4 GHz Dual Quad-Core Linux computer was less than 7 seconds. Running the program on a single Core in serial mode required approximately 28 seconds.

Figure 1.

Figure 1

(a) The outline shows the detected corpus callosum. The automatically detected anterior commissure (AC) and posterior commissure (PC) are shown by the plus signs. (b) An example CC thickness profile. Every tenth medial axis position mi* is indicated by a diamond. Numbering is 0 at the rostrum tip (r) and 100 at the inferior point of the splenium (s).

It must be emphasized that in the multi-atlas based approach that we have used, the non-linear registrations are applied to the CC atlases only, not to the test image being measured. The test image is treated as a fixed target to which floating atlases are matched. Therefore, the CC of the test image does not undergo any shape or size alternation due to the non-linear registration processes involved in the algorithm. The only alignment that is applied to the test image is a non-shape/size altering rigid-body transformation based on the detected MSP and AC and PC landmarks that brings the test image to a standardized orientation, as described above, before multi-atlas based segmentation is applied.

2.3. Corpus callosum thickness profile

The CC thickness profile is specified in terms of 99 non-zero thickness values at equally spaced intervals along the length of the CC. Our method for finding these values is similar to those of Clarke et al. (1989) and Denenberg et al. (1991a). More specifically, the thickness values are lengths of line segments that connect pairs of points on the upper and lower boundaries of the CC. The line segments are perpendicular to the medial axis of the CC and intersect the medial axis at equal intervals. The steps for finding the thickness values are as follows.

  • (a)

    The tip of the rostrum r (point 0 in Figure 1(b)) and the inferior point of the splenium s (point 100 in Figure 1(b)) are located automatically in the segmented binary CC image. For this purpose, we use information based on the known CC orientation with respect the AC and PC landmarks.

  • (b)

    Using the landmarks (rostrum tip and splenium bottom) found in step (a), the CC boundary is divided into two parts, an upper curve and a lower curve, 101 equally spaced points ui are placed on the upper boundary and 101 equally spaced points li are located on the lower boundary, where i = 0, 1,…, 100, u0 = l0 = r, and u100 = l100 = s.

  • (c)

    The mid-points mi = (ui + li)/2 between the corresponding upper and lower CC boundary pairs ui and li are found to define 101 points on a medial axis of the CC.

  • (d)

    A medial axis is reconstructed by linear interpolation between the mi and smoothed. Then, a new set of 101 equally spaced points mi* are placed on the smoothed medial axis, such that m0*=r and m100*=s.

  • (e)

    The normal lines to the medial axis at points mi* (i = 1, 2,…, 99) are determined and their intersection points with the upper and lower boundaries of the CC are detected and denoted by ui* and li*, respectively.

  • (f)

    Finally, the CC thickness profile is defined as x = {xi} (i = 1, 2,…, 99) where xi are the distances between points ui* and li*, that is, xi=ui*li*.

The above algorithm makes two assumptions that may not be necessarily true. The first is that, as shown in Figure 1(b), the perpendicular line segments to the medial axis do not cross each other within the CC. The second assumption is that a perpendicular to the medial axis intersects each of the upper and lower boundaries exactly once. In occasional cases, these assumptions are violated especially in the genu-rostral area. To deal with this situation, the following two steps are added to the above algorithm.

  • (g)

    If a line segment perpendicular to the medial axis at a point mi* crosses another line segment within the CC or if a line segment does not cross the upper and lower boundaries exactly once, delete the corresponding points ui* and li*.

  • (h)

    Replace the missing ui* and li* points by placing equi-distance points between the closest existing neighboring points. For example, if ui* were deleted in step (g), replace it with a point that bisects the arc on the upper boundary that joints ui1* and ui+1*. Extension to cases where more than two adjacent boundary points are missing is trivial. Once the boundary points are corrected, compute the thicknesses xi=ui*li* as before.

2.4. Logistics regression with fused lasso regularization

The logistic regression models the probability of an event of interest, denoted by pi, using a univariate score of many predictive variables as pi = exp(si)/{1 + exp(si)} where si is a weighted linear sum of predictive variables. The relationship between si and pi is the well-known s-shaped curve. Often si is used as a classifier in the sense that we classify the i-th subject into the positive class (the class of an event of interest) if si ≥ 0, which is equivalent to pi ≥ 1/2. Otherwise, the subject is classified into the normal group. The 1 regularization (lasso: least absolute shrinkage and selection operator) was introduced by Tibshirani (1996) to simultaneously select and estimate the model. The lasso method finds only a few non-zero coefficients (i.e., weights of the predictive variables that form si) and becomes a useful tool when the number of predictors is large but only a few of them are actually effective.

The classical lasso regression by Tibshirani (1996), which penalizes the 1 norm of the coefficients vector (the sum of absolute values of coefficients) of ordinary least squares, has many variants to incorporate additional structural information on covariates. For example, the elastic net by Zou and Hastie (2005) is designed to simultaneously select strongly correlated variables in the model. The group lasso by Yuan and Lin (2007) again jointly selects or de-selects a group of variables to explain a single categorical variable. The fused lasso regression by Tibshirani et al. (2005) assumes that the predictors are observed in an order and penalizes a convex combination of the 1 norm of differences of adjacent coefficients and the 1 norm of the coefficient vector itself. It is also known that, like the elastic net, the fused lasso regression selects correlated variables together but it further makes the coefficients to be locally equal to each other.

The callosal thicknesses are spatially correlated in the sense that the thickness at one point is correlated to thicknesses of its neighboring point, and we are interested in finding regions rather than individual thickness points that would differentiate between groups. For this reason, the fused lasso logistic regression (FLLR) is a suitable choice for the analysis of callosal thickness profiles, and results in locally homogeneous and spatially contiguous regions that would indicate the existence of the event of interest (e.g., a subject belonging to the AD group).

Suppose that we have n samples {(yi, di, xi), i = 1, 2,…, n} (n = 196 in the current study), where di = (di1, di2,…,dis) is an s-dimensional covariate vector such as demographic information, xi = (xi1, xi2,…,xit) is a t-dimensional set of callosal thicknesses (t = 99 in the current study), and yi is a binary variable indicating status of the i-th subject. In this paper, yi = 1 if the i-th subject is in the AD group, otherwise 0. In order to find regional differences of the CC thickness between two populations controlling for covariates, we consider the logistic regression model:

g(pi)=logpi1pi=diγ+xiβ,

where pi = Pr(yi = 1), γ = (γ1,…, γs)T and β = (β1,…, βt)T. The likelihood function of (γ, β) is given by

L(γ,β;y,d,x)=i=1npiyi(1pi)1yi=i=1nexp{yilogpi1pi+log(1pi)}=i=1n{yi(diγ+xiβ)log(1+ediγ+xiβ)},

where y = (y1,…,yn)T. Subsequently, the log-likelihood function of (γ, β) becomes

(β,γ;y,d,x)=logL(γ,β;y,d,x)=i=1n{yi(diγ+xiβ)log(1+ediγ+xiβ)}. (1)

Sparse regression using 1-regularization is one of the most popular tools for high-dimensional data. The FLLR applied in this paper proposes minimizing

i=1n{yi(diγ+xiβ)+log(1+exp(diγ+xiβ))}+Ω(β;λ1,λ2), (2)

where

Ω(β;λ1,λ2)=λ1j=1t|βj|+λ2j=2t|βjβj1|.

Note that penalties are only on β.

Lee et al. (2006) show that the problem of maximizing the log-likelihood function (1) can be reformulated as a weighted ordinary least squares problem by taking the 2nd order Taylor expansion. Following the steps described in Lee et al. (2006), the objective function (2) can be quadratically approximated. If the current estimate of γ and β are denoted γ̂(r) and β̂(r), respectively, the objective function is given by:

Q(γ,β|γ^(r),β^(r))=i=1nwi(zidiγxiβ)2+Ω(β;λ1,λ2), (3)

by letting

wi=p^i(1p^i),zi=diγ^(r)+xiβ^(r)+yip^ip^i(1p^i):adjusted responsep^i=g1(diγ^(r)+xiβ^(r)).

Algorithms to solve the original fused lasso problem using a squared loss have been widely discussed in the literature. However, algorithms for logistic loss have rarely been discussed. In this paper, we consider a modified coordinate descent (CD) algorithm for the FLLR. The proposed algorithm can be considered as a logistic modification of the pathwise coordinate descent algorithm of Friedman et al. (2007). However, our procedure does not utilize the monotonicity (in fusion penalty) of the solution, which implies that, given λ1, if coefficients of two variables are fused at λ2 = c, then they are also fused for all λ2 > c. As they have proved, monotonicity is true only for a few special cases of the covariate matrix.

Our CD algorithm also has “descent” and “fusion” steps. The algorithm iterates these two steps. The descent step is applied for βk with β̂j, jk, for each k. When the descent step fails to improve the objective function, the fusion step is applied, that is, we set βk = βk−1 = η (or βk = βk+1 = η) and do the descent step with respect to η. In other words, we move two parameters βk and βk−1 (or βk+1) together. The detailed steps are as follows:

  1. Let γ̂j = 0 for j = 1, 2,…, s, β̂k = 0, for k = 1, 2,…, t, and, for i = 1, 2,…, n, i = 1 /{ 1+exp (−diγ̂− xi β̂)}, wi = i (1−i), zi = diγ̂+xi β̂+ (yii)/{i (1− i)}, zi*=wizi,di*=widi and xi*=wixi.

  2. (update of γ) We consider the derivative of (3) for γj, for j = 1, 2,…, s, given β = β̂, which is
    Q(γ,β^|γ^,β^)γj=i=1n{2(dij*)2γjdij*{zi*ljdil*γlk=1txik*β^k}},forj=1,2,,s. (4)

    We update γ̂ by solving the above set of normal equations by setting (4) equal to 0.

  3. (descent step) In this step, we consider the derivative of (3) for βk, given βl = β̂l, lk, which is
    Q(γ^,β|γ^,β^)βk=i=1n{2(xik*)2βkxik*(zi*j=1sdij*γ^jlkxil*β^l)}+λ1sgn(βk)λ2sgn(β^k+1βk)+λ2sgn(βkβ^k1), (5)
    where sgn(x) is the subgradient of |x|. The function (5) is piecewise linear with breaks at {0, β̂k−1, β̂k+1} and let β̃k be the solution to ∂Qk) / ∂βk = 0. If Q(β̃k) ≤ Q(β̂k), then let β̂k = β̃k and do the descent step to update βk+1. If not, we do the fusion step below to update βk and βk−1, simultaneously.
  4. (fusion step) In this step, we move βk and βk−1 together. We set η = βk = βk−1. The derivative of (3) with respect to η for given βj = β̂j, jk, (k − 1) is
    i=1n{2(xi(k1)*+xik*)2η(xi(k1)*+xik*)(zi*j=1sdij*γ^jlk,(k1)xil*β^l)}+2λ1sgn(η)λ2sgn(β^k+1η)+λ2sgn(ηβ^k2). (6)

    The same procedure in the descent step finds the solution, say η̃, to (6) = 0. Update βk−1 and βk with η̃, and go to the descent step for βk+1.

  5. We repeat the descent and fusion steps until convergence.

  6. For i = 1,…, n, update i, wi, zi*, di*, and xi*.

  7. Repeat steps 2–6 until convergence.

Generally, difficulties in the fused lasso regularized model arise in the computation speed and finding the optimal tuning parameters. When the number of features and samples are very large, say, over 100,000 features with 200 samples, the algorithm needs to be improved to solve the problem efficiently. The choice of tuning parameters is critical since tuning parameters determine the number of non-zero coefficients. For example, if tuning parameters are small enough, then many coefficients will be zero. Several criteria such as the AIC and the BIC are suggested for finding the optimal tuning parameters. In this study, since we are interested in building a predictive model of logistic regression, tuning parameters (λ1, λ2) are chosen to minimize the predictive (Pearson) residual sum of squares, that is, for a new observation (y, d, x),

PR(λ)=E{(yp^λ))2p^λ(1p^λ)}, (7)

where λ = 1 /{1 + exp (− dγ̂λxβ̂λ)}, and γ̂λ and β̂λ are the estimates of γ and β with tuning parameters λ = (λ1, λ2).

We estimate the above predictive residual using a q-fold cross-validation (CV) method in order to choose the tuning parameters in an adaptive way. To be specific, let {Sk : k = 1,…, q} be a partition of all samples {(y1, d1, x1), (y2, d2, x2),…, (yn, dn, xn)}, whose size of Sk is approximately equal to [n/q]. For each k = 1,…, q, let γ^λ(k) and β^λ(k) be estimates from a training set {Sj : jk} with λ = (λ1, λ2). Then, the predictive residual is estimated by

PR^(λ)=k=1qiSk(yip^λ,i(k))2p^λ,i(k)(1p^λ,i(k)),

where p^λ,i(k)=1/{1+exp(diγ^λ(k)xiβ^λ(k))}. We perform a grid search to find the minimum point of PR^(λ).

3. Results

3.1. Preliminary analysis

It is known that both intra-cranial volume (ICV) and CC cross-sectional areas decrease with age (Holloway et al., 1993). Larger ICV also correlates with larger CC (Bruner et al., 2012). Also, there is a potential of sexual dimorphism in the CC (Ardekani et al., 2012b). Therefore, it is necessary to remove the effects of age, ICV and sex, which are considered as potential confounding effects, in examining the callosal differences for AD. In OASIS, the ICV is estimated by the eTIV (estimated total intra-cranial volume) variable and distributed along with the imaging data. Figure 2 shows the effects of age, ICV (represented by eTIV1/3) and sex on the CC thickness. Figure 2(a) shows the mean curves of raw CC thickness for each group while Figure 2(b) shows the callosal differences of NC and AD from the mean after removing the effects of those variables. To obtain the thickness profiles of Figure 2(b), we take the residuals from the regression model of callosal thickness against age, ICV and sex at each location, then take the average of the residuals for each group at each location. Figure 2(b) shows a general thinning of the CC in AD as well as greater than average thinning in the areas of the genu and splenium.

Figure 2.

Figure 2

(a) Mean of raw callosal thickness profiles in NC (solid line) and AD (dotted line). (b) Mean adjusted callosal thickness profiles for each group after removing the effects of age, ICV and sex.

It is natural to consider the ANCOVA model with covariates of age, sex and ICV to test the callosal difference between NC and AD at each location, i.e., for each j = 1, …, 99,

xij=α0+α1AGEi+α2SEXi+α3eTIVi1/3+α4ADi+εi

where SEX and AD are the indicator functions for female and AD, respectively, and εi follows the standard normal distribution. Figure 3(a) presents the p-values from the ANCOVA indicating that most locations show significance at the level of 0.05 without any correction for multiple testing, which is not very informative in showing regional specific differences for predictive differentiation between AD and NC.

Figure 3.

Figure 3

(a) The p-values obtained from ANCOVA at each location. The p-values are below 0.05 at most locations on CC. (b) Estimated coefficients from the proposed FLLR method. Non-zero coefficients are found at the locations of 1–9, 18–23, 34–36, 88–90.

3.2. FLLR analysis

Next we applied our proposed model to callosal thickness profiles after removing the potential confounding effects of age, sex and ICV. OASIS also provides a measure for cognitive impairment, the Mini-Mental State Examination (MMSE) score. MMSE is a simple questionnaire test that is widely used by clinicians to screen for dementia. The range of MMSE score is from 0 (worst) to 30 (best). To utilize all the information available, we include MMSE in our model, which might increase the prediction power of the model. If other biomarkers such as CSF beta-amyloid and tau protein levels are available, they can also be incorporated in the model. Thus, we generate a classifier by combining our callosal thicknesses with subject’s MMSE scores as follows:

logpi1pi=γ0+γ1MMSE+j=199βjx˜ij, (8)

with the fused lasso penalty of λ1j=199|βj|+λ2j=299|βjβj1| where ijs are residuals from the regression model, i.e., x˜ij=xij(α^0+α^1AGEi+α^2eTIVi1/3+α^3SEXi), and α̂0, α̂1, α̂2, α̂3 are estimates by the ordinary least squares method. The tuning parameters (λ1, λ2) are chosen using 5-fold CV as described in Section 2.4 in the range of (λ1, λ2) ∈ [1.0, 1.0] × [5.0, 5.0] by interval 0.1. The computation time for the grid search was approximately two hours on a 2.9 GHz Quad-core Linux computer. In practice, one can start with arbitrarily large λ1 to preserve most coefficients, followed by finding the optimal λ2. Then, decrease λ1 slightly and repeat the search for λ2 in reasonable range around the previous optimal value of λ2, and repeat the procedure until the minimum is reached.

Figure 3(b) shows the estimated coefficients β̂is of the proposed model. We found sparse but contiguous regions that indicate possible callosal differences where β̂i ≠ 0. Regions [1–9], [18–23], [36–40] and [88–90] have negative coefficients suggesting that these regions are narrower in AD than in NC, and differentiate patients with early AD from NC. Table 2 reports the classification error rate based on our proposed method using 5-fold CV method. The accuracy of our classifier was 0.84 (165/196) with 78% sensitivity and 91% specificity.

Table 2.

Classification error rate by 5-fold cross-validation: accuracy=0.84(=165/196) with 78% sensitivity and 91% specificity. Out of 22 false negatives, 20 subjects with CDR=0.5 and 2 subjects with CDR=1 were misclassified as normal.

True NC AD

Est.
NC 89 22
AD 9 76

Total 98 98

4. Discussion and conclusion

In this paper, we have proposed using FLLR (fused lasso logistic regression) to find spatially correlated regions of the CC where thickness may be predictive of AD. Analysis of callosal thickness profiles provide a more detailed view of the differences between AD and NC groups than can be provided by using CC subdivisions. The proposed approach has several advantages in comparison with subdivision procedures. Subdivision procedures are usually done in two steps: subdivisions are determined either geometrically (e.g., splenium as the posterior 1/5 of the CC) or by factor analysis; and equality of the subdivision areas between the two populations is tested. In this approach, CC subdivisions are chosen subjectively or by noisy factor loadings which can be unreliable. The proposed FLLR method reformulates the problem as a feature selection problem in the framework of a prediction model, and finds differential subregions adaptively. For this reason, the FLLR method finds differential subregions between two populations more objectively and reliably. Our method is more specific than analysis based on subdivisions of the CC since we consider entire callosal thickness profiles and take neighborhoods into account in our model. In addition, it simultaneously provides a predictive model which may be valuable for diagnosis.

Finally, we have applied the proposed FLLR to find subregions of the CC where thicknesses differentiate early AD from NC. Our proposed method found that sections of the genu and splenium of AD are proportionally thinner than those of NC with 84% prediction accuracy. Nine normal subjects were misclassified as patients. In clinical practice, such subjects could be candidates for further evaluation. Twenty-two patients with AD were misclassified as normal. Of the 22 false negatives, 20 subjects with CDR=0.5 and 2 subjects with CDR=1 were misclassified as normal. A possibility of misdiagnosis should be considered further in these cases since thinning of the CC appears in the normal range incongruent with there clinical symptoms. A careful study with more extensive follow up is required to determine if these misclassified subjects of the CDR=0.5 group have clinical significance.

The regions identified by this method include parts of the genu that are connected to the frontal lobes, and a region in the splenium that seems to contain fibers that run to the medial temporal lobes (Hofer and Frahm, 2006). Both are regions that are implicated in AD. The region in the splenium seems to have a lesser weight, possibly due to its being a narrow tract whose position suffers from variability.

The classifier we have described is based on two commonly used diagnostic procedures, the MMSE and a structural MRI. Our automatic analysis programs require little supervision or intervention. The MRI protocol involved does not require long or complex acquisitions. The CC can be segmented in a fast, automatic, and reliable way. The FLLR model can be easily extended to include other brain or chemical biomarkers of AD.

Acknowledgments

J.Lim’s research was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIP) (No.2011-0029104). We appreciate the data made available by the OASIS project. OASIS was supported under grants P50 AG05681, P01 AG03991, R01 AG021910, P20 MH071616, and U24 RR021382.

References

  1. Aljabar P, Heckemann RA, Hammers A, Hajnal JV, Rueckert D. Multi-atlas based segmentation of brain images: Atlas selection and its effect on accuracy. Neuroimage. 2009;46:726–738. doi: 10.1016/j.neuroimage.2009.02.018. [DOI] [PubMed] [Google Scholar]
  2. Ardekani BA, Kershaw J, Braun M, Kanno I. Automatic detection of the mid-sagittal plane in 3-D brain images. IEEE Trans. Med. Imaging. 1997;16:947–952. doi: 10.1109/42.650892. [DOI] [PubMed] [Google Scholar]
  3. Ardekani BA, Guckemus S, Bachman AH, Hoptman MJ, Wojtaszek M, Nierenberg J. Quantitative comparison of algorithms for inter-subject registration of 3D volumetric brain MRI scans. J. Neurosci. Methods. 2005;142:67–76. doi: 10.1016/j.jneumeth.2004.07.014. [DOI] [PubMed] [Google Scholar]
  4. Ardekani BA, Bachman AH. Model-based automatic detection of the anterior and posterior commissures on MRI scans. Neuroimage. 2009;46(3):677–682. doi: 10.1016/j.neuroimage.2009.02.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Ardekani BA, Toshikazu I, Bachman AH, Szeszko PR. Multi-atlas corpus callosum segmentation with adaptive atlas selection. Proceedings of the ISMRM; Melbourne, Australia. 2012a. [Google Scholar]
  6. Ardekani BA, Figarsky K, Sidtis JJ. Sexual Dimorphism in the Human Corpus Callosum: An MRI Study Using the OASIS Brain Database. Crib. Cortex. 2012b doi: 10.1093/cercor/bhs253. in press. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Bruner E, de la Cuetara JM, Colom R, Martin-Loeches M. Gender-based differences in the shape of the human corpus callosum are associated with allometric variations. J. Anat. 2012;220:417–421. doi: 10.1111/j.1469-7580.2012.01476.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Cabezas M, Oliver A, Lladó X, Freixenet J, Cuadra MB. A review of atlas-based segmentation for magnetic resonance brain images. Comput. Methods Programs Biomed. 2011;104:e158–e177. doi: 10.1016/j.cmpb.2011.07.015. [DOI] [PubMed] [Google Scholar]
  9. Clarke M, Kraftsik R, van Derk Loos H, Innocenti GM. Forms and Measures of Adult and Developing Human Corpus Callosum: Is There Sexual Dimorphism? J. Comp. Neurol. 1989;280:213–230. doi: 10.1002/cne.902800205. [DOI] [PubMed] [Google Scholar]
  10. Denenberg VH, Cowell PE, Fitch RH, Kertesz A, Kenner GH. Corpus Callosum: Multiple Parameter Measurements in Rodents and Humans. Phys. & Behav. 1991a;49:433–437. doi: 10.1016/0031-9384(91)90261-l. [DOI] [PubMed] [Google Scholar]
  11. Denenberg VH, Kertesz A, Cowell PE. A factor analysis of the human’s corpus callosum. Brain Res. 1991b;548:126–132. doi: 10.1016/0006-8993(91)91113-f. [DOI] [PubMed] [Google Scholar]
  12. Di Paola M, Luders E, Di Iulio F, Cherubini A, Passafiume D, Thompson PM, Caltagirone C, Toga AW, Spalletta G. Callosal atrophy in mild cognitive impairment and Alzheimer’s disease: different effects in different stages. Neuroimage. 2010;49(1):141–149. doi: 10.1016/j.neuroimage.2009.07.050. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Frederiksen KS, Garde E, Skimminge A, Ryberg C, Rostrup E, Baaré WFC, Siebner HR, Hejl A-M, Leffers A-M, Waldemar G. Corpus callosum atrophy in patients with mild Alzheimer’s disease. Neurodegener. Dis. 2011;8:476–482. doi: 10.1159/000327753. [DOI] [PubMed] [Google Scholar]
  14. Friedman J, Hastie T, Hofling H, Tibshirani R. Pathwise coordinate optimization. Ann. Appl. Stat. 2007;1(2):302–332. [Google Scholar]
  15. Hallam BJ, Brown WS, Ross C, Buckwalter JG, Bigler ED, Tschanz JT, Norton MC, Welsh-Bohmer KA, Breitner JCS. Regional atrophy of the corpus callosum in dementia. J. Int. Neuropsychol. Soc. 2008;14:414–423. doi: 10.1017/S1355617708080533. [DOI] [PubMed] [Google Scholar]
  16. Hampel H, Teipel SJ, Alexander GE, Pogarell O, Rapoport SI, Möller H-I. In vivo imaging of regions and cell type specific neocortical neurodegeneration in Alzheimer’s disease. J. Neural. Transm. 2002;109:837–855. doi: 10.1007/s007020200069. [DOI] [PubMed] [Google Scholar]
  17. Hofer S, Frahm J. Topography of the human corpus callosum revisited - Comprehensive fiber tractography using diffusion twnsor magnetic resonance imaging. Neuroimage. 2006;32:989–994. doi: 10.1016/j.neuroimage.2006.05.044. [DOI] [PubMed] [Google Scholar]
  18. Holloway RL, Anderson PJ, Defendini R, Harper C. Sexual dimorphism of the human corpus callosum from three independent samples: relative size of the corpus callosum. Am. J. Phys. Anthropol. 1993;92(4):481–498. doi: 10.1002/ajpa.1330920407. [DOI] [PubMed] [Google Scholar]
  19. Jain R, Kasturi R, Schunck BG. Machine vision. New York: McGraw-Hill; 1995. [Google Scholar]
  20. Kilian S, Brown WS, Hallam BJ, McMahon W, Lu J, Johnson M, Bigler ED, Lainhart J. Regional Callosal Morphology in Autism and Macrocephaly. Dev. Neuropsychol. 2007;33(1):74–99. doi: 10.1080/87565640701729821. [DOI] [PubMed] [Google Scholar]
  21. Klein A, Andersson J, Ardekani BA, Ashburner J, Avants B, Chiang M-C, Christensen GE, Collins DL, Gee J, Hellier P, Song JH, Jenkinson M, Lepage C, Rueckert D, Thompson P, Vercauteren T, Woods RP, Mann JJ, Parsey RV. Evaluation of 14 nonlinear deformation algorithms applied to human brain MRI registration. Neuroimage. 2009;46(3):786–802. doi: 10.1016/j.neuroimage.2008.12.037. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Lee S, Lee H, Abbeel P, Ng A. Efficient L1 regularized logistic regression; Proceedings of the 21st National Conference on Artificial Intelligence (AAAI-06), 2006; 2006. [Google Scholar]
  23. Marcus DS, Wang TH, Parker J, Csernansky JG, Morris JC, Buckner RL. Open Access Series of Imaging Studies (OASIS): cross-sectional MRI data in young, middle aged, nondemented, and demented older adults. J. Cogn. Neurosci. 2007;19:1498–1507. doi: 10.1162/jocn.2007.19.9.1498. [DOI] [PubMed] [Google Scholar]
  24. Risacher SL, Saykin AJ. Neuroimaging and Other Biomarkers for Alzheimer’s Disease: The Changing Landscape of Early Detection. Annu. Rev. Clin. Psychol. 2013;9:621–648. doi: 10.1146/annurev-clinpsy-050212-185535. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Teipel SJ, Bayer W, Alexander GE, Zebuhr Y, Teichberg D, Kulic L, Schapiro MB, Möller HJ, Rapoport SI, Hampel H. Progression of corpus callosum atrophy in Alzheimer disease. Arch. Neurol. 2002;59(2):243–248. doi: 10.1001/archneur.59.2.243. [DOI] [PubMed] [Google Scholar]
  26. Tibshirani R. Regression shrinkage and selection via the lasso. J. R. Stat. Soc. Series B Stat. Methodol. 1996;58(1):267–288. [Google Scholar]
  27. Tibshirani R, Saunders M, Rosset S, Zhu J, Knight K. Sparsity and smoothness via the fused lasso. J. R. Stat. Soc. Series B Stat. Methodol. 2005;67(1):267–288. [Google Scholar]
  28. Wang PJ, Saykin AJ, Flashman LA, Wishart HA, Rabin LA, Santulli RB, McHugh TL, MacDonald JW, Mamourian AC. Regionally specific atrophy of the corpus callosum in AD, MCI and cognitive complaints. Neurobiol. Aging. 2006;27(11):1613–1617. doi: 10.1016/j.neurobiolaging.2005.09.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Weis S, Jellinger K, Wenger E. Morphometry of the corpus callosum in normal aging and Alzheimer’s disease. J. Neural. Transm. Suppl. 1991;33:35–38. doi: 10.1007/978-3-7091-9135-4_6. [DOI] [PubMed] [Google Scholar]
  30. Witelson SF. Hand and sex differences in the isthmus and genu of the human corpus callosum. A post mortem morphological study. Brain. 1989;112(3):799–835. doi: 10.1093/brain/112.3.799. [DOI] [PubMed] [Google Scholar]
  31. Yuan M, Lin Y. Model selection and estimation in regression with grouped variables. J. R. Stat. Soc. Series B Stat. Methodol. 2007;68(1):49–67. [Google Scholar]
  32. Yushkevich PA, Piven J, Hazlett HC, Smith RG, Ho S, Gee JC, Gerig G. User-guided 3D active contour segmentation of anatomical structures: significantly improved efficiency and reliability. Neuroimage. 2006;31(3):1116–1128. doi: 10.1016/j.neuroimage.2006.01.015. [DOI] [PubMed] [Google Scholar]
  33. Zou H, Hastie T. Regularization and variable selection via the elastic net. J. R. Stat. Soc. Series B Stat. Methodol. 2005;67(2):301–320. [Google Scholar]

RESOURCES