Skip to main content
NeuroImage : Clinical logoLink to NeuroImage : Clinical
. 2015 Jan 15;7:367–376. doi: 10.1016/j.nicl.2015.01.008

Content-based image retrieval for brain MRI: An image-searching engine and population-based analysis to utilize past clinical data for future diagnosis

Andreia V Faria a,, Kenichi Oishi a, Shoko Yoshida a, Argye Hillis b,c,d, Michael I Miller e, Susumu Mori a,f
PMCID: PMC4309952  PMID: 25685706

Abstract

Radiological diagnosis is based on subjective judgment by radiologists. The reasoning behind this process is difficult to document and share, which is a major obstacle in adopting evidence-based medicine in radiology. We report our attempt to use a comprehensive brain parcellation tool to systematically capture image features and use them to record, search, and evaluate anatomical phenotypes. Anatomical images (T1-weighted MRI) were converted to a standardized index by using a high-dimensional image transformation method followed by atlas-based parcellation of the entire brain. We investigated how the indexed anatomical data captured the anatomical features of healthy controls and a population with Primary Progressive Aphasia (PPA). PPA was chosen because patients have apparent atrophy at different degrees and locations, thus the automated quantitative results can be compared with trained clinicians' qualitative evaluations. We explored and tested the power of individual classifications and of performing a search for images with similar anatomical features in a database using partial least squares-discriminant analysis (PLS-DA) and principal component analysis (PCA). The agreement between the automated z-score and the averaged visual scores for atrophy (r = 0.8) was virtually the same as the inter-evaluator agreement. The PCA plot distribution correlated with the anatomical phenotypes and the PLS-DA resulted in a model with an accuracy of 88% for distinguishing PPA variants. The quantitative indices captured the main anatomical features. The indexing of image data has a potential to be an effective, comprehensive, and easily translatable tool for clinical practice, providing new opportunities to mine clinical databases for medical decision support.

Keywords: Automated parcellation, Brain, MRI, Content-based image retrieval, Atlas-based analysis

Highlights

  • Brain parcellation tools define structures automatically and convert images into standardized and quantitative matrices.

  • We tested if an automated tool and the resultant vector of structural volumes can accurately capture anatomical phenotypes.

  • The agreement between visual and automated atrophy detection was virtually the same as the inter-evaluator agreement.

  • The quantitative indices captured the main anatomical features in brains with atrophy in different degrees and location.

  • The image quantification has potential to be an effective, comprehensive, and easily translatable tool for clinical practice.

1. Introduction

Currently, radiological diagnosis is based on subjective judgment by radiologists, in which image-based findings and text-based clinical information are integrated to reach a decision. This ability is acquired by education and personal experience, and the exact reasoning leading to the final conclusion is difficult to document and share. This is a major obstacle to adopting evidence-based medicine in radiology, which calls for a systematic integration of past evidence for medical decision making (Hsu, 2012). Presently, radiological data are stored in PACS (Picture Archive and Communication System). Once a diagnosis is completed, this rich data source is rarely used to enrich future daily diagnostic capability. The heart of the problem is that the data in PACS are not fully structurized, which makes information retrieval impractical.

The PACS contains two types of information: text and images. The text is usually composed of a few structured and searchable fields, such as name, gender, and age, as well as free-text fields with image descriptions and diagnosis. There have been many ongoing attempts to structurize the PACS free-text fields (Radiological Society of North America, 2014a,b; Hussein, 2004). If successful, physicians would be able to search, for example, for “past cases with a diagnosis of Alzheimer's disease (AD),” and, in return, say, they would retrieve 100 images of AD patients. This is very useful for research purposes. However, the retrieved 100 images may not be directly useful in supporting routine clinical decisions.

To unlock the rich information stored in PACS, we also need to structurize the image data to support query, search, and relational analysis of anatomical information in the images. For example, physicians would submit an image of a new patient and search 100 images with a similar anatomical phenotype. The clinical reports related to these images would be retrieved and a statistical report of the diagnosis and prognosis could be generated. The technology for direct image searching, called content-based image retrieval (CBIR), is similar to that involved in face recognition (for review, see Müller, 2004), and is highly anticipated advancement in medical imaging for CT and MRI (Greenspan and Pinhas, 2007; Rahman et al., 2007; Robinson, 1996; Orphanoudakis et al., 1996; El-Kwae et al., 2000; Sinha et al., 2001; Unay et al., 2010). While CBIR is certainly promising, to date, very few image retrieval systems have been used for education or for routine clinical practice. Some difficulties in applying these established CBIR systems to the human brain are the complexity of the structures and the importance of location information.

We report our initial attempt to extend CBIR to three-dimensional feature extraction and structuration of the brain anatomy. The most obvious approach would be to align all the images in a database into a common template space. With a resolution of 1 mm3, each image would be converted into a vector with about 1 million-voxel elements and vector-to-vector correlation analysis could be performed to find past similar cases. Alternatively, subtraction between a pair of images could be used to evaluate the similarity. These approaches, however, have several challenges. First, the number of observation (voxels) is too large and the information is too noisy. Second, each voxel does not carry anatomic notion by itself. Here, we assume that the conversion of the 1 million-voxel image representation needs to be contracted to a far smaller number of anatomical representation for subsequent information storage, searching, and correlation analyses. The recent advances in the structural parcellation technique of the entire brain (Shen and Davatzikos, 2002; Shi, 2010; Wang, 2013) now provides tools to structure raw MR images into a standardized and quantitative indices.

In this paper, we test the efficacy of the parcellation-based image structuration approach using T1-weighted images of a population with primary progressive aphasia (PPA) and age-matched controls. PPA is an ideal model because, while brain atrophy is a common feature, its extension and location vary. In addition, the atrophy is visible and the results can be compared with subjective human judgment. We tested whether the structured anatomical data actually captured the anatomical features that can be perceived by trained clinicians. Using principal component analysis (PCA) and partial least squares-discriminant analysis (PLS-DA), we characterized the anatomical variability in PPA, tested our image search engine, and performed individual evaluations and integrative analyses of anatomical (image) and clinical (diagnosis) phenotypes.

2. Methods and materials

2.1. Participants

We enrolled 30 participants with PPA who were seen in one author's (AH's) outpatient clinic. Table 1 summarizes demographic and clinical information; the Appendix table contains the complete clinical information. These patients gave written informed consent and this study was approved by the local IRB. They were diagnosed with PPA on the basis of having a predominant and progressive deterioration in language in the absence of a major change in personality, behavior, or cognition (other than praxis) for at least 2 years. They were classified as Logopenic variant (Lv; n = 19, 4 male, age: 67.4 ± 6.4) or Semantic variant (Sv; 7 male, age: 65 ± 6.5), according to recent guidelines (Gorno-Tempini, 2011). Patients with nonfluent/agrammatic variant PPA were not included. Axial MPRAGE T1 − WIs (TR/TE = 8.4/3.9 ms) were acquired using a 3 T MRI scanner, with a 256 × 256 matrix. Sixteen participants were scanned with a field of view (FOV) of 230 × 230 mm and 120 slices of 1 mm thickness; fourteen were scanned with a FOV of 212 × 212 mm and 140 slices of 1.1 mm thickness. Healthy controls (n = 24) with same age, gender, and image protocol distribution were obtained from our normal database.

Table 1.

Demographic and clinical information of the PPA participants.

Participant PPA variant Age Gender Years from the first symptom, approximated Classifier group
1 Lv 67 F 3 Train
2 Lv 71 F 5.5 Train
3 Lv 70 M 1 Train
4 Lv 62 F 5 Train
5 Lv 58 M 2 Train
6 Lv 67 F 1 Train
7 Lv 72 F 5 Train
8 Lv 73 F 2 Train
9 Lv 64 F 2 Test
10 Lv 79 F 7 Test
11 Lv 71 F 3 Test
12 Lv 66 F 10 Test
13 Lv 65 F 5.2 Test
14 Lv 70 F 4 Test
15 Lv 73 F 1.25 Test
16 Lv 51 M 2 Test
17 Lv 63 F 4.7 Test
18 Lv 71 M 2.2 Test
19 Sv 62 M 2 Train
20 Sv 73 M 10 Train
21 Sv 56 F 7.5 Train
22 Sv 62 M 1 Train
23 Sv 68 M 6 Train
24 Sv 60 F 1.5 Train
25 Sv 71 M 4 Test
26 Sv 68 F 1 Test
27 Sv 58 F 1 Test
28 Sv 74 F 9 Test
29 Sv 70 M 2 Test
30 Sv 57 M 1 Test
Summary
Lv train 67.5 ± 5.2 2M, 6F 3.1 ± 1.8
Lv test 67.3 ± 7.4 2M, 8F 4.1 ± 2.7
Sv train 63.5 ± 6.0 4M, 2F 4.6 ± 3.7
Sv test 66.3 ± 7.1 3M, 3F 3 ± 3.1

2.2. Automated parcellation

To measure the volume of each anatomical region, we performed an atlas-based analysis (ABA), in which an atlas image (Mori, 2008) with a pre-defined anatomical parcellation was warped to each participant's brain, thereby automatically parcellating each brain (Fig. 1). The process was performed by DiffeoMap and RoiEditor (Laboratory of Brain Anatomical MRI and Center for Imaging Science at Johns Hopkins University, 2014) and was made possible due to the high accuracy of the mapping algorithm, the large deformation diffeomorphic metric mapping, LDDMM (Miller, 2005; Wang, 2007). The dual-contrast LDDMM (Ceritoglu, 2009) was based on T1-WIs and cerebrospinal fluid (CSF) maps (Djamanakova, 2013). The template was the JHU-MNI “Eve,” a single-subject atlas in the ICBM-152 space, extensively parcelled and labeled to 211 regions (Oishi, 2009). The 52 “coarse” peripheral regions (Oishi, 2008) were subsegmented in each native individual space to separate the cortex, the white matter beneath the cortex, and the cerebrospinal fluid (CSF), using tissue maps from SPM8, which resulted in the final 211 ROIs for each participant. This parcellation–segmentation combined approach circumvents the mis-registration problems in the cortical areas, although mis-registration may exist between the parcels (e.g., in the boundary between adjacent gyri or between peripheral and deep white matter parcels).

Fig. 1.

Fig. 1

Schematic representation of the normalization procedure. Each subject's brain is accurately normalized to the template, which was previously parcellated into regions that were anatomically predefined. Due to the reversible nature of the normalization algorithm, this parcellation map (PM) can be warped to each subject's brain, thus enabling automatic measurement of the volume (and contrast) of hundreds of areas in each individual. Thus, voxel-based analysis (VBA) and atlas-based analysis (ABA) can be performed on the same framework. The segmentation according to tissue type allows the segregation of CSF, white, and gray matter in the peripheral areas in the native space of each subject.

In order to compare different degrees of granularity, we combined lobar parcels and deep white matter areas, and created a “low granularity” parcellation that contains 14 regions. Both the high and low granularity parcellations were tested against the visual analysis by 3 experienced clinicians (two neuroradiologists — AVF and SY — and one neurologist — KO).

2.3. Data analysis

For the voxel-based analysis (VBA, Fig. 1) in the template space, we used the Jacobian determinant (i.e., the local expansion factor) of the LDDMM deformation fields as a quantitative metric of local volume changes (Chung, 2001; Riddle, 2004; Thompson, 2000). For the ABA, we used the volume of each parcel in the native space, normalized by the intracranial volume. For each PPA patient, regional z-scores were calculated using the average and standard deviations of the controls.

The principal component analysis (PCA) was used to explore the anatomical characterization. For image classification (Lv vs. Sv), and to exemplify how the image search engine works, we used PLS-DA. To guarantee classifier stability, we trained the model using participants scanned with one protocol (n = 14) and tested it with participants scanned with another protocol (n = 16). These groups had no other significant clinical or demographic differences other than the image protocol. We report the variables (i.e., the anatomic regions) with the highest weight to explain the population variance (in the PCA case) and characterize and segregate groups (in the PLS-DA case).

2.4. Correlation with visual assessments

To determine to what extent the visual analysis and the ABA agreed, we asked the neurologist / neuroradiologists to visit each parcel automatically defined in the native space of all the subjects and to classify the atrophy in degrees of severity (0 = no atrophy, 1 = mild, 2 = moderate, 3 = severe atrophy). The average scores of visual analysis in each region were correlated with the z-scores of volume obtained from the automated ABA, using a non-parametric Spearman test. In the same way, we calculated the inter-rater agreement. We also “dichotomized” the data (visual scores: 0 and 1 in class 1, and scores 2 and 3 in class 2; ABA z-scores: ≤2 were class 1 and >2 were class 2) and performed an accuracy analysis to measure the sensitivity and specificity of the ABA compared to the subjective visual impression.

3. Results

3.1. Population-based VBA and ABA comparison

Fig. 2 is a volumetric comparison of controls and PPA participants with VBA and ABA, by t-tests at a p-value < 0.05, corrected for multiple comparisons using the false discovery rate (FDR). Both methods detected atrophy in the white matter and the deep gray matter, as well as enlargement of the ventricles. However, there were several noticeable differences. First, the ABA examines the volumes structure-by-structure, while the VBA examines each voxel independently and, therefore, the report does not follow anatomical boundaries. Second, VBA results are largely silent in the cortical areas, probably due to mis-registration of voxels in the CSF, the cortex, and the peripheral white matter across patients. Isotropic filters do not ameliorate this fact. This is less of an issue for ABA because these three compartments are measured and averaged in the native space.

Fig. 2.

Fig. 2

Volumetric comparison between PPA patients and controls using ABA and VBA with different sizes of isotropic filters. Colored areas were significantly different between groups (t-test at p-value < 0.05 with correction for multiple comparison using FDR). The colors code the ratio of volume in patients/controls.

3.2. Individual-based analysis and comparison with visual examinations

Fig. 2 reports the results of a population-based analysis, in which data within a group are congregated and atrophic locations that are common among the patients are characterized. This analysis is, however, not compatible with clinical diagnosis, which requires information for each patient. A simple approach to report quantitative individual features is to use a z-score map of each patient (Fig. 3). The individual analysis based on VBA is, again, mostly silent in peripheral areas. Fig. 3 shows examples of individual ABA results for three PPA patients. ABA converts the native T1-WIs images into a 211- (high granularity) or 14- (low granularity) element z-score vector, which is visualized by a bar-code-type graph. The agreement of the subjective analysis (inter-evaluators) was higher using the low-granularity parcellation (r = 0.81) than with the high-granularity (r = 0.54). The agreement between the automated z-score and the averaged visual scores (r = 0.8) was virtually the same as the inter-evaluator agreement, showing that the automated method’s performance can be considered as good as that of one of the raters. The agreement between visual and quantitative analysis improved at moderate and severe degrees of atrophy (r = 0.68 for visual scores >1, against r = 0.22 for visual scores <1, at high granularity). The sensitivity, specificity, and predictive values of evaluators and quantitative analysis were comparable (Table 2).

Fig. 3.

Fig. 3

Representation of degrees of regional atrophy as z-scores measured by ABA in patients with different anatomical phenotypes. The intensity plots of the regional degree of atrophy are represented at the bottom of the respective image, in two different degrees of granularity (high: 211 areas; low: 14 areas). The bottom row is shown for comparison with the z-scores obtained from VBA in one of the subjects, brought to this subject space. sT, mT, iT: superior, middle, and inferior temporal gyri, mO: middle occipital gyrus, Ag: angular gyrus, sM: supra marginal gyrus, ins: insula, sF: superior frontal gyrus, lv-b, lv-f, lv-o, lv-t: body, frontal horn, occipital horn, and temporal horn of the lateral ventricles.

Table 2.

Inter-evaluator agreement in describing atrophy and agreement between the average evaluators' scores and automated qualitative z-scores.

Inter-evaluators Quantitative vs. evaluators
Sensitivity 0.74 0.87
Specificity 0.72 0.67
Positive predictive value 0.54 0.66
Negative predictive value 0.87 0.88

3.3. PCA-based analysis for population-based interpretation of the anatomy of individual patients

Fig. 4 shows the tridimensional PCA plot, where the principal components (PCs) were linear combinations of the volume of the parcels. The three PCs explained 47.5% of all anatomical variability in this cohort. The definition of the three groups (controls and two PPA variants) was based on clinical information, not on anatomical information. Although there is a natural segregation among the groups, it is not perfect. For example, a patient clinically diagnosed as Sv (indicated as “patient nearby controls”) is “anatomically” close to the controls on the PCA plot. This is understandable if we examine the MRI, which seems to demonstrate a normal appearance. Similarly, controls “anatomically” located near the PPA patients (such as “control nearby patients”) had enlarged ventricles and parenchyma atrophy. The images of six subjects numbered in Fig. 4 and shown in Fig. 5 were chosen based on their locations in the PCA space. Two neighboring cases in the PCA space (#1 and 2, #3 and 4, or #5 and 6) had visually similar anatomical features, while the cases distant from each other (#1, 3, and 5 or #2, 4, and 6) had markedly different anatomical features, indicating that the structured anatomical vectors captured the gross anatomical features.

Fig. 4.

Fig. 4

PCA plot of the volume vectors. The colors represent clinical classifications. In detail, the images of two individuals that “fall,” at the PCA space, far from the center of the group of his/her “clinical” phenotype, but close to individuals that share his/her “anatomical” phenotype.

Fig. 5.

Fig. 5

These images correspond to the individuals numbered in Fig. 4. “Neighbors” in the PCA plot share a similar anatomical phenotype (#1 and #2 have normal appearance; #3 and #4 [Lv] have marked frontal and parietal atrophy (arrows); #5 and #6 [Sv] have global severe atrophy, particularly at the left temporal lobe (circles)), while individuals that are distant from each other have very distinctive anatomical characteristics.

Fig. 6 shows the areas with highest absolute weight in the first three PCs. The highest loading weights in the first component were widespread in tissue (negative), particularly at the left, and at the ventricles (positive), characterizing global tissue atrophy and ventricle enlargement. This component contributed to the segregation of controls from PPA patients. The other components were marked by high (absolute) weights at the deep gray matter and the core white matter (second), and at the frontal, parietal, and temporal lobes, particularly at right, in the latter (third). This reflected the common phenotype of PPA, as well as particular characteristics of each variant, such as the predominance of atrophy in the frontal and parietal areas in Lv, and at the left temporal in Sv (Wilson, 2009).

Fig. 6.

Fig. 6

Highest loading weights of the three first principal components. The first component characterizes global tissue atrophy and ventricle enlargement, and contributed most to the segregation of controls from PPA patients. The second and third components reveal, respectively, general and particular characteristics of PPA variants, such as global deep gray matter atrophy and atrophy at the fronto-parietal regions (Lv) and temporal left (Sv).

Differences in protocol contributed to 5% of the data variance, suggesting that the anatomical effect size in this population was large enough to be delineated beyond the protocol. Age contributed to 1% of the variance and gender, to 0.73%.

3.4. Evaluation of the searching results

The PLS-DA resulted in a model with an accuracy of 0.875 in distinguishing Lv from Sv (Fig. 7). In the first component, which had the greatest power to segregate the groups, the loading (absolute) weights were higher in the frontal and parietal areas in Lv, and in the left temporal areas in Sv, which agreed with the pattern of atrophy described previously (Wilson, 2009). The second component, much less powerful in segregating the groups, revealed high loading weights in the frontal areas, the deep gray and white matter, in some temporal, parietal, and occipital left areas, and in the frontal horn of left ventricle, which is a common pattern of atrophy in this population.

Fig. 7.

Fig. 7

PLS-DA. The classifier selected and combined anatomical features characteristics of each variant (components) in a model that achieved reasonable classification accuracy. sT, mT, iT: superior, middle, and inferior temporal gyri; iO, mO: inferior and middle occipital gyrus; Ag: angular gyrus; sM: supra marginal gyrus; ins: insula; L: lingual gyrus; Cu: cuneus; iF, sF: inferior and superior frontal gyrus, pC: pre-Central gyrus; CSO: centrum semi ovalle.

4. Discussion

4.1. Advantages and disadvantages of the contraction of spatial information by ABA

We demonstrated the concept of converting images to a standardized vector to be used for the quantitative analysis of atrophy at population and individual levels. The accuracy of this conversion hinges on the accuracy of the image transformation and subsequent atlas warping. Our past publications reported a high level of accuracy using the LDDMM algorithm for populations with marked atrophy (Oishi, 2009; Djamanakova, 2013), although it depends on the structure in question. In addition, our healthy young adult atlas may not be the best template to achieve high transformation accuracy. More sophisticated approaches, such as multi-atlases (Aljabar, 2009; Heckemann, 2006; Iosifescu, 1997; Jia et al., 2012; Klein and Hirsch, 2005; Lao, 2004; Liu et al., 2004; Rohlfing et al., 2004; Wang, 2010; Warfield et al., 2004; Wu et al., 2007), may increase the accuracy in the future.

For the image-vector conversion, ABA plays an important role in dimensional reduction. The comparison between ABA and VBA shown in Fig. 2 has several relevant points for CBIR. Both approaches share the same image transformation scheme and, therefore, they also share the same level of transformation accuracy. The difference is the spatial filtering: in this study no filter was employed for VBA while one extra step to group voxels within pre-defined structures was employed for ABA. This makes the impact of filtering comparable. Assuming the brain volume is 1300 ml3, there are 1.3 million pixels in an image with 1 × 1 × 1 mm resolution. If these pixels are treated independently, the subsequent image structuration (Fig. 3) and PCA (Fig. 4) would become prohibitively difficult due to the sheer amount of information. Another important difference is the way peripheral areas (the CSF, the cortex, and the white matter beneath the cortex) are defined. Because of their large anatomical variability, a problem that is aggravated by the atrophy, we cannot expect accurate cross-subject registration (Diaz-de-Grenu, 2014), even with highly elastic transformation by LDDMM. The mis-registration leads to lower sensitivity for atrophy detection. In the ABA, the separation of these three compartments is performed in the native space of each subject (Fig. 1), reducing the effect of voxel-by-voxel mis-registration. Averaging of a large number of voxels in ABA may also increase the sensitivity to detect diffuse atrophy patterns, although it could be insensitive to atrophy confined in a region significantly smaller than a parcel.

A drawback of ABA is that it requires pre-defined structural definitions. In this study, we used a parcellation based on classical anatomical definitions (Mori, 2008; Oishi, 2009). There are multiple other ways to parcellate the brain based on different criteria, for example, vascular, cytoarchitectonic, or functional maps. The employed parcellation may not optimally capture important pathological features; the best parcellation scheme depends on many factors, including the pathology in question.

Another difficulty is to define the ideal level of granularity (the number of parcels). If an abnormality exists in a confined area, the granularity must be high enough to detect it. On the other hand, if the abnormality is widespread and subtle, to use lower granularity is a better choice. It is known that the brain structures are based on hierarchical relationships and brain pathology could affect at the different levels of the hierarchy. For example, in a cohort of neurodegenerative diseases, various types of brain atrophy are known to exist depending on the patient pathology, such as left-dominant brain atrophy, focal temporal lobe atrophy, or hippocampal atrophy. As shown in Fig. 3, evaluation of the brain anatomy with multiple granularity levels could be an interesting approach to characterize atrophy patterns of neurodegenerative patients.

In this study, we were interested in the level of granularity in which trained radiologists evaluate images; it is certainly not voxel-by-voxel. Judged from the level of inter-rater agreement, the granularity level seems to be less than 211 parcels. At level of 14 lobar-based parcels, we found good agreement among raters as well as between the human and automated analysis. More comprehensive study with more granularity levels may be needed to further investigate how human examines images.

4.2. Evaluation of structured anatomical information by PCA

For conditions like PPA, which is known to contain heterogeneous pathological subtracts, the population-based analysis such as what shown in Fig. 2 has a limited value. If we want to incorporate anatomical phenotype information for clinical use, alternative analytical frameworks, in which individual data are retained and not population-averaged, are needed. Using PCA, we explored the interpretation of individual images within a context of the population data. In addition, a concept of similarity search based on the feature vectors was demonstrated (Figs. 4). The distances among individuals could be used for similarity search. For example, suppose that patient # 5 in Fig. 2 is a patient of interest. Based on the nearest 10 cases in the PCA plot, one can conclude that this is probably an SvPPA patient because 70% of his “neighbors” are Sv, 20% are Lv, and 10% are controls. This type of image search can provide evidence-based clinical reports (Sinha et al., 2001); the PACS data can be searched purely on anatomical features, and statistical reports of associated clinical phenotypes can be provided, aiding the diagnosis and prognosis.

Of course, the searching approach described above considers only the anatomical phenotypes, which may not encode complete information about the pathology. One potential extension of the PCA is to add other dimensions of information, such as clinical data, to create classifiers. For example, using PLS-DA, we achieved a reasonable accuracy for classifying individuals according the PPA variants (Fig. 7). The idea of creating classifiers based on image features is not new and was successfully applied before to various neurodegenerative diseases, such as Alzheimer's, achieving a level of classification accuracy that suppress the human analysis (Cuingnet, 2011; Eskildsen, 2013; Fritzsche, 2010; Gray, 2013; Klöppel, 2008a,b). While the level of the classification accuracy is one of the hallmarks of the success, our main interest in this paper is to examine if the resultant anatomical features that would discriminate the patient populations actually make sense in the light of clinicians' accumulated knowledge, which they did (Gorno-Tempini, 2011, 2004; Mesulam, 2009, 2012). This indicates that the classifier extracts and smartly combines important anatomical features. These features can be easily translated, sought, and confirmed by the qualitative analysis, which is not always possible using VBA.

4.3. Limitations and perspectives

In the present study, we chose to use just the regional volumes, obtained from T1-WI analysis, as the image feature because the pathology in question causes brain atrophy primarily. However, this methodology is extensible to any other image contrast, such as T2-WIs, DTI derived-contrasts, SWIs, MRSI, or even to correlation matrices derived from resting state fMRI, since they share the same anatomical space as shown in a previous publication (Faria, 2012). The different image features, that can be contrasts such as FA, MD, or metabolite concentrations, or connectivity maps, can be used isolated or combined as a matrix of structures by image features that would characterize each individual. Then the same type of analysis can be performed and would be useful in cases where different domains of the anatomical phenotype affected (Zhang, 2011).

Finally, we do not expect perfect diagnostic accuracy with the discriminant analysis; first, as described above, we cannot assume that the anatomy contains enough information about the pathology. Second, our analysis does not include several critical information, such as clinical severity or distance from the onset. In addition, the diagnosis by clinicians, which is used as the gold standard for the classification, is known to contain errors and it is reasonable that the diagnoses change over time. In this context, the goal of this study was to maximize the usefulness of the anatomical information delineated by MRI by systematically integrating the anatomical and clinical information from past cases. We achieved this goal by demonstrating effective structuration of image data through image-vector conversion, which provides new opportunities to mine existing clinical databases for medical decision support.

This study is, however, still preliminary to judge the usefulness of the proposed approach in real clinical settings. To replicate the situations physicians routinely encounter in daily practices, a larger number of cases that contain more heterogeneous pathology are required. Another practical, yet, highly important question is if automated image analysis tools are robust enough to accurately analyze clinical data in PACS, which are often suboptimal in terms of image resolutions and contrasts. Although our tool has been tested for pediatric (Faria, 2010) and geriatric populations (Oishi, 2009; Djamanakova, 2013), protocol impacts have not been rigorously tested. Also, in order to determine the regional pattern of failure we need extensive manual delineation of hundreds of tridimensional structures in a large database. It is in our future scope to improve and test the robustness through technical advancement such as multiple atlas segmentation.

The following is the supplementary data related to this article.

Appendix table

Clinical Information of PPA participants.

mmc1.docx (27.5KB, docx)

Acknowledgments

We thank all the patients and volunteers that agree to participate. We thank NIH (grants P41RR15241, RO1AG20012, and RO1NS058299 (SM), RO1HD065955 (KO), RO3EB014357 (AVF), R01 DC011317 and R01 DC 03681 (AH)) for financial support.

References

  1. Aljabar P. Multi-atlas based segmentation of brain images: atlas selection and its effect on accuracy. Neuroimage. 2009;46(3):726–738. doi: 10.1016/j.neuroimage.2009.02.018. 19245840 [DOI] [PubMed] [Google Scholar]
  2. Ceritoglu C. Multi-contrast large deformation diffeomorphic metric mapping for diffusion tensor imaging. Neuroimage. 2009;47(2):618–627. doi: 10.1016/j.neuroimage.2009.04.057. 19398016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Chung M.K. A unified statistical approach to deformation-based morphometry. NeuroImage. 2001;14(3):595–606. doi: 10.1006/nimg.2001.0862. 11506533 [DOI] [PubMed] [Google Scholar]
  4. Cuingnet R. Automatic classification of patients with Alzheimer's disease from structural MRI: a comparison of ten methods using the ADNI database. Neuroimage. 2011;56(2):766–781. doi: 10.1016/j.neuroimage.2010.06.013. 20542124 [DOI] [PubMed] [Google Scholar]
  5. Diaz-de-Grenu L.Z. A brief history of voxel-based grey matter analysis in Alzheimer's disease. J. Alzheimers Dis. 2014;38(3):647–659. doi: 10.3233/JAD-130362. 24037033 [DOI] [PubMed] [Google Scholar]
  6. Djamanakova A. Diffeomorphic brain mapping based on T1-weighted images: improvement of registration accuracy by multichannel mapping. J. Magn. Reson. Imaging. 2013;37(1):76–84. doi: 10.1002/jmri.23790. 22972747 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. El-Kwae E.A., Xu H., Kabuka M.R. Content-based retrieval in picture archiving and communication systems. J. Digit. Imaging. 2000;13(2):70–81. doi: 10.1007/BF03168371. 10843252 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Eskildsen S.F. Prediction of Alzheimer's disease in subjects with mild cognitive impairment from the ADNI cohort using patterns of cortical thinning. Neuroimage. 2013;65:511–521. doi: 10.1016/j.neuroimage.2012.09.058. 23036450 [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Faria A.V. Atlas-based analysis of neurodevelopment from infancy to adulthood using diffusion tensor imaging and applications for automated abnormality detection. NeuroImage. 2010;52(2):415–428. doi: 10.1016/j.neuroimage.2010.04.238. 20420929 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Faria A.V. Atlas-based analysis of resting-state functional connectivity: evaluation for reproducibility and multi-modal anatomy-function correlation studies. Neuroimage. 2012;61(3):613–621. doi: 10.1016/j.neuroimage.2012.03.078. 22498656 [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Fritzsche K.H. Automated MR morphometry to predict Alzheimer's disease in mild cognitive impairment. Int. J. Comput. Assist. Radiol. Surg. 2010;5(6):623–632. doi: 10.1007/s11548-010-0412-0. 20440655 [DOI] [PubMed] [Google Scholar]
  12. Gorno-Tempini M.L. Cognition and anatomy in three variants of primary progressive aphasia. Ann. Neurol. 2004;55(3):335–346. doi: 10.1002/ana.10825. 14991811 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Gorno-Tempini M.L. Classification of primary progressive aphasia and its variants. Neurology. 2011;76(11):1006–1014. doi: 10.1212/WNL.0b013e31821103e6. 21325651 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Gray K.R. Random forest-based similarity measures for multi-modal classification of Alzheimer's disease. Neuroimage. 2013;65:167–175. doi: 10.1016/j.neuroimage.2012.09.065. 23041336 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Greenspan H., Pinhas A.T. Medical image categorization and retrieval for PACS using the GMM-KL framework. IEEE Trans. Inf. Technol. Biomed. 2007;11(2):190–202. doi: 10.1109/titb.2006.874191. 17390989 [DOI] [PubMed] [Google Scholar]
  16. Heckemann R.A. Automatic anatomical brain MRI segmentation combining label propagation and decision fusion. NeuroImage. 2006;33(1):115–126. doi: 10.1016/j.neuroimage.2006.05.061. 16860573 [DOI] [PubMed] [Google Scholar]
  17. Hsu W. Context-based electronic health record: toward patient specific healthcare. IEEE Trans. Inf. Technol. Biomed. 2012;16(2):228–234. doi: 10.1109/TITB.2012.2186149. 22395637 [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Hussein R. DICOM structured reporting: part 2. Problems and challenges in implementation for PACS workstations. RadioGraphics. 2004;24(3):897–909. doi: 10.1148/rg.243035722. 15143239 [DOI] [PubMed] [Google Scholar]
  19. Iosifescu D.V. An automated registration algorithm for measuring MRI subcortical brain structures. NeuroImage. 1997;6(1):13–25. doi: 10.1006/nimg.1997.0274. 9245652 [DOI] [PubMed] [Google Scholar]
  20. Jia H., Yap P.T., Shen D. Iterative multi-atlas-based multi-image segmentation with tree-based registration. Neuroimage. 2012;59(1):422–430. doi: 10.1016/j.neuroimage.2011.07.036. 21807102 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Klein A., Hirsch J. Mindboggle: a scatterbrained approach to automate brain labeling. NeuroImage. 2005;24(2):261–280. doi: 10.1016/j.neuroimage.2004.09.016. 15627570 [DOI] [PubMed] [Google Scholar]
  22. Klöppel S. Accuracy of dementia diagnosis: a direct comparison between radiologists and a computerized method. Brain. 2008;131(11):2969–2974. doi: 10.1093/brain/awn239. 18835868 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Klöppel S. Automatic classification of MR scans in Alzheimer's disease. Brain. 2008;131(3):681–689. doi: 10.1093/brain/awm319. 18202106 [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Laboratory of Brain Anatomical MRI Center for Imaging Science at Johns Hopkins University . 2014. MRI Studio: an image processing program. Available from: https://http://www.mristudio.org. [Google Scholar]
  25. Lao Z. Morphological classification of brains via high-dimensional shape transformations and machine learning methods. NeuroImage. 2004;21(1):46–57. doi: 10.1016/j.neuroimage.2003.09.027. 14741641 [DOI] [PubMed] [Google Scholar]
  26. Liu T., Shen D., Davatzikos C. Deformable registration of cortical structures via hybrid volumetric and surface warping. NeuroImage. 2004;22(4):1790–1801. doi: 10.1016/j.neuroimage.2004.04.020. 15275935 [DOI] [PubMed] [Google Scholar]
  27. Mesulam M. Quantitative template for subtyping primary progressive aphasia. Arch. Neurol. 2009;66(12):1545–1551. doi: 10.1001/archneurol.2009.288. 20008661 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Mesulam M.M. Quantitative classification of primary progressive aphasia at early and mild impairment stages. Brain. 2012;135(5):1537–1553. doi: 10.1093/brain/aws080. 22525158 [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Miller M.I. Increasing the power of functional maps of the medial temporal lobe by using large deformation diffeomorphic metric mapping. Proc. Natl. Acad. Sci. U S A. 2005;102(27):9685–9690. doi: 10.1073/pnas.0503892102. 15980148 [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Mori S. Stereotaxic white matter atlas based on diffusion tensor imaging in an ICBM template. NeuroImage. 2008;40(2):570–582. doi: 10.1016/j.neuroimage.2007.12.035. 18255316 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Müller H. A review of content-based image retrieval systems in medical applications—clinical benefits and future directions. Int. J. Med. Inform. 2004;73(1):1–23. doi: 10.1016/j.ijmedinf.2003.11.024. 15036075 [DOI] [PubMed] [Google Scholar]
  32. Oishi K. Human brain white matter atlas: identification and assignment of common anatomical structures in superficial white matter. NeuroImage. 2008;43(3):447–457. doi: 10.1016/j.neuroimage.2008.07.009. 18692144 [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Oishi K. Atlas-based whole brain white matter analysis using large deformation diffeomorphic metric mapping: application to normal elderly and Alzheimer's disease participants. Neuroimage. 2009;46(2):486–499. doi: 10.1016/j.neuroimage.2009.01.002. 19385016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Orphanoudakis S.C., Chronaki C.E., Vamvaka D. I2Cnet: content-based similarity search in geographically distributed repositories of medical images. Comput. Med. Imaging Graph. 1996;20(4):193–207. doi: 10.1016/s0895-6111(96)00013-4. 8954228 [DOI] [PubMed] [Google Scholar]
  35. Radiological Society of North America . 2014. Radiological lexicon. Available from: https://http://www.rsna.org/RadLex.aspx. [Google Scholar]
  36. Radiological Society of North America. Medical Image Research Center . 2014. Available from: https:// http://www.rsna.org/MIRC.aspx. [Google Scholar]
  37. Rahman M.M., Bhattacharya P., Desai B.C. A framework for medical image retrieval using machine learning and statistical similarity matching techniques with relevance feedback. IEEE Trans. Inf. Technol. Biomed. 2007;11(1):58–69. doi: 10.1109/titb.2006.884364. 17249404 [DOI] [PubMed] [Google Scholar]
  38. Riddle W.R. Characterizing changes in MR images with color-coded Jacobians. Magn. Reson. Imaging. 2004;22(6):769–777. doi: 10.1016/j.mri.2004.01.078. 15234445 [DOI] [PubMed] [Google Scholar]
  39. Robinson G.P. Medical image collection indexing: shape-based retrieval using KD-trees. Comput. Med. Imaging Graph. 1996;20(4):209–217. doi: 10.1016/s0895-6111(96)00014-6. 8954229 [DOI] [PubMed] [Google Scholar]
  40. Rohlfing T., Russakoff D.B., Maurer C.R., Jr. Performance-based classifier combination in atlas-based image segmentation using expectation-maximization parameter estimation. IEEE Trans. Med. Imaging. 2004;23(8):983–994. doi: 10.1109/TMI.2004.830803. 15338732 [DOI] [PubMed] [Google Scholar]
  41. Shen D., Davatzikos C. HAMMER: hierarchical attribute matching mechanism for elastic registration. IEEE Trans. Med. Imaging. 2002;21(11):1421–1439. doi: 10.1109/TMI.2002.803111. 12575879 [DOI] [PubMed] [Google Scholar]
  42. Shi F. Construction of multi-region-multi-reference atlases for neonatal brain MRI segmentation. NeuroImage. 2010;51(2):684–693. doi: 10.1016/j.neuroimage.2010.02.025. 20171290 [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Sinha U., Taira R., Kangarloo H. Structure localization in brain images: application to relevant image selection. Proc AMIA Symp. 2001:622–626. 11837219 [PMC free article] [PubMed] [Google Scholar]
  44. Thompson P.M. Growth patterns in the developing brain detected by using continuum mechanical tensor maps. Nature. 2000;404(6774):190–193. doi: 10.1038/35004593. 10724172 [DOI] [PubMed] [Google Scholar]
  45. Unay D., Ekin A., Jasinschi R.S. Local structure-based region-of-interest retrieval in brain MR images. IEEE Trans. Inf. Technol. Biomed. 2010;14(4):897–903. doi: 10.1109/TITB.2009.2038152. 20064763 [DOI] [PubMed] [Google Scholar]
  46. Wang L. Large deformation diffeomorphism and momentum based hippocampal shape discrimination in dementia of the Alzheimer type. IEEE Trans. Med. Imaging. 2007;26(4):462–470. doi: 10.1109/TMI.2005.853923. 17427733 [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Wang L. 4D segmentation of brain MR images with constrained cortical thickness variation. PLOS One. 2013;8(7):e64207. doi: 10.1371/journal.pone.0064207. 23843934 [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Wang Q. Groupwise registration based on hierarchical image clustering and atlas synthesis. Hum. Brain Mapp. 2010;31(8):1128–1140. doi: 10.1002/hbm.20923. 20063349 [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Warfield S.K., Zou K.H., Wells W.M. Simultaneous truth and performance level estimation (STAPLE): an algorithm for the validation of image segmentation. IEEE Trans. Med. Imaging. 2004;23(7):903–921. doi: 10.1109/TMI.2004.828354. 15250643 [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Wilson S.M. Automated MRI-based classification of primary progressive aphasia variants. Neuroimage. 2009;47(4):1558–1567. doi: 10.1016/j.neuroimage.2009.05.085. 19501654 [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Wu G., Qi F., Shen D. Learning best features and deformation statistics for hierarchical registration of MR brain images. Inf Process Med Imaging. 2007;20:160–171. doi: 10.1007/978-3-540-73273-0_14. 17633697 [DOI] [PubMed] [Google Scholar]
  52. Zhang D. Multimodal classification of Alzheimer's disease and mild cognitive impairment. Neuroimage. 2011;55(3):856–867. doi: 10.1016/j.neuroimage.2011.01.008. 21236349 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Appendix table

Clinical Information of PPA participants.

mmc1.docx (27.5KB, docx)

Articles from NeuroImage : Clinical are provided here courtesy of Elsevier

RESOURCES