Abstract
Background
Dysfunction in the endolysosome, a late endosomal to lysosomal degradative intracellular compartment, is an early hallmark of some neurodegenerative diseases, in particular Alzheimer's disease. However, the subtle morphological changes in compartments of affected neurons are difficult to quantify quickly and reliably, making this phenotype inaccessible as either an early diagnostic marker, or as a read-out for drug screening.
Methods
We present a method for automatic detection of fluorescently labeled endolysosomes in degenerative neurons in situ. The Drosophila blue cheese (bchs) mutant was taken as a genetic neurodegenerative model for direct in situ visualization and quantification of endolysosomal compartments in affected neurons. Endolysosomal compartments were first detected automatically from 2-D image sections using a combination of point-wise multi-scale correlation and normalized correlation operations. This detection algorithm performed well at recognizing fluorescent endolysosomes, unlike conventional convolution methods, which are confounded by variable intensity levels and background noise. Morphological feature differences between endolysosomes from wild type vs. degenerative neurons were then quantified by multivariate profiling and support vector machine (SVM) classification based on compartment density, size and contrast distribution. Finally, we ranked these distributions according to their profiling accuracy, based on the backward elimination method.
Results
This analysis revealed a statistically significant difference between the neurodegenerative phenotype and the wild type up to a 99.9% confidence interval. Differences between the wild type and phenotypes resulting from overexpression of the Bchs protein are detectable by contrast variations, whereas both size and contrast variations distinguish the wild type from either of the loss of function alleles bchs1 or bchs58. In contrast, the density measurement differentiates all three bchs phenotypes (loss of function as well as overexpression) from the wild type.
Conclusion
Our model demonstrates that neurodegeneration-associated endolysosomal defects can be detected, analyzed, and classified rapidly and accurately as a diagnostic imaging-based screening tool.
Background
Major efforts are underway to identify drug candidates for the treatment of Alzheimer's disease. Most of these are aimed at interference with the production or activity of the amyloid peptide Aβ, since this is the most likely causative agent of the disease [1]. However, other cell biological phenomena such as degradative trafficking to the lysosome have been identified as playing an important role in the disease progression [2].
The endolysosome refers to a vesicular degradative organelle spanning the late endosomal and lysosomal compartments [3]. Endolysosomal compartments are found in neuronal cell bodies, and are transported along axons to and from synaptic termini in both mammals and flies [4-6]. The involvement of the endolysosomal system in neurodegeneration is suggested by defects in the morphology, enzymatic sorting, and function of these compartments which accompany early stages of diseases such as Alzheimer's [7,8]. In fact, Cataldo et al. [9] have shown that there are morphometric differences in the endolysosomal compartments of neurons in the central nervous system of normal and Alzheimer's disease afflicted human subjects. Large-scale and unbiased assessment of morphological abnormalities in the endolysosomal compartment of degenerating neurons either in vivo or in cell culture would be valuable both to define a recognizable neurodegenerative state, and to provide a readout for high-throughput cell-based screening paradigms.
Neuronal loss- and gain-of-function in blue cheese (bchs), a putative lysosomal transport protein in Drosophila [10,11] lead to a phenotype that displays features associated with human neurodegenerative diseases. These include neuronal death, ubiquitin-rich brain inclusions, and shortened lifespan in adults [10] and disintegration of axonal processes, cytoskeletal defects, slowed axonal transport and motorneuronal death in third instar larvae (Lim and Kraut, manuscript under revision). Here, we describe alterations in endolysosomal size and density accompanying the neurodegenerative phenotype in motorneurons of blue cheese mutant larvae. This system was chosen for examining neurodegenerative endolysosomal changes because of several advantages over other neurodegeneration models. First, the neurodegenerative phenotype in blue cheese comes about due to a defect in a presumptive lysosomal processing pathway [11], as opposed to other commonly used mouse and fly models, which rely on overexpression of a human gene [10,12,13]. Secondly the existence of identified motorneurons in which degeneration takes place makes this model unique (Lim & Kraut, submitted). Both of these features remove ambiguity in interpreting the data. Additionally, the ease of preparation of the neurodegenerative mutant animals, and the fact that motorneurons in the fly larva are close to the surface, provide a practical advantage in collecting large numbers of high-quality images.
Previously, changes in endolysosomal morphology and/or number have been noted under various neurodegenerative conditions [8,9]. However, these were described in a labor-intensive and not easily quantifiable manner, and therefore did not yield parameters that could be applied to rapid analysis of either in vivo animal models or drug screening platforms. Since manual quantification of endolysosomal features is highly subjective and time consuming, a rapid and efficient method of assessing these features would be desirable as the basis for a novel screening paradigm. Loo et al. [14] have implemented a multivariate drug screening approach on human cancer cells, where the phenotypic effects of various drugs were classified based on 300 feature measurements. Here, we test the efficacy of using in situ profiling of neuronal endolysosomes as a potential platform for morphogenetic screening. In our method, the endolysosomes are visualized by transgenic expression in isolated motorneurons of a green fluorescent protein (GFP)-fusion of an endolysosomal protein, Spinster[15]. This marker has the advantage that it can be expressed in isolated neurons of choice using the Gal4-UAS system [16] and that it is endolysosomally localized. Fluorescently labeled compartments appear as fluorescent spots in 2-D image sections of neuron termini, which synapse onto muscles at the neuromuscular junction (NMJ)[17].
The analysis presented here first uses a novel image filtering method to detect fluorescently labeled endolysosomal compartments, which appear as spots, in the NMJs of larval motorneurons. The detection method relies in its ability to segment images of neurons robustly and accurately in situ to yield endolysosomal compartments, or spots, as regions of interest (ROIs) that can be analyzed objectively. Standard segmentation methods based on edge detection [18] and intensity thresholding [19] performed poorly in recognizing spots in our images, since some spots have very weak edges and/or intensities and therefore fall below the detection threshold. The presence of background fluorescence or noise also exacerbated the problem of accurately detecting spots. Here, we are able to detect both strong and weak spots while suppressing background noise by identifying regions in the image with a high point-wise multi-scale correlation and normalized cross-correlation [20,21] to that of a Laplacian of Gaussian (LoG) filter [22,23].
The second step of our method uses a support vector machine (SVM) classifier [24] with a radial basis function to determine the differences in the contrast, size and density features between the wild type and the neurodegenerative image sets. We also rank the features, based on their profiling accuracy, via the backward elimination scheme [25]. Univariate and multivariate analyses were used to rank various features of the endolysosomal compartments (e.g. size, density, intensity) in terms of discriminating power in order to assess the importance of each feature in determining the ability of the SVM to predict genotypic origin. This identified the feature or features that could be used most efficiently to assign a class, either neurodegenerative or WT, to a given sample, and also allowed the visualization of differences between each wild type vs. mutant pair. Finally, Fisher's Linear Discriminant analysis (FLD) [26,27] was used to visualize the underlying differences between the wild type and the various mutant classes.
The novelty of this study lies in the application of standard machine learning and statistical analytical methods to an in-vivo neurodegenerative model. It has previously been noted that the endolysosomal compartments of degenerating neurons are affected not only functionally, but also in terms of appearance, e.g. size, shape, and density [9,28]. Our study now shows that rapid in situ profiling of morphological features of endolysosomal compartments is possible, and could therefore serve as a potential platform for morphogenetic screening of neurodegenerative diseases either in animal or cell culture models. The method consists of imaging a large number of motorneuronal endolysosomal compartments of diverse contrast levels, whose features are not visually obvious, and are not accessible by conventional measurement methods. The SVM classifier evaluates changes in contrast, size and density features of these compartments in a rapid, unbiased manner, and on a far larger quantity of image data than would be possible manually, e.g. by analysis of electron micrographs, or fluorescence images.
To our knowledge, this is the first reported image-based multivariate analysis of a neurodegeneration-related feature at the cell biological level. The strength of the approach is that it can identify and quantitatively describe a diagnostic aspect of the neurodegenerative phenotype, namely alterations in key endolysosomal features. Because of its speed, robustness, and accuracy, the proposed method has potential applications in screening the palliative effects of anti-neurodegenerative drugs on an in vivo model.
Methods
Preparation of Larvae for Imaging
Third instar Drosophila larvae carrying a transgene for the green fluorescent fusion protein spinster-GFP[15] in wild type or in a genetic background mutant in the blue cheese gene [10] or in a background carrying an overexpression construct in blue cheese EP2299 (Enhancer-Promoter line 2299) [29,30], were analyzed. Expression of the endolysosomal marker spinster-GFP was driven in two identified motorneurons, which are referred to using the abbreviations aCC and RP2 (see reference [17]) of each hemisegment using the GAL4-UAS system [16], with an even skipped driver line (designated RRa > spinster-GFP, where the symbol ">" refers to a GAL4 line driving a reporter. RRa was the abbreviated name given to the even skipped driver line described in [31]). Genotypes were as follows:
i.
ii.
iii.
iv.
The aforementioned genotypes will henceforth be referred to as i. Wild type ii. bchs1 iii. bchs58 and iv. EP+ respectively. Df(2L)c17 and bchs1 are genetic deletions and strong loss of function alleles of blue cheese [10] while bchs58 carries a stop mutation in the blue cheese coding region, but does not produce detectable protein (own observations and [32]). Larval fillets were prepared, immunostained for GFP, and mounted as described previously [30]. Neuromuscular junctions (NMJs) of the aCC motorneuron in abdominal hemisegments number 2–7 (A2-A7; 12 hemisegments per animal) were imaged and NMJs for all animals of a given genotype were pooled.
Image Acquisition
The images were acquired with a 60×/1.42NA lens at 490 nm excitation/528 nm emission wavelengths using the Deltavision Fluorescence deconvolution imaging system (SoftWorx; Applied Precision, Seattle) equipped with a motorized Olympus IX70 inverted wide-field fluorescence microscope, a 12 bit Coolsnap HQ CCD camera and a 100 W mercury lamp. A stack of 512 × 1024 pixel optical sections spaced at 0.3 μm was acquired for every NMJ in the larvae samples with no binning. The sampling pixel size was 0.1 μm × 0.1 μm along the x- and y- axes which provided sufficient resolution for imaging the endolysosomal compartments.
Three morphological parameters were used to characterize the differences between endolysosomal compartments in wild type and in each of the different mutant genotypes as shown in Table 1. The exposure time and illumination intensity were fixed for each pair-wise comparative study (i.e. WT vs. a particular blue cheese allele) since we determined that certain measurements were sensitive to variations in exposure conditions (not shown). However, it was not possible to use a common acquisition setting for all three studies due to the widely varying fluorescence of the spots between the different groups. The acquisition setting was selected such that the dynamic range of the CCD chip was not exceeded, and an optimal contrast of the endolysosomal compartments with respect to the background was achieved. Images were deconvolved using the constrained iterative algorithm [33] provided with SoftWorx.
Table 1.
Study | Exposure time (ms) | Illumination intensity (% transmission) |
Wild type vs. EP+ | 0.5 | 10 |
Wild type vs. bchs1 | 0.3 | 10 |
Wild type vs. bchs58 | 0.3 | 10 |
Image Processing
Endolysosomal compartments within NMJs were imaged in 2-D optical sections and their contrast, size and density were quantified via the following three sequential steps: (i) enhancement of NMJ spots, (ii) detection and labelling of NMJ spots, and (iii) extraction of spot size, density and contrast.
Enhancement of NMJ spots
The normalized cross-correlation (NCC) as a measure of the similarity between the local spot intensity profile and a predefined Laplacian of Gaussian (LoG) filter was calculated as follows:
(1) |
where bx,y and g7 × 7 are column-wise vector representations of the local image neighbourhood centred at pixel coordinates (x, y) and the 7 × 7 LoG spot filter respectively whereas and are the mean values of bx, y and g7 × 7 respectively.
The point-wise multi-scale correlation (MSC) is given by eqn. (2). The MSC measure is based on the degree of correlation of a localized region at (x, y) to a total of J "spot-like" LoG filters
(2) |
where Ci is a correlation measure formally expressed as Ci(x, y) = ⟨bx,y, g2i+1 × 2i+ 1⟩, 2i+1 denotes the filter dimension where i = 1, 2, ..., J and J = 5 in this case.
Detection and Labeling of NMJ Spots
Combined NCC and MSC detection was done by first detecting regions whose normalized cross correlation magnitude exceeded a predefined threshold T1 i.e. |NCC(x, y)| > T1.
Spots detected by the NCC were also filtered to assure that the magnitudes of the point-wise MSC measures exceeded another predefined threshold T2 i.e. |MSC(x, y)| > T2. A sufficiently low threshold value was determined for T2, via a trial and error approach on the available set of image sections, such that the spot regions were effectively distinguished from noise. Appropriate T1 and T2 levels (T1 = 0.4 and T2 = 23) were chosen from a range of threshold T1 and T2 values that were empirically tested for their classification accuracy (see fig. 5). These threshold values were used for the measurements of all genotypes. After detection, a unique label was assigned to each spot using 2-D connected components labelling [34] so that the contrast and size information of each spot could then be extracted.
Extraction of Size, Density and Contrast
For feature measurements, size information was encoded as a histogram distribution with 10 size categories (S1, S2, ..., S10) ranging from 0.01 μm2 to 0.1 μm2 (1–10 pixels). Contrast information was also encoded as a histogram distribution with 14 categories (C1, C2, ..., C14) uniformly divided within the 0–255 gray-scale where the contrast measurement of a spot is defined as the difference between the peak intensity of the spot and the background intensity surrounding it. The background intensity follows the minimum pixel value within a 15 × 15 window centered around the peak intensity location. A 15 × 15 window size was chosen since it is sufficiently large to contain any given spot.
The feature "spot density" was obtained by determining the centroid of each spot, corresponding to the location of its peak intensity. Subsequently a circular area with a fixed radius r was defined around each spot so that the spot density could be translated as the degree of overlap between these circles. The radius r was selected such that the circles were larger than the spots, and the diameter (2r) typically resembled the average distance between adjacent spots for the wild type case. We formally define spot density D as
(3) |
where N is the number of spots, r is the fixed radius and A is the total area occupied by the spots. D is normalized such that it lies between 0 and 1. Smaller areas of overlap give a lower spot density. The accuracy of the density measure is adversely affected if there are too few spots or if the spot distribution in an NMJ section is nonuniform. To ensure its robustness, we compute the density measure for cases where the spot population is sufficiently large (N ≥ 50). We observe that the spot distribution in the NMJs is generally uniform. However, in any given NMJ section, there may be two or more clusters of spots that are disjoint due to the intervening regions being out of focus at that focal section. In such cases, the density measure is based on the largest cluster of spots in that section.
SVM Classification
Acquired feature data was analyzed using a radial basis function Support Vector Machine (R-SVM) classifier [24]. A leave-one-out cross validation technique was applied to determine the classification accuracy of a dedicated R-SVM for each pair of classes. The data set in each pair was normalized such that each entry had a zero mean and unit standard deviation. Since the R-SVM requires the penalty factor (φ) and kernel width of the radial function (σ) as input arguments, we selected φ' and σ' which gave the highest classification accuracy from the following range of input values φ = {0.1, 1, 10, 100, 1000} and 0.2 ≤ σ ≤ 40 where the step size Δσ = 0.2. Once implemented, R-SVM(φ', σ') outputs a decision value di for every input feature vector xi where the sign of di is used to predict the class to which xi belongs and the magnitude is a measure of the distance of xi from the decision boundary.
Statistical Data Analysis
Feature Ranking – Univariate Analysis
The discriminating power of individual features was quantified based on the Student's t-test [27] value where discriminability is measured by the magnitude of the t2 value which is expressed as follows
(4) |
where m1 and m2 are the mean values of a given feature in the wild type and mutant class respectively, σ1 and σ2 are the corresponding standard deviation values and n1 and n2 are the sample sizes of the two classes.
Feature Ranking – Multivariate Analysis
The Hotelling t-test [27] in eqn. 5, which is the generalized form of the t-test in eqn.4, is given by:
T = (m1 - m2)'S-1(m1 - m2) | (5) |
The notations m1 and m2 denote the mean distribution of feature vectors in the two classes whereas S = S1/n1 + S2/n2 given that S1 and S2 are the corresponding covariance matrices.
Data Visualization – Univariate Analysis
The differences between the wild type and mutant mean distribution profiles for contrast and size were plotted in histograms. The scalar value for spot density was plotted as a probability density function with a Gaussian distribution characterized by the mean and standard deviation in spot density values from a given genotype.
Data Visualization – Multivariate Analysis
Fisher's linear discriminant [26] is applied to visualize the endolysosomal differences between the wild type and mutants in 3-D feature space defined by the axes ω12, ω13 and ω14 which are projection vectors that optimally separate wild type vs. bchs1, wild type vs. bchs58 and wild type vs. EP+ respectively. They are formally defined as follows:
(6) |
Results
Detection of endolysosomal compartments in a neurodegenerative model
The Drosophila neurodegenerative mutant bchs was chosen as a model system in which to analyze changes in the morphology of endolysosomal compartments in degenerating neurons in situ. Neuronal degeneration in bchs mutant animals has been well-described and is believed to affect endolysosomal function and trafficking [10,11](Lim and Kraut, manuscript under revision). Drosophila strains were created that were mutant for bchs or overexpressing Bchs in motorneurons and that also expressed the endolysosomal marker spinster-GFP in motorneurons known to be affected by the neurodegenerative phenotype (A. Lim and R. Kraut, manuscript under revision). Third instar larvae of the relevant genetic backgrounds were filleted and neuromuscular termini at the body wall were imaged with high-resolution wide-field microscopy (see methods, and fig. 1).
The structure of interest was the Drosophila larval neuromuscular junction (NMJ) (shown schematically in fig. 1). The NMJ refers to the structure at the terminus of the motorneuron that extends its axon into the peripheral muscle field in the body wall of the larva, and makes numerous synaptic contacts with muscle targets. Analysis of image stacks of labelled NMJs after filleting, immunofluorescent labelling of endolysosomal compartments, wide-field imaging, and deconvolution were carried out as follows:
A) Differences in endolysosomal compartment features, including spot contrast, size, and density of spots, were quantified and compared between the wild type and the three mutant classes using a radial basis support vector machine (R-SVM).
B) 25 contrast, size and spot density features were ranked in terms of their discriminating abilities using the backward elimination scheme [25].
C) The differences in features between the wild type and the mutants were visualized from both univariate feature analysis and Fisher's linear discriminant [26,27].
Fig. 2 shows a typical maximum projection view of an NMJ from the wild type and mutant genotypes, expressing the endolysosomal marker spinster-GFP in two motorneurons (aCC and RP2) terminating at the NMJ on dorsal muscles. There are fewer compartments per area in the bchs mutants (Fig. 2b and 2c) compared to the wild type although most bchs1 spots appear more prominent. The EP+ NMJ (Fig. 2d) has brighter spots than the wild type.
In order to overcome the problem of differing intensities and contrast in fluorescently labelled endolysosomal compartments, which make detection by standard convolution algorithms difficult [35], we devised a spot enhancement scheme employing two correlation techniques: normalized cross correlation (NCC) [21] and point-wise multi-scale correlation (MSC). NCC has been widely used in frame averaging [20] and edge detection [36]. Here we used it to identify both bright and faint spots accurately based on the similarity of their local spot intensity profiles to a predefined Laplacian of Gaussian (LoG) filter. The normalized NCC measure thus resulted in accurate spot localization regardless of intensity, but was sensitive to background noise. Application of the MSC, on the other hand (see Methods, eqn. (2)) gave significantly better SNR but was sensitive to spot intensity, i.e. weaker spots were not always detected using MSC alone. However in combination with NCC, MSC was able to assign higher coefficient values to bona fide spots compared to background noise. Thus, spots were detected first by including regions whose NCC magnitude exceeded a predefined threshold T1 (see Methods, detection and labelling of NMJ spots), and potential artefacts or noise were eliminated by ensuring that the detected regions also had corresponding point-wise MSC measures whose magnitudes exceeded another predefined threshold T2 (see Methods, detection and labelling of NMJ spots).
Fig. 3 illustrates the effects of combining the two correlation measures, normalized cross correlation (NCC) and multiscale correlation (MSC), on a sample 2-D image section of an NMJ, whereas Fig. 4 shows that combining both NCC and MSC spot detection algorithms detects faint spots while suppressing artefacts arising from the background. The NCC measure enables the accurate detection of the individual spots although they may be clustered together. In Fig. 5 we show that the proposed spot detection method is robust in distinguishing wild type from Bchs overexpressing ('EP+') for threshold values of T1 and T2 ranging from 0.3–0.7 and 15–30 respectively. A relatively small standard deviation of ~2% is observed for the classification accuracy.
R-SVM Classification Results
After detection of spots by NCC and MSC, size, contrast and spot density feature measurements were extracted from the detected spots of a given NMJ image section (see Methods, Extraction of size, density and contrast). A stack of images was captured for every NMJ in every class, but the number of images in each stack varied depending on sample thickness. Each 2-D image section of an NMJ constitutes a data point that is represented by a feature vector with a total of 25 entries from which contrast, size and spot density measurements have 14, 10 and 1 entry respectively. Table 2 summarizes the number of NMJs and the corresponding data points used for comparing each pair of classes.
Table 2.
Class 1 | Class 2 | ||||
Genotype | No. of NMJ | Data points | Genotype | No. of NMJ | Data points |
Wild type | 13 | 135 | EP+ | 19 | 176 |
Wild type | 21 | 217 | bchs1 | 25 | 83 |
Wild type | 21 | 217 | bchs58 | 20 | 79 |
A radial basis function Support Vector Machine (R-SVM) classifier [24] was devised, into which the feature measurements were fed (see Methods, SVM classification). The R-SVM classifier quantified the differences in endolysosomal phenotypes between the wild type and each of the other three mutant genotypes. The R-SVM achieved good classification performance on previously unseen data by finding decision boundaries that optimally separate the wild type data points from those of the other genotypes. The classification scheme is expressed as three two-class problems i.e. wild type vs. bchs1, wild type vs. bchs58 and wild type vs. EP+.
The R-SVM was trained with data sets for each parameter from pairs of genotypes (i.e. bchs1 vs. WT or bchs58 vs. WT), and then challenged to classify images of unknown origin. Optimal classification accuracies of 78.1%, 72.2% and 82.0% were obtained for wild type vs. bchs1, bchs58 and EP+, respectively. Their corresponding SVM parameters representing the standard deviation of the Gaussian kernel σ and the penalty factor φ were as follows: (4, 1000), (40, 1000) and (8, 10) respectively for the three genotypes. Table 3 shows the confusion matrices for the three pairs. This is a significant achievement since, by simply analyzing an arbitrary 2-D section of an NMJ, we can accurately predict 78.1%, 72.2% and 82.0% of the time if the NMJ is a wild type or conversely a bchs1, bchs58 and EP+ mutant class respectively.
Table 3.
Predicted Class | Predicted Class | ||||||
Wild type | bchs1 | Wild type | bchs58 | ||||
Actual Class | Wild type | 77.0 | 23.0 | Actual Class | Wild type | 71.9 | 28.1 |
bchs1 | 27.3 | 72.7 | bchs58 | 27.0 | 73.0 | ||
(a) | (b) | ||||||
Predicted Class | |||||||
Wild type | EP+ | ||||||
Actual Class | Wild type | 78.5 | 21.5 | ||||
EP+ | 20.5 | 79.5 | |||||
(c) |
In order to assess the ability of the SVM to predict the genotypic origin of a given NMJ image, mean decision values (reflecting degree of reliability of the assignment of mutant vs. wild type) were calculated. The mean decision values for all the individual decision values obtained by the SVM for each NMJ section of a particular category of mutant vs. wild type were averaged and plotted with standard errors (Fig. 6). The differences in the mean decision value between the wild type and mutant classes are statistically significant with a high confidence interval of 99.9%.
Multivariate and Univariate Feature ranking
Using univariate and multivariate analyses, we assessed the value of various features to the SVM in determining its ability to predict genotypic origin (i.e., feature ranking). The purpose of feature ranking is to determine which features have the greatest discriminating power, and can thus be used most efficiently to assign a class (neurodegenerative vs. WT) to a given sample. This analysis also enables us to visualize the differences within each wild type vs. mutant pair.
The different feature categories, including 10 different size bins, 14 contrast bins, and the single density measurement were taken as individual features (see Methods, Extraction of size, density, and contrast). The Hotelling t-test [27] (see eqn. 5), was used to rank each of the features via the multivariate analysis approach (see Methods, Statistical data anlysis). A Hotelling multivariate T2 statistic was applied in a sequential backward elimination scheme [37] to rank the contrast (C1,...,14), size (S1,...,10) and density (D) features based on their discriminating power. We start with a complete set of features and sequentially remove those features that reduce the T2 value the least. Thus the feature of least importance is removed first and the most important is removed last. In the event that two or more features have the same T2 value, we remove the feature that corresponds to the lowest univariate T2. The use of T2 statistics is desirable due to its strictly monotonic function where its value for a subset of features is always less than or equal to that of the full set. Fig. 6 shows the monotonic decrease in the T2 value where, at every stage, the least important feature is sequentially removed for each wild type vs. mutant pair.
The feature describing spot density, D, appears as one of the top six most discriminating features for all three genotypes (Fig. 7). The smaller spots of sizes S2, S3 and S4 play an important role in discriminating wild type from bchs1 and bchs58 as shown in Fig. 7b, whereas spots of contrast levels C5, C6, etc. are crucial in discriminating wild type from EP+ as shown in Fig. 7c. Spots of mid level contrast C8 and C9 more effectively discriminate wild type from bchs1 whereas those of lower contrast C3 and C5 aid the discrimination of the bchs58 case.
Visualizing the Differences in Genotypes – Univariate and Multivariate Analysis
The differences between the wild type and mutant feature data were visualized by plotting on histograms the mean distribution profiles of endolysosomes from the different genotypes among the various contrast, size, and density bins (Fig. 8). In this way, the differences in the profiles of the mean spot size and contrast as well as variations in the spot density values could be compared visually between the wild type and the three mutant classes. A significant difference is observed in the mean spot size profiles (Fig. 8b, e) where the endolysosomal compartments are larger in the bchs mutants compared to the wild type. The mean contrast profiles of the bchs mutants in (Fig. 8a, d) are however very similar to the wild type. The converse is true for EP+ where the mean contrast profiles (Fig. 8g) show that, in general, the spots in EP+ have higher contrast levels than those in the wild type. Their mean spot size profiles, on the other hand, (Fig. 8h) are very similar.
Spot density is a scalar value and as such the differences in spot density values between the two groups are visualized from the corresponding probability density function (PDF) with a Gaussian distribution characterized by their respective mean and standard deviation in spot density. The spot density of the wild type endolysosomes is, in general, higher than those of any of the mutant classes (Fig. 8c, f, i). This is not an artifact of spot clustering in the mutants, since individual spots are resolvable from aggregations of spots. The filter profile was detectable within these more clustered regions.
Fisher's linear discriminant [26,27], shown in fig. 9, was used to compute the projection vectors ω12, ω13 and ω14 which optimally separate wild type vs. bchs1, wild type vs. bchs58 and wild type vs. EP+ respectively. These eigenvectors are then used to visualize the differences in distribution between the wild type and the mutant phenotypes. The discrimination between the wild type and mutant classes are visualized (Fig. 9b, c). As observed, wild type and EP+ were optimally discriminated along the axis ω14 (Fig. 9a) but axes ω12 and ω13 (not shown) do not contribute to this discrimination. Interestingly, the same two axes (ω12 and ω13) are both the best at discriminating wild type from mutants bchs1 and bchs58 (Fig. 9b, c). This implies that both bchs1 and bchs58 share common properties which set them apart from the wild type. This is consistent with these two genotypes both being strong loss of function alleles of bchs.
Conclusion
We have developed a novel imaging-based method for quantifying neurodegeneration-associated changes in the endolysosomal compartment, an organelle whose function is strongly associated with the pathology of neurodegenerative diseases, in particular Alzheimer's disease[2]. This method may be useful as a potential tool for diagnosis and screening of the neurodegenerative pathology at the cellular level.
Image analysis was carried out on individual affected neurons in a Drosophila neurodegenerative model, blue cheese, a mutant which is thought to disrupt lysosomal function[11]. We were able to accurately detect and analyze GFP-labelled endolysosomes at the termini of motorneurons in situ, using novel automated segmentation and feature extraction algorithms. The key steps of the method are as follows: (1) A combination of normalized and standard cross correlation techniques is used to give accurate segmentation of compartments from image sections, even those with poor SNR; (2) Contrast, size and spot density measurements are extracted from individual endolysosomal compartments; (3) Impartial and automated diagnosis of the phenotypes via the R-SVM classifier is carried out, and (4) A backward elimination scheme is used to rank the various feature measurements in terms of their power to discriminate the mutant phenotypes from the wild type.
Fluorescently labelled endolysosomes in motorneuronal endings appear as spots with diverse contrast and intensity in 2-D image sections. Due to this variability, conventional image segmentation fails to detect these compartments. In contrast, our segmentation approach, which uses normalized cross-correlation and multi-scale correlation algorithms in succession, is sensitive yet selective to these compartments regardless of their contrast levels. Based on measurement of relatively few key features (spot density, intensity, and size) endolysosomes from neurodegenerative motorneurons can be accurately recognized and differentiated from the wild type. These differences are not obvious by inspection, and manual collection of such quantitative data from a large number of image samples is prohibitively labour-intensive. As a solution to this problem, the automated algorithm detects differences between the neurodegenerative mutant compartments and normal compartments with a highly statistically significant confidence interval of 99.9%.
The ranking of the different feature measurements against each other showed that the relative importance of a particular feature in distinguishing the origin of a sample varied between genotypes. For example, the differences between the wild type and phenotypes resulting from overexpression of the Bchs protein (EP+) are largely attributable to contrast variations. On the other hand, both size and contrast variations differentiated the wild type from either of the loss of function alleles bchs1 and bchs58. Density measurements were different in all three bchs genotypes (loss of function as well as overexpression) from the wild type. Ranking of feature importance makes it in principle possible to select the features most relevant to an assessment of the neurodegenerative phenotype, and increase the speed and accuracy of a diagnostic readout.
While the contrast, size, and density feature differences between genotypes are quite subtle, as is evident from Fig. 2, clearly the R-SVM classifier was able to distinguish the neurodegenerative (mutant) case from wild type, in all three genetic classes. Surprisingly, the application of the feature detection algorithm to the fluorescence images in some cases yielded results that were counter-intuitive to observations made by eye: strikingly, images of NMJs from neurons overexpressing Bchs appeared in general to have larger compartments, whereas the algorithm revealed that the compartments were actually of similar or even smaller size, but higher contrast.
Interestingly, both the neurodegenerative bchs loss of function alleles gave endolysosomal spot size profiles that were bigger than wild type, and this feature was prominent in determining genotypic origin. This may be related to the observation of enlarged lysosomal compartments in the neurodegenerative lysosomal storage diseases [38,39] and in the human Chediak-Higashi syndrome, which results from mutation in a gene related to bchs [40]. Another decisive feature difference, the lower density of the compartments at the NMJ in all three degenerative genotypes, is also consistent with the inefficient axonal transport of endolysosomes from neuronal cell bodies toward the termini (where images were obtained) that was observed in these same genotypes (A. Lim and R. Kraut, manuscript under revision).
An important finding of this study was that Fisher's linear Discriminant categorized both loss of function alleles according to the same eigenvectors, even given the subtlety of the differences in features. This demonstrates the power of the method to identify essential feature information and classify phenotype ("degenerative" vs. "normal") based on relatively small data sets. Moreover, subclasses of phenotype (e.g. Bchs overexpressor vs. bchs mutant) can be defined, since particular eigenvectors and distributions along these vectors appear to be typical for certain genotypic classes, as is seen in Fig. 9.
In conclusion, as one of the key early pathological features of neurodegenerative disease, it is significant that the endolysosomal phenotype is now open to analysis by an automated, potentially high throughput process. The technique described here may be useful in rapid screening of drug candidates, since it can easily be applied to images in cell culture models of neurodegeneration. This type of analysis should also allow rapid, unbiased evaluation of degenerative changes in different in vivo models, which are undetectable by traditional manual methods, and may ultimately be useful as a means of reliably diagnosing a systemic cellular condition that affects the endolysosomal compartment and predisposes neurons to degeneration.
Competing interests
The authors declare that they have no competing interests.
Authors' contributions
SK developed and implemented the algorithms for image processing, feature extraction and feature classification. SK also carried out the statistical analysis and drafted the manuscript. YW was responsible for the genetics, preparation of larvae and the acquisition of images. VV assisted in the implementation of the algorithms and the acquisition of images. RK conceived of the study, participated in its design and coordination, and helped to draft the manuscript.
Acknowledgments
Acknowledgements
This research is supported by the Agency for Science, Technology and Research (A-Star) and the Biomedical Research Council (BMRC), Singapore. We are grateful to Dr. Sean Sweeney for the spinster-GFP transgenic line, to Drs. Paul Garrity, Kim Finley and the Bloomington Stock Center for mutant strains, and to Ashley Wright for making bchs, RRa > spinGFP recombinant fly strains.
Contributor Information
Saravana K Kumarasamy, Email: skumar@ibn.a-star.edu.sg.
Yunshi Wang, Email: yunshi.wang@gmail.com.
Vignesh Viswanathan, Email: vignesh.viswanathan@nus.edu.sg.
Rachel S Kraut, Email: rkraut@ibn.a-star.edu.sg.
References
- Selkoe DJ. Developing preventive therapies for chronic diseases: lessons learned from Alzheimer's disease. Nutr Rev. 2007;65:S239–43. doi: 10.1301/nr.2007.dec.S239-S243. [DOI] [PubMed] [Google Scholar]
- Nixon RA, Cataldo AM. Lysosomal system pathways: genes to neurodegeneration in Alzheimer's disease. J Alzheimers Dis. 2006;9:277–89. doi: 10.3233/jad-2006-9s331. [DOI] [PubMed] [Google Scholar]
- Luzio JP, Pryor PR, Bright NA. Lysosomes: fusion and function. 2007;8:622–632. doi: 10.1038/nrm2217. [DOI] [PubMed] [Google Scholar]
- Hollenbeck PJ. Products of endocytosis and autophagy are retrieved from axons by regulated retrograde organelle transport. J Cell Biol. 1993;121:305–15. doi: 10.1083/jcb.121.2.305. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dermaut B, Norga KK, Kania A, Verstreken P, Pan H, Zhou Y, Callaerts P, Bellen HJ. Aberrant lysosomal carbohydrate storage accompanies endocytic defects and neurodegeneration in Drosophila benchwarmer. J Cell Biol. 2005;170:127–39. doi: 10.1083/jcb.200412001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sanyal S, Ramaswami M. Spinsters, synaptic defects, and amaurotic idiocy. Neuron. 2002;36:335–8. doi: 10.1016/S0896-6273(02)01015-2. [DOI] [PubMed] [Google Scholar]
- Nixon RA. Endosome function and dysfunction in Alzheimer's disease and other neurodegenerative diseases. Neurobiol Aging. 2005;26:373–82. doi: 10.1016/j.neurobiolaging.2004.09.018. [DOI] [PubMed] [Google Scholar]
- Bahr BA, Bendiske J. The neuropathogenic contributions of lysosomal dysfunction. J Neurochem. 2002;83:481–9. doi: 10.1046/j.1471-4159.2002.01192.x. [DOI] [PubMed] [Google Scholar]
- Cataldo AM, Hamilton DJ, Barnett JL, Paskevich PA, Nixon RA. Properties of the endosomal-lysosomal system in the human central nervous system: disturbances mark most neurons in populations at risk to degenerate in Alzheimer's disease. J Neurosci. 1996;16:186–99. doi: 10.1523/JNEUROSCI.16-01-00186.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Finley KD, Edeen PT, Cumming RC, Mardahl-Dumesnil MD, Taylor BJ, Rodriguez MH, Hwang CE, Benedetti M, McKeown M. blue cheese mutations define a novel, conserved gene involved in progressive neural degeneration. J Neurosci. 2003;23:1254–64. doi: 10.1523/JNEUROSCI.23-04-01254.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Simonsen A, Cumming RC, Lindmo K, Galaviz V, Cheng S, Rusten TE, Finley KD. Genetic Modifiers of the Drosophila Blue Cheese Gene Link Defects in Lysosomal Transport With Decreased Life Span and Altered Ubiquitinated-Protein Profiles. Genetics. 2007;176:1283–1297. doi: 10.1534/genetics.106.065011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bilen J, Bonini NM. Drosophila as a model for human neurodegenerative disease. Annu Rev Genet. 2005;39:153–71. doi: 10.1146/annurev.genet.39.110304.095804. [DOI] [PubMed] [Google Scholar]
- McGowan E, Eriksen J, Hutton M. A decade of modeling Alzheimer's disease in transgenic mice. Trends Genet. 2006;22:281–9. doi: 10.1016/j.tig.2006.03.007. [DOI] [PubMed] [Google Scholar]
- Loo LH, Wu LF, Altschuler SJ. Image-based multivariate profiling of drug responses from single cells. Nat Methods. 2007;4:445–53. doi: 10.1038/nmeth1032. [DOI] [PubMed] [Google Scholar]
- Sweeney ST, Davis GW. Unrestricted synaptic growth in spinster-a late endosomal protein implicated in TGF-beta-mediated synaptic growth regulation. Neuron. 2002;36:403–16. doi: 10.1016/S0896-6273(02)01014-0. [DOI] [PubMed] [Google Scholar]
- Brand AH, Perrimon N. Targeted gene expression as a means of altering cell fates and generating dominant phenotypes. Development. 1993;118:401–15. doi: 10.1242/dev.118.2.401. [DOI] [PubMed] [Google Scholar]
- Keshishian H, Broadie K, Chiba A, Bate M. The drosophila neuromuscular junction: a model system for studying synaptic development and function. Annu Rev Neurosci. 1996;19:545–75. doi: 10.1146/annurev.ne.19.030196.002553. [DOI] [PubMed] [Google Scholar]
- Canny J. A computational approach to edge detection. IEEE Trans Pattern Analysis and Machine Intelligence. 1986;8:679–698. [PubMed] [Google Scholar]
- Otsu N. A thresholding selection method from gray level histogram. IEEE Trans System, Man, Cybernetics. 1979;9:62–66. doi: 10.1109/TSMC.1979.4310076. [DOI] [Google Scholar]
- Sussman MS, Wright GA. Factors affecting the correlation coefficient template matching algorithm with application to real-time 2-D coronary artery MR imaging. IEEE Trans Medical Imaging. 2003;22:206–216. doi: 10.1109/TMI.2002.808363. [DOI] [PubMed] [Google Scholar]
- Lewis JP. Fast normalized cross-correlation. Vision Interface: Canadian Image Processing and Pattern Recognition Society. 1995;1995:120–123. [Google Scholar]
- Lindeberg T. Feature detection with automatic scale selection. International Journal of Computer Vision. 1998;30:77–116. [Google Scholar]
- Sage D, Neumann FR, Hediger F, Gasser SM, Unser M. Automatic Tracking of Individual Fluorescence Particles: Application to the Study of Chromosome Dynamics. IEEE Transactions on Image Processing. 2005;14:1372–1383. doi: 10.1109/TIP.2005.852787. [DOI] [PubMed] [Google Scholar]
- Duda RO, PE Hart, DG Stork. Pattern Classification. New York: John Wiley & Sons; 2001. [Google Scholar]
- Hand DJ. Wiley Series in Probability and Mathematical Statistics. Chichester: Wiley; 1981. Discrimination and classification. [Google Scholar]
- Weszka J, Dyer C, Rosenfeld A. A comparative study of texture measure for terrain classification. IEEE Trans Sys Man Cybernetics. 1976;SMC-6:269–285. [Google Scholar]
- Fisher RA. Review of Statistical Methods for Research Workers. Eugenics Review. 1926;18:148–150. [Google Scholar]
- Cataldo AM, Hamilton DJ, Barnett JL, Paskevich PA, Nixon RA. Abnormalities of the endosomal-lysosomal system in Alzheimer's disease: relationship to disease pathogenesis. Adv Exp Med Biol. 1996;389:271–80. [PubMed] [Google Scholar]
- Rørth P. A modular misexpression screen in Drosophila detecting tissue-specific phenotypes. PNAS. 1996;93:12418–12422. doi: 10.1073/pnas.93.22.12418. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kraut R, Menon K, Zinn K. A gain-of-function screen for genes controlling motor axon guidance and synaptogenesis in Drosophila. Curr Biol. 2001;11:417–30. doi: 10.1016/S0960-9822(01)00124-5. [DOI] [PubMed] [Google Scholar]
- Fujioka M, Lear BC, Landgraf M, Yusibova GL, Zhou J, Riley KM, Patel NH, Jaynes JB. Even-skipped, acting as a repressor, regulates axonal projections in Drosophila. Development. 2003;130:5385–5400. doi: 10.1242/dev.00770. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Khodosh R, Augsburger A, Schwarz TL, Garrity PA. Bchs, a BEACH domain protein, antagonizes Rab11 in synapse morphogenesis and other developmental events. Development. 2006;133:4655–65. doi: 10.1242/dev.02650. [DOI] [PubMed] [Google Scholar]
- Scalettar BA, Swedlow JR, Sedat JW, Agard DA. Dispersion, aberration and deconvolution in multi-wavelength fluorescence images. J Microsc. 1996;182:50–60. doi: 10.1046/j.1365-2818.1996.122402.x. [DOI] [PubMed] [Google Scholar]
- Haralick RM, Shapiro LG. Computer and robot vision: Addison-Wesley; 1992. [Google Scholar]
- Gonzales RC, Woods RE. Digital Image Processing. 3. Prentice Hall; 2008. [Google Scholar]
- Kumar S, Ranganath S, Ong TC, Chew FT. A luminance- and contrast-invariant edge-similarity measure. IEEE Pattern Analysis Mach Intelligence. 2006;28:2042–2048. doi: 10.1109/TPAMI.2006.236. [DOI] [PubMed] [Google Scholar]
- Treloar WJ. Digital image processing and their application to the automation of palynology. Hull University; 1992. [Google Scholar]
- Simons K, Gruenberg J. Jamming the endosomal system: lipid rafts and lysosomal storage diseases. Trends Cell Biol. 2000;10:459–62. doi: 10.1016/S0962-8924(00)01847-X. [DOI] [PubMed] [Google Scholar]
- Nixon RA. Niemann-Pick Type C disease and Alzheimer's disease: the APP-endosome connection fattens up. Am J Pathol. 2004;164:757–61. doi: 10.1016/S0002-9440(10)63163-X. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ward DM, Griffiths GM, Stinchcombe JC, Kaplan J. Analysis of the lysosomal storage disease Chediak-Higashi syndrome. Traffic. 2000;1:816–22. doi: 10.1034/j.1600-0854.2000.011102.x. [DOI] [PubMed] [Google Scholar]