Abstract
We present a methodology for extracting the vascular network in the human retina using Dijkstra’s shortest-path algorithm. Our method preserves vessel thickness, requires no manual intervention, and follows vessel branching naturally and efficiently. To test our method, we constructed a retinal video indirect ophthalmoscopy (VIO) image database from pediatric patients and compared the segmentations achieved by our method and state-of-the-art approaches to a human-drawn gold standard. Our experimental results show that our algorithm outperforms prior state-of-the-art methods, for both single VIO frames and automatically generated, large field-of-view enhanced mosaics. We have made the corresponding dataset and source code freely available online.
OCIS codes: (100.0100) Image processing, (100.2960) Image analysis, (170.4470) Ophthalmology
1. Introduction
Accurate segmentation and evaluation of the anatomical and pathological features of retinal vessels are critical for the diagnosis and study of many ocular diseases. These include retinopathy of prematurity (ROP). ROP is a disorder of the retinal blood vessels that is a major cause of vision loss in premature neonates [1]. Important features of the disease include increased diameter (dilation) as well as increased tortuosity (wiggliness) of the retinal blood vessels in the portion of the retina centered on the optic nerve (the posterior pole). Increased dilation and tortuosity of the blood vessels in the posterior pole (called pre-plus in intermediate, and plus in severe circumstances) is an important indicator of ROP severity. [2]. Subjective assessment of plus and pre-plus disease leads to poor agreement between examiners [3]. Manual segmentation of retinal images is not only demanding for experts and excessively time-consuming for clinical use, but is also inherently subjective, and different annotators often yield different results [4]. To address these difficulties, different approaches for automated segmentation of retinal vessels have been tried, with varying levels of success.
Prior methods can be roughly classified into region- and path-based methods. Region-based methods [5, 6, 7, 8, 9, 10, 11, 12, 13] classify image pixels directly into vessel and non-vessel pixels. Classification relies on local appearance, as measured by the responses of suitable filter banks at various scales and orientations. In unsupervised region-based approaches, these filter responses are combined into a new image, which is then appropriately thresholded to yield the final classification. Methods in this category employ matched filters [5], piecewise thresholding [14], local entropy [15], and quadrature filters [12]. Supervised region-based methods, on the other hand, assemble the filter responses into feature vectors that are fed to a classifier, which is trained on hand-labeled data. Techniques used within this framework include ridge detection [10], Gabor wavelet filtering [9], line operators [8], and moment invariants [13]. Other region-based approaches have used region growing [16], mathematical morphology [17], and multiconcavity modeling [11].
The goal of path-based methods [18,19,20,21,22,23,24,25], on the other hand, is primarily to trace the centerline of individual vessels, rather than classifying every pixel in the image. Many path-based approaches also estimate vessel thickness as they track each branch, generally by determining the width of the cross-section perpendicular to the current path. Prior work on two-dimensional branch extraction has addressed this topological ambiguity semi-automatically by relying on user-supplied points, requiring either a single seed point [24] or a pair of start-and end-points [22]. User-supplied one-point methods generally employ ridge detection based on differential geometry [26], while two-point methods find a path between the points that minimizes a cost measure designed to penalize paths that stray from the middle of a vessel. Several of these methods rely on front propagation algorithms, such as the fast marching method [27]. In contrast, as described in Section 2, our tracking methodology forgoes the need for external seed points by being robust to a particular tracker’s initial position.
Existing methods in both categories have been developed primarily for use on high quality retinal fundus images, such as those obtained with the RetCam imaging system (Clarity Medical Systems, Inc., Pleasanton, CA). However, the usual method for diagnosing ROP is the indirect ophthalmoscope (IO). More recently, Video Indirect Ophthalmoscopy (VIO), in which the physician wears a head-mounted video camera during IO evaluations, has emerged as an economical and convenient method for capturing digital retinal images during ROP examinations. In contrast to RetCam, however, VIO data is often of low quality, fraught with reflections from the IO lens, motion blur, low resolution, and sensor noise. A previous study reported that only 24% of randomly selected video sequences can be utilized for semi-automated evaluation of retinal vessel morphology in ROP [24].
In this paper, we propose a hybrid method that extends the path-based methodology into a region-based segmentation scheme for detecting retinal vessels. Our complete approach works in two stages, as illustrated in Fig. 1. The first stage pre-processes the input image to remove both lens and motion artifacts, and to construct a high-contrast vessel map. The second stage builds a forest of tree-like vessel regions through a sequence of exploration waves on the vessel map: the most vessel-like pixel s0 in the image is used as the starting point for an exploration wave that searches for the best tree-like vessel region in the image around s0 by means of the single-source, multi-destination version of Dijkstra’s shortest path algorithm [28]. This exploration returns an entire tree region for part of the vessel system, that is, it handles branching naturally and efficiently, and preserves vessel thickness. When this exploration ends, a new exploration begins at the best remaining starting point s1 in the unexplored part of the image, which yields a new vessel tree region. Our method stops constructing new regions when the best unexplored starting point is no longer likely to be part of the vessel system. Unlike existing single-source, single-destination vessel analysis methods [20, 21, 22, 23], our single-source, multiple-destinations approach automatically explores the complete vasculature in a retinal image, and requires no user intervention whatsoever.
Fig. 1.
Proposed VIO vessel segmentation: In the first stage, VIO images are pre-processed with directional local-contrast filters (DLCF) and LoG-Gabor filters to eliminate artifacts and increase contrast. In the second stage, the best, unvisited vessel pixel in the image is repeatedly chosen as a starting point for a dynamic-programming exploration of the unvisited part of the image. The result of each exploration yields a new tree in the growing forest of vessels. Forest growth stops when the best, unvisited vessel pixel is worse than a predefined threshold.
Furthermore, the initial single-frame image enhancement step can be optionally replaced by a multi-frame image mosaicing technique. We have recently developed such a technique to combine several low-quality VIO frames into a high-quality, large field-of-view (FOV) composite [29]. As our results in Section 3 show, our approach obtains superior segmentation results on both types of –raw and composite– VIO images compared to current state-of-the-art segmentation methods.
The rest of this paper is organized as follows: we first detail our automated dynamic-programming segmentation method in Section 2 and then describe our experiments in Section 3. We present the experimental results in Section 4 and discuss their significance and explore future directions in Section 5.
2. Exploratory Dijkstra forest based vessel segmentation method
We represent each VIO image (or composite) as a graph of nodes, G = (V,E), where each node corresponds to a pixel and the links connecting the nodes are called arcs. In this formulation, the ordered pair of node and arc sets are represented by V and E, respectively. Path-based methods for vessel extraction define the cost of traversing the arc that connects any two neighboring pixels in the image in such a way that arcs between vessel pixels are more likely to have lower cost. Vessel extraction then looks for paths that traverse the image from neighbor to neighbor and have minimum aggregate cost, and are thereby likely to follow vessels. If the cost aggregation rule is associative, minimum-cost paths can be found efficiently [30].
We depart from previous work within this framework in two major ways. First, we find vessel regions, rather than simply vessel paths. In other words, we preserve vessel thickness, rather than merely finding the skeleton, or centerline, of each vessel. This is important, because eye disease diagnosis often requires consideration of vessel thickness. Second, we employ a sequence of searches for vessel regions that start at source points s0, s1, ... automatically selected in decreasing order of their likelihood to be part of a vessel, as detailed in Subsection 2.4. This novelty eliminates the need for a user to select vessel starting points by hand.
Thus, we use the single-source, multiple-destination version of Dijkstra’s shortest path algorithm [28], rather than the single-source, single-destination version used in prior work. In other words, rather than connecting a start point with a destination point, our method explores the image outward from an (automatically selected) source point. This exploratory strategy has two advantages: it eliminates the need for selecting a destination point manually, and it finds vessels as tree-like image regions, thereby accounting for vessel branching naturally and efficiently.
The computational cost of this important change of perspective is trivial, as the only difference between the single-destination and multi-destination algorithms is when they stop: the single-destination algorithm stops when it reaches the designated vertex, while the multidestination algorithm stops when a target threshold on the path cost has been reached. Both versions of Dijkstra’s algorithm have the same computational complexity of O(|E| + |V|log |V|), where | · | indicates the cardinality or size of a set. This complexity is achievable with a heap-based priority queue implementation [31].
2.1. Arcs and Arc Costs
We view each VIO color image as an X × Y × 3 matrix I. Prior to processing, we first remove the image’s artifacts by using directional local contrast filtering (DLCF) as defined in [29]. Figure 2 illustrates the effect of this image enhancement step. A pixel position in I is given by a two-dimensional vector of integers, p = [x,y]T. The value at each pixel position is given by a three-dimensional vector I(p) of red, green, blue values normalized between 0 and 1.
Fig. 2.
DLCF exudate removal: (a) An image from the STARE dataset [14]. (b) The image after DLCF. (c) Matched filtering [5] applied to (a). (d) Matched filtering applied to (b). The non-vascular filter responses around the exudates have been eliminated in (d) without affecting the true vessel responses.
We define two features that determine the arc costs at each pixel p: the green channel intensity Ig(p) and the inverted response F(p) to a Laplacian-of-Gaussian filter followed by a Gabor filter bank, or Laplace-Gabor filtering, as detailed in [29]. The vessel map F maximizes the discriminability of vessels, as illustrated in Fig. 3.
Fig 3.
LoG-Gabor filtering: (a) A sample VIO frame. (b) Frame after LoG-Gabor filtering. (c) A sample mosaic. (d) Mosaic after LoG-Gabor filtering. The isotropic LoG filtering enhances vessel contrast, while the anisotropic Gabor wavelets selectively enhance elongated structures.
To apply Dijkstra’s algorithm to I, we define a weighted lattice graph on the set V = {p} of all pixel locations in the image. There is an arc e = (v,v′) in the arc set E for this directed graph G = (V,E) for any ordered pair of 8-neighbors, that is, whenever
| (1) | 
A non-negative cost is defined on each arc, with the intent that arcs inside and along vessels cost less than arcs that have one or both endpoints outside any vessel. Specifically, we define the cost of arc e as the following convex linear combination:
| (2) | 
and zm(e) indicates the m-th element in the following four-dimensional feature vector:
| (3) | 
Therefore, a low-cost arc is an arc whose destination point v′ is dark (Ig(v′) << 1) and has a low inverted Laplace-Gabor response (F(v′) << 1), and such that the two arc endpoints are similar in both brightness (|Ig(v)–Ig(v′)| << 1) and Laplace-Gabor response (|F(v)–F(v′)|).
The exponential in Eq. 2 provides a non-linear scaling of the arc’s features that emphasizes the divide between vascular and non-vascular feature values, and the scalar α controls the growth rate of the exponential term. In our experiments, we set the values of both α and the coefficients wm based on training images from our dataset, as explained in Section 3.
2.2. Path Costs
A path γ between any two nodes v, v′ in V is composed of a sequence of neighboring lattice locations:
| (4) | 
subject to the constraint that (vi, vi+1) ∈ E, for i ∈ [1,k – 1]. In short, γ is a curve discretized as a sequence of neighboring pixels. The cost of γ is defined as the sum of the costs of its arcs:
| (5) | 
The associative nature of this definition allows splitting a path’s total cost into disjoint sub-path costs at any point along γ:
| (6) | 
with which we can efficiently determine the minimum cost path between any two nodes v and v′. That is, we use Dijkstra’s algorithm to compute:
| (7) | 
where Γ(v,v′) is the set of all possible paths between the two nodes.
2.3. Exploratory Dijkstra Segmentation
Dijkstra’ minimum cost algorithm solves Eq. 7 for any graph with non-negative arc costs [28]. More generally, it finds a minimum cost path γ̃(s,v) between a single source vertex s and (potentially) every other vertex v in the graph.
As discussed earlier, instead of simply connecting user-defined points, we employ an exploratory strategy by using the single-source, many-destinations version of Dijkstra’s method. Starting from a single position s on a major vessel, this strategy enables us to segment this major vessel and all the less prominent vessels that branch out of it, without any need for setting any destination point. Instead, we set an exploration threshold τ on the cost of any path, and find all the minimum-cost paths γ̃ from s in G such that c(γ̃) < τ. Algorithm 1 outlines our exploratory Dijkstra vessel segmentation method.
Algorithm 1.
Exploratory Dijkstra vessel segmentation: starting from a single pixel, the algorithm progressively explores the rest of the image such that every unvisited pixel has a higher minimum path cost than every visited pixel. The algorithm keeps adding pixels until a cost boundary is reached.
With the lattice arc costs defined in Eq. 2, the exploratory Dijkstra algorithm will preferentially visit vascular pixels before exploring non-vascular ones, since the cost to reach the latter is generally much higher. When it stops, it will have visited the Dijkstra region:
| (8) | 
The segmentation’s accuracy is thus dependent on the value of τ. However, our choice of τ is made less sensitive by the exponential in Eq. 2, which increases the separation between the vascular and non-vascular pixel classes. This lower sensitivity reduces both the problem of “leakage”, in which a segmentation goes beyond the correct vessel boundary and the problem of stopping too soon. For our experiments, we set τ based on the training set images, as explained in Section 3.
2.4. Dijkstra Forest
The exploratory Dijkstra method outlined in Subsection 2.3 efficiently segments a Dijkstra region Rτ(s) given a single source vertex s. As Fig. 3 (d) exemplifies, however, the vasculature in the retina extends from more than one primary vessel. Furthermore, the low quality and blur of VIO frames can obscure large sections of the vascular network, and break up the vasculature into several disconnected regions. Therefore, in order to segment all visible vessels better, we extend the single source method to multiple sources.
To this end, we first generate the initial region R0 = Rτ(s0) from a first source point s0 as described above. We then select a new source vertex s1 from those vertices in V that are not part of R0, and generate a new region R1 from it, such that R0 ∩ R1 = ∅. By repeating, we thus form a Dijkstra forest:
| (9) | 
Here, ψ is a threshold on the highest allowable inverted Laplace-Gabor response. We stop adding new regions to the forest when the highest response outside R is higher than ψ. Algorithm 2 outlines the complete Dijkstra forest computation. As with τ, we determine ψ in our experiments using the training set of images in our database (Section 3). In our experiments, each image requires around 10 source vertices.
Algorithm 2.
Dijkstra forest vessel segmentation: The algorithm adds disjoint Dijkstra regions until the minimum inverted Laplace-Gabor response at the source pixel exceeds ψ. The operation V \ R represents {x ∈ V | x ∉ R}.
3. Experiments
To validate the effectiveness of our proposed segmentation method, we collected a new VIO retinal vessel dataset from pediatric patients and manually segmented the corresponding vascular system to produce the associated ground truth. In this section, we outline the dataset construction process and our methodology for comparing the various segmentation methods to the ground truth.
3.1. Benchmark dataset
Existing benchmark retinal vessel segmentation datasets such as the DRIVE [32], STARE [14] and REVIEW [33] databases do not include VIO images. The relatively lower quality and artifacts in VIO images present a number of unique challenges for automated analysis methods. Thus, there is a need for a benchmark VIO dataset. To address this issue, we constructed a thirty-two image database of VIO images, the Vessel Extraction in Video Indirect Ophthalmoscopy (VEVIO) dataset. VEVIO consists of sixteen manually selected frames and sixteen corresponding enhanced large FOV mosaics from sixteen different premature infants imaged. All images are of each patient’s right eye. Figure 4 showcases some of the frames and mosaics in the dataset. Four steps were needed to construct the VEVIO dataset: video recording, manual frame selection, automatic mosaicing and manual vessel segmentation.
Fig. 4.
VEVIO images: Two pairs of manually selected frames (a), (c) and two automatically generated mosaics (b), (d). Although each pair was obtained from the same video, the manual frames were not used to generate the mosaics. The mosaics were constructed using selected source frames as described in [29].
3.1.1. VIO recording
This study was approved by the Duke University Institutional Review Board. Informed consent was obtained from parents or legal guardians of all participating infants. All VIO videos were acquired during ROP clinical bedside examinations at the Duke Medical Center, Durham, NC, USA. Each examination was carried out between August and October 2010. The videos were recorded using a Keeler Wireless Digital Indirect Ophthalmoscope (Keeler Instruments Inc, Broomall, PA, USA). An assistant operated the video recording software provided by Keeler on a computer at the bedside. Each video was recorded at a resolution of 720 × 576 pixels in 24-bit color and saved as an interlaced, compressed Audio Video Interleaved (AVI) file.
3.1.2. Manually selected frames
During the recording of the bedside examination, the assistant viewed a real-time feed of the video being recorded and manually screen-captured a number of frames, using Keeler’s recording software, when she considered that the video feed was well-centered and in focus. One of the authors (MTC) later examined each set of manually captured frames and selected the highest quality image of each right eye.
3.1.3. Automatic mosaics
To generate the corresponding ten mosaics, we applied our automatic mosaicing pipeline [29] to each video. The set of frames suitable for mosaicing into a single image were automatically selected by our method and did not rely on the manually captured frames. From the thousands of frames in each video, our method retained the twenty frames with the highest frame-quality scores. Each mosaic was constructed from five of those twenty frames. While it is possible to construct the mosaic from the highest five scoring frames directly, to ensure that the mosaics had the widest possible field of view we manually selected the final five frames.
3.1.4. Manual vessel segmentation
In order to provide a quantitative assessment of the various automated methods’ performance, we produced a gold standard segmentation by manually tracing all the visible retinal vessels in each of the twenty VIO images. MTC, a practicing ophthalmologist, traced each image in Adobe Photoshop CS3 (Adobe Systems Inc., San Jose, CA) using a Wacom Intuous3 graphics tablet (Wacom Co. Ltd, Kazo-shi, Saitama, Japan). This tablet uses a pressure-sensitive pen that mimics a real brush, thus allowing the user to dynamically alter the thickness of a pen stroke. The set of vessel tracings for each VIO image were then saved as a separate binary image mask.
3.2. Comparison to other methods
We divided the VEVIO dataset into a training set of ten images and a test set of twenty-two images. Each set included frame/mosaic pairs taken from the same videos, so that there was no overlap between the training and test patients. In order to compare our method to existing methods fairly, we contacted a large number of research groups who had developed methods for retinal vessel segmentation. The results presented here were all obtained using the source code of the groups that kindly made their methods available to us.
In this work, we were able to test both supervised and unsupervised state-of-the-art approaches. We obtained source code for the unsupervised methods of Chaudhuri et al. (matched filters) [5] and Chanwimaluang and Fang (local entropy) [15]. We also obtained code for the supervised classification based on Gabor responses of Soares et al. [9]. For the latter, we tested two types of classifiers: Gaussian mixture models (GMM) and K-nearest neighbors (KNN).
For the supervised methods, we trained the different classifiers on the training data using the learning code made available by Soares et al. [9]. For the unsupervised methods, we optimized their parameters by exhaustively determining the values which resulted in the best possible F-measure for the training set. We then kept the parameters fixed for the testing stage. The optimal thresholds for each method are summarized in Table 4. We tested each existing method on the manually selected frames in two ways: (1) using the raw frames directly captured from the video and (2) using the frames after DLCF pre-processing. The raw frames capture how existing methods fare on VIO data as is, while the pre-processed images allowed us to gauge how our Dijkstra forest segmentation itself compared to other methods on the same source data.
4. Results
Our experimental results are summarized in Tables 1, 2 and 3. Each table is ranked according to the F-measure (Appendix A) in the first column. Table 1 includes all twenty-two testing set images (eleven frames and eleven mosaics). As noted above, the testing data includes only data from new patients that were not part of the training data. This table illustrates how a method generalizes to novel data, regardless of image type. The subscripts next to each state-of-the-art method indicate whether we used the raw frames or the frames after DLCF filtering. For all methods, the filtered frames allowed significantly better results. Table 2 includes the segmentation results for the test frames, while Table 3 tallies the results for the test mosaics.
Table 1.
Segmentation results on the test set *
| Method | F-measure | Kappa | Accuracy | Az | 
|---|---|---|---|---|
| The proposed method | 0.5228 (± 0.07) | 0.4987 (± 0.07) | 0.9337 (± 0.05) | 0.8647 (± 0.06) | 
| Matched filters b | 0.489 (± 0.09) | 0.4646 (± 0.09) | 0.9322 (± 0.05) | 0.7977 (± 0.08) | 
| Local entropy b,† | 0.4504 (± 0.11) | 0.4049 (± 0.16) | 0.8839 (± 0.19) | 0.7104 (± 0.1) | 
| Matched filters a,† | 0.3847 (± 0.17) | 0.3313 (± 0.2) | 0.7481 (± 0.34) | 0.7682 (± 0.1) | 
| GMM Gabor b,† | 0.3234 (± 0.19) | 0.3046 (± 0.19) | 0.9341 (± 0.04) | 0.7921 (± 0.17) | 
| GMM Gabor a,† | 0.2861 (± 0.2) | 0.2652 (± 0.19) | 0.9304 (± 0.04) | 0.7716 (± 0.18) | 
| Local entropy a,† | 0.2808 (± 0.23) | 0.2545 (± 0.22) | 0.892 (± 0.17) | 0.7106 (± 0.1) | 
| K-means Gabor b,† | 0.1777 (± 0.12) | 0.1667 (± 0.12) | 0.9328 (± 0.04) | 0.7727 (± 0.16) | 
| K-means Gabor a,† | 0.1536 (± 0.12) | 0.1411 (± 0.12) | 0.9308 (± 0.04) | 0.7599 (± 0.17) | 
The results include the twenty-two test images: eleven manual frames and eleven automatic mosaics. Existing methods were applied to both the raw frames and the frames pre-processed with DLCF.
Raw frames
Pre-processed frames
F-measure: p < 0.05
Table 2.
Segmentation results on the single (not mosaiced) test frames *
| Method | F-measure | Kappa | Accuracy | Az | 
|---|---|---|---|---|
| The proposed method | 0.5403 (± 0.06) | 0.5127 (± 0.06) | 0.9101 (± 0.06) | 0.8773 (± 0.06) | 
| Matched filters b | 0.5025 (± 0.07) | 0.4745 (± 0.07) | 0.9086 (± 0.06) | 0.7735 (± 0.1) | 
| Local entropy b,† | 0.4347 (± 0.1) | 0.3646 (± 0.2) | 0.8123 (± 0.25) | 0.7092 (± 0.05) | 
| Matched filters a,† | 0.2938 (± 0.18) | 0.2078 (± 0.19) | 0.5402 (± 0.4) | 0.7144 (± 0.11) | 
| GMM Gabor b,† | 0.2297 (± 0.19) | 0.2128 (± 0.18) | 0.9153 (± 0.05) | 0.7182 (± 0.18) | 
| GMM Gabor a,† | 0.1549 (± 0.15) | 0.134 (± 0.13) | 0.908 (± 0.05) | 0.6771 (± 0.17) | 
| K-means Gabor b,† | 0.1348 (± 0.12) | 0.1258 (± 0.12) | 0.916 (± 0.05) | 0.7085 (± 0.17) | 
| Local entropy a,† | 0.0955 (± 0.15) | 0.0638 (± 0.15) | 0.8284 (± 0.22) | 0.7097 (± 0.05) | 
| K-means Gabor a,† | 0.0866 (± 0.1) | 0.0746 (± 0.09) | 0.9121 (± 0.05) | 0.6761 (± 0.17) | 
Each method was trained or optimized using the frames in the training set and the parameters where then kept fixed for the testing stage. Existing methods were applied to both the raw frames and the frames pre-processed with DLCF.
Raw frames.
Pre-processed frames
F-measure: p < 0.05
Table 3.
Segmentation results on the test mosaics *
| Method | F-measure | Kappa | Accuracy | Az | 
|---|---|---|---|---|
| The proposed method | 0.5053 (± 0.08) | 0.4847 (± 0.08) | 0.9573 (± 0.01) | 0.8522 (± 0.05) | 
| Matched filters | 0.4755 (± 0.1) | 0.4547 (± 0.1) | 0.9559 (± 0.01) | 0.8219 (± 0.04) | 
| Local entropy | 0.466 (± 0.1) | 0.4453 (± 0.1) | 0.9556 (± 0.01) | 0.7115 (± 0.14) | 
| GMM Gabor | 0.4172 (± 0.15) | 0.3964 (± 0.15) | 0.9529 (± 0.01) | 0.8461 (± 0.12) | 
| K-means Gabor † | 0.2205 (± 0.11) | 0.2076 (± 0.1) | 0.9496 (± 0.01) | 0.8368 (± 0.12) | 
Each method was trained or optimized using the mosaics in the training set and the parameters were then kept fixed for the testing stage.
F-measure: p < 0.05.
Each table includes the mean F-measure, Cohen’s Kappa [34], accuracy (Appendix A), and area under the ROC curve (Az) for each method with the corresponding standard deviation in parentheses. For each image, each metric was calculated inside a region-of-interest (ROI) that only includes the image’s retinal pixels. We obtained each image’s ROI by applying our hue masking method outlined in [29]. In short, hue masking retains only those pixels that match the color profile of the current retina. Figure 5 illustrates the ROI mask for a particular mosaic.
Fig. 5.
VEVIO ROI: (a) The original mosaic. (b) The binary mask outlining the ROI for the mosaic in (a). (c) The corresponding manual gold standard. Only pixels that appear white in (b) are taken into account for the metrics tallied in our results.
Each metric was determined on a pixel-by-pixel basis. For a given automatic segmentation, a pixel is considered a true positive if both it and the matching pixel in the ground truth image are ones. If both are zero, it corresponds to a true negative. A mismatch in which the automatic segmentation produced a one and the ground truth had a zero is a false positive. The converse mismatch is a false negative.
Each of the four metrics captures some form of similarity between a method’s output and the corresponding ground truth. The retinal vessel segmentation literature has traditionally favored accuracy and area under the ROC curve, Az, as the primary metrics [5,6,7,8,9,10,11,13,14,32]. While Az is an adequate measure of classifier robustness, as we argue in Appendix A, we believe the F-measure is a much more appropriate measure than accuracy for analyzing segmentation results in this type of data. Due to the very low prior probability of a pixel being part of a vessel, methods that only segment a small fraction of each image will still obtain competitive accuracy scores. The F-measure, on the other hand, provides a ratio-independent summary of the overlap between two segmentation’s pixel labels. Therefore, the approach of labeling very few pixels as vascular will yield a very low F-measure score due to the large number of false negatives.
We applied a Wilcox signed-rank test between our proposed method’s F-measure distribution and the F-measures of every other method [35]. Methods for which the difference was statistically significant (p < 0.05) are marked with an * in each table.
5. Discussion
As Tables 1, 2 and 3 show, our proposed method compares favorably to existing supervised and unsupervised methods. Regardless of metric, our method consistently outperformed existing state-of-the-art approaches in our experiments by better balancing the likelihood of false positives and negatives. In contrast, Fig. 6 illustrates how a method such as the GMM classifier has good recall, but poor precision, while a more conservative method such as the KNN classifier has better precision, but worse recall. In the first case, the segmentation has too many non-vascular pixels, while the latter segmentation misses a significant portion of the vasculature. Our method’s connectivity contraints allow us to strike a good balance between these two objectives by better disambiguating between similarly valued pixels. In other words, our method is more likely to label a pixel as vascular if it can be directly connected to a large vascular region than if it is isolated, since the latter case is more indicative of noise rather than an actual vessel.
Fig. 6.
Vessel segmentation on a mosaic: (a) Original image (b) Manual segmentation (c) Dijkstra forest (d) Matched filters (e) Local entropy (f) GMM classifier (g) KNN classifier
Finally, it is also worth noting how DLCF pre-processing has a sizable impact on the segmentation results of existing methods. All state-of-the-art methods performed significantly better on pre-processed frames than raw frames. As an extreme example, note in Table 2 the four-fold improvement in the F-measure of the local entropy method when using pre-processed frames.
In the future, we wish to expand our VEVIO database with more images from more patients. The presented experimental results highlight the challenges that VIO data present for vessel segmentation methods. The F-measures reported in this paper indicate significant room for further improvement. To encourage further research in this area, we have made the VEVIO dataset and the MATLAB code that we have developed for this project publically available at http://www.duke.edu/~sf59/Estrada_BOE_2012.htm
Acknowledgments
This work was supported in part by the Knights Templar Eye Foundation, Inc. pediatric ophthalmology research grant, Research to Prevent Blindness 2011 Duke’s Unrestricted Grant award, and a fellowship from the Duke University Center for Theoretical & Mathematical Sciences. We would like to thanks Prof. Yuliya Lokhnygina for her invaluable help with the biostatistical analysis. We would also like to thank the research groups that kindly provided their source code to us, especially Dr. João V. B. Soares.
Appendix A. F-measure vs. accuracy
Traditionally, accuracy has been one of the key metrics for evaluating vessel segmentation results. For a binary classification, this metric is defined thus:
| (A.1) | 
where tp and fp indicate true and false positives respectively, while tn and fn tally true and false negatives. The unbiased F-measure, on the other hand, is given by:
| (A.2) | 
where precision and recall are defined as:
| (A.3) | 
Accuracy becomes less informative when one of the two classes if far more likely than the other, as is the case for vascular vs. non-vascular pixels. On average, vascular pixels only comprise about 5–10% of an image. This means that a classifier that labels all pixels as non-vascular can already boast a 90–95% accuracy. The F-measure, on the other hand, provides a better balance between labeling pixels correctly or incorrectly, since it is not affected by class sizes.
Appendix B. Parameter values
Table 4.
Parameter values
| The proposed method | Exploratory threshold: 5 ×10−5 | Filtering threshold: 0.7 | 
| Matched filters | Raw threshold: 0.5 | Pre-processed threshold: 0.2254 | 
| Local entropy | Raw threshold: 0.5253 | Pre-processed threshold: 0.7677 | 
| GMM Gabor | Raw threshold: 0.5 | Pre-processed threshold: 0.5 | 
| K-means Gabor | Raw threshold: 0.77 | Pre-processed threshold: 0.82 | 
References and links
- 1.Tasman W., Patz A., McNamara J. A., Kaiser R. S., Trese M. T., Smith B. T., “Retinopathy of prematurity: The life of a lifetime disease,” Am. J. Ophthalmol. 141, 167 – 174 (2006). 10.1016/j.ajo.2005.07.034 [DOI] [PubMed] [Google Scholar]
- 2.Gole G. A., Ells A. L., Katz X., Holmstrom G., Fielder A. R., Capone A., Jr, Flynn J. T., Good W. G., Holmes J. M., McNamara J. A., Palmer E. A., Quinn G., E, Shapiro M. J., Trese M. G. J., Wallace D. K., “The international classification of retinopathy of prematurity revisited,” Arch. Ophthalmol. 123, 991–999 (2011). [DOI] [PubMed] [Google Scholar]
- 3.Wallace D. K., Quinn G. E., Freedman S. F., Chiang M. F., “Agreement among pediatric ophthalmologists in diagnosing plus and pre-plus disease in retinopathy of prematurity,” J. Am. Assoc. Pediatric Ophthalmol. Strabismus 12, 352 – 356 (2008). 10.1016/j.jaapos.2007.11.022 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Chiu S. J., Li X. T., Nicholas P., Toth C. A., Izatt J. A., Farsiu S., “Automatic segmentation of seven retinal layers in SDOCT images congruent with expert manual segmentation,” Opt. Express 18, 19413–19428 (2010). 10.1364/OE.18.019413 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Chaudhuri S., Chatterjee S., Katz N., Nelson M., Goldbaum M., “Detection of blood vessels in retinal images using two-dimensional matched filters,” IEEE Trans. Med. Imag. 8, 263–269 (1989). 10.1109/42.34715 [DOI] [PubMed] [Google Scholar]
- 6.Kirbas C., Quek F., “A review of vessel extraction techniques and algorithms,” ACM Comput. Surv. 36, 81–121 (2004). 10.1145/1031120.1031121 [DOI] [Google Scholar]
- 7.Li Q., You J., Zhang L., Bhattacharya P., “Automated retinal vessel segmentation using Gabor filters and scale multiplication,” in Proceedings of System, Man and Cybernetics (IEEE, 2006), pp. 3521–3527 [Google Scholar]
- 8.Ricci E., Perfetti R., “Retinal blood vessel segmentation using line operators and support vector classification,” IEEE Trans. Med. Imag. 26, 1357–1365 (2007). 10.1109/TMI.2007.898551 [DOI] [PubMed] [Google Scholar]
- 9.Soares J., Leandro J., Cesar R., Jr, Jelinek H., Cree M., “Retinal vessel segmentation using the 2-D Gabor wavelet and supervised classification,” IEEE Trans. Med. Imag. 25, 1214–1222 (2006). 10.1109/TMI.2006.879967 [DOI] [PubMed] [Google Scholar]
- 10.Staal J., Abràmoff M., Niemeijer M., Viergever M., van Ginneken B., “Ridge-based vessel segmentation in color images of the retina,” IEEE Trans. Med. Imag. 23, 501–509 (2004). 10.1109/TMI.2004.825627 [DOI] [PubMed] [Google Scholar]
- 11.Lam B., Gao Y., Liew A., “General retinal vessel segmentation using regularization-based multiconcavity modeling,” IEEE Trans. Med. Imag. 29, 1369–1381 (2010). 10.1109/TMI.2010.2043259 [DOI] [PubMed] [Google Scholar]
- 12.Lathen G., Jonasson J., Borga M., “Blood vessel segmentation using multi-scale quadrature filtering,” Pattern Recogn. Lett. 31, 762–767 (2010). 10.1016/j.patrec.2009.09.020 [DOI] [Google Scholar]
- 13.Marín D., Aquino A., Gegúndez-Arias M., Bravo J., “A new supervised method for blood vessel segmentation in retinal images by using gray-level and moment invariants-based features,” IEEE Trans. Med. Imag. 30, 146–158 (2011). 10.1109/TMI.2010.2064333 [DOI] [PubMed] [Google Scholar]
- 14.Hoover A., Kouznetsova V., Goldbaum M., “Locating blood vessels in retinal images by piecewise threshold probing of a matched filter response,” IEEE Trans. Med. Imag. 19, 203–210 (2002). 10.1109/42.845178 [DOI] [PubMed] [Google Scholar]
- 15.Chanwimaluang T., Fan G., “An efficient blood vessel detection algorithm for retinal images using local entropy thresholding,” in Proceedings of the International Symposium on Circuits and Systems (IEEE; 2003), pp. 21–24 [Google Scholar]
- 16.Martínez-Pérez M., Hughes A., Stanton A., Thom S., Bharath A., Parker K., “Retinal blood vessel segmentation by means of scale-space analysis and region growing,” in Proceedings of Medical Image Computing and Computer-Assisted Intervention (Springer; 1999), pp. 90–97 10.1007/10704282_10 [DOI] [Google Scholar]
- 17.Zana F., Klein J., “Segmentation of vessel-like patterns using mathematical morphology and curvature evaluation,” IEEE Trans. Image Process. 10, 1010–1019 (2002). 10.1109/83.931095 [DOI] [PubMed] [Google Scholar]
- 18.Pedersen L., Grunkin M., Ersboll B., Madsen K., Larsen M., Christoffersen N., Skands U., “Quantitative measurement of changes in retinal vessel diameter in ocular fundus images,” Pattern Recogn. Lett. (21), 1215–1223 (2000). 10.1016/S0167-8655(00)00084-2 [DOI] [Google Scholar]
- 19.Cree M., Cornforth D., Jelinek HF., “Vessel segmentation and tracking using a two-dimensional model,” in Proceedings of Image and Vision Computing New Zealand (IVCNZ, 2005), pp. 345–350 [Google Scholar]
- 20.Benmansour F., Cohen L., “Tubular structure segmentation based on minimal path method and anisotropic enhancement,” Int. J. Comput. Vision 92, 192–210 (2011). 10.1007/s11263-010-0331-0 [DOI] [Google Scholar]
- 21.Li H., Yezzi A., “Vessels as 4-D curves: Global minimal 4-D paths to extract 3-D tubular surfaces and centerlines,” IEEE Trans. Med. Imag. 26, 1213–1223 (2007). 10.1109/TMI.2007.903696 [DOI] [PubMed] [Google Scholar]
- 22.Pechaud M., Keriven R., Peyre G., “Extraction of tubular structures over an orientation domain,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (IEEE Computer Society, 2009), pp. 336–342 [Google Scholar]
- 23.Wink O., Niessen W., Viergever M., “Multiscale vessel tracking,” IEEE Trans. Med. Imag. 23, 130–133 (2004). 10.1109/TMI.2003.819920 [DOI] [PubMed] [Google Scholar]
- 24.Ahmad S., Wallace D., Freedman S., Zhao Z., “Computer-assisted assessment of plus disease in retinopathy of prematurity using video indirect ophthalmoscopy images,” Retina 28, 1458–1462 (2008). 10.1097/IAE.0b013e3181803c14 [DOI] [PubMed] [Google Scholar]
- 25.Kiely A., Wallace D., Freedman S., Zhao Z., “Computer-assisted measurement of retinal vascular width and tortuosity in retinopathy of prematurity,” Arch. Ophthalmol. 128, 847–852 (2010). 10.1001/archophthalmol.2010.133 [DOI] [PubMed] [Google Scholar]
- 26.Lindeberg T., “Edge detection and ridge detection with automatic scale selection,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (IEEE Computer Society, 1996), pp. 465–470 [Google Scholar]
- 27.Sethian J., Level Set Methods and Fast Marching Methods (Cambridge University Press, 1999). [Google Scholar]
- 28.Dijkstra E., “A note on two problems in connexion with graphs,” Numer. Math. 1, 269–271 (1959). 10.1007/BF01386390 [DOI] [Google Scholar]
- 29.Estrada R., Tomasi C., Cabrera M., Wallace D., Freedman S., Farsiu S., “Enhanced video indirect ophthalmoscopy (VIO) via robust mosaicing,” Biomed. Opt. Express 2, 2871–2887 (2011). 10.1364/BOE.2.002871 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Bellman R., Dynamic Programming (Dover, 2003). [Google Scholar]
- 31.Cormen T., Leiserson C., Rivest R., Stein C., Introduction to Algorithms (MIT Press, 2001). [Google Scholar]
- 32.Niemeijer M., Staal J., van Ginneken B., Loog M., Abramoff M., “Comparative study of retinal vessel segmentation methods on a new publicly available database,” Proc. SPIE 5370, 648–656 (2004). 10.1117/12.535349 [DOI] [Google Scholar]
- 33.Al-Diri B., Hunter A., Steel D., Habib M., Hudaib T., Berry S., “REVIEW - A reference data set for retinal vessel profiles,” in Proceedings of the IEEE Conference on Engineering in Medicine and Biology Society (IEEE, 2008), pp. 2262–2265 [DOI] [PubMed] [Google Scholar]
- 34.Cohen J., “A Coefficient of agreement for nominal scales,” Educ. Psychol. Meas. 20, 37–46 (1960). 10.1177/001316446002000104 [DOI] [Google Scholar]
- 35.Gibbons J., Chakraborti S., Nonparametric Statistical Inference (CRC Press, 2003). [Google Scholar]








